| | |
Last updated on November 2, 2025. This conference program is tentative and subject to change
Technical Program for Friday December 12, 2025
| |
| FrA01 |
Galapagos I |
| Biomedical and Healthcare Systems |
Regular Session |
| Chair: Dabbene, Fabrizio | CNR-IEIIT |
| Co-Chair: Borri, Alessandro | CNR-IASI |
| |
| 09:30-09:45, Paper FrA01.1 | |
| An Adaptive Control Approach to Treatment Selection for Substance Use Disorders |
|
| Pulick, Eric | University of Wisconsin - Madison |
| Mintz, Yonatan | University of Wisconsin Madison |
Keywords: Healthcare and medical systems, Emerging control applications
Abstract: Despite the massive costs and widespread harms of substance use, most individuals with substance use disorders (SUDs) are untreated. Digital therapeutics platforms are an emerging low-cost, low-barrier means to expand treatment. There is a growing body of research showing how treatment providers can identify patients who need SUD support, but very little work addressing how providers should choose the most appropriate treatment for each patient. Because SUD treatment involves months or years of voluntary compliance from the patient, treatment adherence is a critical consideration for the provider. In this paper, we propose algorithms that a provider can use to match the burden-level of daily proposed treatments to the time-varying engagement state of the patient to promote adherence. We propose structured models for a patient’s engagement over time and their treatment adherence decisions. Using these models we pose a stochastic control formulation of the treatment-provider’s burden selection problem. We propose an adaptive control approach that estimates unknown patient parameters as new data are observed. We show that these estimates are consistent and propose algorithms using these estimates to make appropriate treatment recommendations.
|
| |
| 09:45-10:00, Paper FrA01.2 | |
| Long-Term Diabetes Prevention Via Physical Activity: An Output-Feedback MPC Approach |
|
| De Paola, Pierluigi Francesco | Consiglio Nazionale Delle Ricerche (CNR) |
| Borri, Alessandro | CNR-IASI |
| Paglialonga, Alessia | Consiglio Nazionale Delle Ricerche (CNR) |
| Dabbene, Fabrizio | CNR-IEIIT |
| Palumbo, Pasquale | University of Milano-Bicocca |
Keywords: Healthcare and medical systems, Predictive control for nonlinear systems, Sampled-data control
Abstract: Extensive clinical evidence supports the beneficial role of physical activity in delaying the progression of type-2 diabetes. However, current clinical recommendations remain largely qualitative, failing to account for the patient’s evolving condition and lacking a quantitative framework for real-time, personalized prescriptions. In this work, we propose an original model-based approach to the control of diabetes progression via physical activity, based on a control-theoretical formulation of the benefits of the exercise, leveraging a sampled-data observer-based model predictive control framework. We design the control law on a compact, widespread model of diabetes evolution, whilst the effectiveness of the proposed control strategy is tested in silico by closing the loop on a population of virtual subjects simulated by a different, higher-dimensional model of diabetes regulation under exercise. The validation procedure also accounts for the effect of additional non-idealities, including quantized measurements and disturbances, and clearly shows the efficacy of a suitably designed physical activity to prevent diabetes progression. To the best of our knowledge, this work proposes for the first time an output-feedback approach leveraging physical activity for long-term glucose regulation.
|
| |
| 10:00-10:15, Paper FrA01.3 | |
| Stackelberg Evolutionary Games with Modulated Leadership: A Three-Agent Framework for Tumor-Immune Dynamics |
|
| Romano, Chiara | University of L'Aquila |
| Di Benedetto, Maria Domenica | University of L'Aquila |
| Borri, Alessandro | CNR-IASI |
Keywords: Healthcare and medical systems, Modeling, Game theory
Abstract: In many real-world systems, agents interact in complex ways that challenge traditional control and game-theoretic approaches. Building on Stackelberg evolutionary game theory, which models the interplay between a rational leader and an adaptive follower, we introduce a three-agent framework with modulated leadership, where a passive modulator dynamically aligns with either the leader or the follower, thereby influencing the leader’s strategic choices. The main contribution of this work is the application of the proposed framework to a critical biomedical challenge: tumor growth control. Particularly, we develop a novel tumor-immune model capturing evolution, interactions, and environmental factors. We then frame tumor control as a strategic problem involving the physician (leader), the evolving tumor (follower), and the immune system, which can alternatively support treatment and promote tumor survival. Leveraging this framework, we design combination therapies that dynamically adjust chemotherapy and anti-PD1 immunotherapy. Numerical simulations demonstrate the potential of the proposed approach in oncology and, more broadly, in other fields involving dynamic, multi-agent interactions.
|
| |
| 10:15-10:30, Paper FrA01.4 | |
| Robust Interval Predictor for the BIS Regulation on Anesthesia Machines Considering Two Drugs |
|
| Mera, Manuel | Esime Upt Ipn |
| Ramirez-Barrios, Miguel | Instituto Politecnico Nacional |
| Aviles, Jesus David | Universidad Autónoma De Baja California |
| Ríos, Héctor | Tecnológico Nacional De México/I.T. La Laguna |
Keywords: Biomedical, Compartmental and Positive systems, Control applications
Abstract: A novel interval predictor-based controller is presented to solve the Bispectral Index (BIS) regulation problem in anesthetic procedures where propofol and remifentanil are employed together. The resulting control scheme does not require observing or filtering the drug concentrations on the patient's effect--site from the BIS signal. Knowing a feasible interval of initial conditions is enough to ensure the convergence of the regulation errors to some compact sets around the origin despite perturbations in the pharmacodynamics and pharmacokinetics models. Some simulations, considering different in silico patients, are included to illustrate the implementability of the methods.
|
| |
| 10:30-10:45, Paper FrA01.5 | |
| Towards a Reduced-Order Model for Anesthesia: Identification of Propofol-Remifentanil Effect As a Single Synergic Drug |
|
| Milanesi, Marco | Dipartimento Di Ingegneria Meccanica E Industriale |
| Consolini, Luca | Università Di Parma |
| Di Credico, Giulia | University of Parma |
| Latronico, Nicola | University of Brescia |
| Laurini, Mattia | University of Parma |
| Paltenghi, Massimiliano | ASST Spedali Civili Brescia |
| Schiavo, Michele | Università Degli Studi Di Brescia |
| Visioli, Antonio | University of Brescia |
Keywords: Biomedical, Identification, Optimization
Abstract: For personalized anesthesia, it is crucial to have accurate pharmacokinetic and pharmacodynamic (PK/PD) models of anesthetic drugs. Typically, total intravenous anesthesia (TIVA) relies on the combined administration of propofol and remifentanil, which affect the bispectral index (BIS). This paper proposes a model reduction method that exploits the use of a fixed administration ratio between propofol and remifentanil, which is consistent with the clinical practice. Using a fourth-order ARX model, we estimate the effect-site concentration and fit the BIS response using a Hill function, thus implicitly considering the coadministration as if it was a single drug. The identification process is performed using a branch and bound approach, leveraging BIS data obtained during the induction of anesthesia. The identified model is then validated during the maintenance phase of anesthesia against models that are usually employed in clinical practice, where propofol and remifentanil are separately modeled. Results obtained on real BIS data show the effectiveness of the proposed approach.
|
| |
| 10:45-11:00, Paper FrA01.6 | |
| Deep Learning Model Predictive Control for Deep Brain Stimulation in Parkinson's Disease |
|
| Steffen, Sebastian | University of Oxford |
| Cannon, Mark | University of Oxford |
Keywords: Biomedical, Learning, Predictive control for nonlinear systems
Abstract: We present a nonlinear data-driven Model Predictive Control (MPC) algorithm for deep brain stimulation (DBS) for the treatment of Parkinson's disease (PD). Although DBS is typically implemented in open-loop, closed-loop DBS (CLDBS) uses the amplitude of neural oscillations in specific frequency bands (e.g. beta 13-30 Hz) as a feedback signal, resulting in improved treatment outcomes with reduced side effects and slower rates of patient habituation to stimulation. To date, CLDBS has only been implemented in vivo with simple algorithms, such as proportional, proportional-integral, and thresholded switching control. Our approach employs a multi-step predictor based on differences of input-convex neural networks to model the future evolution of beta oscillations. The use of a multi-step predictor enhances prediction accuracy over the optimization horizon and simplifies online computation. In tests using a simulated model of beta-band activity response and data from PD patients, we achieve reductions of more than 20% in both tracking error and control activity in comparison with existing CLDBS algorithms. The proposed control strategy provides a generalizable data-driven technique that can be applied to the treatment of PD and other diseases targeted by CLDBS, as well as to other neuromodulation techniques.
|
| |
| 11:00-11:15, Paper FrA01.7 | |
| Why Do Border Controls Fail to Stabilise Epidemics? |
|
| Parag, Kris Varun | Imperial College London |
| Hoscheit, Patrick | INRAE |
Keywords: Biological systems, Biomedical, Emerging control applications
Abstract: Can border controls and travel restrictions suppress epidemics? This is an important and recurring question for public health policymaking and pandemic preparedness. Although there is a consensus that infections imported from external locations are critical to sustaining local epidemic growth and driving the endemic equilibria of the disease, there remains ongoing debate on the effectiveness of interventions aimed at curbing importations. Most studies contributing to this debate rely on complex metapopulation models that preclude generalisable insights and formal control principles are rarely used. Here we demonstrate how classical control theory applied to an analytic but flexible and widely used transmission model provides compelling evidence that border and travel restrictions fail to stabilise spread. These restrictions always enter as precompensators and hence fail to shift the epidemic poles. Consequently, regions cannot respond to emergent outbreaks in isolation. We find that coordinating interventions across both the local and external regions converts precompensation into feedback control and derive new criteria for overall stability. We further develop formulae specifying how border controls shape performance, such as disease equilibria and total infections. During growing epidemic phases, cooperative control is crucial for stability. Once stability is assured, border control can be sufficient to meet performance targets.
|
| |
| 11:15-11:30, Paper FrA01.8 | |
| Insulin Sensitivity Management in Artificial Pancreas: A Switching Control Strategy Approach – an in Silico Study |
|
| Cavallo, Maria Sofia | University of Bergamo |
| Licini, Nicola | University of Bergamo |
| Previdi, Fabio | Università Degli Studi Di Bergamo |
| Ferramosca, Antonio | Univeristy of Bergamo |
Keywords: Biomedical, Switched systems, Predictive control for linear systems
Abstract: Management of insulin sensitivity variability poses a significant challenge in achieving optimal blood glucose control for Type 1 Diabetes Mellitus (T1DM) patients using Artificial Pancreas (AP) systems. Traditional control strategies, particularly those employing Linear Time-Invariant (LTI) models in Model Predictive Control (MPC), although effective, do not adequately address the pronounced circadian fluctuations in insulin sensitivity. This study proposes an innovative switching MPC strategy leveraging multiple linear models, each corresponding to distinct daily periods (i.e., morning, afternoon, and evening) to dynamically adapt insulin dosing. The flexibility of the switching algorithm allows transitions between models within predefined, physiologically appropriate time windows. Performance evaluation, conducted via simulations using the UVA/Padova T1DM simulator, demonstrates that the proposed switching control strategy substantially reduces hypoglycemic episodes and stabilizes glucose variability compared to traditional single-model MPC approaches. This adaptive methodology shows promise in enhancing the safety and efficacy of glucose management, paving the way for improved quality of life and reduced diabetes-related complications.
|
| |
| FrA02 |
Oceania II |
| Data-Driven Verification and Control with Provable Guarantees I |
Invited Session |
| Chair: Nejati, Amy | Newcastle University |
| Co-Chair: Jungers, Raphaël M. | University of Louvain |
| Organizer: Lavaei, Abolfazl | Newcastle University |
| Organizer: Nejati, Amy | Newcastle University |
| Organizer: Jungers, Raphaël M. | University of Louvain |
| Organizer: Abate, Alessandro | University of Oxford |
| |
| 09:30-09:45, Paper FrA02.1 | |
| Online Complexity Estimation for Repetitive Scenario Design (I) |
|
| Berger, Guillaume O. | UCLouvain |
| Jungers, Raphaël M. | University of Louvain |
Keywords: Data driven control, Statistical learning, Randomized algorithms
Abstract: We consider the problem of repetitive scenario design where one has to solve repeatedly a scenario design problem and can adjust the sample size (number of scenarios) to obtain a desired level of risk (constraint violation probability). We propose an approach to learn on the fly the optimal sample size based on observed data consisting in previous scenario solutions and their risk level. Our approach consists in learning a function that represents the pdf (probability density function) of the risk as a function of the sample size. Once this function is known, retrieving the optimal sample size is straightforward. We prove the soundness and convergence of our approach to obtain the optimal sample size for the class of fixed-complexity scenario problems, which generalizes fully-supported convex scenario programs that have been studied extensively in the scenario optimization literature. We also demonstrate the practical efficiency of our approach on a series of challenging repetitive scenario design problems, including non-fixed-complexity problems, nonconvex constraints and time-varying distributions.
|
| |
| 09:45-10:00, Paper FrA02.2 | |
| Underapproximative Methods for the Order Reduction of Zonotopes |
|
| Lützow, Laura | Technical University Munich |
| Kochdumper, Niklas | Cite University Paris |
| Althoff, Matthias | Technische Universität München |
Keywords: Model/Controller reduction, Optimization, Uncertain systems
Abstract: Zonotopes are a widely used set representation in set-based computations due to their compact representation size and their closure under many relevant set operations. However, certain set operations, such as the Minkowski sum, increase the zonotope order, which in turn increases the computational cost of further computations. To address this issue, various order reduction techniques have been proposed, most of which focus on overapproximating the original zonotope. While overapproximations are crucial for safety verification, some applications - such as reachset-conformant identification and backward reachability analysis - require underapproximations (also referred to as inner-approximations). Besides providing a comprehensive survey of existing underapproximative order reduction methods, we propose four novel reduction methods in this paper. We analyze the computational cost of all methods and evaluate the tightness of the resulting underapproximations through numerical experiments on more than 2000 randomly generated zonotopes. The results demonstrate that our proposed methods achieve a favorable balance between computational efficiency and approximation accuracy, making them well-suited for applications in control, estimation, and system identification.
|
| |
| 10:00-10:15, Paper FrA02.3 | |
| Safe Domains of Attraction for Discrete-Time Nonlinear Systems: Characterization and Verifiable Neural Network Estimation (I) |
|
| Serry, Mohamed | University of Waterloo |
| Li, Haoyu | University of Illinois, Urbana-Champaign |
| Zhou, Ruikun | University of Waterloo |
| Zhang, Huan | UIUC |
| Liu, Jun | University of Waterloo |
Keywords: Lyapunov methods, Neural networks, Stability of nonlinear systems
Abstract: Analysis of nonlinear autonomous systems typically involves estimating domains of attraction, which has been a topic of extensive research interest for decades. Despite this, accurately estimating domains of attraction for nonlinear systems remains a challenging task, where existing methods are conservative or limited to low-dimensional systems. The estimation becomes even more challenging when accounting for state constraints. In this work, we propose a framework to accurately estimate safe (state-constrained) domains of attraction for discrete-time autonomous nonlinear systems. In establishing this framework, we first derive a new Zubov equation, whose solution corresponds to the exact safe domain of attraction. The solution to the aforementioned Zubov equation is shown to be unique and continuous over the whole state space. We then present a physics-informed approach to approximating the solution of the Zubov equation using neural networks. To obtain certifiable estimates of the domain of attraction from these neural network approximate solutions, we propose a verification framework that can be implemented using standard verification tools (e.g., alpha,!beta-CROWN and dReal). To illustrate its effectiveness, we demonstrate our approach through numerical examples concerning nonlinear systems with state constraints.
|
| |
| 10:15-10:30, Paper FrA02.4 | |
| Post-Design Verification in the Scenario Approach (I) |
|
| Care', Algo | University of Brescia |
| Campi, M. C. | University of Brescia |
| Garatti, Simone | Politecnico Di Milano |
Keywords: Data driven control, Statistical learning, Uncertain systems
Abstract: The scenario approach is an established data-driven design framework equipped with a powerful theory that links design complexity to generalization properties. A key feature of this method is that training data are used both for design and for certifying the design’s reliability, without resorting to additional data points. This paper takes a step further: the goal is not only to certify the properties considered at the design stage, but also to guarantee new properties of interest in post-design usage. To this end, we introduce a two-level framework of appropriateness: baseline appropriateness, which guides the design process, and post-design appropriateness, which serves as a criterion for a posteriori evaluation. The paper fully develops a post-design verification framework that works without resorting to additional test data and illustrates it on an H2 problem.
|
| |
| 10:30-10:45, Paper FrA02.5 | |
| Learning Robust Safety Controllers for Uncertain Input-Affine Polynomial Systems (I) |
|
| Akbarzadeh, Omid | Newcastle University |
| Ashoori, MohammadHossein | Newcastle University |
| Lavaei, Abolfazl | Newcastle University |
Keywords: Data driven control, Robust control, Formal Verification/Synthesis
Abstract: This paper offers a direct data-driven approach for learning robust control barrier certificates (R-CBCs) and robust safety controllers (R-SCs) for discrete-time input-affine polynomial systems with unknown dynamics under unknown-but-bounded disturbances. The proposed method relies on data from input-state observations collected over a finite-time horizon while satisfying a specific rank condition. Our data-driven scheme enables the synthesis of R-CBCs and R-SCs directly from observed data, bypassing the need for explicit modeling of the system's dynamics and thus ensuring robust system safety against disturbances within an infinite time horizon. The proposed approach is formulated as a sum-of-squares (SOS) optimization problem, providing a structured design framework. Two case studies showcase our method's capability to provide robust safety guarantees for unknown input-affine polynomial systems under bounded disturbances, demonstrating its practical effectiveness.
|
| |
| 10:45-11:00, Paper FrA02.6 | |
| Continuous-Time Data-Driven Barrier Certificate Synthesis (I) |
|
| Rickard, Luke | University of Oxford |
| Abate, Alessandro | University of Oxford |
| Margellos, Kostas | University of Oxford |
Keywords: Lyapunov methods, Data driven control, Neural networks
Abstract: We consider the problem of verifying safety for continuous-time dynamical systems. Developing upon recent advancements in data-driven verification, we use only a finite number of sampled trajectories to learn a barrier certificate, namely a function which verifies safety. We train a safety-informed neural network to act as this certificate, with an appropriately designed loss function to encompass the safety conditions. In addition, we provide probabilistic generalisation guarantees from discrete samples of continuous trajectories, to unseen continuous ones. Numerical investigations demonstrate the efficacy of our approach and contrast it with related results in the literature.
|
| |
| 11:00-11:15, Paper FrA02.7 | |
| Model Order Reduction from Data with Certification (I) |
|
| Samari, Behrad | Newcastle University |
| Nejati, Amy | Newcastle University |
| Lavaei, Abolfazl | Newcastle University |
Keywords: Reduced order modeling, Data driven control, Formal Verification/Synthesis
Abstract: This paper introduces a data-driven scheme to construct reduced-order models (ROMs) of linear dynamical systems with unknown mathematical models. Our methodology leverages data and establishes similarity relations between output trajectories of unknown systems and their data-driven ROMs via the notion of simulation functions (SFs), capable of formally quantifying their closeness. To achieve this, under a rank condition readily fulfillable using data, we collect only two input-state trajectories from unknown systems to construct both ROMs and SFs, while offering correctness guarantees. We demonstrate that the proposed ROMs derived from data can be leveraged for controller synthesis endeavors while effectively ensuring high-level logic properties over unknown dynamical models. We showcase our data-driven findings across a range of benchmark scenarios involving various unknown physical systems, demonstrating the enforcement of diverse complex properties.
|
| |
| 11:15-11:30, Paper FrA02.8 | |
| Enforcing Input-Output Behavior in Data-Driven Moment Matching (I) |
|
| Bhattacharjee, Debraj | Imperial College London |
| Moreschini, Alessio | Imperial College London |
| Astolfi, Alessandro | Imperial College & Univ. of Rome |
Keywords: Reduced order modeling, Modeling, Model/Controller reduction
Abstract: While approximating dynamical systems, one may be interested in preserving physical insights of specific states of the underlying system, which can be achieved by fixing the output map of the approximated model. In this paper we derive a class of reduced-order models that achieve moment matching while enforcing an output map chosen by the designer. We show that such models can be constructed even without the knowledge of the underlying system by using data generated from some experiments. In doing so, we highlight how the proposed method can be utilized to enforce input-output behavior, such as passivity, negative imaginary, and/or finite gain, in the reduced-order model, thus providing provable guarantees on various user-specified requirements. We demonstrate this aspect through several examples.
|
| |
| FrA03 |
Oceania III |
| Constrained Learning for Safe Control |
Invited Session |
| Chair: Johansson, Karl H. | KTH Royal Institute of Technology |
| Co-Chair: Bastianello, Nicola | KTH Royal Institute of Technology |
| Organizer: Harting, Alice | KTH Royal Institute of Technology |
| Organizer: Barreau, Matthieu | KTH |
| Organizer: Bastianello, Nicola | KTH Royal Institute of Technology |
| Organizer: Johansson, Karl H. | KTH Royal Institute of Technology |
| |
| 09:30-09:45, Paper FrA03.1 | |
| Incentive Design for Safe Nash Equilibrium Learning in Large Populations Via Control Barrier Functions |
|
| Xiao, Yifeng | University of California, Berkeley |
| Certorio, Jair | University of Maryland |
| Martins, Nuno C. | University of Maryland |
| Shoukry, Yasser | University of California, Irvine |
| Nuzzo, Pierluigi | University of Southern California |
Keywords: Agents-based systems, Lyapunov methods, Emerging control applications
Abstract: Many decision-making scenarios in engineering, sociology, and epidemiology, among other fields, can be effectively modeled by a large population of learning agents, captured by an evolutionary dynamics model (EDM), which can possibly act non-cooperatively and interact with an exogenous dynamical system (ES). In these systems, the agents’ collective behavior can drive the ES into undesirable states and significantly compromise its stability and safety. In this paper, we propose a method using control barrier functions to design payoffs that can guide the population’s strategy selection while ensuring safe and stable behavior of the ES. We first address the need to avoid undesirable population states in the EDM. We then extend our approach to coupled EDM-ES systems, where the designed payoffs can prevent undesirable states and guarantee convergence to a target state. Finally, we establish general conditions under which the designed payoffs remain effective across different classes of EDMs, ensuring broad applicability. Numerical simulations validate our approach in both standalone EDMs and coupled EDM-ES systems that satisfy the proposed conditions.
|
| |
| 09:45-10:00, Paper FrA03.2 | |
| Predictive Lagrangian Optimization for Constrained Reinforcement Learning |
|
| Zhang, Tianqi | Tsinghua University |
| Yuan, Puzhen | Tsinghua Uniersity |
| Zhan, Guojian | Tsinghua University |
| Lin, Ziyu | Tsinghua Uniersity |
| Lyu, Yao | Tsinghua University |
| Qin, Zhenzhi | Tsinghua Uniersity |
| Duan, Jingliang | University of Science and Technology Beijing |
| Zhang, Liping | Department of Mathematical Sciences, Tsinghua University, Beijin |
| Li, Shengbo Eben | Tsinghua University |
Keywords: Reinforcement learning, Learning
Abstract: Constrained optimization is popularly seen in reinforcement learning (RL) for addressing complex control tasks. From the perspective of dynamic system, iteratively solving a constrained optimization problem can be framed as the temporal evolution of a feedback control system. Classical constrained optimization methods, such as penalty and Lagrangian approaches, inherently use proportional and integral feedback controllers. In this paper, we propose a more generic equivalence framework to build the connection between constrained optimization and feedback control system, for the purpose of developing more effective constrained RL algorithms. Firstly, we define that each step of the system evolution determines the Lagrange multiplier by solving a multiplier feedback optimal control problem (MFOCP). In this problem, the control input is multiplier, the state is policy parameters, the dynamics is described by policy gradient descent, and the objective is to minimize constraint violations. Then, we introduce a multiplier guided policy learning (MGPL) module to perform policy parameters updating. And we prove that the resulting optimal policy, achieved through alternating MFOCP and MGPL, aligns with the solution of the primal constrained RL problem, thereby establishing our equivalence framework. Furthermore, we point out that the existing PID Lagrangian is merely one special case within our framework that utilizes a PID controller. We also accommodate the integration of other various feedback controllers, thereby facilitating the development of new algorithms. As a representative, we employ model predictive control (MPC) as the feedback controller and consequently propose a new algorithm called predictive Lagrangian optimization (PLO). Numerical experiments demonstrate its superiority over the PID Lagrangian method, achieving a larger feasible region up to 7.2% and a comparable average reward.
|
| |
| 10:00-10:15, Paper FrA03.3 | |
| Designing Positive Excitation Signals for the Safe Identification of Nonlinear Integrating Systems (I) |
|
| Jenneskens, Jan | DIFFER |
| Ceelen, Lennard | Dutch Institute for Fundamental Energy Research |
| Schoukens, Maarten | Eindhoven University of Technology |
| van Berkel, Matthijs | Dutch Institute for Fundamental Energy Research |
Keywords: Identification for control, Identification
Abstract: Designing identification experiments for nonlinear systems with only positive excitation is challenging. Large excitations improve the signal-to-noise ratio but risk deviating the system from the desired operating point due to their increased mean. To address this, we propose an algorithm that minimizes the integral of the perturbation, balancing the signal-to-noise ratio and the deviation from the target. We validate this approach with a nonlinear diffusion system identification simulation, which is illustrative for fusion energy applications.
|
| |
| 10:15-10:30, Paper FrA03.4 | |
| Trajectory-Based Barrier Certificates for Monotone Systems (I) |
|
| Galarza-Jimenez, Felipe | University of Colorado, Boulder |
| Zamani, Majid | University of Colorado Boulder |
| Jafarpour, Saber | University of Colorado Boulder |
Keywords: Formal Verification/Synthesis, Data driven control, Uncertain systems
Abstract: We propose a data-driven approach to construct barrier certificates for unknown monotone systems using a finite number of system trajectories, while ensuring formal correctness guarantees. Given multiple system trajectories, we construct a family of monotone basis functions that remain non-increasing along all trajectories. Using this class of basis functions, we develop a sampling-based optimization approach to construct barrier certificates, establishing system safety with formal guarantees without requiring additional simulated data or assuming Lipschitz continuity of the system. Furthermore, for the class of polynomial barrier functions, we show that the sampling-based approach can be efficiently reformulated as a linear program with significantly fewer constraints. Finally, we demonstrate the effectiveness of our approach through numerical examples.
|
| |
| 10:30-10:45, Paper FrA03.5 | |
| Feasibility Informed Advantage Weighted Regression for Persistent Safety in Offline Reinforcement Learning |
|
| Koirala, Prajwal | Iowa State University |
| Jiang, Zhanhong | Iowa State University |
| Sarkar, Soumik | Iowa State University |
| Fleming, Cody | Iowa State University |
Keywords: Reinforcement learning, Data driven control, Robotics
Abstract: Safe offline reinforcement learning aims to learn policies that maximize cumulative rewards while adhering to safety constraints, using only offline data for training. A key challenge is balancing safety and performance, particularly when the policy encounters out-of-distribution (OOD) states and actions, which can lead to safety violations or overly conservative behavior during deployment. To address these challenges, we introduce Feasibility Informed Advantage Weighted Actor-Critic (FAWAC), a method that prioritizes persistent safety in constrained Markov decision processes (CMDPs). FAWAC formulates policy optimization with feasibility conditions derived specifically for offline datasets, enabling safe policy updates in non-parametric policy space, followed by projection into parametric space for constrained actor training. By incorporating a cost-advantage term into Advantage Weighted Regression (AWR), FAWAC ensures that the safety constraints are respected while maximizing performance. Additionally, we propose a strategy to address a more challenging class of problems that involves tempting datasets where trajectories are predominantly high-rewarded but unsafe. Empirical evaluations on standard benchmarks demonstrate that FAWAC achieves strong results, effectively balancing safety and performance in learning policies from the static datasets.
|
| |
| 10:45-11:00, Paper FrA03.6 | |
| Closed-Loop Neural Operator-Based Observer of Traffic Density (I) |
|
| Harting, Alice | KTH Royal Institute of Technology |
| Johansson, Karl H. | KTH Royal Institute of Technology |
| Barreau, Matthieu | KTH |
Keywords: Observers for nonlinear systems, Distributed parameter systems, Traffic control
Abstract: We consider the problem of traffic density estimation with sparse measurements from stationary roadside sensors. Our approach uses Fourier neural operators to learn macroscopic traffic flow dynamics from high-fidelity data. During inference, the operator functions as an open-loop predictor of traffic evolution. To close the loop, we couple the open-loop operator with a correction operator that combines the predicted density with sparse measurements from the sensors. Simulations with the SUMO software indicate that, compared to open-loop observers, the proposed closed-loop observer exhibits classical closed-loop properties such as robustness to noise and ultimate boundedness of the error. This shows the advantages of combining learned physics with real-time corrections, and opens avenues for accurate, efficient, and interpretable data-driven observers.
|
| |
| 11:00-11:15, Paper FrA03.7 | |
| Probabilistically Safe and Efficient Model-Based Reinforcement Learning |
|
| Airaldi, Filippo | Delft University of Technology |
| De Schutter, Bart | Delft University of Technology |
| Dabiri, Azita | Delft University of Technology |
Keywords: Reinforcement learning, Optimal control
Abstract: This paper proposes tackling safety-critical stochastic Reinforcement Learning (RL) tasks with a sample-based, model-based approach. At the core of the method lies a Model Predictive Control (MPC) scheme that acts as function approximation, providing a model-based predictive control policy. To ensure safety, a probabilistic Control Barrier Function (CBF) is integrated into the MPC controller. To approximate the effects of stochasticies in the optimal control formulation and to fulfil the probabilistic CBF condition, a sample-based approach with guarantees is employed. Furthermore, to counterbalance the additional computational burden due to sampling, a learnable terminal cost formulation is included in the MPC objective. An RL algorithm is deployed to learn both the terminal cost and the CBF constraint. Results from a numerical experiment on a constrained LTI problem corroborate the effectiveness of the proposed methodology in reducing computation time while preserving control performance and safety.
|
| |
| 11:15-11:30, Paper FrA03.8 | |
| Neural Barrier Certificates for Monotone Systems |
|
| Alavi, Amirreza | University of Colorado Boulder |
| Nadali, Alireza | University of Colorado, Boulder |
| Zamani, Majid | University of Colorado Boulder |
| Jafarpour, Saber | University of Colorado Boulder |
Keywords: Data driven control, Neural networks, Autonomous systems
Abstract: Barrier certificates are real-valued functions used to formally verify the safety of dynamical systems. For systems with unknown dynamics, data-driven barrier certificates have been developed to guarantee safety using only a finite set of data. However, most existing methods rely on knowledge of Lipschitz bound of the system and require a fine discretization of the state set, leading to high sample complexity. In this paper, we propose a novel data-driven framework for learning barrier certificates for unknown monotone systems. Our approach is based on a suitable embedding of barrier certificates into a higher-dimensional space. By leveraging interval analysis, this embedding enables us to establish data-driven safety certificates for monotone systems. Unlike existing methods, our framework is independent of Lipschitz continuity and quantization parameters of the state set, allowing for arbitrary state-space discretization and thereby alleviating extensive sampling requirements. To efficiently construct barrier certificates, we introduce suitable neural network architectures and train them using an appropriately designed loss function. We illustrate our approach through two case studies, demonstrating that our method successfully finds separable embedded barrier certificates while substantially reducing sample complexity compared to conventional neural barrier certificate methods, all while maintaining formal correctness guarantees.
|
| |
| FrA04 |
Oceania IV |
| Bounded Rationality in Human-AI Decision-Making |
Invited Session |
| Chair: Akyol, Emrah | SUNY Binghamton |
| Co-Chair: Vasconcelos, Marcos M. | Florida State University |
| Organizer: Vasconcelos, Marcos M. | Florida State University |
| Organizer: Akyol, Emrah | SUNY Binghamton |
| |
| 09:30-09:45, Paper FrA04.1 | |
| Off-Policy Evaluation for Sequential Persuasion Process with Unobserved Confounding (I) |
|
| Senthil Kumar, Nishanth Venkatesh | Cornell University |
| Bang, Heeseung | Cornell University |
| Malikopoulos, Andreas A. | Cornell University |
Keywords: Statistical learning, Reinforcement learning, Game theory
Abstract: In this paper, we expand the Bayesian persuasion framework to account for unobserved confounding variables in sender-receiver interactions. While traditional models typically assume that belief updates follow Bayesian principles, real-world scenarios often involve hidden variables that impact the receiver’s belief formation and decision-making. We conceptualize this as a sequential decision-making problem, where the sender and receiver interact over multiple rounds. In each round, the sender communicates with the receiver, who also interacts with the environment. Crucially, the receiver’s belief update is affected by an unobserved confounding variable. By reformulating this scenario as a Partially Observable Markov Decision Process (POMDP), we capture the sender’s incomplete information regarding both the dynamics of the receiver’s beliefs and the unobserved confounder. We prove that finding an optimal observation-based policy in this POMDP is equivalent to solving for an optimal signaling strategy in the original persuasion framework. Furthermore, we demonstrate how this reformulation facilitates the application of proximal learning for off-policy evaluation (OPE) in the persuasion process. This advancement enables the sender to evaluate alternative signaling strategies using only observational data from a behavioral policy, thus eliminating the necessity for costly new experiments.
|
| |
| 09:45-10:00, Paper FrA04.2 | |
| Preference-Centric Route Recommendation: Equilibrium, Learning, and Provable Efficiency (I) |
|
| Yang, Ya-Ting | New York University |
| Pan, Yunian | New York University |
| Zhu, Quanyan | New York University |
Keywords: Transportation networks, Game theory, Learning
Abstract: Traditional approaches to modeling and predicting traffic behavior often rely on Wardrop Equilibrium (WE), assuming non-atomic traffic demand and neglecting correlations in individual decisions. However, the growing role of real-time human feedback and adaptive recommendation systems calls for more expressive equilibrium concepts that better capture user preferences and the stochastic nature of routing behavior. In this paper, we introduce a preference-centric route recommendation framework grounded in the concept of Borda Coarse Correlated Equilibrium (BCCE), wherein users have no incentive to deviate from recommended strategies when evaluated by Borda scores—pairwise comparisons encoding user preferences. We develop an adaptive algorithm that learns from dueling feedback and show that it achieves mathcal{O}(T^{frac{2}{3}}) regret, implying convergence to the BCCE under mild assumptions. We conduct empirical evaluations using a case study to illustrate and justify our theoretical analysis. The results demonstrate the efficacy and practical relevance of our approach.
|
| |
| 10:00-10:15, Paper FrA04.3 | |
| A Hierarchical Cognitive Framework for Strategic Communication (I) |
|
| Anand, Anju | Binghamton University |
| Akyol, Emrah | SUNY Binghamton |
Keywords: Game theory
Abstract: This paper analyzes the problem of communication between boundedly rational senders with heterogeneous objectives and partial information and a full information fully rational receiver. Particularly, we explore a setting where there are K different types of senders with varying levels of available information, degree of strategic behavior, and cognitive hierarchy: i) a non-strategic agent with an honest response (level-0), ii) a k-th level kin [1:K-1] strategic agent that believes the population is Poisson distributed over the lower cognitive types. We model each of these scenarios as a strategic communication of a 2-dimensional source (possibly correlated source and bias components) with quadratic distortion measures. The numerical results we obtained via the proposed method validate our theoretical observations.
|
| |
| 10:15-10:30, Paper FrA04.4 | |
| Optimizing Engagement in Recommender Systems with Private Preferences and Mismatched Payoffs |
|
| Vasconcelos, Marcos M. | Florida State University |
| Camara, Odilon | University of Southern California |
Keywords: Game theory, Stochastic systems, Human-in-the-loop control
Abstract: A large share of online content is consumed via social media and other digital platforms. Given the overwhelming amount of content, platforms deploy recommendation systems that operate under uncertainty about user preferences. We introduce a novel information disclosure model of user–platform interaction. The user privately knows his preferences and chooses whether to disclose them to the platform. The platform observes the available content and selects one of them to recommend, aiming to maximize expected engagement. A conflict arises because, when the platform delivers polarizing content, it often increases engagement while reducing user utility. We construct a symmetric Perfect Bayesian Equilibrium characterized by simple threshold functions, and identify conditions under which platforms obtain higher engagement when content creators produce more extreme content. This result helps explain the ongoing escalation of extreme content on social media platforms.
|
| |
| 10:30-10:45, Paper FrA04.5 | |
| Bi-Threshold Decision-Making Dynamics |
|
| Aghaeeyan, Azadeh | Brock University |
| Ramazi, Pouria | University of Calgary |
Keywords: Game theory
Abstract: In a two-strategy decision-making problem, a bi-threshold individual adopts a strategy only if the prevalence of adopters lies between her two thresholds. This study investigates the asymptotic behavior of a heterogeneous bi-threshold population, where the thresholds vary among the individuals. First, we show through an example that the proportion of adopters in finite populations may fluctuate. We then use available results in the stochastic approximation theory and approximate the finite discrete dynamics by corresponding continuous-time dynamics. Our analysis shows that the continuous-time dynamics always equilibrate. This implies that the amplitudes of the fluctuations in the proportion of adopters in a finite bi-threshold population almost surely diminish when the population size grows to infinity. The result underlines the advantage of analyzing simpler continuous-time dynamics to get insights into the long-term behavior of finite discrete bi-threshold populations.
|
| |
| 10:45-11:00, Paper FrA04.6 | |
| Convergent Q-Learning for Infinite-Horizon General-Sum Markov Games through Behavioral Economics |
|
| Zhang, Yizhou | California Institute of Technology |
| Mazumdar, Eric | California Institute of Technology |
Keywords: Game theory, Reinforcement learning, Markov processes
Abstract: Risk-aversion and bounded rationality are two key characteristics of human decision-making. Risk-averse quantal-response equilibrium (RQE) is a solution concept that incorporates these features, providing a more realistic depiction of human decision making in various strategic environments compared to a Nash equilibrium. Furthermore a class of RQE have recently been shown in Mazumdar et al., 2024 to be universally computationally tractable in all finite-horizon Markov games, allowing for the development of multi-agent reinforcement learning algorithms with convergence guarantees. In this paper, we expand upon the study of RQE and analyze their computation in both two-player normal form games and discounted infinite-horizon Markov games. For normal form games we adopt a monotonicity-based approach allowing us to generalize previous results. We first show uniqueness and Lipschitz continuity of RQE with respect to player's payoff matrices under monotonicity assumptions, and then provide conditions on the players' degrees of risk aversion and bounded rationality that ensure monotonicity. We then focus on discounted infinite-horizon Markov games. We define the risk-averse quantal-response Bellman operator and prove its contraction under further conditions on the players' risk-aversion, bounded rationality, and temporal discounting. This yields a Q-learning based algorithm with convergence guarantees for all infinite horizon general-sum games.
|
| |
| 11:00-11:15, Paper FrA04.7 | |
| Fatigue and Task Load Dependent Decision Referrals for Joint Binary Classification in Human-Automation Teams |
|
| Seraj, Raihan | McGill University |
| Le Ny, Jerome | Polytechnique Montréal |
| Mahajan, Aditya | McGill University |
Keywords: Human-in-the-loop control, Stochastic optimal control
Abstract: We consider a human-automation team jointly solving binary classification tasks over multiple time stages. At each stage, the automation observes the data for a batch of classification tasks, classifies a subset of them and refers the others to the human. The human's performance depends on task load and fatigue, where fatigue is modeled as a controlled Markov process dependent on the past task loads. We formulate the automation's problem of deciding which tasks to refer as a Markov decision process and present a sampling-based approximate dynamic program that leverages task independence across time and the structure of the recently obtained single-stage optimal allocation policy. We then present a numerical study comparing our solution against a baseline policy that does not explicitly account for fatigue dynamics.
|
| |
| 11:15-11:30, Paper FrA04.8 | |
| How Can an Influencer Maximize Her Social Power? |
|
| Wang, Lingfei | KTH Royal Institute of Technology |
| Xing, Yu | KTH Royal Institute of Technology |
| Yi, Yuhao | Sichuan University |
| Johansson, Karl H. | KTH Royal Institute of Technology |
Keywords: Network analysis and control, Agents-based systems, Human-in-the-loop control
Abstract: This article considers Friedkin-Johnsen (FJ) model with an external influencer. The influencer is totally stubborn (i.e., her own opinion never changes), and can par- ticipate in the opinion evolution by adding links to a fixed number of agents in the FJ model. The problem is to investigate how the influencer selects agents to maximize her social power, which represents the influence of her initial opinion on other agents’ final opinions. This problem is shown to be equivalent to maximize the absorbing probability to the influencer for a Markov chain. The solution is analytically characterized for the cases of small and large stubbornness, with a single agent to be selected. Moreover, the social power of the influencer is proved to be monotone and submodular with the selected agent set, and a greedy algorithm is then proposed to generate an approximated solution. Random walks are used to speed up the algorithm for large network size. The effectiveness of the greedy algorithm is further shown by a numerical example.
|
| |
| FrA05 |
Galapagos II |
Advances in Rigidity Theory, Multi-Agent Formations, and Distributed
Localization |
Invited Session |
| Chair: Chen, Liangming | Southern University of Science and Technology (SUSTech) |
| Co-Chair: Sun, Zhiyong | Peking University (PKU) |
| Organizer: Chen, Liangming | Southern University of Science and Technology (SUSTech) |
| Organizer: Sun, Zhiyong | Peking University (PKU) |
| |
| 09:30-09:45, Paper FrA05.1 | |
| Equilibrium-Driven Smooth Separation and Navigation of Marsupial Robotic System |
|
| Hu, Bin-Bin | University of Groningen |
| Jayawardhana, Bayu | University of Groningen |
| Cao, Ming | University of Groningen |
Keywords: Cooperative control, Autonomous systems, Robotics
Abstract: In this paper, we propose an equilibrium-driven controller that enables a marsupial carrier-passenger robotic system to achieve smooth carrier-passenger separation and then to navigate the passenger robot toward a predetermined target point. To achieve smooth separation and navigation, we design a third-order potential gradient for the passenger's controller as a function of the carrier-passenger and carrier-target distances in the moving carrier robot's coordinate frame, which introduces three equilibrium points in the closed-loop error system, each corresponding to one of the following three scenarios: the passenger robot is (i) staying on the carrier robot, (ii) positioned between the carrier robot and the target, and (iii) reaching the target. Initially, the passenger robot is confined to the carrier robot, namely within the attraction region of the equilibrium point~(i). By appropriately decreasing the carrier-target distance, the equilibrium point (ii) vanishes, and the attraction and repulsion regions of equilibrium points (i) and (iii) change accordingly. This transition guides the passenger robot to leave the equilibrium point (i) and approach the equilibrium point (iii), which enables a smooth carrier-passenger separation and seamless navigation to the target. Furthermore, rigorous convergence analysis is also provided. Finally, 2D and 3D simulations are conducted to demonstrate the effectiveness and adaptability of the proposed controller facing obstacle in the surroundings.
|
| |
| 09:45-10:00, Paper FrA05.2 | |
| A Formal Analysis of Control-Communication Codesign for Formation Systems with TDMA Communications (I) |
|
| Chen, Yaru | National University of Defense Technology |
| Cong, Yirui | National University of Defense Technology |
| Sun, Zhiyong | Peking University (PKU) |
| Wang, Xiangke | National University of Defense Technology |
Keywords: Cooperative control, Control over communications, Control of networks
Abstract: For formation control systems based on wireless communications, communication interaction and control performance are highly coupled, while traditional methods do not take this coupling into account. To fill this gap, we study the formation control problem of second-order multi-agent systems (MASs) with uncertain dynamics (affected by unknown but bounded process noises) and commonly-used time division multiple access (TDMA) communications. A formal analysis of system modeling, control-communication codesign, and formation system stability is provided. Firstly, we establish the communication model of MASs, which characterizes the effects of TDMA technology, agent dynamics, and physical layer of communications. Based on the communication model, we propose a codesign of the formation control and the communication power control to achieve bounded formation stability, where the boundedness property of the transmit power and the delay-based formation controller are analyzed. Finally, a numerical simulation is provided to validate the effectiveness of the proposed codesign method.
|
| |
| 10:00-10:15, Paper FrA05.3 | |
| Safe Air-Ground Coordination Control under Hybrid Cyberattacks Via Reinforcement Learning and Self-Triggered Communication (I) |
|
| Ren, Ziming | Beihang University |
| Liu, Hao | Beihang University |
| Sun, Zhiyong | Peking University (PKU) |
Keywords: Cooperative control, Intelligent systems, Agents-based systems
Abstract: The safe optimal coordination control problem is addressed for partially-unknown input-constrained air-ground systems under hybrid cyberattacks. Adversaries can launch denial-of-service attacks to prevent data transmission and channel manipulation attacks to tamper with interaction data. A unified distributed observer-based optimal control framework is first proposed for the heterogeneous vehicles. To achieve consensus under hybrid cyberattacks, the Zeno-free switching-type self-triggered observer is constructed based on only viable faulting neighborhood data. Then, optimal input-constrained control policies are learned via an on-policy actor-critic neural network-based learning algorithm. Sufficient conditions to guarantee the stability of the closed-loop system under the modeled hybrid cyberattacks are established. Numerical examples validate the effectiveness of the developed approach.
|
| |
| 10:15-10:30, Paper FrA05.4 | |
| On the Equivalence between Signed Angle Rigidity and Bearing Rigidity (I) |
|
| Huang, Jinpeng | Chongqing University |
| Jing, Gangshan | Chongqing University |
Keywords: Distributed control, Control of networks, Cooperative control
Abstract: In recent years, numerous extended rigidity theories have been developed by considering different types of local constraints, to adapt to formation control in different scenarios. Among them, the signed angle constraint has attracted considerable interest for its exceptional ability to stabilize coordinate-free formations with high degrees of freedom. Unfortunately, the combinatorial aspects of signed angle rigidity remain largely unexplored. In this paper, we study signed angle rigidity theory in the context of frameworks and prove its complete equivalence to bearing rigidity theory in the plane for the first time. Building on this equivalence, we propose a distributed formation controller based on signed angle measurements only, which achieves global stability as long as the formation framework is infinitesimally signed angle rigid. A simulation example demonstrates the effectiveness of our results.
|
| |
| 10:30-10:45, Paper FrA05.5 | |
| Angle-Based Target Tracking Using Kalman Filter and Fully Actuated Control (I) |
|
| Zhang, Chi | Southern University of Science and Technology |
| Ren, Weijie | Southern University of Science and Technology |
| Chen, Liangming | Southern University of Science and Technology (SUSTech) |
| Duan, Guangren | Harbin Institute of Technology |
Keywords: Estimation, Cooperative control, Autonomous systems
Abstract: This paper introduces a framework of localizing and tracking a non-cooperative target by using two pursuers which are assumed to have measurements of self-displacements and interior angles of the triangle formed by the target and the pursuers. First, based on the geometric relationship between the target and pursuers, an {measurement} model is derived from the pursuers' measured interior angles and self-displacements. Combined with the state-transition model, a linear time-varying Kalman filter is designed to estimate the relative position and velocity between the target and the pursuers. Then, a fully actuated control law is designed by using the parametric design approach to achieve the target tracking task, which dynamically adjusts pursuers' trajectories. Finally, simulations demonstrate the proposed framework’s localization and tracking performance for targets with dynamic maneuvering capability.
|
| |
| 10:45-11:00, Paper FrA05.6 | |
| A Constraint-Driven Approach to Line Flocking: The V Formation As an Energy-Saving Strategy |
|
| Beaver, Logan E. | Old Dominion University |
| Kroninger, Christopher | U.S. Army Research Laboratory |
| Dorothy, Michael | US Army Research Laboratory |
| Malikopoulos, Andreas A. | Cornell University |
Keywords: Networked control systems, Optimization, Agents-based systems
Abstract: In this article, we present a constraint-driven control algorithm that minimizes the energy consumption of individual agents and yields an emergent V formation. As the formation emerges from the decentralized interaction between agents, our approach is robust to the spontaneous addition or removal of agents to the system. We start from an analytical model for the trailing upwash behind a fixed-wing uncrewed aerial vehicle (UAV), and we derive the optimal air speed for trailing UAVs to maximize their travel endurance. Next, we prove that simply flying at the optimal airspeed will never lead to emergent flocking behavior and propose a new decentralized “anseroid” behavior that yields emergent V formations. We encode these behaviors in a constraint-driven control algorithm that minimizes the locomotive power of each UAV. Finally, we prove that UAVs initialized in an approximate V or echelon formation will converge under our proposed control law, and we demonstrate that this emergence occurs in real time in simulation and in physical experiments with a fleet of Crazyflie quadrotors despite the noise and disturbances inherent in physical systems.
|
| |
| 11:00-11:15, Paper FrA05.7 | |
| Image-Based Leader-Follower Formation Tracking Control of Nonholonomic Mobile Robots with Field-Of-View Constraint |
|
| Zeng, Haifeng | Xiamen University |
| Jiang, Yi | Huazhong University of Science and Technology |
| Yu, Xiao | Xiamen University |
Keywords: Constrained control, Visual servo control, Nonholonomic systems
Abstract: This paper investigates the leader-follower formation tracking control problem for nonholonomic mobile robots under two critical constraints: the unavailability of robots' position measurements and the absence of inter-robot communication. The lack of positional information renders the formation tracking error immeasurable directly, while the communication restriction prevents the follower from acquiring the leader's real-time velocity. To address these challenges, we propose a novel vision-based feedback control scheme that relies exclusively on monocular visual perception with inherent field-of-view (FOV) constraints. The proposed method achieves the desired formation configuration by regulating visual feature points in the image plane, eliminating the need for both relative pose estimation and the leader's velocity information. Based on the deigned barrier functions, the proposed controller guarantees the leader always remains within the follower's visual coverage domain. Experimental results on multiple differential-drive mobile robots illustrate the effectiveness and practical applicability of the proposed control strategy.
|
| |
| 11:15-11:30, Paper FrA05.8 | |
| On the Stabilization of Directed Formation Using Geometric Algebra Approach |
|
| Sahebsara, Farid | George Mason University |
| Green, Mikhalib | Louisiana State University |
| Barbalata, Corina | Louisiana State University |
Keywords: Agents-based systems, Autonomous systems, Lyapunov methods
Abstract: Distance-based formation control on minimally rigid graphs often encounters ambiguous shapes, where agents form incorrect shape formations due to multiple equilibrium points in the error dynamics. Recent studies on distance-based controllers, even those that introduce extra control variables, struggle to resolve this fundamental issue fully, typically requiring specific conditions or failing to manage reflections effectively. In this paper, we address both flip and reflection ambiguities in formation control by reexamining core geometric constraints and integrating them with traditional bearing-based formation control methods. We propose a novel controller that guarantees convergence to the correct formation without imposing any additional conditions. Numerical simulations demonstrate the controller's effectiveness in avoiding formation ambiguities.
|
| |
| FrA06 |
Oceania I |
| Safety Filters for Autonomous Systems I |
Invited Session |
| Chair: Lederer, Armin | ETH Zurich |
| Co-Chair: Li, Ming | KTH Royal Institute of Technology |
| Organizer: Herbert, Sylvia | UC San Diego (UCSD) |
| Organizer: Lederer, Armin | National University of Singapore |
| Organizer: Li, Ming | KTH Royal Institute of Technology |
| Organizer: Liu, Siyuan | Eindhoven University of Technology |
| Organizer: Sun, Zhiyong | Peking University (PKU) |
| |
| 09:30-09:45, Paper FrA06.1 | |
| Backward Control Barrier Certificates (I) |
|
| Murali, Vishnu | University of Colorado Boulder |
| Zamani, Majid | University of Colorado Boulder |
Keywords: Hybrid systems, Optimization
Abstract: This paper introduces a notion of backward control barrier certificates to synthesize safety controllers for deterministic systems. Barrier certificates and control barrier certificates play a fundamental role in the automated design of controllers to ensure the safety of dynamical systems. The simultaneous search for control barrier certificates and their controllers typically face challenges in automation as they involve quantifier alternation between the states (for all states) and the inputs (there exists an input) as well as bilinearity between the unknown certificate and control input. In this paper, we show that one may simultaneously search for both a certificate and controller effectively for deterministic systems via standard sum-of-squares approaches without the need for quantifier alternation. Here, we treat the input as a disturbance and build an invariant set over the unsafe set of states rather the the initial set. This set is invariant in the backward direction rather than forward, and hence we dub these as backward control barrier certificates. Ensuring that the initial set is not in this invariant set guarantees the existence of a controller to ensure safety. We show how one may automate the search for these certificates, and discuss some strategies to implement their corresponding safety controllers. Finally, we demonstrate the efficacy of our approach with some case studies.
|
| |
| 09:45-10:00, Paper FrA06.2 | |
| Incremental Composition of Learned Control Barrier Functions in Unknown Environments (I) |
|
| Lutkus, Paul | University of Southern California |
| Anantharaman, Deepika | None |
| Tu, Stephen | University of Southern California |
| Lindemann, Lars | University of Southern California |
Keywords: Autonomous systems, Nonlinear output feedback, Constrained control
Abstract: We consider the problem of safely exploring a static and unknown environment while learning valid control barrier functions (CBFs) from sensor data. Existing works either assume known environments, target specific dynamics models, or use a-priori valid CBFs, and thus provide limited safety guarantees for general control-affine systems during exploration. We present a method for safely exploring by incrementally composing a global CBF from local CBFs. The challenge here is that local CBFs may not have well-defined end behavior outside their training domain, i.e. local CBFs may be positive (indicating safety) in regions where no training data is available. We show that well-defined end behavior can be obtained when local CBFs are parameterized by compactly-supported radial basis functions. For learning local CBFs, we collect sensor data, e.g. LiDAR capturing obstacles in the environment, and augment it with simulated data from a safe oracle controller. Our work complements recent efforts to learn CBFs from safe demonstrations, where learned safe sets are limited to their training domains, by demonstrating how to grow the safe set over time as more data becomes available. We evaluate our approach on two simulated systems, where our method successfully explores an unknown environment while maintaining safety throughout the entire execution.
|
| |
| 10:00-10:15, Paper FrA06.3 | |
| Risk-Aware Robot Control in Dynamic Environments Using Belief Control Barrier Functions (I) |
|
| Han, Shaohang | KTH Royal Institute of Technology |
| Vahs, Matti | KTH |
| Tumova, Jana | KTH Royal Institute of Technology |
Keywords: Autonomous robots, Stochastic systems, Formal Verification/Synthesis
Abstract: Ensuring safety for autonomous robots operating in dynamic environments can be challenging due to factors such as unmodeled dynamics, noisy sensor measurements, and partial observability. To account for these limitations, it is common to maintain a belief distribution over the true state. This belief could be a non-parametric, sample-based representation to capture uncertainty more flexibly. In this paper, we propose a novel form of Belief Control Barrier Functions (BCBFs) specifically designed to ensure safety in dynamic environments under stochastic dynamics and a sample-based belief about the environment state. Our approach incorporates provable concentration bounds on tail risk measures into BCBFs, effectively addressing possible multimodal and skewed belief distributions represented by samples. Moreover, the proposed method demonstrates robustness against distributional shifts up to a predefined bound. We validate the effectiveness and real-time performance (approximately 1kHz) of the proposed method through two simulated underwater robotic applications: object tracking and dynamic collision avoidance.
|
| |
| 10:15-10:30, Paper FrA06.4 | |
| Control Synthesis for Multiple Reach-Avoid Tasks Via Hamilton-Jacobi Reachability Analysis (I) |
|
| Chen, Yu | Shanghai Jiao Tong University |
| Li, Shaoyuan | Shanghai Jiao Tong University |
| Yin, Xiang | Shanghai Jiao Tong University |
Keywords: Discrete event systems, Formal Verification/Synthesis, Hybrid systems
Abstract: We investigate the control synthesis problem for continuous-time control-affine systems under a class of multiple reach-avoid (MRA) tasks. Specifically, the MRA task requires the system to reach a series of target regions in a specified order while satisfying state constraints between each pair of target arrivals. This problem is more challenging than standard reach-avoid tasks, as it requires considering the feasibility of future reach-avoid tasks during the planning process. To solve this problem, we define a series of value functions by solving a cascade of time-varying reach-avoid problems characterized by Hamilton-Jacobi variational inequalities. We prove that the super-level set of the final value function computed is exactly the feasible set of the MRA task. Additionally, we demonstrate that the control law can be effectively synthesized by ensuring the non-negativeness of the value functions over time. The effectiveness of the proposed approach is illustrated through two case studies on robot planning problems.
|
| |
| 10:30-10:45, Paper FrA06.5 | |
| A Comparative Study of Artificial Potential Fields and Reciprocal Control Barrier Function-Based Safety Filters (I) |
|
| Li, Ming | KTH Royal Institute of Technology |
| Sun, Zhiyong | Peking University (PKU) |
| Weiland, Siep | Eindhoven Univ. of Tech |
Keywords: Lyapunov methods, Constrained control, Optimization algorithms
Abstract: In this paper, we demonstrate that controllers designed by artificial potential fields (APFs) can be derived from reciprocal control barrier function quadratic program (RCBF-QP) safety filters. By integrating APFs within the RCBF-QP framework, we explicitly establish the relationship between these two approaches. Specifically, we first introduce the concepts of tightened control Lyapunov functions (T-CLFs) and tightened reciprocal control barrier functions (T-RCBFs), each of which incorporates a flexible auxiliary function. We then utilize an attractive potential field as a T-CLF to guide the nominal controller design, and a repulsive potential field as a T-RCBF to formulate an RCBF-QP safety filter. With appropriately chosen auxiliary functions, we show that controllers designed by APFs and those derived by RCBF-QP safety filters are equivalent. Based on this insight, we further generalize the APF-based controllers (equivalently, RCBF-QP safety filter-based controllers) to more general scenarios without restricting the choice of auxiliary functions. Finally, we present a collision avoidance example to clearly illustrate the connection and equivalence between the two methods.
|
| |
| 10:45-11:00, Paper FrA06.6 | |
| Distributed Risk-Sensitive Safety Filters for Uncertain Discrete-Time Systems |
|
| Lederer, Armin | National University of Singapore |
| Noorani, Erfaun | University of Maryland College Park |
| Krause, Andreas | ETH Zurich |
Keywords: Uncertain systems, Distributed control, Constrained control
Abstract: Ensuring safety in multi-agent systems is a significant challenge, particularly in settings where centralized coordination is impractical. In this work, we propose a novel risk-sensitive safety filter for discrete-time multi-agent systems with uncertain dynamics that leverages control barrier functions (CBFs) defined through value functions. Our approach relies on centralized risk-sensitive safety conditions based on exponential risk operators to ensure robustness against model uncertainties. We introduce a distributed formulation of the safety filter by deriving two alternative strategies: one based on worst-case anticipation and another on proximity to a known safe policy. By allowing agents to switch between strategies, feasibility can be ensured. Through detailed numerical evaluations, we demonstrate the efficacy of our approach in maintaining safety without being overly conservative.
|
| |
| 11:00-11:15, Paper FrA06.7 | |
| Secure Safety Filter Design for Sampled-Data Nonlinear Systems under Sensor Spoofing Attacks |
|
| Tan, Xiao | California Institute of Technology |
| Ong, Pio | California Institute of Technology |
| Tabuada, Paulo | University of California at Los Angeles |
| Ames, Aaron D. | California Institute of Technology |
Keywords: Constrained control, Cyber-Physical Security, Resilient Control Systems
Abstract: This paper presents a secure safety filter design for constrained nonlinear systems under sensor spoofing attacks. Existing approaches primarily focus on linear systems which limits their applications in real-world scenarios. In this work, we investigate the extension of our previous results to nonlinear systems. We introduce exact observability maps that provide an abstraction independent of any particular state estimation algorithm, and extend them to a secure version capable of handling sensor attacks. The generalization to nonlinear systems also applies to the relaxed observability case, with slightly relaxed guarantees. More importantly, we propose a secure safety filter design in both exact and relaxed cases, which incorporates secure state estimation and a control barrier function-enabled safety filter. The proposed approach provides theoretical guarantees for enforcing state constraints in the presence of sensor attacks. We numerically validate our analysis on a unicycle vehicle equipped with redundant yet partly compromised sensors.
|
| |
| 11:15-11:30, Paper FrA06.8 | |
| Analytical Construction of CBF-Based Safety Filters for Simultaneous State and Input Constraints |
|
| Fisher, Peter | Massachusetts Institute of Technology |
| Annaswamy, Anuradha M. | Massachusetts Inst. of Tech |
Keywords: Constrained control, Lyapunov methods
Abstract: We consider the problem of meeting multiple state and input constraints simultaneously in nth-order integrator systems for any n >= 1. Our approach consists of a series of recursive filters which result in analytic high-order CBFs for each state constraint, where the recursion is specially designed to enable compatibility of constraints. We further provide a tuning algorithm for the filter parameters and show that it converges in finitely many steps. While we focus our discussions mainly on problems with a single control input, we discuss extensions to multiple inputs as well and validate in simulation on a linearized quadrotor.
|
| |
| FrA07 |
Capri I |
| Recent Achievement and Perspective Directions in Sliding Mode Control I |
Invited Session |
| Chair: Hsu, Liu | COPPE/UFRJ |
| Co-Chair: Polyakov, Andrey | Inria, Univ. Lille |
| Organizer: Fridman, Leonid | Universidad Nacional Autonoma De Mexico |
| Organizer: Hsu, Liu | COPPE/UFRJ |
| |
| 09:30-09:45, Paper FrA07.1 | |
| Composite Lyapunov Function-Based Integral Sliding--Mode Control for Constrained and Uncertain Linear Systems (I) |
|
| Salgado, Ivan | Instituto Politecnico Nacional |
| Mera, Manuel | Esime Upt Ipn |
| Ríos, Héctor | Tecnológico Nacional De México/I.T. La Laguna |
Keywords: Variable-structure/sliding-mode control
Abstract: This article proposes a nonlinear solution for stabilizing uncertain linear systems subjected to state constraints and external disturbances. First, a nonlinear control strategy based on Integral Sliding Modes deals with the problem of bounded coupled disturbances. Then, a linear controller provides a solution for uncoupled disturbances and state constraints. The second stage of the controller applies a Composite Lyapunov Function to derive the gains needed to enforce asymptotic convergence to zero while fulfilling state constraints and delimiting the safe set where the system trajectories do not violate these restrictions. Compared to traditional quadratic Barrier functions, Composite Lyapunov functions offer a less conservative estimation of a safety set.
|
| |
| 09:45-10:00, Paper FrA07.2 | |
| Finite-Dimensional Filippov Method for Infinite-Dimensional SMC System (I) |
|
| Polyakov, Andrey | Inria, Univ. Lille |
Keywords: Variable-structure/sliding-mode control
Abstract: The paper deals with a well-posedness analysis for discontinuous infinite-dimensional systems, which can be regularized by means of the conventional (finite-dimensional) Filippov method. The study is inspired by sliding mode control theory. Sliding mode controllers for infinite-dimensional systems with finite- and infinite-dimensional sliding surfaces are designed. Theoretical results are supported by an example of an output-based sliding mode control for heat system.
|
| |
| 10:00-10:15, Paper FrA07.3 | |
| Output Feedback Control of Nonlinear Systems Via Lipschitz Continuous Sliding Modes |
|
| Texis-Loaiza, Oscar | Brandenburg University of Technology Cottbus-Senftenberg (BTU C |
| Moreno, Jaime A. | Universidad Nacional Autonoma De Mexico-UNAM |
| Estrada, Manuel A. | Facultad De Ingeniería, Universidad Nacional Autónoma De México |
| Fridman, Leonid | Universidad Nacional Autonoma De Mexico |
| Levant, Arie | Tel-Aviv University |
Keywords: Nonlinear output feedback, Variable-structure/sliding-mode control, Lyapunov methods
Abstract: The proposed Lipschitz-continuous arbitrary- relative-degree output-feedback control theoretically exactly compensates for Lipschitz-continuous non-vanishing perturbations. The finite-time stability of the closed-loop system is proved via Lyapunov methods, exploiting the weighted homogeneity features of the control and observer. Moreover, the observer and the controller can be independently designed, establishing a specific separation principle. The feasibility of the method is demonstrated by simulation.
|
| |
| 10:15-10:30, Paper FrA07.4 | |
| A Second-Order Observer Based on Dissipativity and Homogeneity Properties for Uncertain Nonlinear Systems (I) |
|
| Texis-Loaiza, Oscar | Brandenburg University of Technology Cottbus-Senftenberg (BTU C |
| Mercado Uribe, José Angel | Brandenburgische Technische Universität Cottbus-Senftenberg |
| Moreno, Jaime A. | Universidad Nacional Autonoma De Mexico-UNAM |
| Schiffer, Johannes | Brandenburg University of Technology |
Keywords: Variable-structure/sliding-mode control
Abstract: In this paper, we propose a global second-order dissipative homogeneous observer (DH-observer) that can estimate the states of an uncertain nonlinear system with unknown inputs in finite time. The system must be strongly observable with respect to unknown inputs (UIs), uniformly in the known input. To handle the UIs and non-Lipschitz nonlinearities, the proposed observer combines dissipative and homogeneous properties. The convergence of the DH-observer to the system states is proven through a Lyapunov function. Finally, to showcase the effectiveness of the proposed method, the paper includes an academic example.
|
| |
| 10:30-10:45, Paper FrA07.5 | |
| Regularization of Non-Overshooting Quasi-Continuous Sliding Mode Control for Chattering Suppression at Equilibrium |
|
| Ruderman, Michael | University of Agder |
| Efimov, Denis | Inria |
Keywords: Nonlinear output feedback, Variable-structure/sliding-mode control, Stability of nonlinear systems
Abstract: Robust finite-time feedback controller, recently introduced by the authors for second-order systems, can be seen as a non-overshooting quasi-continuous sliding mode control. The paper proposes a regularization scheme to suppress inherent chattering due to discontinuity in the origin, in favor of practical applications. A detailed analysis with ISS and iISS proofs are provided along with supporting numerical results.
|
| |
| 10:45-11:00, Paper FrA07.6 | |
| An Accelerated Heavy-Ball-Based Adaptive Observer for Uncertain Nonlinear Systems |
|
| Ríos, Héctor | Tecnológico Nacional De México/I.T. La Laguna |
| Efimov, Denis | Inria |
| Ushirobira, Rosane | Inria |
Keywords: Adaptive systems, Estimation, Identification
Abstract: This paper presents the design of an accelerated adaptive observer for simultaneously estimating states and constant parameters in a class of uncertain nonlinear systems subject to external disturbances. The proposed observer consists of a high-order sliding-mode (HOSM) observer for state estimation and a heavy-ball-based algorithm for identifying unknown constant parameters. Despite the presence of bounded external disturbances that do not satisfy a relative degree one condition, the observer locally estimates both state and parameter vectors in finite time. The closed-loop stability analysis is performed using a Lyapunov function approach, input-to-state stability properties, and a finite-time small-gain theorem. The effectiveness of the proposed estimation algorithm is demonstrated through simulation results.
|
| |
| 11:00-11:15, Paper FrA07.7 | |
| On the Discretization of the Implicit Lyapunov Function-Based Control |
|
| Labbadi, Moussa | Aix-Marseille University |
| Efimov, Denis | Inria |
Keywords: Lyapunov methods, Numerical algorithms, Stability of linear systems
Abstract: An implicit Euler discretization scheme of the control law from [1] is proposed, which preserves all the main properties of the continuous-time counterpart (hyperexponential rate of convergence, compensation of matched disturbances or their attenuation, robustness to measurement noises). Additionally, the sample-and-hold implementation of the control, along with its stability analysis, is provided.
|
| |
| 11:15-11:30, Paper FrA07.8 | |
| Robust Nonsingular Predefined-Time Terminal Sliding Mode Control for Perturbed Chains of Integrators |
|
| Deng, Yang | Tsinghua University |
| Moulay, Emmanuel | Université De Poitiers |
| Lechappe, Vincent | INSA Lyon |
| Chen, Zhang | Tsinghua University |
| Liang, Bin | Tsinghua University |
| Plestan, Franck | Ecole Centrale De Nantes-LS2N |
Keywords: Variable-structure/sliding-mode control, Robust control, Nonlinear systems
Abstract: This paper develops a nonsingular predefined-time terminal sliding mode control scheme based on smooth sliding surfaces with artificial time-delays. The proposed approach can achieve predefined-time control for perturbed chains of integrators with matched and mismatched disturbances. Especially, for double integrators, an initial-condition-based sufficient condition is provided to preserve the prescribed-time stability in the presence of bounded matched disturbances.
|
| |
| FrA08 |
Oceania V |
| Reinforcement Learning III |
Regular Session |
| Chair: Mahajan, Aditya | McGill University |
| Co-Chair: Stella, Leonardo | University of Birmingham |
| |
| 09:30-09:45, Paper FrA08.1 | |
| Stochastic Reinforcement Learning with Stability Guarantees for Control of Unknown Nonlinear Systems |
|
| Quartz, Thanin | University of Waterloo |
| Zhou, Ruikun | University of Waterloo |
| De sterck, Hans | University of Waterloo |
| Liu, Jun | University of Waterloo |
Keywords: Reinforcement learning, Stability of nonlinear systems, Optimal control
Abstract: Designing a stabilizing controller for nonlinear systems is a challenging task, especially for high-dimensional problems with unknown dynamics. Traditional reinforcement learning algorithms applied to stabilization tasks tend to drive the system close to the equilibrium point. However, these approaches often fall short of achieving true stabilization and result in persistent oscillations around the equilibrium point. In this work, we propose a reinforcement learning algorithm that stabilizes the system by learning a local linear representation of the dynamics. The main component of the algorithm is integrating the learned gain matrix directly into the neural policy. We demonstrate the effectiveness of our algorithm on several challenging high-dimensional dynamical systems. In these simulations, our algorithm outperforms popular reinforcement learning algorithms, such as soft actor-critic (SAC) and proximal policy optimization (PPO), and successfully stabilizes the system. To support the numerical results, we provide a theoretical analysis of the feasibility of the learned algorithm for both deterministic and stochastic reinforcement learning settings, along with a convergence analysis of the proposed learning algorithm. Furthermore, we verify that the learned control policies indeed provide asymptotic stability for the nonlinear systems.
|
| |
| 09:45-10:00, Paper FrA08.2 | |
| Convergence of Regularized Agent-State Based Q-Learning in POMDPs |
|
| Sinha, Amit | McGill University |
| Geist, Matthieu | Earth Species Project |
| Mahajan, Aditya | McGill University |
Keywords: Reinforcement learning, Markov processes, Stochastic optimal control
Abstract: In this paper, we present a framework to understand the convergence of commonly used Q-learning reinforcement learning algorithms in practice. Two salient features of such algorithms are: (i)~the Q-table is recursively updated using an agent state (such as the state of a recurrent neural network) which is not a belief state or an information state and (ii)~policy regularization is often used to encourage exploration and stabilize the learning algorithm. We investigate the simplest form of such Q-learning algorithms which we call regularized agent-state-based Q-learning (RASQL) and show that it converges under mild technical conditions to the fixed point of an appropriately defined regularized MDP, which depends on the stationary distribution induced by the behavioral policy. We also show that a similar analysis continues to work for a variant of RASQL that learns periodic policies. We present numerical examples to illustrate that the empirical convergence behavior matches with the proposed theoretical limit.
|
| |
| 10:00-10:15, Paper FrA08.3 | |
| Adversarial Decoy Placement for Strategic State Perturbations in Artificial Intelligence Driven Defense |
|
| Kazeminajafabadi, Armita | Northeastern University |
| Everett, Michael | Northeastern University |
| Lan, Tian | George Washington University |
| Bastian, Nathaniel D. | United States Military Academy |
| Imani, Mahdi | Northeastern University |
Keywords: Markov processes, Reinforcement learning, Computer/Network Security
Abstract: With the growing sophistication of cybersecurity threats, artificial intelligence driven defense solutions have gained significant attention, particularly reinforcement learning based approaches, which make sequential decisions based on the latest network information. Despite their success in adaptability, scalability, and real-time response, these methods remain highly vulnerable to adversarial deception—strategic manipulations that distort the defender’s perception of the network security state. This paper introduces a budget-constrained adversarial model, in which an attacker deploys decoys to mislead the defender about the true state of network compromises, disrupting effective and timely decision-making. Deception is formulated as an augmented Markov Decision Process, allowing an intelligent adversary to account for the defender’s perception of network compromises and anticipate defensive actions. An optimal deception policy is derived, enabling the strategic placement of decoys to maximize long-term security degradation while evading detection. A worst-case theoretical bound is established to quantify the long-term impact of deception on network security. Numerical experiments demonstrate that even minimal decoy-based deception significantly weakens network security, particularly against deterministic defense policies.
|
| |
| 10:15-10:30, Paper FrA08.4 | |
| PrELIN: Provably Efficient Local-Information Networked Multi-Agent Reinforcement Learning |
|
| Chu, Ziyue | University of Birmingham |
| Jose, Sharu Theresa | University of Birmingham |
| Stella, Leonardo | University of Birmingham |
Keywords: Reinforcement learning, Statistical learning, Decentralized control
Abstract: Recently, there has been a surge of interest in decentralized learning approaches to tackle complex collaborative tasks in multi-agent systems. One of the most promising approaches is multi-agent reinforcement learning (MARL). Yet, as the number of agents becomes larger, the sample complexity in MARL increases exponentially, making scalability a fundamental issue. Networked MARL algorithms can address this issue by leveraging a communication network for information exchange between the agents. For homogeneous network MARL, previous research established a regret upper-bound sqrt{MH^4SAT}. Recent approaches rely on global knowledge about the structure of the communication network, which poses a serious limitation when it is not known or changes depending on the task. In this paper, we overcome this limitation by proposing a novel networked MARL algorithm with an upper-confidence bound (UCB) exploration strategy, called provably efficient local-information networked (PrELIN) MARL, that does not require any global information but only relies on the local interactions between the agents. Furthermore, we derive the regret and sample complexity for our algorithm and show that the regret bound may still remain sublinear.
|
| |
| 10:30-10:45, Paper FrA08.5 | |
| A Multi-Agent Reinforcement Learning Approach Based on Local Shapley Value |
|
| Shi, Jinxuan | Shanghai Jiao Tong University |
| Liu, Zhe | Shanghai Jiao Tong University |
| Jin, Kefan | Shanghai Jiaotong University |
| Wang, Hesheng | Shanghai Jiao Tong University |
Keywords: Reinforcement learning, Cooperative control, Autonomous robots
Abstract: Multi-Agent Reinforcement Learning (MARL) has been extensively applied in cooperative control tasks, yet credit assignment remains a critical challenge. Existing approaches predominantly employ global reward sharing mechanisms or rely on monolithic neural network architectures, which may lead to inaccurate credit attribution for individual agents and consequently result in suboptimal learning efficiency. In this paper, we propose the Local Shapley Policy Optimization (LSPO) method to address the multi-agent credit assignment problem. The Shapley values are introduced to reasonably assess the contribution of each agent. Furthermore, we present a method for calculating local Shapley values to address the issue of dimensional explosion, and we theoretically demonstrate that local Shapley values can effectively approximate global Shapley values. Extensive experiments conducted on several cooperative multi-agent benchmarks demonstrate that our approach achieves superior task performance compared to existing state-of-the-art methods. Empirical results reveal that the proposed LSPO framework substantially enhances both learning efficiency and coordination capabilities in complex multi-agent environments.
|
| |
| 10:45-11:00, Paper FrA08.6 | |
| Swarm-Level Task Learning Via Generalized Moments in Reinforcement Learning with Reward Machines |
|
| Meshkat Alsadat, Shayan | Arizona State University |
| Xu, Zhe | Arizona State University |
Keywords: Reinforcement learning, Markov processes
Abstract: Swarm-level task learning provides a basis for learning complex tasks in swarm systems using swarm-level features. We propose a method, SwaRM-L, to learn reward machines (RMs) that encode non-Markovian reward functions. We use reward machines to specify the task and its temporal structure. Our approach enables a swarm of agents to learn an RM that eventually becomes equivalent to ground truth RM (i.e., the specified task) in this environment (agents have no access to the ground truth RM and only learn it through environment interaction). We use generalized moments (GMs) to characterize swarm features and estimate the RM state. Each agent maintains an estimated GM to contribute to the collective learning process. The agents use a gossip algorithm to communicate with neighbors and update their estimated GMs. We prove that our method converges to an optimal policy and learns an equivalent RM to the ground truth RM within the environment. We evaluate our proposed method in three case studies involving forty agents with homogeneous dynamics. Our results demonstrate the effectiveness of our method in learning complex swarm behaviors.
|
| |
| 11:00-11:15, Paper FrA08.7 | |
| Finite-Time Bounds for Two-Time-Scale Stochastic Approximation with Arbitrary Norm Contractions and Markovian Noise |
|
| Chandak, Siddharth | Stanford University |
| Haque, Shaan Ul | Georgia Tech |
| Bambos, Nicholas | Stanford University |
Keywords: Reinforcement learning, Stochastic optimal control, Optimization algorithms
Abstract: Two-time-scale Stochastic Approximation (SA) is an iterative algorithm with applications in reinforcement learning and optimization. Prior finite time analysis of such algorithms has focused on fixed point iterations with mappings contractive under Euclidean norm. Motivated by applications in reinforcement learning, we give the first mean square bound on non linear two-time-scale SA where the iterations have arbitrary norm contractive mappings and Markovian noise. We show that the mean square error decays at a rate of O(1/n^{2/3}) in the general case, and at a rate of O(1/n) in a special case where the slower timescale is noiseless. Our analysis uses the generalized Moreau envelope to handle the arbitrary norm contractions and solutions of Poisson equation to deal with the Markovian noise. By analyzing the SSP Q-Learning algorithm, we give the first O(1/n) bound for an algorithm for asynchronous control of MDPs under the average reward criterion. We also obtain a rate of O(1/n) for Q-Learning with Polyak-averaging and provide an algorithm for learning Generalized Nash Equilibrium (GNE) for strongly monotone games which converges at a rate of O(1/n^{2/3}).
|
| |
| 11:15-11:30, Paper FrA08.8 | |
| Action-Priority Driven Policy Optimization for Flexible Job Shop Scheduling Problem |
|
| Zheng, Wenjun | The Chinese Unversity of Hong Kong, Shenzhen |
| Cai, Weilin | The Chinese University of Hong Kong, Shenzhen |
| Qu, Wei | The Chinese University of Hong Kong, Shenzhen |
| Mao, Jianfeng | The Chinese University of Hong Kong, Shenzhen |
Keywords: Manufacturing systems and automation, Reinforcement learning, Discrete event systems
Abstract: Efficient scheduling significantly impacts productivity and operational performance, particularly in flexible resource allocation scenarios. The Flexible Job Shop Scheduling Problem (FJSP) extends traditional scheduling by allowing each operation to be processed on multiple machines, enhancing practical applicability but increasing computational complexity. Existing methods, ranging from exact solvers to heuristic/metaheuristic algorithms, either incur high computational costs or yield suboptimal solutions. Recent reinforcement learning (RL)-based approaches attempt a balance between optimality and efficiency; however, intrinsic characteristics of FJSP remain underutilized for better solution quality. This paper introduces the Action-Priority Driven Policy Optimization (APDPO) method, explicitly designed to address three critical challenges: achieving superior solutions, improving interpretability via an intuitive policy representation, and enhancing computational efficiency through direct, gradient-free policy updates. A policy improvement theorem ensures performance enhancement under specified conditions. Comparative evaluations confirm that APDPO outperforms existing approaches by delivering better solution quality, reduced computational requirements, and greater decision transparency.
|
| |
| FrA09 |
Oceania VIII |
| Statistical Learning |
Regular Session |
| Chair: Befekadu, Getachew K. | The Catholic University of America |
| Co-Chair: Romao, Licio | Technical University of Denmark |
| |
| 09:30-09:45, Paper FrA09.1 | |
| Off-Policy Evaluation Using Information Borrowing and Context-Based Switching |
|
| Dasgupta, Sutanoy | Intuit |
| Niu, Yabo | University of Houston |
| Panaganti, Kishan | Caltech |
| Kalathil, Dileep | Texas A&M University (TAMU) |
| Pati, Debdeep | University of Wisconsin Madison |
| Mallick, Bani K. | Texas a & M University |
Keywords: Statistical learning, Estimation, Reinforcement learning
Abstract: We consider the off-policy evaluation (OPE) problem in contextual bandits, where the goal is to estimate the value of a target policy using the data collected by a logging policy. Most popular approaches to the OPE are variants of the doubly robust (DR) estimator obtained by combining a direct method (DM) estimator and a correction term involving the inverse propensity score (IPS). Existing algorithms primarily focus on strategies to reduce the variance of the DR estimator arising from large IPS. We propose a new approach called the Doubly Robust with Information borrowing and Context-based switching (DR-IC) estimator that focuses on reducing both bias and variance. The DR-IC estimator replaces the standard DM estimator with a parametric reward model that borrows information from the `closer’ contexts through a correlation structure that depends on the IPS. The DR-IC estimator also adaptively interpolates between this modified DM estimator and a modified DR estimator based on a context-specific switching rule. We give provable guarantees on the performance of the DR-IC estimator. We also demonstrate the superior performance of the DR-IC estimator compared to the state-of-the-art OPE algorithms on a number of benchmark problems.
|
| |
| 09:45-10:00, Paper FrA09.2 | |
| Bridging Conformal Prediction and Scenario Optimization |
|
| O'Sullivan, Niall | University of Oxford |
| Romao, Licio | Technical University of Denmark |
| Margellos, Kostas | University of Oxford |
Keywords: Randomized algorithms, Statistical learning, Optimization algorithms
Abstract: Conformal prediction and scenario optimization constitute two important classes of statistical learning frameworks to certify decisions made using data. They have found numerous applications in control theory, machine learning and robotics. Despite intense research in both areas, and apparently similar results, a clear connection between these two frameworks has not been established. By focusing on the so-called vanilla conformal prediction, we show rigorously how to choose appropriate score functions and set predictor map to recover well-known bounds on the probability of constraint violation associated with scenario programs. We also show how to treat ranking of nonconformity scores as a one-dimensional scenario program with discarded constraints, and use such connection to recover vanilla conformal prediction guarantees on the validity of the set predictor. We also capitalize on the main developments of the scenario approach, and show how we could analyze calibration conditional conformal prediction under this lens. Our results establish a theoretical bridge between conformal prediction and scenario optimization.
|
| |
| 10:00-10:15, Paper FrA09.3 | |
| A Finite-Sample Bound for Identifying Partially Observed Linear Switched Systems from a Single Trajectory |
|
| Racz, Daniel | HUN-REN SZTAKI |
| Petreczky, Mihaly | UMR CNRS 9189, Ecole Centrale De Lille |
| Daroczy, Balint | Institute for Computer Science and Control (SZTAKI), Hungarian R |
Keywords: Statistical learning, Stochastic systems
Abstract: We derive a finite-sample probabilistic bound on the parameter estimation error of a system identification algorithm for Linear Switched Systems. The algorithm estimates Markov parameters from a single trajectory and applies a variant of the Ho-Kalman algorithm to recover the system matrices. Our bound guarantees statistical consistency under the assumption that the true system exhibits quadratic stability. The proof leverages the theory of weakly dependent processes. To the best of our knowledge, this is the first finite-sample bound for this algorithm in the single-trajectory setting.
|
| |
| 10:15-10:30, Paper FrA09.4 | |
| PAC Learnability of Scenario Decision-Making Algorithms: Necessary Conditions and Sufficient Conditions |
|
| Berger, Guillaume O. | UCLouvain |
| Jungers, Raphaël M. | University of Louvain |
Keywords: Statistical learning, Data driven control, Optimization
Abstract: We investigate the Probably Approximately Correct (PAC) property of scenario decision algorithms, which refers to their ability to produce decisions with an arbitrarily low risk of violating unknown safety constraints, provided a sufficient number of realizations of these constraints are sampled. While several PAC sufficient conditions for such algorithms exist in the literature---such as the finiteness of the VC dimension of their associated classifiers, or the existence of a compression scheme---it remains unclear whether these conditions are also necessary. In this work, we demonstrate through counterexamples that these conditions are not necessary in general. These findings stand in contrast to binary classification learning, where analogous conditions are both sufficient and necessary for a family of classifiers to be PAC. Furthermore, we extend our analysis to stable scenario decision algorithms, a broad class that includes practical methods like scenario optimization. Even under this additional assumption, we show that the aforementioned conditions remain unnecessary. Furthermore, we introduce a novel quantity, called the dVC dimension, which serves as an analogue to the VC dimension for scenario decision algorithms. We prove that the finiteness of this dimension is a PAC necessary condition for scenario decision algorithms. This allows to (i) guide algorithm users and designers to recognize algorithms that are not PAC, and (ii) contribute to a comprehensive characterization of PAC scenario decision algorithms.
|
| |
| 10:30-10:45, Paper FrA09.5 | |
| A Naive Aggregation Algorithm for Improving Generalization in a Class of Learning Problems |
|
| Befekadu, Getachew K. | The Catholic University of America |
Keywords: Learning, Estimation, Statistical learning
Abstract: In this brief paper, we present a naive aggregation algorithm for a typical learning problem with expert advice setting, in which the task of improving generalization, i.e., model validation, is embedded in the learning process as a sequential decision-making problem. In particular, we consider a class of learning problem of point estimations for modeling high-dimensional nonlinear functions, where a group of experts update their parameter estimates using the discrete-time version of gradient systems, with small additive noise term, guided by the corresponding subsample datasets obtained from the original dataset. Here, our main objective is to provide conditions under which such an algorithm will sequentially determine a set of mixing distribution strategies used for aggregating the experts' estimates that ultimately leading to an optimal parameter estimate, i.e., as a consensus solution for all experts, which is better than any individual expert's estimate in terms of improved generalization or learning performances. Finally, as part of this work, we present some numerical results for a typical case of nonlinear regression problem.
|
| |
| 10:45-11:00, Paper FrA09.6 | |
| Derandomizing Simultaneous Confidence Regions for Band-Limited Functions by Improved Norm Bounds and Majority-Voting Schemes |
|
| Csáji, Balázs Cs. | HUN-REN SZTAKI |
| Horváth, Bálint | SZTAKI |
Keywords: Statistical learning, Identification, Stochastic systems
Abstract: Band-limited functions are fundamental objects that are widely used in systems theory and signal processing. In this paper we refine a recent nonparametric, nonasymptotic method for constructing simultaneous confidence regions for band-limited functions from noisy input-output measurements, by working in a Paley-Wiener reproducing kernel Hilbert space. Kernel norm bounds are tightened using a uniformly-randomized Hoeffding's inequality for small samples and an empirical Bernstein bound for larger ones. We derive an approximate threshold, based on the sample size and how informative the inputs are, that governs which bound to deploy. Finally, we apply majority voting to aggregate confidence sets from random subsamples, boosting both stability and region size. We prove that even per-input aggregated intervals retain their simultaneous coverage guarantee. These refinements are also validated through numerical experiments.
|
| |
| 11:00-11:15, Paper FrA09.7 | |
| Neural Contextual Bandits under Delayed Feedback Constraints |
|
| Moghimi, Mohammadali | University of Birmingham |
| Jose, Sharu Theresa | University of Birmingham |
| Moothedath, Shana | Iowa State University |
Keywords: Learning, Statistical learning, Reinforcement learning
Abstract: This paper presents a new algorithm for neural contextual bandits (CBs) that addresses the challenge of delayed reward feedback, where the reward for a chosen action is revealed after a random, unknown delay. This scenario is common in applications such as online recommendation systems and clinical trials, where reward feedback is delayed because the outcomes or results of a user’s actions (such as recommendations or treatment responses) take time to manifest and be measured. The proposed algorithm, called textit{Delayed NeuralUCB}, uses upper confidence bound (UCB)-based exploration strategy. Under the assumption of independent and identically distributed sub-exponential reward delays, we derive an upper bound on the cumulative regret over T-length horizon, that scales as O(tilde{d} sqrt{T log T}+ tilde{d}^{3/2} D_+log(T)^{3/2}) where tilde{d} denotes the effective dimension of the neural tangent kernel matrix, and D_+ depends on the expected delay Ebb[tau]. We further consider a variant of the algorithm, called Delayed NeuralTS, that uses Thompson Sampling based exploration. Numerical experiments on real-world datasets, such as MNIST and Mushroom, along with comparisons to benchmark approaches, demonstrate that the proposed algorithms effectively manage varying delays and are well-suited for complex real-world scenarios.
|
| |
| 11:15-11:30, Paper FrA09.8 | |
| Model Selection for Inverse Reinforcement Learning Via Structural Risk Minimization |
|
| Qu, Chendi | Shanghai Jiao Tong University |
| He, Jianping | Shanghai Jiao Tong University |
| Duan, Xiaoming | Shanghai Jiao Tong University |
| Chen, Jiming | Zhejiang University |
Keywords: Reinforcement learning, Optimal control, Statistical learning
Abstract: Inverse reinforcement learning (IRL) usually assumes the reward function model is pre-specified as a weighted sum of features and estimates the weighting parameters only. However, how to select features and determine a proper reward model is nontrivial and experience-dependent. A simplistic model is less likely to explain the intention, while a model with high complexity leads to substantial computation cost and potential overfitting. This paper addresses this trade-off in the model selection for IRL problems by introducing the structural risk minimization (SRM) framework from statistical learning. SRM selects an optimal reward model from a hypothesis set minimizing both estimation error and complexity. To formulate an SRM scheme for IRL, we estimate the policy gradient from given demonstration as the empirical risk, and establish the upper bound of Rademacher complexity as the penalty for hypothesis function classes. The SRM learning guarantee is further presented. In particular, we provide the explicit form for the linear weighted sum setting. Simulations demonstrate the performance and efficiency of our algorithm.
|
| |
| FrA10 |
Oceania VII |
| Distributed and Decentralized Control IV |
Regular Session |
| Chair: Gasparri, Andrea | Roma Tre University |
| Co-Chair: Moreira, Marcos V. | Universidade Federal Do Rio De Janeiro |
| |
| 09:30-09:45, Paper FrA10.1 | |
| Distributed Resource Allocation for Human-Autonomy Teaming under Coupled Constraints |
|
| Yao, Yichen | George Mason University |
| Mbagna-Nanko, Ryan | Clemson University |
| Wang, Yue | Clemson University |
| Wang, Xuan | George Mason University |
Keywords: Human-in-the-loop control, Agents-based systems, Distributed control
Abstract: Abstract— This paper studies the optimal resource allocation problem within a multi-agent network composed of both autonomous agents and humans. The main challenge lies in the globally coupled constraints that link the decisions of autonomous agents with those of humans. To address this, we propose a novel reformulation that transforms these coupled constraints into decoupled local constraints defined over the system’s communication graph. Building on this reformulation, and incorporating a human response model that captures human-robot interactions while accounting for individual pref- erences and biases, we develop a fully distributed algorithm. This algorithm guides the states of the autonomous agents to equilibrium points which, when combined with the human responses, yield a globally optimal resource allocation. We provide both theoretical analysis and numerical simulations to validate the effectiveness of the proposed approach.
|
| |
| 09:45-10:00, Paper FrA10.2 | |
| Complete Decentralization of Linear Quadratic Gaussian Control for the Discrete Wave Equation |
|
| McCurdy, Addie | University of Colorado Boulder |
| Jensen, Emily | University of Colorado, Boulder |
Keywords: Decentralized control, Distributed control, Optimal control
Abstract: The linear quadratic Gaussian (LQG) control problem for the linear wave equation on the unit circle with fully distributed actuation and partial state measurements is considered. An analytical solution to a spatial discretization of the problem is obtained. The main result of this work illustrates that for specific parameter values, the optimal LQG policy is completely decentralized, meaning only a measurement at spatial location i is needed to compute an optimal control signal to actuate at this location. The relationship between performance and decentralization as a function of parameters is explored. Conditions for complete decentralization are related to metrics of kinetic and potential energy quantities and control effort.
|
| |
| 10:00-10:15, Paper FrA10.3 | |
| Scalable Power Management of Data Centers Via Proximal Gradient Optimization |
|
| Zeger, Emi | Stanford University |
| Bambos, Nicholas | Stanford University |
| Pilanci, Mert | Stanford |
Keywords: Decentralized control, Iterative learning control, Optimization
Abstract: Data centers are hitting a bottleneck in available power as the demand for computations increases, and sustainable power management has emerged as a key challenge in green computing. We propose algorithms to scalably manage power across servers and solutions to guarantee and hasten convergence in the power trajectory. This makes power use more efficient and stable, and provides a broad class of pricing functions to discourage excessive power consumption. A foundational framework is established to optimizing power-aware servers. We provide proofs of convergence with guarantees on rates. Simulations support our theory.
|
| |
| 10:15-10:30, Paper FrA10.4 | |
| The Price of Simplicity: Analyzing Decoupled Policies for Multi-Location Inventory Control |
|
| John, Yohan | University of California, Santa Barbara |
| Shah, Vade | University of California, Santa Barbara |
| Preiss, James | University of California, Santa Barbara |
| Alizadeh, Mahnoosh | University of California Santa Barbara |
| Marden, Jason R. | University of California, Santa Barbara |
Keywords: Decentralized control, Optimal control, Building and facility automation
Abstract: What is the performance cost of using simple, decoupled control policies in inherently coupled systems? Motivated by industrial refrigeration systems, where centralized compressors exhibit economies of scale yet traditional control employs decoupled room-by-room temperature regulation, we address this question through the lens of multi-location inventory control. Here, a planner manages multiple inventories to meet stochastic demand while minimizing costs that are coupled through nonlinear ordering functions reflecting economies of scale. Our main contributions are: (i) a surprising equivalence result showing that optimal stationary base-stock levels for individual locations remain unchanged despite coupling when restricting attention to decoupled strategies; (ii) tight performance bounds for simple decoupled policies relative to optimal coupled policies, revealing that the worst-case ratio depends solely on the degree of nonlinearity in the ordering cost function; and (iii) an analysis of a practical online algorithm that achieves competitive performance without solving complex dynamic programs. Numerical simulations demonstrate that while decoupled policies significantly outperform their worst-case guarantees in typical scenarios, they still exhibit meaningful suboptimality compared to fully coordinated strategies. These results provide actionable guidance for system operators navigating the trade-off between control complexity and operational efficiency in coupled systems.
|
| |
| 10:30-10:45, Paper FrA10.5 | |
| Average Consensus Over Time-Varying Directed Graphs with Broadcast Communication |
|
| Oliva, Gabriele | University Campus Bio-Medico of Rome |
| Gasparri, Andrea | Roma Tre University |
| Hadjicostis, Christoforos N. | University of Cyprus |
Keywords: Decentralized control, Distributed control, Time-varying systems
Abstract: This paper proposes a novel protocol for achieving arithmetic average consensus over time-varying directed graphs using broadcast-only communication, where agents cannot selectively address out-neighbors. The proposed approach extends existing methods to compute the dominant left eigenvector of an infinite product of time-varying row-stochastic matrices in a fully distributed manner. By leveraging a local rescaling strategy, our algorithm ensures that agents autonomously compute correction offsets that drive the consensus value toward the arithmetic average of their initial states. The method circumvents the impractical requirement of point-to-point communication and eliminates the need for column stochastic normalization of the weights, which requires each node to know its local out-neighborhood; the proposed method only imposes a mild requirement on knowing an upper bound on the total number of agents. Theoretical convergence guarantees are provided, and simulations confirm the solution’s effectiveness.
|
| |
| 10:45-11:00, Paper FrA10.6 | |
| Principled Learning-To-Communicate with Quasi-Classical Information Structures |
|
| Liu, Xiangyu | University of Maryland, College Park |
| You, Haoyi | University of Maryland, College Park |
| Zhang, Kaiqing | University of Maryland, College Park |
Keywords: Decentralized control, Stochastic optimal control, Reinforcement learning
Abstract: Learning-to-Communicate (LTC) in partially observable environments has emerged and received increasing attention in deep multi-agent reinforcement learning, where the control and communication strategies are jointly learned. On the other hand, the impact of communication has been extensively studied in control theory. In this paper, we seek to formalize and better understand LTC by bridging these two lines of work, through the lens of information structures (ISs). To this end, we formalize LTC in decentralized partially observable Markov decision processes (Dec-POMDPs) under the common-information-based framework from decentralized stochastic control, and classify LTC problems based on the ISs before (additional) information sharing. We first show that non-classical LTCs are computationally intractable in general, and thus focus on quasi-classical (QC) LTCs. We then propose a series of conditions for QC LTCs, violating which can cause computational hardness in general. Further, we develop provable planning and learning algorithms for QC LTCs, and show that some examples of QC LTCs satisfying the above conditions can be solved with quasi-polynomial time and samples. Along the way, we also establish some relationship between (strictly) QC IS and the condition of having strategy-independent common-information-based beliefs (SI-CIBs), as well as solving Dec-POMDPs without computationally intractable oracles but beyond those with the SI-CIB condition, which may be of independent interest.
|
| |
| 11:00-11:15, Paper FrA10.7 | |
| Polynomial Time Verification of Decentralized State-Based Transparency |
|
| Mayer, Patrícia | Federal University of Santa Catarina |
| Cabral, Felipe G. | Federal University of Santa Catarina |
| Lima, Publio Macedo Monteiro | Universidade Federal De Santa Catarina |
| Moreira, Marcos V. | Universidade Federal Do Rio De Janeiro |
Keywords: Discrete event systems, Automata
Abstract: Recently, the definition of Property-Based Transparency has been proposed in the literature for Discrete Event Systems. Property-Based Transparency is a utility notion that establishes that a legitimate receiver must know if a given property of interest is true in the system before it becomes false. Since the satisfaction of the property of interest is associated with useful states of the system, we call the proposed definition of transparency with respect to the set of useful states State-Based Transparency (ST). In this paper, the ST property is generalized to consider a decentralized approach, leading to the definition of state-based co-transparency. A polynomial-time algorithm to verify state-based co-transparency is proposed. The verification algorithm proposed in this paper can also be applied to the centralized case.
|
| |
| 11:15-11:30, Paper FrA10.8 | |
| Several Performance Bounds on Decentralized Online Optimization Are Highly Conservative and Potentially Misleading |
|
| Meunier, Erwan | UCLouvain |
| Hendrickx, Julien M. | UCLouvain |
Keywords: Optimization algorithms, Distributed control, Learning
Abstract: We analyze Decentralized Online Optimization algorithms using the Performance Estimation Problem approach which allows, to automatically compute exact worst- case performance of optimization algorithms. Our analysis shows that several available performance guarantees are very conservative, sometimes by multiple orders of magnitude, and can lead to misguided choices of algorithm. Moreover, at least in terms of worst-case performance, some algorithms appear not to benefit from inter-agent communications for a significant period of time. We show how to improve classical methods by tuning their step-sizes, and find that we can save up to 20% on their actual worst-case performance regret.
|
| |
| FrA11 |
Oceania VI |
| Network Analysis and Control I |
Regular Session |
| Chair: Caines, Peter E. | McGill University |
| Co-Chair: Paganini, Fernando | Universidad ORT Uruguay |
| |
| 09:30-09:45, Paper FrA11.1 | |
| Transmission Neural Networks: Approximate Receding Horizon Control for Virus Spread on Networks |
|
| Gao, Shuang | Polytechnique Montreal |
| Caines, Peter E. | McGill University |
Keywords: Control of networks, Network analysis and control, Stochastic systems
Abstract: Transmission Neural Networks (TransNNs) proposed by Gao and Caines (2022) serve as both virus spread models over networks and neural network models with tuneable activation functions. This paper establishes that TransNNs provide upper bounds on the infection probability generated from the associated Markovian stochastic Susceptible-Infected- Susceptible (SIS) model with 2^n state configurations where n is the number of nodes in the network, and can be employed as an approximate model for the latter. Based on such an approximation, a TransNN-based receding horizon control approach for mitigating virus spread is proposed and we demonstrate that it allows significant computational savings compared to the dynamic programming solution to Markovian SIS model with 2^n state configurations, as well as providing less conservative control actions compared to the TransNN-based optimal control. Finally, numerical comparisons among (a) dynamic programming solutions for the Markovian SIS model, (b) TransNN-based optimal control and (c) the proposed TransNN-based receding horizon control are presented.
|
| |
| 09:45-10:00, Paper FrA11.2 | |
| A Passivity Analysis for Nonlinear Consensus on Digraphs |
|
| Yue, Feng-Yu | Technion |
| Zelazo, Daniel | Technion - Israel Institute of Technology |
Keywords: Network analysis and control, Cooperative control, Networked control systems
Abstract: This work presents a passivity-based analysis for the nonlinear output agreement problem in network systems over directed graphs. We reformulate the problem as a convergence analysis on the agreement submanifold. First, we establish how passivity properties of individual agents and controllers determine the passivity of their associated system relations. Building on this, we introduce the concept of submanifold-constrained passivity and develop a novel compensation theorem that ensures output convergence to the agreement submanifold. Unlike previous approaches, our approach can analyze the network system with arbitrary digraphs and any passive agents. We apply this framework to analyze the output agreement problem for network systems consisting of nonlinear and passive agents. Numerical examples support our results.
|
| |
| 10:00-10:15, Paper FrA11.3 | |
| Collective Decision-Making Dynamics in Hypernetworks |
|
| Fontan, Angela | KTH Royal Institute of Technology |
| Zhang, Silun | KTH Royal Institute of Technology |
Keywords: Network analysis and control, Nonlinear systems, Agents-based systems
Abstract: This work describes a collective decision-making dynamical process in a multiagent system under the assumption of cooperative higher-order interactions within the community, modeled as a hypernetwork. The nonlinear interconnected system is characterized by saturated nonlinearities that describe how agents transmit their opinion state to their neighbors in the hypernetwork, and by a bifurcation parameter representing the community's social effort. We show that the presence of higher-order interactions leads to the unfolding of a pitchfork bifurcation, introducing an interval for the social effort parameter in which the system exhibits bistability. With equilibrium points representing collective decisions, this implies that, depending on the initial conditions, the community will either remain in a deadlock state (with the origin as the equilibrium point) or reach a nontrivial decision. A numerical example is given to illustrate the results.
|
| |
| 10:15-10:30, Paper FrA11.4 | |
| Integer Control Approximations for Graphon Dynamical Systems |
|
| Köhler, Martin T. | Technical University of Braunschweig |
| Makarow, Artemi | Technische Universität Braunschweig |
| Kirches, Christian | Technical University of Braunschweig |
Keywords: Network analysis and control, Control of networks, Large-scale systems
Abstract: Graphons generalize graphs and define a limit object of a converging graph sequence. The notion of graphons allows for a generic representation of coupled network dynamical systems. We are interested in approximating optimal switching controls for graphon dynamical systems. To this end, we apply a decomposition approach comprised of a relaxation and a reconstruction step. We extend the sum-up rounding algorithm to operate on a finite partition of the continuous vertex set in a graphon setting and restore integer feasibility for a given relaxed control solution. Finally, we derive an L1 bound for the state variables for structurally similar graphs and approximated switching controls. We verify our claims by simulating the Lotka-Volterra equations on a graph.
|
| |
| 10:30-10:45, Paper FrA11.5 | |
| Nonlinear Average Consensus Over Circulating Directed Hypergraphs |
|
| Liuzza, Davide | University of Sannio |
| Della Rossa, Fabio | Politecnico Di Milano |
| Lo Iudice, Francesco | Università Di Napoli Federico II |
| De Lellis, Pietro | University of Naples Federico II |
Keywords: Network analysis and control, Control of networks
Abstract: Consensus dynamics are ubiquitous in control theory, with applications spanning from technological fields to social sciences. In this Letter, we evaluate the impact of higher-order, many-body interactions on the ability of the network to attain consensus. Modeling multi-body interactions with circulating directed hypergraphs, we provide for the first time global conditions for average consensus in the presence of time-varying and possibly non-smooth coupling protocols. Our results are then applied to study the effect of homophily in opinion dynamics.
|
| |
| 10:45-11:00, Paper FrA11.6 | |
| Dissatisfaction Dynamics in Directed Bipartite Networks of Captive User Communities |
|
| Kibangou, Alain | Univ. Grenoble Alpes |
| Kalaoane, Retsepile | University of Johannesburg |
Keywords: Compartmental and Positive systems, Network analysis and control, Model Validation
Abstract: Socio-technical systems providing essential services, like water, energy, and transportation, often create captive user segments due to limited alternatives. This can lead to user dissatisfaction and complex social dynamics. Building on a recent dynamic model, this paper extends the analysis of captive user dissatisfaction to networks of interconnected communities, using a directed bipartite leader–follower structure. We investigate the steady-state properties of the model and analyze how leader communities can mitigate or amplify dissatisfaction among their followers. A key insight is that consensus in dissatisfaction levels does not stem from network topology but from the alignment of a Dissatisfaction Index, a new metric introduced in this letter that reflects each community's perception of service quality
|
| |
| 11:00-11:15, Paper FrA11.7 | |
| Scalable Sensor Placement for Cyclic Networks with Observability Guarantees: Application to Water Distribution Networks |
|
| van Gemert, Jarne Jeannetta Huberta | University of Technology Eindhoven |
| Breschi, Valentina | Eindhoven University of Technology |
| Yntema, Doekle | Wetsus, Centre of Excellence in Water Technology |
| Keesman, Karel J. | Wageningen University |
| Lazar, Mircea | Eindhoven University of Technology |
Keywords: Network analysis and control, Large-scale systems, Computational methods
Abstract: Optimal sensor placement is essential for state estimation and effective network monitoring. As known in the literature, this problem becomes particularly challenging in large-scale undirected or bidirected cyclic networks with parametric uncertainties, such as water distribution networks (WDNs), where pipe resistance and demand patterns are often unknown. Motivated by the challenges of cycles, parametric uncertainties, and scalability, this paper proposes a sensor placement algorithm that guarantees structural observability for cyclic and acyclic networks with parametric uncertainties. By leveraging a graph-based strategy, the proposed method efficiently addresses the computational complexities of large-scale networks. To demonstrate the algorithm's effectiveness, we apply it to several EPANET benchmark WDNs. Most notably, the developed algorithm solves the sensor placement problem with guaranteed structural observability for the L-town WDN with 1694 nodes and 124 cycles in under 0.1 seconds.
|
| |
| 11:15-11:30, Paper FrA11.8 | |
| Dynamic Load Balancing for Cloud Systems under Heterogeneous Setup Delays |
|
| Paganini, Fernando | Universidad ORT Uruguay |
| Goldsztajn, Diego | Universidad ORT Uruguay |
Keywords: Control of networks, Optimization, Lyapunov methods
Abstract: We consider a distributed cloud service deployed at a set of distinct server pools. Arriving jobs are classified into heterogeneous types, in accordance with their setup times which are differentiated at each of the pools. A dispatcher for each job type controls the balance of load between pools, based on decentralized feedback. The system of rates and queues is modeled by a fluid differential equation system, and analyzed via convex optimization. A first, myopic policy is proposed, based on task delay-to-service. Under a simplified dynamic fluid queue model, we prove global convergence to an equilibrium point which minimizes the mean setup time; however queueing delays are incurred with this method. A second proposal is then developed based on proximal optimization, which explicitly models the setup queue and is proved to reach an optimal equilibrium, devoid of queueing delay. Results are demonstrated through a simulation example.
|
| |
| FrA12 |
Oceania X |
| Optimization Algorithms III |
Regular Session |
| Chair: Tabuada, Paulo | University of California at Los Angeles |
| Co-Chair: Yang, Tao | Northeastern University |
| |
| 09:30-09:45, Paper FrA12.1 | |
| Sequential QCQP for Bilevel Optimization with Line Search |
|
| Sharifi, Sina | Johns Hopkins University |
| Yazdandoost Hamedani, Erfan | University of Arizona |
| Fazlyab, Mahyar | Johns Hopkins University |
Keywords: Optimization algorithms, Machine learning, Emerging control applications
Abstract: Bilevel optimization involves a hierarchical structure where one problem is nested within another, leading to complex interdependencies between levels. We propose a single-loop, tuning-free algorithm that guarantees anytime feasibility, i.e., approximate satisfaction of the lower-level optimality condition, while ensuring descent of the upper-level objective. At each iteration, a convex quadratically-constrained quadratic program (QCQP) with a closed-form solution yields the search direction, followed by a backtracking line search inspired by control barrier functions to ensure safe, uniformly positive step sizes. The resulting method is scalable, requires no hyperparameter tuning, and converges under mild local regularity assumptions. We establish an O(1/k) ergodic convergence rate in terms of a first-order stationary metric and demonstrate the algorithm’s effectiveness on representative bilevel tasks.
|
| |
| 09:45-10:00, Paper FrA12.2 | |
| Time-Varying Distributed Optimization with Derivative Estimation |
|
| Silvestre, Joao Pedro | University of California, Los Angeles |
| Marchi, Matteo | University of California, Los Angeles |
| Tabuada, Paulo | University of California at Los Angeles |
Keywords: Optimization algorithms
Abstract: Optimization algorithms for time-varying cost functions extend classical methods, which cannot track dynamic minima. However, they typically assume access to the partial derivative of the cost function with respect to time. This is an assumption not satisfied in many situations where, although the time-varying parameters can be measured, either their time-derivatives or their model are unknown. Distributed optimization algorithms for time-varying costs typically inherit these limitations and introduce additional challenges, such as requiring each node to measure the time-varying parameters its local cost depends on. This work addresses these issues by first developing a centralized time-varying optimization algorithm that ensures convergence to a bounded neighborhood of the optimal trajectory. We then extend it to a distributed framework, relaxing the aforementioned measurement requirements. The effectiveness of our approach is validated through numerical examples of varying complexity.
|
| |
| 10:00-10:15, Paper FrA12.3 | |
| Distributed Constrained Online Nonconvex Optimization with Compressed Communication |
|
| Zhang, Kunpeng | Northeastern University |
| Xu, Lei | Northeastern University |
| Yi, Xinlei | College of Electronics and Information Engineering, Tongji Unive |
| Cao, Ming | University of Groningen |
| Johansson, Karl H. | KTH Royal Institute of Technology |
| Chai, Tianyou | Northeastern University |
| Yang, Tao | Northeastern University |
Keywords: Optimization algorithms, Agents-based systems, Communication networks
Abstract: This paper considers distributed online nonconvex optimization with time-varying inequality constraints over a network of agents. For time-varying graphs, we propose a distributed online primal--dual algorithm with compressed communication to efficiently utilize communication resources. We show that the proposed algorithm establishes an mathcal{O}( {{T^{max { {1 - {theta _1},{theta _1}} }}}} ) network regret bound and an mathcal{O}( {T^{1 - {theta _1}/2}} ) network cumulative constraint violation bound, where T is the number of iterations and {theta _1} in ( {0,1} ) is a user-defined trade-off parameter. When Slater's condition holds, the network cumulative constraint violation bound is reduced to mathcal{O}( {T^{1 - {theta _1}}} ). These bounds are comparable to the state-of-the-art results established by existing distributed online algorithms with perfect communication for distributed online convex optimization with (time-varying) inequality constraints. Finally, a simulation example is presented to validate the theoretical results.
|
| |
| 10:15-10:30, Paper FrA12.4 | |
| Intrinsic Successive Convexification: Trajectory Optimization on Smooth Manifolds |
|
| Kraisler, Spencer | University of Washington |
| Mesbahi, Mehran | University of Washington |
| Acikmese, Behcet | University of Washington |
Keywords: Algebraic/geometric methods, Optimization algorithms, Optimal control
Abstract: A fundamental issue at the core of trajectory optimization on smooth manifolds is handling the implicit manifold constraint within the dynamics. The conventional approach is to enforce the dynamic model as a constraint. However, we show that this approach leads to significantly redundant operations, as well as being heavily dependent on the state space representation. Specifically, we propose an intrinsic successive convexification methodology for optimal control on smooth manifolds. This so-called iSCvx is then applied to a representative example involving attitude trajectory optimization for a spacecraft subject to non-convex constraints.
|
| |
| 10:30-10:45, Paper FrA12.5 | |
| Collaborative Bayesian Optimization Via Wasserstein Barycenters |
|
| Zhan, Donglin | Columbia University |
| Zhang, Haoting | University of California, Berkeley |
| Righter, Rhonda | University of California, Berkeley |
| Zheng, Zeyu | UC Berkeley |
| Anderson, James | Columbia University |
Keywords: Optimization algorithms, Machine learning, Stochastic systems
Abstract: Motivated by the growing need for black-box optimization and data privacy, we introduce a collaborative Bayesian optimization (BO) framework that addresses both of these challenges. In this framework agents work collaboratively to optimize a function they only have oracle access to. In order to mitigate against communication and privacy constraints, agents are not allowed to share their data but can share their Gaussian process (GP) surrogate models. To enable collaboration under these constraints, we construct a central model to approximate the objective function by leveraging the concept of Wasserstein barycenters of GPs. This central model integrates the shared models without accessing the underlying data. A key aspect of our approach is a collaborative acquisition function that balances exploration and exploitation, allowing for the optimization of decision variables collaboratively in each iteration. We prove that our proposed algorithm is asymptotically consistent and that its implementation via Monte Carlo methods is numerically accurate. Through numerical experiments, we demonstrate that our approach outperforms other baseline collaborative frameworks and is competitive with centralized approaches that do not consider data privacy.
|
| |
| 10:45-11:00, Paper FrA12.6 | |
| Properties of Fixed Points of Generalised Extra Gradient Methods Applied to Min-Max Problems |
|
| Farzin, Amir Ali | Australian National University |
| Pun, Yuen-Man | Australian National University |
| Braun, Philipp | The Australian National University |
| Shames, Iman | Australian National University |
Keywords: Optimization algorithms, Numerical algorithms, Machine learning
Abstract: This paper studies properties of fixed points of generalised Extra-gradient (GEG) algorithms applied to min- max problems. We discuss connections between saddle points of the objective function of the min-max problem and GEG fixed points. We show that, under appropriate step-size selections, the set of local saddle points (local Nash equilibria) is a subset of locally stable fixed points of GEG. Convergence properties of the GEG algorithm are obtained through a stability analysis of a discrete-time dynamical system. The results when compared to existing methods are illustrated through numerical examples.
|
| |
| 11:00-11:15, Paper FrA12.7 | |
| Online Convex Optimization and Integral Quadratic Constraints: An Automated Approach to Regret Analysis |
|
| Jakob, Fabian | University of Stuttgart |
| Iannelli, Andrea | University of Stuttgart |
Keywords: Optimization algorithms, Robust control
Abstract: We propose a novel approach for analyzing dynamic regret of first-order online convex optimization (OCO) algorithms for strongly convex and Lipschitz-smooth objectives. Crucially, we provide a general analysis that is applicable to a wide range of first-order algorithms that can be expressed as an interconnection of a linear dynamical system in feedback with a first-order oracle. By leveraging Integral Quadratic Constraints (IQCs), we derive a semi-definite program which, when feasible, provides a regret guarantee for the online algorithm. For this, the concept of variational IQCs is introduced as the generalization of IQCs to time-varying monotone operators. Our bounds capture the temporal rate of change of the problem in the form of the path length of the time-varying minimizer and the objective function variation. In contrast to standard results in OCO, our results do not require neither the assumption of gradient boundedness, nor that of a bounded feasible set. Numerical analyses showcase the ability of the approach to capture the dependence of the regret on the function class condition number.
|
| |
| 11:15-11:30, Paper FrA12.8 | |
| Automatically Stopping Bayesian Optimization under Uncertain Surrogate Model Means |
|
| He, Hans | Virginia Tech |
| Koppel, Alec | JP Morgan Chase |
| Bedi, Amrit Singh | University of Maryland |
| Farhood, Mazen | Virginia Tech |
| Stilwell, Daniel J. | Virginia Tech |
Keywords: Statistical learning, Optimization algorithms, PID control
Abstract: Stopping criteria for Bayesian optimization (BO) automatically terminate the optimization algorithm when a near-optimal solution has likely been reached, avoiding unnecessary expenditure of computational resources. Existing criteria, however, only guarantee accuracy given a well-specified Gaussian process surrogate model, an assumption that often does not hold in practice. We propose a stopping criterion for Bayesian optimization when the mean function of the Gaussian process surrogate model is uncertain, but modeled with a prior. The prior induces a probability distribution over surrogate models, which our criterion uses to evaluate the probability that a model will be stopped under an existing stopping criterion. We demonstrate that our criterion will eventually terminate BO, with high probability of achieving a desired accuracy to the optimal value. Empirical analysis on a controller tuning problem for an autonomous underwater vehicle suggests that our method can terminate BO with high accuracy solutions, particularly for low dimensional problems.
|
| |
| FrA13 |
Oceania IX |
| Game Theory IV |
Regular Session |
| Chair: Yu, Wenwu | Southeast University |
| Co-Chair: Wisniewski, Rafal | Aalborg University |
| |
| 09:30-09:45, Paper FrA13.1 | |
| Distributed Nash Equilibrium Seeking for Multi-Cluster Aggregative Games Over Time-Varying Unbalanced Graphs |
|
| Lu, Tian | Southeast University |
| Zhao, Jingzhao | Southeast University |
| Liu, Hongzhe | Southeast University |
| Xu, Wenying | Southeast University |
| Yu, Wenwu | Southeast University |
Keywords: Game theory, Agents-based systems
Abstract: The letter studies a multi-cluster aggregative game, where multiple clusters exist and each cluster consists of a group of agents. Each cluster serves as a noncooperative player, and agents within the same cluster aim to minimize the sum of their individual cost functions cooperatively. The local cost function of each agent depends on both its own decision and the aggregate decision of each cluster. The objective is to find the Nash equilibrium (NE) of the game in a distributed manner over time-varying unbalanced graphs. To this end, a distributed discrete-time NE seeking algorithm is developed, incorporating a novel tracking technique to estimate the aggregate decision of each cluster and the push-sum protocol to accommodate time-varying unbalanced graphs. The linear convergence of the proposed algorithm is established rigorously via multi-step contraction analysis and linear systems of inequalities. Finally, numerical simulations of an Energy Internet System validate the effectiveness of the algorithm.
|
| |
| 09:45-10:00, Paper FrA13.2 | |
| Finding Approximate Correlated Equilibrium for Decentralized Control |
|
| Misra, Rahul | Aalborg University |
| Wisniewski, Rafal | Aalborg University |
| Kallesøe, Carsten Skovmose | Grundfos |
Keywords: Game theory, Decentralized control, Reinforcement learning
Abstract: We address the problem of finding an optimal decentralized control strategy, defined as achieving approximate Correlated Equilibrium, a generalization of Nash equilibrium. Each controller uses a Reinforcement Learning algorithm within a Markov game framework, abstracting continuous states into discrete ones. Controllers observe the system's discrete state and incurred costs, updating their strategies privately. We prove convergence to approximate Correlated Equilibrium. Our algorithm is applied to a water distribution network with two controllers managing a tank's water level in a decentralized setting, using real-world consumption data. Previously, this problem was deemed to be computationally challenging owing to the discrete controls (number of pumps switched on), resulting in a Mixed Integer optimization. The designed controllers aim to ensure stability, safety, and energy minimization. By training over multiple episodes, optimal weighting parameters for objectives are determined using Blackwell's Approachability theorem. Simulation results show the attainment of approximate correlated equilibrium in accordance with our theoretical results.
|
| |
| 10:00-10:15, Paper FrA13.3 | |
| Up Your Game: Training Games with Efficient Nash Equilibrium with Deep Learning |
|
| Kantorovich, Ariel | Tel Aviv University |
| Bistritz, Ilai | Tel Aviv University |
Keywords: Game theory, Machine learning, Cooperative control
Abstract: We consider a game with N cooperative players that have a global objective. Simple distributed algorithms such as gradient play often converge to a Nash equilibrium (NE). However, a NE typically suffers from poor global performance. Converging to the global optimum requires explicit communication and coordination. In this paper, we propose a new approach to improve the performance at NE. Our method uses machine learning offline training to design games with efficient NE. In particular, we use a dataset of games with parameters coming from a certain distribution (e.g., uniformly random player locations). We then train a deep neural network (DNN) where the input is the local measurement available to the player and the output is the reward parameters that best approximate the global objective. We demonstrate our approach for two %three classes of games: energy games and wireless power control games. Our approach offers significant performance boosts while requiring no communication between the players and no complexity increase in real-time.
|
| |
| 10:15-10:30, Paper FrA13.4 | |
| Nonlinear Mean Field Games with Multiple Major Agents and Multiple Populations of Minor Agents |
|
| Zhang, Xuanping | Beihang University |
| Ren, Lu | Beihang University |
| Yao, Wang | Beihang University |
| Zhang, Xiao | Beihang University |
Keywords: Mean field games, Large-scale systems, Nonlinear systems
Abstract: The paper studies a nonlinear mean field game (MFG) model based on McKean-Vlasov (MV) approximation, which involves multiple major agents and multiple populations of minor agents. Since the interactions occur not only between major–minor agents but also among major–major agents and minor–minor populations, complicating equilibrium existence analysis, we first cast the MFG as two sets of adjoint stochastic McKean–Vlasov equations coupled via the mean field behavior. Subsequently, a new norm on the product probability measure space is constructed to prove that one set of equations exists solutions, while the existence of solutions to the other set is proved based on Pontryagin stochastic maximum principle. Then, within the established product normed space, Banach's fixed point theorem is utilized to prove the existence of equilibrium for this MFG, which is manifested as solutions to two sets of coupled equations. Finally, an epsilon-Nash equilibrium is proved for the finite agent situation, in which epsilon to 0 while all population sizes go to infty, and a numerical experiment under certain settings is carried out.
|
| |
| 10:30-10:45, Paper FrA13.5 | |
| Population-Aware Online Mirror Descent for Mean-Field Games with Common Noise by Deep Reinforcement Learning |
|
| Wu, Zida | University of California, Los Angeles |
| Lauriere, Mathieu | NYU Shanghai |
| Geist, Matthieu | Earth Species Project |
| Pietquin, Olivier | Google Brain |
| Mehta, Ankur | University of California Los Angeles |
Keywords: Mean field games, Game theory, Reinforcement learning
Abstract: Mean Field Games (MFGs) offer a powerful framework for studying large-scale multi-agent systems. Yet, learning Nash equilibria in MFGs remains a challenging problem, particularly when the initial distribution is unknown or when the population is subject to common noise. In this paper, we introduce an efficient deep reinforcement learning (DRL) algorithm designed to achieve population-dependent Nash equilibria without relying on averaging or historical sampling, inspired by Munchausen RL and Online Mirror Descent. The resulting policy is adaptable to various initial distributions and sources of common noise. Through numerical experiments on seven canonical examples, we demonstrate that our algorithm exhibits superior convergence properties compared to state-of-the-art algorithms, particularly a DRL version of Fictitious Play for population-dependent policies. The performance in the presence of common noise underscores the robustness and adaptability of our approach.
|
| |
| 10:45-11:00, Paper FrA13.6 | |
| Incentive Analysis for Agent Participation in Federated Learning |
|
| Yi, Lihui | Northwestern University |
| Niu, Xiaochun | Northwestern University |
| Wei, Ermin | Northwestern Univeristy |
Keywords: Game theory, Modeling, Learning
Abstract: Federated learning (FL) offers a decentralized approach to machine learning, where multiple agents collaboratively train a model while preserving data privacy. In this paper, we investigate the decision-making and equilibrium behavior in FL systems, where agents choose between participating in global training or conducting independent local training. The problem is first modeled as a stage game and then extended to a repeated game to analyze the long-term dynamics of agent participation. For the stage game, we characterize the participation patterns and identify Nash equilibrium, revealing how data heterogeneity influences the equilibrium behavior—specifically, agents with similar data qualities will participate in FL as a group. We also derive the optimal social welfare strategy and show that it lies in a neighborhood of Nash equilibrium. In the repeated game, we propose a privacy-preserving, computationally efficient myopic strategy. This strategy enables agents to make practical decisions under bounded rationality and converges to a neighborhood of Nash equilibrium of the stage game in finite time. By combining theoretical insights with practical strategy design, this work provides a realistic and effective framework for guiding and analyzing agent behaviors in FL systems.
|
| |
| 11:00-11:15, Paper FrA13.7 | |
| Multi-Potential Games: Bipartite Conflicts and Nash Equilibria Equivalence Analysis |
|
| Liu, Aixin | Shanghai Jiao Tong University |
| Wang, Lin | Shanghai Jiao Tong University |
| Cao, Ming | University of Groningen |
Keywords: Boolean control networks and logic networks, Game theory, Algebraic/geometric methods
Abstract: A multi-potential game extends the classical potential game by grouping players into distinct subsets. Within each subset, given that the strategies of players in other subsets remain fixed, the interactions among this subset's players form a potential subgame characterized by its own restricted potential function. In this paper, we first establish that when the conflict relationships among players exist and follow structural balance, the multi-potential game achieves the minimal potential index of two. Then, we prove that the pure strategy Nash equilibria in the multi-potential framework, called the pure strategy multi-potential Nash equilibria, are a subset of those in the original game. Furthermore, we establish the conditions under which the set of pure strategy Nash equilibria in the original game coincides with the set of pure strategy multi-potential Nash equilibria. Finally, we provide an illustrative example to demonstrate our findings.
|
| |
| FrA14 |
Galapagos III |
| Robotics and Autonomous Systems III |
Regular Session |
| Chair: Vermillion, Christopher | University of Michigan |
| Co-Chair: Diagne, Mamadou | University of California San Diego |
| |
| 09:30-09:45, Paper FrA14.1 | |
| Priority-Driven Constraints Softening in Safe MPC for Perturbed Systems |
|
| Quan, Yingshuai | Chalmers University of Technology |
| Jeddi, Mohammad | University of Modena and Reggio Emilia |
| Prignoli, Francesco | Università Di Modena E Reggio Emilia |
| Falcone, Paolo | Chalmers University of Technology |
Keywords: Autonomous vehicles, Predictive control for linear systems, Robotics
Abstract: This paper presents a safe model predictive control (SMPC) framework designed to ensure the satisfaction of hard constraints, for systems perturbed by an external disturbance. Such safety guarantees are ensured, despite the disturbance, by online softening a subset of adjustable constraints defined by the designer. The selection of the constraints to be softened is made online based on a predefined priority assigned to each adjustable constraint. The design of a learning-based algorithm enables real-time computation while preserving the original safety properties. Simulations results, obtained from an automated driving application, show that the proposed approach provides guarantees of collision-avoidance hard constraints despite the unpredicted behaviors of the surrounding environment.
|
| |
| 09:45-10:00, Paper FrA14.2 | |
| Trajectory Planning for Pseudo-Omnidirectional Vehicles: A Near-Optimal Approach |
|
| Xi, Wang | Shanghai Jiao Tong University |
| Guo, Jiaming | Shanghai Jiao Tong University |
| Wang, Chenyang | Shanghai Jiao Tong University |
| Wu, Shukun | Shanghai Jiao Tong University |
| He, Jianping | Shanghai Jiao Tong University |
Keywords: Autonomous vehicles, Automotive systems
Abstract: Pseudo-omnidirectional vehicles exhibit broad application in robotics, yet their inherent complex dynamic characteristics impose fundamental challenges on motion planning and control. In this work, we formulate the trajectory planning as a nonlinear model predictive control problem with bilinear constraints. This formulation effectively addresses multiple model-specific challenges induced by nonlinear kinematics, and features strong generality. To meet real-time computational requirements, we propose an efficient algorithm through structural analysis of the nonlinear optimization problem. The algorithm constructs a sequence of convex optimization subproblems to approximate solutions of the original problem, featuring both implementation simplicity and rapid convergence. Furthermore, the algorithm is proven to converge to stationary points of the original optimization problem. Numerical experiments validate the effectiveness and computational efficiency of our method.
|
| |
| 10:00-10:15, Paper FrA14.3 | |
| Trajectory Planning Based on Time-Varying Convex Liftings of Dynamic Environments |
|
| Konyalioglu, Turan | CentraleSupélec, L2S, Ampere Software Technology |
| Olaru, Sorin | CentraleSupélec |
| Niculescu, Silviu-Iulian | University Paris-Saclay, CNRS, CentraleSupelec, Inria |
| Mustaki, Simon | LS2N / Renault |
| Ballesteros-Tolosana, Iris | CentraleSupélec/Renault |
| Flores, Carlos | UC Berkeley |
Keywords: Autonomous vehicles, Constrained control, Predictive control for linear systems
Abstract: This paper presents a novel and integrated framework for motion planning and control in time-varying environments. The proposed method combines a time-parameterized convex-lifting-based path planning approach—including state space partitioning, graph generation, pathfinding, and safe corridor generation—with a spatiotemporal formulation to guide motion planning. To ensure dynamic feasibility, the framework is extended with a Model Predictive Control (MPC) scheme that computes safe trajectories within the identified corridors. A replanning strategy is introduced to adapt to the unpredictable behaviors of surrounding vehicles and maintain the feasibility of trajectories. Simulation results in an overtaking scenario demonstrate the effectiveness of the approach, with the ego vehicle successfully performing trajectory tracking and replanning while satisfying dynamic constraints.
|
| |
| 10:15-10:30, Paper FrA14.4 | |
| Constant-Sum High-Order Barrier Functions for Safety between Parallel Boundaries |
|
| Kim, Kwang Hak | University of California San Diego |
| Diagne, Mamadou | University of California San Diego |
| Krstic, Miroslav | University of California, San Diego |
Keywords: Constrained control, Autonomous systems, Autonomous vehicles
Abstract: This paper takes a step towards addressing the difficulty of constructing Control Barrier Functions (CBFs) for parallel safety boundaries. A single CBF for both boundaries has been reported to be difficult to validate for safety, and we identify why this challenge is inherent. To overcome this, the proposed method constructs separate CBFs for each boundary. We begin by presenting results for the relative degree one case and then extend these to higher relative degrees using the CBF backstepping technique, establishing conditions that guarantee safety. Finally, we showcase our method by applying it to a unicycle system, deriving a simple, verifiable condition to validate the target CBFs for direct implementation of our results.
|
| |
| 10:30-10:45, Paper FrA14.5 | |
| An Internal Model Control System for Clothoid Tracking |
|
| Mimmo, Nicola | University of Bologna |
| Frego, Marco | Free University of Bozen-Bolzano |
Keywords: Output regulation, Autonomous vehicles
Abstract: This paper deals with clothoid tracking via the In ternal Model Principle (IMP). The ability to track these curves impacts efficiency, e.g., traffic flow, number of worked pieces in CNC machines, etc. To solve the clothoid trajectory tracking problem, this paper presents a new design method for IMP-based controllers. The theoretical arguments are corroborated via the simulation of a robotic walker.
|
| |
| 10:45-11:00, Paper FrA14.6 | |
| Three-Dimensional Repulsive Vector Field Strategy for Collision Avoidance in Quadrotors Trajectory Tracking |
|
| Martinez-Ramirez, Marco Antonio | CINVESTAV |
| Trujillo-Flores, Miguel Angel | Instituto Tecnológico Autónomo De México |
| Shao, Xiaodong | Beihang University |
| Romero, Jose Guadalupe | Instituto Tecnológico Autónomo De México |
| Rodríguez-Cortés, Hugo | CINVESTAV-IPN |
Keywords: Autonomous vehicles, Agents-based systems, Lyapunov methods
Abstract: Unmanned aerial vehicles are now widely adopted in various applications, often requiring coexistence with other vehicles. This article proposes a repulsive vector field-based collision avoidance for quadrotors performing independent trajectory-tracking tasks in the same airspace. The trajectory tracking control is designed in the quadrotor’s attitude configuration space and utilizes the Immersion and Invariance technique to compensate for constant unknown disturbances; moreover, it operates with a collision avoidance strategy that is activated when a threshold distance between quadrotors is exceeded. The collision avoidance strategy is based on the direction of a three-dimensional repulsive vector field (RVF) constructed around each vehicle. The trajectory tracking and collision avoidance strategies are experimentally evaluated.
|
| |
| 11:00-11:15, Paper FrA14.7 | |
| Clarity-Driven Ergodic Control for Persistent Tip-And-Cue Missions with Synchronized Rendezvous |
|
| Li, David | University of Michigan |
| Govindarajan, Kavin | University of Michigan |
| Vermillion, Christopher | University of Michigan |
Keywords: Optimization, Cooperative control, Autonomous vehicles
Abstract: This paper presents a persistent control methodology for a sustainably powered host/agent network that executes tip-and-cue oceanographic observing operations within a spatiotemporally evolving environment. Specifically, a renewably powered host vessel simultaneously serves as a recharging platform for an autonomous aerial vehicle (AAV), while also performing broad surveillance of an evolving mission domain. When the AAV is on board the host vessel, the mission trajectory (termed the “nominal” trajectory) is selected based on a clarity-driven ergodic planner. When a location of interest (termed a “tip” location) is detected by the host vessel, the AAV is dispatched to provide detailed observation of that location. This necessitates a replanning operation (of the “rendezvous” trajectory) wherein a rendezvous point is selected to maximize the mutual long-horizon benefit to the host and agent. Because the mutually beneficial rendezvous point will, in general, deviate from the original ergodic trajectory, another replanning operation (of the nominal trajectory) is completed on rendezvous. In this paper, we demonstrate the efficacy of the combination of the ergodic trajectory planner and rendezvous planner for a solar-powered host vessel (the SeaTrac SP-48 ASV) and a quadrotor (Agilicious) agent vehicle. In particular, the combined control system is shown to significantly outperform a line-transect strategy and an ergodic controller wherein the rendezvous point is constrained to lie on the nominal mission trajectory.
|
| |
| 11:15-11:30, Paper FrA14.8 | |
| Uncertainty Removal in Verification of Nonlinear Systems against Signal Temporal Logic Via Incremental Reachability Analysis |
|
| Besset, Antoine | ENSTA Paris |
| Tillet, Joris | ENSTA |
| Alexandre dit Sandretto, Julien | ENSTA ParisTech |
Keywords: Formal Verification/Synthesis, Uncertain systems, Nonlinear systems
Abstract: A framework is presented for the verification of Signal Temporal Logic (STL) specifications over continuous-time nonlinear systems under uncertainty. Based on reachability analysis, the proposed method addresses indeterminate satisfaction caused by over-approximated reachable sets or incomplete simulations. STL semantics is extended via Boolean interval arithmetic, enabling the decomposition of satisfaction signals into unitary components with traceable uncertainty markers. These are propagated through the satisfaction tree, supporting precise identification even in nested formulas. To improve efficiency, only the reachable sets contributing to uncertainty are refined, identified through the associated markers. The framework allows online or offline monitoring to adapt to incremental system evolution while avoiding unnecessary recomputation. A case study on a nonlinear oscillator demonstrates a significant reduction in satisfaction ambiguity, highlighting the effectiveness of the approach.
|
| |
| FrA15 |
Capri II |
| Stochastic Systems II |
Regular Session |
| Chair: Avrachenkov, Konstantin E. | INRIA Sophia Antipolis |
| Co-Chair: Lahijanian, Morteza | University of Colorado Boulder |
| |
| 09:30-09:45, Paper FrA15.1 | |
| Distributions and Direct Parametrization for Stable Stochastic State-Space Models |
|
| Al Ahdab, Mohamad | Aalborg University |
| Tan, Zheng-Hua | Aalborg University |
| Leth, John | Aalborg University |
Keywords: Stochastic systems, Statistical learning, Stability of linear systems
Abstract: We present a direct parametrization for continuous-time stochastic state-space models that ensures external stability via the stochastic bounded-real lemma. Our formulation facilitates the construction of probabilistic priors that enforce almost-sure stability, which are suitable for sampling-based Bayesian inference methods. We validate our work with a simulation example and demonstrate its ability to yield stable predictions with uncertainty quantification.
|
| |
| 09:45-10:00, Paper FrA15.2 | |
| On Polynomial Stochastic Barrier Functions: Bernstein versus Sum-Of-Squares |
|
| Amorese, Peter | University of Colorado Boulder |
| Lahijanian, Morteza | University of Colorado Boulder |
Keywords: Stochastic systems, Lyapunov methods, Optimization
Abstract: Stochastic Barrier Functions (SBFs) certify the safety of stochastic systems by formulating a functional optimization problem, which state-of-the-art methods solve using Sum-of Squares (SoS) polynomials. This work focuses on polynomial SBFs and introduces a new formulation based on Bernstein polynomials and provides a comparative analysis of its theoretical and empirical performance against SoS methods. We show that the Bernstein formulation leads to a linear program (LP), in contrast to the semi-definite program (SDP) required for SoS, and that its relaxations exhibit favorable theoretical convergence properties. However, our empirical results reveal that the Bernstein approach struggles to match SoS in practical performance, exposing an intriguing gap between theoretical advantages and real-world feasibility.
|
| |
| 10:00-10:15, Paper FrA15.3 | |
| Causal Conditional Directed Information in a Point Process Network |
|
| Rong, Xinhui | The University of Melbourne |
| Nair, Girish N. | University of Melbourne |
Keywords: Stochastic systems, Information theory and control, Estimation
Abstract: The causal conditional (CC) directed information (DI) can test the Granger causality for stochastic processes. However, for point process networks, the CCDI has only been developed for a trivariate network. In this work, we develop the general multivariate CCDI under Kramer’s framework by developing the CC likelihood, which is characterized by marginal intensity functions. We establish several Granger causality equivalences, unifying and generalizing existing results. Further, for Hawkes networks, we develop an estimation method of CCDI requiring only one point process trajectory, by developing an analytical formula for the marginal intensity functions, and an ergodic theorem for the CCDI.
|
| |
| 10:15-10:30, Paper FrA15.4 | |
| Stability of Polling Systems for a Large Class of Markovian Switching Policies |
|
| Avrachenkov, Konstantin E. | INRIA Sophia Antipolis |
| Das, Kousik | Indian Institute of Technology Bomaby |
| Veeraruna, Kavitha | IIT Bombay, India |
| Singh, Vartika | University of Colorado Colorado Springs |
Keywords: Queueing systems, Stochastic systems, Lyapunov methods
Abstract: We consider a polling system with two queues, where a single server is attending the queues in a cyclic order and requires non-zero switching times to switch between the queues. Our aim is to identify a fairly general and comprehensive class of Markovian switching policies that renders the system stable. Potentially a class of policies that can cover the Pareto frontier related to individual-queue-centric performance measures like the stationary expected number of waiting customers in each queue; for instance, such a class of policies is identified recently for a polling system near the fluid regime (with large arrival and departure rates), and we aim to include that class. We also aim to include a second class that facilitates switching between the queues at the instance the occupancy in the opposite queue crosses a threshold and when that in the visiting queue is below a threshold (this inclusion facilitates the design of `robust' polling systems). Towards this, we consider a class of two-phase switching policies, which include the above mentioned classes. In the maximum generality, our policies can be represented by eight parameters, while two parameters are sufficient to represent the aforementioned classes. We provide simple conditions to identify the sub-class of switching policies that ensure system stability. By numerically tuning the parameters of the proposed class, we illustrate that the proposed class can cover the Pareto frontier for the stationary expected number of customers in the two queues.
|
| |
| 10:30-10:45, Paper FrA15.5 | |
| Verifying Probabilistic Regions of Attraction with Neural Lyapunov Functions for Stochastic Systems |
|
| Su, Yun | University of Waterloo |
| De Sterck, Hans | University of Waterloo |
| Liu, Jun | University of Waterloo |
Keywords: Neural networks, Lyapunov methods, Stochastic systems
Abstract: Leveraging a stochastic extension of Zubov's equation, we develop a physics-informed neural network (PINN) approach for learning a neural Lyapunov function that captures the largest probabilistic region of attraction (ROA) for stochastic systems. We then provide sufficient conditions for the learned neural Lyapunov functions that can be readily verified by satisfiability modulo theories (SMT) solvers, enabling formal verification of both local stability analysis and probabilistic ROA estimates. By solving Zubov's equation for the maximal Lyapunov function, our method provides more accurate and larger probabilistic ROA estimates than traditional sum-of-squares (SOS) methods. Numerical experiments on nonlinear stochastic systems validate the effectiveness of our approach in training and verifying neural Lyapunov functions for probabilistic stability analysis and ROA estimates.
|
| |
| 10:45-11:00, Paper FrA15.6 | |
| Bayesian Diagnosability and Active Fault Identification |
|
| Kong, Chun-Wei | University of Colorado Boulder |
| McMahon, Jay | University of Colorado |
| Lahijanian, Morteza | University of Colorado Boulder |
Keywords: Fault diagnosis, Fault detection, Stochastic systems
Abstract: We study fault identification in discrete-time nonlinear systems subject to additive Gaussian white noise. We introduce a Bayesian framework that explicitly accounts for unmodeled faults under reasonable assumptions. Our approach hinges on a new quantitative diagnosability definition, revealing when passive fault identification (FID) is fundamentally limited by the given control sequence. To overcome such limitations, we propose an active FID strategy that designs control inputs for better fault identification. Numerical studies on a two-water tank system and a Mars satellite with complex and discontinuous dynamics demonstrate that our method significantly reduces failure rates with shorter identification delays compared to purely passive techniques.
|
| |
| 11:00-11:15, Paper FrA15.7 | |
| Controlled Supermartingale Functions for Stochastic Differential Equations: Inference and Applications |
|
| Ghanbarpour Mamaghani, Masoumeh | University of Colorado Boulder |
| Berger, Guillaume O. | UCLouvain |
| Sankaranarayanan, Sriram | University of Colorado, Boulder |
Keywords: Formal Verification/Synthesis, Stochastic systems, LMIs
Abstract: We study the problem of constructing controlled supermartingale functions to synthesize feedback laws that guarantee safety properties of stochastic differential equations (SDE) with control inputs. SDEs are widely used to model continuous time stochastic processes with applications ranging from financial markets to biology. In this paper, we extend classic notions from martingale theory for stochastic processes to prove that a given SDE will not exit a safe region over some finite time horizon with high probability. Our notion considers time-varying supermartingale functions that provide sharper probability bounds when compared to those that are time-independent. Furthermore, we study the controlled version of these supermartingales and the problem of synthesizing feedback control law that will maintain the state within a safe set with high probability over a given finite time horizon. We provide a projection-based algorithm for synthesizing polynomial, time-varying controlled supermartingales and corresponding feedback laws using sum-of-square (SOS) programming techniques. We implement our approach on some challenging numerical examples to demonstrate how it can synthesize control feedback laws that provide upper bounds on the probability of safety violations over a given time horizon.
|
| |
| 11:15-11:30, Paper FrA15.8 | |
| Tangent Space Parametrization for Stochastic Differential Equations on SO(n) |
|
| Wang, Xi | University of New South Wales |
| Solo, Victor | University of New South Wales |
Keywords: Algebraic/geometric methods, Stochastic systems, Numerical algorithms
Abstract: In this paper, we study the numerical simulation of stochastic differential equations (SDEs) on the special orthogonal Lie group SO(n). We propose a geometry-preserving numerical scheme based on the stochastic tangent space parametrization (S-TaSP) method for state-dependent multiplicative SDEs on SO(n). The convergence analysis of the S-TaSP scheme establishes a strong convergence order of O(delta^((1-epsilon)/2)), which matches the convergence order of the previous stochastic Lie Euler-Maruyama scheme while avoiding the computational cost of the exponential map. Numerical simulation illustrates the theoretical results.
|
| |
| FrA16 |
Capri III |
| Predictive Control for Nonlinear Systems III |
Regular Session |
| Chair: Olaru, Sorin | CentraleSupélec |
| Co-Chair: van Berkel, Matthijs | Dutch Institute for Fundamental Energy Research |
| |
| 09:30-09:45, Paper FrA16.1 | |
| Recursively Feasible MPC for Underactuated Euler-Lagrange Systems with Application to Snake Robot Locomotion |
|
| Gushkov, Ivan | NTNU |
| Schmidt-Didlaukies, Henrik M. | Norwegian University of Science and Technology |
| Lysø, Mads Erlend Bøe | Norwegian University of Science and Technology (NTNU) |
| Pettersen, Kristin Y. | Norwegian University of Science and Technology (NTNU) |
| Gravdahl, Jan Tommy | Norwegian Univ. of Science & Tech |
| Koehler, Matthias | University of Stuttgart |
| Mair, Jonas | University of Stuttgart |
Keywords: Predictive control for nonlinear systems, Robotics, Control applications
Abstract: This paper presents a control design approach for underactuated Euler-Lagrange (EL) systems that combines foundational ideas from passivity-based control and energy shaping with Model Predictive Control (MPC). By leveraging the intrinsic energy structure and passivity properties of EL systems, we construct a recursively feasible MPC. This hybrid design guarantees recursive feasibility by enforcing a terminal constraint on the shaped energy and exploiting the forward invariance of energy sub-level sets. We prove that a suitable terminal region always exists under mild system assumptions. The effectiveness of the approach is demonstrated through a case study on snake robot locomotion, highlighting how physical insight and energy-based reasoning can be used transparently throughout the control design process. We discuss the computation of terminal energy levels, and present simulation results validating the approach.
|
| |
| 09:45-10:00, Paper FrA16.2 | |
| Tube-Based Robust Nonlinear Model Predictive Control of Anaerobic Co-Digestion |
|
| Carecci, Davide | Politecnico Di Milano |
| Dewasme, Laurent | Université De Mons |
| La Bella, Alessio | Politecnico Di Milano |
| Vande Wouwer, Alain | Université De Mons |
| Ferretti, Gianni | Politecnico Di Milano |
Keywords: Predictive control for nonlinear systems, Robust control, Biotechnology
Abstract: To match the growing demand for bio-methane production, anaerobic digesters need to embrace the co-digestion of different feedstocks; in addition, to improve the techno-economic performance, an optimal and time-varying adaptation of the input diet is required. These operation modes constitute a very hard challenge for the limited instrumentation and control equipment typically installed aboard full-scale plants. A model-based predictive approach may be able to handle such control problem, but the identification of reliable predictive models is limited by the low information content typical of the data available from full-scale plants’ operations, which entail high parametric uncertainty. In this work, the application of a tube-based robust nonlinear model predictive control (NMPC) is proposed to regulate bio-methane production over a period of diet change in time, while warranting safe operation and dealing with uncertainties. In view of its upcoming validation on a true small pilot-scale plant, the NMPC capabilities are assessed via numerical simulations designed to resemble as much as possible the experimental setup, along with some practical final considerations.
|
| |
| 10:00-10:15, Paper FrA16.3 | |
| Learning the MPC Objective Function from Human Preferences |
|
| Krupa, Pablo | IMT School for Advanced Studies |
| El Hasnaouy, Hasna | IMT School for Advanced Studies Lucca |
| Zanon, Mario | IMT Institute for Advanced Studies Lucca |
| Bemporad, Alberto | IMT School for Advanced Studies Lucca |
Keywords: Predictive control for nonlinear systems, Predictive control for linear systems, Machine learning
Abstract: In Model Predictive Control (MPC), the objective function plays a central role in determining the closed-loop behavior of the system, and must therefore be designed to achieve the desired closed-loop performance. However, in real-world scenarios, its design is often challenging, as it requires balancing complex trade-offs and accurately capturing a performance criterion that may not be easily quantifiable in terms of an objective function. This paper explores preference-based learning as a data-driven approach to constructing an objective function from human preferences over trajectory pairs. We formulate the learning problem as a machine learning classification task to learn a surrogate model that estimates the likelihood of a trajectory being preferred over another. The approach provides a surrogate model that can directly be used as an MPC objective function. Numerical results show that we can learn objective functions that provide closed-loop trajectories that align with the expressed human preferences.
|
| |
| 10:15-10:30, Paper FrA16.4 | |
| Constraint Horizon in Model Predictive Control |
|
| Andre do Nascimento, Allan | University of Oxford |
| Wang, Han | University of Oxford, ETH Zurich |
| Papachristodoulou, Antonis | University of Oxford |
| Margellos, Kostas | University of Oxford |
Keywords: Predictive control for nonlinear systems, Optimal control
Abstract: In this work, we propose a Model Predictive Control (MPC) formulation incorporating two distinct horizons: a prediction horizon and a constraint horizon. This approach enables a deeper understanding of how constraints influence key system properties such as suboptimality, without compromising recursive feasibility and constraint satisfaction. In this direction, our contributions are twofold. First, we provide a framework to estimate closed-loop optimality as a function of the number of enforced constraints. This is a generalization of existing results by considering partial constraint enforcement over the prediction horizon. Second, when adopting this general framework under the lens of safety-critical applications, our method improves conventional Control Barrier Function (CBF) based approaches. It mitigates myopic behaviour in Quadratic Programming (QP)-CBF schemes, and resolves compatibility issues between Control Lyapunov Function (CLF) and CBF constraints via the prediction horizon used in the optimization. We show the efficacy of the method via numerical simulations for a safety critical application.
|
| |
| 10:30-10:45, Paper FrA16.5 | |
| Safe Navigation Using NMPC Based on a K-Invariant Set and Its Exploration Features |
|
| Zhao, Zhixin | University Paris Saclay |
| Girard, Antoine | CNRS |
| Olaru, Sorin | CentraleSupélec |
Keywords: Predictive control for nonlinear systems, Numerical algorithms
Abstract: This paper presents a framework for safe navigation in cluttered, unknown environments. The control layer employs a new Nonlinear Model Predictive Control (NMPC) framework based on K-invariant sets, which guarantees that the trajectories of nonlinear systems satisfy safety constraints despite the unknown environment. The recursive feasibility is guaranteed by ensuring the existence of a backup trajectory at each sampling instance. The backup trajectories return to K-invariant sets, which are designed offline but can be parameterized online by leveraging the system's translation and rotation invariance. The planning layer uses an improved convex lifting method, employing the limited information provided by the sensors to find a path. This method is both efficient and exhibits versatile exploration features. It generates an interconnected graph at each computation step, based on which the shortest path to a goal can be extracted, or alternatively, a shortest path to an unexplored region can be identified.
|
| |
| 10:45-11:00, Paper FrA16.6 | |
| Precision UAV Formation Control Via PGPE-Enhanced NMPC |
|
| Olivieri, Pierriccardo | Politecnico Di Milano |
| Sanchini, Andrea | Politecnico Di Milano |
| Spica, Riccardo | MBDA Italia |
| Gatti, Nicola | Politecnico Di Milano |
| Formentin, Simone | Politecnico Di Milano |
Keywords: Predictive control for nonlinear systems, Robotics, Reinforcement learning
Abstract: In formation control for unmanned aerial vehicles (UAVs), a fleet of drones is arranged in a predefined geometric configuration that must be maintained throughout the flight, while avoiding collisions with other drones and obstacles. In real-world applications, the need for quick deployment of UAV fleets often makes controller parameter tuning a significant challenge. In this paper, we introduce an end-to-end formation controller based on Nonlinear Model Predictive Control (NMPC), enhanced by a reinforcement learning algorithm for optimal hyperparameter tuning. Specifically, we adapt the Policy Gradient with Parameter-based Exploration (PGPE) algorithm to the formation control context. This method offers a fast and scalable solution for parameter tuning that does not require a differentiable controller and can be customized to the specific needs of the deployer. To validate our approach, we conduct simulation experiments using a realistic quadrotor model in a three-dimensional environment with static obstacles. Our results demonstrate the effectiveness and advantages of our method in comparison to state-of-the-art algorithms.
|
| |
| 11:00-11:15, Paper FrA16.7 | |
| Data-Enabled Predictive Control for Nonlinear Systems Based on a Koopman Bilinear Realization |
|
| Xiong, Zuxun | University of Oxford |
| Yuan, Zhenyi | University of California, San Diego |
| Miao, Keyan | University of Oxford |
| Wang, Han | University of Oxford, ETH Zurich |
| Cortes, Jorge | UC San Diego |
| Papachristodoulou, Antonis | University of Oxford |
Keywords: Data driven control, Nonlinear systems, Predictive control for nonlinear systems
Abstract: This paper extends the Willems' Fundamental Lemma to nonlinear control-affine systems using the Koopman bilinear realization. This enables us to bypass the Extended Dynamic Mode Decomposition (EDMD)-based system identification step in conventional Koopman-based methods and design controllers for nonlinear systems directly from data. Leveraging this result, we develop a Data-Enabled Predictive Control framework for nonlinear systems with unknown dynamics. A case study demonstrates that our direct data-driven control method achieves improved optimality compared to conventional Koopman-based methods. Furthermore, in examples where an exact Koopman realization with a finite-dimensional lifting function set of the controlled nonlinear system does not exist, our method exhibits advanced robustness to finite Koopman approximation errors compared to existing methods.
|
| |
| 11:15-11:30, Paper FrA16.8 | |
| MPC Strategies for Density Profile Control with Pellet Fueling in Nuclear Fusion Tokamaks under Uncertainty |
|
| Orrico, Christopher Anthony | Eindhoven University of Technology |
| Varadarajan, Hari Prasad | Eindhoven University of Technology |
| van Berkel, Matthijs | Dutch Institute for Fundamental Energy Research |
| Ceelen, Lennard | Dutch Institute for Fundamental Energy Research |
| Bosman, Thomas | DIFFER |
| Heemels, W.P.M.H. (Maurice) | Eindhoven University of Technology |
| Krishnamoorthy, Dinesh | Norwegian University of Science and Technology (NTNU) |
Keywords: Emerging control applications, Energy systems, Predictive control for linear systems
Abstract: Control of the density profile based on pellet fueling for the ITER nuclear fusion tokamak involves a multi-rate nonlinear system with safety-critical constraints, input delays, and discrete actuators with parametric uncertainty. To address this challenging problem, we propose a multi-stage MPC (msMPC) approach to handle uncertainty in the presence of mixed-integer inputs. While the scenario tree of msMPC accounts for uncertainty, it also adds complexity to an already computationally intensive mixed-integer MPC (MI-MPC) problem. To achieve real-time density profile controller with discrete pellets and uncertainty handling, we systematically reduce the problem complexity by (1) reducing the identified prediction model size through dynamic mode decomposition with control, (2) applying principal component analysis to reduce the number of scenarios needed to capture the parametric uncertainty in msMPC, and (3) utilizing the penalty term homotopy for MPC (PTH-MPC) algorithm to reduce the computational burden caused by the presence of mixed-integer inputs. We compare the performance and safety of the msMPC strategy against a nominal MI-MPC in plant simulations, demonstrating the first predictive density control strategy with uncertainty handling, viable for real-time pellet fueling in ITER.
|
| |
| FrA17 |
Capri IV |
| Stability of Nonlinear Systems III |
Regular Session |
| Chair: Shakib, Fahim | Imperial College London |
| Co-Chair: van den Eijnden, Sebastiaan | Eindhoven University of Technology |
| |
| 09:30-09:45, Paper FrA17.1 | |
| On Non-Euclidean Contraction Theory for Constrained Differential Inclusions |
|
| Ochoa, Daniel E. | University of California Santa Cruz |
| Sanfelice, Ricardo G. | University of California at Santa Cruz |
Keywords: Nonlinear systems, Hybrid systems, Stability of nonlinear systems
Abstract: This paper develops methods for studying contraction properties of constrained differential inclusions (CDIs) in finite-dimensional settings using non-Euclidean norms and compatible regular pairings. We introduce uniform and non-uniform pre-contraction concepts for CDIs, generalize one-sided Lipschitz conditions for set-valued maps relative to constraint sets, and prove that CDIs satisfying these conditions exhibit suitable contraction properties. The analysis employs jointly continuous regular pairings and constructs surrogate time-varying measurable set-valued maps that, for any given solution to the CDI, select values of the flow-map compatible with the one-sided Lipschitz inequality relative to that solution. This construction guarantees that for each solution there exists a contractive counterpart.
|
| |
| 09:45-10:00, Paper FrA17.2 | |
| Exploiting Structure in MIMO Scaled Graph Analysis |
|
| de Groot, Timo | Technische Universiteit Eindhoven |
| Oomen, Tom | Eindhoven University of Technology |
| van den Eijnden, Sebastiaan | Eindhoven University of Technology |
Keywords: Stability of nonlinear systems, Nonlinear systems, LMIs
Abstract: Scaled relative graphs offer a graphical tool for analysis of nonlinear feedback systems. Of specific interest for stability analysis is the scaled graph, a special case of the scaled relative graph, related to non-incremental system properties. Although recently substantial progress has been made in scaled graph analysis, at present their use in multivariable feedback systems is limited by conservatism. In this paper, we aim to reduce this conservatism by introducing multipliers and exploit system structure in the analysis with scaled graphs. In particular, we use weighted inner products to arrive at a weighted scaled graph and combine this with a commutation property to formulate a stability result for multivariable feedback systems. We present a method for computing the weighted scaled graph of Lur'e systems based on solving sets of linear matrix inequalities, and demonstrate a significant reduction in conservatism through an example.
|
| |
| 10:00-10:15, Paper FrA17.3 | |
| Global Exponential Stability for Discrete-Time Nonlinear Systems Using SDC Parametrization |
|
| Cusimano, Valerio | CNR-IASI, Italian National Research Council - Institute for Syst |
| Cacace, Filippo | Università Campus Biomedico Di Roma |
| d'Angelo, Massimiliano | Università Mercatorum |
| Germani, Alfredo | Universita' Dell'Aquila |
Keywords: Stability of nonlinear systems, Lyapunov methods, Algebraic/geometric methods
Abstract: Since the mid-1990s, State-Dependent Riccati Equation (SDRE) strategies have gained prominence as systematic and effective methodologies for designing nonlinear controllers, observers, and filters. These approaches address many of the limitations inherent in traditional techniques while offering computationally efficient algorithms that have demonstrated remarkable success across a wide range of practical and impactful applications. In this paper, we leverage the State-Dependent-Coefficient (SDC) parametrization to establish a novel global asymptotic (and exponential) stability result for discrete-time nonlinear systems with a controllable SDC representation. The proof is based on geometrical tools and spectral decompositions. Numerical simulations validate the results.
|
| |
| 10:15-10:30, Paper FrA17.4 | |
| State Dimension Reduction of Recurrent Equilibrium Networks with Contraction and Robustness Preservation |
|
| Shakib, Fahim | Imperial College London |
Keywords: Model/Controller reduction, Reduced order modeling, Stability of nonlinear systems
Abstract: Recurrent equilibrium networks (RENs) are effective for learning the dynamics of complex dynamical systems with certified contraction and robustness properties through unconstrained learning. While this opens the door to learning large-scale RENs, deploying such large-scale RENs in real-time applications on resource-limited devices remains challenging. Since a REN consists of a feedback interconnection of linear time-invariant (LTI) dynamics and static activation functions, this article proposes a projection-based approach to reduce the state dimension of the LTI component of a trained REN. One of the two projection matrices is dedicated to preserving contraction and robustness by leveraging the already-learned REN contraction certificate. The other projection matrix is iteratively updated to improve the accuracy of the reduced-order REN based on necessary h2-optimality conditions for LTI model reduction. Numerical examples validate the approach, demonstrating significant state dimension reduction with limited accuracy loss while preserving contraction and robustness.
|
| |
| 10:30-10:45, Paper FrA17.5 | |
| Stabilization of Nonlinear Parameter Varying Systems Using LMIs: An Observer-Based Controller Framework |
|
| Mohite, Shivaraj | Research Assitant, RPTU, Kaiserslautern, Germany |
| Alma, Marouane | CRAN Lorraine University |
| Liu, Steven | University of Kaiserslautern Landau |
Keywords: Linear parameter-varying systems, Stability of nonlinear systems, Observers for nonlinear systems
Abstract: This paper investigates the observer-based stabilization problem for a class of disturbance-affected Nonlinear Parameter-Varying (NLPV) systems. We introduce a novel observer-controller framework that leverages Linear Matrix Inequality (LMI) conditions to guarantee robust performance. Unlike previous works that focus solely on state estimation, our approach integrates a controller based on the estimated states to ensure asymptotic stability and optimal disturbance attenuation. Through the deployment of the parameter-dependent Lyapunov (PDL) function and mathcal{H}_infty criterion, a new LMI condition is synthesized to achieve the objective. Further, due to the judicious use of a variant of Young's inequality, the reformulated Lipschitz property, and the matrix multiplier method presented in our previous works, the established LMI encompasses extra degrees of freedom from a feasibility point of view. The performance of the method is validated through simulations on a nonlinear time-varying pendulum system, demonstrating enhanced noise attenuation and faster convergence compared to existing approaches.
|
| |
| 10:45-11:00, Paper FrA17.6 | |
| Contraction Analysis of Almost Surely Monotone Discrete-Time Systems with Unknown Time Delay |
|
| Kawano, Yu | Hiroshima University |
| Hosoe, Yohei | Kyoto University |
Keywords: Stability of nonlinear systems, Markov processes, Delay systems
Abstract: In this paper, we utilize monotonicity to simplify contraction analysis of discrete-time nonlinear time-delay systems with parameters following stochastic processes. First, we extend the concept of almost sure monotonicity to the time-delay systems, which can be verified via analysis of the corresponding prolonged systems. Next, we introduce a novel notion of uniform incremental asymptotic stability (UIAS) in the first moment to the time-delay systems and develop its sufficient condition. By virtue of almost sure monotonicity, if this condition holds, the time-delay systems are UIAS in the first moment for any finite length of delay. Moreover, the proposed result suggests that UIAS in the first moment has a strong connection with stability of the averaged deterministic systems. We formalize this observation for almost surely cooperative systems by establishing the equivalence between UIAS and uniform incremental asymptotic mean stability.
|
| |
| 11:00-11:15, Paper FrA17.7 | |
| Harnessing Membership Function Dynamics for Local Stability Analysis and Estimation of Domains of Attraction for Discrete-Time T-S Fuzzy Systems |
|
| Quilles Marinho, Yara | Korea Advanced Institute of Science and Technology |
| Lee, Donghwan | KAIST |
| Oliveira, Ricardo C. L. F. | University of Campinas - UNICAMP |
| Peres, Pedro L. D. | University of Campinas |
Keywords: LMIs, Stability of nonlinear systems, Fuzzy systems
Abstract: This letter investigates the stability analysis of discrete-time Takagi-Sugeno fuzzy systems by incorporating the dynamics of membership functions (MFs) into an augmented state vector. Using a polytopic representation for the MFs, a novel Lyapunov function that enables the derivation of less conservative linear matrix inequality-based stability conditions is proposed. Unlike existing approaches, the proposed method eliminates the need for predefined MF variation bounds, leveraging their inherent dynamics to refine stability analysis and domain of attraction (DOA) estimation. Numerical examples demonstrate the effectiveness of the proposed approach in reducing conservatism and enlarging the estimated DOA compared to existing methods.
|
| |
| 11:15-11:30, Paper FrA17.8 | |
| Leaky-Integrator Echo State Network Incremental ISS Stability Analysis |
|
| Deng, Hao | University Paris-Saclay |
| Stoica, Cristina | CentraleSupélec/L2S, Univ. Paris-Saclay |
| Chadli, M. | University of Paris-Saclay - UEVE |
Keywords: LMIs
Abstract: This paper proposes a novel incremental input-to-state stability condition for a discrete-time leaky-integrator echo state network. The derived condition is further utilized for control design through Linear Matrix Inequalities (LMIs). The corresponding observer design LMI condition is also derived. A numerical simulation showcases the effectiveness of the proposed approach.
|
| |
| FrA18 |
Aruba I+II+III |
| Hybrid Systems |
Regular Session |
| Chair: Sanfelice, Ricardo G. | University of California at Santa Cruz |
| Co-Chair: Poveda, Jorge I. | University of California, San Diego |
| |
| 09:30-09:45, Paper FrA18.1 | |
| On Sufficient Lyapunov Conditions for Fixed-Time Stability of Hybrid Dynamical Systems |
|
| Tang, Michael | University of California, San Diego |
| Krstic, Miroslav | University of California, San Diego |
| Poveda, Jorge I. | University of California, San Diego |
Keywords: Stability of hybrid systems, Lyapunov methods, Hybrid systems
Abstract: We study the property of fixed-time stability (FxTS) for hybrid dynamical systems (HDS) that combine continuous-time and discrete-time dynamics, possibly modeled via set-valued maps. For such systems, we provide sufficient Lyapunov characterizations for certifying FxTS of general closed sets. In this work, FxTS of a closed set is defined to require two properties: Lyapunov stability and fixed time convergence. For fixed time convergence, the system's trajectories must converge to the set of interest within a finite hybrid time that is uniformly bounded over all initializations. Our main contributions are two-fold: we first show that if the HDS admits solutions that flow for sufficiently long time, as well as a suitable Lyapunov function with a sufficient ``fixed-time'' decrease along flows and nonincrease during jumps, then the HDS will exhibit FxTS. Afterwards, we extend this result by showing that, under a mild dwell time condition, FxTS can be established for HDS for which the Lyapunov function is allowed to mildly increase during jumps. While our theoretical results pave the way for the general analysis of a broad class of fixed-time stable hybrid algorithms and systems, we illustrate them using two representative cases: a hybrid system with dwell-time conditions on the jumps, and systems exhibiting arbitrarily fast switching between a finite number of fixed-time stable vector fields.
|
| |
| 09:45-10:00, Paper FrA18.2 | |
| Optimal Pulse Patterns through a Hybrid Optimal Control Perspective |
|
| Miller, Jared | University of Stuttgart |
| Karamanakos, Petros | Tampere University |
Keywords: Power electronics, Algebraic/geometric methods, Hybrid systems
Abstract: Optimal pulse patterns (OPPs) are a modulation method in which the switching angles and levels of a switching signal are computed via an offline optimization procedure to minimize a performance metric, typically the harmonic distortions of the load current. Additional constraints can be incorporated into the optimization problem to achieve secondary objectives, such as the limitation of specific harmonics or the reduction of power converter losses. The resulting optimization problem, however, is highly nonconvex, featuring a trigonometric objective function and constraints as well as both real- and integer-valued optimization variables. This work casts the task of OPP synthesis for a multilevel inverter as an optimal control problem of a hybrid system. This problem is in turn lifted into a convex but infinite-dimensional conic program of occupation measures using established methods in convex relaxations of optimal control. Lower bounds on the minimum achievable harmonic distortion are acquired by solving a sequence of semidefinite programs via the moment-sum-of-squares hierarchy, where each semidefinite program scales in a jointly linear manner with the numbers of permitted switching transitions and inverter voltage levels.
|
| |
| 10:00-10:15, Paper FrA18.3 | |
| Inverse-Optimal Input-To-State Stabilizing Control for Hybrid Systems |
|
| Montenegro Gonzalez, Carlos | University of California, Santa Cruz |
| J. Leudo, Santiago | University of California, Santa Cruz |
| Sanfelice, Ricardo G. | University of California at Santa Cruz |
Keywords: Hybrid systems, Lyapunov methods, Stability of hybrid systems
Abstract: This paper studies the problem of control design methods to stabilize a hybrid system under disturbances as an inverse-optimality problem. Sufficient conditions are given to guarantee input-to-state stability of a hybrid system with disturbances only. Next, it is shown that a hybrid system, with inputs and disturbances, can be rendered input-to-state controlled stable under the existence of a control Lyapunov function (CLF) using pointwise minimum norm (min-norm) stabilizing feedback laws. Finally, it is proven that every CLF is a meaningful value function for a two-player zero-sum hybrid game in the context of stability, and that every pointwise min-norm stabilizing control feedback law is optimal for such a game with meaningful cost functionals. The main results are illustrated in a one-degree-of-freedom juggling system.
|
| |
| 10:15-10:30, Paper FrA18.4 | |
| Towards Global Stabilization of a Hovercraft Model Using Hybrid Systems and Discontinuous Feedback Laws |
|
| Ballaben, Riccardo | University of Trento |
| Astolfi, Alessandro | Imperial College & Univ. of Rome |
| Braun, Philipp | The Australian National University |
| Zaccarian, Luca | LAAS-CNRS |
Keywords: Maritime control, Hybrid systems, Lyapunov methods
Abstract: In this paper we propose discontinuous control laws to globally stabilize a target position for a hovercraft. Equations of motion of the hovercraft are derived through a kinematic model approximation of a dynamic model taken from the literature. Discontinuous feedback laws for the kinematic and the dynamic model are derived and analyzed using a hybrid systems formulation of the closed-loop dynamics. Numerical simulations confirm the theoretical results and validate the kinematic model as a accurate simplification of the dynamic model.
|
| |
| 10:30-10:45, Paper FrA18.5 | |
| A Hybrid Orbital Stabilizer with Guaranteed Basin of Attraction for Mechanical Systems with Underactuation Degree One |
|
| Dias Navarro, Luiz | University of Toronto |
| Maggiore, Manfredi | University of Toronto |
Keywords: Robotics, Stability of hybrid systems, Algebraic/geometric methods
Abstract: We enhance a recently proposed hybrid controller that asymptotically stabilizes a class of closed orbits (so-called oscillations) for mechanical control systems with underactuation degree one. The controller in question enforces virtual holonomic constraint (VHCs) within a parametric family and instantiates new VHCs at certain events so as to asymptotically stabilize the target orbit. In this paper we propose a novel feedback mechanism in the jump map of the hybrid controller ensuring the asymptotic stabilization of oscillations with guaranteed basin of attraction. We demonstrate the new controller with a four degrees-of-freedom robot mimicking a child on a swing. For this robot, we are able to stabilize a large range of oscillations with guaranteed basin of attraction.
|
| |
| 10:45-11:00, Paper FrA18.6 | |
| Geometric Hybrid Dynamical Systems on Intrinsic Manifolds: Part I - Framework and Hybrid Lyapunov Theorem |
|
| Jirwankar, Piyush Prabhakar | University of California Santa Cruz |
| Sanfelice, Ricardo G. | University of California at Santa Cruz |
Keywords: Stability of hybrid systems, Algebraic/geometric methods
Abstract: This paper develops the hybrid Lyapunov theorem for geometric hybrid dynamical systems, namely hybrid inclusions evolving on intrinsic C1-manifolds. We present various nonsmooth analysis notions, which aid in the formulation of sufficient conditions for existence of solutions. We also present topological definitions of uniform global stability, attractivity, and asymptotic stability of a nonempty, compact set A. Finally, the Lyapunov theorem provides relaxed conditions for uniform global asymptotic stability of the set A. An example illustrates the concepts and results.
|
| |
| 11:00-11:15, Paper FrA18.7 | |
| Geometric Hybrid Dynamical Systems on Intrinsic Manifolds: Part II - Hybrid Invariance Principle |
|
| Jirwankar, Piyush Prabhakar | University of California Santa Cruz |
| Ochoa, Daniel E. | University of California Santa Cruz |
| Sanfelice, Ricardo G. | University of California at Santa Cruz |
Keywords: Stability of hybrid systems, Algebraic/geometric methods
Abstract: This paper establishes a hybrid invariance principle for geometric hybrid dynamical systems, modeled as hybrid inclusions, evolving on intrinsic C1-manifolds. Building on previous results on Lyapunov stability for hybrid systems presented in the companion paper [1], we introduce the theoretical tools required for invariance-based stability analysis. These include set-convergence concepts for manifolds, characterization of nominal well-posedness, and analysis of ω-limit set properties for precompact solutions. Our variational analysis approach preserves the intrinsic geometric structure of the underlying space while handling set-valued dynamics without requiring uniqueness of solutions. This theory particularly addresses challenges in disconnected manifolds and quotient spaces.
|
| |
| 11:15-11:30, Paper FrA18.8 | |
| Observer-Based Control for Lur'e Systems under Aperiodic Sampling |
|
| Gomes da Silva Jr, Joao Manoel | Universidade Federal Do Rio Grande Do Sul |
| Ferrante, Francesco | Universita Degli Studi Di Perugia |
| Palmeira, Alessandra Helena Kimura | UFRGS |
Keywords: Hybrid systems, LMIs, Sampled-data control
Abstract: This paper focuses on observer-based control for Lur’e type systems with slope bounded nonlinearities under aperiodic sampling. In this setting, we design a hybrid observer that provides an estimate of the state based only on aperiodic samples of the plant output. The estimate provided by the observer is used in a sampled-data feedback control architecture. By introducing a timer state, the closed-loop system is modeled as a hybrid dynamical system. By using a quadratic timer-dependent Lyapunov function, we provide sufficient conditions in the form of timer-dependent inequalities ensuring closed-loop asymptotic stability. By relying on an affine dependence on the timer variable, these conditions are turned into a finite collection of linear matrix inequalities that can be used to design the observer gain. A numerical example illustrates the application of the proposed results.
|
| |
| FrA19 |
Ibiza IV |
| Optimal Control VII |
Regular Session |
| Chair: Mangharam, Rahul | University of Pennsylvania |
| Co-Chair: Nie, Yuanbo | University of Sheffield |
| |
| 09:30-09:45, Paper FrA19.1 | |
| Reliable Solution to Dynamic Optimization Problems Using Integrated Residual Regularized Direct Collocation |
|
| Nie, Yuanbo | University of Sheffield |
| Kerrigan, Eric C. | Imperial College London |
Keywords: Optimal control, Numerical algorithms, Predictive control for nonlinear systems
Abstract: Direct collocation (DC) is a widely used method for solving dynamic optimization problems (DOPs), but its implementation simplicity and computational efficiency are limited for challenging problems. For DOPs involving singular arcs, DC solutions often exhibit significant fluctuations along the singular arc, accompanied by large residual errors between collocation points, where the dynamic constraints are enforced as equality constraints. In this paper, we introduce the direct transcription method of integrated residual regularized direct collocation (IRRDC). This approach enforces dynamic constraints using a combination of point-wise residual constraints (expressed as either equalities or inequalities) and a penalty term on the integrated residual error, which helps reduce errors between collocation points. IRRDC retains the implementation simplicity of DC while improving both solution accuracy and efficiency, particularly for challenging problem types. Through the examples, we demonstrate that for problems where traditional DC results in excessive fluctuations, IRRDC effectively suppresses fluctuations and yields solutions with greater accuracy - at least two orders of magnitude lower in various error measures in relation to the dynamic and path constraints.
|
| |
| 09:45-10:00, Paper FrA19.2 | |
| Bang–Bang Optimal Control of Vaccination in Metapopulation Epidemics with Linear Cost Structures |
|
| Machado Moschen, Lucas | Imperial College London |
| Aronna, María Soledad | Fundação Getulio Vargas |
Keywords: Optimal control, Constrained control, Systems biology
Abstract: In this letter, we study optimal vaccination in a heterogeneous metapopulation epidemic model with state and mixed control-state constraints and a linear cost on the control, which better reflects real‐world expenses compared to quadratic penalties. Within a general SIR / SEIR K-patch framework, we formulate the resulting control problem on a finite horizon, prove the existence of optimal control, and derive the necessary conditions via Pontryagin's maximum principle extended to handle both shipment and per-region capacity limits. We use Pontryagin's result to rule out the existence of singular arcs and obtain a complete characterization of the optimal policy as bang-bang, with at most one switching time per region per week. Numerical experiments on networks of 3 to 8 cities confirm that the predicted switching structure arises from the linear cost assumption, and we compare these findings with optimal quadratic-cost policies.
|
| |
| 10:00-10:15, Paper FrA19.3 | |
| Minimax Optimistic Planning for Continuous-Action Noncooperative Sequential Control |
|
| Bogdan, Sever | Technical University of Cluj-Napoca |
| Herzal, Radu | Technical University of Cluj-Napoca |
| Satheeskumar Varma, Vineeth | CNRS |
| Busoniu, Lucian | Technical University of Cluj-Napoca |
Keywords: Optimal control, Numerical algorithms
Abstract: An adversarial setting is considered where two agents influence in turn a dynamical system using continuous, scalar actions. One agent seeks to maximize an infinite-horizon value, while the other aims to minimize it, leading to a sequential noncooperative game. To search for the minimax-optimal solution, we introduce Minimax Optimistic Planning for Continuous actions (MOPC), an algorithm that iteratively refines the most promising regions of the space of action sequences. Our analysis shows that MOPC provides computable bounds on the solution quality. Furthermore, we prove these bounds converge at well-characterized rates as computation grows, where the convergence rates are driven by a measure of problem complexity. We show how to pose in our framework -- and thereby near-optimally solve -- a class of problems in which opinions in a social network are influenced by two competing marketers. In experiments on such problems, MOPC outperforms uniform allocation over time.
|
| |
| 10:15-10:30, Paper FrA19.4 | |
| Quattro: Transformer-Accelerated Iterative Linear Quadratic Regulator Framework for Fast Trajectory Optimization |
|
| Wang, Yue | University of Southampton |
| Wang, Haoyu | University of Southampton |
| Li, Zhaoxing | University of Southampton |
Keywords: Optimal control, Neural networks, Embedded systems
Abstract: Real-time optimal control remains a fundamental challenge in robotics, especially for nonlinear systems with stringent performance requirements. As one of the representative trajectory optimization algorithms, the iterative Linear Quadratic Regulator (iLQR) faces limitations due to their inherently sequential computational nature, which restricts the efficiency and applicability of real-time control for robotic systems. While existing parallel implementations aim to overcome the above limitations, they typically demand additional computational iterations and high-performance hardware, leading to only modest practical improvements. In this paper, we introduce Quattro, a transformer-accelerated iLQR framework employing an algorithm-hardware co-design strategy to predict intermediate feedback and feedforward matrices. It facilitates effective parallel computations on resource-constrained devices without sacrificing accuracy. Experiments on cart-pole and quadrotor systems show an algorithm-level acceleration of up to 5.3x and 27x per iteration, respectively. When integrated into a Model Predictive Control (MPC) framework, Quattro achieves overall speedups of 2.8x for the cart-pole and 17.8x for the quadrotor compared to the one that applies traditional iLQR. Transformer inference is deployed on FPGA to maximize performance, achieving further up to 20.8x speedup over prevalent embedded CPUs with over 11x power reduction than GPU and low hardware resource overhead.
|
| |
| 10:30-10:45, Paper FrA19.5 | |
| Robust Global Exponential Attitude Tracking on SO(3) Via Optimal MRP-Based Hybrid Feedback |
|
| Martins, Luis | Instituto Superior Técnico |
| Cardeira, Carlos | IDMEC/Instituto Superior Tecnico |
| Oliveira, Paulo | Instituto Superior Técnico |
Keywords: Hybrid systems, Stability of nonlinear systems, Optimal control
Abstract: This work presents an innovative robust attitude tracking controller for fully actuated rigid bodies. The approach relies on the modified Rodrigues parameters (MRP) description, which is a double covering of the three-dimensional rotation group SO(3), and the feedback linearization technique to formulate a Lyapunov-based quadratic programming problem. The resulting optimal controller ensures the exponential decay of the attitude tracking errors with minimum control effort and operates within a hybrid system structure to overcome the topological obstructions of the rotation group, rendering the attitude tracking dynamics globally exponentially stable on the MRP space. By combining this MRP-based optimal controller with a hybrid dynamic path-lifting mechanism, this novel approach yields an equivalent tracking result on SO(3). Furthermore, the resulting closed-loop system is nominally robust to perturbations, including external disturbances, parameter variations, or measurement noise. Numerical simulations demonstrate the performance and efficiency of the optimal hybrid strategy.
|
| |
| 10:45-11:00, Paper FrA19.6 | |
| Adversarially Robust Optimal Safe Predefined-Time Stabilization: A Game-Theoretic Approach |
|
| Kokolakis, Nick-Marios T. | University of Pennsylvania |
| Mangharam, Rahul | University of Pennsylvania |
Keywords: Constrained control, Stability of nonlinear systems, Optimal control
Abstract: Safe predefined-time stability characterizes parameter-dependent nonlinear dynamical systems whose trajectories starting in a given set of admissible states remain in the set of admissible states and converge to an equilibrium point in a predefined time. In this paper, we develop a game-theoretic control framework to address a robust optimal safe predefined-time stabilization problem for parameter-dependent nonlinear affine dynamical systems subject to an adversary with nonquadratic cost functionals. In particular, the robust optimal safe predefined-time stabilization problem is formulated as a two-player zero-sum differential game, wherein the controller is a minimizing player and the adversary is a maximizing player. Sufficient conditions for the existence of a saddle-point solution to the zero-sum game and closed-loop system safe predefined-time stability are derived. Specifically, safe predefined-time stability of the closed-loop system is guaranteed via a barrier Lyapunov function satisfying a differential inequality while serving as a solution to the steady-state Hamilton-Jacobi-Isaacs equation ensuring Nash equilibrium. Finally, an illustrative numerical example is provided to demonstrate the efficacy of the developed framework.
|
| |
| 11:00-11:15, Paper FrA19.7 | |
| Agile Temporal Discretization for Symbolic Optimal Control |
|
| Janssens, Adrien | UCLouvain |
| Banse, Adrien | UCLouvain |
| Calbert, Julien | UCLouvain |
| Jungers, Raphaël M. | University of Louvain |
Keywords: Formal Verification/Synthesis, Optimal control, Hybrid systems
Abstract: As control systems grow in complexity, abstraction-based methods have become essential for designing controllers with formal guarantees. However, a key limitation of these methods is their reliance on discrete-time models, typically obtained by discretizing continuous-time systems with a fixed timestep. This discretization leads to two major problems: when the timestep is small, the abstraction includes numerous stuttering and spurious trajectories, making controller synthesis suboptimal or even infeasible; conversely, a large time step may also render control design infeasible due to a lack of flexibility. In this work, drawing inspiration from Reinforcement Learning concepts, we introduce temporal abstractions, which allow for a flexible timestep. We provide a method for constructing such abstractions and formally establish their correctness in controller design. Furthermore we show how to apply these to optimal control under reachability specifications. Finally we showcase our methods on two numerical examples, highlighting that our approach leads to controllers that achieve a lower worst-case control cost.
|
| |
| 11:15-11:30, Paper FrA19.8 | |
| Optimal Control of Hybrid Systems Via Measure Relaxations |
|
| Buehrle, Etienne | Karlsruhe Institute of Technology |
| Tas, Ömer Sahin | Karlsruhe Institute of Technology (KIT) |
| Stiller, Christoph | University of Karlsruhe |
Keywords: Hybrid systems, LMIs, Formal Verification/Synthesis
Abstract: We propose an approach to trajectory optimization for piecewise polynomial systems based on the recently proposed graphs of convex sets framework. We instantiate the framework with a convex formulation of optimal control based on occupation measures, resulting in a convex relaxation resembling the discrete shortest-paths linear program that can be solved efficiently to global optimality. This approach improves scalability to large numbers of discrete modes compared to the NP-hard mixed- integer formulation. We use this to plan trajectories under linear temporal logic specifications, comparing the computed cost lower bound to a nonconvex optimization approach with fixed mode sequence. In our numerical experiments, we find that this bound is typically in the vicinity of the nonconvex solution. While the method inherits the limitations of semidefinite programming, the runtime speedup is significant compared to the often intractable mixed-integer formulation. Our implementation is available at https://github.com/ebuehrle/hpoc.
|
| |
| FrA20 |
Asia I+II+III+IV |
| Distributed Multi-Armed Bandits |
Tutorial Session |
| Chair: Liu, Ji | Stony Brook University |
| Co-Chair: Liu, Ji | Stony Brook University |
| Organizer: Liu, Ji | Stony Brook University |
| Organizer: Suttle, Wesley | Stony Brook University |
| Organizer: Koppel, Alec | JP Morgan Chase |
| |
| 09:30-11:30, Paper FrA20.1 | |
| A Tutorial on Distributed Multi-Armed Bandits (I) |
|
| Liu, Ji | Stony Brook University |
Keywords: Cooperative control, Reinforcement learning, Resilient Control Systems
Abstract: This tutorial paper provides an overview of distributed multi-armed bandit problems in networks of multiple agents. Each agent repeatedly selects an action from a fixed set of choices and receives a random reward, aiming to balance exploration and exploitation. The agents are allowed to communicate only with their neighbors, where the neighbor relations are described by a possibly time-varying graph. Two main settings are considered. Two main settings are discussed. In the first setting, agents observe identical reward distributions for each action. We present two distributed algorithms based on the classical UCB and KL-UCB methods. It is shown that each agent can achieve a lower logarithmic regret compared to the standard single-agent case, as long as the agent has at least one neighbor and the communication graph is strongly connected. The improvement in regret is related to the size of the agent's local neighborhood and the structure of the network. These algorithms can be further modified to be fully resilient to adversarial agents who may inject untrustworthy information, using communication redundancy. In the second setting, agents observe different reward distributions for the same actions. The goal is to minimize cumulative expected regret with respect to the true rewards, defined as the average of all agents' mean rewards. A distributed algorithm is introduced that guarantees optimal regret performance for each agent when the network remains jointly connected over time. All proposed algorithms operate in a fully distributed manner and do not require any global knowledge of the network. This tutorial highlights the theoretical foundations, algorithmic designs, and resilience properties of distributed bandit algorithms, with an emphasis on performance guarantees under limited communication and local observations. Open problems and possible future directions are also discussed.
|
| |
| FrB01 |
Galapagos I |
| Quantum Estimation, Control and Learning |
Invited Session |
| Chair: Dong, Daoyi | Australian National University |
| Co-Chair: Amini, Nina H. | CNRS, L2S, CentraleSupelec |
| Organizer: Wang, Yuanlong | Chinese Academy of Sciences |
| Organizer: Dong, Daoyi | Australian National University |
| Organizer: Amini, Nina H. | CNRS, L2S, CentraleSupelec |
| Organizer: Qi, Bo | CAS |
| |
| 14:00-14:15, Paper FrB01.1 | |
| Gradient-Based Optimization for Linear Quantum Systems: Applications to LQG Control and Ground-State Problems (I) |
|
| Lee, Yunyan | Australian National University |
| Petersen, Ian R. | Australian National University |
| Dong, Daoyi | Australian National University |
Keywords: Quantum information and control, Optimization algorithms, Stochastic optimal control
Abstract: This paper presents a gradient-based optimization approach to solve optimal control problems in linear quantum systems. Utilizing an algebraic Lyapunov equation, we derive the partial derivatives of the cost function with respect to the steady-state covariance matrix. Furthermore, we formulate the gradient for the cost function when applied to a coherent quantum Linear Quadratic Gaussian (LQG) control problem. Through numerical simulations, we verify that our method effectively solves the LQG control problem.
|
| |
| 14:15-14:30, Paper FrB01.2 | |
| Efficient Control Design in Open Quantum Systems under Parametric Uncertainty (I) |
|
| Fan, Yidian | Tsinghua University |
| Wu, Re-Bing | Tsinghua University |
Keywords: Quantum information and control, Robust control, Uncertain systems
Abstract: Robust control design in open quantum systems under parametric uncertainty is essential for practical quantum applications. In this work, we propose an efficient algorithm for identifying robust control solutions in high-dimensional quantum systems. The algorithm reduces the computational complexity by utilizing the Suzuki-Trotter expansion to accelerate the solution of system dynamics. Our simulation results demonstrate that the proposed algorithm can achieve control solutions of high fidelity and robustness with improved efficiency.
|
| |
| 14:30-14:45, Paper FrB01.3 | |
| Quantum State and Detector Tomography through Closed and Open Quantum Systems (I) |
|
| Xiao, Shuixin | University of Melbourne |
| Wang, Yuanlong | Chinese Academy of Sciences |
| Petersen, Ian R. | Australian National University |
| Dong, Daoyi | Australian National University |
Keywords: Quantum information and control
Abstract: The estimation of all the parameters in an unknown quantum state or measurement device, known as quantum state tomography (QST) and quantum detector tomography (QDT), is fundamental to the characterization and control of quantum systems. This paper presents a unified state space framework for QST and QDT based on observable measurement results at different evolution moments, applicable to both closed and Markovian open quantum systems. We derive lower bounds on the number of sampling points required for a unique estimation and analyze the computational complexity and mean squared error (MSE) scaling. As the estimator may yield unphysical results, we introduce new correction techniques to enforce physicality, which may also improve the infidelity scaling. The effectiveness of the proposed methods is validated through numerical simulations.
|
| |
| 14:45-15:00, Paper FrB01.4 | |
| Information Control and Online Estimation for Unital Quantum Systems (I) |
|
| Clouatre, Maison | Massachusetts Institute of Technology (MIT) |
| Marano, Stefano | University of Salerno |
| Win, Moe Z. | Massachusetts Institute of Technology (MIT) |
Keywords: Quantum information and control
Abstract: Information control and online estimation are studied in this paper, focusing on unital quantum systems. To infer the value of an unknown parameter characterizing a quantum state, control is applied to the quantum system and then a measurement is taken. To optimize the estimation procedure, control is selected with the aim of maximizing the Fisher information, about the unknown parameter, extracted from the measurement. We introduce the notion of ``controlled Fisher information (CFI)’’ to describe the maximum Fisher information attainable by control. A method for computing the CFI is designed and an analytic upper bound on the CFI is derived. Necessary and sufficient conditions for the admissibility (i.e., parameter independence) of controls achieving the CFI are given. For scenarios in which the admissibility conditions are not met, we devise an online joint estimation and control algorithm that, with high probability, achieves the CFI after a sufficient number of measurements.
|
| |
| 15:00-15:15, Paper FrB01.5 | |
| Ergodic Properties of Quantum Markov Semigroups (I) |
|
| Mousset, Nicolas | CentraleSupélec, Université Paris-Saclay |
| Amini, Nina H. | CNRS, L2S, CentraleSupelec |
Keywords: Quantum information and control, Markov processes
Abstract: In this paper, we study the ergodic theorem for infinite-dimensional quantum Markov semigroups, originally introduced by Frigerio and Verri in 1982, and its latest version developed by Carbone and Girotti in 2021. We provide a sufficient condition that ensures exponential convergence towards the positive recurrent subspace, a well-known result for irreducible quantum Markov semigroups in finite-dimensional Hilbert spaces. Several illustrative examples are presented to demonstrate the application of the ergodic theorem. Moreover, we show that the positive recurrent subspace plays a crucial role in the study of global asymptotic stability.
|
| |
| 15:15-15:30, Paper FrB01.6 | |
| Controlling Ensemble States of a Quantum System |
|
| Kaushik, Vishesh | Indian Institute of Technolog Bombay |
| Khaneja, Navin | IIT Bombay |
Keywords: Quantum information and control, Emerging control applications
Abstract: We develop a method for controlling the ensemble states of a two-level quantum system on the Bloch sphere. A spectrum of resonance frequencies parametrizes the ensemble. The evolution of the ensemble states under externally applied controls is uniquely determined by their respective resonance frequencies. Therefore, the control fields must be designed to simultaneously evolve the ensemble states across the domain of resonance frequencies. The objective is to evolve the discretized quantum ensemble states from an arbitrary initial dispersion to a targeted final configuration on the Bloch sphere. The proposed method holds potential for broad applicability in areas such as nuclear magnetic resonance (NMR), electron spin resonance (ESR), quantum computing and information processing, sampling-based learning control (SLC), quantum sensing and metrology, and quantum error correction.
|
| |
| 15:30-15:45, Paper FrB01.7 | |
| On the Robustness of Feedback Stabilization for Discrete-Time Quantum Trajectories Beyond QND Measurements |
|
| Bompais, Mael | University of Nottingham |
| Amini, Nina H. | CNRS, L2S, CentraleSupelec |
Keywords: Quantum information and control, Markov processes, Lyapunov methods
Abstract: In this paper, we present several refinements of the results in [1] concerning feedback stabilization of invariant subspaces of quantum trajectories. First, we consider a more realistic dynamics that accounts for potential measurement imperfections. Second, the strong controllability assumption can be replaced by a simpler and more natural condition whenever the channel is primitive on its minimal invariant subspaces. Finally, we show that the control strategy can be based on the estimated trajectory (starting from an estimated initial state) while still ensuring exponential stabilization towards the target subspace at the same rate, thereby demonstrating the robustness of the approach.
|
| |
| 15:45-16:00, Paper FrB01.8 | |
| Control-Enhanced Quantum Metrology Guided by Pontryagin's Minimum Principle |
|
| Hu, Shouliang | Australian National University |
| Ma, Hailan | University of New South Wales |
| Lee, Yunyan | Australian National University |
| Dong, Daoyi | Australian National University |
| Petersen, Ian R. | Australian National University |
Keywords: Quantum information and control
Abstract: Quantum metrology utilizes quantum resources to enhance the precision of parameter estimation. A central objective is to maximize the Quantum Fisher information (QFI), which determines the ultimate estimation precision based on the Cram´er-Rao bound. In this work, we formulate QFI optimization as an optimal control problem and apply Pontryagin's Minimum Principle (PMP) to derive the optimal control protocol. We show that when the control amplitude is constrained below a certain threshold, the optimal solution reduces to a constant control. This strategy enables the QFI to scale quadratically with the total evolution time T, thereby improving estimation precision with longer probe durations.
|
| |
| FrB02 |
Oceania II |
| Data-Driven Verification and Control with Provable Guarantees II |
Invited Session |
| Chair: Lavaei, Abolfazl | Newcastle University |
| Co-Chair: Nejati, Amy | Newcastle University |
| Organizer: Lavaei, Abolfazl | Newcastle University |
| Organizer: Nejati, Amy | Newcastle University |
| Organizer: Jungers, Raphaël M. | University of Louvain |
| Organizer: Abate, Alessandro | University of Oxford |
| |
| 14:00-14:15, Paper FrB02.1 | |
| Kernel-Based Error Bounds of Bilinear Koopman Surrogate Models for Nonlinear Data-Driven Control |
|
| Strässer, Robin | University of Stuttgart |
| Schaller, Manuel | Technische Universität Chemnitz |
| Berberich, Julian | University of Stuttgart |
| Worthmann, Karl | Technische Universität Ilmenau |
| Allgöwer, Frank | University of Stuttgart |
Keywords: Data driven control, Nonlinear systems identification, Robust control
Abstract: We derive novel deterministic bounds on the approximation error of data-based bilinear surrogate models for unknown nonlinear systems. The surrogate models are constructed using kernel-based extended dynamic mode decomposition to approximate the Koopman operator in a reproducing kernel Hilbert space. Unlike previous methods that require restrictive assumptions on the invariance of the dictionary, our approach leverages kernel-based dictionaries that allow us to control the projection error via pointwise error bounds, overcoming a significant limitation of existing theoretical guarantees. The derived state- and input-dependent error bounds allow for direct integration into Koopman-based robust controller designs with closed-loop guarantees for the unknown nonlinear system. Numerical examples illustrate the effectiveness of the proposed framework.
|
| |
| 14:15-14:30, Paper FrB02.2 | |
| Data-Driven Distributed Output Synchronization of Heterogeneous Discrete-Time Multi-Agent Systems (I) |
|
| Fattore, Giulio | University of Padova |
| Valcher, Maria Elena | Universita' Di Padova |
Keywords: Data driven control, Cooperative control, Output regulation
Abstract: Given an autonomous exosystem, generating a reference output, we consider the problem of designing a distributed data-driven control law for a family of discrete time heterogeneous LTI agents, connected through a directed graph, in order to synchronize the agents’ outputs to the reference one. Agents split into two categories: leaders, with direct access to the exosystem output, and followers, that only receive information from their neighbors. All agents aim to achieve output synchronization by means of a state feedback that makes use of their own states as well as of an estimate of the exogenous system state, provided by an internal state observer. Such an observer has a different structure for leaders and followers. Necessary and sufficient conditions for the existence of a solution are first derived in the model-based set-up and then in a data-driven context.
|
| |
| 14:30-14:45, Paper FrB02.3 | |
| Certified Learning of Incremental ISS Controllers for Unknown Nonlinear Polynomial Dynamics (I) |
|
| Zaker, Mahdieh | Newcastle University |
| Angeli, David | Imperial College |
| Lavaei, Abolfazl | Newcastle University |
Keywords: Stability of nonlinear systems, Data driven control, Formal Verification/Synthesis
Abstract: Incremental input-to-state stability (delta-ISS) offers a robust framework to ensure that small input variations result in proportionally minor deviations in the state of a nonlinear system. This property is essential in practical applications where input precision cannot be guaranteed. However, analyzing delta-ISS demands precise knowledge of system dynamics to assess the state's incremental response to input changes, posing a challenge in real-world scenarios where mathematical models are unknown. In this work, we develop a data-driven approach to design delta-ISS Lyapunov functions together with their corresponding delta-ISS controllers for continuous-time input-affine nonlinear systems with polynomial dynamics, ensuring the delta-ISS property is achieved without requiring knowledge of the system dynamics. In our data-driven scheme, we collect only two sets of input-state trajectories from sufficiently excited dynamics. By fulfilling a specific rank condition, we design delta-ISS controllers using the collected samples through formulating a sum-of-squares optimization program. The effectiveness of our data-driven approach is evidenced by its application to a physical case study.
|
| |
| 14:45-15:00, Paper FrB02.4 | |
| Verifying Robust Neural Lyapunov-Barrier Functions Via Counter Example Guided Abstraction Refinement (rNLBF-CEGAR) (I) |
|
| Sharma, Vivek | University of Illinois Urbana Champaign |
| Hovakimyan, Naira | University of Illinois at Urbana-Champaign |
Keywords: Formal Verification/Synthesis, Neural networks, Lyapunov methods
Abstract: In this paper, we introduce a novel framework for simultaneously learning and verifying certificates for uncertain nonlinear control affine systems, utilizing robust control Lyapunov barrier functions (rCLBF) to jointly certify stability and safety. To ensure dense verification of Lyapunov (barrier) conditions in continuous state space, we exploit the special structure of the derivative condition of rCLBF to derive a novel specification that could be densely verified with neural network (NN) verifiers, endowing such certificates with formal guarantees. We first over-approximate the verification domain for the derivative condition and then use counter-example guided abstraction refinement (CEGAR) to eliminate spurious counterexamples (false-negatives) generated by the NN verifiers to prevent exponential blow-up of the verification problem and improve the efficiency of the verification process. We demonstrate the efficacy of our framework in generating certificates with formal guarantees on challenging benchmarks.
|
| |
| 15:00-15:15, Paper FrB02.5 | |
| Data-Driven Abstraction and Synthesis for Stochastic Systems with Unknown Dynamics (I) |
|
| Nazeri, Mahdi | University of Oxford and Max Planck Institute for Software Syste |
| Badings, Thom | University of Oxford |
| Schmuck, Anne-Kathrin | MPI-SWS |
| Soudjani, Sadegh | Max Planck Institute for Software Systems |
| Abate, Alessandro | University of Oxford |
Keywords: Data driven control, Formal Verification/Synthesis, Stochastic systems
Abstract: We study the automated abstraction-based synthesis of correct-by-construction control policies for stochastic dynamical systems with unknown dynamics. Our approach is to learn an abstraction from sampled data, which is represented in the form of a finite Markov decision process (MDP). In this paper, we present a data-driven technique for constructing finite-state interval MDP (IMDP) abstractions of stochastic systems with unknown nonlinear dynamics. As a distinguishing and novel feature, our technique only requires (1) noisy state-input-state observations and (2) an upper bound on the system's Lipschitz constant. Combined with standard model-checking techniques, our IMDP abstractions enable the synthesis of policies that satisfy probabilistic temporal properties (such as "reach-while-avoid") with a predefined confidence. Our experimental results show the effectiveness and robustness of our approach.
|
| |
| 15:15-15:30, Paper FrB02.6 | |
| Data-Driven MPC with Stability Guarantees Using Extended Dynamic Mode Decomposition |
|
| Bold, Lea | Technische Universität Ilmenau |
| Gruene, Lars | University of Bayreuth |
| Schaller, Manuel | Technische Universität Chemnitz |
| Worthmann, Karl | Technische Universität Ilmenau |
Keywords: Data driven control, Predictive control for nonlinear systems, Stability of nonlinear systems
Abstract: For nonlinear (control) systems, extended dynamic mode decomposition (EDMD) is a popular method to obtain data-driven surrogate models. Its theoretical foundation is the Koopman framework, in which one propagates observable functions of the state to obtain a linear representation in an infinite-dimensional space. In this work, we prove practical asymptotic stability of a (controlled) equilibrium for EDMD-based model predictive control, in which the optimization step is conducted using the data-based surrogate model. To this end, we derive novel bounds on the estimation error that are proportional to the norm of state and control. This enables us to show that, if the underlying system is cost controllable, this stabilizablility property is preserved. We conduct numerical simulations illustrating the proven practical asymptotic stability.
|
| |
| 15:30-15:45, Paper FrB02.7 | |
| Data-Driven Feedback Linearization Using the Koopman Generator |
|
| Gadginmath, Darshan | University of California, Riverside |
| Krishnan, Vishaal | University of California, Riverside |
| Pasqualetti, Fabio | University of California, Irvine |
Keywords: Feedback linearization, Data driven control, Predictive control for nonlinear systems
Abstract: This article contributes a theoretical framework for data-driven feedback linearization of nonlinear control-affine systems. We unify the traditional geometric perspective on feedback linearization with an operator-theoretic perspective involving the Koopman operator. We first show that if the distribution of the control vector field and its repeated Lie brackets with the drift vector field is involutive, then there exists an output and a feedback control law for which the Koopman generator is finite-dimensional and locally nilpotent. We use this connection to propose a data-driven algorithm ‘Koopman generator-based feedback linearization (KGFL)’ for feedback linearization of single-input systems. Particularly, we use experimental data to identify the state transformation and control feedback from a dictionary of functions for which feedback linearization is achieved in a least-squares sense. We also propose a single-step data-driven formula which can be used to compute the linearizing transformations. When the system is feedback linearizable and the chosen dictionary is complete, our data-driven algorithm provides the same solution as model-based feedback linearization. Finally, we provide numerical examples for the data-driven algorithm and compare it with model-based feedback linearization. We also numerically study the effect of the richness of the dictionary and the size of the dataset on the effectiveness of feedback linearization.
|
| |
| 15:45-16:00, Paper FrB02.8 | |
| Localized Data-Driven Distributed Controller Design for Positive Stabilization |
|
| Iwata, Takumi | Hiroshima University |
| Azuma, Shun-ichi | Kyoto University |
| Nagahara, Masaaki | Hiroshima University |
Keywords: Compartmental and Positive systems, Data driven control, Networked control systems
Abstract: This paper proposes a localized data-driven distributed controller design method for positive stabilization of network systems. In particular, we consider a network system consisting of multiple unknown agents, each of which has a local controller to be designed. In addition, we assume that each agent has access to the local time-series data and communicates with neighboring agents. In this situation, we address the problem of finding a protocol in which each agent autonomously updates its local controller such that the entire network system is positive and stable. In this paper, we provide a solution to the problem based on the projected consensus algorithm.
|
| |
| FrB03 |
Oceania III |
| Collaboration, Safety, and Security in Multi-Agent Systems |
Invited Session |
| Chair: Lindemann, Lars | University of Southern California |
| Co-Chair: Butler, Brooks A. | University of California, Irvine |
| Organizer: Butler, Brooks A. | University of California, Irvine |
| Organizer: Pare, Philip E. | Purdue University |
| |
| 14:00-14:15, Paper FrB03.1 | |
| Automatic and Scalable Safety Verification Using Interval Reachability with Subspace Sampling |
|
| Gould, Brendan | Georgia Institute of Technology |
| Harapanahalli, Akash | Georgia Institute of Technology |
| Coogan, Samuel | Georgia Institute of Technology |
Keywords: Computational methods, Autonomous systems
Abstract: Interval refinement is a technique for reducing the conservatism of traditional interval based reachability methods by lifting the system to a higher dimension using new auxiliary variables and exploiting the introduced structure through a refinement procedure. We present a novel, efficiently scaling, automatic refinement strategy based on a subspace sampling argument and motivated by reducing the number of interval operations through sparsity. Unlike previous methods, we guarantee that refined bounds shrink as additional auxiliary variables are added. This additionally encourages automation of the lifting phase by allowing larger groups of auxiliary variables to be considered. We implement our strategy in JAX, a high-performance computational toolkit for Python and demonstrate its efficacy on several examples, including regulating a multi-agent platoon to the origin while avoiding an obstacle.
|
| |
| 14:15-14:30, Paper FrB03.2 | |
| Hamilton's Rule for Enabling Altruism in Multi-Agent Systems (I) |
|
| Butler, Brooks A. | University of California, Irvine |
| Egerstedt, Magnus | University of California, Irvine |
Keywords: Cooperative control, Agents-based systems, Networked control systems
Abstract: This paper explores the application of Hamilton’s rule to altruistic decision-making in multi-agent systems. Inspired by biological altruism, we introduce a framework that evaluates when individual agents should incur costs to benefit their neighbors. By adapting Hamilton’s rule, we define agent "fitness" in terms of task productivity rather than genetic survival. We formalize altruistic decision-making through a graph-based model of multi-agent interactions and propose a solution using collaborative control Lyapunov functions. The approach ensures that altruistic behaviors contribute to the collective goal-reaching efficiency of the system. We illustrate this framework on a multi-agent way-point navigation problem, where we show through simulation how agent importance levels influence altruistic decision-making, leading to improved coordination in navigation tasks.
|
| |
| 14:30-14:45, Paper FrB03.3 | |
| Nonconvex Obstacle Avoidance Using Efficient Sampling-Based Distance Functions (I) |
|
| Lutkus, Paul | University of Southern California |
| Chong, Michelle | Eindhoven University of Technology |
| Lindemann, Lars | University of Southern California |
Keywords: Autonomous robots, Constrained control, Output regulation
Abstract: We consider nonconvex obstacle avoidance where a robot described by nonlinear dynamics and a nonconvex shape has to avoid potentially nonconvex obstacles. Obstacle avoidance is a fundamental problem in robotics and well studied in control. However, existing solutions are computationally expensive (e.g., model predictive controllers), neglect nonlinear dynamics (e.g., graph-based planners), use diffeomorphisms to obtain convex problems (e.g., for star shapes), or are conservative as convex overapproximations are used. We instead provide an efficient yet non-conservative solution for general nonconvex obstacles.Our approach is based on efficient computation of the distance between the robot and obstacles via sampling-based distance functions. We quantify the sampling error and show that, for certain systems, sampling-based distance functions are valid nonsmooth control barrier functions. We also study how to deal with disturbances on the robot dynamics in our setting. Finally, we illustrate our method on a robot navigation task involving an omnidirectional robot and nonconvex obstacles. We also analyze performance and computational efficiency of our controller as a function of the number of samples.
|
| |
| 14:45-15:00, Paper FrB03.4 | |
| A Primal-Dual Gradient Descent Approach to the Connectivity Constrained Sensor Coverage Problem (I) |
|
| Bock Agerman, Mathias | KTH Royal Institute of Technology |
| Zhang, Ziqiao | Purdue University |
| Kim, Jong Gwang | Kennesaw State University |
| Sundaram, Shreyas | Purdue University |
| Brinton, Christopher | Purdue University |
Keywords: Optimization, Sensor networks, Optimization algorithms
Abstract: Sensor networks play a critical role in many situational awareness applications. In this paper, we study the problem of determining sensor placements to balance coverage and connectivity objectives over a target region. Leveraging algebraic graph theory, we formulate a novel optimization problem to maximize sensor coverage over a spatial probability density of event likelihoods while adhering to connectivity constraints. To handle the resulting non-convexity under constraints, we develop an augmented Lagrangian-based gradient descent algorithm inspired by recent approaches to efficiently identify points satisfying the Karush-Kuhn-Tucker (KKT) conditions. We establish convergence guarantees by showing necessary assumptions are satisfied in our setup, including employing Mangasarian-Fromowitz constraint qualification to prove the existence of a KKT point. Numerical simulations under different probability densities demonstrate that the optimized sensor networks effectively cover high-priority regions while satisfying desired connectivity constraints.
|
| |
| 15:00-15:15, Paper FrB03.5 | |
| Jointly Computation and Communication-Efficient Distributed Learning (I) |
|
| Ren, Xiaoxing | Cornell University |
| Bastianello, Nicola | KTH Royal Institute of Technology |
| Johansson, Karl H. | KTH Royal Institute of Technology |
| Parisini, Thomas | Imperial C., Aalborg U. & Univ. of Trieste |
Keywords: Learning, Optimization, Agents-based systems
Abstract: We address distributed learning problems over undirected networks. Specifically, we focus on designing a novel ADMM-based algorithm that is jointly computation- and communication-efficient. Our design guarantees computational efficiency by allowing agents to use stochastic gradients during local training. Moreover, communication efficiency is achieved as follows: i) the agents perform multiple training epochs between communication rounds, and ii) compressed transmissions are used. We prove exact linear convergence of the algorithm in the strongly convex setting. We corroborate our theoretical results by numerical comparisons with state of the art techniques on a classification task.
|
| |
| 15:15-15:30, Paper FrB03.6 | |
| Secure Formation Control of Multi-Agent System against FDI Attack Using Fixed-Time Convergent Reinforcement Learning |
|
| Gong, Zhenyu | Northwestern Polytechnical University |
| Yang, Feisheng | Northwestern Polytechnical University |
| Yuan, Yuan | Northwestern Polytechnical University |
| Ma, Qian | Nanjing University of Science and Technology |
| Zheng, Wei Xing | Western Sydney University |
Keywords: Cooperative control, Optimal control, Cyber-Physical Security
Abstract: In this article, a fixed-time convergent reinforcement learning (RL) algorithm is proposed to accomplish the secure formation control of a second-order multi-agent system (MAS) under the false data injection (FDI) attack. To alleviate the FDI attack on the control signal, a zero-sum graphical game is introduced to analyze the attack-defense process, in which the secure formation controller intends to minimize the common performance index function, whereas the purpose of the attacker is the opposite. Attaining the optimal secure formation control policy located at the Nash equilibrium depends on solving the game-associated coupled Hamilton-Jacobi-Isaacs equation. Taking into account fixed-time convergence, a critic-only online RL algorithm with the experience replay technique is designed. Meanwhile, the corresponding convergence and stability proofs are provided. A simulation example is presented to show the effectiveness of the devised scheme.
|
| |
| 15:30-15:45, Paper FrB03.7 | |
| Resilient Multi-Agent Systems against Denial of Service Attacks Via Adaptive and Activatable Network Layers |
|
| Babu Venkateswaran, Deepalakshmi | University of Central Florida |
| Qu, Zhihua | Univ. of Central Florida |
Keywords: Resilient Control Systems, Distributed control, Networked control systems
Abstract: This extended abstract presents a resilient control framework for multi-agent systems operating under Denial of Service (DoS) attacks. The proposed approach includes two core components: a real-time connectivity detection algorithm that identifies disruptions using only local neighbor information, and a dynamic multi-layer communication architecture that activates hidden backup layers when connectivity is lost. The framework is fully distributed, scalable, and requires no global coordination. The distributed framework is validated on a 1000-node Erdõs-Rényi network and an IEEE 123-bus power system, demonstrating improved resilience, consensus accuracy, and reduced control effort under persistent DoS conditions.
|
| |
| 15:45-16:00, Paper FrB03.8 | |
| Efficient State Estimation of a Networked FlipIt Model (I) |
|
| Collins, Brandon | University of Colorado Colorado Springs |
| Thomas Gherna, Thomas Gherna | University of Colorado Colorado Springs |
| Paarporn, Keith | University of Colorado, Colorado Springs |
| Xu, Shouhuai | University of Colorado Colorado Springs |
| Brown, Philip N. | University of Colorado Colorado Springs |
Keywords: Boolean control networks and logic networks, Networked control systems
Abstract: The Boolean Kalman Filter and associated Boolean Dynamical System Theory have been proposed to study the spread of infection on computer networks. Such models feature a network where attacks propagate through, an intrusion detection system that provides noisy signals of the true state of the network, and the capability of the defender to clean a subset of computers at any time. The Boolean Kalman Filter has been used to solve the optimal estimation problem, by estimating the hidden true state given the attack-defense dynamics and noisy observations. However, this algorithm is intractable because it runs in exponential time and space with respect to the network size. We address this feasibility problem by proposing a mean-field estimation approach, which is inspired by the epidemic modeling literature. Although our approach is heuristic, we prove that our estimator exactly matches the optimal estimator in certain non-trivial cases. We conclude by using simulations to show both the run-time improvement and estimation accuracy of our approach.
|
| |
| FrB04 |
Oceania IV |
| Multi-Agent Optimization & Learning |
Invited Session |
| Chair: Basar, Tamer | Univ of Illinois, Urbana-Champaign |
| Co-Chair: Pavel, Lacra | University of Toronto |
| Organizer: Aggarwal, Shubham | University of Illinois, Urbana Champaign |
| Organizer: Bastopcu, Melih | Bilkent University |
| Organizer: Basar, Tamer | Univ of Illinois, Urbana-Champaign |
| Organizer: Maity, Dipankar | University of North Carolina at Charlotte |
| |
| 14:00-14:15, Paper FrB04.1 | |
| Online Bandit Non-Cooperative Games with Arbitrary Delays (I) |
|
| Liu, Wenting | Tongji University |
| Lei, Jinlong | Tongji University |
| Yi, Peng | Tongji University |
| Pavel, Lacra | University of Toronto |
| Hu, Xiaoming | KTH Royal Institute of Technology |
Keywords: Game theory, Learning, Autonomous systems
Abstract: This paper considers online bandit games with arbitrary delays, where the cost functions of all self-interested players are time-varying. Specifically, the players lack an explicit model of the game and can only learn their actions based on the sole available feedback of delayed cost values. To address this challenging setting, a novel learning algorithm named Cumulative Bandit Online Learning with arbitrary delays (CBOL-ad) is proposed. We conduct regret analysis for time-varying games where the player-specific problem is convex, explicitly revealing the influence of time delays and game structure on the regret bound. In particular, under certain delay conditions, our bound can achieve the same order as that in online bandit optimization problems without considering delays. In addition, numerical simulations are provided to illustrate the algorithmic performance.
|
| |
| 14:15-14:30, Paper FrB04.2 | |
| Multi-Cluster Distributed Optimization in Open Multi-Agent Systems Over Directed Graphs with Acknowledgement Messages (I) |
|
| Makridis, Evagoras | University of Cyprus |
| Oliva, Gabriele | University Campus Bio-Medico of Rome |
| Charalambous, Themistoklis | University of Cyprus |
Keywords: Optimization algorithms, Agents-based systems, Large-scale systems
Abstract: In this paper, we tackle the problem of distributed optimization over directed networks in open multi-agent systems (OMAS), where agents may dynamically join or leave, causing persistent changes in network topology and problem dimension. These disruptions not only pose significant challenges to maintaining convergence and stability in distributed optimization algorithms, but could also break the network topology into multiple clusters, each one associated with its own set of objective functions. To address this, we propose a novel Open Distributed Optimization Algorithm with Gradient Tracking (OPEN-GT), which employs: (a) a dynamic mechanism for detecting active out-neighbors through acknowledgement messages, and (b) a fully distributed max-consensus procedure to spread information regarding agent departures, in possibly unbalanced directed networks. We show that when all active agents execute OPEN-GT, the optimization process in each formed cluster remains consistent, while the agents converge to their cluster-wide optimal solution if there exists a time after which the network remains unchanged. Finally, we validate our approach in a simulated environment with dynamically changing agent populations, demonstrating its resilience to network variations and its ability to support distributed optimization under OMAS dynamics.
|
| |
| 14:30-14:45, Paper FrB04.3 | |
| A Swarmalator-Based Multiplayer Pursuit-Evasion Game (I) |
|
| Adams, Takuma | The University of Melbourne |
| Cullen, Andrew Craig | University of Melbourne |
| Alpcan, Tansu | The University of Melbourne |
Keywords: Game theory, Agents-based systems, Biologically-inspired methods
Abstract: Multiplayer pursuit-evasion games are a promising framework for studying inter-agent coordination in domains like autonomous navigation and aerial defence. However, many models rely on explicit coordination, a simplistic assumption that does not adequately reflect the complex communication environment of real-world problems. We elucidate this gap by formulating a multiplayer nonlinear differential game using a physics-based spatio-temporal swarming model to explore how indirect communication drives spatial coordination. We introduce novel locally optimal state feedback strategies, derived via a Hamilton-Jacobi-Bellman approach, that promote synchronisation amongst peers to enhance coordination using mutual observation without requiring explicit communication. Finally, we demonstrate that these strategies harness spatio-temporal dynamics to improve team performance in a three-player scenario, establishing a strong foundation for future numerical evaluations.
|
| |
| 14:45-15:00, Paper FrB04.4 | |
| A Game-Theoretic Framework for Network Formation in Large Populations (I) |
|
| Dayanikli, Gokce | University of Illinois Urbana-Champaign |
| Lauriere, Mathieu | NYU Shanghai |
Keywords: Game theory, Mean field games, Control of networks
Abstract: In this paper, we study a model of network formation in large populations. Each agent can choose the strength of interaction (i.e. connection) with other agents to find a Nash equilibrium. Different from the recently-developed theory of graphon games, here each agent's control depends not only on her own index but also on the index of other agents. After defining the general model of the game, we focus on a special case with piecewise constant graphs and we provide optimality conditions through a system of forward-backward stochastic differential equations. Furthermore, we show the uniqueness and existence results. Finally, we provide numerical experiments to discuss the effects of different model settings.
|
| |
| 15:00-15:15, Paper FrB04.5 | |
| Markov Potential Game Construction and Multi-Agent Reinforcement Learning with Applications to Autonomous Driving |
|
| Yan, Huiwen | University of Missouri |
| Liu, Mushuang | Virginia Tech |
Keywords: Game theory, Reinforcement learning, Autonomous vehicles
Abstract: Markov games (MGs) provide a mathematical foundation for multi-agent reinforcement learning (MARL), enabling self-interested agents to learn their optimal policies while interacting with others in a shared environment. However, due to the complexities of an MG problem, seeking (Markov perfect) Nash equilibrium (NE) is often very challenging for a general-sum MG. Markov potential games (MPGs), which are a special class of MGs, have appealing properties such as guaranteed existence of pure NEs and guaranteed convergence of gradient play algorithms, thereby leading to desirable properties for many MARL algorithms in their NE-seeking processes. However, the question of how to construct MPGs has been open. This paper provides sufficient conditions on the reward design and on the Markov decision process (MDP), under which an MG is an MPG. Numerical results on autonomous driving applications are reported.
|
| |
| 15:15-15:30, Paper FrB04.6 | |
| Resilient Consensus in Open Multi-Agent Systems |
|
| Tang, Yuanyi | Beijing Institute of Technology |
| Lv, Yuezu | Beijing Institute of Technology |
| Zhou, Jialing | Beijing Institute of Technology |
| Ogorzałek, Maciej | Jagiellonian University |
Keywords: Distributed control, Cooperative control, Networked control systems
Abstract: The advancements in resilient consensus research have largely relied on the assumption of a timeinvariant agent set, limiting their applicability in dynamic environments. This paper extends the resilient consensus problem to a more dynamic setting by incorporating agent arrivals and departures, modeling an open multi-agent system. To ensure resilient consensus in such systems, we propose a novel algorithm that separately designs update schemes for cooperative remaining agents and newly joining agents. Furthermore, we establish and analyze the sufficient condition for resilient consensus under malicious attacks in open multi-agent systems. Numerical examples are provided to validate the effectiveness of our theoretical findings.
|
| |
| 15:30-15:45, Paper FrB04.7 | |
| Deviation between Team-Optimal Solution and Nash Equilibrium in Flow Assignment Problems (I) |
|
| Xu, Gehui | Cornell University |
| Bai, Ting | Cornell University |
| Malikopoulos, Andreas A. | Cornell University |
| Parisini, Thomas | Imperial C., Aalborg U. & Univ. of Trieste |
Keywords: Game theory
Abstract: We investigate the relationship between the team-optimal solution and the Nash equilibrium (NE) to assess the impact of strategy deviation on team performance. As a working use case, we focus on a class of flow assignment problems in which each source node acts as a cooperating decision maker (DM) within a team that minimizes the team cost based on the team-optimal strategy. In practice, some selfish DMs may prioritize their own marginal cost and deviate from NE strategies, thus potentially degrading the overall performance. To quantify this deviation, we explore the deviation bound between the team-optimal solution and the NE in two specific scenarios: (i) when the team-optimal solution is unique and (ii) when multiple solutions do exist. This helps DMs analyze the factors influencing the deviation and adopting the NE strategy within a tolerable range. Furthermore, in the special case of a potential game model, we establish the consistency between the team-optimal solution and the NE. Once the consistency condition is satisfied, the strategy deviation does not alter the total cost, and DMs do not face a strategic trade-off. Finally, we validate our theoretical analysis through some simulation studies.
|
| |
| 15:45-16:00, Paper FrB04.8 | |
| Gossip-Based Estimation of Centroid and Common Reference Frame in Open Multi-Agent Systems |
|
| Miele, Andrea | Roma Tre University |
| Franceschelli, Mauro | University of Cagliari |
| Gasparri, Andrea | Roma Tre University |
Keywords: Agents-based systems, Autonomous systems, Distributed control
Abstract: Decentralized estimation of centroid and common reference frame in multi-agent systems is a challenging problem, particularly when agents do not have access to global positioning data. This challenge intensifies in Open Multi-Agent Systems (OMAS), where the network composition dynamically changes due to agents joining or leaving, causing fluctuations in the number of participants. This paper presents a novel, decentralized gossip-based algorithm that enables agents in OMAS to collaboratively estimate both the centroid and a common reference frame in a 2-D environment where the number of agents may fluctuate over time. Notably, our approach remains robust despite noisy distance measurements and intermittent participation, as it relies on asynchronous, local pairwise interactions. Designed to accommodate the dynamic nature of network topologies, our algorithm can be employed for real-world applications where agents can join or leave the system due to failures, resource limitations or external environmental factors.
|
| |
| FrB05 |
Galapagos II |
| Submodular Optimization and Assignment in Multi-Agent and Robotic Systems |
Invited Session |
| Chair: Kia, Solmaz S. | University of California Irvine (UCI) |
| Co-Chair: Siami, Milad | Northeastern University |
| Organizer: Tzoumas, Vasileios | University of Michigan, Ann Arbor |
| Organizer: Kia, Solmaz S. | University of California Irvine (UCI) |
| Organizer: Smith, Stephen L. | University of Waterloo |
| |
| 14:00-14:15, Paper FrB05.1 | |
| Submodular Swarm Assignment for Multi-Agent Systems under Linear Temporal Logic Constraints (I) |
|
| Niu, Luyao | University of Washington |
| Cheng, Shiyu | Washington University in St. Louis |
| Ramasubramanian, Bhaskar | Western Washington University |
| Bushnell, Linda | University of Washington |
| Clark, Andrew | Washington University in St. Louis |
| Poovendran, Radha | University of Washington |
Keywords: Large-scale systems, Distributed control
Abstract: Multi-agent systems need to satisfy complex tasks where collaboration among agents is required. These tasks are often specified using formal methods such as Linear Temporal Logic (LTL). However, enforcing LTL constraints in multi-agent systems presents computational challenges, particularly in large-scale systems, since the size of the joint multi-agent state space scales exponentially with the number of agents. In this paper, we develop a scalable and efficient framework to assign agents to swarms to satisfy LTL specifications. The central idea of our approach is to map multi-agent planning to a combinatorial optimization problem of assigning agents to swarms during each time step, which reduces computational complexity. Further, we demonstrate that swarm assignments can be computed using submodular optimization, leading to a computationally efficient greedy algorithm with a provable (1-frac{1}{e}) optimality guarantee. We extend our approach to a fully distributed scenario, enabling agents to form swarms in a decentralized manner while preserving near-optimal performance. The proposed framework is validated through theoretical analysis and simulations. Compared to a mixed-integer programming-based baseline, our approach is over 10^6 times faster when there are more than 10 agents, while still guaranteeing the satisfaction of LTL formula.
|
| |
| 14:15-14:30, Paper FrB05.2 | |
| Subcarrier Selection for WiFi CSI Sensing Via Submodular Optimization (I) |
|
| Behzad, Kian | Northeastern University |
| Zandi, Rojin | Northeastern University |
| Mordad, Maral | Northeastern University |
| Siami, Milad | Northeastern University |
Keywords: Optimization, Machine learning, Robotics
Abstract: WiFi-based sensing is a promising tool for activity recognition, localization, and human-robot interaction, enabled by the rich Channel State Information (CSI) of modern standards. However, the high dimensionality of CSI, often hundreds of subcarriers, creates challenges in computation, communication, and storage. We propose a submodular optimization framework for subcarrier selection, formulating the task as a combinatorial problem and exploiting the diminishing-returns property of submodular functions. A simple greedy algorithm provides near-optimal selection with theoretical guarantees, while analysis of a convex relaxation shows optimal solutions are sparse and boundary-optimal. We validate our method on the multimodal RoboMNIST dataset for robot activity recognition, showing that it reduces CSI dimensionality while preserving high accuracy and outperforming heuristic baselines. These results demonstrate the effectiveness of submodular optimization for scalable and resource-efficient WiFi sensing in robotic and IoT systems.
|
| |
| 14:30-14:45, Paper FrB05.3 | |
| Offline and Online Distributed Submodular Maximization under a Partitioned Matroid Constraint (I) |
|
| Ye, Lintao | Huazhong University of Science and Technology |
| Du, Bin | Nanjing University of Aeronautics and Astronautics |
| Liu, Zhi-Wei | Huazhong University of Sci & Tech |
Keywords: Optimization, Optimization algorithms
Abstract: We study a distributed framework for solving submodular maximization under a partitioned matroid constraint. A group of agents are connected by a graph and each agent needs to choose a subset from its local ground set. The goal of the agents is to maximize a global objective function that is submodular with respect to the union of the sets chosen by all the agents. We propose a distributed algorithm to solve the problem that lets the agents communicate over the graph and return a global solution that is a 1/(1+c) approximation of the optimal solution in a finite number of communication rounds among the agents, where c is the curvature of the submodular function. We further consider an online setting of the problem where the global objective function can change over a time horizon T. We propose an online distributed algorithm for this setting with frac{1}{1+c}-regret that scales as sqrt{T}.
|
| |
| 14:45-15:00, Paper FrB05.4 | |
| ResQue Greedy: Rewiring Sequential Greedy for Improved Submodular Maximization (I) |
|
| Vendrell Gallart, Joan | University of California Irvine |
| Kuhnle, Alan | Texas A&M University |
| Kia, Solmaz S. | University of California Irvine (UCI) |
Keywords: Optimization, Optimization algorithms, Numerical algorithms
Abstract: This paper introduces Rewired Sequential Greedy (ResQue Greedy), an enhanced approach for submodular maximization under cardinality constraints. By integrating a novel set curvature metric within a lattice-based framework, ResQue Greedy identifies and corrects suboptimal decisions made by the standard sequential greedy algorithm. Specifically, a curvature-aware rewiring strategy is employed to dynamically redirect the solution path, leading to improved approximation performance over the conventional sequential greedy algorithm without significantly increasing computational complexity. Numerical experiments demonstrate that ResQue Greedy achieves tighter near-optimality bounds compared to the traditional sequential greedy method.
|
| |
| 15:00-15:15, Paper FrB05.5 | |
| Resource Allocation with Multi-Team Collaboration Based on Hamilton's Rule |
|
| Karam, Riwa | University of California, Irvine |
| Lin, Ruoyu | University of California, Irvine |
| Butler, Brooks A. | University of California, Irvine |
| Egerstedt, Magnus | University of California, Irvine |
Keywords: Autonomous robots, Cooperative control, Agents-based systems
Abstract: This paper presents a multi-team collaboration strategy based on Hamilton's rule from ecology that facilitates resource allocation among multiple teams, where agents are considered as shared resource among all teams that must be allocated appropriately. We construct an algorithmic framework that allows teams to make bids for agents that consider the costs and benefits of transferring agents while also considering relative mission importance for each team. This framework is applied to a multi-team coverage control mission to demonstrate its effectiveness. It is shown that the necessary criteria of a mission evaluation function are met by framing it as a function of the locational coverage cost of each team with respect to agent gain and loss, and these results are illustrated through simulations.
|
| |
| 15:15-15:30, Paper FrB05.6 | |
| Distributed Multi-Task Assignment for Multi-Agent System and Distributed Transportation Control |
|
| Takamizawa, Soya | Keio University |
| Tsuge, Shunsuke | Kawasaki Heavy Industries, Ltd |
| Namerikawa, Toru | Keio University |
Keywords: Autonomous systems, Autonomous robots, Cooperative control
Abstract: We conduct research on a distributed task assignment algorithm, which is one of the distributed decision-making methods in multi-agent system. We consider a mission in which multi-agents transport multiple packages. In particular, we deal with a problem in which one agent can carry multiple packages depending on its loading capacity, and there are heavy packages which require multiple agents to transport. To achieve the goal, we combine two methods. The first is Grouping, a method of calculating the groups of packages that one agent can carry at a time. The second is two types of lists which record task start time of agents. In consequence, we decrease the task start time and agents' moving distance. We also consider transportation control, by using consensus based control to second-order system.
|
| |
| 15:30-15:45, Paper FrB05.7 | |
| Energy-Aware Task Allocation for Teams of Multi-Mode Robots |
|
| Ito, Takumi | Institute of Science Tokyo |
| Funada, Riku | Tokyo Institute of Technology |
| Sampei, Mitsuji | Institute of Science Tokyo |
| Notomista, Gennaro | University of Waterloo |
Keywords: Robotics, Autonomous systems, Optimal control
Abstract: This work proposes a novel multi-robot task allocation framework for robots that can switch between multiple modes, e.g., flying, driving, or walking. We first provide a method to encode the multi-mode property of robots as a graph, where the mode of each robot is represented by a node. Next, we formulate a constrained optimization problem to decide both the task to be allocated to each robot as well as the mode in which the latter should execute the task. The robot modes are optimized based on the state of the robot and the environment, as well as the energy required to execute the allocated task. Moreover, the proposed framework is able to encompass kinematic and dynamic models of robots alike. Furthermore, we provide sufficient conditions for the convergence of task execution and allocation for both robot models.
|
| |
| FrB06 |
Oceania I |
| Safety Filters for Autonomous Systems II |
Invited Session |
| Chair: Sun, Zhiyong | Peking University (PKU) |
| Co-Chair: Liu, Siyuan | KTH Royal Institute of Technology |
| Organizer: Herbert, Sylvia | UC San Diego (UCSD) |
| Organizer: Lederer, Armin | National University of Singapore |
| Organizer: Li, Ming | KTH Royal Institute of Technology |
| Organizer: Liu, Siyuan | Eindhoven University of Technology |
| Organizer: Sun, Zhiyong | Peking University (PKU) |
| |
| 14:00-14:15, Paper FrB06.1 | |
| Pyspect: An Extensible Toolbox for Automatic Construction of Temporal Logic Trees Via Reachability Analysis (I) |
|
| Munhoz Arfvidsson, Kaj | KTH Royal Institute of Technology |
| Hadjiloizou, Loizos | KTH Royal Institute of Technology |
| Jiang, Frank J. | Royal Institute of Technology |
| Johansson, Karl H. | KTH Royal Institute of Technology |
| Mårtensson, Jonas | KTH Royal Institute of Technology |
Keywords: Formal Verification/Synthesis, Computational methods, Autonomous systems
Abstract: In this paper, we present pyspect, a Python toolbox that simplifies the use of reachability analysis for temporal logic problems. Currently, satisfying complex requirements in cyber-physical systems requires significant manual effort and domain expertise to develop the underlying reachability programs. This high development effort limits the broader adoption of reachability analysis for complex verification problems. To address this, pyspect provides a method-agnostic approach to performing reachability analysis for verifying a temporal logic specification via temporal logic trees (TLTs). It enables the specification of complex safety and liveness requirements using high-level logic formulations that are independent of any particular reachability technique or set representation. As a result, pyspect allows for the comparison of different reachability implementations, such as Hamilton-Jacobi and Hybrid Zonotope-based reachability analysis, for the same temporal logic specification. This design separates the concerns of implementation developers (who develop numerical procedures for reachability) and end-users (who write specifications). Through a simple vehicle example, we demonstrate how pyspect simplifies the synthesis of reachability programs, promotes specification reusability, and facilitates side-by-side comparisons of reachability techniques for complex tasks.
|
| |
| 14:15-14:30, Paper FrB06.2 | |
| Safe Control of Second-Order Systems with Linear Positional Constraints |
|
| Alyaseen, Mohammed | UCSD |
| Atanasov, Nikolay | University of California, San Diego |
| Cortes, Jorge | UC San Diego |
Keywords: Nonlinear output feedback
Abstract: Control barrier functions (CBFs) offer a powerful tool for enforcing safety specifications in control synthesis. This paper deals with the problem of constructing valid CBFs. Given a second-order system and any desired safety set with linear boundaries in the position space, we construct a provably control-invariant subset of this desired safety set. The constructed subset does not sacrifice any positions allowed by the desired safety set, which can be nonconvex. We show how our construction can also meet safety specification on the velocity. We then demonstrate that if the system satisfies standard Euler- Lagrange systems properties then our construction can also handle constraints on the allowable control inputs. We finally show the efficacy of the proposed method in a numerical example of keeping a 2D robot arm safe from collision.
|
| |
| 14:30-14:45, Paper FrB06.3 | |
| Barrier Certificates for Unknown Systems with Latent States and Polynomial Dynamics Using Bayesian Inference (I) |
|
| Lefringhausen, Robert | Technical University of Munich |
| Noel Aziz Hanna, Sami Leon | Technical University Munich, Chair of Information-Oriented Contr |
| August, Elias | Reykjavik University |
| Hirche, Sandra | Technische Universität München |
Keywords: Uncertain systems, Machine learning, Estimation
Abstract: Certifying safety in dynamical systems is crucial, but barrier certificates — widely used to verify that system trajectories remain within a safe region — typically require explicit system models. When dynamics are unknown, data-driven methods can be used instead, yet obtaining a valid certificate requires rigorous uncertainty quantification. For this purpose, existing methods usually rely on full-state measurements, limiting their applicability. This paper proposes a novel approach for synthesizing barrier certificates for unknown systems with latent states and polynomial dynamics. A Bayesian framework is employed, where a prior in state-space representation is updated using output data via a targeted marginal Metropolis–Hastings sampler. The resulting samples are used to construct a barrier certificate through a sum-of-squares program. Probabilistic guarantees for its validity with respect to the true, unknown system are obtained by testing on an additional set of posterior samples. The approach and its probabilistic guarantees are illustrated through a numerical simulation.
|
| |
| 14:45-15:00, Paper FrB06.4 | |
| Data-Driven Input-Output Control Barrier Functions |
|
| Bajelani, Mohammad | The University of British Columbia |
| van Heusden, Klaske | University of British Columbia |
Keywords: Constrained control, Data driven control, Identification
Abstract: Control Barrier Functions (CBFs) offer a framework for ensuring set invariance and designing constrained control laws. However, crafting a valid CBF relies on system-specific assumptions and the availability of an accurate system model, underscoring the need for systematic data-driven synthesis methods. This paper introduces a data-driven approach to synthesizing a CBF for discrete-time LTI systems using only input-output measurements. The method begins by computing the maximal control invariant set using an input-output data-driven representation, eliminating the need for precise knowledge of the system’s order and explicit state estimation. The proposed CBF is then systematically derived from this set, which can accommodate multiple input-output constraints. Furthermore, the proposed CBF is leveraged to develop a minimally invasive safety filter that ensures recursive feasibility with an adaptive decay rate. To improve clarity, we assume a noise-free dataset, though the method can be extended using data-driven reachability to capture noise effects and handle uncertainty. Finally, the effectiveness of the proposed method is demonstrated on an unknown time-delay system.
|
| |
| 15:00-15:15, Paper FrB06.5 | |
| Interpolation-Inspired Closure Certificates (I) |
|
| Oumer, Mohammed Adib | University of Colorado Boulder |
| Murali, Vishnu | University of Colorado Boulder |
| Zamani, Majid | University of Colorado Boulder |
Keywords: Formal Verification/Synthesis, Hybrid systems, Optimization
Abstract: Barrier certificates, a form of state invariants, provide an automated approach for safety verification of dynamical systems. Similar to barrier certificates, recent works explore the notion of closure certificates, a form of transition invariants, to verify dynamical systems against ω-regular properties including safety. A closure certificate, defined over state pairs of a dynamical system, is a real-valued function whose zero superlevel set characterizes an inductive transition invariant of the system. The search for such a certificate can be effectively automated by assuming it to be within a specific template class, e.g. a polynomial of a fixed degree, and then using optimization techniques such as sum-of-squares programming (SOS) or satisfiability-modulo-theory solvers (SMT) to find it. Unfortunately, one may not be able to find such a certificate for a fixed template. In such a case, one must change the template, e.g. increase the degree of the polynomial. In this paper we consider a notion of multiple closure certificates, dubbed interpolation-inspired closure certificates. An interpolation-inspired closure certificate consists of a set of functions which jointly over-approximate a transition invariant by first considering one-step transitions, then two, and repeat until a transition invariant is obtained. The advantage of interpolation-inspired closure certificates is that they allow us to prove properties, even when a single function for a fixed template cannot be found using standard approaches. We present an SOS programming approach to search for these set of functions and demonstrate the effectiveness of our proposed method for verifying safety and persistence (or refuting recurrence) over case studies.
|
| |
| 15:15-15:30, Paper FrB06.6 | |
| Designing Control Barrier Functions for Underactuated Euler--Lagrange Systems Using Dynamic Safety Margins |
|
| Freire, Victor | University of Colorado Boulder |
| Debarshi, Sauranil | University of Colorado, Boulder |
| Nicotra, Marco M | University of Colorado Boulder |
Keywords: Constrained control
Abstract: This letter shows how to design control barrier functions for underactuated and fully-actuated Euler-Lagrange systems subject to state and input constraints. The proposed method uses passivity-based considerations to limit the total energy available to the system and ensure constraint satisfaction. The approach can handle multiple state and input constraints regardless of relative degree.
|
| |
| 15:30-15:45, Paper FrB06.7 | |
| Risk-Aware Safety Verification and Robustness Analysis of Neural Network |
|
| Kishida, Masako | National Institute of Informatics |
Keywords: Neural networks, Uncertain systems, Stochastic systems
Abstract: Ensuring the safety of neural networks under input uncertainty is a fundamental challenge in safety-critical applications. This paper presents a risk-aware variant of Fazlyab’s neural network safety verification, which certifies robustness to input uncertainties using quadratic constraints and semidefinite programming. The proposed approach integrates the Worst-Case Conditional Value-at-Risk to explicitly account for tail risk. This not only expands the freedom of the input uncertainty geometry but also the applicability of Fazlyab’s approach to scenarios where tail-risk is crucial, such as automotive systems and medical applications. Applications of the proposed approach to closed-loop reachability analysis of control systems, dynamical system analysis, and classification are demonstrated through numerical experiments.
|
| |
| 15:45-16:00, Paper FrB06.8 | |
| Safe Navigation in Unmapped Environments for Robotic Systems with Input Constraints |
|
| Safari, Amirsaeid | University of Kentucky |
| Hoagg, Jesse B. | University of Kentucky |
Keywords: Optimal control, Constrained control, Nonlinear systems
Abstract: This paper presents an approach for navigation and control in unmapped environments under input and state constraints using a composite control barrier function (CBF). We consider the scenario where real-time perception feedback (e.g., LiDAR) is used online to construct a local CBF that models local state constraints (e.g., local safety constraints such as obstacles) in the a priori unmapped environment. The approach employs a soft-maximum function to synthesize a single time-varying CBF from recently obtained local CBFs. Next, input constraints are transformed into controller-state constraints through the use of control dynamics. Then, we use a soft-minimum function to compose the input constraints with the time-varying CBF that models the a priori unmapped environment. This composition yields a single relaxed CBF, which is used in a constrained optimization to obtain an optimal control that satisfies the state and input constraints. The approach is validated through simulations of a nonholonomic ground robot that is equipped with LiDAR and navigates an unmapped environment.
|
| |
| FrB07 |
Capri I |
| Recent Achievement and Perspective Directions in Sliding Mode Control Ii |
Invited Session |
| Chair: Fridman, Leonid | Universidad Nacional Autonoma De Mexico |
| Co-Chair: Reger, Johann | TU Ilmenau |
| Organizer: Fridman, Leonid | Universidad Nacional Autonoma De Mexico |
| Organizer: Hsu, Liu | COPPE/UFRJ |
| |
| 14:00-14:15, Paper FrB07.1 | |
| Homogeneous in the Bi-Limit Output Feedback Control for a Class of SISO LTV Systems |
|
| Meléndez-Pérez, René | Universidad Nacional Autónoma De México |
| Moreno, Jaime A. | Universidad Nacional Autonoma De Mexico-UNAM |
Keywords: Variable-structure/sliding-mode control
Abstract: In this letter, we construct an output feedback control scheme tailored to a class of linear time-varying (LTV) systems with a single output and single matched non-vanishing perturbation. The control is shown to be capable of stabilizing the origin of the system in finite-time, or even more, in fixed-time. The result is obtained by constructing a nonlinear homogeneous in the bi-limit controller and observer. For a particular choice of parameters and gains, a high-order sliding mode (HOSM) controller is obtained, capable of global convergence when the system perturbation is bounded. The closed-loop stability of the system is demonstrated by means of a bl-homogeneous Lyapunov function, which allows us to conclude a principle of separation between the observer and controller design. The effectiveness of the proposal is illustrated by an academic example.
|
| |
| 14:15-14:30, Paper FrB07.2 | |
| Generalized Super-Twisting Observer for a Class of Interconnected Nonlinear Systems with Uncertainties |
|
| Tafat, Rania | Technische Universität Chemnitz |
| Moreno, Jaime A. | Universidad Nacional Autonoma De Mexico-UNAM |
| Streif, Stefan | Technische Universität Chemnitz |
Keywords: Observers for nonlinear systems, Lyapunov methods, Variable-structure/sliding-mode control
Abstract: The Generalized Super-Twisting Observer (GSTO) is extended for a strongly observable class of nonlinearly interconnected systems with bounded uncertainties/perturbations. A nonsmooth strong Lyapunov function is used to prove the finite-time convergence of the proposed observer to the true system's trajectories, in the presence of the uncertainties. A case study on the interaction between two food production systems is presented, comparing the proposed observer with the High Gain observer. The results emphasize the critical role of the GSTO’s discontinuous term in achieving exact estimation.
|
| |
| 14:30-14:45, Paper FrB07.3 | |
| Local Stability Analysis for Sliding Mode Control with Unbounded Perturbations - Dynamic Sliding Mode Design Revisited (I) |
|
| Tietze, Niclas | Technische Universität Ilmenau |
| Wulff, Kai | TU Ilmenau |
| Reger, Johann | TU Ilmenau |
Keywords: Variable-structure/sliding-mode control, Stability of nonlinear systems, Lyapunov methods
Abstract: We analyse the local stability properties of sliding mode control designs with a dynamic sliding variable. We apply both, first-order and super-twisting control to a system which is subject to unbounded state- and time-depended perturbations. Our stability analysis yields an estimate of the region of attraction given by invariant sets. Moreover we discuss design aspects such as the influence of the initialisation of the dynamic sliding variable and integrator state of the STA on the estimated region of attraction as well as the avoidance of the algebraic loop by suitable parametrisation of the STA. We illustrate our results by an example.
|
| |
| 14:45-15:00, Paper FrB07.4 | |
| Finite-Time Output Feedback of a Class of Second-Order Systems Via Sliding Mode Observer and Non-Separation Principle Design (I) |
|
| Chen, Weile | Southeast University |
| Li, Shihua | Southeast University |
Keywords: Variable-structure/sliding-mode control, Uncertain systems, Nonlinear output feedback
Abstract: In this paper, the finite-time output feedback stabilization problem for a class of second-order disturbed nonlinear uncertain systems is addressed. First, a high-order sliding mode based finite-time extended state observer (SMESO) is designed to estimate the system's unknown state and external disturbance within finite time. Subsequently, a finite-time composite controller is developed based on the SMESO, where the estimated state is utilized for finite-time feedback control, and the estimated disturbance is applied for feedforward compensation. However, due to system uncertainties, the stability of the SMESO cannot be guaranteed independently of the controller. To resolve this, a non-separation principle design integrating the SMESO and the composite controller is proposed. A Lyapunov function is constructed to rigorously prove the finite-time stability of the entire closed-loop system. Finally, numerical simulations validate the algorithm's finite-time convergence and enhanced anti-disturbance capability through comparisons with existing linear and nonsmooth algorithms.
|
| |
| 15:00-15:15, Paper FrB07.5 | |
| Robust Exact Sliding Mode Controller: A Generalization of the Super-Twisting Controller Based on a Disturbance Model (I) |
|
| Andritsch, Benedikt | Graz University of Technology |
| Koch, Stefan | Graz University of Technology |
| Reichhartinger, Markus | Graz University of Technology |
| Moreno, Jaime A. | Universidad Nacional Autonoma De Mexico-UNAM |
| Horn, Martin | Graz University of Technology |
Keywords: Variable-structure/sliding-mode control
Abstract: In this paper a controller design method is pre- sented that generalizes the super-twisting controller to cope with a broader class of disturbances. The disturbance is assumed to be described by a linear time-invariant system with bounded unknown input. Inspired by ideas of internal model control the disturbance model is incorporated into the controller. The controller contains the characteristic injection terms of the robust exact differentiator of arbitrary order. Moreover, the resulting closed-loop error dynamics are equivalent to the linearly transformed error dynamics of the robust exact differentiator. This equivalence provides for a formal stability proof of the closed loop. The effectiveness of the method is demonstrated in simulations of two mechanical examples.
|
| |
| 15:15-15:30, Paper FrB07.6 | |
| Optimal Adaptive Second-Order Differentiation (I) |
|
| Seeber, Richard | Graz University of Technology |
| Haimovich, Hernan | CONICET and Universidad Nacional De Rosario |
Keywords: Variable-structure/sliding-mode control
Abstract: A causal second-order differentiator achieving exactness from the beginning, robustness almost from the beginning, and the theoretically lowest worst-case differentiation error in presence of measurement noise at every time instant is constructed for the first time. The constructed differentiator is based on the online adaptation of the interval length of a finite-difference differentiator according to an estimate of the noise amplitude. The only tuning parameter of the differentiator is a bound on the third-order derivative of the signals whose first and second derivatives are to be estimated. All the differentiator's properties are formally proven and a simulation example is provided for illustration.
|
| |
| 15:30-15:45, Paper FrB07.7 | |
| Lyapunov Redesign Based SMC Design for a Class of Perturbed Periodic LTV Systems (I) |
|
| Sumenkov, Oleg | Sirius University of Science and Technology |
| Tarabukin, Ivan | Sirius University of Science and Technology |
| Fridman, Leonid | Universidad Nacional Autonoma De Mexico |
| Gusev, Sergei V. | Sirius University |
Keywords: Variable-structure/sliding-mode control, Uncertain systems, Lyapunov methods
Abstract: The paper proposes a Sliding Mode controller (SMC) design method based on Lyapunov Redesign for a class of Periodic Linear Time-Varying (PLTV) systems subject to matched uncertainties and non-vanishing disturbances. First, the existence of Lyapunov function for a PLTV with state matrix uncertainty only is shown using positive definite solutions of the periodic Riccati differential equation (PRDE). Then, Lyapunov Redesign methodology is used to generate the set of periodic time-variant subspaces (sliding surfaces), where the closed-loop system exhibits exponential stability. Finally, the resulting SMC law guarantees the finite-time stability of the sliding surface and robustness with respect to matched non-vanishing disturbances and input matrix uncertainty. The effectiveness of the proposed method is demonstrated through the orbital stabilization of a cart-pendulum system oscillating around its unstable equilibrium.
|
| |
| 15:45-16:00, Paper FrB07.8 | |
| On Delay Robustness of ILF-Based Hyperexponential Control |
|
| Zimenko, Konstantin | ITMO University |
| Efimov, Denis | Inria |
| Polyakov, Andrey | Inria, Univ. Lille |
| Kremlev, Artem | ITMO University |
Keywords: Delay systems, Nonlinear systems, Stability of nonlinear systems
Abstract: The stability of hyperexponential control based on the implicit Lyapunov function method under input delay is analyzed. Using the theory of homogeneous systems, a nonlinear feedback is proposed that ensures global asymptotic stability with respect to a compact set containing the origin for any bounded input delay. The tuning of control parameters and estimation of the convergence set in the presence of input delay are based on solving linear matrix inequalities. For this control a parameter adjustment method is proposed to ensure stability with respect to an arbitrarily small set containing the origin. In the absence of input delay, the proposed feedback guarantees hyperexponential stability at the origin.
|
| |
| FrB08 |
Oceania V |
| Machine Learning I |
Regular Session |
| Chair: Panagou, Dimitra | University of Michigan, Ann Arbor |
| Co-Chair: D'Innocenzo, Alessandro | University of L'Aquila |
| |
| 14:00-14:15, Paper FrB08.1 | |
| Free Parametrization of L2-Bounded State Space Models |
|
| Massai, Leonardo | EPFL |
| Ferrari-Trecate, Giancarlo | Ecole Polytechnique Fédérale De Lausanne |
Keywords: Learning, Stability of nonlinear systems, Neural networks
Abstract: State space models (SSMs) have emerged as a powerful architecture in machine learning and control, fea- turing stacked layers where each consists of a linear time- invariant (LTI) discrete-time system followed by a nonlinearity. While SSMs offer computational efficiency and excel in long- sequence predictions, their widespread adoption in applications like system identification and optimal control is hindered by the challenge of ensuring their stability and robustness properties. We introduce L2RU, a novel parametrization of SSMs that guarantees input-output stability and robustness by enforcing a prescribed L2-bound for all parameter values. This design elim- inates the need for complex constraints, allowing unconstrained optimization over L2RUs by using standard methods such as gradient descent. Leveraging tools from system theory and con- vex optimization, we derive a non-conservative parametrization of square discrete-time LTI systems with a specified L2-bound, forming the foundation of the L2RU architecture. Additionally, we enhance its performance with a bespoke initialization strategy optimized for long input sequences. Through a system identification task, we validate L2RU’s superior performance, showcasing its potential in learning and control applications.
|
| |
| 14:15-14:30, Paper FrB08.2 | |
| Exact Learning of Linear Model Predictive Control Laws Using Oblique Decision Trees with Linear Predictions |
|
| Ren, Jiayang | The University of British Columbia |
| Mao, Qiangqiang | University of British Columbia |
| Zhao, Tianwei | University of British Columbia |
| Cao, Yankai | The University of British Columbia |
Keywords: Predictive control for linear systems, Machine learning, Learning
Abstract: Model Predictive Control (MPC) is a powerful strategy for constrained multivariable systems but faces computational challenges in real-time deployment due to its online optimization requirements. While explicit MPC and neural network approximations mitigate this burden, they suffer from scalability issues or lack interpretability, limiting their applicability in safety-critical systems. This work introduces a data-driven framework that directly learns the Linear MPC control law from sampled state-action pairs using Oblique Decision Trees with Linear Predictions (ODT-LP), achieving both computational efficiency and interpretability. By leveraging the piecewise affine structure of Linear MPC, we prove that the Linear MPC control law can be replicated by finite-depth ODT-LP models. We develop a gradient-based training algorithm using smooth approximations of tree routing functions to learn this structure from grid-sampled Linear MPC solutions, enabling end-to-end optimization. Input-to-state stability is established under bounded approximation errors, with explicit error decomposition into learning inaccuracies and sampling errors to inform model design. Numerical experiments demonstrate that ODT-LP controllers match MPC's closed-loop performance while reducing online evaluation time by orders of magnitude compared to MPC, explicit MPC, neural network, and random forest counterparts. The transparent tree structure enables formal verification of control logic, bridging the gap between computational efficiency and certifiable reliability for safety-critical systems.
|
| |
| 14:30-14:45, Paper FrB08.3 | |
| Learning Verifiable Control Policies Using Relaxed Verification |
|
| Chaudhury, Puja | Northeastern University |
| Estornell, Alexander | Northeastern University |
| Everett, Michael | Northeastern University |
Keywords: Formal Verification/Synthesis, Neural networks, Machine learning
Abstract: To provide safety guarantees for learning-based control systems, recent work has developed formal verification methods to apply after training ends. However, if the trained policy does not meet the specifications, or there is conservatism in the verification algorithm, establishing these guarantees may not be possible. Instead, this work proposes to perform verification throughout training to ultimately aim for policies whose properties can be evaluated throughout runtime with lightweight, relaxed verification algorithms. The approach is to use differentiable reachability analysis and incorporate new components into the loss function. Numerical experiments on a quadrotor model and unicycle model highlight the ability of this approach to lead to learned control policies that satisfy desired reach-avoid and invariance specifications.
|
| |
| 14:45-15:00, Paper FrB08.4 | |
| Provably-Safe Neural Network Training Using Hybrid Zonotope Reachability Analysis |
|
| Chung, Long Kiu | Georgia Institute of Technology |
| Kousik, Shreyas | Georgia Institute of Technology |
Keywords: Formal Verification/Synthesis, Robust control, Machine learning
Abstract: Even though neural networks are being increasingly deployed in safety-critical control applications, it remains difficult to enforce constraints on their output, meaning that it is hard to guarantee safety in such settings. While many existing methods seek to verify a neural network's satisfaction of safety constraints, few address how to correct an unsafe network. The handful of works that extract a training signal from verification cannot handle non-convex sets, and are either conservative or slow. To begin addressing these challenges, this work proposes a neural network training method that can encourage the exact image of a non-convex input set for a neural network with rectified linear unit (ReLU) nonlinearities to avoid a non-convex unsafe region. This is accomplished by reachability analysis with scaled hybrid zonotopes, a modification of the existing hybrid zonotope set representation that enables parameterized scaling of non-convex polytopic sets with a differentiable collision check via mixed-integer linear programs (MILPs). The proposed method was shown to be effective and fast for networks with up to 240 neurons, with the computational complexity dominated by inverse operations on matrices that scale linearly in size with the number of neurons and complexity of input and unsafe sets. We demonstrate the practicality of our method by training a forward-invariant neural network controller for an affine dynamical system with a non-convex input set, as well as generating safe reach-avoid plans for a black-box dynamical system.
|
| |
| 15:00-15:15, Paper FrB08.5 | |
| Sample Efficient Certification of Discrete-Time Control Barrier Functions |
|
| Mulagaleti, Sampath Kumar | IMT School for Advanced Studies Lucca |
| Del Prete, Andrea | University of Trento |
Keywords: Formal Verification/Synthesis, Autonomous systems, Statistical learning
Abstract: Control Invariant (CI) sets are essential for safety certification of dynamical systems. Control Barrier Functions (CBFs) provide a powerful framework for con- structing such sets, but their computation often requires solving intractable robust optimization problems, typically addressed through sampling. We propose a sample-efficient verification procedure based on Lipschitz continuity to certify candidate CBFs, i.e., verify if they satisfy the robust constraints. The approach is demonstrated and validated through a numerical example.
|
| |
| 15:15-15:30, Paper FrB08.6 | |
| Data Driven Finite Abstractions by Simulation Relations with Probabilistic Guarantees Using Regression Trees |
|
| D'Innocenzo, Alessandro | University of L'Aquila |
| Rehman, Khalil Ul | University of L'Aquila |
| Zacchia Lun, Yuriy | Università Degli Studi Dell’Aquila |
Keywords: Model/Controller reduction, Machine learning, Hybrid systems
Abstract: In this paper we propose a novel methodology to construct, given trajectories measured from a dynamical system, a finite abstraction by means of a transition system. We prove that our abstraction is a textit{simulation} of the original dynamical system, providing quantified probabilistic guarantees derived using the scenario approach. We test our methodology on a benchmark on hybrid systems showing that it strongly reduces the cardinality of the abstraction states with respect to a uniform grid, and is thus very promising for handling abstractions of large dimensional systems.
|
| |
| 15:30-15:45, Paper FrB08.7 | |
| How to Adapt Control Barrier Functions? a Learning-Based Approach with Applications to a VTOL Quadplane |
|
| Kim, Taekyung | University of Michigan |
| Beard, Randal W. | Brigham Young Univ |
| Panagou, Dimitra | University of Michigan, Ann Arbor |
Keywords: Constrained control, Neural networks, Flight control
Abstract: In this paper, we present a novel theoretical framework for online adaptation of Control Barrier Function (CBF) parameters, i.e., of the class K functions included in the CBF condition, under input constraints. We introduce the concept of locally validated CBF parameters, which are adapted online to guarantee finite-horizon safety, based on conditions derived from Nagumo’s theorem and tangent cone analysis. To identify these parameters online, we integrate a learning-based approach with an uncertainty-aware verification process that account for both epistemic and aleatoric uncertainties inherent in neural network predictions. Our method is demonstrated on a VTOL quadplane model during challenging transition and landing maneuvers, showcasing enhanced performance while maintaining safety.
|
| |
| 15:45-16:00, Paper FrB08.8 | |
| Line-Of-Sight Guidance: Learning to Look Ahead in Three Dimensions |
|
| Foseid, Eirik Lothe | Norwegian University of Science and Technology |
| Basso, Erlend Andreas | Norwegian University of Science and Technology |
| Schmidt-Didlaukies, Henrik M. | Norwegian University of Science and Technology |
| Pettersen, Kristin Y. | Norwegian University of Science and Technology (NTNU) |
| Gravdahl, Jan Tommy | Norwegian Univ. of Science & Tech |
Keywords: Machine learning, Maritime control, Stability of nonlinear systems
Abstract: This letter investigates line-of-sight (LOS) guidance algorithms for three-dimensional path-following. We prove that a spatial LOS guidance algorithm ensures input-to-state stability (ISS) of the closed-loop system with respect to the lateral velocity. Building on this theoretical foundation, we propose an enhanced LOS algorithm where the lookahead distance is parameterized using a neural network. This approach optimizes performance based on vehicle states and local path characteristics, which serve as inputs to the neural network, while preserving the stability guarantees. The effectiveness of our proposed method is validated through a simulation study using a high-fidelity six degree-of-freedom model of an autonomous underwater vehicle (AUV), demonstrating improved path-following performance while maintaining the stability guarantees of the original approach.
|
| |
| FrB09 |
Oceania VIII |
| Estimation I |
Regular Session |
| Chair: Charalambous, Themistoklis | University of Cyprus |
| Co-Chair: Rodrigues, Luis | Concordia University |
| |
| 14:00-14:15, Paper FrB09.1 | |
| On the Relation between Observability of a Class of Linear Systems with Norm Outputs and Navigation with Range Measurements |
|
| Rodrigues, Luis | Concordia University |
Keywords: Observers for nonlinear systems, Linear systems, Estimation
Abstract: This paper formulates sufficient conditions for observability of a class of linear time invariant systems with a norm output. The observability of these systems depends on the input and will be studied for any input that satisfies a linear autonomous difference equation. It is shown that the observability conditions are related to the navigation problem of finding a position fix from several range measurements by intersecting spheres. This offers a new geometric interpretation of observability. Moreover, it will be shown that the initial condition of the system can still be estimated in the least squares sense if noise is added to the output. An example shows the application of the methodology. Additionally the example shows that the initial condition of the noiseless system is obtained by inverting a lower dimensional upper triangular matrix. Thus, the proposed method yields a matrix inversion with less computations when compared to the augmented state linear time-varying system approaches in the literature. A second example shows how the proposed methodology can be extended to perform state estimation.
|
| |
| 14:15-14:30, Paper FrB09.2 | |
| Adaptive Parameter Estimation-Based Observers of Linear Time-Varying Differential Algebraic Equations Systems |
|
| Ortega, Romeo | ITAM |
| Bobtsov, Alexey | ITMO University |
| Castanos, Fernando | Cinvestav |
| Nikolaev, Nikolay | Technion – Israel Institute of Technology |
Keywords: Estimation, Time-varying systems, Differential-algebraic systems
Abstract: In this paper, we apply the recently developed generalized parameter estimation-based observer design technique for state-affine systems to the practically important case of linear time-varying differential-algebraic equations (DAE) systems with uncertain parameters. We proceed from the general description of the system given by the Standard Canonical Form and try to develop a comprehensive theory for the design of adaptive observers for these systems.
|
| |
| 14:30-14:45, Paper FrB09.3 | |
| SafeSLAM: Trustworthy Localization Via Zonotopic Memory |
|
| Kogel, Tanner | University of Texas at Dallas |
| Wagner, Jonas | University of Texas at Dallas |
| Koeln, Justin | University of Texas at Dallas |
| Ruths, Justin | University of Texas at Dallas |
Keywords: Estimation, Uncertain systems, Autonomous robots
Abstract: The sensor fusion required to localize a robot in an unknown environment brings together relative odometry updates (how far the robot has moved) and relative measurements of features, or landmarks, in the environment. The robot location and trajectory can be inferred by simultaneously corroborating these noisy relative measurements over time. Here we leverage the dependency preserving ``memory'' of zonotopic sets to exactly calculate the set of all feasible agent poses and landmark locations under unknown but bounded uncertainty. Our safeSLAM implementation enables it to be used in pipelines for safety verification. We demonstrate our method in two examples and show that conventional approaches, which identify the maximum likelihood solution using a probabilistic formulation, cannot provide a guarantee that the solution is actually a feasible state of the robot.
|
| |
| 14:45-15:00, Paper FrB09.4 | |
| Parameter Robustness in Data-Driven Estimation of Dynamical Systems |
|
| Pandey, Ayush | University of California, Merced |
Keywords: Estimation, Model Validation, Uncertain systems
Abstract: We study the robustness of system estimation to parametric perturbations in system dynamics and initial conditions. We define the problem of sensitivity-based parametric uncertainty quantification in dynamical system estimation. The main contribution of this paper is the development of a novel robustness metric for estimation of parametrized linear dynamical systems with and without control actions. For the computation of this metric, we delineate the uncertainty contributions arising from control actions, system dynamics, and initial conditions. Furthermore, to validate our theoretical findings, we establish connections between these new results and the existing literature on the robustness of model reduction. This work provides guidance for selecting estimation methods based on tolerable levels of parametric uncertainty and paves the way for new cost functions in data-driven estimation that reward sensitivity to a desired subset of parameters while penalizing others.
|
| |
| 15:00-15:15, Paper FrB09.5 | |
| An Interval Observer Design and Optimization Method for Discrete Linear Time-Invariant Systems |
|
| Xu, Feng | Tsinghua University |
Keywords: Estimation, Linear systems, Uncertain systems
Abstract: This paper proposes a gain optimization design method for an interval observer (IO) of discrete linear timeinvariant (LTI) systems. First, in order to describe the quality of interval estimation, a vector norm-based specification is used to measure the interval size. Second, based on the specification,a gain optimization design method is proposed for the IO. At the end of this paper, the effectiveness of the proposed method is illustrated by numerical examples.
|
| |
| 15:15-15:30, Paper FrB09.6 | |
| Shape-Aware Extended Object Tracking Via Gaussian Process Implicit Surfaces |
|
| Ernst, Eugen | University of Stuttgart |
| Pfaff, Florian | University of Stuttgart |
Keywords: Estimation, Filtering, Uncertain systems
Abstract: We propose a framework for Extended Object Tracking (EOT) that integrates Gaussian Process Implicit Surfaces (GPIS) with particle filtering to track objects with complex, non-star-convex geometries. Unlike traditional EOT methods that rely on parametric or star-convex shape assumptions, our approach models object extents as probabilistic signed distance functions. The GPIS model incorporates surface normals and propagates measurement uncertainty, enabling robust geometry-aware state estimation. To address the computational demands of GPIS, we introduce an online data acquisition strategy that adaptively selects informative observations. Simulation results across a range of shape categories demonstrate that our method outperforms existing EOT techniques in both tracking accuracy and shape reconstruction fidelity.
|
| |
| 15:30-15:45, Paper FrB09.7 | |
| Remote Estimation for Markov Jump Linear Systems: A Distributionally Robust Approach |
|
| Tzortzis, Ioannis | University of Cyprus |
| Charalambous, Themistoklis | University of Cyprus |
| Charalambous, Charalambos D. | University of Cyprus |
Keywords: Estimation, Uncertain systems, Markov processes
Abstract: This paper considers the problem of remote state estimation for Markov jump linear systems in the presence of uncertainty in the posterior mode probabilities. Such uncertainty may arise when the estimator receives noisy or incomplete measurements over an unreliable communication network. To address this challenge, the estimation problem is formulated within a distributionally robust framework, where the true posterior is assumed to lie within a total variation distance ball centered at the nominal posterior. The resulting minimax formulation yields an estimator that extends the classical MMSE solution with additional terms that account for mode uncertainty. A tractable implementation is developed using a distributionally robust variant of the first-order generalized pseudo-Bayesian algorithm. A numerical example is provided to illustrate the applicability and effectiveness of the approach.
|
| |
| 15:45-16:00, Paper FrB09.8 | |
| Control Synthesis in Partially Observable Environments for Complex Perception-Related Objectives |
|
| Xuan, Zetong | University of Florida |
| Wang, Yu | University of Florida |
Keywords: Markov processes, Estimation, Automata
Abstract: Perception-related tasks often arise in autonomous systems operating in partially observable environments. This work studies the control synthesis problem for optimal policy for complex perception-related objectives in environments modeled by partially observable Markov decision processes (POMDPs). To formally express these objectives, we introduce co-safe linear inequality temporal logic (sc-iLTL), which can define complex tasks that are formed by logical concatenation of atomic propositions as linear inequalities on the belief space of the POMDPs. Our solution to the control synthesis problem is to transform the sc-iLTL objectives into reachability objectives by constructing the product of the belief MDP and a deterministic finite automaton (DFA) built from the sc-iLTL objective. Then, to address the scalability issue due to the product, we introduce a Monte Carlo Tree Search (MCTS) method that is guaranteed to find the optimal policy. Finally, we demonstrate the applicability of our method through a drone-probing case study.
|
| |
| FrB10 |
Oceania VII |
| Cooperative Control in Networks and Agent-Based Systems |
Regular Session |
| Chair: Su, Lanlan | University of Sheffield |
| Co-Chair: Tzes, Anthony | New York University Abu Dhabi |
| |
| 14:00-14:15, Paper FrB10.1 | |
| Passivity Compensation: A Distributed Approach for Consensus Analysis in Heterogeneous Networks |
|
| Su, Yongkang | University of Sheffield |
| Khong, Sei Zhen | National Sun Yat-Sen University |
| Su, Lanlan | University of Manchester |
Keywords: Cooperative control, Communication networks, Nonlinear output feedback
Abstract: This paper investigates a passivity-based approach to output consensus analysis in heterogeneous networks composed of non-identical agents coupled via nonlinear interactions, in the presence of measurement and/or communication noise. Focusing on agents that are input-feedforward passive (IFP), we first examine whether a shortage of passivity in some agents can be compensated by a passivity surplus in others, in the sense of preserving the passivity of the transformed open-loop system defined by the agent dynamics and network topology. We show that such compensation is only feasible when at most one agent lacks passivity, and we characterise how this deficit can be offset using the excess passivity within the group of agents. For general networks, we then investigate passivity compensation within the feedback interconnection by leveraging the passivity surplus in the coupling links to locally compensate for the lack of passivity in the adjacent agents. In particular, a distributed condition, expressed in terms of passivity indices and coupling gains, is derived to ensure output consensus of the interconnected network.
|
| |
| 14:15-14:30, Paper FrB10.2 | |
| Adaptive Containment Control of Nonlinear Multi-Agent Systems with State Constraints |
|
| Zhu, Chenhong | Shandong Normal University, Southeast University |
| Wen, Guanghui | Southeast University |
| Zheng, Wei Xing | Western Sydney University |
| Lei, Xuqiang | Southeast University |
Keywords: Agents-based systems, Cooperative control, Lyapunov methods
Abstract: This paper investigates the finite-time adaptive containment control problem for a class of nonlinear multi-agent systems subject to full state constraints. Unlike existing literature that predominantly focused on multi-agent systems with integrator-type high-order linear dynamics or unconstrained nonlinear dynamics, the adaptive containment control problem of multi-agent systems with high-order nonlinear dynamics and state constraints is resolved in this paper. Based on the finite-time Lyapunov stability theory, a finite-time distributed observer is first designed to estimate the information of leaders. Subsequently, by employing the backstepping technique, novel finite-time adaptive containment controllers are constructed to drive followers into a small neighborhood of the convex hull defined by the output of leaders within finite time, while rigorously maintaining all system states inside predefined bounded regions. Finally, theoretical findings are verified through illustrative numerical results.
|
| |
| 14:30-14:45, Paper FrB10.3 | |
| A Consensus Algorithm for Systems Evolving on Lie Groups |
|
| Krishna, Akhil | Research Engineer (New York University Abu Dhabi) |
| Khorrami, Farshad | NYU Tandon School of Engineering |
| Tzes, Anthony | New York University Abu Dhabi |
Keywords: Cooperative control, Agents-based systems, Distributed control
Abstract: In this paper, a consensus algorithm is proposed for interacting multi-agents, which can be modeled as simple Mechanical Control Systems (MCS) evolving on a general Lie group. The standard Laplacian flow consensus algorithm for double integrator systems evolving on Euclidean spaces is extended to a general Lie group. A tracking error function is defined on a general smooth manifold for measuring the error between the configurations of two interacting agents. The stability of the desired consensus equilibrium is proved using a generalized version of Lyapunov theory and LaSalle's invariance principle applicable for systems evolving on a smooth manifold. The proposed consensus control input requires only the configuration information of the neighboring agents and does not require their velocities and inertia tensors. The design of tracking error function and consensus control inputs are demonstrated through an application of attitude consensus problem for multiple communicating rigid bodies. The consensus algorithm is numerically validated by demonstrating the attitude consensus problem.
|
| |
| 14:45-15:00, Paper FrB10.4 | |
| A Class of Optimal Directed Graphs for Network Synchronization |
|
| Lu, Susie | MIT |
| Liu, Ji | Stony Brook University |
Keywords: Cooperative control, Distributed control, Network analysis and control
Abstract: In a paper by Nishikawa and Motter, a quantity called the normalized spread of the Laplacian eigenvalues is used to measure the synchronizability of certain network dynamics. Through simulations, and without theoretical validation, it is conjectured that among all simple directed graphs with a fixed number of vertices and arcs, the optimal value of this quantity is achieved if the Laplacian spectrum satisfies a specific pattern. This paper proves that the conjectured Laplacian spectrum is always achievable by a class of almost regular directed graphs. For a few special cases, it is also shown that the corresponding value of the quantity is indeed optimal.
|
| |
| 15:00-15:15, Paper FrB10.5 | |
| Attitude Synchronization for Multi-Agent Systems on SO(3) Using Vector Measurements |
|
| Boughellaba, Mouaad | Lakehead University |
| Berkane, Soulaimane | University of Quebec in Outaouais |
| Tayebi, Abdelhamid | Lakehead University |
Keywords: Cooperative control, Distributed control, Nonlinear systems
Abstract: In this paper, we address the problem of leaderless attitude synchronization for a group of rigid body systems evolving on SO(3), relying on local measurements of some inertial (unit-length) vectors. The interaction graph among agents is assumed to be undirected, acyclic, and connected. We first present a distributed attitude synchronization scheme designed at the kinematic level of SO(3), followed by an extended scheme designed at the dynamic level. Both schemes are supported by a rigorous stability analysis, which establishes their almost global asymptotic stability properties. Finally, numerical simulations demonstrate the effectiveness of both distributed attitude synchronization schemes.
|
| |
| 15:15-15:30, Paper FrB10.6 | |
| Multi-Agent Consensus with Non-Commensurate Time Delay: Lambert W Function Approach |
|
| Badran, Layan | Concordia University |
| Aryankia, Kiarash | Concordia University |
| Selmic, Rastko | Concordia University |
Keywords: Agents-based systems, Cooperative control
Abstract: This paper investigates the effect of constant time delay in weakly connected multi-agent systems modeled by double integrator dynamics. A novel analytical approach is proposed to establish an upper bound on the permissible time delay that ensures stability and consensus convergence. The analysis employs the Lambert W function method in higher-dimensional systems to derive explicit conditions under which consensus is achieved. The theoretical results are rigorously proven and provide insight into the allowable delay margins. The analysis applies to general leaderless undirected network topologies. The framework also accounts for complex and realistic delays, including non-commensurate communication delays. Numerical examples are provided to demonstrate the effectiveness of the proposed method.
|
| |
| 15:30-15:45, Paper FrB10.7 | |
| LMI-Based Static Output Feedback Control of Uncertain Multi-Agent Systems |
|
| Rostami Alkhorshid, Daniel | University of Brasília |
| Tognetti, Eduardo Stockler | University of Brasilia |
| Morarescu, Irinel-Constantin | CRAN, CNRS, Université De Lorraine |
Keywords: Cooperative control, LMIs, Uncertain systems
Abstract: This work presents the design of static output feedback (SOF) decentralized protocols for the leader-follower tracking problem in multi-agent systems subject to bounded heterogeneous time-varying uncertainties. We propose an approach that ensures input-to-state stability of the closed-loop system with respect to uncertainties such that the synchronization errors between the leader and followers converge to a positively invariant and attractive set. The SOF design is formulated as a convex optimization problem expressed in terms of linear matrix inequalities (LMIs), making it computationally efficient. Additionally, the proposed method relaxes the requirement for full network knowledge by employing a network matrix decomposition via variable transformation. This design framework is applicable regardless of the number of agents, making it well-suited for large-scale networks. Numerical examples illustrate the effectiveness of the proposed approach.
|
| |
| 15:45-16:00, Paper FrB10.8 | |
| An Extension of Taylor’s Model: Hierarchical Convergence under Static and Dynamic Biases |
|
| D'Alfonso, Luigi | Università Della Calabria |
| Merzi, Mehmet Alp | University of Calabria |
| Fedele, Giuseppe | Università Della Calabria |
Keywords: Agents-based systems, Cooperative control, Network analysis and control
Abstract: This work revisits the classical Taylor model for opinion dynamics in networks of interacting agents by embedding a hierarchical structure that governs the flow of influence across multiple strata of a multi-agent system. Unlike traditional frameworks that assume uniform exposure to static external inputs, we propose a model in which agents are structured into distinct tiers, each governed by the outcomes of those above. This allows for a refined understanding of how stubborn agents shape collective opinion, both in static and temporally varying scenarios. The model accommodates the presence of dynamic external biases and accounts for estimation errors in the observation of evolving leadership signals. Numerical studies validate the effectiveness and containment capabilities of the model.
|
| |
| FrB11 |
Oceania VI |
| Network Analysis and Control II |
Regular Session |
| Chair: Stella, Leonardo | University of Birmingham |
| Co-Chair: Fontan, Angela | KTH Royal Institute of Technology |
| |
| 14:00-14:15, Paper FrB11.1 | |
| Global Synchronization of Multi-Agent Systems with Nonlinear Interactions |
|
| Couthures, Anthony | University of Lorraine |
| Satheeskumar Varma, Vineeth | CNRS |
| Lasaulce, Samson | CNRS |
| Morarescu, Irinel-Constantin | CRAN, CNRS, Université De Lorraine |
Keywords: Network analysis and control, Agents-based systems
Abstract: The paper addresses the synchronization of multi-agent systems with continuous-time dynamics interacting through a very general class of monotonic continuous signal functions that covers estimation biases, approximation of discrete quantization, or state-dependent estimation. Our analysis reveals that, in the setup under consideration, synchronization equilibria are exactly the fixed points of the signal function. We also derive intuitive stability conditions based on whether the signal underestimates or overestimates the state of the agents around these fixed points. Moreover, we show that network topology plays a crucial role in asymptotic synchronization. These results provide interesting insights into the interplay between communication nonlinearity and network connectivity, paving the way for advanced coordination strategies in complex systems.
|
| |
| 14:15-14:30, Paper FrB11.2 | |
| Stability of Open Multi-Agent Systems Over Dynamic Signed Graphs |
|
| Sekercioglu, Pelin | KTH Royal Institute of Technology |
| Fontan, Angela | KTH Royal Institute of Technology |
| Dimarogonas, Dimos V. | KTH Royal Institute of Technology |
Keywords: Network analysis and control, Switched systems, Lyapunov methods
Abstract: This paper addresses the synchronization problem in open multi-agent systems containing both cooperative and antagonistic interactions. In these systems, new agents can join and new interactions can be formed over time. Moreover, the types of interactions, cooperative or antagonistic, may change. To model these structural changes, we represent the system as a switched system interconnected over a dynamic signed graph. Using the signed edge-based agreement protocol and constructing strict Lyapunov functions for signed edge-Laplacian matrices with multiple zero eigenvalues, we establish global asymptotic stability of the synchronization errors. Numerical simulations validate our theoretical results.
|
| |
| 14:30-14:45, Paper FrB11.3 | |
| Model Predictive Control for Coupled Adoption-Opinion Dynamics |
|
| Alutto, Martina | Politecnico Di Torino |
| Xu, Qiulin | Tokyo Institute of Technology |
| Dabbene, Fabrizio | CNR-IEIIT |
| Ishii, Hideaki | University of Tokyo |
| Ravazzi, Chiara | National Research Council of Italy (CNR) |
Keywords: Network analysis and control, Optimal control, Stability of nonlinear systems
Abstract: This paper investigates an optimal control problem for an adoption-opinion model that couples opinion dynamics with a compartmental adoption framework on a multilayer network to study the diffusion of sustainable behaviors. Adoption evolves through social contagion and perceived benefits, while opinions are shaped by social interactions and feedback from adoption levels. Individuals may also stop adopting virtuous behavior due to external constraints or shifting perceptions, affecting overall diffusion. After the stability analysis of equilibria, both in the presence and absence of adopters, we introduce a Model Predictive Control (MPC) framework that optimizes interventions by shaping opinions rather than directly enforcing adoption. This nudge-based control strategy allows policymakers to influence diffusion indirectly, making interventions more effective and scalable. Numerical simulations demonstrate that, in the absence of control, adoption stagnates, whereas MPC-driven interventions sustain and enhance adoption across communities.
|
| |
| 14:45-15:00, Paper FrB11.4 | |
| Synchronization of Multi-Agent Hybrid Systems with Synchronous State-Dependent Jumps |
|
| Cellier-Devaux, Alexandre | Cnrs - Lagepp |
| Zaccarian, Luca | LAAS-CNRS |
| Astolfi, Daniele | Cnrs - Lagepp |
| Andrieu, Vincent | Université De Lyon |
Keywords: Hybrid systems, Control of networks, Network analysis and control
Abstract: In this paper, we investigate the problem of synchronization of multi-agent systems described by hybrid non-linear input affine dynamics. We consider networks described by undirected connected graphs without leader. We present a set of sufficient conditions based on an LMI approach in order to design a state-feedback distributed control law. Then, by exploiting the properties of the graph incidence matrix we provide an optimization of the tuning parameters. The incidence matrix properties also allow us to construct a Lyapunov function to establish synchronization of the hybrid network.
|
| |
| 15:00-15:15, Paper FrB11.5 | |
| Cluster Consensus for Signed Networks with Inherent Nonlinear Dynamics |
|
| Develer, Ümit | Bogazici University |
| Cihan, Onur | Marmara University |
| Akar, Mehmet | Bogazici University |
Keywords: Stability of nonlinear systems, Distributed control, Network analysis and control
Abstract: In this paper, we examine the cluster consensus challenge in networks where first and second-order systems interact through both cooperation and competition and exhibit inherent nonlinear dynamics. We establish a sufficient condition, derived from Lyapunov theory, for a single controller parameter that guarantees cluster consensus in a first-order multi-agent network with a signed directed graph that might not have a spanning tree. We expand the stability analysis to include second-order nonlinear systems, which utilize a two-parameter distributed control approach. By employing the extended graph representation of a given signed network, we determine the exact number of clusters and the composition of each cluster from the primary and secondary subgraphs of the extended system. This paper establishes a novel contribution by being the first to address cluster consensus for inherently nonlinear networks operating on signed directed graphs that do not contain a spanning tree.
|
| |
| 15:15-15:30, Paper FrB11.6 | |
| Bio-Inspired Collective Decision-Making on a Multi-Population |
|
| Baar, Wouter | University of Groningen |
| Stella, Leonardo | University of Birmingham |
| Bauso, Dario | University of Groningen |
Keywords: Stability of nonlinear systems, Network analysis and control, Biological systems
Abstract: In recent years, there has been an increasing interest towards the theory and applications of decision-making in multi-agent systems, where the interactions among multiple groups of individuals exhibit complex behaviors. However, a large body of works considers only a single homogeneous population, limiting the applications in real settings. To this end, we develop a general framework for collective decision-making on a networked multi-population. We study this problem in populations with a large number of agents, where each agent has to choose one of two available options, or remain uncommitted. The contribution of this paper is threefold. First, we develop a framework for collective decision-making on a networked multi-population where the transition rates depend on the neighboring populations. Second, we characterize the equilibria and find the conditions for local asymptotic stability. Finally, we study globally stability for the equilibrium where all players are committed to neither option.
|
| |
| 15:30-15:45, Paper FrB11.7 | |
| Graph Resistance-Based Approach to Identify Articulation Sets |
|
| Dennisselvan, Sahaya Aarti | Indian Institute of Technology Bombay |
| Mukherjee, Dwaipayan | Indian Institute of Technology Bombay |
Keywords: Network analysis and control, Control of networks
Abstract: Connectivity is a critical aspect of a multi-agent system (MAS), as it determines the system's ability to successfully accomplish cooperative tasks. Consequently, it is essential to develop algorithms that design networks or graphs that not only ensure nominal connectivity but also provide guarantees of connectivity despite node failures. This characteristic of maintaining connectivity even in the face of node failures is referred to as biconnectivity. In this paper, we utilize the concept of graph resistance to ensure connectivity in the presence of emph{articulation nodes/sets} when node failures occur. First, an algorithm is presented to identify articulation nodes/sets within the graph. Subsequently, we introduce a connectivity index to quantify the extent of connectivity loss when nodes fail. Additionally, we propose a method for designing a network that preserves connectivity against node failures with addition of a minimal number of edges.
|
| |
| 15:45-16:00, Paper FrB11.8 | |
| Towards Modular Scattering-Based Design of Dissipative Networks with Time Delays |
|
| Polushin, Ilia G. | Western University |
Keywords: Nonlinear systems, Network analysis and control, Delay systems
Abstract: Results related to modular scattering-based design of complex dissipative networks in the case where at least some of the subsystems are interconnected over delayed communication channels are presented. Conditions for well-posedness of dissipative networks with delayed communication channels are formulated in terms of subsystems behaviors. Conditions on the subsystems that guarantee an interconnection to satisfy the target property of weak dissipativity with a quadratic supply rate and additional internal stability-like properties are established in the form of a linear matrix inequality as well as in the form of a graph separation condition. A procedure for the design of scattering transformations is developed which guarantees that the scattering-based interconnection with communication delays satisfies the target property. Iterative application of the proposed techniques allows for modular design of large scale dissipative networks with overall stability properties in the presence of communication delays.
|
| |
| FrB12 |
Oceania X |
| Optimization Algorithms IV |
Regular Session |
| Chair: Coutinho, Daniel | Universidade Federal De Santa Catarina |
| Co-Chair: Geng, Sijia | Johns Hopkins University |
| |
| 14:00-14:15, Paper FrB12.1 | |
| A Finsler’s Lemma Application to DLMI Optimization |
|
| Bhiri, Bassem | Université De Gabès-CONPRI |
| Coutinho, Daniel | Universidade Federal De Santa Catarina |
Keywords: Optimization algorithms, Sampled-data control, LMIs
Abstract: This paper suggests a new numerical technique to deal with infinite dimensional Differential Linear Matrix Inequality over a real compact. For this aim, we use an extended robust version of Finsler's Lemma to convert a quadratic time-dependent constraint over a finite time interval into an efficient, tractable Linear Matrix Inequality. Then, the proposed approach is used to obtain a new computationally efficient sufficient condition for finite time stability linear time invariant systems.
|
| |
| 14:30-14:45, Paper FrB12.3 | |
| Distributed Online Algorithms for Economic Power Dispatch |
|
| Mi, Yingjie | The University of Sydney |
| Yuan, Deming | Nanjing University of Science and Technology |
| Ratnam, Elizabeth | Australian National University |
| Verbic, Gregor | Unversity of Sydney |
| Shi, Guodong | The University of Sydney |
Keywords: Power systems, Optimization algorithms, Distributed control
Abstract: We extend the classical economic dispatch problem in power systems to an online optimization framework. In this setting, costs and constraints are revealed locally at decentralized generating units only after dispatch decisions are implemented. The formulation admits sequential and robust solutions to economic dispatch under uncertainty, and further generalizes to a distributed online optimization problem with a global coupling constraint enforcing supply–demand balance. For both full-information and bandit feedback, we develop distributed algorithms and prove that they achieve sublinear dynamic regret relative to the path length of the optimal offline solutions.
|
| |
| 14:45-15:00, Paper FrB12.4 | |
| Unified Control Scheme for Optimal Allocation of GFM and GFL Inverters in Power Networks |
|
| Geng, Sijia | Johns Hopkins University |
| Chatterjee, Sushobhan | Johns Hopkins University |
Keywords: Power systems, Smart grid, Optimization algorithms
Abstract: With the rapid adoption of emerging inverter-based resources in power systems, it is crucial to understand their dynamic interactions across the network and impacts on stability. To avoid combinatorial challenges and the limitation of case-by-case studies, this paper proposes a systematic and efficient method for determining optimal allocation of grid-forming and grid-following inverters in power networks. The approach leverages a novel unified grid-forming/following inverter control and formulates an optimization problem to ensure stability and maximal energy dissipation during transient periods. An iterative algorithm is developed to solve the optimization problem. The resulting optimal droop gains for the unified inverters provide insights into the network's needs for grid-forming and grid-following resources. A three-bus system is used to demonstrate the optimality and dynamic performance of the proposed method.
|
| |
| 15:00-15:15, Paper FrB12.5 | |
| Feedback Optimization with State Constraints through Control Barrier Functions |
|
| Delimpaltadakis, Giannis | Eindhoven University of Technology |
| Mestres, Pol | California Institute of Technology |
| Cortes, Jorge | UC San Diego |
| Heemels, W.P.M.H. (Maurice) | Eindhoven University of Technology |
Keywords: Constrained control, Optimization algorithms, Optimization
Abstract: Recently, there has been a surge of research on a class of methods called feedback optimization. These are methods to steer the state of a control system to an equilibrium that arises as the solution of an optimization problem. Despite the growing literature on the topic, the important problem of enforcing state constraints at all times remains unaddressed. In this work, we present the first feedback-optimization method that enforces state constraints. The method combines a class of dynamics called safe gradient flows with high-order control barrier functions. We provide a number of results on our proposed controller, including well-posedness guarantees, anytime constraint-satisfaction guarantees, equivalence between the closed-loop's equilibria and the optimization problem's critical points, and local asymptotic stability of optima.
|
| |
| 15:15-15:30, Paper FrB12.6 | |
| Tikhonov Regularized Exterior Penalty Methods for Hierarchical Variational Inequalities |
|
| Marschner, Meggie | University of Mannheim |
| Staudigl, Mathias | Universität Mannheim |
Keywords: Optimization, Optimization algorithms
Abstract: We consider nested variational inequalities consisting in a (upper-level) variational inequality whose feasible set is given by the solution set of another (lower-level) variational inequality. This class of hierarchical equilibrium contains a wealth of important applications, including purely hierarchical convex bilevel optimization problems and certain multi-follower games. Working within a real Hilbert space setting, we develop a double loop prox-penalization algorithm with strong convergence guarantees towards a solution of the nested VI problem. We present various application that fit into our framework and present also some preliminary numerical results.
|
| |
| 15:30-15:45, Paper FrB12.7 | |
| Reactive Power Flow Optimization in AC Drive Systems |
|
| Chandrasekaran, Sanjay | ETH Zurich |
| Arghir Scheifele, Catalin-Ionel | Rheinmetall Air Defence |
| Joerg, Pieder | ABB Medium Voltage Drives |
| Dörfler, Florian | Swiss Federal Institute of Technology (ETH) Zurich |
| Mastellone, Silvia | University of Applied Science Northwestern Switzerland FHNW |
Keywords: Power electronics, Optimization algorithms, Constrained control
Abstract: This paper explores a limit avoidance approach in the case of modulation (input) and current (output) constraints with the aim of enhancing system availability of AC drives. Drawing from the observation that, in a certain range of reactive power, there exists a trade-off between current and modulation magnitude, we exploit this freedom and define a constrained optimization problem. We propose two approaches: one in the form of an activation-function, and another that uses online feedback optimization to set the reactive power dynamically. Both methods compromise reactive power tracking accuracy for increased system robustness. Through a high-fidelity simulation, we compare the benefits of the two methods, highlighting their effectiveness in industrial applications.
|
| |
| 15:45-16:00, Paper FrB12.8 | |
| Optimal Balancing of Tropical Discrete-Event Systems through Feedback Control |
|
| Maia, Carlos-Andrey | Universidade Federal De Minas Gerais |
Keywords: Discrete event systems
Abstract: Dynamical Tropical systems are described by means of Tropical Algebra (for instance, Min- or Max-plus ones), which is a kind of idempotent semifield. For such systems, we are interested in the study of general algebraic properties ensuring optimal balancing through feedback control. By balancing, we mean that all events, or transitions, occur at the same rate, meaning that there is no sub-product accumulation inside the system. In this context, after formulating the problem for Tropical Semifields, the first result is the development, thanks to Residuation Theory, of the expression of the maximum feedback matrix expressed in terms of a vector parameter, ensuring that the closed-loop matrix has a desired eigenvalue. Under the assumption of controllability and boundedness of the controllability matrix, we develop a method to properly choose this maximum feedback matrix. In order to illustrate the method, we present a solution for the problem of balancing two unconnected networks by means of feedback control.
|
| |
| FrB13 |
Oceania IX |
| Game Theory V |
Regular Session |
| Chair: Bose, Subhonmesh | University of Illinois at Urbana Champaign |
| Co-Chair: Zhu, Quanyan | New York University |
| |
| 14:00-14:15, Paper FrB13.1 | |
| Invariance and Concentration Properties of Gradient-Based Learning in Games |
|
| Lotidis, Kyriakos | Stanford University |
| Mertikopoulos, Panayotis | French National Center for Scientific Research (CNRS) |
| Bambos, Nicholas | Stanford University |
| Blanchet, Jose | Stanford University |
Keywords: Game theory, Optimization algorithms
Abstract: In this paper, we study the long-run behavior of learning in strongly monotone games with stochastic, gradient-based feedback. For concreteness, we focus on the stochastic projected gradient (SPG) algorithm, and we examine the asymptotic distribution of its iterates when the method is run with constant step-size updates (the de facto choice for practical deployments of the algorithm). In contrast to variants of the method with a vanishing step-size case, SPG with a constant step-size does not converge: instead, it reaches a neighborhood of the game's Nash equilibrium at an exponential rate, and then, due to persistent noise, it fluctuates in its vicinity without converging (occasionally moving away on rare occasions). We provide a theoretical quantification of this behavior by analyzing the Markovian structure of the process. Namely, we show that, regardless of the algorithm's initialization, the distribution of its iterates converges at a geometric rate to a unique invariant measure which is concentrated in a neighborhood of the game's Nash equilibrium. More explicitly, we quantify the degree of this concentration and the rate of convergence of the algorithm's empirical frequency of play to the invariant measure of the process in Wasserstein distance, and we provide explicit bounds in terms of the method's step-size, the variance of the noise entering the process, and the geometric features of the game's payoff landscape.
|
| |
| 14:15-14:30, Paper FrB13.2 | |
| Two-Player Dynamic Potential LQ Games with Sequentially Revealed Costs |
|
| Chen, Yitian | Australian National University |
| Molloy, Timothy L. | Australian National University |
| Shames, Iman | Australian National University |
Keywords: Game theory, Optimal control, Linear systems
Abstract: We investigate a novel finite-horizon linear-quadratic (LQ) feedback dynamic potential game with a priori unknown cost matrices played between two players. The cost matrices are revealed to the players sequentially, with the potential for future values to be previewed over a short time window. We propose an algorithm that enables the players to predict and track a feedback Nash equilibrium trajectory, and we measure the quality of their resulting decisions by introducing the concept of price of uncertainty. We show that under the proposed algorithm, the price of uncertainty is bounded by horizon-invariant constants. The constants are the sum of three terms; the first and second terms decay exponentially as the preview window grows, and the third depends on the magnitude of the differences between the cost matrices for each player. Through simulations, we illustrate that the resulting price of uncertainty initially decays at an exponential rate as the preview window lengthens, then remains constant for large time horizons.
|
| |
| 14:30-14:45, Paper FrB13.3 | |
| Inverse Reinforcement Learning for Mean-Field Games with Average Reward Criterion |
|
| Alkır, Şevket Kaan | Bilkent University |
| Saldi, Naci | Bilkent University |
Keywords: Mean field games, Reinforcement learning, Markov processes
Abstract: We study the inverse reinforcement learning problem for discrete-time, infinite-horizon mean-field games with an average-reward criterion. Unlike the forward setting, where the reward function is known, IRL assumes access only to expert demonstrations that are optimal under some unknown reward function. The objective is to recover the reward structure that explains these expert behaviors. Our approach is based on the maximum causal entropy principle, which selects the least biased policy among those consistent with the observed demonstrations. We show that the resulting non-convex formulation is equivalently reformulated as a convex optimization problem over occupation measures. Furthermore, we establish that the dual objective is smooth and strongly convex over compact sets and derive a variational representation using a log-partition formulation. Finally, we propose a first-order algorithm for solving the dual problem and recovering an entropy-maximizing equilibrium policy.
|
| |
| 14:45-15:00, Paper FrB13.4 | |
| Initial Error Tolerant Distributed Mean Field Control under Partial and Discrete Information |
|
| Jin, Yuxin | Beihang University |
| Wang, Haotian | Beihang University |
| Yao, Wang | Beihang University |
| Zhang, Xiao | Beihang University |
Keywords: Large-scale systems, Mean field games, Estimation
Abstract: In this paper, an initial error tolerant distributed mean field control method under partial and discrete information is introduced, where each agent only has discrete observations on its own state. First, we study agents' behavior in linear quadratic mean field games (LQMFGs) under heterogeneous erroneous information of the initial mean field state (MF-S), and formulate the relationships between initial errors and systemic deviations. Next, by capturing the initial error affection on the private trajectory of an agent, we give a distributed error estimation method based on maximum likelihood estimation (MLE), where each agent estimates information errors only based on discrete observations on its private trajectory. Furthermore, we establish an error-based segmented state estimation method, design the initial error tolerant distributed mean field control method (IET-DMFC), and analyze the error distribution of the state estimation. Finally, simulations are performed to verify the efficiency of the algorithm and the consistent properties.
|
| |
| 15:00-15:15, Paper FrB13.5 | |
| Harnessing Information in Incentive Design |
|
| Velicheti, Raj Kiriti | University of Illinois at Urbana Champaign |
| Bose, Subhonmesh | University of Illinois at Urbana Champaign |
| Basar, Tamer | Univ of Illinois, Urbana-Champaign |
Keywords: Game theory, Optimization algorithms, Stochastic systems
Abstract: Incentive design deals with interaction between a principal and an agent where the former can shape the latter's utility through a policy commitment. It is well known that the principal faces an information rent when dealing with an agent that has informational advantage. In this work, we embark on a systematic study of the effect of information asymmetry in incentive design games. Specifically, we first demonstrate that it is in principal's interest to decrease this information asymmetry. To mitigate this uncertainty, we let the principal gather information either by letting the agent shape her belief (aka Information Design), or by paying to acquire it. Providing solutions to all these cases we show that while introduction of uncertainty increases the principal's cost, letting the agent shape its belief can be advantageous. We study information asymmetry and information acquisition in both matrix games and quadratic Gaussian game setups.
|
| |
| 15:15-15:30, Paper FrB13.6 | |
| Voltage Support Procurement in Transmission Grids: Incentive Design Via Online Bilevel Games |
|
| Jiang, Zhisen | Tsinghua University |
| Bolognani, Saverio | ETH Zurich |
| Belgioioso, Giuseppe | KTH Royal Institute of Technology |
Keywords: Power systems, Game theory, Computational methods
Abstract: The integration of distributed energy resources into transmission grid operations presents a complex challenge, particularly in the context of reactive power procurement for voltage support. This paper addresses this challenge by formulating the voltage regulation problem as a Stackelberg game, where the Transmission System Operator (TSO) designs incentives to guide the reactive power responses of Distribution System Operators (DSOs). We utilize a gradient-based iterative algorithm that updates the incentives to ensure that DSOs adjust their reactive power injections to maintain voltage stability. We incorporate principles from online feedback optimization to enable real-time implementation, utilizing voltage measurements in both TSO's and DSOs' policies. This approach not only enhances the robustness against model uncertainties and changing operating conditions but also facilitates the co-design of incentives and automation. Numerical experiments on a 5-bus transmission grid demonstrate the effectiveness of our approach in achieving voltage regulation while accommodating the strategic interactions of self-interested DSOs.
|
| |
| 15:30-15:45, Paper FrB13.7 | |
| A Bregman Method for Mixed-Strategy Generalized Nash Equilibrium Seeking |
|
| Ananduta, Wicak | Flemish Institute for Technological Research (VITO) |
| Grammatico, Sergio | Delft Univ. of Tech |
Keywords: Optimization algorithms, Variational methods, Game theory
Abstract: We consider the problem of computing a mixed-strategy generalized Nash equilibrium (MS-GNE) for a class of games where each agent has both continuous and integer decision variables. Specifically, we propose a novel Bregman forward-reflected-backward splitting and design a semi-decentralized algorithm that exploits the problem structure. Technically, we prove convergence to a variational MS-GNE under mere monotonicity and Lipschitz continuity assumptions. Finally, we show the performance of our algorithms via numerical experiments.
|
| |
| 15:45-16:00, Paper FrB13.8 | |
| Bilateral Cognitive Security Games in Networked Control Systems under Stealthy Injection Attacks |
|
| Nguyen, Anh Tung | Uppsala University |
| Zhu, Quanyan | New York University |
| Teixeira, André M. H. | Uppsala University |
Keywords: Cyber-Physical Security, Game theory, Optimization
Abstract: This paper studies a strategic security problem in networked control systems under stealthy false data injection attacks. The security problem is modeled as a bilateral cognitive security game between a defender and an adversary, each possessing cognitive reasoning abilities. The adversary with an adversarial cognitive ability attacks strategically selected interconnections of the system with the aim of disrupting the network performance while remaining stealthy to the defender. Meanwhile, the defender with a defensive cognitive ability monitors strategically selected nodes to impose the stealthiness constraint with the purpose of minimizing the worst-case disruption caused by the adversary. Within the proposed bilateral cognitive security framework, the preferred cognitive levels of the two strategic agents are formulated in terms of two newly proposed concepts, cognitive mismatch and cognitive resonance. Moreover, we propose a method to compute the policies for the defender and the adversary with arbitrary cognitive abilities. A sufficient condition is established under which an increase in cognitive levels does not alter the policies for the defender and the adversary, ensuring convergence. The obtained results are validated through numerical simulations.
|
| |
| FrB14 |
Galapagos III |
| Robotics and Autonomous Systems IV |
Regular Session |
| Chair: Wendel, Eric | Boston University, Charles Stark Draper Laboratory |
| Co-Chair: d'Addato, Giulia | University of Trento |
| |
| 14:00-14:15, Paper FrB14.1 | |
| Real-Time Optimal Control Via Transformer Networks and Bernstein Polynomials |
|
| MacLin, Gage | University of Iowa |
| Cichella, Venanzio | University of Iowa |
| Patterson, Andrew | NASA Langley Research Center |
| Gregory, Irene | NASA Langley Research Center |
Keywords: Autonomous systems, Optimal control, Machine learning
Abstract: In this paper, we propose a Transformer-based framework for approximating solutions to infinite-dimensional optimization problems: calculus of variations problems and optimal control problems. Our approach leverages offline training on data generated by solving a sample of infinite-dimensional optimization problems using composite Bernstein collocation. Once trained, the Transformer efficiently generates near-optimal, feasible trajectories, making it well-suited for real-time applications. In motion planning for autonomous vehicles, for instance, these trajectories can serve to warm-start optimal motion planners or undergo rigorous evaluation to ensure safety. We demonstrate the effectiveness of this method through numerical results on a classical control problem and an online obstacle avoidance task. This data-driven approach offers a promising solution for real-time optimal control of nonlinear, nonconvex systems.
|
| |
| 14:15-14:30, Paper FrB14.2 | |
| Stable Autonomous Visual Navigation: An Expert Prediction Approach |
|
| Wendel, Eric | Boston University, Charles Stark Draper Laboratory |
| Baillieul, John | Boston Univ |
| Hollmann, Joseph | The Charles Stark Draper Laboratory, Inc |
Keywords: Autonomous systems, Predictive control for nonlinear systems, Vision-based control
Abstract: The proposed algorithm guarantees safe navigation of unknown visual scenes by autonomous robotic systems equipped with a monocular digital camera. Our methodology exploits synergistic connections between the design of almost globally stable nonlinear control systems and the design of exponentially weighted policies for sequential decision problems under the expert prediction protocol. These connections yield practical tools aiding the design of safe algorithms for autonomous visual navigation. Our algorithms are implemented in Warp and demonstrated in the Isaac Lab simulation environment.
|
| |
| 14:30-14:45, Paper FrB14.3 | |
| Motion Planning for Information Acquisition Via Continuous-Time Successive Convexification |
|
| Uzun, Samet | University of Washington |
| Acikmese, Behcet | University of Washington |
| Di Cairano, Stefano | Mitsubishi Electric Research Labs |
Keywords: Autonomous systems, Constrained control, Optimal control
Abstract: We address motion planning for a mobile agent to acquire information from multiple monitored targets using sensors with limited capabilities. To represent sensor limitations, such as range, field of view, and allowed acquisition directions, we introduce a nonnegative metric that quantifies the rate of information acquisition and is positive only when these limitations are satisfied. To enable optimization-based trajectory generation, we impose temporal logic specifications to ensure that the information acquisition metric and its gradient are nonzero over some time interval. This enables the application of continuous-time successive convexification to solve the motion planning problem. We demonstrate the proposed approach in a case study of a quadrotor that must acquire information on multiple targets with sensor range, field of view, and acquisition direction constraints.
|
| |
| 14:45-15:00, Paper FrB14.4 | |
| On Tensor-Based Polynomial Hamiltonian Systems |
|
| Cui, Shaoxuan | University of Groningen |
| Zhang, Guofeng | The Hong Kong Polytechnic University |
| Jardón-Kojakhmetov, Hildeberto | University of Groningen |
| Cao, Ming | University of Groningen |
Keywords: Autonomous systems, Nonlinear systems
Abstract: It is known that a linear system with a system matrix A constitutes a Hamiltonian system with a quadratic Hamiltonian if and only if A is a Hamiltonian matrix. This provides a straightforward method to verify whether a linear system is Hamiltonian or whether a given Hamiltonian function corresponds to a linear system. These techniques fundamentally rely on the properties of Hamiltonian matrices. Building on recent advances in tensor algebra, this paper generalizes such results to a broad class of polynomial systems. As the systems of interest can be naturally represented in tensor forms, we name them "tensor-based polynomial systems". Our main contribution is that we formally define Hamiltonian cubical tensors and characterize their properties. Crucially, we demonstrate that a tensor-based polynomial system is a Hamiltonian system with a polynomial Hamiltonian if and only if all associated system tensors are Hamiltonian cubical tensors—a direct parallel to the linear case. Additionally, we establish a computationally tractable stability criterion for tensor-based polynomial Hamiltonian systems. Finally, we validate all theoretical results through numerical examples and provide a further intuitive discussion.
|
| |
| 15:00-15:15, Paper FrB14.5 | |
| Behavior-Aware Online Prediction of Obstacle Occupancy Using Zonotopes |
|
| Carrizosa-Rendon, Alvaro | Universitat Politècnica De Catalunya |
| Zhou, Jian | Linköping University |
| Frisk, Erik | Linkoping Univ |
| Puig, Vicenc | Universitat Politècnica De Catalunya |
| Nejjari, Fatiha | Universitat Politecnica De Catalunya |
Keywords: Autonomous systems, Autonomous vehicles
Abstract: Predicting the motion of surrounding vehicles is key to achieving safe autonomous driving, especially in scenarios where road geometry is unknown or no prior information about the surrounding obstacles is available. This paper proposes a novel method to accurately and efficiently predict the occupancy sets of surrounding vehicles based on online observations of their behaviors. The approach is divided in two stages: First, a zonotopic set is computed to represent the estimated behavior of the vehicle. This is achieved using optimal observers and solving a linear programming (LP) problem. Then, a reachability analysis is performed to predict the corresponding zonotopic occupancy sets over a fixed prediction horizon. The effectiveness of the method has been validated through simulations in an urban environment, showing accurate and compact predictions without relying on assumptions or prior training data.
|
| |
| 15:15-15:30, Paper FrB14.6 | |
| Decoupled Phase Space Flexing in Autonomous Hamiltonian Systems |
|
| Boodram, Oliver | University of Colorado Boulder |
| Scheeres, Daniel J. | The University of Colorado |
Keywords: Algebraic/geometric methods, Aerospace, Autonomous systems
Abstract: Hamiltonian dynamical systems can be studied with the tools of symplectic geometry, which constrain how volumes of space deform. A better understanding of phase space flexing outlines how control strategies can leverage its natural deformation to reduce control effort. This letter establishes analytical forecasts for the stretching of phase space in the vicinity of a reference trajectory. For autonomous Hamiltonian systems, we leverage symplectic invariants and exterior algebraic forms to predict the expansion and contraction of phase space along both the local flow direction and the local gradient direction. This isolates an area-preserving sub-pocket of phase space spanned by these two directions. The stretching results are demonstrated numerically and linked to control effort in the two-body problem.
|
| |
| 15:30-15:45, Paper FrB14.7 | |
| Shaping Opinion Dynamics for Driving Navigation in Shared Human-Robot Spaces |
|
| d'Addato, Giulia | University of Trento |
| Palopoli, Luigi | University of Trento |
| Fontanelli, Daniele | University of Trento |
Keywords: Autonomous systems, Robotics, Stability of nonlinear systems
Abstract: We propose a strategy to shape opinion dynamics and guide navigation in shared human-robot environments. Unlike traditional methods that passively adapt to human motion, our approach actively guides human decisions with minimal intervention, steering the system towards a desired equilibrium. Using a Lyapunov-based stability framework, the strategy ensures robust convergence while preserving natural human-robot interactions. Simulation results confirm the effectiveness of this method, demonstrating how it achieves efficient, socially-aware navigation with subtle adjustments to the robot's behaviour. This framework paves the way for improved collaboration and coordination in dynamic, human-populated spaces.
|
| |
| 15:45-16:00, Paper FrB14.8 | |
| Continuously Ordered Hierarchies of Algorithmic Information in Digital Twinning and Signal Processing |
|
| Böck, Yannik | Technical University of Munich |
| Boche, Holger | Technische Universitaet Muenchen |
| Fitzek, Frank | Technical University of Dresden |
Keywords: Computational methods, Formal Verification/Synthesis, Autonomous systems
Abstract: We consider a fractional-calculus example of a continuous hierarchy of algorithmic information in the context of its potential applications in digital twinning. Digital twinning refers to different emerging methodologies in control engineering that involve the creation of a digital replica of some physical entity. From the perspective of computability theory, the problem of ensuring the digital twin's integrity -- i.e., keeping it in a state where it matches its physical counterpart -- entails a notion of algorithmic information that determines which of the physical system's properties we can reliably deduce by algorithmically analyzing its digital twin. The present work investigates the fractional calculus of periodic functions -- particularly, we consider the Wiener algebra -- as an exemplary application of the algorithmic-information concept. We establish a continuously ordered hierarchy of algorithmic information among spaces of periodic functions -- depending on their fractional degree of smoothness -- in which the ordering relation determines whether a certain representation of some function contains ``more'' or ``less'' information than another. Additionally, we establish an analogous hierarchy among ell-p-spaces, which form a cornerstone of (traditional) digital signal processing. Notably, both hierarchies are (mathematically) ``dual'' to each other. From a practical perspective, our approach ultimately falls into the category of formal verification and (general) formal methods.
|
| |
| FrB15 |
Capri II |
| Constrained Control I |
Regular Session |
| Chair: Ong, Pio | California Institute of Technology |
| Co-Chair: Russo, Antonio | Università Degli Studi Di Bergamo |
| |
| 14:00-14:15, Paper FrB15.1 | |
| Guaranteeing In-Polytope Permanence Times of Nonlinear System Trajectories Via Approximate Solutions |
|
| Cinto, Felipe | Cifasis, Unr - Conicet |
| Vallarella, Alexis J. | CIFASIS, UNR-CONICET |
| Russo, Antonio | Università Degli Studi Di Bergamo |
| Haimovich, Hernan | CONICET and Universidad Nacional De Rosario |
Keywords: Numerical algorithms, Switched systems, Simulation
Abstract: Causing the system trajectory to remain within a given region over some time interval is a key requirement of many control strategies. This may involve prediction of the future state evolution. However, general nonlinear systems often lack analytical solutions, making numerical approximations the only available tool. Even if error bounds for approximate solutions exist, these only ensure that the exact solution remains within the given region at discrete time instants. This fact may nonetheless not be sufficient to guarantee that the exact continuous-time solution remains within the region at all intermediate times. In this work, we develop a method to guarantee that the exact solution remains within a convex set throughout a time interval, by employing approximate solutions. When the convex set is a polytope, every step of the method becomes easily and completely computable. The applicability of the method is illustrated by means of numerical examples.
|
| |
| 14:15-14:30, Paper FrB15.2 | |
| Terminal Time Constrained Guidance against Stationary Target with Bounded Input and Rate |
|
| Singh, Swati | Indian Institute of Technology Bombay |
| Kumar, Shashi Ranjan | Indian Institute of Technology Bombay |
| Mukherjee, Dwaipayan | Indian Institute of Technology Bombay |
Keywords: Aerospace, Control applications
Abstract: This paper presents a nonlinear guidance scheme to address the problem of achieving precise target interception at a desired impact time against stationary targets in a three-dimensional setting. Unlike most existing approaches that assume the interceptor to possess unbounded control, the proposed guidance design explicitly incorporates both magnitude and rate constraints (physical bounds) of the interceptor's actuators into the guidance law. This approach makes the guidance law more applicable to real-world scenarios. By including actuator constraints in the design, the overall effectiveness of the interceptor is enhanced. The proposed method utilizes an input-affine magnitude and rate saturation (MRS) model to effectively enforce these constraints. Guidance commands are derived by integrating the MRS model into the kinematics and are validated through performing simulations under various engagement scenarios.
|
| |
| 14:30-14:45, Paper FrB15.3 | |
| On the Properties of Optimal-Decay Control Barrier Functions |
|
| Ong, Pio | California Institute of Technology |
| Cohen, Max | California Institute of Technology |
| Molnar, Tamas G. | Wichita State University |
| Ames, Aaron D. | California Institute of Technology |
Keywords: Constrained control, Lyapunov methods
Abstract: Control barrier functions provide a powerful means for synthesizing safety filters that ensure safety framed as forward set invariance. Key to CBFs' effectiveness is the simple inequality on the system dynamics: dot{h} geq - alpha(h). Yet determining the class mathcal{K}^e function alpha is a user defined choice that can have a dramatic effect on the resulting system behavior. This paper formalizes the process of choosing alpha using optimal-decay control barrier functions (OD-CBFs). These modify the traditional CBF inequality to: dot{h} geq - omega alpha(h), where omega geq 0 is automatically determined by the safety filter. A comprehensive characterization of this framework is elaborated, including tractable conditions on OD-CBF validity, control invariance of the underlying sets in the state space, forward invariance conditions for safe sets, and discussion on optimization-based safe controllers in terms of their feasibility, Lipschitz continuity, and closed-form expressions. The framework also extends existing higher-order CBF techniques, addressing safety constraints with vanishing relative degrees. The proposed method is demonstrated on a satellite control problem in simulation.
|
| |
| 14:45-15:00, Paper FrB15.4 | |
| Characterization and Computation of a Control Invariant Set for Discrete-Time Switched Systems under Waiting-Time Constraints |
|
| Perez, Mara | CONICET-UNL |
| Sanchez, Ignacio | CONICET |
| Anderson, Alejandro | CONICET-INTEC-UNL |
| González, Alejandro H. | CONICET-Universidad Nacional Del Litoral |
| Actis, Marcelo | UNL-FIQ |
Keywords: Switched systems, Constrained control, Numerical algorithms
Abstract: This work addresses the study of control invariant sets for discrete-time switched systems subject to minimal and maximal waiting-time constraints. We formally define and characterize such sets within an admissible region for linear control switched systems of q modes, demonstrating that they can be expressed as the union of q component sets. These component sets permit transitions between the modes without violating either the invariance property or the waiting-time constraints. Furthermore, we propose an algorithm for computing the maximal control invariant set under waiting-time constraints within a target region and validate our approach through numerical simulations.
|
| |
| 15:00-15:15, Paper FrB15.5 | |
| Anti-Windup Compensation for Nonlinear Dynamic Inversion Rigid Body Attitude Control |
|
| Soltani, Ali | University of Southampton |
| Turner, Matthew C. | University of Southampton |
| Richards, Christopher | University of Louisville |
Keywords: Aerospace, Constrained control, Stability of nonlinear systems
Abstract: The rigid body attitude stabilization problem with constrained control inputs has been studied by many researchers. However, if perfect eigen-axis rotation in rest-to-rest maneuvers is also desirable, the control design problem becomes more challenging and, to the best of the authors' knowledge, has not yet been addressed. In this paper, an anti-windup compensation approach to this problem is developed. A nonlinear dynamic inversion control is used to obtain satisfactory unconstrained performance and this is supplemented by an anti-windup compensator when constraints are encountered. The compensator provides global mathcal{L}_2 performance under reasonable conditions. A highlight of the approach is that the anti-windup compensator can have a nonlinear structure, giving flexibility in the choice of its parameters. Simulation results demonstrate the effectiveness of the proposed scheme as well as the performance improvement achieved using a compensator with state-dependent parameters.
|
| |
| 15:15-15:30, Paper FrB15.6 | |
| Approximation-Free Control for Signal Temporal Logic Specifications Using Spatiotemporal Tubes |
|
| Das, Ratnangshu | Indian Institute of Science, Bangalore |
| Choudhury, Subhodeep | BITS Pilani K. K. Birla Goa Campus |
| Jagtap, Pushpak | Indian Institute of Science |
Keywords: Constrained control, Uncertain systems
Abstract: This paper presents a spatiotemporal tube (STT)-based control framework for satisfying Signal Temporal Logic (STL) specifications in unknown control-affine systems. We formulate STL constraints as a robust optimization problem (ROP) and recast it as a scenario optimization program (SOP) to construct STTs with formal correctness guarantees. We also propose a closed-form control law that operates independently of the system dynamics, and ensures the system trajectory evolves within the STTs, thereby satisfying the STL specifications. The proposed approach is validated through case studies and comparisons with state-of-the-art methods, demonstrating superior computational efficiency, trajectory quality, and applicability to complex STL tasks.
|
| |
| 15:30-15:45, Paper FrB15.7 | |
| Stabilization on the 2-Sphere under Geodesically Convex Constraints |
|
| Sawant, Mayur | Lakehead University |
| Tayebi, Abdelhamid | Lakehead University |
Keywords: Constrained control, Stability of nonlinear systems, Autonomous systems
Abstract: We address the problem of constrained stabilization to a desired point on the 2-sphere under geodesically strongly convex constraints. The proposed feedback control law is composed of an attractive vector field that guides the state along a geodesic toward the desired point and a repulsive vector field that drives the state away from the unsafe regions. We show that the target point is almost globally asymptotically stable under the proposed continuous time-invariant control law. Simulation results are provided to illustrate the effectiveness of the theoretical developments.
|
| |
| 15:45-16:00, Paper FrB15.8 | |
| Adapting First-Order Motion Planners to Second-Order Dynamical Systems |
|
| Sawant, Mayur | Lakehead University |
| Tayebi, Abdelhamid | Lakehead University |
Keywords: Autonomous robots, Constrained control, Stability of nonlinear systems
Abstract: This paper extends first-order motion planners to robots governed by second-order dynamics. Two control schemes are proposed based on the knowledge of a scalar function whose negative gradient aligns with a given first-order motion planner. When such a function is known, the first-order motion planner is combined with a damping velocity vector with a dynamic gain to extend the safety and convergence guarantees of the first-order motion planner to second-order systems. If no such function is available, we propose an alternative control scheme ensuring that the error between the robot's velocity and the first-order motion planner converges to zero. The theoretical developments are supported by simulation results demonstrating the effectiveness of the proposed approaches.
|
| |
| FrB16 |
Capri III |
| Observers for Nonlinear Systems I |
Regular Session |
| Chair: Goppert, James | Purdue University |
| Co-Chair: Hamel, Tarek | I3S-CNRS-UCA |
| |
| 14:00-14:15, Paper FrB16.1 | |
| Attitude Estimation Using Scalar Measurements |
|
| Alnahhal, Hassan | Department Computer Science and Engineering, University of Quebe |
| Benahmed, Sifeddine | Capgemini Engineering |
| Berkane, Soulaimane | University of Quebec in Outaouais |
| Hamel, Tarek | I3S-CNRS-UCA |
Keywords: Estimation, Kalman filtering, Robotics
Abstract: This paper revisits the problem of orientation estimation for rigid bodies through a novel framework based on scalar measurements. Unlike traditional vector-based methods, the proposed approach enables selective utilization of only the reliable axes of vector measurements while seamlessly incorporating alternative scalar modalities such as Pitot tubes, barometers with range sensors, and landmark-based constraints. The estimation problem is reformulated within a linear time-varying (LTV) framework, allowing the application of a deterministic linear Kalman filter. This design guarantees Global Uniform Exponential Stability (GES) under the Uniform Observability (UO) condition. Simulation results demonstrate the effectiveness of the proposed approach in achieving robust and accurate attitude estimation, even with partial vector measurements that simulate sensor axis failure.
|
| |
| 14:15-14:30, Paper FrB16.2 | |
| Observer Design for Optical Flow-Based Visual-Inertial Odometry with Almost-Global Convergence |
|
| Bouazza, Tarek | Laboratoire I3S UCA-CNRS |
| Berkane, Soulaimane | University of Quebec in Outaouais |
| Hua, Minh-Duc | I3s Uca-Cnrs Umr7271 |
| Hamel, Tarek | I3S-CNRS-UCA |
Keywords: Observers for nonlinear systems, Sensor fusion, Kalman filtering
Abstract: This paper presents a novel cascaded observer architecture that combines optical flow and IMU measurements to perform continuous monocular visual-inertial odometry (VIO). The proposed solution estimates body-frame velocity and gravity direction simultaneously by fusing velocity direction information from optical flow data with gyro and accelerometer data. This fusion is achieved using a globally exponentially stable Riccati observer, which operates under persistently exciting translational motion conditions. The estimated gravity direction in the body frame is then employed, along with an optional magnetometer measurement, to design a complementary observer on SO(3) for attitude estimation. The resulting interconnected observer architecture is shown to be almost globally asymptotically stable. To extract the velocity direction from sparse optical flow data, a gradient descent algorithm is developed to solve a constrained minimization problem on the unit sphere. The effectiveness of the proposed algorithms is validated through simulation results.
|
| |
| 14:30-14:45, Paper FrB16.3 | |
| A Geometric Approach for Pose and Velocity Estimation Using IMU and Inertial/Body-Frame Measurements |
|
| Benahmed, Sifeddine | Capgemini Engineering |
| Berkane, Soulaimane | University of Quebec in Outaouais |
| Hamel, Tarek | I3S-CNRS-UCA |
Keywords: Estimation, Observers for nonlinear systems, Robotics
Abstract: This paper addresses accurate pose estimation (orientation and position) and linear velocity for a rigid body using a combination of generic inertial-frame and/or body-frame measurements along with an Inertial Measurement Unit (IMU). By embedding the original state space mathrm{SO}(3)timesR^3times R^3 into the higher-dimensional Lie group mathrm{SE}_5(3), we reformulate the state dynamics and measurement models within a structured geometric framework. This embedding enables a natural decoupling of the geometric error dynamics, where the translational error dynamics exhibit a structure analogous to that of a continuous-time Kalman filter, allowing for a time-varying gain design based on the continuous Riccati equation. Under the assumption of uniform observability, we establish that the proposed observer on mathrm{SE}_5(3) guarantees almost-global asymptotic stability. The effectiveness of the approach is demonstrated in simulations for a practical scenario of GPS-aided inertial navigation systems (INS). Overall, the proposed method significantly simplifies the design of nonlinear geometric observers for INS, offering a unified and robust approach to state estimation.
|
| |
| 14:45-15:00, Paper FrB16.4 | |
| Hyperbolic Functions and the Modulating Function Method |
|
| Gonçalves Accioli, Davi | University of Southern Denmark, SDU |
| Jouffroy, Jerome | University of Southern Denmark |
Keywords: Estimation, Algebraic/geometric methods, Observers for nonlinear systems
Abstract: The modulating function method is an integral transform method that has been used for state and parameter estimation of lumped systems, as well as estimation of distributed and fractional systems. The kernel of the modulation operator for input-output systems is called the modulating function, the selection of which dictates the resulting filter characteristics. Since this selection is crucial in the filter design process, different functions have been proposed over the years, until a different approach was introduced: obtaining the modulating function as a solution to an auxiliary problem. This approach is, in fact, common within mathematics, and well-known functions, such as hyperbolic sine and cosine, are obtained as the solution to a particular auxiliary system. Intriguingly, hyperbolic functions generate 4 new types of modulating functions in a simple manner. Moreover, 12 types of mixed hyperbolic modulating functions are obtained by using different hyperbolic functions to satisfy the boundary conditions. Lastly, an application example shows that hyperbolic modulating functions can display significantly lower bias than polynomial modulating functions.
|
| |
| 15:00-15:15, Paper FrB16.5 | |
| Saltation-Based Analysis of Estimation Error in Observers for Hybrid Systems with Unknown Jump Times |
|
| Alleaume, Valentin | Mines Paris - PSL |
| Bernard, Pauline | Mines Paris - PSL |
| Di Meglio, Florent | Mines Paris PSL |
Keywords: Hybrid systems, Observers for nonlinear systems, Estimation
Abstract: This paper highlights some key conditions for local state estimation of hybrid dynamical systems with unknown jump times, for which the observer jumps cannot be triggered at the same time as the system's. We provide sufficient conditions -- including appropriate transversality conditions of flows with respect to guard maps -- to ensure observer solutions with sufficiently small initial error exist at least during one flow-jump cycle of the system, and we study the propagation of infinitesimal estimation errors throughout this cycle. We show that their evolution through non-simultaneous jumps depends on two distinct saltation matrices, depending whether the observer jumps before or after the system: one is similar to the classical saltation matrix derived in the literature, while the other differs due to the mismatch of the derivative of the output around jump times. We illustrate on a linear toy example the importance of ensuring transversality and contraction with both saltation matrices to ensure observer convergence.
|
| |
| 15:15-15:30, Paper FrB16.6 | |
| Improving the Accuracy of Adaptive Observers Using Additional Filters |
|
| Ríos, Héctor | Tecnológico Nacional De México/I.T. La Laguna |
| Efimov, Denis | Inria |
| Ushirobira, Rosane | Inria |
Keywords: Adaptive systems, Estimation, Observers for nonlinear systems
Abstract: In this paper, we present the design of an adaptive observer with additional filtering dynamics for simultaneously estimating states and constant parameters in a class of affine–state nonlinear systems affected by external perturbations. The proposed estimator integrates Luenberger–like observers for state estimation with a gradient–based algorithm for identifying unknown constant parameters, enhanced by additional linear filters. In the ideal case, the adaptive observer ensures exponential convergence to the true state and parameter values, while also exhibiting input–to–state stability in the presence of bounded external perturbations. Stability analysis of the closed–loop system is conducted using a Lyapunov function approach, relying on standard persistence of excitation conditions. The effectiveness of the improved estimation algorithm is validated through simulations, demonstrating a better accuracy compared to the conventional adaptive observer.
|
| |
| 15:30-15:45, Paper FrB16.7 | |
| Leveraging Flapping Kinematics for Attitude Estimation in SO(3) |
|
| Sarkar, Dhirodaatto | Purdue University |
| Hyun, Nak-seung Patrick | Purdue University |
Keywords: Observers for nonlinear systems, Adaptive systems, Robotics
Abstract: In nature, avian animals navigate space by flapping their wings, their flapping frequency ranging from 10 Hz for larger birds to 200 Hz for small insects. Inspired by these fast oscillation-driven biological mechanisms, many scientists have designed and created numerous synthetic flapping-wing vehicles (FWVs). However, sensing autonomy using an on-board IMU is inherently challenging due to continuous large measurement perturbations on both velocity and acceleration. Learning the cycle-averaged orientation of the body plays a key role in designing a controller for the FWV. In this paper, we propose a decomposition of orientation of a hovering FWV into the averaged frame and the fast oscillating frame. By leveraging the pitching kinematics of the bird during hover, we propose a cascaded adaptive complementary filter in the special orthogonal group SO(3), where the perturbations exist entirely in a one-parameter group on SO(3), and prove the asymptotic stability of the proposed observer. The results are verified in simulation with high flapping frequency of 30 Hz, for which a conventional complementary filter with an appended geometric averaging filter fails to converge to the true average.
|
| |
| 15:45-16:00, Paper FrB16.8 | |
| A Closed-Form Matrix Exponential for SE_n(3) with Applications to Strapdown Inertial Navigation |
|
| Lin, Li-Yu | Purdue University |
| Pant, Kartik Anand | Purdue University |
| Perseghetti, Benjamin | Rudis Laboratories |
| Goppert, James | Purdue University |
Keywords: Numerical algorithms, Aerospace, Algebraic/geometric methods
Abstract: A large class of nonlinear systems in engineering and robotics evolves on geometric manifolds, such as fixed-wing aircraft, quadrotors, etc., whose kinematic motion can be described using Lie groups, in particular SE_n(3). The existing techniques involve numerical integration methods such as Runge-Kutta integration to propagate their motion forward in time. However, these methods are approximate solutions and do not respect the geometric constraints of the nonlinear differential equation, leading to higher computational costs and numerical integration errors. To this end, we propose a geometric closed-form approach to solve the initial value problem for a class of nonlinear systems that evolves on the SE_n(3) Lie group. Through numerical simulations, we show that our closed-form solution is more efficient than the Runge-Kutta 4th-order (RK-4) integrator and reduces floating-point operations by 45% in a strapdown-inertial-navigation (SINS) case study. We believe that the efficiency and accuracy gains of our approach warrant the general adoption of this method for numerical integration in applications such as SINS.
|
| |
| FrB17 |
Capri IV |
| Stability of Nonlinear Systems IV |
Regular Session |
| Chair: Katriniok, Alexander | Eindhoven University of Technology |
| Co-Chair: Montanari, Arthur | Northwestern University |
| |
| 14:00-14:15, Paper FrB17.1 | |
| Counterexample-Guided Synthesis of Robust Discrete-Time Control Barrier Functions |
|
| Shakhesi, Erfan | Eindhoven University of Technology |
| Katriniok, Alexander | Eindhoven University of Technology |
| Heemels, W.P.M.H. (Maurice) | Eindhoven University of Technology |
Keywords: Lyapunov methods, Robust control, Neural networks
Abstract: Learning-based methods have gained popularity for training candidate Control Barrier Functions (CBFs) to satisfy the CBF conditions on a finite set of sampled states. However, since the CBF is unknown a priori, it is unclear which sampled states belong to its zero-superlevel set and must satisfy the CBF conditions, and which ones lie outside it. Existing approaches define a set in which all sampled states are required to satisfy the CBF conditions, thus introducing conservatism. In this letter, we address this issue for robust discrete-time CBFs (R-DTCBFs). Furthermore, we propose a class of R-DTCBFs that can be used in an online optimization problem to synthesize safe controllers for general discrete-time systems with input constraints and bounded disturbances. To train such an R-DTCBF that is valid not only on sampled states but also across the entire region, we employ a verification algorithm iteratively in a counterexample-guided approach. We apply the proposed method to numerical case studies.
|
| |
| 14:15-14:30, Paper FrB17.2 | |
| Stability Analysis of the Newton-Raphson Controller for a Class of Differentially Flat Systems |
|
| Niu, Kaicheng | Georgia Institute of Technology |
| Wardi, Yorai | Georgia Institute of Technology |
| Abdallah, Chaouki T. | Georgia Institute of Technology |
Keywords: Stability of nonlinear systems, Nonlinear systems, Lyapunov methods
Abstract: The Newton-Raphson controller, recently proposed by authors of this paper, is a tracking controller based on an output prediction and a Newton-Raphson algorithm. Wheres its early versions have focused on performance verifications by simulation and experimental means, convergence proofs have been confined to linear systems. This paper makes the first step towards convergence proofs for nonlinear systems. It assumes that the plant system is differentially flat, and it uses the flat system for computing the predictor. The theoretical results include local stability of output regulation, and stability of a discretized tracking controller for time-dependent target trajectories. Simulation results on the inverted pendulum (cartpole model) and the kinematic bicycle support the theoretical derivations.
|
| |
| 14:30-14:45, Paper FrB17.3 | |
| Open-Loop Control Design for Contraction in Affine Nonlinear Systems |
|
| Arkhis, Mohamed Yassine | Inria Centre at the University of Lille |
| Efimov, Denis | Inria |
Keywords: Lyapunov methods, Stability of nonlinear systems, Time-varying systems
Abstract: This paper presents two main contributions. First, it is shown that if a nonlinear time-varying system is contractive, then it is incrementally exponentially stable. Second, building on this result, a method is proposed, for designing feedforward inputs that induce contraction or incremental exponential stability in affine-in-control systems. The effectiveness of the approach is demonstrated through illustrative examples.
|
| |
| 14:45-15:00, Paper FrB17.4 | |
| Ensuring Stability in Bilinear Structured State-Space Models Via IQCs: A Free Parameterisation Approach |
|
| Gupta, Vaibhav | École Polytechnique Fédérale De Lausanne (EPFL) |
| Zakwan, Muhammad | ETH Zurich |
| Ferrari-Trecate, Giancarlo | Ecole Polytechnique Fédérale De Lausanne |
| Karimi, Alireza | EPFL |
Keywords: Neural networks, Stability of nonlinear systems, Nonlinear systems identification
Abstract: Recently, a novel class of Recurrent Neural Net- works (RNNs) known as Structured State-space Models (SSMs) has emerged, leveraging dynamical system properties. While most SSM architectures use linear time-invariant systems as the recurrent unit, bilinear systems offer a more expressive alter- native. Although existing studies impose structural restrictions on the bilinear systems, stability is not guaranteed, potentially leading to unstable or ill-posed training. This paper introduces a generic bilinear system as the recurrent unit for SSMs. A stability condition based on Integral Quadratic Constraints (IQCs) is derived to ensure the model’s stability during and after the training. To this purpose, a free parameterisation of this stability condition is provided, enabling the use of gradient-based optimisation algorithms. Moreover, a Parallel Scan algorithm is provided for forward propagation to enhance the training efficiency. The effectiveness of the proposed architecture is demonstrated by applying it to the non-linear system identification task for an F-16 ground vibration benchmark while incorporating the prior regarding the system stability into the learning process.
|
| |
| 15:00-15:15, Paper FrB17.5 | |
| Set Invariance in Model-Following High-Gain Control - Decoupling the Region of Attraction and Precision |
|
| Tietze, Niclas | Technische Universität Ilmenau |
| Wulff, Kai | TU Ilmenau |
| Reger, Johann | TU Ilmenau |
Keywords: Nonlinear systems, Stability of nonlinear systems, Robust control
Abstract: In classical single-loop high-gain set-point control, the (estimated) region of attraction depends on the gain of the controller which also determines the tracking precision. However, as we increase the gain to enforce good precision, the estimated region of attraction decreases. We consider nonlinear systems in Byrnes-Isidori subject to unbounded perturbations and apply the well-established two-degree-of-freedom architecture of model-following control (MFC) to overcome this inherent coupling. Our analysis of the local stability of the closed loop shows that the estimate of the region of attraction depends on both the controller gain and the initialisation of the model, which is an additional degree of freedom in the MFC. It turns out that the initialisation can be utilised either to increase the region of attraction compared to a single-loop high-gain design, or to render it independent of the desired precision. Thereby decoupling the region of attraction from the desired precision.
|
| |
| 15:15-15:30, Paper FrB17.6 | |
| Strictifying Storage Functions for Autonomous Discrete-Time Systems Using Observers |
|
| Benchebba, Mohamed Aymane | Université De Lorraine |
| Postoyan, Romain | CNRS, CRAN, Université De Lorraine |
| Andrieu, Vincent | Université De Lyon |
| Astolfi, Daniele | Cnrs - Lagepp |
Keywords: Lyapunov methods, Stability of nonlinear systems, Observers for nonlinear systems
Abstract: This paper addresses the strictification of storage functions for autonomous discrete-time nonlinear systems. Given a storage function that does not increase along solutions, we present conditions under which we can construct an alternative storage function that strictly decreases along solutions. The results include, as a special case, the situation where one starts with a weak Lyapunov function and seeks to modify it into a strict Lyapunov function, thereby certifying properties stronger than mere stability. The approach relies on combining the original storage function with an auxiliary function derived from the assumed knowledge of an observer. We provide sufficient conditions for the existence of this auxiliary function and construct it for a class of smooth systems, as well as for general backward distinguishable nonlinear systems.
|
| |
| 15:30-15:45, Paper FrB17.7 | |
| Distributed Lyapunov Functions for Nonlinear Networks |
|
| Wang, Yiming | Northwestern University |
| Montanari, Arthur | Northwestern University |
| Motter, Adilson E. | Northwestern University |
Keywords: Lyapunov methods, Stability of nonlinear systems, Network analysis and control
Abstract: Nonlinear networks are often multistable, exhibiting coexisting stable states with competing regions of attraction (ROAs). As a result, ROAs can have complex ``tentacle-like'' morphologies that are challenging to characterize analytically or computationally. In addition, the high dimensionality of the state space prohibits the automated construction of Lyapunov functions using state-of-the-art optimization methods, such as sum-of-squares (SOS) programming. In this letter, we propose a distributed approach for the construction of Lyapunov functions based solely on local information. To this end, we establish an augmented comparison lemma that characterizes the existence conditions of partial Lyapunov functions, while also accounting for residual effects caused by the associated dimensionality reduction. These theoretical results allow us to formulate an SOS optimization that iteratively constructs such partial functions, whose aggregation forms a composite Lyapunov function. The resulting composite function provides accurate convex approximations of both the volumes and shapes of the ROAs. We validate our method on networks of van der Pol and Ising oscillators, demonstrating its effectiveness in characterizing high-dimensional systems with non-convex ROAs.
|
| |
| 15:45-16:00, Paper FrB17.8 | |
| On Classical Solutions in the Stabilization Problem for Nonholonomic Control Systems with Time-Varying Feedback Laws |
|
| Zuyev, Alexander | Max Planck Institute for Dynamics of Complex Systems |
| Grushkovskaya, Victoria | University of Klagenfurt |
Keywords: Lyapunov methods, Algebraic/geometric methods, Nonholonomic systems
Abstract: We consider the stabilization problem for driftless control-affine systems under the bracket-generating condition. In our previous works, a class of time-varying feedback laws has been constructed to stabilize the equilibrium of a nonholonomic system under rather general controllability assumptions. This stabilization scheme is based on the sampling concept, which is not equivalent to the classical definition of solutions for the corresponding nonautonomous closed-loop system. In the present paper, we refine the previous results by presenting sufficient conditions for the convergence of classical solutions of the closed-loop system to the equilibrium. Our theoretical findings are applied to a multidimensional driftless control-affine system and illustrated through numerical simulations.
|
| |
| FrB18 |
Aruba I+II+III |
| Stability of Linear Systems I |
Regular Session |
| Chair: Dashkovskiy, Sergey | University of Würzburg |
| Co-Chair: Anderson, James | Columbia University |
| |
| 14:00-14:15, Paper FrB18.1 | |
| The Relationship between ISS of Difference and Differential Linear Equations |
|
| Dashkovskiy, Sergey | University of Würzburg |
| Hütter, Gianluca | Institute of Mathematics, University of Würzburg |
Keywords: Hybrid systems, Stability of hybrid systems, Stability of linear systems
Abstract: We establish a spectral characterization for Input-to-State stability of linear dynamical systems on time scales, that generalizes the Schur and Hurwitz criterion of discrete and continuous systems respectively. We then use this characterization to study the connection between the ISS property of linear systems on families of time scales and ISS of the corresponding continuous time system.
|
| |
| 14:15-14:30, Paper FrB18.2 | |
| Numerical Stability Analysis of a Distributed Thermal Model for Pouch Cell Assemblies Via Gershgorin Circle Theorem |
|
| Trivella, Andrea | Politecnico Di Milano |
| Radrizzani, Stefano | Politecnico Di Milano |
| Corno, Matteo | Politecnico Di Milano |
Keywords: Stability of linear systems, Energy systems
Abstract: Thermal simulation plays a key role in battery pack design, supporting cooling system sizing, temperature control and thermal failure prevention. However, accurate thermal models involving the numerical solution of a large number of partial differential equations (PDEs) can exhibit numerical instabilities that are difficult to diagnose and resolve. This letter applies the Gershgorin Circle Theory to evaluate the numerical stability of a distributed-parameter thermal model (DTM), which simulates the temperature distribution evolution of a multi-body assembly composed of a Li-ion pouch cell, its tabs, and a compression aluminum plate. A discrete state-space representation of the system is developed before calculating the Gershgorin circles. This method identifies the stability boundaries of the DTM as a function of the physical parameters of the system, e.g., thermal conductivities and thermal resistances, and simulation parameters, e.g., mesh size and sampling step. The proposed approach provides an efficient tool for optimizing simulation settings, ensuring convergence and reliability in thermal simulations of complex multi-element Li-ion pouch cell assembly.
|
| |
| 14:30-14:45, Paper FrB18.3 | |
| Integral Gains for Non-Autonomous Wazewski Systems |
|
| Atamas, Ivan | University of Würzburg |
| Dashkovskiy, Sergey | University of Würzburg |
| Slyn'ko, Vitalii | S.P. Timoshenko Institute of Mechanics |
Keywords: Stability of linear systems, Time-varying systems, Lyapunov methods
Abstract: In this work we consider linear non-autonomous systems of Wazewski type on Hilbert spaces and provide a new approach to study their stability properties by means of a decomposition into subsystems and conditions implied on the interconnection properties. These conditions are of the small-gain type but the appoach is based on a conceptually new notion which we call integral gain. This notion is introduced for the first time in this paper. We compare our approach with known results from the literature and demonstrate advantages of our results.
|
| |
| 14:45-15:00, Paper FrB18.4 | |
| Exponential Stability of Linear Non-Autonomous Coupled Systems |
|
| Dashkovskiy, Sergey | University of Würzburg |
| Kulish, Dmytro | JMU |
| Slyn'ko, Vitalii | S.P. Timoshenko Institute of Mechanics |
Keywords: Stability of linear systems, Time-varying systems
Abstract: We propose a method for studying the exponential stability of a linear non-autonomous interconnected system of differential equations based on a new concept of “integral gain.” This method utilizes non-coercive Lyapunov functions and Banach’s fixed-point theorem for operators in pseudometric spaces. The examples demonstrate the advantage of the small integral gain conditions obtained in the theorem over previously known small-gain conditions.
|
| |
| 15:00-15:15, Paper FrB18.5 | |
| Stability Conditions for Linear Discrete-Time Positive Switched Systems with Nonuniform Dwell-Time Ranges |
|
| De Iuliis, Vittorio | University of L'Aquila |
| Manes, Costanzo | Universita' Dell'Aquila |
| Pepe, Pierdomenico | University of L' Aquila |
Keywords: Switched systems, Compartmental and Positive systems, Stability of linear systems
Abstract: This paper deals with linear discrete-time positive switched systems and establishes novel sufficient conditions for global exponential stability under nonuniform dwell-time ranges on switches digraphs. The conditions we introduce in the paper apply to any system, regardless of the stability properties of its discrete modes. We first propose a set of easy to check conditions in the form of linear programming problems which require a test for each admissible dwell-time. Then, we show how such tests can be relaxed for stable and antistable subsystems, on which the conditions need to be checked only on the minimum and maximum dwell-times, respectively.
|
| |
| 15:15-15:30, Paper FrB18.6 | |
| Learning Stabilizing Policies Via an Unstable Subspace Representation |
|
| Toso, Leonardo Felipe | Columbia University |
| Ye, Lintao | Huazhong University of Science and Technology |
| Anderson, James | Columbia University |
Keywords: Stability of linear systems, Learning, Optimization
Abstract: We study the problem of learning to stabilize (LTS) a linear system. Policy gradient methods for control assume access to an initial stabilizing policy. Designing such a policy for an unknown system is one of the most fundamental problems in control, and it may be as hard as learning the optimal policy. Existing work on the LTS problem requires a volume of data that scales quadratically with the ambient dimension. We propose a two-phase approach that first learns the left unstable subspace of the system and then solves a series of discounted linear quadratic regulator (LQR) problems on the learned unstable subspace. By targeting just the unstable dynamics, we reduce the effective dimension of the problem. We provide non-asymptotic guarantees for both phases and demonstrate that operating on the unstable subspace reduces sample complexity. In particular, when the number of unstable modes is much smaller than the state dimension, LTS on the unstable subspace substantially speeds up stabilization.
|
| |
| 15:30-15:45, Paper FrB18.7 | |
| Stability-Preserving Parametric Model Reduction by Moment Matching |
|
| Zhang, Hanqing | Imperial College London |
| Shakib, Fahim | Imperial College London |
| Scarciotti, Giordano | Imperial College London |
Keywords: Model/Controller reduction, Stability of linear systems, Reduced order modeling
Abstract: This article proposes a novel parametric model reduction method that preserves exponential stability while achieving parametric moment matching at user-specified interpolation points across the entire user-selected parameter space. Building on the parametric interconnection-based moment matching framework, our approach leverages the flexibility of free parameters in parametric reduced-order models to ensure the preservation of exponential stability, independent of the state dimension of the parametric reduced-order model or the location of the interpolation points. A numerical example is presented to demonstrate the effectiveness and applicability of the proposed approach.
|
| |
| 15:45-16:00, Paper FrB18.8 | |
| Stabilizing Non-Discerning Control Design for Discrete-Time Linear Switched Systems |
|
| Khalin, Anatolii | CentraleSupelec Rennes |
| Bourdais, Romain | CentraleSupelec - IETR |
Keywords: Switched systems, Stability of hybrid systems, Control applications
Abstract: The distinguishability between the modes of a switched system is an essential concept, particularly for the synthesis of observers. In this paper, we propose to use its opposite, specifically to protect against cyber-physical attacks that aim to obtain information about the system. More precisely, we focus on synthesizing non-discerning control laws that stabilize the switched system.We establish the conditions for the existence of such a law and propose a procedure for its design. The academic examples demonstrate the described application.
|
| |
| FrB19 |
Ibiza IV |
| Optimal Control VIII |
Regular Session |
| Chair: Andrianesis, Panagiotis | Boston University |
| Co-Chair: Babazadeh, Maryam | University of Kaiserslautern-Landau |
| |
| 14:00-14:15, Paper FrB19.1 | |
| Retraction Maps in Optimal Control of Nonholonomic Systems |
|
| Anahory Simões, Alexandre | IE University |
| Barbero-Linan, Maria | Technical University of Madrid |
| Bloch, Anthony M. | Univ. of Michigan |
| Colombo, Leonardo Jesus | Spanish National Research Council |
| Martin de Diego, David | High Council for Scientific Research-CSIC |
Keywords: Nonholonomic systems, Algebraic/geometric methods, Optimal control
Abstract: In this paper, we compare the performance of different numerical schemes in approximating Pontryagin's Maximum Principle's necessary conditions for the optimal control of {fully actuated} nonholonomic systems. Retraction maps are used as a seed to construct geometric integrators for the corresponding Hamilton's equations. First, we obtain an intrinsic formulation of a discretization map on a distribution. Then, we illustrate this construction on a particular example for which the performance of different symplectic integrators is examined and compared with non-symplectic integrators.
|
| |
| 14:15-14:30, Paper FrB19.2 | |
| An Optimistic Planning Algorithm for Switched Discrete-Time LQR |
|
| Granzotto, Mathieu | University of Melbourne |
| Postoyan, Romain | CNRS, CRAN, Université De Lorraine |
| Nesic, Dragan | University of Melbourne |
| Daafouz, Jamal | Université De Lorraine, CRAN, CNRS |
| Busoniu, Lucian | Technical University of Cluj-Napoca |
Keywords: Switched systems, Optimal control, Lyapunov methods
Abstract: We introduce TROOP, a tree-based Riccati optimistic online planner, that is designed to generate near-optimal control laws for discrete-time switched linear systems with switched quadratic costs. The key challenge that we address is balancing computational resources against control performance, which is important as constructing near-optimal inputs often requires substantial amount of computations. TROOP addresses this trade-off by adopting an online best-first search strategy inspired by A *, allowing for efficient estimates of the optimal value function. The control laws obtained guarantee both near-optimality and stability properties for the closed-loop system. These properties depend on the planning depth, which determines how far into the future the algorithm explores and is closely related to the amount of computations. TROOP thus strikes a balance between computational efficiency and control performance, which is illustrated by numerical simulations on an example.
|
| |
| 14:30-14:45, Paper FrB19.3 | |
| Approximation of Diffeomorphisms for Quantum State Transfers |
|
| Pozzoli, Eugenio | ENS Rennes, CNRS |
| Scagliotti, Alessandro | Technical University of Munich; Munich Center for Machine Learni |
Keywords: Quantum information and control, Optimal control, Algebraic/geometric methods
Abstract: In this paper, we seek to combine two emerging standpoints in control theory. On the one hand, recent advances in infinite-dimensional geometric control have unlocked a method for controlling (with arbitrary precision and in arbitrarily small times) state transfers for bilinear Schrödinger PDEs posed on a Riemannian manifold M. In particular, these arguments rely on controllability results in the group of the diffeomorphisms of M. On the other hand, using tools of Gamma-convergence, it has been proved that we can phrase the retrieve of a diffeomorphism of M as an ensemble optimal control problem. More precisely, this is done by employing a control-affine system for simultaneously steering a finite swarm of points towards the respective targets. Here we blend these two theoretical approaches and numerically find control laws driving state transitions (such as eigenstate transfers) in a bilinear Schrödinger PDE posed on the torus. Such systems have experimental relevance and are currently used to model rotational dynamics of molecules, and cold atoms trapped in periodic optical lattices.
|
| |
| 14:45-15:00, Paper FrB19.4 | |
| Mode-Prefix-Based Control of Switched Linear Systems with Applications to Fault Tolerance |
|
| Padmanabhan, Ram | University of Illinois Urbana-Champaign |
| Aspeel, Antoine | University of Michigan |
| Ozay, Necmiye | Univ. of Michigan |
| Ornik, Melkior | University of Illinois Urbana-Champaign |
Keywords: Fault tolerant systems, Switched systems, Optimal control
Abstract: In this paper, we consider the problem of designing prefix-based optimal controllers for switched linear systems over finite horizons. This problem arises in fault-tolerant control, when system faults result in abrupt changes in dynamics. We consider a class of mode-prefix-based linear controllers that depend only on the history of the switching signal. The proposed optimal control problems seek to minimize both expected performance and worst-case performance over switching signals. We show that this problem can be reduced to a convex optimization problem. To this end, we synthesize one controller for each switching signal under a prefix constraint that ensures consistency between controllers. Then, system level synthesis is used to obtain a convex program in terms of the system-level parameters. In particular, it is shown that the prefix constraints are linear in terms of the system-level parameters. Finally, we apply this framework for optimal control of a fighter jet model suffering from system faults, illustrating how fault tolerance is ensured.
|
| |
| 15:00-15:15, Paper FrB19.5 | |
| Numerical Trajectory Optimization of Airborne Wind Energy Systems with Stroboscopic Averaging Methods |
|
| Harzer, Jakob | Albert Ludwig University of Freiburg |
| De Schutter, Jochem | ALU Freiburg |
| Diehl, Moritz | University of Freiburg |
Keywords: Energy systems, Optimal control, Numerical algorithms
Abstract: Airborne wind energy (AWE) systems harvest energy from high-altitude winds using tethered wings. The flight trajectory of pumping AWE systems consists of a reel-out phase where the wing flies fast crosswind loops while slowly reeling out the tether, and a reel-in phase where the wing directly flies back to its original starting point. The number of crosswind loops typically considered in state-of-the-art trajectory optimization approaches is artificially low in order to limit computational complexity. We tackle this problem using the stroboscopic averaging method (SAM), that provides a numerically cheap simulation of the reel-out phase, based on only a small subset of full crosswind loop simulations. We introduce regularization terms that safeguard the accuracy of the SAM approximation and present how the full reel-out trajectory can be reconstructed from the SAM-based solution. The accuracy of the reconstruction is then quantified with a dynamically accurate closed-loop simulation based on model predictive control. In numerical experiments, we achieve the same level of accuracy and efficiency in optimizing 30-loop trajectories as state-of-the-art approaches achieve with five-loop trajectories.
|
| |
| 15:15-15:30, Paper FrB19.6 | |
| Lyapunov-Based Online Optimal Control of a Class of Cascaded Systems with an Infinite Horizon Time-Average Performance Guarantee |
|
| Santosuosso, Luca | TU Graz |
| Andrianesis, Panagiotis | Boston University |
Keywords: Optimal control, Lyapunov methods, Power systems
Abstract: In this paper, we investigate the optimal control of a class of cascaded systems, where mass and/or energy flow through subsystems connected in series. In a stochastic setting, model predictive control (MPC) is a widely adopted approach for tackling this problem. However, it can be computationally intensive and, perhaps more importantly, relies on the characterization of uncertainty. To overcome these potential limitations, we propose a Lyapunov-based online optimal control method, which operates in real time without requiring prior characterization of the uncertainty. By relaxing time-coupling constraints, the proposed online method becomes suboptimal compared to offline optimization with perfect foresight, i.e., the ideal solution to the optimal control problem. Our main result establishes an infinite horizon time-average performance guarantee, demonstrating bounded suboptimality of the proposed method. We apply the Lyapunov-based online optimal control method to the economic dispatch of cascaded hydropower plants, and show that it can outperform both greedy online control and popular MPC.
|
| |
| 15:30-15:45, Paper FrB19.7 | |
| Data-Driven Positivity-Preserving Optimal State-Feedback Control Via Successive Convex Programming |
|
| Babazadeh, Maryam | University of Kaiserslautern-Landau |
| Mishra, Vikas Kumar | Technische Universitat Kaiserlautern |
| Bajcinca, Naim | University of Kaiserslautern |
Keywords: Optimization algorithms, Data driven control, Optimal control
Abstract: This letter introduces a successive convex programming approach to solve the data-driven optimal linear quadratic regulator (LQR) problem with positivity-preserving constraints. While positivity-preserving stabilization of linear time-invariant (LTI) systems is convex due to the existence of a linear copositive or diagonal Lyapunov matrix, this property does not apply to positivity preserving LQR performance. Adding positivity constraints disrupts the convexity of both model-based and data-driven optimal state-feedback designs. This study is the first to address the data-driven LQR problem with positivity constraints. Assuming that the system dynamical model is unknown but the measurement data are available, this work derives an equivalent formulation for the non-convex data-based problem and presents a successive convex optimization method using epsilon-perturbed semi-definite programs (SDPs) to monotonically decrease the cost and ensure convergence to a stationary point of the original problem. The method also supports structured controller design. Finally, its optimality, convergence, sensitivity to initialization, and comparison with the existing methods are analyzed through numerical case studies, demonstrating its effectiveness in data-driven control synthesis.
|
| |
| 15:45-16:00, Paper FrB19.8 | |
| Approximate Hamilton-Jacobi Reachability Analysis for a Class of Two-Timescale Systems, with Application to Biological Models |
|
| Hirsch, Dylan | UC San Diego (UCSD) |
| Herbert, Sylvia | UC San Diego (UCSD) |
Keywords: Model/Controller reduction, Game theory, Optimal control
Abstract: Hamilton-Jacobi reachability (HJR) is an exciting framework used for control of safety-critical systems with nonlinear and possibly uncertain dynamics. However, HJR suffers from the curse of dimensionality, with computation times growing exponentially in the dimension of the system state. Many autonomous and controlled systems involve dynamics that evolve on multiple timescales, and for these systems, singular perturbation methods can be used for model reduction. However, such methods are more challenging to apply in HJR due to the presence of an underlying differential game. In this work, we leverage prior work on singularly perturbed differential games to identify a class of systems which can be readily reduced, and we relate these results to the quantities of interest in HJR. We demonstrate the utility of our results on two examples involving biological systems, where dynamics fitting the identified class are frequently encountered.
|
| |
| FrB20 |
Asia I+II+III+IV |
| Graphons in Systems and Control |
Tutorial Session |
| Chair: Chen, Xudong | Washington University in St. Louis |
| Co-Chair: Belabbas, Mohamed Ali | University of Illinois at Urbana-Champaign |
| Organizer: Belabbas, Mohamed Ali | University of Illinois at Urbana-Champaign |
| Organizer: Chen, Xudong | Washington University in St. Louis |
| |
| 14:00-16:00, Paper FrB20.1 | |
| Structural System Theory Over Random Networks (I) |
|
| Belabbas, Mohamed Ali | University of Illinois at Urbana-Champaign |
| Chen, Xudong | Washington University in St. Louis |
Keywords: Networked control systems, Network analysis and control, Decentralized control
Abstract: In this tutorial, we will introduce notions of structural systems theory, namely structural controllability and structural stability, and show how to study them over random network models, namely Erdos-Renyi random graphs and graphons. We give some proofs when they illustrate some universal techniques.
|
| |
| FrC01 |
Galapagos I |
| Quantum Information and Control |
Regular Session |
| Chair: Aguiar, A. Pedro | Faculty of Engineering, University of Porto |
| Co-Chair: Nurdin, Hendra I | UNSW Australia |
| |
| 16:30-16:45, Paper FrC01.1 | |
| Quantum Projection Filtering for State Estimation and Feedback Stabilization in Open Quantum Systems |
|
| Ramadan, Ibrahim | Laas-Cnrs |
| Amini, Nina H. | CNRS, L2S, CentraleSupelec |
| Mason, Paolo | CNRS, Laboratoire Des Signaux Et Systèmes |
Keywords: Filtering, Quantum information and control, Stochastic systems
Abstract: In this paper, we examine an open quantum system that is subject to indirect and imperfect measurements. Specifically, for quantum non-demolition (QND) measurements, we show that the evolution of the system is confined to a properly selected manifold. This allows us to express the exact solution of the quantum filter equation in terms of a lower-dimensional stochastic differential equation, significantly reducing the computational complexity. Motivated by a quantum state reduction result for the predicted dynamics under QND measurement, we propose, in the case when the Hamiltonian and the coupling operator do not commute, a projection filter approach for the quantum state estimation. The efficiency of the proposed quantum projection filter is verified by numerical simulations. The latter suggest that, even when paired with a stabilizing feedback control mechanism that depends on the projected state, the filter maintains its robustness. More generally, our numerical results suggest the effectiveness of the adopted quantum projection filter approach as a method for state estimation and feedback stabilization of open quantum systems.
|
| |
| 16:45-17:00, Paper FrC01.2 | |
| Regulation of a Continuously Monitored Quantum Harmonic Oscillator with Inefficient Detectors |
|
| Sabbagh, Ralph | University of California, Irvine |
| Movilla Miangolarra, Olga | University of Calfornia, Irvine |
| Georgiou, Tryphon T. | University of California, Irvine |
Keywords: Quantum information and control, Stochastic optimal control
Abstract: We study the control problem of regulating the purity of a quantum harmonic oscillator in a Gaussian state via weak measurements. Specifically, we assume time-invariant Hamiltonian dynamics and that control is exerted via the back-action induced from monitoring the oscillator's position and momentum observables; the manipulation of the detector measurement strengths regulates the purity of the target Gaussian quantum state. After briefly drawing connections between Gaussian quantum dynamics and stochastic control, we focus on the effect of inefficient detectors and derive closed-form expressions for the transient and steady-state dynamics of the state covariance. We highlight the degradation of attainable purity that is due to inefficient detectors, as compared to that dictated by the Robertson-Schrödinger uncertainty relation. Our results suggest that quantum correlations can enhance the purity at steady-state. The quantum harmonic oscillator represents a basic system where analytic formulae may provide insights into the role of inefficient measurements in quantum control; the gained insights are pertinent to measurement-based quantum engines and cooling experiments.
|
| |
| 17:00-17:15, Paper FrC01.3 | |
| Physical Reduced Stochastic Equations for Continuously Monitored Non-Markovian Quantum Systems with a Markovian Embedding |
|
| Nurdin, Hendra I | UNSW Australia |
Keywords: Quantum information and control, Stochastic systems, Filtering
Abstract: An effective approach to modeling non-Markovian quantum systems is to embed a principal (quantum) system of interest into a larger quantum system. A widely employed embedding is one that uses another quantum system, referred to as the auxiliary system, which is coupled to the principal system, and both the principal and auxiliary can be coupled to quantum white noise processes. The principal and auxiliary together form a quantum Markov system and the quantum white noises act as a bath (environment) for this system. Recently it was shown that the conditional evolution of the principal system in this embedding under continuous monitoring by a travelling quantum probe can be expressed as a system of coupled stochastic differential equations (SDEs) that involve only operators of the principal system. The reduced conditional state of the principal only (conditioned on the measurement outcomes) is determined by the "diagonal" blocks of this coupled systems of SDEs. It is shown here that the "off-diagonal" blocks can be exactly eliminated up to their initial conditions, leaving a reduced closed system of SDEs for the diagonal blocks only. Under additional conditions the off-diagonal initial conditions can be made to vanish. This new closed system of equations, which includes an integration term involving a two-time stochastic kernel, represents the non-Markovian stochastic dynamics of the principal system under continuous-measurement. The system of equations determine the reduced conditional state of the principal only and may be viewed as a stochastic Nakajima-Zwanzig type of equation for continuously monitored non-Markovian quantum systems.
|
| |
| 17:15-17:30, Paper FrC01.4 | |
| Decoherence Time Maximization and Partial Isolation for Open Quantum Harmonic Oscillator Memory Networks |
|
| Vladimirov, Igor G. | Australian National University |
| Petersen, Ian R. | Australian National University |
| Shi, Guodong | The University of Sydney |
Keywords: Quantum information and control, Networked control systems, Stochastic optimal control
Abstract: This paper considers a network of open quantum harmonic oscillators which interact with their neighbours through direct energy and field-mediated couplings and also with external quantum fields. The position-momentum dynamic variables of the network are governed by linear quantum stochastic differential equations associated with the nodes of a graph whose edges specify the interconnection of the component oscillators. Such systems can be employed as quantum memories with an engineered ability to approximately retain initial conditions over a bounded time interval. We use the quantum memory decoherence time defined previously in terms of a fidelity threshold on a weighted mean-square deviation for a subset (or linear combinations) of network variables from their initial values. This approach is applied to maximizing a high-fidelity asymptotic approximation of the decoherence time over the direct energy coupling parameters of the network. The resulting optimality condition is a set of linear equations for blocks of a sparse matrix associated with the edges of the direct energy coupling graph of the network. We also discuss a setting where the quantum network has a subset of dynamic variables which are affected by the external fields only indirectly, through a complementary “shielding” system. The partially isolated subnetwork has a longer decoherence time in the high-fidelity limit, which is also amenable to optimization.
|
| |
| 17:30-17:45, Paper FrC01.5 | |
| An Approach to Control Design for Two-Level Quantum Ensemble Systems |
|
| Liang, Ruikang | Sorbonne University |
| Cheng, Gong | Tongji University |
Keywords: Quantum information and control, Large-scale systems
Abstract: Quantum ensemble systems arise in a variety of applications, including NMR spectroscopy and robust quantum control. While their theoretical properties have been extensively studied, relatively little attention has been given to the explicit construction of control inputs. In this paper, we address this gap by presenting a fully implementable control strategy for a one-parameter family of driftless two-level quantum systems. The proposed method is supported by rigorous analysis that guarantees accurate approximation of target distributions on SU(2). Convergence properties are established analytically, and numerical simulations are provided to demonstrate the effectiveness of the approach.
|
| |
| 17:45-18:00, Paper FrC01.6 | |
| Discrete Ricci Flow for Detecting Gaps in Adiabatic Quantum Processes |
|
| Rompokos, Athanasios | University of Southern California |
| Bogdan, Paul | University of Southern California |
| Jonckheere, Edmond | University of Southern California |
Keywords: Quantum information and control, Algebraic/geometric methods
Abstract: Adiabatic Quantum Processes have been known to solve combinatorial optimization problems intractable by classical computational methods by exploiting the adiabatic theorem. The solution of the optimization problem is encoded in the ground state of a final Hamiltonian, which in turn can be targeted by adiabatically transitioning from the known ground state of an easy-to-prepare Hamiltonian to the ground state of the problem Hamiltonian. Our focus is to characterize topologically the minimum gap between the ground state and the first excited state, which is the bottleneck of the process in terms of its speed. Since the gap occurs at topologically steep ascent/descent of the ground/first excited states, it is proposed to utilize the curvature uniformization process of the Ricci flow to identify such gaps as areas of heavy curvature transport. Our results indicate that such gaps can be detected automatically in various Adiabatic Quantum Processes.
|
| |
| 18:00-18:15, Paper FrC01.7 | |
| State-Constrained Optimal Control for Coherence Preservation in Multi-Level Open Quantum Systems |
|
| Binandeh Dehaghani, Nahid | Aalborg University |
| Aguiar, A. Pedro | Faculty of Engineering, University of Porto |
| Wisniewski, Rafal | Aalborg University |
Keywords: Quantum information and control
Abstract: This paper addresses the optimal control of quantum coherence in multi-level systems, modeled by the Lindblad master equation, which captures both unitary evolution and environmental dissipation. We develop an energy minimization framework to control the evolution of a qutrit (three-level) quantum system while preserving coherence between states. The control problem is formulated using Pontryagin’s Minimum Principle in the form of Gamkrelidze, incorporating state constraints to ensure coherence remains within desired bounds. Our approach accounts for Markovian decoherence, demonstrating that the Lindblad operator is non-unital, which reflects the irreversible decay processes inherent in the system. The results provide insights into effectively maintaining quantum coherence in the presence of dissipation.
|
| |
| 18:15-18:30, Paper FrC01.8 | |
| Sample Complexity Bounds for Scalar Parameter Estimation under Quantum Differential Privacy |
|
| Farokhi, Farhad | The University of Melbourne |
Keywords: Quantum information and control
Abstract: This paper presents tight upper and lower bounds for minimum number of samples (copies of a quantum state) required to attain a prescribed accuracy (measured by error variance) for scalar parameters estimation using unbiased estimators under quantum local differential privacy for qubits. In the small privacy budget epsilon regime, i.e., epsilonll 1, the sample complexity scales as Theta(epsilon^{-2}). This bound matches that of classical parameter estimation under local differential privacy. The lower bound however loosens in the large privacy budget regime, i.e., epsilongg 1. The upper bound for the minimum number of samples is then generalized to qudits (with dimension d) resulting in sample complexity of mathcal{O}(depsilon^{-2}).
|
| |
| FrC02 |
Oceania II |
| Data-Driven Verification and Control with Provable Guarantees III |
Invited Session |
| Chair: Lavaei, Abolfazl | Newcastle University |
| Co-Chair: Nejati, Amy | Newcastle University |
| Organizer: Lavaei, Abolfazl | Newcastle University |
| Organizer: Nejati, Amy | Newcastle University |
| Organizer: Jungers, Raphaël M. | University of Louvain |
| Organizer: Abate, Alessandro | University of Oxford |
| |
| 16:30-16:45, Paper FrC02.1 | |
| On the Regret of Model Predictive Control with Imperfect Inputs |
|
| Liu, Changrui | Delft University of Technology |
| Shi, Shengling | Delft University of Technology |
| De Schutter, Bart | Delft University of Technology |
Keywords: Optimal control, Predictive control for nonlinear systems
Abstract: Implementing model predictive control (MPC) in practice faces many subtle but prevalent problems, including modeling errors, solver errors, and actuator faults. In essence, the real control input applied to the system always deviates from the ideal one based on a perfect controller, resulting in an imperfect controller. In this work, we provide a general analysis to quantify the suboptimality of MPC for Lipschitz-continuous nonlinear systems due to imperfect control inputs in terms of textit{dynamic regret}. Based on a general assumption about how the imperfect controller may improve over time, sublinear regret upper bounds are established for cases where the closed-loop system under the ideal controller is Lipschitz-contractive (i.e., its Lipschitz constant is smaller than one). In addition, we also discuss how the regret scales when the closed-loop system under the oracle controller is not Lipschitz-contractive. The results provide insights into designing suitable MPC strategies, especially for learning-based MPC.
|
| |
| 16:45-17:00, Paper FrC02.2 | |
| Learning Safe Data-Driven Control Barrier Functions for Unknown Continuous Systems |
|
| Zhu, Feiya | Northeastern University |
| Pati, Tarun | Northeastern University |
| Yong, Sze Zheng | Northeastern University |
Keywords: Data driven control, Constrained control, Robust control
Abstract: This letter presents a semi-parametric approach for learning safe data-driven control barrier functions (SDD-CBFs) for unknown continuous systems from noisy data. By leveraging optimization theory, interval and mixed-monotone bounding, and probably approximately correct (PAC) learning, we learn at design time both parametric control barrier functions (CBFs) and their non-parametric CBF conditions from noisy data with a mixed-integer linear program (MILP) to ensure robust safety despite generalization errors with a high probability. Moreover, we propose an online safety filter for minimally modifying any nominal controller for safety that reduces to computationally efficient quadratic programming.
|
| |
| 17:00-17:15, Paper FrC02.3 | |
| A Compositional Algorithm for Computing a Switched System Representation of Neural Network Controllers (I) |
|
| García Soto, Miriam | Universidad Complutense De Madrid |
| Prabhakar, Pavithra | Kansas State University |
Keywords: Neural networks, Switched systems, Hybrid systems
Abstract: Our broad motivation is to utilize the large body of work on verification techniques for switched affine systems towards verification of neural network-controlled systems. To this end, we explore the problem of computing a switched affine system (SAS) representation of neural network-controlled discrete-time linear dynamical systems by providing a compositional algorithm that computes the piecewise affine (PWA) representation of the neural network. Our algorithm relies on two subroutines - one that computes the PWA representation of a single layer of a neural network, and the other that computes the compositions of PWA representations. We introduce the concept of a composition ordering represented as a binary tree that specifies the order in which the layers of the neural network are composed, and use that to compute the PWA representation of the whole neural network. Our experimental evaluation highlights the critical parameters of the network affecting the runtime complexity. Finally, we illustrate the application of the PWA representation computation toward stability analysis of a neural network-controlled discrete-time linear dynamical system.
|
| |
| 17:15-17:30, Paper FrC02.4 | |
| Neural Barrier Certificates for Stochastic Control Systems (I) |
|
| Taheri, Sara | Ludwig Maximilian University of Munich |
| Zamani, Majid | University of Colorado Boulder |
Keywords: Formal Verification/Synthesis, Stochastic systems, Neural networks
Abstract: This paper investigates controller synthesis for discrete-time stochastic control systems with unknown dynamics and unknown noise distribution, using control barrier certificates (CBCs) to ensure safety. Due to uncertainties and the lack of precise models, traditional model-based safety verification methods are often impractical for real-world cyber-physical systems. To address these challenges, we propose a data-driven framework that represents both the CBC and the associated controller as neural networks trained on a finite set of samples. Unlike prior work, our method jointly synthesizes the CBC and the controller without relying on model knowledge or predefined functional forms, and without requiring a posteriori validation. Our approach provides a probabilistic safety guarantee by establishing a lower bound on the probability of satisfying the safety specifications with a desired confidence level. Furthermore, by introducing a Lipschitz continuity-based validity condition and integrating it into the training process, we ensure that safety requirements hold not only for the training data but across the entire state set. Finally, we validate our approach on an inverted pendulum, demonstrating formal safety guarantees under uncertainty.
|
| |
| 17:30-17:45, Paper FrC02.5 | |
| Correct-By-Construction Barrier Certificate Synthesis for Safety Verification of Continuous Dynamical Systems (I) |
|
| Panja, Promit | Karlsruhe Institute of Technology |
| Platzer, Andre | Carnegie Mellon University |
Keywords: Formal Verification/Synthesis
Abstract: Barrier certificates are a powerful tool for verifying the safety of dynamical systems by certifying that trajectories remain within a safe region while avoiding unsafe states. However, due to the parametric nature of their synthesis procedure and the inaccuracies resulting from the required numerical solvers, constructing valid barrier certificates is challenging, requiring post-synthesis verification to confirm their correctness. This paper presents a combined framework for correct-by-construction barrier certificate synthesis by combining barrier certificate synthesis witnessing dynamical systems safety with polynomial synthesis witnessing the resulting real arithmetic via a combination of semidefinite programming (SDP) and Gröbner bases for the Real Nullstellensatz. This is achieved by exploiting a sum-of-squares relaxation for both the barrier certificate and Real Nullstellensatz witness construction procedures by solving a single combined sum-of-squares optimization program, hence eliminating the need for post-synthesis verification. We also show how a monomial basis for the witness can be chosen. We demonstrate the effectiveness of our framework on some examples, showcasing its ability to certify safety by synthesizing correct-by-construction barrier certificates.
|
| |
| 17:45-18:00, Paper FrC02.6 | |
| Learning Koopman-Based Stability Certificates for Unknown Nonlinear Systems (I) |
|
| Zhou, Ruikun | University of Waterloo |
| Meng, Yiming | University of Illinois Urbana-Champaign |
| Zeng, Zhexuan | Huazhong University of Science and Techonology |
| Liu, Jun | University of Waterloo |
Keywords: Learning, Lyapunov methods, Stability of nonlinear systems
Abstract: Koopman operator theory has gained significant attention in recent years for identifying discrete-time nonlinear systems by embedding them into an infinite-dimensional linear vector space. However, providing stability guarantees while learning the continuous-time dynamics, especially under conditions of relatively low observation frequency, remains a challenge within the existing Koopman-based learning frameworks. To address this challenge, we propose an algorithmic framework to simultaneously learn the vector field and Lyapunov functions for unknown nonlinear systems, using a limited amount of data sampled across the state space and along the trajectories at a relatively low sampling frequency, potentially as low as 10 Hz or less. The proposed framework builds upon recently developed high-accuracy Koopman generator learning for capturing transient system transitions and physics-informed neural networks for training Lyapunov functions. We show that the learned Lyapunov functions can be formally verified using a satisfiability modulo theories (SMT) solver and provide less conservative estimates of the region of attraction compared to existing methods.
|
| |
| 18:00-18:15, Paper FrC02.7 | |
| Bisimulation-Based Reduction of Neural Network Controllers (I) |
|
| Gupta, Lipsy | Kansas State University |
| Prabhakar, Pavithra | Kansas State University |
Keywords: Neural networks, Computer-aided control design, Formal Verification/Synthesis
Abstract: We consider the problem of constructing a neural network that is conformant with a given architecture and is closest to a given neural network, where the distance is formalized using the notion of approximate bisimulation on neural networks. We present two methods to solve the problem. Our first approach is based on reducing it to a mixed integer linear program (MILP); this returns the closest neural network and the distance. However, since MILP is an NP-complete problem, this approach is limited to smaller networks. We explore an alternate approach based on k-means clustering which provides a reduced system faster, however, is not guaranteed to return the closest. Our experimental evaluation shows that the second approach provides a reasonably close value as compared to the first approach. We also show that closed-loop behaviors with the original network and the reduced network are sufficiently close in practice.
|
| |
| 18:15-18:30, Paper FrC02.8 | |
| Recurrent Control Barrier Functions: A Path towards Nonparametric Safety Verification (I) |
|
| Liu, Jixian | Johns Hopkins University |
| Mallada, Enrique | Johns Hopkins University |
Keywords: Data driven control, Formal Verification/Synthesis, Robotics
Abstract: Ensuring the safety of complex dynamical systems often relies on Hamilton-Jacobi (HJ) Reachability Analysis or Control Barrier Functions (CBFs). Both methods require computing a function that characterizes a safe set that can be made (control) invariant. However, the computational burden of solving high-dimensional partial differential equations (for HJ Reachability) or large-scale semidefinite programs (for CBFs) makes finding such functions challenging. In this paper, we introduce the notion of Recurrent Control Barrier Functions (RCBFs), a novel class of CBFs that leverages a recurrent property of the trajectories, i.e., coming back to a safe set, for safety verification. Under mild assumptions, we show that the RCBF condition holds for the signed‑distance function, turning function design into set identification. Notably, the resulting set need not be invariant to certify safety. We further propose a data-driven nonparametric method to compute safe sets that is massively parellizable, and trades off conservativeness against computational cost.
|
| |
| FrC03 |
Oceania III |
| Event-Triggered and Self-Triggered Control Theory |
Invited Session |
| Chair: Yao, Ningshi | George Mason University |
| Co-Chair: Khorasani, Khashayar | Concordia University |
| Organizer: Malisoff, Michael | Louisiana State University |
| Organizer: Nowzari, Cameron | George Mason University |
| Organizer: Postoyan, Romain | CNRS, CRAN, Université De Lorraine |
| Organizer: Yao, Ningshi | George Mason University |
| |
| 16:30-16:45, Paper FrC03.1 | |
| Dynamic Periodic Event-Triggered Control for the Stabilization of Nonlinear Rational Systems (I) |
|
| Lisbôa, Cristyan | UFRGS |
| Flores, Jeferson Vieira | UFRGS |
| Moreira, Luciano Gonçalves | IFSUL |
| Tarbouriech, Sophie | LAAS-CNRS |
Keywords: Nonlinear systems, Networked control systems, LMIs
Abstract: This paper addresses the regional stabilization of continuous-time nonlinear rational systems with dynamic periodic event-triggered control. Based on the differential-algebraic representation of the controlled system, the looped-functional approach, and Lyapunov arguments, we derive constructive conditions in the form of linear matrix inequalities to regionally stabilize the origin of the closed-loop system. These conditions are cast into a convex optimization problem, providing a systematic co-design approach, to optimally compute the control law gains and the triggering parameters aiming at reducing the control updates regarding the periodic time-triggered strategy. A numerical example illustrates the proposed method.
|
| |
| 16:45-17:00, Paper FrC03.2 | |
| Semi-Global Exponential Stabilization of Nonlinear Time-Delay Systems Via Sampled-Data Event-Triggered Controllers (I) |
|
| Di Ferdinando, Mario | University of L'Aquila |
| Borri, Alessandro | CNR-IASI |
| Di Gennaro, Stefano | University of L'Aquila |
| Pepe, Pierdomenico | University of L' Aquila |
Keywords: Sampled-data control, Delay systems, Stability of nonlinear systems
Abstract: In this paper, the sampled-data event-triggered stabilization problem of nonlinear time-delay systems is addressed. In particular, a methodology for the design of sampled-data event-triggered exponential stabilizers is provided for locally Lipschitz nonlinear systems not necessarily affine in the control input and in presence of state delays. As a first step, the new notion of Steepest Descent Exponential Feedbacks (SDEFs) is introduced. Then, it is shown that there exists a suitably fast sampling such that the digital implementation of SDEFs, updated through a proposed event-triggered mechanism, ensures the semi-global exponential stability property of the corresponding closed-loop system. The stabilization in the sample-and-hold sense theory and the Halanay's inequality approach are used as tools to prove the results. In the proposed methodology, the inter-sampling system behaviour as well as time-varying sampling intervals are taken into account. Moreover, delay-free systems are included as a special case. The results are validated through a numerical example.
|
| |
| 17:00-17:15, Paper FrC03.3 | |
| Quantized Distributed Estimation with Event-Triggered Communication and Packet Loss (I) |
|
| Wang, Ying | KTH Royal Institute of Technology |
| Zhao, Yanlong | Academy of Mathematics and Systems Science, Chinese Academyof Sci |
| Zhang, Ji-Feng | Chinese Academy of Sciences |
| Johansson, Karl H. | KTH Royal Institute of Technology |
Keywords: Estimation, Networked control systems, Linear systems
Abstract: This paper focuses on the problem of quantized distributed estimation with event-triggered communication and packet loss, aiming to reduce the number of transmitted bits. The main challenge lies in the inability to differentiate between an untriggered event and a packet loss occurrence.This paper proposes an event-triggered distributed estimation algorithm with quantized communication and quantized measurement,in which it introduces a one-bit information reconstruction method to deal with packet loss. The almost sure convergence and convergence rate of the proposed algorithm are established. Besides, it is demonstrated that the global average communication bit-rate decreases to zero over time. Moreover, the trade-off between communication rate and convergence rate is revealed, providing guidance for designing the communication rate required to achieve the algorithm's convergence rate.A numerical example is supplied to validate the findings.
|
| |
| 17:15-17:30, Paper FrC03.4 | |
| On Consistency-Preserving Stochastic Event Trigger Design (I) |
|
| Schmitt, Eva Julia | Otto Von Guericke University Magdeburg (OVGU) |
| Noack, Benjamin | Otto Von Guericke University Magdeburg (OVGU) |
Keywords: Kalman filtering, Estimation, Sensor networks
Abstract: Event-based transmissions and estimation can effectively reduce the burden on the communications system in spatially distributed sensing and estimation setups while maintaining good estimation performance. This can be achieved by transmitting only those sensor measurements that fulfill a predefined transmission policy and using a remote estimator that can exploit knowledge of the triggering condition in non-transmission instants. However, it is crucial to ensure the consistency of the remote estimator to obtain reliable estimates. Depending on the transmission policy, this is difficult to achieve. In this paper, a generalized framework for stochastic event triggers is provided and design criteria for triggering policies are deduced that allow for simple estimator design. The results obtained are evaluated with the help of system simulations using a novel triggering policy developed under the proposed design criteria and show its advantages.
|
| |
| 17:30-17:45, Paper FrC03.5 | |
| Observer-Based Event-Triggered Control of 2x2 Linear Hyperbolic PDEs with Switching Dynamic Triggering (I) |
|
| Rathnayake, Bhathiya | Student (University of California San Diego) |
| Diagne, Mamadou | University of California San Diego |
Keywords: Backstepping, Stability of hybrid systems, Switched systems
Abstract: This paper introduces a novel observer-based dynamic event-triggered control (ETC) approach for 2x2 linear hyperbolic PDEs with collocated sensing and actuation. The proposed method guarantees global exponential stability (GES) under ETC using PDE backstepping, with stability estimates provided in the spatial L2 norm of the states. The triggering mechanism explicitly enforces a lower bound on the time interval between consecutive events, ruling out the possibility of Zeno behavior, and introduces a dynamic variable with appropriate switching dynamics. Events are triggered when the dynamic variable crosses zero from positive, after which the variable is immediately reset to an appropriate nonnegative value. By employing a novel Lyapunov functional that incorporates the L^2 norms of the observer and observer error target system states, the switching dynamic variable, and a state-independent dynamic reset variable modulated by the square of the control input sampling error, the GES of the closed-loop system is established under the zero-order hold application of the backstepping control between events. The well-posedness of the closed-loop system is also proven. A simulation study is conducted to illustrate the theory.
|
| |
| 17:45-18:00, Paper FrC03.6 | |
| Zero Dynamics Attack Detection and Isolation in Cyber-Physical Systems with Event-Triggered Communication |
|
| Eslami, Ali | Concordia University |
| Khorasani, Khashayar | Concordia University |
Keywords: Agents-based systems, Cooperative control, Networked control systems
Abstract: This paper investigates the problem of Zero Dynamics (ZD) cyber-attack detection and isolation in Cyber-Physical Systems (CPS). By utilizing the notion of auxiliary systems with event-based communications, we will develop a detection mechanism capable of detecting and isolating the ZD cyber-attack even when the attackers have full knowledge of the dynamics of the auxiliary system and can launch False Data Injection (FDI) attacks on all the communication channels. More specifically, we will utilize a self-triggering rule for the communication channels connecting the auxiliary system with the Command & Control (C&C) center, leveraging its properties to detect the ZD cyberattack. Finally, the effectiveness and capabilities of our approach are verified and demonstrated through simulation case studies.
|
| |
| 18:00-18:15, Paper FrC03.7 | |
| Comparison Perspectives for Time-, Event-, and Self-Triggered Control |
|
| Meister, David | University of Stuttgart |
| Lang, Simon | University of Stuttgart |
| Allgöwer, Frank | University of Stuttgart |
Keywords: Networked control systems, Sampled-data control
Abstract: Event-based control, subsuming event- and self-triggered control, has the potential to reduce the average triggering rate compared to periodic control while still fulfilling a control objective, e.g., control performance. Different formulations of the inherent performance triggering rate trade-off have been studied to formally show this advantage. One of them compares control performance of triggering schemes under equal average triggering rates. Utilizing this perspective to compare an event-based control scheme to the best periodic control scheme yields the notion of consistency. Alternatively, triggering schemes can be compared with respect to a weighted cost including both control performance and triggering rate. In this work, we discuss the relationship between these different perspectives on comparing triggering schemes, with a particular focus on comparing event-based and periodic control. As a direct outcome, we provide different ways of interpreting consistency (or its absence). Moreover, we discuss key factors in triggering scheme comparisons which can serve as a conceptual basis for the future development of evaluation methods for triggering schemes. With our work, we raise awareness for the importance of triggering scheme evaluations that go beyond numerical examples for one specific parameter choice.
|
| |
| 18:15-18:30, Paper FrC03.8 | |
| Event-Triggered Source Seeking Control for Nonholonomic Systems (I) |
|
| Rodrigues, Victor Hugo Pereira | State University of Rio De Janeiro (UERJ) |
| Oliveira, Tiago Roux | State University of Rio De Janeiro |
| Krstic, Miroslav | University of California, San Diego |
Keywords: Extremum seeking, Nonholonomic systems, Sampled-data control
Abstract: This paper introduces an event-triggered source seeking control (ET-SSC) for autonomous vehicles modeled as the nonholonomic unicycle. The classical source seeking control is enhanced with static-triggering conditions to enable aperiodic and less frequent updates of the system's input signals, offering a resource-aware control design. Our convergence analysis is based on time-scaling combined with Lyapunov and averaging theories for systems with discontinuous right-hand sides. ET-SSC ensures exponentially stable behavior for the resulting average system, leading to practical asymptotic convergence to a small neighborhood of the source point. We guarantee the avoidance of Zeno behavior by establishing a minimum dwell time to prevent infinitely fast switching. The performance optimization is aligned with classical continuous-time source seeking algorithms while balancing system performance with actuation resource consumption. Our ET-SSC algorithm, the first of its kind, allows for arbitrarily large inter-sampling times, overcoming the limitations of classical sampled-data implementations for source seeking control.
|
| |
| FrC04 |
Oceania IV |
| Intelligent Decision-Making and Advanced Control for Unmanned Systems |
Invited Session |
| Chair: Chen, Zhang | Tsinghua University |
| Co-Chair: Liu, Hao | Beihang University |
| Organizer: Liu, Hao | Beihang University |
| Organizer: Chen, Zhang | Tsinghua University |
| Organizer: Jia, (Samuel) Qing-Shan | Tsinghua University |
| |
| 16:30-16:45, Paper FrC04.1 | |
| Critic-Only Learning-Based Optimal Visual Servoing Control for Quadrotors with Safe Constraints (I) |
|
| Yi, Xinning | Beihang University |
| Liu, Hao | Beihang University |
| Xue, Shibei | Shanghai Jiao Tong University |
Keywords: Iterative learning control, Control applications, Optimal control
Abstract: This paper investigates the constrained optimal visual servoing control problem for quadrotors tracking moving ground targets without direct position measurements. To ensure compliance with safe constraints, the quadrotor visual servoing model is transformed using a barrier function, and formulated as a time-varying optimal control problem for both the image feature and attitude systems. The proposed optimal control law is derived using an integral learning-based approach, and implemented through a modified critic-only neural network with time-related basis functions. Simulation results validate the effectiveness of the proposed control approach.
|
| |
| 16:45-17:00, Paper FrC04.2 | |
| Multilateral Collaborative Teleoperation with Self-Tuning Authority Via Prescribed Performance Control (I) |
|
| Wang, Ziwei | Lancaster University |
| Fei, Haolin | Lancaster University |
| Xue, Tao | Tsinghua University |
| Lam, Hak-Keung | King's College London |
| Guo, Yao | Shanghai Jiao Tong University |
| Williams, Darren | Lancaster University |
| Xiao, Bo | Dr |
| Yeatman, Eric | Imperial College London |
Keywords: Constrained control, Adaptive control, Fuzzy systems
Abstract: Most teleoperation strategies focus on time-delay stability of single-local/single-remote (SL/SR) systems but lack mechanisms to correct delay-induced manipulation errors. Moreover, state constraints further complicate control design. To tackle these issues, we propose an effective solution to multi-local/single-remote teleoperation systems with constraint requirements. The human-robot and robot-environment interaction processes are first modeled as interval type-2 (IT2) polynomial-fuzzy-model-based (PFMB) systems, where the nonlinearity and parameter uncertainty are effectively captured by the polynomial system matrices with IT2 membership functions. A critic-neural-network-based distribution adjusts operator authority weights using periodic position synchronization and force-tracking metrics. An adaptive IT2 PFMB control framework enhances transient/steady-state performance while enforcing asymmetric constraints through a novel bias state transformation technique, converting asymmetric constraints into symmetric ones for tractable handling. We have rigorously shown that position synchronization and force tracking performance can be guaranteed by introducing a fuzzy Lyapunov-Krasovskii functional. The simulation results have verified the merits of the proposed control structure.
|
| |
| 17:00-17:15, Paper FrC04.3 | |
| Closed-Loop Energy Optimized Control of a Dielectric Elastomer Soft Robot Via Policy Iteration |
|
| Massenio, Paolo Roberto | Polytechnic University of Bari |
| Soleti, Giovanni | Saarland University |
| Ruderman, Michael | University of Agder |
| Naso, David | Politecnico Di Bari |
| Rizzello, Gianluca | Saarland University |
Keywords: Robotics, Optimal control, Nonlinear systems
Abstract: This paper presents an energy optimized closed-loop control strategy for a soft robot based on dielectric elastomer actuators (DEAs). While soft robotics offer advantages in flexibility, compliance, and safety compared to conventional solutions, the development of energy-efficient actuation strategies remains a key challenge. Existing methods primarily focus on trajectory optimization or structural design, with limited attention to control strategies. This work addresses this issue by combining an energy-based inner stabilizing controller with an outer optimal control loop to optimize transient performance and energy consumption. A nonlinear cost functional that balances electro-mechanical energy dissipation and dynamic performance is proposed. By embedding the DEAs' electrical dynamics and using a Policy Iteration-based approach, the method designs an additional control input with time-varying bounds, constrained by the voltage used by the stabilizing loop. Simulation results show significant improvements in dynamic performance, energy efficiency, and tuning flexibility.
|
| |
| 17:15-17:30, Paper FrC04.4 | |
| LiDAR-Inertial SLAM-Based Navigation and Safety-Oriented AI-Driven Control System for Skid-Steer Robots |
|
| Shahna, Mehdi Heydari | Tampere University |
| Haaparanta, Eemil | Tampere University |
| Mustalahti, Pauli | Tampere University |
| Mattila, Jouni | Tampere University |
Keywords: Robust adaptive control, Autonomous robots, Sensor fusion
Abstract: Integrating artificial intelligence (AI) and stochastic technologies into the mobile robot navigation and control (MRNC) framework while adhering to rigorous safety standards presents significant challenges. To address these challenges, this paper proposes a comprehensively integrated MRNC framework for skid-steer wheeled mobile robots (SSWMRs), in which all components are actively engaged in real-time execution. The framework comprises: 1) a LiDAR-inertial simultaneous localization and mapping (SLAM) algorithm for estimating the current pose of the robot within the built map; 2) an effective path-following control system for generating desired linear and angular velocity commands based on the current pose and the desired pose; 3) inverse kinematics for transferring linear and angular velocity commands into left and right side velocity commands; and 4) a robust AI-driven (RAID) control system incorporating a radial basis function network (RBFN) with a new adaptive algorithm to enforce in-wheel actuation systems to track each side motion commands. To further meet safety requirements, the proposed RAID control within the MRNC framework of the SSWMR constrains AI-generated tracking performance within predefined overshoot and steady-state error limits, while ensuring robustness and system stability by compensating for modeling errors, unknown RBF weights, and external forces. Experimental results verify the proposed MRNC framework performance for a 4,836 kg SSWMR operating on soft terrain.
|
| |
| 17:30-17:45, Paper FrC04.5 | |
| Swinging Control of a Robot Wheel with Soft Actuators |
|
| Mitterbach, Philipp | Eindhoven University of Technology |
| Kuling, Irene | Technical University of Eindhoven |
| Pogromsky, A. Yu. | Eindhoven University of Technology |
Keywords: Robotics, Nonlinear systems, Control applications
Abstract: Soft robotics enables the development of lightweight robots that can be fabricated cost-effectively using additive manufacturing techniques. However, the continuum nature of soft actuators, governed by partial differential equations, introduces significant control challenges. This is because many first-principles control methods, which are designed for mechanical systems without soft actuators, are not applicable to those soft robots. To address this, a soft robotic wheel demonstrator was designed to achieve motion by dynamically shifting its center of gravity through pneumatic actuation. The system is modeled as a pendulum-on-a-wheel, enabling first-principles analysis via Euler-Lagrange equations. A swinging control strategy, which is inspired by Fradkov’s energy control approach, is developed to control the system's behavior. A proof is provided, showing that for almost all initial conditions, energy converges to the target level, ensuring controlled rotation of the wheel. Simulations validate the method’s effectiveness in achieving efficient energy utilization and controlled motion.
|
| |
| 17:45-18:00, Paper FrC04.6 | |
| Path Efficiency Enhanced Exploration Method for Mobile Robot in Unknown Unstructured Environments |
|
| Hu, Yiming | Huazhong University of Science and Technology |
| Wang, Shuting | Huazhong University of Science and Technology |
| Xie, Yuanlong | Huazhong University of Science and Technology |
| Zhang, Youmin | Concordia University |
| Cheng, Xiang | Huazhong University of Science and Technology |
Keywords: Robotics, Autonomous robots, Agents-based systems
Abstract: Autonomous exploration of unknown environments is one of the fundamental capabilities of intelligent mobile robots, which affects their adaptability in complex environment navigation tasks. Traditional frontier-based exploration methods prioritize exploration coverage while neglecting path cost, whereas some novel machine learning-based approaches often require substantial training and tuning to maintain effectiveness. To address this issue, this paper integrates the frontier-based method with artificial potential fields to develop a new local navigation decision-making approach, enabling the robot to respond quickly to its surroundings while simultaneously performing exploration and obstacle avoidance. Subsequently, the newly designed global exploration strategy successfully selects the optimal frontier with both the exploration value and the shortest path cost based on the minimum potential energy principle, thus prioritizing the exploration of a local area before visiting others. Simulations and experiments demonstrate that in unstructured scenarios with irregular layouts, the proposed method ensures good exploration coverage while significantly reducing path cost compared to existing methods, thereby improving the efficiency of the robot.
|
| |
| 18:00-18:15, Paper FrC04.7 | |
| A Harmonic Potential Field-Based Method for Robot Exploration in Narrow Environments |
|
| Tarantos, Spyridon | New York University Abu Dhabi |
| Panetsos, Fotis | New York University Abu Dhabi |
| Rodopoulos, Dimitrios | New York University Abu Dhabi |
| Kyriakopoulos, Kostas J. | National Tech. Univ. of Athens |
Keywords: Robotics, Autonomous robots, Autonomous vehicles
Abstract: In this work, we propose an exploration approach appropriate for robots called to operate in narrow environments using sensors with limited field-of-view (FOV). It consists of two modules, namely, a controller that generates reference velocity commands consistent with a harmonic potential field (HPF) built in the robot configuration space, and an optimization-based controller (OBC) that intervenes between the HPF-based controller and the robot in order to enforce robot safety. Building the HPF in the configuration space permits us to control both the robot position and orientation, taking into account the robot shape and the limitations imposed by the sensor FOV. In order to ensure that the time required for the solution of the HPF will not jeopardize the robot safety, the OBC equipped with constraints, built upon appropriate control barrier functions, guarantees that the robot will remain within the explored region considered for the generation of the HPF until the field is updated with the latest information. The effectiveness of the proposed method in exploring safely narrow environments is demonstrated in simulations, highlighting its real-time capabilities.
|
| |
| 18:15-18:30, Paper FrC04.8 | |
| Adaptive Integral-Gain Controller for Robust Quadrotor Navigation with Fourier Neural Network Compensation |
|
| Tevera Ruiz, Alejandro | Heudiasyc CNRS, Université De Technologie De Compiègne |
| Sanchez-Orta, Anand | CINVESTAV |
| Castillo, Pedro | Univ De Technologie De Compiegne |
| Chazot, Jean Daniel | Universite De Technologie De Compiegne |
| Munoz-Vazquez, Aj | Texas A&M University |
Keywords: Robotics, Neural networks, Autonomous vehicles
Abstract: In this paper, an adaptive integral-gain controller with Fourier Neural Network (NN) compensation is proposed to automatically handle changing operational conditions, particularly aerodynamic disturbances in quadrotor navigation. The proposed controller compensates for external disturbances by approximating the residual error dynamics, ensuring that the parameters are bounded over time without prior knowledge. Theoretical guarantees are provided in the sense of Lyapunov analysis, considering the closed-loop system with adaptive and NN mechanisms. Numerical results demonstrate that the proposed approach can properly manage quasiperiodic disturbances. Experimental results, using a Parrot AR Drone in torque mode, validate the controller under different operational scenarios and conditions, showing superior tracking accuracy and faster recovery from sustained and intermittent disturbances compared to its non-adaptive counterpart.
|
| |
| FrC05 |
Galapagos II |
| Precision Mechatronics |
Invited Session |
| Chair: Özparpucu, Mehmet Can | Cigus GmbH |
| Co-Chair: Al Janaideh, Mohammad | University of Guelph |
| Organizer: Keulen, Jurrien | University of Groningen |
| Organizer: Jayawardhana, Bayu | University of Groningen |
| Organizer: Rakotondrabe, Micky | ENIT Tarbes, INPT, University of Toulouse |
| Organizer: Boudaoud, Mokrane | Sorbonne Université |
| Organizer: Al Janaideh, Mohammad | University of Guelph |
| |
| 16:30-16:45, Paper FrC05.1 | |
| Data-Based Encryption Iterative Learning Heading Control for Unmanned Surface Vehicles |
|
| Chen, Chen | Jiangnan University |
| Zhao, Huarong | Jiangnan University |
| Shan, Jinjun | York University |
| Yu, Hongnian | Edinburgh Napier University |
Keywords: Data driven control, Iterative learning control, Networked control systems
Abstract: This paper investigates a data-driven iterative learning heading control problem for unmanned surface vehicles (USVs) with encoding-decoding mechanisms. First, a compact form dynamic linearized model of the USV is established using dynamic linearization techniques and redefined outputs. Then, an encoding-decoding scheme is designed, which encodes the data before transmission and decodes the data received by the controller. This strategy compresses the data and offers protection from potential breaches of information. Finally, the convergence of the designed method is theoretically analyzed, and simulation results demonstrate its effectiveness in enhancing heading control performance.
|
| |
| 16:45-17:00, Paper FrC05.2 | |
| Frequency Response Analysis of General Zero-Crossing Reset Control Systems |
|
| van Eijk, Luke Franciscus | Delft University of Technology |
| Kostic, Dragan | ASM Pacific Technology |
| HosseinNia, S. Hassan | Delft University of Technology |
Keywords: Nonlinear output feedback, Numerical algorithms, Data driven control
Abstract: This article introduces an output prediction method for a general class of closed-loop reset control systems. The considered type of system consists of a linear time-invariant (LTI) part which is connected in feedback with a reset controller that (partially) resets (a part of) its states when its input is equal to zero. Given some practical assumptions on the reset element's input signal, the system output can be accurately predicted when the system is subject to a sinusoidal input. One benefit of this approach is that it provides an intuitive frequency-domain representation of the system. Another benefit is that output prediction can be done based solely on a frequency-response function (FRF) of the LTI part of the system. This article also introduces an accurate and computationally efficient algorithm which can -- based on the predicted output -- compute a closed-loop pseudo-sensitivity. This pseudo-sensitivity represents the ratio between the maximum absolute value of the system's output and the amplitude of its input, similar to the closed-loop sensitivity functions for LTI systems.
|
| |
| 17:00-17:15, Paper FrC05.3 | |
| Estimating Poles of Motion Systems from Output-Only Measurements Using Generalized Transmissibility Operators (I) |
|
| Aljanaideh, Khaled | Jordan University of Science and Technology |
| Al Janaideh, Mohammad | University of Guelph |
Keywords: Mechatronics, Linear systems, Modeling
Abstract: Transmissibility operators are mathematical objects that relate two subsets of outputs of a dynamic system. Transmissibility operators are used in applications where the dynamics of the underlying system and the input excitation are not available. Therefore, output measurements collected using sensors, which represent the only available information about the dynamic system in this case, can be used to identify transmissibility operators. Generalized transmissibility operators, which have been recently introduced, provide a more general mathematical characterization of transmissibility operators by relaxing an assumption that requires knowledge of the dimension of the excitation signal to construct meaningful transmissibility operators. Although generalized transmissibility operators are constructed from the zeros of the underlying system and not the poles, we show in this paper that the determinant of the difference between two generalized transmissibility operators constructed between the same outputs but under different input locations can be used to determine the poles of the underlying system. We apply the proposed approach to determine the poles of a motion system from two transmissibility operators.
|
| |
| 17:15-17:30, Paper FrC05.4 | |
| A Data-Driven LMI-Based Approach for Feedback Control of Discrete-Time Lur'e Systems with an L2-Gain Bound |
|
| Perci, Yasmin M. | Universidade Estadual De Campinas (UNICAMP) |
| Oliveira, Ricardo C. L. F. | University of Campinas - UNICAMP |
| Peres, Pedro L. D. | University of Campinas |
| Tarbouriech, Sophie | LAAS-CNRS |
Keywords: Data driven control, Stability of nonlinear systems, LMIs
Abstract: This paper proposes an LMI-based data-driven approach to compute a stabilizing control law assuring an upper bound on the l2-gain for discrete-time Lur'e systems with sector bounded nonlinearities. The control law is composed by a state feedback and a term proportional to the output of the nonlinear block. The design conditions, constructed from experimental data, provide feedback gains through slack variables, allowing the incorporation of structural constraints such as decentralization to be imposed without introducing extra conservatism in the Lyapunov function. The effectiveness of the approach is illustrated by means of numerical examples.
|
| |
| 17:30-17:45, Paper FrC05.5 | |
| PID-Like Controllers for a Class of Uncertain Underactuated Nonlinear Systems with Application to Wafer-Handling Robot |
|
| Al Saaideh, Mohammad | Memorial University of Newfoundland |
| Boker, Almuatazbellah | Virginia Tech |
| Al Janaideh, Mohammad | University of Guelph |
Keywords: Mechatronics
Abstract: We develop an output feedback tracking controller for a class of underactuated nonlinear systems, specifically applied to a wafer handling robot in semiconductor manufacturing. The proposed controller exhibits performance similar to a PID controller, incorporating proportional, derivative, and integral components to shape both transient and steady-state behavior. Unlike traditional approaches, the controller does not require any stability assumptions on the system’s zero dynamics, making it well-suited for non-minimum phase systems. The design process begins with a full-state feedback controller to achieve the tracking objective. This is then extended into an output feedback control framework by integrating an Extended High-Gain Observer (EHGO). The EHGO employs a cascade observer structure, estimating the unmeasured output from available system measurements to ensure accurate feedback. As a case study, we demonstrate that for a relative degree-two system, the state feedback controller simplifies into a PID-like structure, enhancing ease of implementation. The proposed control strategy is validated through its application to a wafer-handling robot that is described as a flexible-joint system, showcasing its ability to achieve precise trajectory tracking while compensating for uncertainties and external disturbances. The results highlight the controller’s robustness and adaptability, making it a viable solution for high-precision wafer transfer operations in semiconductor fabrication.
|
| |
| 17:45-18:00, Paper FrC05.6 | |
| Time-Optimal Control for Braking Series-Elastic Actuators |
|
| Özparpucu, Mehmet Can | Cigus GmbH |
Keywords: Optimal control, Robotics, Mechatronics
Abstract: In this paper, we solve the time-optimal synthesis problem for braking an undamped linear series-elastic actuator under a maximum motor torque constraint. In particular, by analysing the switching locus and the time-2nπ-controllable sets to the origin we identify geometrical structures that divide the state-space into distinct regions where the time-optimal control is equal to either the minimum or maximum torque.
|
| |
| 18:00-18:15, Paper FrC05.7 | |
| Switching PID Strategy for Positioning Systems with Non-Symmetric Friction and Noisy Velocity Measurements |
|
| Lorigiola, Riccardo | University of Padova |
| Borzone, Tommaso | Eni |
| Bruschetta, Mattia | University of Padova |
| Cenedese, Angelo | University of Padova |
Keywords: Control applications, Mechatronics, Manufacturing systems and automation
Abstract: This paper proposes a model-free, switching PID control strategy for point-to-point motion in positioning systems, where asymmetric friction and noisy velocity measurements are present. Traditional low-level control methods, such as PID, struggle with friction and noise in real-world conditions, particularly due to the difficulty in tuning parameters. The proposed approach employs a two-stage state-machine design to manage friction asymmetries and measurement noise, without requiring friction estimation or complex models. Experimental results on an industrial system demonstrate the approach’s simplicity, robustness, and independence from system-specific parameters, offering a practical solution for high-precision, repeatable motion control in the presence of challenging disturbances.
|
| |
| 18:15-18:30, Paper FrC05.8 | |
| Learning Sparse Rational Feedforward Controllers |
|
| Ickenroth, Tjeerd | Eindhoven University of Technology |
| Oomen, Tom | Eindhoven University of Technology |
Keywords: Iterative learning control, Mechatronics, Identification for control
Abstract: Iterative Learning Control (ILC) with basis function techniques are capable of improving tracking performance and task flexibility. The aim of this paper is to design a systematic approach to enable automatic rational basis function selection for feedforward learning. A sparse optimization framework is proposed to identify the most relevant rational basis functions from a large candidate set. The ILC algorithm that employs sparse optimization is able to automatically select relevant rational basis functions and is validated on an example motion system.
|
| |
| FrC06 |
Oceania I |
| Encrypted Control and Optimization |
Invited Session |
| Chair: Schulze Darup, Moritz | TU Dortmund University |
| Co-Chair: Kim, Junsoo | Seoul National University of Science and Technology |
| Organizer: Schulze Darup, Moritz | TU Dortmund University |
| Organizer: Alexandru, Andreea B. | Duality Technologies |
| Organizer: Kim, Junsoo | Seoul National University of Science and Technology |
| |
| 16:30-16:45, Paper FrC06.1 | |
| Relative Entropy Regularized Reinforcement Learning for Efficient Encrypted Policy Synthesis |
|
| Suh, Jihoon | Purdue University |
| Jang, Yeongjun | Seoul National University |
| Teranishi, Kaoru | Purdue University |
| Tanaka, Takashi | Purdue University |
Keywords: Reinforcement learning, Markov processes, Networked control systems
Abstract: We propose an efficient encrypted policy synthesis to develop privacy-preserving model-based reinforcement learning. We first demonstrate that the relative-entropy-regularized reinforcement learning framework offers a computationally convenient linear and ``min-free'' structure for value iteration, enabling a direct and efficient integration of fully homomorphic encryption with bootstrapping into policy synthesis. Convergence and error bounds are analyzed as encrypted policy synthesis propagates errors under the presence of encryption-induced errors including quantization and bootstrapping. Theoretical analysis is validated by numerical simulations. Results demonstrate the effectiveness of the RERL framework in integrating FHE for encrypted policy synthesis.
|
| |
| 16:45-17:00, Paper FrC06.2 | |
| A Polynomial-Based QCQP Solver for Encrypted Optimization (I) |
|
| Schlor, Sebastian | University of Stuttgart |
| Iannelli, Andrea | University of Stuttgart |
| Kim, Junsoo | Seoul National University of Science and Technology |
| Shim, Hyungbo | Seoul National University |
| Allgöwer, Frank | University of Stuttgart |
Keywords: Control Systems Privacy, Optimization, Optimization algorithms
Abstract: In this paper, we present a novel method for solving a class of quadratically constrained quadratic optimization problems using only additions and multiplications. This approach enables solving constrained optimization problems on private data since the operations involved are compatible with the capabilities of homomorphic encryption schemes. To solve the constrained optimization problem, a sequence of polynomial penalty functions of increasing degree is introduced, which are sufficiently steep at the boundary of the feasible set. Adding the penalty function to the original cost function creates a sequence of unconstrained optimization problems whose minimizer always lies in the admissible set and converges to the minimizer of the constrained problem. A gradient descent method is used to generate a sequence of iterates associated with these problems. For the algorithm, it is shown that the iterate converges to a minimizer of the original problem, and the feasible set is positively invariant under the iteration. Finally, the method is demonstrated on an illustrative cryptographic problem, finding the smaller value of two numbers, and the encrypted implementability is discussed.
|
| |
| 17:00-17:15, Paper FrC06.3 | |
| Taking Advantage of Rational Canonical Form for Faster Ring-LWE Based Encrypted Controller with Recursive Multiplication (I) |
|
| Song, Donghyeon | Seoul National University |
| Jang, Yeongjun | Seoul National University |
| Lee, Joowon | Seoul National University |
| Kim, Junsoo | Seoul National University of Science and Technology |
Keywords: Cyber-Physical Security, Computer/Network Security, Control Systems Privacy
Abstract: This paper aims to provide an efficient implementation of encrypted linear dynamic controllers that perform recursive multiplications on a Ring-Learning With Errors (Ring-LWE) based cryptosystem. By adopting a system-theoretical approach, we significantly reduce both time and space complexities, particularly the number of homomorphic operations required for recursive multiplications. Rather than encrypting the entire state matrix of a given controller, the state matrix is transformed into its rational canonical form, whose sparse and circulant structure enables that encryption and computation are required only on its nontrivial columns. Furthermore, we propose a novel method to ``pack'' each of the input and the output matrices into a single polynomial, thereby reducing the number of homomorphic operations. Simulation results demonstrate that the proposed design enables a remarkably fast implementation of encrypted controllers.
|
| |
| 17:15-17:30, Paper FrC06.4 | |
| Robust Privacy-Preserving Cloud-Based Control Using Reed-Solomon Codes (I) |
|
| Naseri, Amir Mohammad | Concordia University |
| Lucia, Walter | Concordia University |
| Youssef, Amr | Concordia University |
| Franze, Giuseppe | Universita' Della Calabria |
Keywords: Control Systems Privacy, Cyber-Physical Security
Abstract: Cloud-based networked control systems are emerging as a promising approach for managing complex processes by offloading control tasks to remote cloud platforms. While this architecture offers flexibility and scalability, it also raises significant privacy concerns, particularly regarding the sensitive measurements transmitted from the plant to third-party cloud providers, where the control algorithms are executed. In multi-cloud setups, an effective privacy-preserving solution is provided by the Shamir secret sharing scheme. When applied to networked control systems, this approach allows the measurement vector (the secret) to be split among multiple clouds without compromising the overall control law, which can be successfully and efficiently reconstructed from computations performed on the clouds. However, existing Shamir-based schemes have limited robustness, as they only ensure the reconstruction of the correct control input in the presence of missing shares but cannot handle cases where some shares are corrupted. This work presents a robust privacy-preserving computational scheme for multi-cloud control systems using a robust Reed-Solomon secret sharing scheme. The proposed solution enables the reconstruction of the control input despite missing or corrupted shares while preserving the secrecy of the measurements sent to the clouds. The effectiveness and potential of this approach are demonstrated through simulations involving a remotely controlled differential-drive robot.
|
| |
| 17:30-17:45, Paper FrC06.5 | |
| Encrypted Coordination for Distributed Building Thermal Control |
|
| Mahuze, Richard | Cornell University |
| Zhang, K. Max | Cornell University |
Keywords: Smart grid, Networked control systems, Control Systems Privacy
Abstract: Coordinated control of electric heating, ventilation, and air conditioning (HVAC) systems enhances grid flexibility but introduces significant privacy risks through smart meter data exposure and communication vulnerabilities. While distributed methods, such as the alternating direction method of multipliers (ADMM), mitigate some of these risks, iterative coordination steps remain susceptible. This paper proposes an encrypted ADMM framework utilizing Brakerski--Fan--Vercauteren (BFV) homomorphic encryption along with a secure random-chain aggregation protocol to safeguard individual building power consumption data during coordination. Co-simulation with EnergyPlus Functional Mock-up Units (FMUs) demonstrates that the approach effectively enforces global power constraints and reduces costs while maintaining user privacy against semi-honest adversaries with manageable computational overhead.
|
| |
| 17:45-18:00, Paper FrC06.6 | |
| Privacy-Preserving Fusion Estimation Over Multiple Markov Fading Channels |
|
| Huang, Jie | Tsinghua University |
| Gao, Chen | Tsinghua University |
| Liu, Jason J. R. | The University of Hong Kong |
Keywords: Control Systems Privacy, Filtering, Networked control systems
Abstract: This paper studies the privacy-preserving fusion estimation (PPFE) problem over multiple Markov fading channels (multi-MFCs) and proposes a distributed privacy-preserving mechanism (PPM). By combining an encoding-decoding scheme with Markov jump filter (MJF) design, the proposed mechanism ensures that legitimate user's estimation errors remain bounded while forcing eavesdropper's estimation errors to diverge. Within the distributed PPM design, a scalar parameter is introduced to amplify the divergence of eavesdropper's estimation errors. Additionally, a reference time updating mechanism utilizes Critical Events (CEs) to trigger decoding deviations in the eavesdropper's results, thereby ensuring data privacy. Extending confidentiality guarantees from single-channel scenarios to multi-MFC systems, this work proves that the legitimate user has bounded local and global estimation error covariance, whereas the eavesdropper experiences divergent global errors. A numerical example illustrates the exponential divergence of eavesdropper's estimation errors and the convergence of legitimate user's error covariance, demonstrating the effectiveness of the proposed PPFE mechanism.
|
| |
| 18:00-18:15, Paper FrC06.7 | |
| Accurate Performance Analysis of Distributed Adaptive Differential Privacy Estimation |
|
| Gan, Die | Chinese Academy of Science |
| Chen, Shuning | Academy of Mathematics and Systems Science, Chinese Academy |
| Liu, Nian | Academy of Mathematics and Systems Science, Chinese Academy of S |
| Zhao, Cheng | Academy of Mathematics and Systems Science, Chinese Academy of S |
| Ji, Xianchao | National Genomics Data Center |
Keywords: Identification, Time-varying systems, Control Systems Privacy
Abstract: The incorporation of differential privacy mechanisms within the distributed parameter estimation problem has attracted significant attention. However, existing differential privacy algorithms predominantly focus on deriving upper bounds on estimation error, which lack precise quantification of error magnitudes.In this paper,we propose a distributed privacy-preserving least mean squares (LMS) algorithm with a noise injection mechanism for estimating unknown time-varying parameters in stochastic regression models.To derive the accurate performance of the algorithm, we first establish the theoretical upper bound for the estimation error under the cooperative excitation condition, which requires neither independence nor stationarity of the regression vectors. The mean square estimation error matrix is then approximated through a linear deterministic difference matrix equation, rigorously quantifying the relationship between noise njection and estimation accuracy. Finally, a simulation example is provided to verify the effectiveness of the proposed algorithm.
|
| |
| 18:15-18:30, Paper FrC06.8 | |
| Distributed Finite-Horizon Optimal Control for Consensus with Differential Privacy Guarantees |
|
| Ma, Yuwen | University College London |
| Wang, Yongqiang | Clemson University |
| Spurgeon, Sarah K. | University College London |
| Chen, Boli | University College London |
Keywords: Control Systems Privacy, Control over communications, Networked control systems
Abstract: This paper addresses the problem of privacy- preserving consensus control for multi-agent systems (MAS) using differential privacy. We propose a novel distributed finite-horizon linear quadratic regulator (LQR) framework, in which agents share individual state information while preserving the confidentiality of their local pairwise weight matrices, which are considered sensitive data in MAS. Protecting these matrices effectively safeguards each agent’s private cost function and control preferences. Our solution injects consensus error-dependent Laplace noise into the communicated state information and employs a carefully designed time-dependent scaling factor in the local cost functions. This approach guarantees bounded consensus and achieves rigorous ϵ-differential privacy for the weight matrices without relying on specific noise distribution assumptions. Additionally, we analytically characterize the trade-off between consensus accuracy and privacy level, offering clear guidelines on how to enhance consensus performance through appropriate scaling of the LQR weight matrices and the privacy budget.
|
| |
| FrC07 |
Capri I |
| Recent Achievement and Perspective Directions in Sliding Mode Control Iii |
Invited Session |
| Chair: Hsu, Liu | COPPE/UFRJ |
| Co-Chair: Fridman, Leonid | Universidad Nacional Autonoma De Mexico |
| Organizer: Fridman, Leonid | Universidad Nacional Autonoma De Mexico |
| Organizer: Hsu, Liu | COPPE/UFRJ |
| |
| 16:30-16:45, Paper FrC07.1 | |
| Multi-Layer Barrier Function-Based Adaptive Super-Twisting Controller (I) |
|
| Vie, Antoine Thibault | Technical University of Denmark |
| Fridman, Leonid | Universidad Nacional Autonoma De Mexico |
| Galeazzi, Roberto | Technical University of Denmark |
| Papageorgiou, Dimitrios | Technical University of Denmark |
Keywords: Variable-structure/sliding-mode control
Abstract: This article presents an adaptive Super-Twisting Sliding Mode Control framework for uncertain first-order systems, with rate-bounded perturbations, where the bound is constant but unknown. Positive definite barrier functions, when used in self-tuning super-twisting controllers may introduce some conservatism in relation to initial estimations of the perturbation rate bound. Moreover, discrete time implementation of the algorithm does not necessarily guarantee the boundedness of the closed-loop trajectories when sudden changes in the perturbation occur in between two time samples. The salient features of the proposed methodology pertain to extending the use of positive semidefinite barrier functions to Super-Twisting controller adaptation and the employment of a "nested barriers" scheme that ensures boundedness of the solutions even for "unfavourable" perturbations-to-sampling time ratios. The stability of the closed-loop system is assessed via Lyapunov analysis and simulations demonstrate the efficacy of the proposed framework.
|
| |
| 16:45-17:00, Paper FrC07.2 | |
| Implicit Discretization of a PID-Like Sliding-Mode Controller for Double Integrator Systems (I) |
|
| Zapf, Martin | Graz University of Technology, Siemens Healthineers AG |
| Andritsch, Benedikt | Graz University of Technology |
| Koch, Stefan | Graz University of Technology |
| Fridman, Leonid | Universidad Nacional Autonoma De Mexico |
| Horn, Martin | Graz University of Technology |
Keywords: Variable-structure/sliding-mode control
Abstract: This paper presents the fully implicit (backward-Euler) discretization of a PID-like sliding-mode controller for undisturbed double-integrator systems. The method inherits the third-order sliding accuracy of the continuous-time PID-like controller while eliminating discretization chattering. The controller is compared in simulation studies to an explicit Euler discretization and a cascaded implementation with a super-twisting controller in the inner loop. The results reveal unique strengths and weaknesses of each approach. It is finally applied to patient positioning of a real-world magnetic resonance imaging system, demonstrating its practical applicability.
|
| |
| 17:00-17:15, Paper FrC07.3 | |
| A Filtering Super-Twisting Controller with Noise Rejection (I) |
|
| Fornaro, Pedro | Centre for Ocean Energy Research - Maynooth University, Ireland |
| Mosquera, Facundo Daniel | LEICI Institute, National University of La Plata - CONICET and M |
| Evangelista, Carolina Alejandra | CONICET and LEICI, Facultad De Ingeniería, Universidad Nacional |
| Puleston, Paul Frederick | Universidad Nacional De La Plata |
| Ringwood, John V. | Maynooth University, Ireland |
Keywords: Variable-structure/sliding-mode control, Control applications, Emerging control applications
Abstract: This paper addresses the design of a filtering Super-Twisting (FST) controller with noise rejection. To effectively achieve noise rejection and improve ST performance, the proposed control structure includes a zero-phase sliding-mode filter, capable of rejecting unbounded measurement noise. The features of the FST controller reduce the control effort required to steer the sliding variable to zero, without compromising the control robustness. The convergence of the FST is demonstrated, and a numerical example based on wave energy systems is presented to illustrate the effectiveness of the proposal.
|
| |
| 17:15-17:30, Paper FrC07.4 | |
| Feedback Homogenization and Robust Stabilization of Switched Linear MIMO Systems |
|
| Labbadi, Moussa | Aix-Marseille University |
| Polyakov, Andrey | Inria, Univ. Lille |
| Efimov, Denis | Inria |
Keywords: Switched systems, Lyapunov methods
Abstract: In this paper, we investigate the stabilization of a class of switched linear systems subject to perturbations. We introduce a generalized homogenization approach for switched linear MIMO systems using linear feedback. A control algorithm is proposed to achieve exponential, finite-time, or nearly fixed-time stabilization of switched linear MIMO systems through the homogenization technique. Matrix inequalities are provided for the generator matrix of dilations and for tuning the gains. Additionally, we analyze the robustness of the proposed control algorithm against system uncertainties and disturbances. The theoretical findings are validated through numerical examples.
|
| |
| 17:30-17:45, Paper FrC07.5 | |
| A Robust Discrete-Time Internal Model-Based Controller for Continuous-Time LTI Systems with Uncertain Periodic Disturbances (I) |
|
| Azimi, Atabak | Graz University of Technology |
| Koch, Stefan | Graz University of Technology |
| Efimov, Denis | Inria |
| Reichhartinger, Markus | Graz University of Technology |
Keywords: Variable-structure/sliding-mode control, Output regulation, Robust control
Abstract: This paper introduces a robust discrete-time controller that combines internal model-based control with sliding mode techniques to reject periodic and matched disturbances in linear time-invariant systems. By explicitly accounting for discretization effects inherent in digital implementations, the controller bridges continuous-time dynamics and sampled-data operation. A systematic design approach is developed to handle uncertainties in the exosystem, ensuring bounded tracking error and improved disturbance rejection. Simulation studies on an active suspension system illustrate that the proposed method outperforms traditional internal model-based and sliding mode controllers, yielding reduced overshoot and tighter error bounds.
|
| |
| 17:45-18:00, Paper FrC07.6 | |
| Unit-Vector Control Design under Saturating Actuators (I) |
|
| Vitório, Andevaldo | Universidade Federal Do Amazonas |
| Coutinho, Pedro Henrique Silva | State University of Rio De Janeiro |
| Bessa, Iury | Federal University of Amazonas |
| Rodrigues, Victor Hugo Pereira | State University of Rio De Janeiro (UERJ) |
| Estrada, Antonio | Secihti - Centro De Ingenieria Y Desarrollo Industrial, Queretar |
| Oliveira, Tiago Roux | State University of Rio De Janeiro |
Keywords: Variable-structure/sliding-mode control, LMIs, Constrained control
Abstract: This paper deals with unit vector control design for multivariable polytopic uncertain systems under saturating actuators. For that purpose, we propose LMI-based conditions to design the unit vector control gain such that the origin of the closed-loop system is finite-time stable. Moreover, an optimization problem is provided to obtain an enlarged estimate of the region of attraction of the equilibrium point for the closed-loop system, where the convergence of trajectories is ensured even in the presence of saturation functions. Numerical simulations illustrate the effectiveness of the proposed approach.
|
| |
| 18:00-18:15, Paper FrC07.7 | |
| Sliding Mode Control Techniques for Voltage Source Converters under Low-Voltage Ride-Through Scenarios (I) |
|
| Gutiérrez-Florensa, Joan | University College Dublin |
| Anderson-Azzano, Jorge Luis | UNLP |
| Puleston, Paul Frederick | Universidad Nacional De La Plata |
| Ortega, Álvaro | Universidad Pontificia Comillas |
| Sigrist, Lukas | Universidad Pontificia Comillas |
Keywords: Variable-structure/sliding-mode control
Abstract: The present paper evaluates different sliding mode control techniques for the negative-sequence current injection of voltage source converters under low-voltage ride-through scenarios. Under such scenarios those devices are required, from the grid code specifications, to inject reactive power to help mitigate voltage sag situations. The implicit unbalanced and faulted conditions of this scenario demands a robust enough control strategy to ensure system stability. The ro- bustness requirements and the variety of scenarios justify the consideration of sliding mode for current control. However, the control method is also required to exhibit a smooth and rapid performance during steady-state operation conditions. Among the studied cases an adaptive super-twisting algorithm present better results for this application when considering long-term operation. The implementation of these control techniques might imply challenges on line protections. HiL application is discussed in order to assess the challenges and key considerations that need to be addressed.
|
| |
| 18:15-18:30, Paper FrC07.8 | |
| The Power Tower Function, a New Tool for the Control Design |
|
| Ghanes, Malek | Centrale Nantes |
| Barbot, Jean Pierre | Ecole Centrale Nantes & CNRS |
Keywords: Variable-structure/sliding-mode control
Abstract: In this paper, we propose a new function, the power tower function truncated at order 2, for designing control laws, and this in continuous time and under sampling. The idea behind is first to allow linking and bringing together with one control strategy the following properties: - fixed-finite time convergence, - recursive approach, and - non-matching perturbations. In doing so, a new sliding-mode control is obtained because the power tower function on order 2 is equivalent to the sgn function on the sliding manifold. In order to highlight the usefulness of the power tower function on the design of control laws, the stabilization of a double integrator subject to perturbations d1 and d2 where d1 is non-matching is treated. We have opted for a recursive control law (backstepping) to emphasize that it is possible, under certain conditions, to use the power tower function iteratively. Recursivity is made possible by using Filippov solutions in the iterative calculation of Lyapunov functions. Both fixed and finitetime convergences are then proved and an overestimation of the fixed-time is computed. The second part of this work is devoted to control design in the context of sampling control based on the power tower function. Simulation results are given to show the well founded of the proposed control.
|
| |
| FrC08 |
Oceania V |
| Machine Learning II |
Regular Session |
| Chair: Uribe, Cesar A. | Rice University |
| Co-Chair: Massai, Leonardo | EPFL |
| |
| 16:30-16:45, Paper FrC08.1 | |
| Traffic Flow Reconstruction from Limited Collected Data |
|
| Baloul, Nail | Ecole Nationale Des Ponts Et Chaussées |
| Hayat, Amaury | Ecole Des Ponts Paristech |
| Liard, Thibault | LS2N, École Centrale De Nantes |
| Lissy, Pierre | Ecole Des Ponts Et Chaussees |
Keywords: Machine learning, Nonlinear systems identification, Optimization
Abstract: We propose an efficient method for reconstructing traffic density with low penetration rate of probe vehicles. Specifically, we rely on measuring only the initial and final positions of a small number of cars which are generated using microscopic dynamical systems. We then implement a machine learning algorithm from scratch to reconstruct the approximate traffic density. This approach leverages learning techniques to improve the accuracy of density reconstruction despite constraints in available data. For the sake of consistency, we will prove that, if only using data from dynamical systems, the approximate density predicted by our learned-based model converges to a well-known macroscopic traffic flow model when the number of vehicles approaches infinity.
|
| |
| 16:45-17:00, Paper FrC08.2 | |
| Neural Spline Operators for Risk Quantification in Stochastic Systems |
|
| Wang, Zhuoyuan | Carnegie Mellon University |
| Romagnoli, Raffaele | Duquesne University |
| Azizzadenesheli, Kamyar | Purdue University |
| Nakahira, Yorie | Carnegie Mellon University |
Keywords: Machine learning, Neural networks, Stochastic systems
Abstract: Accurately quantifying long-term risk probabilities in diverse stochastic systems is essential for safety-critical control. However, existing sampling-based and partial differential equation (PDE)-based methods often struggle to handle complex varying dynamics. Physics-informed neural networks learn surrogate mappings for risk probabilities from varying system parameters of fixed and finite dimensions, yet can not account for functional variations in system dynamics. To address these challenges, we introduce physics-informed neural operator (PINO) methods to risk quantification problems, to learn mappings from varying functional system dynamics to corresponding risk probabilities. Specifically, we propose Neural Spline Operators (NeSO), a PINO framework that leverages B-spline representations to improve training efficiency and achieve better initial and boundary condition enforcements, which are crucial for accurate risk quantification. We provide theoretical analysis demonstrating the universal approximation capability of NeSO. We also present two case studies, one with varying functional dynamics and another with high-dimensional multi-agent dynamics, to demonstrate the efficacy of NeSO and its significant online speed-up over existing methods. The proposed framework and the accompanying universal approximation theorem are expected to be beneficial for other control or PDE-related problems beyond risk quantification.
|
| |
| 17:00-17:15, Paper FrC08.3 | |
| Distributed Optimization with Quantization for Computing Wasserstein Barycenters |
|
| Krawtschenko, Roman | Humboldt University |
| Uribe, Cesar A. | Rice University |
| Gasnikov, Alexander | Moscow Institute of Physics and Technology (State University) |
| Dvurechensky, Pavel | Weierstrass Institute for Applied Analysis and Stochastics |
Keywords: Machine learning, Optimization algorithms, Large-scale systems
Abstract: We study the problem of the decentralized computation of (entropy-regularized) semi-discrete fixed-support Wasserstein barycenters over a network. Building upon recent primal-dual approaches, we propose a sampling gradient quantization scheme that allows efficient communication and computation of approximate barycenters in a distributed manner. The iteration, sample, computational, and communication complexities of the proposed algorithm are shown, including dependency on the support size, the number of distributions, the desired accuracy, and the characteristics of the network. Numerical results validate our algorithmic analysis.
|
| |
| 17:15-17:30, Paper FrC08.4 | |
| On Model Protection in Federated Learning against Eavesdropping Attacks |
|
| Maity, Dipankar | University of North Carolina at Charlotte |
| Chakrabarti, Kushal | Tata Consultancy Services Research |
Keywords: Machine learning, Control Systems Privacy, Optimization algorithms
Abstract: In this study, we investigate the protection offered by Federated Learning algorithms against eavesdropping adversaries. In our model, the adversary is capable of intercepting model updates transmitted from clients to the server, enabling it to create its own estimate of the model. Unlike previous research, which predominantly focuses on safeguarding client data, our work shifts attention to protecting the client model itself. Through a theoretical analysis, we examine how various factors—such as the probability of client selection, the structure of local objective functions, global aggregation at the server, and the eavesdropper’s capabilities—impact the overall level of protection. We further validate our findings through numerical experiments, assessing the protection by evaluating the model accuracy achieved by the adversary. Finally, we compare our results with methods based on differential privacy, underscoring their limitations in this specific context.
|
| |
| 17:30-17:45, Paper FrC08.5 | |
| Juggler: Multitask Learning with Task Performance Constraints |
|
| Bar-el Avidan, Ella | Tel Aviv University |
| Bistritz, Ilai | Tel Aviv University |
Keywords: Machine learning, Stochastic systems, Optimization algorithms
Abstract: Consider multitask learning (MTL), in which N models share parts of their architecture (e.g., a backbone with a head for each task). Our goal is to train the overall model so that the training loss of each task falls below a given threshold. However, we may harm others when varying the shared parameters to help one task. A weighted total loss could balance between the different tasks to achieve the target threshold. Nevertheless, the weights that correctly balance the tasks are unknown in advance. To overcome that, we propose a scheme that divides the total training time into epochs of increasing length. The scheme adjusts the weights every epoch based on the performance of the tasks at the end of the epoch. We prove that our scheme asymptotically converges to a model that satisfies the target loss constraints (if feasible), provided the learning rate, control step size, and epoch lengths are properly tuned. We experiment on deep neural networks to demonstrate that our scheme is effective even beyond our theoretical assumptions.
|
| |
| 17:45-18:00, Paper FrC08.6 | |
| Boosting the Transient Performance of Reference Tracking Controllers with Neural Networks |
|
| Kirsch, Nicolas | EPFL |
| Massai, Leonardo | EPFL |
| Ferrari-Trecate, Giancarlo | Ecole Polytechnique Fédérale De Lausanne |
Keywords: Neural networks, Machine learning
Abstract: Reference tracking is a fundamental goal in many control systems, particularly those with complex nonlinear dynamics. While traditional control strategies can achieve steady-state accuracy, they often fall short in explicitly optimizing transient performance. Neural network controllers have emerged as a flexible solution to handle nonlinearities and disturbances, yet they typically lack formal guarantees on closed-loop stability and performance. To bridge this gap, the recently proposed Performance Boosting (PB) framework offers a principled way to optimize generic transient costs while preserving the mathcal{L}_p-stability of nonlinear systems. In this work, we extend the PB framework to tackle reference tracking problems. First, we characterize the complete set of nonlinear controllers that retain the tracking properties of a given baseline reference-tracking controller. Next, we show how to optimize transient costs while searching within subsets of tracking controllers that incorporate expressive neural network models. We also analyze the robustness of the proposed method under uncertainties in the system dynamics. Finally, numerical experiments on a robotic system illustrate the performance gains of our approach compared to the standard PB framework.
|
| |
| 18:00-18:15, Paper FrC08.7 | |
| Scalable Decomposition for Stability Analysis of Feedback Systems with High-Dimensional Neural Network Components |
|
| Wang, Zichen | University of Illinois Urbana Champaign |
| Seiler, Peter | University of Michigan, Ann Arbor |
| Dullerud, Geir E. | University of Minnesota |
| Hu, Bin | University of Illinois at Urbana-Champaign |
Keywords: Neural networks, Robust control, Machine learning
Abstract: Stability conditions of dynamical systems with neural network components can often be formulated as semidefinite programs (SDPs). However, such SDP-based conditions typically scale poorly with the depth and width of the neural networks, leading to severe computational challenges. In this paper, we address this scalability issue by developing a matrix decomposition approach for contraction-based stability analysis of feedback systems with high-dimensional neural network modules. By leveraging novel analytical derivations, we develop new SDP conditions whose computational complexity depends solely on the final layer width, achieving effective independence from network depth and overall architecture size.This depth-agnostic framework substantially enhances the scalability of stability analysis for systems with large neural networks. We demonstrate the scalability and effectiveness of our methods through a series of numerical studies on systems with large neural network components, accompanied by comprehensive comparisons against existing stability conditions.
|
| |
| 18:15-18:30, Paper FrC08.8 | |
| Robust Convolution Neural ODEs Via Contractivity-Promoting Regularization |
|
| Zakwan, Muhammad | ETH Zurich |
| Xu, Liang | Shanghai University |
| Ferrari-Trecate, Giancarlo | Ecole Polytechnique Fédérale De Lausanne |
Keywords: Pattern recognition and classification, Machine learning, Learning
Abstract: Neural networks can be fragile to input noise and adversarial attacks. In this work, we consider Convolutional Neural Ordinary Differential Equations (NODEs) – a family of continuous-depth neural networks represented by dynamical systems - and propose to use contraction theory to improve their robustness. For a contractive dynamical system two trajectories starting from different initial conditions converge to each other exponentially fast. Contractive Convolutional NODEs can enjoy increased robustness as slight perturbations of the features do not cause a significant change in the output. Contractivity can be induced during training by using a regularization term involving the Jacobian of the system dynamics. To reduce the computational burden, we show that it can also be promoted using carefully selected weight regularization terms for a class of NODEs with slope-restricted activation functions. The performance of the proposed regularizers is illustrated through benchmark image classification tasks on MNIST and FashionMNIST datasets, where images are corrupted by different kinds of noise and attacks.
|
| |
| FrC09 |
Oceania VIII |
| Estimation II |
Regular Session |
| Chair: Alessandri, Angelo | University of Genoa |
| Co-Chair: Romero, Jose Guadalupe | Instituto Tecnológico Autónomo De México |
| |
| 16:30-16:45, Paper FrC09.1 | |
| Design of Robust Moving Horizon Estimators for Linear Systems Using Incremental Input/Output-To-State Stability |
|
| Alessandri, Angelo | University of Genoa |
Keywords: Estimation, LMIs, Numerical algorithms
Abstract: This paper builds upon recent advances in the robust stability analysis of moving-horizon estimators for discrete-time nonlinear systems, with a particular focus on linear systems and quadratic cost functions. Toward this end, we carefully select the weights for the cost function, adhering to the established conditions, and leveraging the property of incremental input/output-to-state stability, a key property required for systems targeted by robust estimation. Within this framework, a new design method based on optimization is proposed, incorporating linear matrix inequalities as constraints within a tailored optimization problem, to guide the selection of appropriate parameters for the moving-horizon cost function. The effectiveness of the resulting moving-horizon estimators is evaluated through a numerical case study, demonstrating superior performance compared to the traditional Luenberger observer.
|
| |
| 16:45-17:00, Paper FrC09.2 | |
| Simultaneous Inertia Estimation and Trajectory Tracking Control for the Quadrotor Rotational Dynamics |
|
| Romero, Jose Guadalupe | Instituto Tecnológico Autónomo De México |
| Martinez-Ramirez, Marco Antonio | CINVESTAV |
| Gándara-Sánchez, J. Antonio | CINVESTAV |
| Rodríguez-Cortés, Hugo | CINVESTAV-IPN |
Keywords: Estimation, Robust adaptive control, Autonomous vehicles
Abstract: This work presents an innovative composite Proportional Integral-Derivative (PID) controller for quadrotor rotational dynamics, which simultaneously ensures trajectory tracking and estimates unknown inertia parameters. The controller combines a Passivity-based control law with an updated adaptive law, which involves a new Linear Regression Equation (LRE) based on the power balance parametrization technique and the recently reported Least Squares plus Dynamic Regressor Extension and Mixing (LS+DREM) estimator. The excellent performance of the proposed controllers is validated through real-time experimental flights.
|
| |
| 17:00-17:15, Paper FrC09.3 | |
| Distributed Simultaneous Centroid Estimation and Formation Tracking Control Using Relative Position Measurement |
|
| Choopojcharoen, Thanacha | University of Waterloo |
| Selim, Erman | Ege University |
| Fidan, Baris | University of Waterloo |
Keywords: Estimation, Decentralized control, Distributed control
Abstract: This paper presents a distributed framework for rigid formation control and trajectory tracking in n-dimensional space (n = 2, 3), addressing the challenge of limited sensing capabilities where each agent measures only the relative positions of its neighbors. The proposed approach integrates estimation and control in a two-layer algorithm that enables decentralized coordination among agents. The first layer employs a consensus-based self-estimation law, ensuring each agent’s position estimate converges exponentially to its true value under minimal localization assumptions. The second layer extends this capability, allowing agents to estimate the positions of their peers, resulting in global knowledge of the formation. Using these estimates, the formation’s centroid and orientation are computed and tracked along predefined trajectories. The control design includes three laws: one for maintaining inter-agent distances, one for guiding the centroid along a positional trajectory, and one for aligning the formation’s orientation, using unit complex numbers in 2D and unit quaternions in 3D. Lyapunov-based analysis establishes exponential convergence for all estimation components. Simulations demonstrate the formation’s ability to maintain geometric integrity, track trajectories, and achieve desired orientations, all within a distributed framework.
|
| |
| 17:15-17:30, Paper FrC09.4 | |
| When Atomic Norm Meets the G-Filter: A General Framework for Line Spectral Estimation |
|
| Zhu, Bin | Sun Yat-Sen University |
| Tang, Jiale | Sun Yat-Sen University |
Keywords: Estimation
Abstract: This paper proposes a novel approach for line spectral estimation which combines Georgiou’s filter bank (G-filter) with atomic norm minimization (ANM). A key ingredient is a Carathéodory–Fejér-type decomposition for the covariance matrix of the filter output. The resulting optimization problem can be characterized via semidefinite programming and contains the standard ANM for line spectral estimation as a special case. Simulations show that our approach outperforms the standard ANM in terms of recovering the number of spectral lines when the signal-to-noise ratio is no lower than 0 dB and the G-filter is suitably designed.
|
| |
| 17:30-17:45, Paper FrC09.5 | |
| Leader Selection and Control Design for Topology Estimation of Dynamical Networks |
|
| Wang, Nana | Royal Institute of Technology (KTH) |
| Dimarogonas, Dimos V. | KTH Royal Institute of Technology |
Keywords: Estimation, Control of networks, Agents-based systems
Abstract: We propose a framework for selecting leaders and employing active control to guarantee accurate topology estimation in finite time for dynamical networks. After determining the optimal or suboptimal solution of the minimum leader number which renders strongly structurally controllable, two topology estimation algorithms with active control design schemes are proposed. The first, employing the integral of the states and control input and building an equation of its topology matrix, based on the dynamics of the original network, gives a unique solution for a symmetric topological matrix and the subspace of an asymmetric topological matrix. The second, building upon an auxiliary network and comparing the difference with the original network, provides a guarantee for accurate topology estimation for both stable and unstable dynamical networks in finite time. Finally, a relevant simulation example verifies the performance of the proposed methods.
|
| |
| 17:45-18:00, Paper FrC09.6 | |
| Optimal Control of Calibration Via Fisher Information |
|
| Blomqvist, Karl | Swedish Defense Research Agency |
| Hamberg, Johan | Swedish Defence Research Agency |
Keywords: Information theory and control, Estimation, Optimal control
Abstract: A coordinate independent technique of recursively choosing recalibration of a measurement system is presented. The approach is likelihood based and is formulated in terms of Information Geometry. A Riemannian metric is used for representing the importance of precision in different directions. The technique is illustrated in a localization scenario, where the error model is derived from the von Mises-Fisher family.
|
| |
| 18:00-18:15, Paper FrC09.7 | |
| Accelerated SPSA-Based Consensus Algorithm for Mutual Device Positioning |
|
| Chernov, Andrey | Saint Petersburg State University |
| Erofeeva, Victoria | Institute for Problems in Mechanical Engineering of RAS |
| Granichin, Oleg | Saint Petersburg State University |
| Sudomir, Anastasiia | Saint Petersburg State University |
Keywords: Adaptive control, Agents-based systems, Estimation
Abstract: Positioning systems are crucial in multiple fields such as Internet of Things due to their wide-ranging applications across smart environments and industries. Existing methods for global and local positioning are not effective in certain scenarios. This paper presents an advanced mutual positioning solution that combines a randomized stochastic optimization algorithm tailored for dynamic systems under unknown-but-bounded disturbances with a consenus method. The proposed method addresses measurement noise while maintaining computational efficiency. The approach is validated through numerical simulations, demonstrating its effectiveness in real-time positioning tasks within complex networks.
|
| |
| 18:15-18:30, Paper FrC09.8 | |
| A Decentralized Variational Estimation Scheme for Relative Rotational Motion in a Multi-Agent Network of Rigid Body Vehicles |
|
| Sanyal, Amit | Syracuse University |
| Safaei Hashkavaei, Nazanin | Syracuse University |
| Srinivasu, Neon | Syracuse University |
| Sukumar, Srikant | Indian Institute of Technology Bombay |
Keywords: Algebraic/geometric methods, Estimation, Variational methods
Abstract: In this article, a decentralized relative attitude state observer using rotation matrices is designed for a multi-vehicle system. This system is modeled as a multi-agent rigid body system (MARBS). Several challenges arise in the analysis of multi-vehicle systems modeled as MARBS: one of these challenges is estimating the relative rotational motions of vehicles in such systems. Here, a relative attitude and relative angular velocity estimation scheme based on the Lagrange-d’Alembert principle of variational mechanics, is designed. Estimates of absolute attitudes and angular velocities of agents (vehicles) in the system are shared with other vehicles they are communicating with. These estimates are used to design a Lagrangian, obtained as the difference between a kinetic energy-like term and a potential energy-like function. Thereafter, a decentralized variational estimation scheme is designed to estimate the relative attitude motion states and its stability is shown analytically. This estimation scheme is then discretized in the form of a Lie group variational integrator (LGVI) for numerical simulations, by applying the discrete Lagrange-d’Alembert principle. Finally, numerical simulations indicate that the estimated states converge to a bounded neighborhood of the true states of the multi-vehicle system in the presence of noise in absolute state estimates.
|
| |
| FrC11 |
Oceania VI |
| Network Analysis and Control III |
Regular Session |
| Chair: Pequito, Sergio | Instituto Superior Tecnico, University of Lisbon |
| Co-Chair: Soudjani, Sadegh | Max Planck Institute for Software Systems |
| |
| 16:30-16:45, Paper FrC11.1 | |
| Optimal Interventions in Opinion Dynamics on Large-Scale, Time-Varying, Random Networks |
|
| Cianfanelli, Leonardo | Politecnico Di Torino |
| Como, Giacomo | Politecnico Di Torino |
| Fagnani, Fabio | Politecnico Di Torino |
| Ozdaglar, Asu | MIT |
| Parise, Francesca | Cornell University |
Keywords: Control of networks, Network analysis and control
Abstract: We consider two optimization problems in which a planner aims to influence the average transient opinion in the Friedkin-Johnsen dynamics on a network by intervening on the agents' innate opinions. Solving these problems requires full network knowledge, which is often not available because of the cost involved in collecting this information or due to privacy considerations. For this reason, we focus on intervention strategies that are based on statistical instead of exact knowledge of the network. We focus on a time-varying random network model where the network is resampled at each time step and formulate two intervention problems in this setting. We show that these problems can be casted into mixed integer linear programs in the type space, where the type of a node captures its out- and in-degree and other local features of the nodes, and provide a closed form solution for one of the two problems. The integer constraints may be easily removed using probabilistic interventions leading to linear programs. Finally, we show by a numerical analysis that there are cases in which the derived optimal interventions on time-varying networks can lead to close to optimal interventions on fixed networks.
|
| |
| 16:45-17:00, Paper FrC11.2 | |
| Coupling Induced Stabilization of Network Dynamical Systems and Switching |
|
| Mouyebe, Moise R | University of Michigan |
| Bloch, Anthony M. | Univ. of Michigan |
Keywords: Switched systems, Network analysis and control, Stability of nonlinear systems
Abstract: This paper investigates the stability and stabilization of diffusively coupled network dynamical systems. We leverage Lyapunov methods to analyze the role of coupling in stabilizing or destabilizing network systems. We derive critical coupling parameter values for stability and provide sufficient conditions for asymptotic stability under arbitrary switching scenarios, thus highlighting the impact of both coupling strength and network topology on the stability analysis of such systems. Our theoretical results are supported by numerical simulations.
|
| |
| 17:00-17:15, Paper FrC11.3 | |
| On the Strong Structural Controllability of Switched Linear Systems |
|
| K C, Rajendra Prasad | Indian Institute of Technology Kharagpur |
| Dilip, Sanand | IIT Kharagpur |
| Athalye, Chirayu D. | BITS Pilani, K K Birla Goa Campus |
Keywords: Switched systems, Control of networks, Networked control systems
Abstract: Structural controllability becomes important in the study of large scale complex systems such as networked systems. In this paper, we consider strong structural controllability of switched linear systems. We give sufficient conditions and a separate necessary condition for the strong structural controllability of switched linear systems. We leverage results of [1] for the strong structural controllability of LTI systems to obtain these conditions. Both these conditions are given using the graph theoretic framework and they can be checked in polynomial time. We briefly discuss some issues about finding the graph theoretic conditions which are both necessary and sufficient.
|
| |
| 17:15-17:30, Paper FrC11.4 | |
| Sensor Placement Effects on Distributed Kalman Filtering in Cyclic Networks |
|
| Yu, Lirui | University of Alberta |
| Zheng, Yiming | University of Alberta |
| Zhao, Qing | Univ. of Alberta |
| Shu, Zhan | University of Alberta |
Keywords: Sensor networks, Network analysis and control, Distributed control
Abstract: This paper investigates the impact of sensor placement on the performance of distributed Kalman filters (DKFs) in cyclic wireless sensor networks (WSNs). We show that in networks with limited sensing coverage, improper sensor placement can significantly degrade estimation accuracy, regardless of the consensus algorithm used. Focusing on the high-pass dynamic average consensus (HP-DAC) protocol, we derive an analytical expression for the filter output for cycle graphs. For cyclic networks, we characterize the locations of zero entries in the eigenvector matrix of the Laplacian and establish design guidelines for optimal sensor placement in both small- and large-scale settings. Theoretical results are validated through simulations, demonstrating that proper sensor placement can significantly enhance DKF estimation performance in sparse WSNs.
|
| |
| 17:30-17:45, Paper FrC11.5 | |
| Sensor Placement in District Heating Networks Using Frequency-Domain Gramians |
|
| Sibeijn, Max | Delft University of Technology |
| Pequito, Sergio | Instituto Superior Tecnico, University of Lisbon |
| Boskos, Dimitris | TU Delft |
| Khosravi, Mohammad | Delft University of Technology |
Keywords: Time-varying systems, Network analysis and control, Energy systems
Abstract: District heating networks (DHNs) are essential in providing efficient heating services to urban areas through networked pipes. The performance of these systems critically depends on the strategic placement of thermal storage buffers (actuators) and temperature sensors throughout the network. Due to the inherent slow dynamics of thermal transport, these systems exhibit significant delays and periodic behaviors that necessitate time-varying analysis approaches. This paper presents a frequency-domain framework for optimal actuator and sensor placement in DHNs, focusing on metrics derived from frequential Gramians. We provide rigorous analysis of two key metrics, namely the trace and log-determinant of the frequential Gramian, establishing submodularity properties and performance guarantees for greedy selection algorithms. Our theoretical framework naturally handles both the periodic nature of DHNs and their slow transients, outperforming standard approaches in estimation accuracy.
|
| |
| 17:45-18:00, Paper FrC11.6 | |
| Synchronization and Regulation of Epileptic Dynamics in Networked Bi-Stable Oscillators |
|
| Liu, Zonglin | University of Kassel |
| Yuzhen, Qin | Donders Institute |
| van Gerven, Marcel | Radboud University |
| Stursberg, Olaf | University of Kassel |
Keywords: Networked control systems, Network analysis and control, Biological systems
Abstract: This paper studies networks of diffusively-coupled bi-stable oscillators. Each oscillator has been shown in the literature to be ideal for capturing epileptic dynamics, since its stable equilibrium can describe normal neural activity, while its stable limit cycle signifies seizure events. To investigate seizure propagation in brain networks, synchronization is investigated. A sufficient condition is established, under which the oscillators are synchronized. Interestingly, a synchronized network inherits bistability from individual oscillators. It can transition between a stable equilibrium and a limit-cycle manifold, capturing network-level onsets and offsets of generalized seizures. To prevent the network from converging into the limit-cycle manifold, a controller that targets a subset of the oscillators is proposed, which is proven to globally stabilize the equilibrium. Numerical studies are conducted to validate the theoretical findings.
|
| |
| 18:00-18:15, Paper FrC11.7 | |
| Computation of Feasible Assume-Guarantee Contracts: A Resilience-Based Approach |
|
| Monir, Negar | Newcastle University |
| Ait Si, Youssef | Mohammed VI Polytechnic University |
| Das, Ratnangshu | Indian Institute of Science, Bangalore |
| Jagtap, Pushpak | Indian Institute of Science |
| Saoud, Adnane | University Mohammed VI Polytechnic |
| Soudjani, Sadegh | Max Planck Institute for Software Systems |
Keywords: Network analysis and control, Formal Verification/Synthesis, Computational methods
Abstract: We propose a resilience-based framework for computing feasible assume-guarantee contracts that ensure the satisfaction of temporal specifications in interconnected discrete-time systems. Interconnection effects are modeled as structured disturbances. We use a resilience metric, the maximum disturbance under which local specifications hold, to refine assumptions and guarantees across subsystems iteratively. We first demonstrate correctness and monotone refinement of guarantees for two subsystems. Then, we extend our approach to general networks of L subsystems using weighted combinations of interconnection effects. We instantiate the framework on linear systems by meeting finite-horizon safety, exact-time reachability, and finite-horizon reachability specifications, and on nonlinear systems by fulfilling general finite-horizon specifications. Our approach is demonstrated through numerical linear examples and a nonlinear DC microgrid case study, showcasing the impact of our framework on verifying temporal logic specifications with compositional reasoning.
|
| |
| 18:15-18:30, Paper FrC11.8 | |
| Distributed Incast Detection in Data Center Networks |
|
| Zheng, Yiming | University of Alberta |
| Qi, Haoran | University of Alberta |
| Yu, Lirui | University of Alberta |
| Shu, Zhan | University of Alberta |
| Zhao, Qing | Univ. of Alberta |
Keywords: Communication networks, Fault detection, Network analysis and control
Abstract: Incast traffic in data centers can lead to severe performance degradation, such as packet loss and increased latency. Effectively addressing incast requires prompt and accurate detection. Existing solutions, including MA-ECN, BurstRadar and Pulser, typically rely on fixed thresholds of switch port egress queue lengths or their gradients to identify microburst caused by incast flows. However, these queue length related methods often suffer from delayed detection and high error rates. In this study, we propose a distributed incast detection method for data center networks at the switch-level, leveraging a probabilistic hypothesis test with an optimal detection threshold. By analyzing the arrival intervals of new flows, our algorithm can immediately determine if a flow is part of an incast traffic from its initial packet. The experimental results demonstrate that our method offers significant improvements over existing approaches in both detection speed and inference accuracy.
|
| |
| FrC12 |
Oceania X |
| Optimization and Computational Methods |
Regular Session |
| Chair: Pohl, Volker | Technische Universität München |
| Co-Chair: Paternain, Santiago | Rensselaer Polytechnic Institute |
| |
| 16:30-16:45, Paper FrC12.1 | |
| On the Convexification of Non-Linear Optimization Problems under Performance Guarantees |
|
| Boche, Holger | Technische Universitaet Muenchen |
| Pohl, Volker | Technische Universität München |
| Poor, H. Vincent | Princeton Univ |
Keywords: Optimization algorithms, Computational methods, Extremum seeking
Abstract: The convexification operator is the mapping that determines for every given continuous function on the real axis its greatest convex minorant. It is needed to convexify non-convex optimization problems in order to apply powerful convex optimization techniques to solve them. This paper shows that the convexification operator cannot be implemented on a Turing machine. In particular, we show that there exist piecewise linear continuous functions with a unique global minimum such that their greatest convex minorant is not Turing computable. Furthermore, for these functions there do not even exist monotonically increasing, computable sequences of convex computable continuous functions that converge to the greatest convex minorant. The same results also hold for the least concave majorants of continuous functions.
|
| |
| 16:45-17:00, Paper FrC12.2 | |
| A Bi-Level Optimization Method for Redundant Dual-Arm Minimum Time Problems |
|
| Fried, Jonathan | Rensselaer Polytechnic Institute |
| Paternain, Santiago | Rensselaer Polytechnic Institute |
Keywords: Optimization algorithms, Robotics
Abstract: In this work, we present a method for minimizing the time required for a redundant dual-arm robot to follow a desired relative Cartesian path at constant path speed by optimizing its joint trajectories, subject to position, velocity, and acceleration limits. The problem is reformulated as a bi-level optimization whose lower level is a convex, closed-form subproblem that maximizes path speed for a fixed trajectory, while the upper level updates the trajectory using a single-chain kinematic formulation and the subgradient of the lower-level value. Numerical results demonstrate the effectiveness of the proposed approach.
|
| |
| 17:00-17:15, Paper FrC12.3 | |
| Privacy-Preserving Convex Optimization: When Differential Privacy Meets Stochastic Programming |
|
| Dvorkin, Vladimir | University of Michigan |
| Fioretto, Ferdinando | University of Virginia |
| Van Hentenryck, P. | Georgia Institute of Technology |
| Pinson, Pierre | Imperial College London |
| Kazempour, Jalal | Technical University of Denmark |
Keywords: Control Systems Privacy, Optimization, Power systems
Abstract: Convex optimization finds many applications where optimization results may expose private data (e.g., health records, commercial information). To guarantee privacy to optimization data owners, we develop a new privacy-preserving perturbation strategy for convex optimization programs by combining stochastic (chance-constrained) programming and differential privacy. Unlike standard noise-additive strategies, which perturb either optimization data or result, we formulate optimization variables as functions of a random perturbation using linear decision rules; we then optimize these rules to accommodate the perturbation within the feasible region using chance constraints. The perturbation becomes feasible and makes adjacent—in the sense of some distance function—optimization datasets statistically similar in randomized optimization results, thereby enabling privacy guarantees.
|
| |
| 17:15-17:30, Paper FrC12.4 | |
| A Laplace Duality for Integration |
|
| Lasserre, Jean B. | LAAS-CNRS and Institute of Mathematics, University OfToulouse |
Keywords: Computational methods, Numerical algorithms, Optimization
Abstract: We consider the integral v(y)=int_{K_y}f(x)dx on a domain K_y={xinmathbb{R}^d: g(x)leq y}, where g is nonnegative and K_y is compact for all yin [0,+infty). Under some assumptions, we show that for every yin (0,infty) there exists a distinguished scalar lambda_yin (0,+infty) such that v(y)=int_{mathbb{R}^d}f(x)exp(-lambda_y,g(x)),dx, which is the counterpart analogue for integration of Lagrangian duality for optimization. A crucial ingredient is the Laplace transform, the analogue for integration of Legendre-Fenchel transform in optimization. In particular, if both f and g are positively homogeneous then lambda_y is a simple explicitly rational function of y. In addition if g is quadratic form then computing v(y) reduces to computing the integral of f with respect to a specific Gaussian measure for which exact and approximate numerical methods (e.g. cubatures) are available.
|
| |
| 17:30-17:45, Paper FrC12.5 | |
| Mechanics As a Convex Quadratic Programming Problem with Application to Incompressible Flows |
|
| Anand, Kshitij | University of California, Irvine |
| Taha, Haithem | University of California, Irvine |
Keywords: Computational methods, Optimization, Variational methods
Abstract: Gauss’s principle of least constraint transforms a dynamics problem into a pure minimization framework. We show that this minimization problem is a Strongly Convex Quadratic Programming (SCQP) problem whose necessary condition is Newton’s equation of motion. The principle of minimum pressure gradient (PMPG) is to incompressible flows what Gauss’s principle is to particle and rigid-body dynamics. The principle asserts that an incompressible flow evolves from one instant to another by minimizing the L2-norm of the pressure gradient force. That is, Navier-Stokes equation is the first-order necessary condition for minimizing the pressure gradient cost. Here, we show that the PMPG transforms the incompressible fluid mechanics problem into a pure minimization framework, allowing one to determine the evolution of the flow field by solely focusing on minimizing the cost—without directly invoking the Navier-Stokes equation. Moreover, we formulate the resulting minimization problems from Gauss’s principle and the PMPG as a SCQP problem—one of the most computationally tractable classes in nonlinear optimization, which has a rich literature with many efficient algorithms. This formulation eliminates the daunting task of solving the Poisson equation in pressure at each time step. Rather, it replaces it with a SCQP problem.
|
| |
| 17:45-18:00, Paper FrC12.6 | |
| Separated Representation of Approximated Koopman Operator Using Tensor-Product Bases |
|
| Xu, Zhi | Purdue University |
| Dai, Ran | Purdue University |
Keywords: Numerical algorithms, Computational methods, Optimization
Abstract: This paper explores the use of separated representation to improve the scalability of the Koopman operator for both continuous- and discrete-time dynamical systems. We propose an approximation method for the Koopman operator using tensor-product bases. The approach comprises two main steps. In the first step, we utilize the Galerkin method to derive a tensor-based approximate Koopman operator, which subsequently aids in the development of a decomposition algorithm for achieving a separable operator. The second step finds the separated representation via an efficient algorithm to decompose the operator by exploiting its low-rank tensor structure. Numerical experiments demonstrate that the proposed method provides high accuracy with significantly reduced computational effort. The results highlight the potential of separated representation for handling high-dimensional systems and advancing the applicability of Koopman operator-based techniques.
|
| |
| 18:00-18:15, Paper FrC12.7 | |
| Harnessing Topology and Causal Discovery for Dynamic Analysis and Particulate Gel Control |
|
| Smith, Alexander | University of Minnesota |
| Daoutidis, Prodromos | Univ. of Minnesota |
Keywords: Computational methods, Modeling, Distributed parameter systems
Abstract: Particulate gels, characterized by multi-scale structures and dynamic behaviors, present significant challenges for quantitative analysis and control due to their evolving complexity under external stimuli. In this work, we integrate topological data analysis (TDA) and causal discovery methodologies to analyze particulate gels subjected to cyclic shear deformation. Utilizing persistence homology, we extract meaningful topological features across various scales, capturing critical structural transitions. These features are further processed through the Automatic Topologically-Oriented Learning (ATOL) algorithm, enabling their representation in a low-dimensional vector space suitable for causal analysis. Employing a VARLiNGAM causal discovery framework, we uncover multi-scale causal relationships, illustrating both top-down and bottom-up flows of structural information within the gel network. Our approach offers insights into the hierarchical organization and dynamic evolution of particulate gels, paving the way for enhanced predictive modeling and control strategies applicable to advanced soft material systems.
|
| |
| 18:15-18:30, Paper FrC12.8 | |
| Geometric Integrators for Mechanical Systems on Lie Groups |
|
| Vivek, Viyom | IIT Bombay |
| Martin de Diego, David | High Council for Scientific Research-CSIC |
| Banavar, Ravi N. | Indian Institute of Technology |
Keywords: Algebraic/geometric methods, Numerical algorithms
Abstract: Retraction and discretization maps form the seed for many numerical integrators, and hence provide a general framework for discretization methods on manifolds. This approach has been extended to carry out discretizations on both the tangent and cotangent bundle leading to structure preserving integrators for mechanical systems. We explore the particular case when the configuration space happens to be a Lie group and the mechanical system exhibits certain symmetries. This case is especially interesting since it appears, for instance, on the equations of the rigid body, heavy top and ideal fluids as some special cases. In such a scenario, the discretization framework simplifies owing to the symmetries and the fact that Lie groups along with their tangent and cotangent bundles are parallelizable. The geometric integrator thus obtained can be used to discretize the Lie-Poisson-type equations that govern the motion of many mechanical systems, and more importantly, easily extend to systems with forces and optimal control problems where the configuration space is a Lie group.
|
| |
| FrC15 |
Capri II |
| Constrained Control II |
Regular Session |
| Chair: Fornaro, Pedro | Centre for Ocean Energy Research - Maynooth University, Ireland |
| Co-Chair: Dabbene, Fabrizio | CNR-IEIIT |
| |
| 16:30-16:45, Paper FrC15.1 | |
| Global Adaptive Performance Control with Input Saturation: A Low-Complexity Approach |
|
| Lai, Wenxin | Shanghai Jiao Tong University |
| Li, Yuanlong | Shanghai Jiao Tong University |
| Lin, Zongli | University of Virginia |
Keywords: Nonlinear systems, Constrained control, Adaptive control
Abstract: In this paper, we propose a global adaptive performance control strategy for the output tracking problem of unknown nonlinear MIMO systems subject to input saturation and external disturbances. We begin by proposing a new type of global performance functions with an initial value tending to +∞, by which the constraint on initial condition dependence typically found in existing prescribed performance control methods can be eliminated. Then, with the help of a saturation-driven auxiliary system, we modify the proposed global performance function adaptively and present a global adaptive performance function, by which the conflict between input saturation and output performance constraint can be resolved. Based on these, we design a low-complexity controller to solve the tracking problem without the need for approximation structures. Our development starts with case where output consists of the full state and then, inspired by sliding mode control, extends to higher-order MIMO systems. Finally, two numerical examples demonstrate the effectiveness of the proposed strategy.
|
| |
| 16:45-17:00, Paper FrC15.2 | |
| Approximate Optimal Control for Nonlinear System with State Constraints and Partially Unknown Dynamics |
|
| Geng, Fan | Tongji University |
| Dong, Yi | Tongji University |
| Zhou, Liubin | Wuhan 2nd Ship Design and Research Institute |
| Li, Rongyan | Tongji University |
Keywords: Nonlinear systems, Constrained control, Reinforcement learning
Abstract: This paper considers the approximate optimal control problem for a nonlinear system under state constraints and with partially unknown dynamics. A penalty term utilizing barrier functions is first introduced to handle asymmetric constraints. Then a neural network identifier incorporating concurrent learning is developed for the estimation of unknown dynamics, relaxing the persistent excitation condition and enabling policy optimization via model-based reinforcement learning. Our design guarantees that the system achieves approximate optimal performance while satisfying the state constraints. Simulation results validate the constraint satisfaction and near-optimal performance.
|
| |
| 17:00-17:15, Paper FrC15.3 | |
| Quadrotor Trajectory Tracking: An Almost Global Full-State Solution |
|
| Madeiras, João | Instituto Superior Técnico |
| Cardeira, Carlos | IDMEC/Instituto Superior Tecnico |
| Oliveira, Paulo | Instituto Superior Técnico |
Keywords: Stability of nonlinear systems, Optimal control, Constrained control
Abstract: This paper presents a novel control strategy for underactuated quadrotors, unifying position and attitude dynamics into a single full-state feedback framework. Departing from conventional hierarchical methods, the approach redefines the complete system as a triple integrator with dual inputs, linearly coupled to the attitude error, while angular velocity and thrust serve as direct control inputs. A nonlinear transformation renders the complete system dynamics linear time-invariant (LTI), enabling the application of Linear Quadratic Regulator (LQR) optimization for exponential stability within a prescribed linear region. Key to this formulation is a prescribed bounded attitude error definition that ensures almost global convergence while avoiding singularities inherent to mathbb{S}^2 representations. The resulting controller explicitly accounts for the feedback interconnection term, eliminating reliance on nested loops or backstepping. Asymptotic stabilization under input saturation is rigorously validated through numerical simulations, showcasing its potential and effectiveness.
|
| |
| 17:15-17:30, Paper FrC15.4 | |
| Control Barrier Function Synthesis for Nonlinear Systems with Dual Relative Degree |
|
| Bahati, Gilbert | California Institute of Technology |
| Cosner, Ryan | California Institute of Technology |
| Cohen, Max | North Carolina State University |
| Bena, Ryan | California Institute of Technology |
| Ames, Aaron D. | California Institute of Technology |
Keywords: Lyapunov methods, Nonlinear systems, Constrained control
Abstract: Control barrier functions (CBFs) are a powerful tool for synthesizing safe control actions; however, constructing CBFs remains difficult for general nonlinear systems. In this work, we provide a constructive framework for synthesizing CBFs for systems with dual relative degree—where different inputs influence the outputs at two different orders of differentiation; this is common in systems with orientation-based actuation, such as unicycles and quadrotors. In particular, we propose dual relative degree CBFs (DRD-CBFs) and show that these DRD-CBFs can be constructively synthesized and used to guarantee system safety. Our method constructs DRD-CBFs by leveraging the dual relative degree property—combining a CBF for an integrator chain with a Lyapunov function certifying the tracking of safe inputs generated for this linear system. We apply these results to dual relative degree systems, both in simulation and experimentally on hardware using quadruped and quadrotor robotic platforms.
|
| |
| 17:30-17:45, Paper FrC15.5 | |
| A Causal Approach to Hard Constrained Control of Wave Energy Systems Based on Implicit Gaussian Differential Equation |
|
| Anderson-Azzano, Jorge Luis | UNLP |
| Fornaro, Pedro | Centre for Ocean Energy Research - Maynooth University, Ireland |
| Puleston, Paul Frederick | Universidad Nacional De La Plata |
| Ringwood, John V. | Maynooth University, Ireland |
Keywords: Control applications, Constrained control, Variable-structure/sliding-mode control
Abstract: This paper introduces a novel method to causally satisfy hard position and velocity constraints in wave energy systems. The constraint mechanism is simple to implement, computationally efficient, and does not require complex tuning or optimisation techniques. The proposed strategy handles system constraints by modulating a velocity reference with a Gaussian-like envelope function that depends on both position and velocity, which results in nonlinear closed-loop dynamics. In this context, this paper focuses on the stability of the constrained closed-loop dynamics, and it is proven that, for a set of initial conditions within the constraint region, the system trajectories remain within the prescribed limits. Finally, in-silico evaluations demonstrate that the Gaussian-like function effectively guarantees compliance with system constraints and is broadly applicable to wave energy systems.
|
| |
| 17:45-18:00, Paper FrC15.6 | |
| Backstepping Reach-Avoid Controller Synthesis for Multi-Input Multi-Output Systems with Mixed Relative Degrees |
|
| Ding, Jianqiang | Aalto University |
| Yuan, Dingran | Aalto University |
| Deka, Shankar | School of Electrical Engineering, Aalto University |
Keywords: Constrained control, Feedback linearization, Backstepping
Abstract: Designing controllers with provable formal guarantees has become an urgent requirement for cyber-physical systems in safety critical scenarios. In this paper, by leveraging feedback linearization and backstepping techniques, we present a novel framework for constructing provable reach-avoid formal certificates tailored to multi-input multi-output systems. Based on this, we developed a systematic synthesis approach for controllers with reach-avoid guarantees, which ensures that the outputs of the system eventually enter the predefined target set while staying within the required safe set. Finally, we validate our method in numerical simulations.
|
| |
| 18:00-18:15, Paper FrC15.7 | |
| Robustness Results for Systems with Discontinuous Right-Hand Sides: Applications to Nonsmooth Control Barrier Functions |
|
| Jimenez Cortes, Carmen | Georgia Institute of Technology |
| Watson, Tyler | Mercer University |
| Salgarkar, Chirayu | Rochester Institute of Technology |
| Thitsa, Makhin | Mercer University |
Keywords: Lyapunov methods, Constrained control, Optimization algorithms
Abstract: This paper derives sufficient conditions for local and global asymptotic stability of the superzero level set of a nonsmooth function with respect to systems with discontinuous right-hand sides. If the set is compact and has a continuously differentiable C^1 boundary, we propose simple to test conditions to prove its local asymptotic stability. When the set is compact and convex, we also provide simplified conditions to prove its global asymptotic stability. These latter results do not require the boundary of the set to be C^1. Lastly, we show the practicality of these easier to evaluate conditions through multiple numerical examples.
|
| |
| 18:15-18:30, Paper FrC15.8 | |
| Innovation Diffusion Dynamics Toward Long-Term Behavioral Shifts |
|
| Piccinin, Lisa | Politecnico Di Milano |
| Breschi, Valentina | Eindhoven University of Technology |
| Ravazzi, Chiara | National Research Council of Italy (CNR) |
| Dabbene, Fabrizio | CNR-IEIIT |
| Tanelli, Mara | Politecnico Di Milano |
Keywords: Emerging control applications, Optimal control, Constrained control
Abstract: Sustainable technologies and services can play a pivotal role in the transition to “greener” habits. Their widespread adoption is thus crucial, and understanding how to foster this phenomenon in a systematic way could have a major impact on our future. With this in mind, in this work we propose an extension of the Friedkin-Johnsen opinion dynamics model toward characterizing the long-term impact of (structural) fostering policies. We then propose alternative nudging strategies that target a trade-off between widespread adoption and investments under budget constraints, showing the impact of our modeling and design choices on inclination shifts over a set of numerical tests.
|
| |
| FrC16 |
Capri III |
| Observers for Nonlinear Systems II |
Regular Session |
| Chair: Khajenejad, Mohammad | The University of Tulsa |
| Co-Chair: Postoyan, Romain | CNRS, CRAN, Université De Lorraine |
| |
| 16:30-16:45, Paper FrC16.1 | |
| Systematic Observer Design with Robust Global Convergence Guarantees for a Large Class of Lithium-Ion Battery Models |
|
| Khalil, Mira | CRAN, Université De Lorraine |
| Postoyan, Romain | CNRS, CRAN, Université De Lorraine |
| Raël, Stéphane | Université De Lorraine |
Keywords: Observers for nonlinear systems, Lyapunov methods, Energy systems
Abstract: We present a robust nonlinear Luenberger-like observer for a class of systems that captures many of the finite-dimensional lithium-ion cell models found in the literature including electrical equivalent circuit models and electrochemical models. The observer gain is designed by solving a linear matrix inequality (LMI), which guarantees the robust, global convergence of the estimated state to the true state. We then present the main result, that is, conditions ensuring the LMI is feasible and allowing for the explicit construction of a class of suitable observer gains. These conditions are shown to be satisfied by various existing lithium-ion models thereby making the proposed observer design applicable for these models.
|
| |
| 16:45-17:00, Paper FrC16.2 | |
| A Robust Learning-Based KKL Observer for Nonlinear Systems |
|
| Zhao, Yangyi | Harbin Institute of Technology, Shenzhen |
| Zhao, Ruixuan | University College London |
| Chen, Boli | University College London |
| Wang, Rui | Harbin Institute of Technology, Shenzhen |
| Li, Peng | Harbin Institute of Technology, Shenzhen |
Keywords: Observers for nonlinear systems, Learning, Estimation
Abstract: This paper proposes a robust observer for nonlinear systems. A transformation is designed to convert the nonlinear system into a linear one, utilizing the Volterra integral operator to mitigate the effects of initialization errors. Neural network techniques are then applied to learn both the transformation and its inverse. As a result, the state variables of the nonlinear system can be accurately tracked using the estimated transformation, providing enhanced robustness against measurement noise and model uncertainty. Specifically, the proposed approach reduces the trade-off between convergence speed and noise immunity, which is often influenced by the choice of observer eigenvalue. The paper provides comprehensive error analysis and validation through simulations.
|
| |
| 17:00-17:15, Paper FrC16.3 | |
| Improved Interval Observer Design for Nonlinear Systems with State Lifting |
|
| Hokmi, Sadredin | Sharif University of Technology |
| Khajenejad, Mohammad | The University of Tulsa |
Keywords: Estimation, Observers for nonlinear systems, Nonlinear systems
Abstract: This paper presents a novel interval observer design for discrete-time (DT) and continuous-time (CT) nonlinear systems subject to additive bounded uncertainties. Utilizing the Cayley-Hamilton theorem, our approach reduces the reliance on time-varying or time-invariant coordinate transformations by introducing additional degrees of freedom through a state lifting technique. We further show that the interval observers designed for the lifted system achieve performance at least as good as their counterparts for the original, non-lifted system, with only a minimal increase in computational complexity. Moreover, the increased flexibility provided by the lifting method extends the applicability of the proposed observer to a broader class of systems. The interval observer error for the lifted system is inherently positive/cooperative, obviating the need for additional positivity constraints, and is designed to be input-to-state stable while minimizing the system gain.
|
| |
| 17:15-17:30, Paper FrC16.4 | |
| Guaranteed Privacy-Preserving mathcal{H}_{infty}-Optimal Interval Observer Design for Nonlinear Discrete-Time Systems |
|
| Khajenejad, Mohammad | The University of Tulsa |
Keywords: Estimation, Observers for nonlinear systems, Uncertain systems
Abstract: We propose a novel guaranteed privacy-preserving interval observer design for perturbed nonlinear discrete-time bounded-error systems. Unlike stochastic differential privacy, guaranteed privacy offers strict bounds on the proximity between the ranges of two sets of estimated data. Our approach relies on synthesizing an interval observer for a perturbed nonlinear bounded-error system. The design procedure incor- porates a bounded noise perturbation factor computation and observer gains synthesis based on solving tractable semi-definite programs. Consequently, the observer simultaneously provides guaranteed private and stable interval-valued estimates for the desired variable. We demonstrate the optimality of our design by minimizing the mathcal{H}_{infty} norm of the observer error system. Furthermore, we assess the accuracy of our proposed mechanism by quantifying the loss incurred when considering guaranteed privacy specifications. Simulations illustrate the out-performance of the proposed approach to differential privacy.
|
| |
| 17:30-17:45, Paper FrC16.5 | |
| Interval Observers for Uncertain Hybrid Dynamical Systems with Known Jump Times |
|
| Pati, Tarun | Northeastern University |
| Sanfelice, Ricardo G. | University of California at Santa Cruz |
| Yong, Sze Zheng | Northeastern University |
Keywords: Hybrid systems, Observers for nonlinear systems, Embedded systems
Abstract: This paper presents a novel interval observer design for uncertain hybrid systems with nonlinear dynamics and measurements, assuming known jump times. Leveraging mixed-monotone decompositions, we construct an interval observer that provably bounds the true system state. Global uniform ultimate boundedness of the observer error is guaranteed through a non-expansive observer design, achieved by computing gain matrices using linear Lyapunov-based analysis and 1-norm supply rates. We also introduce a linear transformation to enhance design flexibility. The effectiveness of our approach is validated through simulations of a bouncing ball system.
|
| |
| 17:45-18:00, Paper FrC16.6 | |
| Distributed Resilient Interval Observer Synthesis for Nonlinear Discrete-Time Systems |
|
| Khajenejad, Mohammad | The University of Tulsa |
| Brown, Scott | University of California, San Diego |
| Martinez, Sonia | University of California at San Diego |
Keywords: Fault detection, Observers for nonlinear systems, Resilient Control Systems
Abstract: This article introduces a novel recursive distributed estimation algorithm aimed at synthesizing input and state interval observers for nonlinear bounded-error discrete-time multi-agent systems. The considered systems have sensors and actuators that are susceptible to unknown or adversarial inputs. To solve this problem, we first identify conditions that allow agents to obtain nonlinear bounded-error equations characterizing the input. Then, we propose a distributed interval-valued observer that is guaranteed to contain the disturbance and system states. To do this, we first detail a gain design procedure that uses global problem data to minimize an upper bound on the signal norm of the observer error. We then propose a gain design approach that does not require global information, using only values that are local to each agent. The second method improves on the computational tractability of the first, at the expense of some added conservatism. Further, we discuss some possible ways of extending the results to a broader class of systems. We conclude by demonstrating our observer on two examples. The first is a unicycle system, for which we apply the first gain design method. The second is a 145-bus power system, which showcases the benefits of the second method, due to the first approach being intractable for systems with high dimensional state spaces.
|
| |
| 18:00-18:15, Paper FrC16.7 | |
| Observer Design for Parameter-Varying Persidskii Systems Using Parameter-Independent Lyapunov Functions |
|
| Efimov, Denis | Inria |
| Combastel, Christophe | University of Bordeaux |
| Zolghadri, Ali | Universite Bordeaux |
Keywords: Linear parameter-varying systems, Observers for nonlinear systems, Stability of nonlinear systems
Abstract: This paper addresses the state observer design for parameter-varying Persidskii systems. These systems exhibit dynamics that depend both on time-varying parameters, as in the Linear Parameter Varying (LPV) framework, and sector nonlinearities of the state, as in classical Persidskii systems. A state observer design is proposed, and its stability is analyzed using parameter-independent Lyapunov functions. The conditions for tuning the observer gains are derived within the input-to-output stability framework. They are first expressed as parameterized matrix inequalities, and then further reduced to linear ones under additional mild assumptions. The effectiveness of the proposed design is shown through an academic example.
|
| |
| 18:15-18:30, Paper FrC16.8 | |
| Distributed Prescribed-Time Observer for Nonlinear Systems in Block-Triangular Form |
|
| de Heij, Vincent | University of Groningen |
| Niazi, M. Umar B. | Massachusetts Institute of Technology |
| Johansson, Karl H. | KTH Royal Institute of Technology |
| Ahmed, Saeed | University of Groningen |
Keywords: Observers for nonlinear systems, Networked control systems, Sensor networks
Abstract: This paper proposes a design of a distributed prescribed-time observer for nonlinear systems representable in a block-triangular observable canonical form. Using a weighted average of neighbor estimates exchanged over a strongly connected digraph, each observer estimates the system state despite the limited observability of local sensor measurements. The proposed design guarantees that distributed state estimation errors converge to zero at a user-specified convergence time, irrespective of observers' initial conditions. To achieve this prescribed-time convergence, distributed observers implement time-varying local output injection gains that monotonically increase and approach infinity at the prescribed time. The theoretical convergence is rigorously proven and validated through numerical simulations, where some implementation issues due to increasing gains have also been clarified.
|
| |
| FrC17 |
Capri IV |
| Lyapunov Methods and Applications |
Regular Session |
| Chair: Peixoto, Marcia Luciana da Costa | Université Polytechnique Hauts-De-France |
| Co-Chair: Hamberg, Johan | Swedish Defence Research Agency |
| |
| 16:30-16:45, Paper FrC17.1 | |
| On Modeling, Stabilization and Control of Underwater Vehicles |
|
| Hamberg, Johan | Swedish Defence Research Agency |
Keywords: Modeling, Lyapunov methods, Robotics
Abstract: A derivation of Kirchhoff's equations is given, which is not relying on infinite dimensional reduction theory. The relative equilibria are determined, and a universal stabilizing controller for these is constructed and analyzed. A short description of how these results can be used to control a flexible vehicle is presented.
|
| |
| 16:45-17:00, Paper FrC17.2 | |
| Periodic Convergence for a Class of Nonlinear Time-Delay Systems |
|
| Aleksandrov, Alexander | St. Petersburg State University |
| Efimov, Denis | Inria |
| Ping, Xubin | Xidian University |
Keywords: Delay systems, Lyapunov methods
Abstract: The new existence conditions for periodic steady-state solution in time-delay convergent systems are presented. The main advantage of this result is that highly nonlinear (without meaningful linear approximation) dynamics are allowed for analysis. These conditions are developed for Persidskii and Lotka-Volterra time-delay systems. The efficiency of the approach is demonstrated on academic examples of these models.
|
| |
| 17:00-17:15, Paper FrC17.3 | |
| Is There a Closed-Loop Lagrangian for Hierarchical Motion Control? |
|
| Mishra, Hrishik | German Aerospace Center (DLR) |
| De Stefano, Marco | German Aerospace Center (DLR) |
| Ott, Christian | TU Wien |
Keywords: Robotics, Hierarchical control, Lyapunov methods
Abstract: Yes. In this paper, we prove that the closed-loop Lagrangian in hierarchical motion control is always an Euler-Lagrange system with symmetry, which additionally has geodesic invariance along the symmetry. The hierarchy in motion is imposed as follows: the primary task is a shape potential and the secondary task is a symmetry-breaking potential. The proposed theory enables a hierarchical Passivity-Based Control synthesis, in which the energetic behaviour is prescribed in advance. Using this approach, we derive a hierarchical control for the Floating-base Robotic Mechanisms, which have an inherent Lagrangian symmetry.
|
| |
| 17:15-17:30, Paper FrC17.4 | |
| Ordering and Refining Path-Complete Lyapunov Functions through Composition Lifts |
|
| Jongeneel, Wouter | UCLouvain |
| Jungers, Raphaël M. | University of Louvain |
Keywords: Switched systems, Lyapunov methods, Stability of hybrid systems
Abstract: A fruitful approach to study stability of switched systems is to look for multiple Lyapunov functions. However, in general, we do not yet understand the interplay between the desired stability certificate, the template of the Lyapunov functions and their mutual relationships to accommodate switching. In this work we elaborate on path-complete Lyapunov functions: a graphical framework that aims to elucidate this interplay. In particular, previously, several preorders were introduced to compare multiple Lyapunov functions. These preorders are initially algorithmically intractable due to the algebraic nature of Lyapunov inequalities, yet, lifting techniques were proposed to turn some preorders purely combinatorial and thereby eventually tractable. In this note we show that a conjecture in this area regarding the so-called composition lift, that was believed to be true, is false. This refutal, however, points us to a beneficial structural feature of the composition lift that we exploit to iteratively refine path-complete graphs, plus, it points us to a favourable adaptation of the composition lift.
|
| |
| 17:30-17:45, Paper FrC17.5 | |
| A Novel Unknown Input Observer Design for Nonlinear LPV Systems |
|
| Arango Restrepo, Juan Pablo | IMT Nord Europe CERI |
| Puig, Vicenc | Universitat Politècnica De Catalunya |
| Etienne, Lucien | IMT Lille-Douai |
| Segovia, Pablo | Universitat Politècnica De Catalunya |
| Duviella, Eric | IMT Lille Douai |
| Langueh, Kokou Anani Agbessi | Imt Nord Europe |
Keywords: LMIs, Linear parameter-varying systems, Lyapunov methods
Abstract: This paper presents the design of an unknown input observer (UIO) for linear parameter-varying (LPV) systems, including nonlinearities that are assumed to fulfill one-sided Lipschitz quadratically inner-bounded (OSL-QIB) conditions. The proposed approach introduces a novel extension of conventional LPV frameworks by directly incorporating nonlinear terms, aiming to improve observer performance and reduce the modeling errors typically introduced during the transformation of a nonlinear system into its LPV counterpart. A key contribution of this work is the development of a UIO design that avoids the state transformation step, which is often highly complex and only valid under restrictive assumptions such as a constant unknown input matrix D. By eliminating this constraint, the proposed observer design significantly enhances scalability and applicability to a broader class of systems. The performance and effectiveness of the approach are demonstrated through both a numerical example and a well-established open-channel flow benchmark: the Corning channel in California, USA.
|
| |
| 17:45-18:00, Paper FrC17.6 | |
| Resilient Tracking Control for Leader-Follower Multi-Agent Systems against Sinusoidal Sensor Attacks: An LMI-Based Framework |
|
| Hwang, Sounghwan | Purdue University |
| Cho, Minhyun | Purdue University |
| Wu, Guanlin | Purdue University |
| Hwang, Inseok | Purdue University |
Keywords: Robust control, Lyapunov methods, LMIs
Abstract: This paper proposes a novel resilient control framework for leader-follower multi-agent systems (MASs) when agents’ sensors are compromised by sinusoid false-data-injection (FDI) attacks. Generally, MASs present various attack surfaces in their communication channels and malicious impact by such persistent attacks can propagate through inter-agent communication, potentially disrupting MAS consensus. We propose an observer-based resilient controller by devising an FDI attack estimator to tackle this issue. The attack estimates are then directly integrated into the agent’s state observer, enabling the MAS to maintain resilience under FDI attacks. We present sufficient conditions for the stability of the proposed resilient control framework via Lyapunov stability with the H infinity control criterion and its corresponding linear matrix inequalities (LMIs). Finally, we provide an illustrative MAS example to demonstrate the efficacy of the proposed attack mitigation strategy.
|
| |
| 18:00-18:15, Paper FrC17.7 | |
| Stabilization of Nonlinear Systems Via Subset-Based Membership Functions and Exact Polytopic Representation |
|
| Guerra, Thierry Marie | Polytechnic University Hauts-De-France |
| Silva, Rafael | Université Polytechnique Hauts-De-France |
| Diouf, Mohamed | Université Polytechnique Hauts-De-France |
| Peixoto, Marcia Luciana da Costa | Université Polytechnique Hauts-De-France |
Keywords: Fuzzy systems, LMIs, Lyapunov methods
Abstract: This paper introduces a novel approach for ensuring the asymptotic stabilization of nonlinear systems by constructing an exact polytopic representation of the system in a region of interest using a trapezoidal partition. Unlike traditional methods, such as gain scheduling or piecewise approximations, the proposed approach does not rely on such approximations and ensures that the derived stabilization conditions are directly applicable to the original nonlinear system. A systematic iterative design algorithm is introduced to compute the control gains, guaranteeing asymptotic stabilization for the nonlinear system. The effectiveness of the proposed approach is demonstrated through numerical examples, highlighting its advantages over traditional methods.
|
| |
| FrC18 |
Aruba I+II+III |
| Stability of Linear Systems II |
Regular Session |
| Chair: Papadopoulos, Alessandro Vittorio | Mälardalen University |
| Co-Chair: Egorov, Alexey | SPbSU |
| |
| 16:30-16:45, Paper FrC18.1 | |
| A New Representation of the Lyapunov Matrix for the One-Dimensional Heat Equation with Delay |
|
| Makoveeva, Polina | Saint Petersburg State University |
| Egorov, Alexey | SPbSU |
Keywords: Delay systems, Stability of linear systems, Lyapunov methods
Abstract: The Lyapunov–Krasovskii functional method is a well-known tool for analyzing stability and robust stability of time-delay systems. An important element of functionals with a prescribed derivative along the solutions of system is the Lyapunov matrix. This paper investigates the one-dimensional heat equation with delay. An explicit form of the Lyapunov–Krasovskii functional with a prescribed derivative is presented. A new representation of the Lyapunov matrix is introduced via an auxiliary function with fewer arguments, along with an explicit formula for its computation.
|
| |
| 16:45-17:00, Paper FrC18.2 | |
| Iterated Integral Representation of the Characteristic Function of Time-Delay Systems with Spectral Values of Maximal Multiplicity |
|
| Boussaada, Islam | Universite Paris Saclay, CNRS-CentraleSupelec-Inria |
| Mazanti, Guilherme | Inria, Université Paris-Saclay, CentraleSupélec, CNRS |
| Niculescu, Silviu-Iulian | University Paris-Saclay, CNRS, CentraleSupelec, Inria |
Keywords: Delay systems, Stability of linear systems
Abstract: Motivated by recently obtained factorizations of characteristic functions of some classes of time-delay systems admitting a root with the largest possible multiplicity in terms of Kummer confluent hypergeometric functions, this paper provides a new representation of some Kummer functions with integer coefficients in terms of iterated integrals of an exponential kernel. As a consequence of the existing links between Kummer, Whittaker, and modified Bessel functions, the latter classes of special functions also take advantage of such an iterated integral representation. We also express characteristic functions of the aforementioned classes of time-delay systems in terms of iterated integrals, and illustrate how such an iterated integral representation allow us to obtain information on the location of the spectrum of the system, at least in some low-order cases.
|
| |
| 17:00-17:15, Paper FrC18.3 | |
| Complete Type Lyapunov-Krasovskii Functionals for the Scalar Case of General Linear Delay Systems |
|
| Rychkov, Andrey | Saint-Petersburg State University |
| Egorov, Alexey | SPbSU |
Keywords: Delay systems, Stability of linear systems, Lyapunov methods
Abstract: This paper presents an expression for the Lyapunov matrix in the case of general linear delay systems. Then this expression is used in the scalar case for obtaining the complete type Lyapunov-Krasovskii functional, which admits a quadratic lower bound. We show that the proposed functional can be used for proving the inverse Lyapunov-Krasovskii theorem and derive exponential estimates of the solutions of the system.
|
| |
| 17:15-17:30, Paper FrC18.4 | |
| Existence Issue for the Delay Lyapunov Matrix for Periodic Systems |
|
| Egorov, Alexey | SPbSU |
Keywords: Delay systems, Stability of linear systems, Time-varying systems
Abstract: The Lyapunov matrix and the Lyapunov-Krasovskii functionals of complete type based on the matrix are used to analyze the stability and robust stability of systems with time delays. For systems with constant delay and periodic coefficients, this theory is still at an early stage. In this paper, a new formula for the Lyapunov matrix is presented, which guarantees the existence of the Lyapunov matrix for systems of the considered class and can be easily generalized to wider classes of systems. It is also shown how the new formula can be useful for deriving an exponential stability criterion based on the Lyapunov matrix, similar to that obtained for LTI systems previously.
|
| |
| 17:30-17:45, Paper FrC18.5 | |
| Conditions for Mean and First-Moment Stability of Positive Markov Jump Linear Systems with Time-Varying Subsystems |
|
| De Iuliis, Vittorio | University of L'Aquila |
| Kaheni, Mojtaba | Mälardalen University |
| Papadopoulos, Alessandro Vittorio | Mälardalen University |
| Manes, Costanzo | Universita' Dell'Aquila |
Keywords: Compartmental and Positive systems, Stability of linear systems, Time-varying systems
Abstract: This work studies mean stability and first-moment stability of discrete-time positive Markov Jump Linear Systems with time-varying discrete modes. We adopt an approach based on linear co-positive Lyapunov functions that produces two sets of non-equivalent sufficient conditions with guaranteed exponential decay rates. Due to the general time-varying nature of the subsystems, the conditions require infinitely many tests. Hence, we show how one of the two introduced conditions can be finitely tested in the special case where the subsystems take uncertain values within polytopes.
|
| |
| 17:45-18:00, Paper FrC18.6 | |
| Inflectional Instability of Linearized Incompressible Euler Equations Via Linear Partial Inequality Tests |
|
| Peet, Yulia | Arizona State University |
| Purra, Varshitha | Arizona State University |
Keywords: Fluid flow systems, Distributed parameter systems, Stability of linear systems
Abstract: In this paper, we consider inflectional instability of linearized incompressible Euler equations in a continuous time formulation. Inflectional instability is a linear inviscid instability mechanism that plays important role in dynamics of transitional and turbulent fluid flows. According to Rayleigh theorem, it occurs when an inflectional point develops in the mean velocity profile. Rayleigh theorem, as most of the studies concerning stability analysis of fluid flows, relies on eigenvalue decomposition that restricts analysis to a class of perturbations known as normal models (waves with time-dependent amplitude). In addition to the Rayleigh condition, a second, Fjortoft condition, has also been developed, likewise based on the normal mode assumption. Both these conditions represent necessary conditions for instability assuming a normal mode decomposition. The current paper revisits linear stability analysis of inviscid incompressible flows, but instead develops a generalized, continuous in time analysis framework that does not require a normal mode decomposition or any other assumption on the form of perturbations. The framework transforms a governing Partial Differential Equation (PDE) problem into a Partial Integral Equation (PIE) and then analyzes stability of a continuous in time PIE by invoking Lyapunov-based methods verified through Linear Partial Inequality (LPI) tests. The developed stability test provides a stricter condition for stability than the normal mode analysis since it explicitly searches for a Lyapunov function, which, if found, guarantees stability. The results of this paper show that, even when the Rayleigh and Fjortoft criteria for instability are not satisfied (i.e. a system has all stable eigenvalues), an appropriate Lyapunov function cannot always be found, meaning that these profiles can be unstable to generalized perturbations. This suggests that, in general, neither Rayleigh nor Fjortoft criteria can be used as sufficient conditions for stability when the Lyapunov stability of a continuous-time formulation is considered.
|
| |
| 18:00-18:15, Paper FrC18.7 | |
| Static Output Feedback Stabilization of Linear Systems with Multiple Delays |
|
| Braghini, Danilo | Arizona State University |
| Tognetti, Eduardo Stockler | University of Brasilia |
| Peet, Matthew M. | Arizona State University |
Keywords: LMIs, Delay systems, Distributed parameter systems
Abstract: This work proposes a new procedure for the stabilization of time-delay systems using Static Output Feedback (SOF) control. A previous convex optimization approach to SOF for Ordinary Differential Equations (ODEs) is extended to time-delay systems through the use of a proposed state-space representation. This approach is based on solving two convex optimization problems, which are extensions of Linear Matrix Inequalities (LMIs) to infinite-dimensional systems. The first problem is stabilization under state feedback control; the second problem takes advantage of the Projection Lemma, which is extended here from matrices to Partial Integral (PI) operators. Finally, the results are compared with other SOF solutions for systems with delay found in the literature, showing a significant reduction in conservatism.
|
| |
| 18:15-18:30, Paper FrC18.8 | |
| Understanding Collective Stability of ACC Systems: From Theory to Real-World Observations |
|
| Korbmacher, Raphael | University of Wuppertal |
| Khound, Parthib | Indian Institute of Technology Bombay |
| Tordeux, Antoine | Forschungszentrum Juelich |
Keywords: Adaptive control, Autonomous vehicles, Stability of linear systems
Abstract: Autonomated vehicles (AVs) are expected to have a profound impact on society, with high expectations for their potential benefits. One key anticipated benefit is the reduction of traffic congestion and stop-and-go waves, which negatively affect fuel efficiency, travel time, and environmental sustainability. This paper presents a comprehensive review that evaluates the impact of AVs on longitudinal collective stability in traffic flow. We focus on adaptive cruise control (ACC) systems, a widely used precursor technology to fully autonomous driving. ACC controllers have been studied extensively in both theoretical and practical contexts, making it a valuable starting point for analysis. Our study systematically differentiates the findings from models and simulations, controlled experiments, and empirical observations to provide a structured overview of existing research. Although some results in the literature are contradictory, three key insights emerge from our analysis: (i) String stability is highly dependent on the chosen time gap and reaction time. (ii) ACC systems currently implemented in commercial vehicles are not string stable, as manufacturers prioritise individual comfort and smooth driving, resulting in high system response times. (iii) Although cooperative ACC (CACC) systems are theoretically the most effective solution to ensure string stability, their widespread implementation in the near future remains uncertain. Instead, improvements in autonomous control algorithms should be considered to enhance system performance.
|
| |
| FrC20 |
Asia I+II+III+IV |
IEEE CSS TC on Smart Cities Tutorial Session: Challenges and Opportunities
for Control in Smart Cities |
Tutorial Session |
| Chair: Salazar, Mauro | Eindhoven University of Technology |
| Co-Chair: Malikopoulos, Andreas A. | Cornell University |
| Organizer: Salazar, Mauro | Eindhoven University of Technology |
| Organizer: Malikopoulos, Andreas A. | Cornell University |
| |
| 16:30-17:00, Paper FrC20.1 | |
| At the Intersection of Learning and Control for Emerging Mobility Systems (I) |
|
| Malikopoulos, Andreas A. | Cornell University |
Keywords: Adaptive control, Learning, Optimal control
Abstract: Emerging mobility systems [1], e.g., connected and automated vehicles (CAVs), and shared mobility, are typical cyber-physical systems (CPS) representing systems of subsystems with an informationally decentralized structure [2]. To derive optimal control strategies for such systems, we typically assume an ideal model [3]. Such model-based control approaches cannot effectively facilitate optimal solutions with performance guarantees due to the discrepancy between the model and the actual CPS. On the other hand, in most CPS there is a large volume of data of a dynamic nature, which is added to the system gradually in real time and not altogether in advance. Thus, traditional supervised learning approaches cannot always facilitate robust solutions using data derived offline. By contrast, applying reinforcement learning approaches directly to the actual CPS might impose significant implications on the safety and robust operation of the system. In this talk, I will discuss the challenges of supervised learning and model-based control approaches in several transportation-related applications, including self-learning powertrain control, power management control of hybrid electric vehicles, and optimal coordination of connected and automated vehicles. Then, I will present a theoretical framework founded at the intersection of control theory and learning that circumvents these challenges in deriving optimal strategies for CPS [4],[5]. In this framework, we aim to identify a sufficient information state for the CPS that takes values in a time-invariant space and use this information state to derive separated control strategies. Separated control strategies are related to the concept of separation between the estimation of the information state and control of the system. By establishing separated control strategies, we can derive offline the optimal control strategy of the system with respect to the information state, which might not be precisely known due to model uncertainties or the complexity of the system, and then use learning methods to learn the information state online while data are added gradually to the system in real time. This approach could effectively facilitate optimal solutions with performance guarantees in a wide range of CPS applications, such as emerging mobility systems, networked control systems, smart power grids, cooperative cyber-physical networks, cooperation of robots, and the IoT.
|
| |
| 17:00-17:30, Paper FrC20.2 | |
| Online Feedback Optimization for Power Systems Operation (I) |
|
| Dörfler, Florian | Swiss Federal Institute of Technology (ETH) Zurich |
| |
| 17:30-18:00, Paper FrC20.3 | |
| Smart Water Systems: Monitoring, Control and Resilience (I) |
|
| Polycarpou, Marios M. | University of Cyprus |
Keywords: Smart cities/houses, Control applications, Fault tolerant systems
Abstract: Considerable changes in freshwater resources have been occurring across the globe, indicating a future in which already limited water resources will become even more precious. According to the World Economic Forum, water crises is one of the top global risks in terms of impact. On the other hand, the continuous expansion of urban footprint means that an estimated 70% of the world’s population will live in urban areas by 2050. The dramatic increase in water demands resulting from this unprecedented urbanization, together with increasingly uncertain climate conditions, indicate the need for a holistic, intelligent decision-making framework for managing water infrastructures in the cities of the future. From a system engineering perspective, urban drinking water networks are complex, large-scale systems designed to supply clean water to industrial and domestic users. Some of the key water challenges include water losses, ensuring water quality, energy efficiency, and safety and security of water resources. Recent advances in information and communication technologies have facilitated the modernization of water systems with the installation of sensors, actuators, data processing units and wireless communications, which enables the collection of far more real-time data related to water systems. The objective of this presentation is to provide an overview of current advances in smart water systems from a systems and control perspective. Several results on monitoring, control and fault tolerance of water distribution networks will be presented and illustrated, and directions for future research will be discussed.
|
| |
| 18:00-18:30, Paper FrC20.4 | |
| On Justice, Wellbeing and the Engineer Trap: A Transdisciplinary Example in Mobility (I) |
|
| Salazar, Mauro | Eindhoven University of Technology |
Keywords: Smart cities/houses, Transportation networks, Traffic control
Abstract: Nowadays urban mobility systems are facing challenges ranging from environmental pollution to social injustice. The advent of cyber-physical technologies such as automated driving, connectivity and powertrain electrification (ACE), along with well- established innovations (+), might provide us with promising opportunities to face these challenges. At the same time, we are running the risk of falling (again) into the “engineer trap”, where we engineer technology-driven answers to the wrong question, focusing on technological advancements without considering the broader social and environmental context, and ultimately exacerbating these challenges rather than alleviating them. In this context, this talk will present our recent work on concepts, models and optimization to address research questions encompassing the individual-vehicle and the transportation- system level. Specifically, it will explore to what extent the application of different justice principles can enhance the fairness of a transport system, and focus on realizing such principles in the operation of transport systems rather than merely assessing a given system design. Using an intermodal Autonomous Mobility-on-Demand (AMoD) system as a case study, where a fleet of centrally controlled self-driving cars provides on-demand mobility synergistically with public transit and active modes (biking and walking), it investigates how its operation can improve the situation of users that do not own a car. I first formally define a set of justice metrics that differ in terms of distributive principle and the good of concern. The metrics include: minimization of average travel time for the car-less population (i.e., a population-specific application of utilitarianism); avoidance of unacceptably long travel times for the car-less population in line with a sufficientarian approach; and delivery of reasonable travel times to a sufficient set of destinations. I will showcase our framework in a real-world case-study in the city of Eindhoven, the Netherlands. Our results show that, compared to conventional utilitarian minimum-travel-time planning, it is possible to significantly improve the situation of the car-less users without affecting conventional performance metrics such as average travel time. Whilst the differences between the proposed sufficientarian deployment models are rather modest, they highlight intrinsic crucial trade-offs that require further consideration and analysis. Overall, these results underscore the importance of taking a transdisciplinary approach addressing planning problems from conceptualization to modeling and optimization in transport and mobility. Reference: M. Salazar, S. Betancur Giraldo, F. Paparella, L. Pedroso, K. Martens, “Mobilizing Transport Justice: A Sufficientarian Optimization Framework for Intermodal Mobility Systems”, NPJ Sustainable Mobility and Transport, 2025, In Press, DOI: https://doi.org/10.21203/rs.3.rs-6172438/v1
|
| |