| |
Last updated on October 9, 2025. This conference program is tentative and subject to change
Technical Program for Thursday December 11, 2025
|
ThA01 |
Galapagos I |
Advanced Design Principles for Gene Expression Regulation |
Invited Session |
Chair: Borri, Alessandro | CNR-IASI |
Co-Chair: Singh, Abhyudai | University of Delaware |
Organizer: Bellato, Massimo | Università Di Padova |
Organizer: Lugagne, Jean-Baptiste | University of Oxford |
Organizer: Cuba Samaniego, Christian | Carnegie Mellon University |
Organizer: Borri, Alessandro | CNR-IASI |
Organizer: Singh, Abhyudai | University of Delaware |
|
09:30-09:45, Paper ThA01.1 | |
The Incoherent Feedback Loop of the Nitrate Acquisition in Plant Roots |
|
Blanchini, Franco | Univ. Degli Studi Di Udine |
Casagrande, Daniele | University of Udine |
Tomasi, Nicola | University of Udine |
Zanin, Laura | University of Udine |
Keywords: Biomolecular systems, Systems biology, Biological systems
Abstract: In this paper we consider the high--affinity nitrate acquisition in plant roots. This process is governed by a mechanism including two loops of opposite signs we call Incoherent FeedBack (IFB) loop. A positive feedback loop acting in the short term, increases the intake when the roots are exposed to nitrate. An additional negative loop drastically reduces the acquisition in the long run by inhibiting the synthesis of transporters. We propose a model that fits nicely the experimental data. We investigate about the robustness of the model proving its structural stability, namely, stability for all possible parameters under some technical assumptions. We compare our model to the well known Incoherent FeedForward (IFF) loop which is a very popular network motif.
|
|
09:45-10:00, Paper ThA01.2 | |
Compact Attractors of an Antithetic Integral Feedback System Have a Simple Structure (I) |
|
Margaliot, Michael | Tel Aviv University |
Wu, Chengshuai | Xi'an Jiaotong University |
Sontag, Eduardo | Northeastern University |
Keywords: Systems biology, Stability of nonlinear systems, Biological systems
Abstract: Since its introduction by Briat, Gupta and Khammash, the antithetic feedback controller design has attracted considerable attention in both theoretical and experimental systems biology. The case in which the plant is a two-dimensional linear system (making the closed-loop system a four-dimensional nonlinear system) has been analyzed in much detail. This system has a unique equilibrium e but, depending on parameters, it may exhibit periodic orbits. An interesting question is for what parameter values periodic orbits exist. Another open question is whether other dynamical behaviors, such as chaotic attractors, might be possible for some parameter choices. We show that, for any parameter choices, every compact omega-limit set that does not include e is a periodic solution. We also show that if the Jacobian of the vector field at the equilibrium is unstable then a (non-trivial) periodic orbit exists, and provide an explicit open set of initial conditions that are attracted to a periodic orbit. The analysis is based on the theory of strongly 2-cooperative systems.
|
|
10:00-10:15, Paper ThA01.3 | |
Discrete Modeling of Bursty Gene Expression in Single Cells and Growing Populations (I) |
|
Poljovka, Jakub | Comenius University |
Zabaikina, Iryna | Comenius University in Bratislava |
Bokes, Pavol | Comenius University |
Singh, Abhyudai | University of Delaware |
Keywords: Genetic regulatory systems, Systems biology, Markov processes
Abstract: Bursty protein production is a key source of gene expression noise. In this study, we analyze a Markovian model of cellular protein dynamics, where proteins are produced in geometrically distributed bursts. Extending this model, we incorporate a feedback mechanism by assuming that higher protein levels reduce both cell growth and protein decay rates. We study both single-cell dynamics and an expanding cell population. Without feedback, the protein level follows a negative binomial distribution with the same parameters in both cases. With feedback, however, single-cell and population-level distributions differ, each expressible as a mixture of two negative binomial distributions with framework-dependent parameters. Using numerical integration of the master and population balance equations, we calculate the time-dependent distributions in both settings. This work extends previous continuous models and provides new insights into how population expansion influences intrinsic cellular heterogeneity.
|
|
10:15-10:30, Paper ThA01.4 | |
Consequences of Resource Constraints on Stochastic Gene Expression (I) |
|
Solanki, Utkarsh Singh | Indian Institute of Technology Kanpur |
Singh, Abhyudai | University of Delaware |
Patel, Abhilash | Indian Institute of Technology Kanpur |
Keywords: Biomolecular systems, Genetic regulatory systems, Systems biology
Abstract: In this contribution, we systematically investigate how intracellular constraints on resources impact stochastic gene expression. We first consider a model of a single gene with discrete integer-valued mRNA and protein copy numbers that evolved stochastically based on probability occurrences of biochemical reactions. The resource constraints are imposed by considering a finite number of ribosomes binding to mRNAs to form a translation complex, and the complex dissociates to give back a free ribosome and a protein molecule. Analytical analysis reveals that ribosomal constraints reduce the magnitude of stochastic fluctuations in protein copy numbers, and also lead to lower statistical single-cell concordance between mRNA and protein levels of the same gene. We also identify parameter regimes where copy-number fluctuations become sub-Poisson -- less variation than expected from a Poisson distribution. Considering fast ribosomal binding/unbinding to mRNAs, we also develop a reduced stochastic model that faithfully captures the statistical fluctuation of the system. The results help towards exploiting resource as design parameter to minimize noise in synthetic biology.
|
|
10:30-10:45, Paper ThA01.5 | |
Control of a Bi-Stable Genetic System Via Parallelized Reinforcement Learning (I) |
|
Henry, Robin | The University of Oxford |
Lugagne, Jean-Baptiste | University of Oxford |
Keywords: Biological systems, Reinforcement learning, Control applications
Abstract: Achieving real-time control of genetic systems is critical for improving the reliability, efficiency, and reproducibility of biological research and engineering. Yet the intrinsic stochasticity of these systems makes this goal difficult. Prior efforts have faced three recurring challenges: (a) predictive models of gene expression dynamics are often inaccurate or unavailable, (b) nonlinear dynamics and feedback in genetic circuits frequently lead to multi-stability, limiting the effectiveness of deterministic control strategies, and (c) slow biological response times make data collection for learning-based methods prohibitively time-consuming. Recent experimental advances now allow the parallel observation and manipulation of over a million individual cells, opening the door to model-free, data-driven control strategies. Here we investigate the use of Parallelized Q-Networks (PQN), a recently-developed reinforcement learning algorithm, to learn control policies for a simulated bi-stable gene regulatory network. We show that PQN can not only control this self-activating system more accurately than other model-free and model-based control methods previously used in the field, but also converges efficiently enough to be practical for experimental application. Our results suggest that parallelized experiments coupled with advances in reinforcement learning provide a viable path for real-time, model-free control of complex, multi-stable biological systems.
|
|
10:45-11:00, Paper ThA01.6 | |
Resilience of the Positive Gene Autoregulation Loop |
|
Proverbio, Daniele | University of Trento |
Giordano, Giulia | University of Trento |
Keywords: Genetic regulatory systems, Biological systems, Biomolecular systems
Abstract: Gene expression in response to stimuli is regulated by transcription factors (TFs) through feedback loop motifs, aimed at maintaining the desired TF concentration despite uncertainties and perturbations. In this work, we consider a stochastic model of the positive gene autoregulating feedback loop and we probabilistically quantify its resilience, i.e., its ability to preserve the equilibrium associated with a prescribed concentration of TFs, and the corresponding basin of attraction, in the presence of noise. We show that the formation of larger oligomers, corresponding to larger Hill coefficients of the regulation function, and thus to sharper non-linearities, improves the system resilience, even close to critical concentrations of TFs. We also explore a complementary definition of resilience that can be assessed within a stochastic formulation relying on the Fokker-Planck equation. Our formal results are accompanied by numerical simulations.
|
|
11:00-11:15, Paper ThA01.7 | |
Control with Practical Guarantees of Stationary Variance in Stochastic Chemical Reaction Networks |
|
M. Zand, Armin | ETH Zurich |
Gupta, Ankit | ETH Zürich |
Khammash, Mustafa H. | ETH Zurich |
Keywords: Genetic regulatory systems, Cellular dynamics, Biomolecular systems
Abstract: Biomolecular integral feedback controllers offer precise regulation of molecular species copy numbers, making them valuable for synthetic biology applications. Antithetic integral feedback controllers, in particular, can be effective in low-copy-number regimes with stochastic dynamics. In this work, we introduce a modified variant of this controller, called the antithetic dual-rein integral feedback motif, and analyze its performance from a stochastic perspective in the presence of intrinsic dynamic randomness. We demonstrate that our controller enables first-moment control while maintaining a tractable steady-state variance bound under specific parametric regimes. Notably, this variance bound is tunable, as it depends solely on the controller parameters. We derive these results using stochastic model-order reduction and validate them through numerical simulations. Our findings provide new insights into achieving both precise regulation and noise suppression in stochastic genetic circuits.
|
|
11:15-11:30, Paper ThA01.8 | |
On the Optimal Control of Birhythmic Oscillatory PWA Systems: An Application to the P53-Mdm2 Network |
|
Yabo, Agustín G. | INRAE |
Augier, Nicolas | CNRS |
Keywords: Genetic regulatory systems, Optimal control, Hybrid systems
Abstract: In this work, we tackle the problem of inducing optimal transfers between the two oscillatory regimes of a birhythmic genetic network, represented through a piecewise affine dynamical system. For that, we resort to an adaptation of Pontryagin's Maximum Principle to the hybrid setting, with a cost function that combines the transfer time and an L¹-control cost. We focus on a two-dimensional PWA model of the p53-Mdm2 network, a well-known tumor suppressor module that represents a key example of birhythmicity naturally found in mammalian cells. The resulting optimal control can be expressed in feedback form, and is able to remove an oscillatory mode of the system, allowing selection between low or high frequency oscillations of the bimodal genetic network.
|
|
ThA02 |
Oceania II |
Learning-Based Control IV: Safety Guarantees |
Invited Session |
Chair: Zeilinger, Melanie N. | ETH Zurich |
Co-Chair: Tu, Stephen | University of Southern California |
Organizer: Müller, Matthias A. | Leibniz University Hannover |
Organizer: Schoellig, Angela P | Technical University of Munich & University of Toronto |
Organizer: Trimpe, Sebastian | RWTH Aachen University |
Organizer: Zeilinger, Melanie N. | ETH Zurich |
|
09:30-09:45, Paper ThA02.1 | |
Learning Quasi-LPV Models and Robust Control Invariant Sets with Reduced Conservativeness |
|
Mulagaleti, Sampath Kumar | IMT School for Advanced Studies Lucca |
Bemporad, Alberto | IMT School for Advanced Studies Lucca |
Keywords: Identification for control, Linear parameter-varying systems, Uncertain systems
Abstract: We present an approach to identify a quasi Linear Parameter Varying (qLPV) model of a plant, with the qLPV model guaranteed to admit a robust control invariant (RCI) set. It builds upon the concurrent synthesis framework recently presented by the authors, in which the requirement of existence of an RCI set is modeled as a control-oriented regularization. Here, we reduce the conservativeness of the approach by bounding the qLPV system with an uncertain LTI system, which we derive using bound propagation approaches. The resulting regularization function is the optimal value of a nonlinear robust optimization problem that we solve via a differentiable algorithm. We numerically demonstrate the benefits of the proposed approach over two benchmark approaches.er synthesized from data.
|
|
09:45-10:00, Paper ThA02.2 | |
Towards Safe Control Parameter Tuning in Distributed Multi-Agent Systems (I) |
|
Tokmak, Abdullah | Aalto University |
Schön, Thomas (Bo) | Uppsala University |
Baumann, Dominik | Aalto University |
Keywords: Data driven control, Machine learning, Optimization algorithms
Abstract: Many safety-critical real-world problems, such as autonomous driving and collaborative robots, are of a distributed multi-agent nature. To optimize the performance of these systems while ensuring safety, we can cast them as distributed optimization problems, where each agent aims to optimize their parameters to maximize a coupled reward function subject to coupled constraints. Prior work either studies a centralized setting, does not consider safety, or struggles with sample efficiency. Since we require sample efficiency and work with unknown and nonconvex rewards and constraints, we solve this optimization problem using safe Bayesian optimization with Gaussian process regression. Moreover, we consider nearest-neighbor communication between the agents. To capture the behavior of non-neighboring agents, we reformulate the static global optimization problem as a time-varying local optimization problem for each agent, essentially introducing time as a latent variable. To this end, we propose a custom spatio-temporal kernel to integrate prior knowledge. We show the successful deployment of our algorithm in simulations.
|
|
10:00-10:15, Paper ThA02.3 | |
Latent Representations for Control Design with Provable Stability and Safety Guarantees (I) |
|
Lutkus, Paul | University of Southern California |
Wang, Kaiyuan | University of Southern California |
Lindemann, Lars | University of Southern California |
Tu, Stephen | University of Southern California |
Keywords: Reduced order modeling, Identification for control, Lyapunov methods
Abstract: We initiate a formal study on the use of low-dimensional latent representations of dynamical systems for verifiable control synthesis. Our main goal is to enable the application of verification techniques—such as Lyapunov or barrier functions—that might otherwise be computationally prohibitive when applied directly to the full state representation. Towards this goal, we first provide dynamics-aware, approximate conjugacy conditions which formalize the notion of reconstruction error necessary for systems analysis. We then utilize our conjugacy conditions to transfer the stability and invariance guarantees of a latent certificate function (e.g., a Lyapunov or barrier function) for a latent space controller back to the original system. Importantly, our analysis contains several important implications for learning latent spaces and dynamics, by highlighting the necessary geometric properties which need to be preserved by the latent space, in addition to providing concrete loss functions for dynamics reconstruction that are directly related to control design. We conclude by demonstrating the applicability of our theory to two case studies: (1) stabilization of a cartpole system, and (2) collision avoidance for a two-vehicle system.
|
|
10:15-10:30, Paper ThA02.4 | |
Learning High-Order CBFs Using Gaussian Processes for Systems in Brunovsky Canonical Form (I) |
|
Begzadić, Azra | University of California, San Diego |
Lederer, Armin | National University of Singapore |
Cortes, Jorge | UC San Diego |
Herbert, Sylvia | UC San Diego (UCSD) |
Keywords: Machine learning, Constrained control, Data driven control
Abstract: Learning control barrier functions (CBFs) offers a promising approach to enforcing safety but doing so with formal guarantees remains challenging. This is compounded by the complexity of the dynamics considered in this paper, given by systems in Brunovsk'y canonical form, which naturally require high-order CBFs (HOCBFs). The signed distance function (SDF) naturally encodes HOCBF properties but is often non-smooth, limiting its applicability in learning-based methods and safety-critical control. To address this, we propose a smoothing technique that ensures continuous differentiability with respect to the relative degree of the system dynamics while preserving key safety properties of the SDF. We then leverage Gaussian Process regression to learn an HOCBF candidate from noisy measurements, providing probabilistic safety guarantees through an inner approximation of the safe set. Additionally, we establish formal feasibility guarantees for the HOCBF-based controller and ensure the safety of the resulting closed-loop dynamics with high probability. Our approach enables online adaptation by efficiently updating the learned HOCBF with new data. We demonstrate our approach in simulation on a planar system and robotic manipulator with 2DOF.
|
|
10:30-10:45, Paper ThA02.5 | |
Data-Driven Hamiltonian for Direct Construction of Safe Set from Trajectory Data (I) |
|
Choi, Jason J. | University of California, Berkeley |
Strong, Christopher | University of California, Berkeley |
Sreenath, Koushil | University of California, Berkeley |
Cho, Namhoon | Seoul National University |
Tomlin, Claire J. | UC Berkeley |
Keywords: Optimal control, Data driven control, Formal Verification/Synthesis
Abstract: In continuous-time optimal control, evaluating the Hamiltonian requires solving a constrained optimization problem using the system's dynamics model. Hamilton-Jacobi reachability analysis for safety verification has demonstrated practical utility only when efficient evaluation of the Hamiltonian over a large state-time grid is possible. In this study, we introduce the concept of a data-driven Hamiltonian (DDH), which circumvents the need for an explicit dynamics model by relying only on mild prior knowledge (e.g., Lipschitz constants), thus enabling the construction of reachable sets directly from trajectory data. Recognizing that the Hamiltonian is the optimal inner product between a given costate and realizable state velocities, the DDH estimates the Hamiltonian using the worst-case realization of the velocity field based on the observed state trajectory data. This formulation ensures a conservative approximation of the true Hamiltonian for uncertain dynamics. The reachable set computed based on the DDH is also ensured to be a conservative approximation of the true reachable set. Next, we propose a data-efficient safe experiment framework for gradual expansion of safe sets using the DDH. This is achieved by iteratively conducting experiments within the computed data-driven safe set and updating the set using newly collected trajectory data. To demonstrate the capabilities of our approach, we showcase its effectiveness in safe flight envelope expansion for a tiltrotor vehicle transitioning from near-hover to forward flight.
|
|
10:45-11:00, Paper ThA02.6 | |
Model-Free Learning Reference Governor with Enhanced Data Collection for Safety-Critical Control Systems |
|
Merckaert, Kelly | Vrije Universiteit Brussel |
Convens, Bryan | Vrije Universiteit Brussel |
Kolmanovsky, Ilya V. | The University of Michigan |
Keywords: Constrained control, Data driven control, Control applications
Abstract: Ensuring constraint satisfaction in control systems without relying on accurate models is essential for real-world applications. Learning-based Reference Governors (LRGs) address this challenge by leveraging data-driven adaptation to improve constraint handling. However, existing safety-critical LRG methods often suffer from slow learning speeds and require full state measurements. This paper presents an enhanced safety-critical LRG framework that accelerates learning by redefining the peak deviation function and proposes an output-only measurement version that increases its practical applicability. A case study on a spacecraft with a flexible appendage demonstrates the effectiveness of the approach, showing improved learning speed and closed-loop convergence.
|
|
11:00-11:15, Paper ThA02.7 | |
Probabilistic Safety for Hard-To-Formalize Constraints Via Conformal Neural CBFs |
|
Hirano, Koki | The University of Tokyo |
Takeishi, Naoya | The University of Tokyo |
Yairi, Takehisa | The University of Tokyo |
Keywords: Data driven control, Nonlinear systems, Autonomous robots
Abstract: We present a data-driven safety filtering framework for nonlinear discrete-time systems that combines neural network-based control barrier functions (CBFs) with conformal prediction (CP) to provide probabilistic safety guarantees. This method utilizes CBFs learned via neural networks to determine whether the system can remain in a safe state. Furthermore, by incorporating CP, the method evaluates the error between the predicted and true values of the CBF, enabling reliable, probability-based safety certification. We applied the proposed method to the mobile robot path planning task combined with the potential field method, demonstrating its applicability even in hazardous scenarios such as deadlocks, which are difficult to formalize explicitly. Moreover, CP enables quantification of the likelihood that the system remains within the safe set. By combining data-driven control design with probabilistic safety guarantees, the proposed method contributes to improving the safety of control systems.
|
|
11:15-11:30, Paper ThA02.8 | |
Statistical Guarantees in Data-Driven Nonlinear Control: Conformal Robustness for Stability and Safety |
|
Hsu, Ting-Wei | University of Illinois Urbana-Champaign |
Tsukamoto, Hiroyasu | University of Illinois at Urbana-Champaign/NASA JPL |
Keywords: Data driven control, Uncertain systems, Stability of nonlinear systems
Abstract: We present a true-dynamics-agnostic, statistically rigorous framework for establishing exponential stability and safety guarantees of closed-loop, data-driven nonlinear control. Central to our approach is the novel concept of conformal robustness, which robustifies the Lyapunov and zeroing barrier certificates of data-driven dynamical systems against model prediction uncertainties using conformal prediction. It quantifies these uncertainties by leveraging rank statistics of prediction scores over system trajectories, without assuming any specific underlying structure of the prediction model or distribution of the uncertainties. With the quantified uncertainty information, we further construct the conformally robust control Lyapunov function (CR-CLF) and control barrier function (CR-CBF), data-driven counterparts of the CLF and CBF, for fully data-driven control with statistical guarantees of finite-horizon exponential stability and safety. The performance of the proposed concept is validated in numerical simulations with four benchmark nonlinear control problems.
|
|
ThA03 |
Oceania III |
Estimation and Control of Distributed Parameter Systems IV |
Invited Session |
Chair: Fridman, Emilia | Tel-Aviv Univ |
Co-Chair: Hu, Weiwei | University of Georgia |
Organizer: Demetriou, Michael A. | Worcester Polytechnic Institute |
Organizer: Hu, Weiwei | University of Georgia |
|
09:30-09:45, Paper ThA03.1 | |
A Dual Ensemble Kalman Filter Approach to Robust Control of Nonlinear Systems: An Application to Partial Differential Equations (I) |
|
Joshi, Anant A. | University of Illinois at Urbana Champaign |
Mowlavi, Saviz | Mitsubishi Electric Research Laboratories |
Benosman, Mouhacine | Amazon Robotics |
Keywords: Robust control, Optimal control, Data driven control
Abstract: This paper considers the problem of data-driven robust control design for nonlinear systems, for instance, obtained when discretizing nonlinear partial differential equations (PDEs). A robust learning control approach is developed for nonlinear affine in control systems based on Lyapunov redesign technique. The robust control is developed as a sum of an optimal learning control which stabilizes the system in absence of disturbances, and an additive Lyapunov-based robustification term which handles the effects of disturbances. The dual ensemble Kalman filter (dual EnKF) algorithm is utilized in the optimal control design methodology. A simulation study is done on the heat equation and Burgers partial differential equation.
|
|
09:45-10:00, Paper ThA03.2 | |
Constructive Method for Boundary Control of Singularly Perturbed Reaction-Diffusion Systems (I) |
|
Wang, Pengfei | Tel Aviv University |
Fridman, Emilia | Tel-Aviv Univ |
Keywords: Distributed parameter systems, LMIs
Abstract: In recent years, the control of singularly perturbed reaction-diffusion systems has gained increasing attention. Differently from the existing qualitative methods with stability guaranteed for small enough value of the singular perturbation parameter, in this paper we give a quantitative upper bound on this parameter. We study finite-dimensional state-feedback control of singularly perturbed linear 1D reaction-diffusion systems with scalar slow and fast systems under Neumann actuation in each system. We employ modal decomposition, and the two actuators are efficient for unstable fast system and the non-standard singularly perturbed system. The controller gain is designed based on the slow (descriptor) system. We provide LMIs for finding a quantitative upper bound of the singular perturbation parameter along with the controller gain and decay rate. Numerical examples demonstrate the efficiency of the proposed method.
|
|
10:00-10:15, Paper ThA03.3 | |
DeepONet of Dynamic Event-Triggered Backstepping Boundary Control for Reaction-Diffusion PDEs (I) |
|
Yuan, Hongpeng | Xiamen University |
Wang, Ji | Xiamen University |
Diagne, Mamadou | University of California San Diego |
Keywords: Backstepping, Machine learning, Distributed parameter systems
Abstract: We present an event-triggered boundary control scheme for a class of reaction-diffusion PDEs using operator learning and the backstepping method. Our contribution aims to learn the backstepping kernels, which inherently induce the learning of the gains in the event trigger and the control law. The kernel functions in constructing the control law are approximated with neural operators (NOs) to improve the computational efficiency. Then, a dynamic event-triggering mechanism is designed, based on the plant and the continuous-time control law using kernels given by NOs, to determine the updating times of the actuation signal. In the resulting event-based closed-loop system, a strictly positive lower bound of the minimal dwell time is found, which is independent of initial conditions. As a result, the absence of a Zeno behavior is guaranteed. Besides, exponential convergence to zero of the L^2 norm of the reaction-diffusion PDE state and the dynamic variable in the event-triggering mechanism is proved via Lyapunov analysis. The effectiveness of the proposed method is illustrated by numerical simulation.
|
|
10:15-10:30, Paper ThA03.4 | |
Leader-Follower Density Control of Spatial Dynamics in Large-Scale Multi-Agent Systems |
|
Maffettone, Gian Carlo | Scuola Superiore Meridionale |
Boldini, Alain | New York Institute of Technology |
Porfiri, Maurizio | New York University Tandon School of Engineering |
di Bernardo, Mario | University of Naples Federico II |
Keywords: Autonomous systems, Distributed parameter systems, Large-scale systems
Abstract: We address the problem of controlling the density of a large ensemble of follower agents by acting on a group of leader agents that interact with them. Using coupled partial integro-differential equations to describe leader and follower density dynamics, we establish feasibility conditions and develop two control architectures ensuring global stability. The first employs feed-forward control on the followers' and a feedback on the leaders' density. The second implements a dual feedback loop through a reference-governor that adapts the leaders' density based on both populations' measurements. Our methods, initially developed in a one-dimensional setting, are extended to multi-dimensional cases, and validated through numerical simulations for representative control applications, both for groups of infinite and finite size.
|
|
10:30-10:45, Paper ThA03.5 | |
Safe Stabilization of the Stefan Problem with a High-Order Moving Boundary Dynamics by PDE Backstepping |
|
Koga, Shumon | Kobe University |
Krstic, Miroslav | University of California, San Diego |
Keywords: Backstepping, Distributed parameter systems, Constrained control
Abstract: This paper presents a safe stabilization of the Stefan PDE model with a moving boundary governed by a high-order dynamics. We consider a parabolic PDE with a time-varying domain governed by a second-order response with respect to the Neumann boundary value of the PDE state at the moving boundary. The objective is to design a boundary heat flux control to stabilize the moving boundary at a desired setpoint, with satisfying the required conditions of the model on PDE state and the moving boundary. We apply a PDE backstepping method for the control design with considering a constraint on the control law. The PDE and moving boundary constraints are shown to be satisfied by applying the maximum principle for parabolic PDEs. Then the closed-loop system is shown to be globally exponentially stable by performing Lyapunov analysis. The proposed control is implemented in numerical simulation, which illustrates the desired performance in safety and stability. An outline of the extension to third-order moving boundary dynamics is also presented.
|
|
10:45-11:00, Paper ThA03.6 | |
Finite-Time Stabilization of a Class of Nonlinear Systems in Hilbert Space |
|
Fenza, Kamal | Sidi Mohamed Ben Abdellah University |
Labbadi, Moussa | Aix-Marseille University |
Ouzahra, Mohamed | University of Sidi Mohamed Ben Abdellah, ENS, Fes |
Keywords: Distributed parameter systems, Stability of nonlinear systems, Nonlinear systems
Abstract: This paper deals with the finite-time stabilization of a class of nonlinear infinite-dimensional systems. First, we consider a bounded matched perturbation in its linear form. It is shown that by using a set-valued function, both the convergence objective (finite-time) and the rejection of perturbations are achieved. Second, we consider a class of nonlinear systems and design a feedback control that ensures the closed-loop system is finite-time stable. All proofs presented in this paper regarding convergence are based on Lyapunov theory. The existence of solutions to the closed-loop system and its well-posedness are established using maximal monotone theory. To illustrate the applicability of the theoretical results, a heat equation is considered as an application of the main results.
|
|
11:00-11:15, Paper ThA03.7 | |
Sampled-Data and Event-Triggered Control of Globally Lipschitz Infinite-Dimensional Systems |
|
Katz, Rami | University of Trento |
Mironchenko, Andrii | University of Bayreuth |
Keywords: Distributed parameter systems, Stability of nonlinear systems, Sampled-data control
Abstract: We show that if a linear infinite-dimensional system is exponentially stabilizable by compact feedback, it is also stabilizable by means of a sampled-data feedback that is fed through a globally Lipschitz nonlinearity, provided that the sector bound for the nonlinearity and the sampling time is small enough. Next we develop a switching-based event-triggered control scheme stabilizing the system with a reduced number of switching events. We further show how the proposed event-triggering scheme can be adapted to handle uncertain input nonlinearities, where only the sector bound parameter and a bound on the Lipschitz constant of the nonlinearity are assumed to be known.
|
|
11:15-11:30, Paper ThA03.8 | |
Optimal Control of an Interconnected SDE - Parabolic PDE System |
|
Velho, Gabriel | Université Paris-Saclay, CentraleSupélec, Laboratoire Des Signau |
Auriol, Jean | CNRS |
Boussaada, Islam | Universite Paris Saclay, CNRS-CentraleSupelec-Inria |
Bonalli, Riccardo | Laboratoire Des Signaux Et Systèmes |
Keywords: Distributed parameter systems, Stochastic optimal control, Stochastic systems
Abstract: In this paper, we design a controller for an interconnected system where a linear Stochastic Differential Equation (SDE) is actuated through a linear parabolic heat equation. These dynamics arise in various applications, such as coupled heat transfer systems and chemical reaction processes that are subject to disturbances. Our goal is to develop a computational method for approximating the controller that minimizes a quadratic cost associated with the state of the SDE component. To achieve this, we first perform a change of variables to shift the actuation inside the PDE domain and reformulate the system as a linear Stochastic Partial Differential Equation (SPDE). We use a spectral approximation of the Laplacian operator to discretize the coupled dynamics into a finite-dimensional SDE and compute the optimal control for this approximated system. The resulting control serves as an approximation of the optimal control for the original system. We then establish the convergence of the approximated optimal control and the corresponding closed-loop dynamics to their infinite-dimensional counterparts. Numerical simulations are provided to illustrate the effectiveness of our approach.
|
|
ThA04 |
Oceania IV |
Control Architecture Theory (CAT) |
Invited Session |
Chair: Zardini, Gioele | Massachusetts Institute of Technology |
Co-Chair: Pappas, George J. | University of Pennsylvania |
Organizer: Zardini, Gioele | Massachusetts Institute of Technology |
Organizer: Matni, Nikolai | University of Pennsylvania |
Organizer: Ames, Aaron D. | California Institute of Technology |
Organizer: Pappas, George J. | University of Pennsylvania |
|
09:30-09:45, Paper ThA04.1 | |
Symbolic Control for Autonomous Docking of Marine Surface Vessels (I) |
|
Dietrich, Elizabeth | University of California, Berkeley |
Gezer, Emir Cem | Norwegian University of Science and Technology |
Zhong, Bingzhuo | The Hong Kong University of Science and Technology (Guangzhou) |
Arcak, Murat | University of California, Berkeley |
Zamani, Majid | University of Colorado Boulder |
Skjetne, Roger | Norwegian Univ of Science and Technology |
Sorensen, Asgeir Johan | Norwegian Univ of Sci and Technology |
Keywords: Maritime control, Hierarchical control, Control applications
Abstract: We develop a hierarchical control architecture for autonomous docking maneuvers of a dynamic positioning vessel and provide formal safety guarantees. At the upper-level, we treat the vessel’s desired surge, sway, and yaw velocities as control inputs and synthesize a symbolic controller in real-time. The desired velocities are then executed by the vessel's low-level velocity feedback control loop. We next investigate methods to optimize the performance of the proposed control scheme. The results are evaluated on a simulation model of a marine surface vessel in the presence of static obstacles and, for the first time, through physical experiments on a scale model vessel.
|
|
09:45-10:00, Paper ThA04.2 | |
Layered Multirate Control of Constrained Linear Systems (I) |
|
Stamouli, Charis | University of Pennsylvania |
Tsiamis, Anastasios | ETH Zurich |
Morari, Manfred | University of Pennsylvania |
Pappas, George J. | University of Pennsylvania |
Keywords: Control system architecture, Hierarchical control
Abstract: Layered control architectures have been a standard paradigm for efficiently managing complex constrained systems. A typical architecture consists of: i) a higher layer, where a low-frequency planner controls a simple model of the system, and ii) a lower layer, where a high-frequency tracking controller guides a detailed model of the system toward the output of the higher-layer model. A fundamental problem in this layered architecture is the design of planners and tracking controllers that guarantee both higher- and lower-layer system constraints are satisfied. Toward addressing this problem, we introduce a principled approach for layered multirate control of linear systems subject to output and input constraints. Inspired by discrete-time simulation functions, we propose an efficient control design that guarantees the lower-layer system tracks the output of the higher-layer system with computable precision. Using this design, we derive conditions and present a method for propagating the constraints of the lower-layer system to the higher-layer system. The propagated constraints are integrated into the design of an arbitrary planner that can handle higher-layer system constraints. Our framework ensures that the output constraints of the lower-layer system are satisfied at all high-level time steps, while respecting its input constraints at all low-level time steps. We apply our approach in a scenario of motion planning, highlighting its critical role in ensuring collision avoidance.
|
|
10:00-10:15, Paper ThA04.3 | |
Learning Flatness-Preserving Residuals for Pure-Feedback Systems (I) |
|
Yang, Fengjun | University of Pennsylvania |
Welde, Jake | University of Pennsylvania |
Matni, Nikolai | University of Pennsylvania |
Keywords: Learning, Feedback linearization, Nonlinear systems
Abstract: We study residual dynamics learning for differentially flat systems, where a nominal model is augmented with a learned correction term from data. A key challenge is that generic residual parameterizations may destroy flatness, limiting the applicability of flatness-based planning and control methods. To address this, we propose a framework for learning flatness-preserving residual dynamics in systems whose nominal model admits a pure-feedback form. We show that residuals with a lower-triangular structure preserve both the flatness of the system and the original flat outputs. Moreover, we provide a constructive procedure to recover the flatness diffeomorphism of the augmented system from that of the nominal model. Building on these insights, we introduce a parameterization of flatness-preserving residuals using smooth function approximators, making them learnable from trajectory data with conventional algorithms. Our approach is validated in simulation on a 2D quadrotor subject to unmodeled aerodynamic effects. We demonstrate that the resulting learned flat model achieves a tracking error 5times lower than the nominal flat model, while being 20times faster over a structure-agnostic alternative.
|
|
10:15-10:30, Paper ThA04.4 | |
Guaranteed Multistability in a microRNA-Based Genetic Network by Formal Methods (I) |
|
Nolan, Nicholas | Massachusetts Institute of Technology |
Peterman, Emma | Massachusetts Institute of Technology |
Galloway, Kate | Massachusetts Institute of Technology |
Sontag, Eduardo | Northeastern University |
Del Vecchio, Domitilla | Massachusetts Institute of Technology |
Keywords: Genetic regulatory systems, Biological systems, Systems biology
Abstract: The development of genetic memory devices in synthetic biology is a challenging process that requires extensive analysis and characterization. In mammalian systems, this complexity is compounded by the need for a small DNA payload for efficient delivery into the cell. Previous genetic memory devices have relied exclusively on protein-based regulation, which are limited by their large size; in this paper, we propose a microRNA-based multistable network, which effectively halves the payload size for more efficient delivery. We demonstrate that the system can be multistable, and use formal methods to characterize constraints on design parameters that guarantee multistability. Our results provide a new genetic network topology that can achieve multistability and demonstrate the use of formal methods in the design of sophisticated genetic network architectures against non-convex top-level specifications.
|
|
10:30-10:45, Paper ThA04.5 | |
On Composable and Parametric Uncertainty in Systems Co-Design (I) |
|
Huang, Yujun | Massachusetts Institute of Technology |
Furter, Marius | University of Zurich |
Zardini, Gioele | Massachusetts Institute of Technology |
Keywords: Autonomous systems, Autonomous robots, Formal Verification/Synthesis
Abstract: Optimizing the design of complex systems requires navigating interdependent decisions, heterogeneous components, and multiple objectives. Our monotone theory of co-design offers a compositional framework for addressing this challenge, modeling systems as design problems (DPs), representing trade-offs between functionalities and resources within partially ordered sets. While current approaches model uncertainty using intervals, capturing worst- and best-case bounds, they fail to express probabilistic notions such as risk and confidence. These limitations hinder the applicability of co-design in domains where uncertainty plays a critical role. In this paper, we introduce a unified framework for composable uncertainty in co-design, capturing intervals, distributions, and parametrized models. This extension enables reasoning about risk-performance trade-offs and supports advanced queries such as experiment design, learning, and multi-stage decision making. We demonstrate the expressiveness and utility of the framework via a numerical case study on the uncertainty-aware co-design of task-driven unmanned aerial vehicles (UAVs).
|
|
10:45-11:00, Paper ThA04.6 | |
Distributed Multi-Agent Coordination Over Cellular Sheaves (I) |
|
Hanks, Tyler | University of Florida |
Riess, Hans | Georgia Institute of Technology |
Cohen, Samuel | University of Florida |
Gross, Trevor | University of Florida |
Hale, Matthew | Georgia Institute of Technology |
Fairbanks, James | University of Florida |
Keywords: Cooperative control, Distributed control, Optimal control
Abstract: Techniques for coordination of multi-agent systems are vast and varied, often utilizing purpose-built solvers or controllers with tight coupling to the types of systems involved or the coordination goal. In this paper, we introduce a general unified framework for heterogeneous multi-agent coordination using the language of cellular sheaves and nonlinear sheaf Laplacians, which are generalizations of graphs and graph Laplacians. Specifically, we introduce the concept of a nonlinear homological program encompassing a choice of cellular sheaf on an undirected graph, nonlinear edge potential functions, and constrained convex node objectives, which constitutes a standard form for a wide class of coordination problems. We use the alternating direction method of multipliers to derive a distributed optimization algorithm for solving these nonlinear homological programs. To demonstrate the applicability of this framework, we show how heterogeneous coordination goals including combinations of consensus, formation, and flocking can be formulated as nonlinear homological programs and provide numerical simulations showing the efficacy of our distributed solution algorithm.
|
|
11:00-11:15, Paper ThA04.7 | |
A Layered Control Perspective on Legged Locomotion: Embedding Reduced Order Models Via Hybrid Zero Dynamics (I) |
|
Esteban, Sergio | California Institute of Technology |
Cohen, Max | North Carolina State University |
Ghansah, Adrian Boedtker | California Institute of Technology |
Ames, Aaron D. | California Institute of Technology |
Keywords: Robotics, Stability of hybrid systems, Lyapunov methods
Abstract: Reduced-order models (ROMs) provide a powerful means of synthesizing dynamic walking gaits on legged robots. Yet this approach lacks the formal guarantees enjoyed by methods that utilize the full-order model (FOM) for gait synthesis, e.g., hybrid zero dynamics. This paper aims to unify these approaches through a layered control perspective. In particular, we establish conditions on when a ROM of locomotion yields stable walking on the full-order hybrid dynamics. To achieve this result, given an ROM we synthesize a zero dynamics manifold encoding the behavior of the ROM---controllers can be synthesized that drive the FOM to this surface, yielding hybrid zero dynamics. We prove that a stable periodic orbit in the ROM implies an input-to-state stable periodic orbit of the FOM's hybrid zero dynamics, and hence the FOM dynamics. This result is demonstrated in simulation on a linear inverted pendulum ROM and a 5-link planar walking FOM.
|
|
11:15-11:30, Paper ThA04.8 | |
Theoretical Foundations for Virtualization in Layered Control Architectures |
|
Bernat, Natalie | Caltech |
Keywords: Biologically-inspired methods, Control system architecture, Biological systems
Abstract: Motivated by the challenge of reverse-engineering biological control systems, this paper establishes a mathematical foundation for a central property of layered control design: virtualization. Despite its ubiquity in engineered systems, virtualization lacks a rigorous, comprehensive theoretical treatment. We begin to address this gap by proposing a formal definition of internal virtualization within reference tracking architectures-- a simple but foundational step toward a broader theory of virtualization in layered control architecture. Using minimal, standard tools from control theory, we present examples, counterexamples, and empirical explorations to illustrate key properties of internal virtualization and its relationship to classic control concepts such as feasible trajectory planning and differential flatness, as well as the more general concepts of modularity and predictive coding. Altogether, the rich insight demonstrated in such a simple, mathematically minimal treatment of virtualization in layered reference tracking reflects the urgent need for a more expansive, formal theory of Internal Virtualization in layered control architectures.
|
|
ThA05 |
Galapagos II |
Optimal and Learning-Based Control for Safe, Energy-Efficient, and
Autonomous Mobility Systems |
Invited Session |
Chair: Katriniok, Alexander | Eindhoven University of Technology |
Co-Chair: Bezzo, Nicola | University of Virginia |
Organizer: Katriniok, Alexander | Eindhoven University of Technology |
|
09:30-09:45, Paper ThA05.1 | |
Safe Adaptive Cruise Control under Perception Uncertainty: A Deep Ensemble and Conformal Tube Model Predictive Control Approach (I) |
|
Li, Xiao | University of Michigan, Ann Arbor |
Girard, Anouck | University of Michigan, Ann Arbor |
Kolmanovsky, Ilya V. | The University of Michigan |
Keywords: Autonomous vehicles, Vision-based control, Uncertain systems
Abstract: Autonomous driving systems heavily depend on deep neural network-based perception to interpret their environment and support decision-making. To enhance robustness in these safety-critical settings, this paper proposes a Deep Ensemble of neural network regressors integrated with Conformal Prediction to estimate both vehicle states and associated perception uncertainties. Within the Adaptive Cruise Control framework, the method uses RGB image inputs to jointly predict states and generate non-uniform, probabilistically calibrated uncertainty bounds. A Conformal Tube Model Predictive Control scheme incorporates these uncertainty estimates to ensure probabilistic safety guarantees under exchangeability assumptions. Evaluations in a high-fidelity driving simulator demonstrate the algorithm’s effectiveness in maintaining safe following distances and accurate speed tracking, including under Out-Of-Distribution conditions.
|
|
09:45-10:00, Paper ThA05.2 | |
Economic Nonlinear MPC for Conflicting Control Objectives: The Case of Adaptive Cruise Control (I) |
|
Calogero, Lorenzo | Politecnico Di Torino |
Pagone, Michele | Politecnico Di Torino |
Novara, Carlo | Politecnico Di Torino |
Rizzo, Alessandro | Politecnico Di Torino |
Keywords: Automotive control, Predictive control for nonlinear systems, Optimal control
Abstract: Optimizing the energy consumption of electric vehicles (EVs) during operation is a key factor in mitigating their overall environmental impact. Autonomous vehicle functions, such as Adaptive Cruise Control (ACC), typically disregard economic criteria such as energy optimization, being, in general, not trivial to conciliate tracking and economic control tasks. Within the domain of optimal control, Economic Nonlinear MPC (E-NMPC) is designed to deliver an economically optimal control action, optimizing the economic profit of the plant. However, E-NMPC does not allow us to include additional adversarial tasks, such as tracking, and its closed-loop stability is not easy to guarantee. In this work, we propose a novel E-NMPC formulation for conflicting control objectives - such as tracking and economic tasks - that attains the optimal trade-off between them. Furthermore, we propose a constructive procedure to design stabilizing terms for E-NMPC, ensuring its closed-loop stability with minimal impact on economic performance. We apply the proposed E-NMPC strategy to the ACC case study, proving its effectiveness in simulation: the E-NMPC-based ACC proficiently attains the conflicting tasks, delivering a higher economic profit than standard NMPC, while ensuring closed-loop stability.
|
|
10:00-10:15, Paper ThA05.3 | |
Stochastic Model Predictive Control of Charging Energy Hubs with Conformal Prediction (I) |
|
Fernández Zapico, Diego | Eindhoven University of Technology |
Hofman, Theo | Technische Universiteit Eindhoven |
Salazar, Mauro | Eindhoven University of Technology |
Keywords: Automotive control, Stochastic optimal control, Machine learning
Abstract: This paper presents an online energy management system for an energy hub where electric vehicles are charged combining on-site photovoltaic generation and battery energy storage with the power grid, with the objective to decide on the battery (dis)charging to minimize the costs of operation. To this end, we devise a scenario-based stochastic model predictive control (MPC) scheme that leverages probabilistic 24-hour-ahead forecasts of charging load, solar generation and day-ahead electricity prices to achieve a cost-optimal operation of the energy hub. The probabilistic forecasts leverage conformal prediction providing calibrated distribution-free confidence intervals starting from a machine learning model that generates no uncertainty quantification. We showcase our controller by running it over a 280-day evaluation in a closed-loop simulated environment to compare the observed cost of two scenario-based MPCs with two deterministic alternatives: a version with point forecast and a version with perfect forecast. Our results indicate that, compared to the perfect forecast implementation, our proposed scenario-based MPCs are 13% more expensive, and 1% better than their deterministic point-forecast counterpart.
|
|
10:15-10:30, Paper ThA05.4 | |
Observer-Based Environment Robust Control Barrier Functions for Safety-Critical Control with Dynamic Obstacles |
|
Quan, Yingshuai | Chalmers University of Technology |
Zhou, Jian | Linköping University |
Frisk, Erik | Linkoping Univ |
Chung, Chung Choo | Hanyang University |
Keywords: Autonomous vehicles, Robotics, Robust control
Abstract: This paper proposes a safety-critical controller for dynamic and uncertain environments, leveraging a robust environment control barrier function (ECBF) to improve robustness against the uncertainties associated with moving obstacles. The approach reduces conservatism, compared with a worst-case uncertainty approach, by incorporating a state observer for obstacles into the ECBF design. The safety-guaranteed controller is achieved by efficiently solving a quadratic programming problem. The proposed method's effectiveness is demonstrated via a dynamic obstacle-avoidance problem for an autonomous vehicle, including comparisons with established baseline approaches.
|
|
10:30-10:45, Paper ThA05.5 | |
Corridor-Based Adaptive Control Barrier & Lyapunov Functions for Safe Mobile Robot Navigation |
|
Mohammad, Nicholas | University of Virginia |
Bezzo, Nicola | University of Virginia |
Keywords: Autonomous robots, Optimal control, Reinforcement learning
Abstract: Safe navigation in unknown and cluttered environments remains a challenging problem in robotics. Model Predictive Contour Control (MPCC) has shown promise for performant obstacle avoidance by enabling precise and agile trajectory tracking, however, existing methods lack formal safety assurances. To address this issue, we propose a general Control Lyapunov Function (CLF) and Control Barrier Function (CBF) enabled MPCC framework that enforces safety constraints derived from a free-space corridor around the planned trajectory. To enhance feasibility, we dynamically adapt the CBF parameters at runtime using a Soft Actor-Critic (SAC) policy. The approach is validated with extensive simulations and an experiment on mobile robot navigation in unknown cluttered environments.
|
|
10:45-11:00, Paper ThA05.6 | |
Dynamic Log-Gaussian Process Control Barrier Function for Safe Robotic Navigation in Dynamic Environments |
|
Yin, Xin | Harbin Institute of Technology, Shenzhen |
Liang, Chenyang | Harbin Institute of Technology, Shenzhen |
Guo, Yanning | Harbin Institute of Technology |
Mei, Jie | Harbin Institute of Technology, Shenzhen |
Keywords: Autonomous robots, Autonomous systems, Control applications
Abstract: Control Barrier Functions (CBFs) have emerged as efficient tools to address the safe navigation problem for robot applications. However, synthesizing informative and obstacle motion-aware CBFs online using real-time sensor data remains challenging, particularly in unknown and dynamic scenarios. Motived by this challenge, this paper aims to propose a novel Gaussian Process-based formulation of CBF, termed the Dynamic Log Gaussian Process Control Barrier Function (DLGP-CBF), to enable real-time construction of CBF which are both spatially informative and responsive to obstacle motion. Firstly, the DLGP-CBF leverages a logarithmic transformation of GP regression to generate smooth and informative barrier values and gradients, even in sparse-data regions. Secondly, by explicitly modeling the DLGP-CBF as a function of obstacle positions, the derived safety constraint integrates predicted obstacle velocities, allowing the controller to proactively respond to dynamic obstacles’ motion. Simulation results demonstrate significant improvements in obstacle avoidance performance, including increased safety margins, smoother trajectories, and enhanced responsiveness compared to baseline methods.
|
|
11:00-11:15, Paper ThA05.7 | |
Hierarchical Policy-Gradient Reinforcement Learning for Multi-Agent Shepherding Control of Non-Cohesive Targets |
|
Covone, Stefano | Scuola Superiore Meridionale |
Napolitano, Italo | Scuola Superiore Meridionale |
De Lellis, Francesco | University of Naples Federico II |
di Bernardo, Mario | University of Naples Federico II |
Keywords: Autonomous systems, Reinforcement learning, Distributed control
Abstract: We propose a decentralized reinforcement learning solution for multi-agent shepherding of non-cohesive targets using policy-gradient methods. Our architecture integrates target-selection with target-driving through Proximal Policy Optimization, enabling continuous action spaces and smoother agent trajectories compared to discrete-action approaches. This model-free framework effectively solves the shepherding problem while exhibiting better performance than model-based solutions previously presented in the literature. Experiments demonstrate our method's effectiveness and scalability with increased target numbers and limited sensing capabilities.
|
|
11:15-11:30, Paper ThA05.8 | |
Using Control Barrier Functions for Constrained Reinforcement Learning in Backward Chained Behavior Trees |
|
Kartasev, Mart | KTH Royal Institute of Technology |
Wagner, Jannik | KTH Royal Institute of Technology |
Ogren, Petter | KTH Royal Institute of Technology |
Keywords: Autonomous systems, Constrained control, Reinforcement learning
Abstract: In this paper we combine Reinforcement Learning (RL), Behavior Trees (BTs), and Control Barrier Functions (CBFs) to create controllers for complex tasks. Pairwise, these tools have been combined in several earlier works. RL and BTs have been combined to break down tasks into subtasks that are solved by RL. BTs and CBFs have been combined to avoid undoing already achieved subtasks, and CBFs have been used to provide safety guarantees for RL. By combining all three, we are able to break down tasks into subtasks and learn controllers for those tasks that are both safe and avoid undoing previously achieved subtasks. We provide experimental results that show that the proposed approach leads to more efficient training and shorter completion times of missions.
|
|
ThA06 |
Oceania I |
Optimal Transportation Methods for Estimation and Control II |
Invited Session |
Chair: Georgiou, Tryphon T. | University of California, Irvine |
Co-Chair: Rantzer, Anders | Lund University |
Organizer: Chen, Yongxin | Georgia Institute of Technology |
Organizer: Haasler, Isabel | Uppsala University |
Organizer: Karlsson, Johan | KTH Royal Institute of Technology |
Organizer: Ringh, Axel | Chalmers University of Technology and the University of Gothenburg |
Organizer: Taghvaei, Amirhossein | University of Washington Seattle |
|
09:30-09:45, Paper ThA06.1 | |
Collective Steering: Tracer-Informed Dynamics (I) |
|
Eldesoukey, Asmaa | University of California at Irvine |
Abdelgalil, Mahmoud | University of California, San Diego |
Georgiou, Tryphon T. | University of California, Irvine |
Keywords: Optimal control, Constrained control, Algebraic/geometric methods
Abstract: We consider control and inference problems where control protocols and internal dynamics are informed by two types of constraints. Our data consist of i) statistics on the ensemble and ii) trajectories or final disposition of selected tracer particles embedded in the flow. Our aim is i’) to specify a control protocol to realize a flow that meets such constraints or ii’) to recover the internal dynamics that are consistent with such a data set. We analyze these problems in the setting of linear flows and Gaussian distributions. The control cost is taken to be a suitable action integral constrained by either the trajectories of tracer particles or their terminal placements.
|
|
09:45-10:00, Paper ThA06.2 | |
Nonlinear Dynamical Unbalanced Optimal Transport: Relaxation and Duality (I) |
|
Wu, Dongjun | Lund University |
Rantzer, Anders | Lund University |
Keywords: Nonlinear systems, Optimal control, Optimization
Abstract: In this paper, we introduce a generalized dynamical unbalanced optimal transport framework by incorporating limited control input and mass dissipation, addressing limitations in conventional optimal transport for control applications. We derive a convex dual of the problem using dual optimal control techniques developed before and during the 1990s,transforming the non-convex optimization into a more tractable form. At the core of this formulation is the smooth sub-solutions to an HJB equation. A first-order algorithm based on the dual formulation is proposed to solve the problem numerically.
|
|
10:00-10:15, Paper ThA06.3 | |
The LQR-Schrodinger Bridge (I) |
|
Lambert, Marc | Ecole Normale Superieure |
Keywords: Variational methods, Markov processes, Information theory and control
Abstract: We consider the Schrodinger bridge problem in discrete time, where the pathwise cost is replaced by a sum of quadratic functions, taking the form of a linear quadratic regulator (LQR) cost. This cost comprises potential terms that act as attractors and kinetic terms that control the diffusion of the process. When the two boundary marginals are Gaussian, we show that the LQR-Schrodinger bridge problem can be solved in closed form. We follow the dynamic programming principle, interpreting the Kantorovich potentials as cost-to-go functions. Under the LQR-Gaussian assumption, these potentials can be propagated exactly in a backward and forward passes, leading to a system of dual Riccati equations, well known in estimation and control. This system converges rapidly in practice. We then show that the optimal process is Markovian and compute its transition kernel in closed form as well as the Gaussian marginals. Through numerical experiments, we demonstrate that this approach can be used to construct complex, non-homogeneous Gaussian processes with acceleration and loops, given well-chosen attractive potentials. Moreover, this approach allows extending the Bures transport between Gaussian distributions to more complex geometries with negative curvature.
|
|
10:15-10:30, Paper ThA06.4 | |
Incompressible Optimal Transport and Applications in Fluid Mixing (I) |
|
Emerick, Max | University of California Santa Barbara |
Bamieh, Bassam | Univ. of California at Santa Barbara |
Keywords: Fluid flow systems, Optimal control, Algebraic/geometric methods
Abstract: The problem of incompressible fluid mixing arises in numerous engineering applications and has been well-studied over the years, yet many open questions remain. This paper aims to address the question “what do efficient flow fields for mixing look like, and how do they behave?” We approach this question by developing a framework which is inspired by the dynamic and geometric approach to optimal mass transport. Specifically, we formulate the fluid mixing problem as an optimal control problem where the dynamics are given by the continuity equation together with an incompressibility constraint. We show that within this framework, the set of reachable fluid configurations can formally be endowed with the structure of an infinite-dimensional Riemannian manifold, with a metric which is induced by the control effort, and that flow fields which are maximally efficient at mixing correspond to geodesics in this Riemannian space.
|
|
10:30-10:45, Paper ThA06.5 | |
The Ground Cost for Optimal Transport of Angular Velocity (I) |
|
Elamvazhuthi, Karthik | Los Alamos National Laboratory |
Halder, Abhishek | Iowa State University |
Keywords: Stochastic systems, Variational methods, Stochastic optimal control
Abstract: We revisit the optimal transport problem over angular velocity dynamics given by the controlled Euler equation. The solution of this problem enables stochastic guidance of spin states of a rigid body (e.g., spacecraft) over a hard deadline constraint by transferring a given initial state statistics to a desired terminal state statistics. This is an instance of generalized optimal transport over a nonlinear dynamical system. While prior work has reported existence-uniqueness and numerical solution of this dynamical optimal transport problem, here we present structural results about the equivalent Kantorovich a.k.a. optimal coupling formulation. Specifically, we focus on deriving the ground cost for the associated Kantorovich optimal coupling formulation. The ground cost is equal to the cost of transporting unit amount of mass from a specific realization of the initial or source joint probability measure to a realization of the terminal or target joint probability measure, and determines the Kantorovich formulation. Finding the ground cost leads to solving a structured deterministic nonlinear optimal control problem, which is shown to be amenable to an analysis technique pioneered by Athans et al. We show that such techniques have broader applicability in determining the ground cost (thus Kantorovich formulation) for a class of generalized optimal mass transport problems involving nonlinear dynamics with translated norm-invariant drift.
|
|
10:45-11:00, Paper ThA06.6 | |
Multi-Robot Path Planning and Scheduling Via Model Predictive Optimal Transport (MPC-OT) |
|
Khan, Usman A. | Boston College |
Benosman, Mouhacine | Mitsubishi Electric Research Laboratories |
Liu, Wenliang | Boston University |
Pecora, Federico | Amazon Robotics |
Durham, Joseph W. | Kiva Systems |
Keywords: Autonomous robots, Optimization algorithms, Networked control systems
Abstract: In this paper, we propose a novel methodology for path planning and scheduling for multi-robot navigation that is based on optimal transport theory and model predictive control. We consider a setup where~N robots are tasked to navigate to~M targets in a common space with obstacles. Mapping robots to targets first and then planning paths can result in overlapping paths that lead to deadlocks. We derive a strategy based on optimal transport that not only provides minimum cost paths from robots to targets but also guarantees non-overlapping trajectories. We achieve this by discretizing the space of interest into~K cells and by imposing a~{Ktimes K} cost structure that describes the cost of transitioning from one cell to another. Optimal transport then provides textit{optimal and non-overlapping} cell transitions for the robots to reach the targets that can be readily deployed without any scheduling considerations. The proposed solution requires~mathcal O(K^3log K) computations in the worst-case and~mathcal O(K^2log K) for well-behaved problems. To further accommodate potentially overlapping trajectories (unavoidable in certain situations) as well as robot dynamics, we show that a temporal structure can be integrated into optimal transport with the help of textit{replans} and textit{model predictive control}.
|
|
11:00-11:15, Paper ThA06.7 | |
Feedback-Evolving Mean-Field Games |
|
Robbins, Sam | University of Birmingham |
Stella, Leonardo | University of Birmingham |
Giacobbe, Mirco | University of Birmingham |
Keywords: Mean field games, Game theory, Stochastic optimal control
Abstract: A natural assumption in games is to consider static payoffs. Yet, this is not true when the environment changes independently or as a result of players' interactions, e.g., geopolitical decisions in the global financial market, or weather conditions in autonomous driving. Indeed, these environmental aspects have a significant impact on the strategic interactions between players and vice versa. With the growing interest in machine learning approaches, disregarding these environmental changes leads to nonstationarity and instability of the corresponding algorithms. Motivated by this issue, we develop a novel framework for continuous-time finite-state feedback-evolving mean-field games (FEMFG) where the population dynamics are paired with an environmental resource which determines the payoffs and in turn evolves according to the population distribution across the underlying Markov chain. We derive the corresponding initial-terminal value problem and show the conditions for the existence of a feedback-evolving mean-field Nash equilibrium as the solution to the FEMFG, namely, when the population dynamics given by the Kolmogorov equation and the value function obtained via the Hamilton-Jacobi-Bellman equation do not change over time.
|
|
ThA08 |
Oceania V |
Data Driven Control IV |
Regular Session |
Chair: Coulson, Jeremy | University of Wisconsin-Madison |
Co-Chair: Kaneko, Osamu | The University of Electro-Communications |
|
09:30-09:45, Paper ThA08.1 | |
A System Parameterization for Direct Data-Driven Estimator Synthesis |
|
Brändle, Felix | University Stuttgart |
Allgöwer, Frank | University of Stuttgart |
Keywords: Data driven control
Abstract: This paper introduces a novel parameterization to characterize unknown linear time-invariant systems using noisy data. The presented parameterization describes exactly the set of all systems consistent with the available data. We then derive verifiable conditions when the consistency constraint reduces the set to the true system and when it does not have any impact. Furthermore, we demonstrate how to use this parameterization to perform a direct data-driven estimator synthesis with guarantees on the H∞-norm. Lastly, we conduct numerical experiments to compare our approach to existing methods.
|
|
09:45-10:00, Paper ThA08.2 | |
QSID-MPC: Model Predictive Control with System Identification from Quantized Data |
|
Ataei, Shahab | Ohio State University |
Maity, Dipankar | University of North Carolina at Charlotte |
Goswami, Debdipta | The Ohio State University |
Keywords: Data driven control, Predictive control for linear systems, Quantized systems
Abstract: Cloud-assisted system identification and control have emerged as practical solutions for low-power, resource-constrained control systems such as micro-UAVs. In a typical cloud-assisted setting,state and input data are transmitted from local agents to a central computer over low-bandwidth wireless links, leading to quantization. This letter investigates the impact of state and input data quantization on system identification and subsequent Model Predictive Controller (MPC). We establish a fundamental relationship between the quantization resolution and the resulting model error, and analyze how this error propagates to affect the stability and boundedness of the MPC tracking error. In particular, we show that, given a sufficiently rich dataset, the model error is bounded as a function of the quantization resolution, and the MPC tracking error is likewise ultimately bounded.
|
|
10:00-10:15, Paper ThA08.3 | |
All Data-Driven LQR Algorithms Require at Least As Much Data As System Identification |
|
Song, Christopher | University of Waterloo |
Liu, Jun | University of Waterloo |
Keywords: Data driven control
Abstract: We show that algorithms for solving continuous-time infinite-horizon LQR problems using input and state data on intervals require at least as much data as system identification. Using this result, we show that the map from interval data to the optimal gain defined by these algorithms is continuous. We then obtain a convergence criterion that allows us to approximate the optimal gain by using sampled data in place of interval data. In doing so, we uncover a connection with the theory of numerical integration. We corroborate our theoretical results with some numerical experiments, which show how judicious selection of sample points can significantly improve the accuracy of the approximation.
|
|
10:15-10:30, Paper ThA08.4 | |
Algebraic Generalization of Controllability in Data Informativity Approach |
|
Tanaka, Yuki | The University of Electro-Communications |
Kaneko, Osamu | The University of Electro-Communications |
Keywords: Data driven control, Algebraic/geometric methods, Linear systems
Abstract: In this study, we investigate the generalization of controllability in the Data Informativity approach to ensure that it is independent of data space. Our proposed approach is independent of the data space and captures linear space theory from the perspective of abstract algebra, as seen in algebraic systems theory and geometric approaches in model-based control. Subsequently, we use an explicit data representation for deriving the informativity of controllability. Our results naturally derive a series of results on controllability using the Hautus test in model-based control in the context of data-driven control, which includes results of the conventional informativity of controllability.
|
|
10:30-10:45, Paper ThA08.5 | |
Data-Driven Controllability and Observability Tests for Descriptor Systems |
|
Wang, Yu | Beijing Institute of Technology |
Zhang, Yuan | School of Automation, Beijing Institute of Technology |
Xia, Yuanqing | Beijing Institute of Technology |
Keywords: Data driven control, Predictive control for linear systems, Linear systems
Abstract: This paper proposes rank-based criteria for testing R-controllability and C-controllability, as well as R-observability and C-observability of the discrete-time descriptor (singular) systems using purely input-output data matrices. To address the non-causality-induced challenges in C-controllability analysis, forward and backward data matrices are constructed. Furthermore, Willems' fundamental lemma is extended to incompletely controllable descriptor systems, demonstrating that finite-length trajectories with initial states in specific subspaces can be linearly represented by measured trajectories. Numerical examples validate the effectiveness of the proposed criteria and show that Data-enabled Predictive Control (DeePC) achieves output tracking even under incomplete system controllability for descriptor systems.
|
|
10:45-11:00, Paper ThA08.6 | |
Distances between Finite-Horizon Linear Behaviors |
|
Padoan, Alberto | University of British Columbia |
Coulson, Jeremy | University of Wisconsin-Madison |
Keywords: Subspace methods, Linear systems, Data driven control
Abstract: The paper introduces a class of distances for linear behaviors over finite time horizons. These distances allow for comparisons between finite-horizon linear behaviors represented by matrices of possibly different dimensions. They remain invariant under coordinate changes, rotations, and permutations, ensuring independence from input-output partitions. Moreover, they naturally encode complexity-misfit trade-offs for Linear Time-Invariant (LTI) behaviors, providing a principled solution to a longstanding puzzle in behavioral systems theory. The resulting framework characterizes modeling as a minimum distance problem, identifying the Most Powerful Unfalsified Model (MPUM) as optimal among all systems unfalsified by a given dataset.
|
|
11:00-11:15, Paper ThA08.7 | |
Data Informativity for Output Controllability Gramians and Its Duality |
|
Banno, Ikumi | Kyoto University |
Keywords: Estimation, Data driven control, Control of networks
Abstract: Controllability evaluation is one of fundamental topics in the analysis and design of network systems. However, the necessary and sufficient condition for the possibility of estimating controllability Gramians by using output measurement data have never been addressed. Therefore, this paper addresses data informativity for this task. First, we characterize the data informativity for computing the output controllability Gramian, where the one-step controllability subspace plays an crucial role. Second, we present data-driven computation methods for computing the output controllability Gramians. Finally, we characterize data informativity for the observability Gramian and provide computation methods for it, based on the duality to the data informativity for computing the controllability Gramian.
|
|
11:15-11:30, Paper ThA08.8 | |
On the Convergence of Re-Centered Chen-Fliess Series |
|
Boudaghi, Farnaz | University of Vermont |
Gray, W. Steven | Old Dominion University |
Duffaut Espinosa, Luis Augusto | University of Vermont |
Keywords: Algebraic/geometric methods, Modeling, Data driven control
Abstract: Chen-Fliess functional series provide a representation for a large class of nonlinear input-output systems. Like any infinite series, however, their applicability is limited by their radii of convergence. The goal of this paper is to present a computationally feasible method to re-center a Chen-Fliess series in order to expand its time horizon. It extends existing results in two ways. First, it takes a simpler combinatorial approach to the re-centering formula that draws directly on the analogous re-centering problem for Taylor series. Second, a convergence analysis is presented for the re-centered series. This information can be used to compute a lower bound on the radius of convergence for the output function and an estimate of the series truncation error. The method is demonstrated by simulation on a steering problem for a car-trailer steering system.
|
|
ThA09 |
Oceania VIII |
Identification IV |
Regular Session |
Chair: Breschi, Valentina | Eindhoven University of Technology |
Co-Chair: Arcak, Murat | University of California, Berkeley |
|
09:30-09:45, Paper ThA09.1 | |
A Newton Interior-Point Method for ℓ0 Factor Analysis |
|
Wang, Linyang | Sun Yat-Sen University |
Liu, Wanquan | Sun Yat-Sen University |
Zhu, Bin | Sun Yat-Sen University |
Keywords: Identification, Optimization algorithms
Abstract: Factor Analysis is an effective way of dimensionality reduction achieved by revealing the low-rank plus sparse structure of the data covariance matrix. The corresponding model identification task is often formulated as an optimization problem with suitable regularizations. In particular, we use the nonconvex discontinuous L0 norm in order to induce the sparsity of the covariance matrix of the idiosyncratic noise. This paper shows that such a challenging optimization problem can be approached via an interior-point method with inner-loop Newton iterations. To this end, we first characterize the solutions to the unconstrained L0 regularized optimization problem through the L0 proximal operator, and demonstrate that local optimality is equivalent to the solution of a stationary-point equation. The latter equation can then be solved using standard Newton's method, and the procedure is integrated into an interior-point algorithm so that inequality constraints of positive semidefiniteness can be handled. Finally, numerical examples validate the effectiveness of our algorithm.
|
|
09:45-10:00, Paper ThA09.2 | |
State-Space Kolmogorov Arnold Networks for Interpretable Nonlinear System Identification |
|
Granjal Cruz, Gonçalo | Vrije Universiteit Brussel |
Renczes, Balazs | Budapest University of Technology and Economics, Department of M |
Runacres, Mark C | Vrije Universiteit Brussel |
Decuyper, Jan | Vrije Universiteit Brussel |
Keywords: Nonlinear systems identification, Grey-box modeling, Machine learning
Abstract: While accurate, black-box system identification models lack interpretability of the underlying system dynamics. This letter proposes State-Space Kolmogorov- Arnold Networks (SS-KAN) to address this challenge by integrating Kolmogorov-Arnold Networks within a state-space framework. The proposed model is validated on two benchmark systems: the Silverbox and the Wiener-Hammerstein benchmarks. Results show that SS-KAN provides enhanced interpretability due to sparsitypromoting regularization and the direct visualization of its learned univariate functions, which reveal system nonlinearities at the cost of accuracy when compared to state-of-the-art black-box models, highlighting SS-KAN as a promising approach for interpretable nonlinear system identification, balancing accuracy and interpretability of nonlinear system dynamics.
|
|
10:00-10:15, Paper ThA09.3 | |
Modeling, Observability, and Inertial Parameter Estimation of a Planar Multi-Link System with Thrusters |
|
Andrews, Nicholas B. | University of Washington |
Morgansen, Kristi A. | University of Washington |
Keywords: Nonlinear systems identification, Modeling, Robotics
Abstract: This research provides a theoretical foundation for modeling and real-time estimation of both the pose and inertial parameters of a free-floating multi-link system with link thrusters, which are essential for safe and effective controller design and performance. First, we adapt a planar nonlinear multi-link snake robot model to represent a planar chain of bioinspired salp robots by removing joint actuators, introducing link thrusters, and allowing for non-uniform link lengths, masses, and moments of inertia. Second, we conduct a nonlinear observability analysis of the multi-link system with link thrusters, proving that the link angles, angular velocities, masses, and moments of inertia are locally observable when equipped with inertial measurement units and operating under specific thruster conditions. The analytical results are demonstrated in simulation with a three-link system.
|
|
10:15-10:30, Paper ThA09.4 | |
Distributionally Robust Minimization in Meta-Learning for System Identification |
|
Rufolo, Matteo | USI-SUPSI |
Piga, Dario | University of Applied Sciences and Arts of Southern Switzerland |
Forgione, Marco | IDSIA USI-SUPSI |
Keywords: Nonlinear systems identification, Neural networks, Optimization
Abstract: Meta learning aims at learning how to solve tasks, and thus it allows to estimate models that can be quickly adapted to new scenarios. This work explores distributionally robust minimization in meta learning for system identification. Standard meta learning approaches optimize the expected loss, overlooking task variability. We use an alternative approach, adopting a distributionally robust optimization paradigm that prioritizes high-loss tasks, enhancing performance in worst-case scenarios. Evaluated on a meta model trained on a class of synthetic dynamical systems and tested in both in-distribution and out-of-distribution settings, the proposed approach allows to reduce failures in safety-critical applications.
|
|
10:30-10:45, Paper ThA09.5 | |
Evaluating Methods to Calculate Lithium Battery Impedance from Physics-Based PDAE Models |
|
Sun, Juan-Jie | University of Colorado Colorado Springs |
Hileman, Wesley Allen | University of Colorado Colorado Springs |
Trimboli, Michael | University of Colorado, Colorado Springs |
Plett, Gregory L. | University of Colorado Colorado Springs |
Keywords: Energy systems, Modeling, Identification
Abstract: Electrochemical impedance contains a wealth of information about the physical parameters and state of lithium battery cells. As such, efficient ways to predict impedance from partial differential algebraic equation (PDAE) models are valuable for white-box system identification and health estimation. This paper reviews several approaches to calculate impedance from PDAE cell models found dispersed in the literature: direct time-domain simulation, frequency-domain linear perturbation analysis, and transfer function analysis. We construct a PDAE model of a lithium-ion battery cell and compute the model’s impedance using each approach. We cross-validate the approaches by matching impedance results and evaluate their differences in terms of computational workload, model flexibility, and functionality. We provide MATLAB and Python code to compute PDAE model impedance with each approach—including time-domain and linear perturbation analyses with COMSOL and PyBaMM solvers—useful for practical application.
|
|
10:45-11:00, Paper ThA09.6 | |
AutoLIME and PWA-LIME: Towards Robust Explanations of Deep Dynamical Models |
|
Porcari, Federico | Politecnico Di Milano |
Breschi, Valentina | Eindhoven University of Technology |
Formentin, Simone | Politecnico Di Milano |
Keywords: Emerging control applications, Learning, Identification
Abstract: The increasing complexity of machine learning models highlights the need for interpretability, especially in critical domains requiring trust and transparency. Local Interpretable Model-agnostic Explanations (LIME) is a popular eXplainable AI (XAI) method that provides localized, instance-specific explanations using an interpretable surrogate model. However, its effectiveness is limited by the lack of systematic guidelines for tuning its hyperparameters. This paper addresses this limitation by proposing Automatic LIME (AutoLIME), a bi-level optimization framework to tune LIME’s kernel width. Additionally, we introduce PieceWise Affine LIME (PWALIME), a clustering-based extension of LIME for multi-instance explanations, particularly useful for interpreting black-box models of dynamical systems. Preliminary numerical results validate the potential of these methods in explaining opaque dynamical models.
|
|
11:00-11:15, Paper ThA09.7 | |
Ensemble Learning of Dynamical Systems with Multiple Operating Conditions Via Statistical Process Control |
|
Boca de Giuli, Laura | Politecnico Di Milano |
La Bella, Alessio | Politecnico Di Milano |
Scattolini, Riccardo | Politecnico Di Milano |
Keywords: Identification, Statistical learning, Energy systems
Abstract: This paper addresses the adaptation and performance monitoring of ensemble data-based models over time. Once a model of a system is identified using a specific training dataset, it may fail to accurately represent system dynamics under varying operating conditions not included in the original training dataset. To continuously adapt a data-based model to evolving operating conditions, we propose an ensemble learning framework characterized by (i) a combination rule that weights different models based on the statistical proximity of their training dataset to the current operating condition, and (ii) a monitoring algorithm leveraging statistical control charts to supervise the ensemble model's reliability and trigger the identification and integration of a new model when a new operating condition is encountered. The proposed methodology is tested on an energy system referenced in the literature, which exhibits multiple operating conditions, showing promising results from both adaptation and monitoring perspectives.
|
|
11:15-11:30, Paper ThA09.8 | |
STL-Based Optimization of Biomolecular Neural Networks for Regression and Control |
|
Palanques Tost, Eric | Boston University |
Krasowski, Hanna | University of California, Berkeley |
Arcak, Murat | University of California, Berkeley |
Weiss, Ron | MIT |
Belta, Calin | University of Maryland |
Keywords: Biological systems, Learning, Modeling
Abstract: Biomolecular Neural Networks (BNNs), artificial neural networks with biologically synthesizable architectures, achieve universal function approximations beyond simple biological circuits. However, training BNNs remains challenging due to the lack of target data. To address this, we propose leveraging Signal Temporal Logic (STL) specifications to define training objectives for BNNs. We build on the quantitative semantics of STL, enabling gradient-based optimization of the BNN weights, and introduce a learning algorithm that enables BNNs to perform regression and control tasks in biological systems. Specifically, we investigate two regression problems in which we train BNNs to act as reporters of dysregulated states, and a feedback control problem in which we train the BNN in closed loop with a chronic disease model, learning to reduce inflammation while avoiding adverse responses to external infections. Our numerical experiments demonstrate that STL-based learning can solve the investigated regression and control tasks efficiently.
|
|
ThA10 |
Oceania VII |
Distributed and Decentralized Control I |
Regular Session |
Chair: Charalambous, Themistoklis | University of Cyprus |
Co-Chair: Lall, Sanjay | Stanford University |
|
09:30-09:45, Paper ThA10.1 | |
Quantized Average Consensus with a Plateau Escaping Strategy in Undirected Graphs |
|
Oliva, Gabriele | University Campus Bio-Medico of Rome |
Fioravanti, Camilla | University Campus Bio-Medico of Rome |
Makridis, Evagoras | University of Cyprus |
Charalambous, Themistoklis | University of Cyprus |
Keywords: Distributed control, Decentralized control, Quantized systems
Abstract: In this paper, the average consensus problem has been considered for undirected networks under finite bit-rate communication. While other algorithms reach approximate average consensus or require global information about the network for reaching the exact average consensus, we propose a fully distributed consensus algorithm that incorporates an adaptive quantization scheme and achieves convergence to the exact average while only requiring knowledge of an upper bound of the network diameter. Using Lyapunov stability analysis, we characterize the convergence properties of the resulting nonlinear quantized system. Moreover, we provide a fully distributed strategy to escape plateaux, i.e., situations where the Lyapunov function stops descending. Simulation results justify the performance of our proposed algorithm.
|
|
09:45-10:00, Paper ThA10.2 | |
Distributed Safety-Critical MPC for Multi-Agent Formation Control and Obstacle Avoidance |
|
Wang, Chao | Beihang University |
Zhang, Shuyuan | UCLouvain |
Wang, Lei | Beihang University |
Keywords: Distributed control, Predictive control for nonlinear systems, Constrained control
Abstract: For nonlinear multi-agent systems with high relative degrees, achieving formation control and obstacle avoidance in a distributed manner remains a significant challenge. To address this issue, we propose a novel distributed safety-critical model predictive control (DSMPC) algorithm that incorporates discrete-time high-order control barrier functions (DHCBFs) to enforce safety constraints, alongside discrete-time control Lyapunov functions (DCLFs) to establish terminal constraints. To facilitate distributed implementation, we develop estimated neighbor states for formulating DHCBFs and DCLFs, while also devising a compatibility constraint to limit estimation errors and ensure convergence. Additionally, we provide theoretical guarantees regarding the feasibility and stability of the proposed DSMPC algorithm based on a mild assumption. The effectiveness of the proposed method is evidenced by the simulation results, demonstrating improved performance and reduced computation time compared to existing approaches.
|
|
10:00-10:15, Paper ThA10.3 | |
Dynamical Leaderless Consensus of Third Order Uncertain Multi-Agent Systems with Only Relative Position Measurements |
|
Tian, Kaixin | Harbin Institute of Technology, Shenzhen |
Mei, Jie | Harbin Institute of Technology, Shenzhen |
Gong, Youmin | Harbin Institute of Technology, Shenzhen |
Li, Chuanjiang | Harbin Institute of Technology |
Ma, Guangfu | Harbin Institute of Technology, Shenzhen |
Keywords: Distributed control, Adaptive control, Cooperative control
Abstract: This paper focuses on the leaderless consensus problem of third order multi-agent systems with parametric uncertainties over a directed graph, where the agents reach consensus on their positions and high order derivatives. We design a novel extended state reference model using only relative position information as the input. Then an adaptive tracking algorithm is proposed to track the reference model in the presence of parametric uncertainties. We focus on the dynamical consensus, and prove the prerequisite of consensus is that the directed graph has a directed spanning tree and the eigenvalues of the transformed system matrix needs to have negative real parts except for three zero eigenvalues. Numerical simulation results are given to verify the effectiveness of proposed control algorithms.
|
|
10:15-10:30, Paper ThA10.4 | |
Graph Conditions and Distributed Control for 3-D Similar Formation with Shared Z-Axis Alignment |
|
Wang, Lili | Southern University of Science and Technology |
Lin, Zhiyun | Southern University of Science and Technology |
Cai, Kai | Osaka Metropolitan University |
Pan, Wenda | Southern University of Science and Technology |
Keywords: Distributed control, Large-scale systems, Cooperative control
Abstract: This paper addresses the formation control problem for multi-agent systems operating in three-dimensional space by introducing the novel concept of z-similar formation. Unlike traditional rigid formations, z-similar formations ensure geometric invariance specifically under translation, scaling, and rotations around the vertical (z) axis. Our method features significantly relaxed graphical conditions, requiring only a 2-rooted sensing graph and ensuring each follower node has at least three neighbors for both realizability and stability. Moreover, by leveraging local coordinate systems and relative position measurements with alignment solely in the z-axis direction, we uniquely define formations, substantially reducing reliance on global references. We provide rigorous theoretical analyses to establish conditions for formation realizability and stability, which are validated through comprehensive simulations.
|
|
10:30-10:45, Paper ThA10.5 | |
Optimal Control in Human-Robotic Agent Teams for Cooperative Manipulation |
|
Ganie, Irfan Ahmad | Missouri University of Science and Technology Rolla MO 65401 |
Jagannathan, Sarangapani | Missouri Univ of Science & Tech |
Keywords: Human-in-the-loop control, Distributed control, Learning
Abstract: This paper introduces a distributed deep neural networks (DNNs) adaptive observer-based optimal control framework for cooperative manipulation. At the high level, a multilayer NN-based observer estimates human intent-based desired trajectories using limited local information, ensuring real-time tracking consistency. Novel NN weight update laws, derived via singular value decomposition, enhance adaptive estimation performance. At the low level, a cooperative game-theoretic DNNs controller ensures optimal robotic agent coordination through neighborhood optimization, considering the effects of neighboring agents, and adaptive dynamic programming (ADP), enabling efficient decision-making and coordination. Safety constraints are enforced using neighbor-dependent barrier Lyapunov functions (BLFs) that encode individual robot tracking constraints and inter-agent safety requirements. Unlike prior methods, the constrained optimization problem is transformed using enhanced Karush-Kuhn-Tucker (KKT) conditions, where Lagrange multipliers, defined as functions of neighboring states, dynamically enforce safety constraints without incorporating them directly into the cost function. Simulation results validate the framework, showing zero safety violations during cooperative manipulation tasks and a 45% reduction in cost compared to recent methods.
|
|
10:45-11:00, Paper ThA10.6 | |
Formation Control of Nonholonomic Agents by Discrete-Valued Inputs and Multi-Step Movements |
|
Izumi, Shinsaku | Kochi University of Technology |
Nakayama, Takeru | Kochi University of Technology |
Keywords: Distributed control, Quantized systems, Control of networks
Abstract: This paper considers formation control of nonholonomic agents capable of straight, lateral, and rotational movements, subject to discrete constraints on the control inputs. In particular, we aim to extend existing formation controllers so that each agent can perform the multi-step movement, i.e., achieving the desired movement in multiple steps. The multi-step movements allow us to reduce the performance degradation due to the input constraints while eliminating complicated components from the existing controllers. We present controllers to achieve the desired formation through the multi-step movements. We then present theoretical results on the performance of our controllers and the behavior of the resulting feedback system.
|
|
11:00-11:15, Paper ThA10.7 | |
Integrating Cooperative Influence and Memory Dynamics: An Adaptive Framework for Distributed Coordination |
|
Maldonado Andrade, Diego Javier | Escuela Politécnica Nacional |
Obando Martínez, Camila Alejandra | Universidad Politécnica De Cataluña · Barcelona Tech - UPC |
Cruz, Patricio J. | Escuela Politécnica Nacional |
Cepeda, Jaime | Escuela Politecnica Nacional |
Keywords: Adaptive control, Biologically-inspired methods, Distributed control
Abstract: This paper presents an adaptive distributed coordination framework designed for synchronizing networks of oscillators by introducing two novel virtual dynamic states. These states facilitate cooperative behavior and enable memory-based adaptation within the network. Specifically, the Adaptive Virtual Cooperative Influence (AVCI) dynamically adjusts the cooperative interactions based on local frequency deviations, while the Virtual Influence Memory (VIM) accumulates historical interaction data to regulate the influence distribution and mitigate persistent dependency effects. A rigorous Lyapunov-based analysis is provided, proving the mathcal{D}-global asymptotic stability of the resulting closed-loop system. Numerical simulations demonstrate that the interplay between AVCI and VIM enhances coordination, counteracting the individualities of the oscillators, even in the event of improperly parameterized local controllers.
|
|
11:15-11:30, Paper ThA10.8 | |
Buffer Centering for Bittide Synchronization Via Frame Rotation |
|
Lall, Sanjay | Stanford University |
Spalink, Tammo | Google |
Keywords: Distributed control, Large-scale systems, Information technology systems
Abstract: Maintaining consistent time in distributed systems is a fundamental challenge. The bittide system addresses this by providing logical synchronization through a decentralized control mechanism that observes local buffer occupancies and controls the frequency of an oscillator at each node. A critical aspect of bittide's stability and performance is ensuring that these elastic buffers operate around a desired equilibrium point, preventing data loss due to overflow or underflow. This paper introduces a novel method for centering buffer occupancies in a bittide network using a technique we term frame rotation. We propose a control strategy utilizing a directed spanning tree of the network graph. By adjusting the frequencies of nodes in a specific order dictated by this tree, and employing a pulsed feedback controller that targets the buffer occupancy of edges within the spanning tree, we prove that all elastic buffers in the network can be driven to their desired equilibrium. This ordered adjustment approach ensures that prior centering efforts are not disrupted, providing a robust mechanism for managing buffer occupancy in bittide synchronized systems.
|
|
ThA11 |
Oceania VI |
Networked Control Systems IV |
Regular Session |
Chair: Peters, Andres A. | Universidad Adolfo Ibáñez |
Co-Chair: Wang, Miaomiao | Hong Kong University of Science and Technology |
|
09:30-09:45, Paper ThA11.1 | |
Safety Controller Synthesis for Stochastic Networked Systems under Communication Constraints |
|
Akbarzadeh, Omid | Newcastle University |
Mamduhi, Mohammad H. | University of Birmingham |
Lavaei, Abolfazl | Newcastle University |
Keywords: Networked control systems, Control over communications, Stochastic systems
Abstract: This paper develops a framework for synthesizing safety controllers for discrete-time stochastic linear control systems (dt-SLS) operating under communication imperfections. The control unit is remote and communicates with the sensor and actuator through an imperfect wireless network. We consider a constant delay in the sensor-to-controller channel (uplink), and data loss in both sensor-to-controller and controller-to-actuator (downlink) channels. In our proposed scheme, data loss in each channel is modeled as an independent Bernoulli-distributed random process. To systematically handle the uplink delay, we first introduce an augmented discrete-time stochastic linear system (dt-ASLS) by concatenating all states and control inputs that sufficiently represent the state-input evolution of the original dt-SLS under the delay and packet loss constraints. We then leverage control barrier certificates for dt-ASLS to synthesize a controller that ensures the stochastic safety of dt-SLS, guaranteeing that all trajectories remain outside unsafe regions with a quantified probabilistic bound. Our approach translates safety constraints into matrix inequalities, leading to an optimization problem that eventually quantifies the probability of satisfying the safety specification in the presence of communication imperfections. We validate our results on an RLC circuit subject to both constant delay and probabilistic data loss.
|
|
09:45-10:00, Paper ThA11.2 | |
One-Bit Consensus Control of Multi-Agent Systems with Packet Loss |
|
An, Ru | Academy of Mathematics and Systems Science, Chinese Academy of S |
Wang, Ying | Chinese Academy of Sciences |
Zhao, Yanlong | Academy of Mathematics and Systems Science, Chinese Academyof Sci |
Zhang, Ji-Feng | Chinese Academy of Sciences |
Keywords: Networked control systems, Cooperative control, Identification for control
Abstract: This paper investigates the one-bit consensus control of multi-agent systems (MASs) with independent and identically distributed (i.i.d.) and Markovian packet loss. To explore the impact of packet loss on one-bit communication, this paper first quantitatively characterizes the information loss of one-bit communications caused by packet loss, which provides the proportional relationship between one-bit data with and without packet loss in the sense of expectation.Based on quantitative characterizations, a one-bit packet loss onsensus algorithm with a packet loss proportional coefficient is proposed to compensate for the information loss, where the coefficient is designed as the reciprocal of the information loss proportion.Furthermore, this paper demonstrates that the proposed algorithm enables the MAS to achieve one-bit consensus in the mean square sense at a rate of O(1/t) with packet loss. Two simulation examples are given to validate the algorithm.
|
|
10:00-10:15, Paper ThA11.3 | |
A Distributed Observer Accommodating a Broad Range of Intermittent Communication Scenarios |
|
Koo, Sunghyun | Seoul National University |
Lee, Jin Gyu | Seoul National University |
Shim, Hyungbo | Seoul National University |
Keywords: Networked control systems, Observers for Linear systems, Large-scale systems
Abstract: This paper proposes a distributed observer that utilizes intermittent communication and accommodates a broad range of scenarios, including asynchronous operation, unidirectional communication, packet loss, and communication delays. An analysis over jointly connected switching topology supports this and provides a condition for exponential convergence of the estimation error. This condition can be used to determine a communication rate. The analysis is valid when the unobservable subspace of each agent admits an invariant orthogonal complement. This is a property that is always achievable via a coordinate transformation when the system matrix is diagonalizable over the complex field.
|
|
10:15-10:30, Paper ThA11.4 | |
Wireless Control with Channel State Detection and Message Dropout Compensation |
|
Zacchia Lun, Yuriy | Università Degli Studi Dell’Aquila |
Santucci, Fortunato | University of L'Aquila |
D'Innocenzo, Alessandro | University of L'Aquila |
Keywords: Control over communications, Markov processes
Abstract: This letter presents a framework for designing optimal state-feedback control that uses a wireless actuation link with imperfect channel state information to transfer the current and future control inputs that actuators can apply if future control messages are lost. The dropout compensation strategy supports scaling inputs to actuators when necessary. We analytically solve finite- and infinite-horizon control problems and present a necessary and sufficient stability condition for any given infinite-horizon state-feedback control law. We validate the results using an illustrative example.
|
|
10:30-10:45, Paper ThA11.5 | |
Estimator-Based Encoder-Decoder for Reducing Communications Demands in Event-Triggered Networked Control Systems |
|
Villamil, Andres | TU Dresden |
Casas, Jonathan | Dresden University of Technology |
Fettweis, Gerhard | Technische Universität Dresden |
Keywords: Control over communications, Networked control systems, Autonomous vehicles
Abstract: Wireless networks are vital for implementing flexible Networked Controlled Systems (NCS) in distributed applications, yet they introduce sampling errors, delays, and packet losses that can compromise control performance. While emerging communication services such as Ultra-Reliable Low Latency Communications (URLLC) can mitigate these issues, they consume more shared network resources and may not be efficient if the NCS does not manage its transmissions. Event Triggered Control (ETC) addresses this challenge by determining when an update is needed, thereby specifying a Minimum Inter-Event Time (MIET) and Maximum Allowable Delay (MAD) to ensure a prescribed L2 norm condition or robust stability criterion. This letter proposes an Encoder-Decoder (E/D) architecture for NCS that requires that a control signal is transmitted over a wireless link. Instead of sending the original control signal whenever a trigger occurs, this method transmits an error signal produced by the comparison between the original control signal and a locally estimated signal. This estimated signal is assumed to be locally available at the transmitter and receiver to be used as the encoder and decoder, respectively. Assuming that the estimated signal is correlated to the original control signal, the transmitted error has a lower magnitude than the original transmitted signal. As a result, the NCS can guarantee its robust stability criterion while increasing the achievable MIET, thus reducing network resource usage. This approach is validated in a Cooperative Adaptive Cruise Control (CACC) setup, demonstrating an at least 20% improvement in MIET compared to conventional ETC, while maintaining L2 (string) stability and robust performance with fewer transmissions.
|
|
10:45-11:00, Paper ThA11.6 | |
The L_{infty}/L_{2}-Gain Analysis for Sampled-Data Periodic Event-Triggered Control Systems: Discretization Method with Convergence Rate Analysis |
|
Kang, Oe Ryung | POSTECH |
Kim, Jung Hoon | Pohang Univeristy of Science and Technology |
Keywords: Sampled-data control, Networked control systems
Abstract: This paper is concerned with the L_{infty}/L_{2}-gain analysis of periodic event-triggered control (PETC) systems, in which the feedback connection between a continuous-time plant and a discrete-time controller is intermittently activated. An operator-based description of PETC systems is first presented by applying the lifting technique to take into account their hybrid continuous/discrete-time behavior. On top of the fast-lifted treatment of PETC systems, in which the sampling interval [0,h) is divided into N subintervals of equal width, the piecewise constant approximation (PCA) is developed for the output operator. This PCA allows us to convert the PETC system into an equivalent discrete-time event-triggered control (ETC) system. It is also shown that the l_infty/l_2-gain of the discretized ETC system converges to the gain-gain of the original PETC system at a rate of 1/N. Based on this fact, we further introduce a method for estimating the L_{infty}/L_{2}-gain of PETC systems through the linear matrix inequality (LMI)-based approach. Finally, a numerical example is given to verify the effectiveness of the arguments developed in this paper.
|
|
11:00-11:15, Paper ThA11.7 | |
String Stability for Predecessor-Leader Following Platoons with Additive Noise Channels |
|
Sanhueza, Fernando | Universidad Técnica Federico Santa Maria |
Gordon, Marco A. | Universidad Técnica Federico Santa María |
Wang, Miaomiao | Hong Kong University of Science and Technology |
Chen, Jie | City University of Hong Kong |
Peters, Andres A. | Universidad Adolfo Ibáñez |
Vargas, Francisco J. | Universidad Técnica Federico Santa María |
Keywords: Autonomous vehicles, Networked control systems, Cooperative control
Abstract: This article studies vehicle platoons whose communication is affected by additive noise. A predecessor-leader topology is considered, with additive noises affecting both communication channels. A control law is implemented that weights the importance of leader-following and predecessor-following tasks through a weighting parameter. Given this setup, a characterization of the second order statistics of the platoon is obtained, and also conditions are derived for Lp-mean Lq-variance string stability and also for mean-square string stability. Numerical examples are also provided to illustrate our findings.
|
|
11:15-11:30, Paper ThA11.8 | |
Trains Virtual Coupling under Unreliable Communication |
|
Terlizzi, Mario | University of Sannio |
Glielmo, Luigi | Università Di Napoli Federico II |
Liuzza, Davide | Università Del Sannio |
Keywords: Cooperative control, Optimal control, Autonomous vehicles
Abstract: The contribution of this work is a control system architecture enabling a safe trains Virtual Coupling in realistic railway communication. Specifically, the proposed architecture addresses unreliability and variability in data communication, ensuring safety through incorporating a safety control barrier function to guarantee operational safety and through a real-time collision prediction block. Further, a real-time cruise distance from the leader train is continuously computed and updated to allow for a smooth follower motion and avoid unnecessary braking. Further, the proposed architecture supports diverse possible control laws and leaves the control designer free to design it (in the paper we propose a switched MPC strategy but other choices are possible). A railway simulation tool for VC scenarios, developed in collaboration with the Italian railway company, validates the effectiveness and potential of the proposed system in advancing safe virtual coupling in railway systems.
|
|
ThA12 |
Oceania X |
Optimization IV |
Regular Session |
Chair: Dörfler, Florian | Swiss Federal Institute of Technology (ETH) Zurich |
Co-Chair: Pasqualetti, Fabio | University of California, Irvine |
|
09:30-09:45, Paper ThA12.1 | |
Nonlinear Robust Optimization for Planning and Control |
|
Abdul, Arshiya Taj | Georgia Institute of Technology |
Saravanos, Augustinos D. | Georgia Institute of Technology |
Theodorou, Evangelos | Georgia Institute of Technology |
Keywords: Optimization, Uncertain systems, Constrained control
Abstract: This paper presents a novel robust trajectory optimization method for constrained nonlinear dynamical systems subject to unknown bounded disturbances. In particular, we seek optimal control policies that remain robustly feasible with respect to all possible realizations of the disturbances within prescribed uncertainty sets. To address this problem, we introduce a bi-level optimization algorithm. The outer level employs a trust-region successive convexification approach which relies on linearizing the nonlinear dynamics and robust constraints. The inner level involves solving the resulting linearized robust optimization problems, for which we derive tractable convex reformulations and present an Augmented Lagrangian method for efficiently solving them. To further enhance the robustness of our methodology on nonlinear systems, we also illustrate that potential linearization errors can be effectively modeled as unknown disturbances as well. Simulation results verify the applicability of our approach in controlling nonlinear systems in a robust manner under unknown disturbances. The promise of effectively handling approximation errors in such successive linearization schemes from a robust optimization perspective is also highlighted.
|
|
09:45-10:00, Paper ThA12.2 | |
Online Optimization with Unknown Time-Varying Parameters |
|
Tripathi, Shivanshu | University of California, Riverside |
Al Makdah, Abed AlRahman | Arizona State University |
Pasqualetti, Fabio | University of California, Irvine |
Keywords: Optimization, Identification for control, Data driven control
Abstract: In this letter we study optimization problems where the cost function contains time-varying parameters that are unmeasurable and evolve according to linear, yet unknown, dynamics. We propose a solution that leverages control theoretic tools to identify the dynamics of the parameters, predict their evolution, and ultimately compute a solution to the optimization problem. The identification of the dynamics of the time-varying parameters is done online using measurements of the gradient of the cost function. This system identification problem is not standard, since the output matrix is known and the dynamics of the parameters must be estimated in the original coordinates without similarity transformations. Interestingly, our analysis shows that, under mild conditions that we characterize, the identification of the parameter dynamics and, consequently, the computation of a time-varying solution to the optimization problem, requires only a finite number of measurements of the gradient of the cost function. We illustrate the effectiveness of our algorithm on a series of numerical examples.
|
|
10:00-10:15, Paper ThA12.3 | |
Anytime Trajectory Optimization for MultI-Drone Systems with Guaranteed Collision Avoidance |
|
Rubinacci, Roberto | Politecnico Di Milano |
Nazzari, Alessandro | Politecnico Di Milano |
Lovera, Marco | Politecnico Di Milano |
Keywords: Optimization, Decentralized control, Autonomous vehicles
Abstract: We present ATOMICA, Anytime Trajectory Optimization for MultI-drone systems with guaranteed Collision Avoidance, a novel algorithm designed to generate guaranteed collision-free trajectories for multi-UAV systems. Each UAV communicates with the others and treats them as dynamic obstacles within a receding-horizon guidance framework. Recursive feasibility is ensured by maintaining a safe backup trajectory at all times. The time-dependent collision avoidance constraints are efficiently handled using positivity certificates, eliminating the need for potentially unsafe time discretizations while enabling fast collision checking. The non convex optimization problem is solved using the convex concave procedure, which provides ATOMICA with anytime capability, allowing users to predefine the duration of each replanning step. We evaluate the algorithm through simulations, demonstrating a 22% reduction in mission duration compared to state-of-the-art methods. Additionally, we validate its real-time capabilities through real-world experiments.
|
|
10:15-10:30, Paper ThA12.4 | |
Bi-Level Route Optimization and Path Planning with Hazard Exploration |
|
Choi, Jimin | University of Michigan |
Stagg, Grant | Brigham Young University |
Peterson, Cameron | Brigham Young University |
Li, Max | University of Michigan |
Keywords: Autonomous systems, Optimization, Uncertain systems
Abstract: Effective risk monitoring in dynamic environments such as disaster zones requires an adaptive exploration strategy to detect hidden threats. We propose a bi-level unmanned aerial vehicle (UAV) monitoring strategy that efficiently integrates high-level route optimization with low-level path planning for known and unknown hazards. At the high level, we formulate the route optimization as a vehicle routing problem (VRP) to determine the optimal sequence for visiting known hazard locations. To strategically incorporate exploration efficiency, we introduce an edge-based centroidal Voronoi tessellation (CVT), which refines baseline routes using pseudo-nodes and allocates path budgets based on the UAV's battery capacity using a line segment Voronoi diagram. At the low level, path planning maximizes information gain within the allocated path budget by generating kinematically feasible B-spline trajectories. Bayesian inference is applied to dynamically update hazard probabilities, enabling the UAVs to prioritize unexplored regions. Simulation results demonstrate that edge-based CVT improves spatial coverage and route uniformity compared to the node-based method. Additionally, our optimized path planning consistently outperforms baselines in hazard discovery rates across a diverse set of scenarios.
|
|
10:30-10:45, Paper ThA12.5 | |
Sharp Hybrid Zonotopes: Set Operations and the Reformulation-Linearization Technique |
|
Glunt, Jonah | The Pennsylvania State University |
Robbins, Joshua | The Pennsylvania State University |
Siefert, Jacob | Pennsylvania State University |
Silvestre, Daniel | NOVA University of Lisbon |
Pangborn, Herschel | The Pennsylvania State University |
Keywords: Hybrid systems, Optimization
Abstract: Mixed integer set representations, and specifically hybrid zonotopes, have enabled new techniques for reachability and verification of nonlinear and hybrid systems. Mixed-integer sets which have the property that their convex relaxation is equal to their convex hull are said to be sharp. This property allows the convex hull to be computed with minimal overhead, and is known to be important for improving the convergence rates of mixed-integer optimization algorithms that rely on convex relaxations. This paper examines methods for formulating sharp hybrid zonotopes and provides sharpness-preserving methods for performing several key set operations. The paper then shows how the reformulation-linearization technique can be applied to create a sharp realization of a hybrid zonotope that is initially not sharp. A numerical example applies this technique to find the convex hull of a level set of a feedforward ReLU neural network.
|
|
10:45-11:00, Paper ThA12.6 | |
Online Feedback Optimization for Monotone Systems without Timescale Separation |
|
Bianchi, Mattia | ETH Zurich |
Dörfler, Florian | Swiss Federal Institute of Technology (ETH) Zurich |
Keywords: Optimization algorithms, Stability of nonlinear systems, Optimization
Abstract: Online Feedback Optimization steers a dynamical plant to a cost-efficient steady-state, only relying on input-output sensitivity information, rather than on a full plant model. Unlike traditional feedforward approaches, OFO leverages real-time measurements from the plant, thereby inheriting the robustness and adaptability of feedback control. Unfortunately, existing theoretical guarantees for OFO assume that the controller operates on a slower timescale than the plant, which can affect responsiveness and transient performance. In this paper, we focus on relaxing this ``timescale separation'' assumption. Specifically, we consider the class of monotone systems, and we prove that OFO can achieve an optimal operating point, regardless of the time constants of controller and plant. By leveraging a small gain theorem for monotone systems, we derive several sufficient conditions for global convergence. Notably, these conditions depend only on the steady-state behavior of the plant, and are entirely independent of its transient dynamics.
|
|
11:00-11:15, Paper ThA12.7 | |
Probabilistic Reachability-Driven Robust Trajectory Optimization for a Multirotor in Uncertain Environments |
|
Zhu, Yutong | Northwestern Polytechnical University |
Zhang, Ye | Northwestern Polytechnical University |
Keywords: Uncertain systems, Hybrid systems, Optimization
Abstract: This paper presents a unified framework for robust trajectory planning and optimization of multirotor systems in uncertain environments, addressing challenges posed by stochastic disturbances, model inaccuracies, and safety-critical constraints. Traditional deterministic methods often yield overly conservative solutions, while existing probabilistic approaches struggle with computational complexity and scalability. To bridge these gaps, we integrate probabilistic reachability analysis with scenario-based convex program. The framework is validated through numerical examples demonstrating collision-free navigation in cluttered environments with stochastic obstacles, outperforming baseline methods in both safety and computational efficiency. This work advances robust trajectory planning by harmonizing probabilistic safety guarantees with tractable optimization under uncertainty.
|
|
11:15-11:30, Paper ThA12.8 | |
Optimization Outperforms Unscented Techniques for Nonlinear Smoothing |
|
Howell, Payton | University of Washington |
Aravkin, Aleksandr | Dept. Applied Mathematics, University of Washington |
Keywords: Kalman filtering, Optimization, Nonlinear systems
Abstract: We review optimization-based approaches to smoothing nonlinear dynamical systems. These approaches leverage the fact that the Extended Kalman Filter and corresponding smoother can be framed as the Gauss-Newton method for a nonlinear least squares maximum a posteriori loss, and stabilized with standard globalization techniques. We compare the performance of the Optimized Kalman Smoother (OKS) to Unscented Kalman smoothing techniques, and show that they achieve significant improvement for highly nonlinear systems, particularly in noisy settings. The comparison is performed across standard parameter choices (such as the trade-off between process and measurement terms). To our knowledge, this is the first comparison of these methods in the literature.
|
|
ThA13 |
Oceania IX |
Game Theory I |
Regular Session |
Chair: Casbeer, David W. | Air Force Research Laboratory |
Co-Chair: Trivedi, Ashutosh | University of Colorado Boulder |
|
09:30-09:45, Paper ThA13.1 | |
On the Convergence of Gradient Descent in Scalar Two-Agent Infinite-Horizon LQ Games |
|
Salizzoni, Giulio | EPFL |
Kamgarpour, Maryam | EPFL |
Keywords: Game theory, Optimization algorithms, Linear systems
Abstract: In the context of infinite-horizon general-sum linear quadratic (LQ) games, the convergence of gradient descent remains a significant yet not completely understood issue. While the convergence in the finite-horizon setting has already been proved in [1], the extension to infinite setting is challenging. We focus on a specific instance, the two-agent scalar game, and prove the algorithm's convergence in this simplified scenario. Using this example as a foundation, we demonstrate that even in the presence of equilibria that are saddle point of the gradient descent dynamics, gradient descent can still converge, and that the inclusion of noise is unnecessary.
|
|
09:45-10:00, Paper ThA13.2 | |
Decision-Making on Timing and Route Selection: A Game-Theoretic Approach |
|
Wang, Chenlan | University of Michigan, Ann Arbor |
Liu, Mingyan | University of Michigan |
Keywords: Game theory, Modeling
Abstract: We present a Stackelberg game model to investigate how individuals make their decisions on timing and route selection. Group formation can naturally result from these decisions, but only when individuals arrive at the same time and choose the same route. Although motivated by bird migration, our model applies to scenarios such as traffic planning, disaster evacuation, and other animal movements. Early arrivals secure better territories, while traveling together enhances navigation accuracy, foraging efficiency, and energy efficiency. Longer or more difficult migration routes reduce predation risks but increase travel costs, such as higher elevations and scarce food resources. Our analysis reveals a richer set of subgame perfect equilibria (SPEs) and heightened competition, compared to earlier models focused only on timing. By incorporating individual differences in travel costs, our model introduces a ``neutrality" state in addition to ``cooperation" and ``competition."
|
|
10:00-10:15, Paper ThA13.3 | |
Objective Improvement Algorithm for Controller Synthesis in Uncertain Environments |
|
Dell’Erba, Daniele | University of Liverpool |
Schewe, Sven | The University of Liverpool |
Trivedi, Ashutosh | University of Colorado Boulder |
Keywords: Game theory, Uncertain systems, Decentralized control
Abstract: Stochastic games provide a powerful framework for controller synthesis in multi-agent systems where cooperation between agents cannot be assumed. They also serve as a core model for modular and decentralized control, where interactions between components can be captured using assume-guarantee contracts. In this setting, synthesis for an individual module reduces to computing a policy robust to the behavior of the environment, modeled as a stochastic two-player game. Many control objectives reduce to reachability objectives, leading to the study of simple stochastic games, a well-known class whose exact computational complexity remains unresolved. A classic result of Condon reformulates the value and policy computation in such games as a quadratic program—a linear program with a quadratic objective. Motivated by their “almost linear” structure, we ask whether efficient linear programming solvers can be leveraged by iterating over a sequence of linear objectives. We introduce the Objective Improvement Algorithm, which iteratively solves linear programs to compute the optimal value and policy. Unlike strategy improvement, our method treats both players symmetrically, and unlike value iteration, it terminates with the optimal value in finitely many steps. We prove convergence and correctness and present experimental results demonstrating practical effectiveness.
|
|
10:15-10:30, Paper ThA13.4 | |
More Information Is Not Always Better: Connections between Zero-Sum Local Nash Equilibria in Feedback and Open-Loop Information Patterns |
|
Gupta, Kushagra | The University of Texas at Austin |
Allen, Ross | MITLL |
Fridovich-Keil, David | The University of Texas at Austin |
Topcu, Ufuk | The University of Texas at Austin |
Keywords: Game theory, Optimization
Abstract: Noncooperative dynamic game theory provides a principled approach to modeling sequential decision-making among multiple noncommunicative agents. A key focus is on finding Nash equilibria in two-agent zero-sum dynamic games under various information structures. A well-known result states that in linear-quadratic games, unique Nash equilibria under feedback and open-loop information structures yield identical trajectories. Motivated by two key perspectives---(i) real-world problems extend beyond linear-quadratic settings and lack unique equilibria, making only local Nash equilibria computable, and (ii) local open-loop Nash equilibria (OLNE) are easier to compute than local feedback Nash equilibria (FBNE)---it is natural to ask whether a similar result holds for local equilibria in zero-sum games. To this end, we establish that for a broad class of zero-sum games with potentially nonconvex-nonconcave objectives and nonlinear dynamics: (i) the state/control trajectory of a local FBNE satisfies local OLNE first-order optimality conditions, and vice versa, (ii) a local FBNE trajectory satisfies local OLNE second-order necessary conditions, (iii) a local FBNE trajectory satisfying feedback sufficiency conditions also constitutes a local OLNE, and (iv) with additional hard constraints on agents' actuations, a local FBNE where strict complementarity holds satisfies local OLNE first-order optimality conditions, and vice versa.
|
|
10:30-10:45, Paper ThA13.5 | |
Continuity and Approximability of Competitive Spectral Radii |
|
Akian, Marianne | INRIA and CMAP, Ecole Polytechnique CNRS |
Gaubert, Stephane | INRIA and Ecole Polytechnique |
Marchesini, Loic | CMAP, Ecole Polytechnique, Inria, Institu Polytechnique De Paris |
Morris, Ian | Queen Mary, University of London |
Keywords: Switched systems, Game theory, Optimal control
Abstract: The competitive spectral radius extends the notion of joint spectral radius to the two-player case: two players alternatively select matrices in prescribed compact sets, resulting in an infinite matrix product; one player wishes to maximize the growth rate of this product, whereas the other player wishes to minimize it. We show that when the matrices represent linear operators preserving a cone and satisfying a “strict positivity” assumption, the competitive spectral radius depends continuously — and even in a Lipschitz-continuous way — on the matrix sets. Moreover, we show that the competive spectral radius can be approximated up to any accuracy. This relies on the solution of a discretized infinite dimensional non-linear eigenproblem. We illustrate the approach with an example of age-structured population dynamics.
|
|
10:45-11:00, Paper ThA13.6 | |
Multi-Topic Projected Opinion Dynamics for Resource Allocation |
|
Wankhede, Prashil | Indian Institute of Science |
Mandal, Nirabhra | University of California San Diego |
Martinez, Sonia | University of California at San Diego |
Tallapragada, Pavankumar | Indian Institute of Science |
Keywords: Switched systems, Network analysis and control, Game theory
Abstract: We propose a model of opinion formation on resource allocation among multiple topics by multiple agents, who are subject to hard budget constraints. We define a utility function for each agent and then derive a projected dynamical system model of opinion evolution assuming that each agent myopically seeks to maximize its utility subject to its constraints. Inter-agent coupling arises from an undirected social network, while inter-topic coupling arises from resource constraints. We show that opinions always converge to the equilibrium set. For special networks with very weak antagonistic relations, the opinions converge to a unique equilibrium point. We further show that the underlying opinion formation game is a potential game. We relate the equilibria of the dynamics and the Nash equilibria of the game and characterize the unique Nash equilibrium for networks with no antagonistic relations. Finally, simulations illustrate our findings.
|
|
11:00-11:15, Paper ThA13.7 | |
Deception in Asymmetric Information Homicidal Chauffeur Game |
|
Mahapatra, Shreesh | Indian Institute of Technology Kharagpur |
Jha, Bhargav | Indian Institute of Technology Kharagpur |
Dorothy, Michael | US Army Research Laboratory |
Bopardikar, Shaunak D. | Michigan State University |
Keywords: Game theory, Optimal control, Autonomous systems
Abstract: The classic Homicidal Chauffeur game is a pursuit-evasion game played in an unbounded planar environment between a pursuer constrained to move with fixed speed on curves with bounded curvature, and a slower evader with fixed speed but with simple kinematics. We introduce a new variant of this game with asymmetric information in which the evader has the ability to choose its speed among a finite set of choices that is unknown to the pursuer a priori. Therefore the pursuer is required to estimate the evader's maximum speed based on the observations so far. This formulation leads to the question of whether the evader can exploit this asymmetry by moving deceptively by first picking a slower speed to move with and then switching to a faster speed when a specified relative configuration is attained to increase the capture time as compared to moving with the maximum speed at all times. Our contributions are as follows. First, we derive optimal feedback Nash equilibrium strategies for the complete information case of this game in which the evader is allowed to vary its speed in a given interval. Second, for the version with asymmetric information, we characterize regions of initial player locations in the game space from which the evader does not have any advantage in using deceptive strategies. Finally, we provide numerical evidence of regions in the game space from which the evader can increase the capture time by moving deceptively.
|
|
11:15-11:30, Paper ThA13.8 | |
SIS Epidemic Propagation under Virus Mutation and Game-Theoretic Protection |
|
Maitra, Urmee | Indian Institute of Technology, Kharagpur |
Hota, Ashish R. | Indian Institute of Technology (IIT), Kharagpur |
Srivastava, Vaibhav | Michigan State University |
Keywords: Biological systems, Game theory
Abstract: We study a bi-virus epidemiological model where individuals can either be susceptible or infected by one of two virus strains. We account for mutations that lead to transitions between these two strains. In this work, we primarily focus on uni-directional mutation, and analyze the existence and stability of equilibrium points when mutation is permissible from the strain with a larger reproduction number and infection rate to the other strain. The novelty of our work lies in framing the mutation model within a game-theoretic context and examining the impact of strategic protection adoption on the survival of different virus strains. In this setting, each susceptible individual acts as a player, choosing an action (either adopting protection or remaining unprotected) to maximize its instantaneous payoff. We completely characterize the stationary Nash equilibrium (SNE) of the setting in which both strains coexist, and investigate how mutation rate affects the protection adoption and infection prevalence at the SNE. Finally, we present numerical results to illustrate the effects of mutation rate and cost of protection adoption on the infection prevalence of different strains.
|
|
ThA14 |
Galapagos III |
Control of Uncertain Systems |
Regular Session |
Chair: Lacerda, Marcio J. | London Metropolitan University |
Co-Chair: Kerrigan, Eric C. | Imperial College London |
|
09:30-09:45, Paper ThA14.1 | |
Update-Aware Robust Optimal Model Predictive Control for Nonlinear Systems |
|
Wehbeh, Jad | Imperial College |
Kerrigan, Eric C. | Imperial College London |
Keywords: Robust control, Optimal control, Uncertain systems
Abstract: Robust optimal or min-max model predictive control (MPC) approaches aim to guarantee constraint satisfaction over a known, bounded uncertainty set while minimizing a worst-case performance bound. Traditionally, these methods compute a trajectory that meets the desired properties over a fixed prediction horizon, apply a portion of the resulting input, and then re-solve the MPC problem using newly obtained measurements at the next time step. However, this approach fails to account for the fact that the control trajectory will be updated in the future, potentially leading to conservative designs. In this paper, we present a novel update-aware robust optimal MPC algorithm for decreasing horizon problems on nonlinear systems that explicitly accounts for future control trajectory updates. This additional insight allows our method to provably expand the feasible solution set and guarantee improved worst-case performance bounds compared to existing techniques. Our approach formulates the trajectory generation problem as a sequence of nested existence-constrained semi-infinite programs (SIPs), which can be efficiently solved using local reduction techniques. To demonstrate its effectiveness, we evaluate our approach on a planar quadrotor problem, where it clearly outperforms an equivalent method that does not account for future updates at the cost of increased computation time.
|
|
09:45-10:00, Paper ThA14.2 | |
Optimistic vs Pessimistic Uncertainty Model Unfalsification |
|
Hühnerbein, Jannes | Technical University of Munich |
Wehbeh, Jad | Imperial College |
Kerrigan, Eric C. | Imperial College London |
Keywords: Model Validation, Uncertain systems, Optimization
Abstract: We present a novel, input-output data-driven approach to uncertainty model identification. As the true bounds and distributions of system uncertainties ultimately remain unknown, we depart from the goal of identifying the uncertainty model and instead look for minimal concrete statements that can be made based on an uncertain system model and available input-output data. We refer to this as unfalsifying an uncertainty model. Two different unfalsification approaches are taken. The optimistic approach determines the smallest uncertainties that could explain the given data, while the pessimistic approach finds the largest possible uncertainties suggested by the data. The pessimistic problem is revealed to be a semi-infinite program, which is solved using the local reduction algorithm. It is also shown that the optimistic and pessimistic approaches to uncertainty model unfalsification are mathematical duals. Finally, both approaches are tested using an uncertain linear model with data from a simulated nonlinear system.
|
|
10:00-10:15, Paper ThA14.3 | |
Conformal Contraction for Robust Nonlinear Control with Distribution-Free Uncertainty Quantification |
|
Wei, Sihang | University of Illinois Urbana-Champaign |
Ornik, Melkior | University of Illinois Urbana-Champaign |
Tsukamoto, Hiroyasu | University of Illinois at Urbana-Champaign/NASA JPL |
Keywords: Uncertain systems, Lyapunov methods, Stability of nonlinear systems
Abstract: We present a novel robust control framework for continuous-time, perturbed nonlinear dynamical systems with uncertainty that depends nonlinearly on both the state and control inputs. Unlike conventional approaches that impose structural assumptions on the uncertainty, our framework enhances contraction-based robust control with data-driven uncertainty prediction, remaining agnostic to the models of the uncertainty and predictor. We statistically quantify how reliably the contraction conditions are satisfied under dynamics with uncertainty via conformal prediction, thereby obtaining a distribution-free and finite-time probabilistic guarantee for exponential boundedness of the trajectory tracking error. We further propose the probabilistically robust control invariant (PRCI) tube for distributionally robust motion planning, within which the perturbed system trajectories are guaranteed to stay with a finite probability, without explicit knowledge of the uncertainty model. Numerical simulations validate the effectiveness of the proposed robust control framework and the performance of the PRCI tube.
|
|
10:15-10:30, Paper ThA14.4 | |
Memory Switching Control for Discrete-Time Switched Uncertain Linear Systems |
|
Souza, Andressa M. | University of Campinas |
Oliveira, Ricardo C. L. F. | University of Campinas - UNICAMP |
Peres, Pedro L. D. | University of Campinas |
Keywords: Switched systems, Uncertain systems, LMIs
Abstract: This paper presents a new strategy that incorporates past state measurements to enhance the design of stabilizing switching laws for discrete-time switched uncertain linear systems. Inspired by techniques used in uncertain and time-varying linear systems, conditions that extend the Lyapunov-Metzler inequalities by embedding past states into the switching rule are proposed. The objective is to achieve improved performance in terms of less conservative bounds on the decay rate of the trajectories of the system. The proposed approach eliminates the need for grid-based strategies commonly required in conventional methods, resulting in more efficient numerical procedures. A numerical example illustrates the effectiveness of the approach, highlighting its advantages over existing methods.
|
|
10:30-10:45, Paper ThA14.5 | |
Output-Feedback Model Predictive Control under Dynamic Uncertainties Using Integral Quadratic Constraints |
|
Schwenkel, Lukas | University of Stuttgart |
Köhler, Johannes | ETH Zurich |
Müller, Matthias A. | Leibniz University Hannover |
Allgöwer, Frank | University of Stuttgart |
Keywords: Predictive control for linear systems, Robust control
Abstract: In this work, we propose an output-feedback tube-based model predictive control (MPC) scheme for linear systems under dynamic uncertainties that are described via integral quadratic constraints (IQC). By leveraging IQCs, a large class of nonlinear and dynamic uncertainties can be addressed. We leverage recent IQC synthesis tools to design a dynamic controller and an observer that are robust to these uncertainties and minimize the size of the resulting constraint tightening in the MPC. Thereby, we show that the robust estimation problem using IQCs with peak-to-peak performance can be convexified. We guarantee recursive feasibility, robust constraint satisfaction, and input-to-state stability of the resulting MPC scheme.
|
|
10:45-11:00, Paper ThA14.6 | |
Safe Control Design for Uncertain Linear Systems under Input Saturation Using Lyapunov Barrier Functions |
|
Lacerda, Marcio J. | London Metropolitan University |
Silva, Felipe Augusto | Federal University of Sao Joao Del-Rei |
Keywords: Resilient Control Systems, LMIs, Uncertain systems
Abstract: This paper proposes conditions for the design of safe, robust state-feedback controllers and barrier certificates for uncertain polytopic continuous-time linear systems subject to input saturation. The generalized sector condition is employed to assess the presence of saturation and the level set of the Lyapunov function is utilized to define the barrier function. Numerical experiments are used to illustrate the features of the proposed method.
|
|
11:00-11:15, Paper ThA14.7 | |
A Model-Free Approach to Control Barrier Functions Using Funnel Control |
|
Lanza, Lukas | Technische Universität Ilmenau |
Köhler, Johannes | ETH Zurich |
Dennstädt, Dario | Universität Paderborn |
Berger, Thomas | Universität Paderborn |
Worthmann, Karl | Technische Universität Ilmenau |
Keywords: Nonlinear output feedback, Constrained control, Uncertain systems
Abstract: Control barrier functions (CBFs) are a popular approach to design feedback laws that achieve safety guarantees for nonlinear systems. The CBF-based controller design relies on the availability of a model to select feasible inputs from the set of CBF-based controls. In this paper, we develop a model-free approach to design CBF-based control laws, eliminating the need for knowledge of system dynamics or parameters. Specifically, we address safety requirements characterized by a time-varying distance to a reference trajectory in the output space and construct a CBF that depends only on the measured output. Utilizing this particular CBF, we determine a subset of CBF-based controls without relying on a model of the dynamics by using techniques from funnel control. The latter is a model-free high-gain adaptive control methodology, which achieves tracking guarantees via reactive feedback. In this paper, we discover and establish a connection between the modular controller synthesis via zeroing CBFs and model-free reactive feedback. The theoretical results are illustrated by a numerical simulation.
|
|
11:15-11:30, Paper ThA14.8 | |
How Partial Knowledge Affects Decision Support Process: A Multi-Criteria Decision-Making Approach to City Selection for Quality of Life |
|
Więckowski, Jakub | National Institute of Telecommunications |
Salabun, Wojciech | National Institute of Telecommunications |
Keywords: Computational methods, Uncertain systems, Modeling
Abstract: Decision-making in complex scenarios often involves multiple, conflicting criteria, making it difficult for decision-makers to precisely determine the relative importance of each one. Traditional Multi-Criteria Decision Analysis (MCDA) methods, which rely on crisp data, typically assume complete knowledge of criteria weights, an assumption that is rarely met in practice. This study addresses this limitation by incorporating partial knowledge of decision-makers into the weighting process without relying on fuzzy logic. Unlike standard approaches that rely on a single predefined weight vector, the proposed method generates diverse weight distributions spanning the entire space of feasible criteria weights. The Partial Knowledge Weighting (PKW) is integrated with the Stable Preference Ordering Towards Ideal Solution (SPOTIS) method and tested in the context of city selection for quality of life, where decision-makers often cannot rank all criteria precisely. The methodology progressively incorporates known ranking relationships while generating consistent distributions for the remaining weights. Results show that increasing the number of known ranking relationships reduces uncertainty in weight assignments and leads to more refined decision outcomes. The study demonstrates that accommodating partial knowledge significantly enhances the robustness and applicability of MCDA, supporting more reliable decision support systems in real-world contexts characterized by incomplete information.
|
|
ThA15 |
Capri II |
Stochastic Optimal Control I |
Regular Session |
Chair: Oguri, Kenshiro | Purdue University |
Co-Chair: Lestas, Ioannis | University of Cambridge |
|
09:30-09:45, Paper ThA15.1 | |
Discrete-Time Mean-Field-Type Control Problems with Higher-Order Costs |
|
Barreiro-Gomez, Julian | Khalifa University |
Duncan, Tyrone E. | Univ. of Kansas |
Pasik-Duncan, Bozenna | Univ. of Kansas |
Tembine, Hamidou | NYU |
Keywords: Stochastic optimal control, Optimal control
Abstract: Traditional solvable optimal control theory mainly addresses quadratic costs due to its analytical tractability. Nevertheless, quadratic costs are not appropriate to model critical non-linearities found in many real systems such as water, energy, agriculture, financial networks, among many others. In this paper, we present a unified framework for solving discrete-time optimal control problems with higher-order state and control costs. To this end, we rely on convex-completion techniques, and derive semi-explicit solutions. Key contributions include variance-aware solutions under additive and multiplicative noise. We show that higher-order costs induce less aggressive control policies compared to quadratic formulations, a finding that is validated through numerical analyses.
|
|
09:45-10:00, Paper ThA15.2 | |
Soft-Constrained Stochastic MPC of Markov Jump Linear Systems: Application to Real-Time Control with Deadline Overruns |
|
Gallant, Melanie | Robert Bosch GmbH |
Mark, Christoph | Robert Bosch GmbH |
Pazzaglia, Paolo | Robert Bosch GmbH |
von Keler, Johannes | Robert Bosch GmbH |
Beermann, Laura | Bosch |
Schmidt, Kevin | Robert Bosch GmbH |
Maggio, Martina | Saarland University |
Keywords: Stochastic optimal control, Predictive control for linear systems, Markov processes
Abstract: Modern real-time control systems can sporadically exceed the computation deadlines, which may lead to a deterioration in performance or even instability if not actively accounted for. This letter proposes a stochastic model predictive control approach that incorporates deadline miss probabilities of subsequent control task executions in a scenario tree. To account for the effect of missed deadlines, we utilize Markov jump linear systems that allow us to prove mean-square stability and recursive feasibility under hard input and mixed hard/soft state constraints. The proposed stochastic controller is benchmarked using a Furuta pendulum, demonstrating improved performance and an increased feasible region compared to a nominal and a hard-constrained stochastic controller, respectively.
|
|
10:00-10:15, Paper ThA15.3 | |
Operator Splitting Covariance Steering for Safe Stochastic Nonlinear Control |
|
Ratheesh Babu, Akash | Georgia Institute of Technology |
Pacelli, Vincent | Georgia Institute of Technology |
Saravanos, Augustinos D. | Georgia Institute of Technology |
Theodorou, Evangelos A. | Georgia Institute of Technology |
Keywords: Stochastic optimal control, Optimization algorithms, Constrained control
Abstract: This paper presents a novel algorithm for solving distribution steering problems featuring nonlinear dynamics and chance constraints. Covariance steering (CS) is an emerging methodology in stochastic optimal control that poses constraints on the first two moments of the state distribution — thereby being more tractable than full distributional control. Nevertheless, a significant limitation of current approaches for solving nonlinear CS problems, such as sequential convex programming (SCP), is that they often generate infeasible or poor results due to the large number of constraints. In this paper, we address these challenges, by proposing an operator splitting CS approach that temporarily decouples the full problem into subproblems that can be solved in parallel. This relaxation does not require intermediate iterates to satisfy all constraints simultaneously prior to convergence, which enhances exploration and improves feasibility in such non-convex settings. Simulation results across a variety of robotics applications verify the ability of the proposed method to find better solutions even under stricter safety constraints than standard SCP. Finally, the applicability of our framework on real systems is also confirmed through hardware demonstrations.
|
|
10:15-10:30, Paper ThA15.4 | |
Hands-Off Covariance Steering: Inducing Feedback Sparsity Via Iteratively Reweighted ell_{1, P} Regularization |
|
Kumagai, Naoya | Purdue University |
Oguri, Kenshiro | Purdue University |
Keywords: Stochastic optimal control, Stochastic systems, Optimal control
Abstract: We consider the problem of optimally steering the state covariance matrix of a discrete-time linear stochastic system to a desired terminal covariance matrix, while inducing the control input to be zero over many time intervals. We propose to induce sparsity in the feedback gain matrices by using a sum-of-norms version of the iteratively reweighted ell_1-norm minimization. We show that the lossless convexification property holds even with the regularization term. Numerical simulations show that the proposed method produces a Pareto front of transient cost and sparsity that is not achievable by a simple ell_1-norm minimization and closely approximates the ell_0-norm minimization obtained from brute-force search.
|
|
10:30-10:45, Paper ThA15.5 | |
Optimal Control of Stochastic Networks of M/M/∞ Queues with Linear Costs |
|
Pugliese Carratelli, Giovanni | University of Cambridge |
Lestas, Ioannis | University of Cambridge |
Keywords: Queueing systems, Stochastic optimal control, Optimal control
Abstract: We consider an arbitrary network of M/M/∞ queues with controlled transitions between queues. We consider optimal control problems where the costs are linear functions of the state and inputs over a finite horizon or infinite. We provide in both cases an explicit characterization of the optimal control policies. We also show that these do not involve state feedback, but they depend on the network topology and system parameters. The results are also illustrated with various examples.
|
|
10:45-11:00, Paper ThA15.6 | |
Piecewise Control Barrier Functions for Safe Control of Stochastic Systems |
|
Mazouz, Rayan | University of Colorado Boulder |
Laurenti, Luca | TU Delft |
Lahijanian, Morteza | University of Colorado Boulder |
Keywords: Formal Verification/Synthesis, Stochastic systems, Lyapunov methods
Abstract: This paper presents a method for the simultaneous synthesis of a barrier certificate and a safe controller for discrete-time nonlinear stochastic systems. Our approach, based on piecewise stochastic control barrier functions, reduces the synthesis problem to a minimax optimization, which we solve exactly using a dual linear program with zero gap. This enables the joint optimization of the barrier certificate and safe controller within a single formulation. The method accommodates stochastic dynamics with additive noise and a bounded continuous control set. The synthesized controllers and barrier certificates provide a formally guaranteed lower bound on probabilistic safety. Case studies on linear and nonlinear stochastic systems validate the effectiveness and scalability of our approach.
|
|
11:00-11:15, Paper ThA15.7 | |
On the Risk Levels of Distributionally Robust Chance Constrained Problems |
|
Heinlein, Moritz | TU Dortmund University |
Alamo, Teodoro | Universidad De Sevilla |
Lucia, Sergio | TU Dortmund University |
Keywords: Uncertain systems, Randomized algorithms, Stochastic optimal control
Abstract: In this paper, we discuss the utilization of perturbed risk levels (PRLs) for the solution of chance-constrained problems via sampling-based approaches. PRLs allow the consideration of distributional ambiguity by rescaling the risk level of the nominal chance constraint. Explicit expressions of the PRL exist for some discrepancy-based ambiguity sets. We propose a discrepancy functional not included in previous comparisons of different PRLs based on the likelihood ratio, which we term ,,relative variation distance" (RVD). If the ambiguity set can be described by the RVD, the rescaling of the risk level with the PRL is in contrast to other discrepancy functionals possible even for very low risk levels. We derive distributionally robust one- and two-level guarantees for the solution of chance-constrained problems with randomized methods. We demonstrate the viability of the derived guarantees for a randomized MPC under distributional ambiguity.
|
|
11:15-11:30, Paper ThA15.8 | |
Parameter Invariance Analysis of Moment Equations Using Dulmage-Mendelsohn Decomposition |
|
Igarashi, Akito | Keio University |
Hori, Yutaka | Keio University |
Keywords: Biomolecular systems, Stochastic systems, Genetic regulatory systems
Abstract: Living organisms maintain stable functioning amid environmental fluctuations through homeostasis, a property that preserves a system's behavior despite changes in environmental conditions. To elucidate homeostasis in stochastic biochemical reactions, theoretical tools for assessing population level invariance under parameter perturbations are crucial. In this paper, we propose a systematic method for identifying the stationary moments that remain invariant under parameter perturbations by leveraging the structural properties of the stationary moment equations. A key step in this development is addressing the underdetermined nature of moment equations, which has traditionally made it difficult to characterize how stationary moments depend on system parameters. To overcome this, we utilize the Dulmage-Mendelsohn (DM) decomposition of the coefficient matrix to extract welldetermined subequations and reveal their hierarchical structure. Leveraging this struc ture, we identify stationary moments whose partial derivatives with respect to parameters are structurally zero, facilitating the exploration of fundamental constraints that govern homeostatic behavior in stochastic biochemical systems.
|
|
ThA16 |
Capri III |
Nonlinear Systems Control IV |
Regular Session |
Chair: Como, Giacomo | Politecnico Di Torino |
Co-Chair: Sandberg, Henrik | KTH Royal Institute of Technology |
|
09:30-09:45, Paper ThA16.1 | |
Terrain-Following Guidance for Underwater Vehicles |
|
Basso, Erlend Andreas | Norwegian University of Science and Technology |
Schmidt-Didlaukies, Henrik M. | Norwegian University of Science and Technology |
Pettersen, Kristin Y. | Norwegian University of Science and Technology (NTNU) |
Keywords: Maritime control, Robotics, Nonlinear systems
Abstract: This paper presents a novel terrain-following guidance scheme for underwater vehicles utilizing full orientation actuation. Unlike existing approaches that primarily rely on pitch control, the proposed method exploits both roll and pitch to enable more accurate adaptation to complex terrain. The proposed control law simultaneously accomplishes three objectives: (i) it aligns the vehicle's body-fixed downward axis with the seafloor's normal vector, (ii) ensures convergence to a horizontal trajectory, and (iii) maintains a specified altitude. We prove that the proposed control law ensures global asymptotic stability of a desired altitude and horizontal path, and that the body-fixed downward axis is aligned with the seafloor's normal vector at this equilibrium. The effectiveness of the proposed guidance approach is demonstrated through numerical simulations of a six-degree-of-freedom dynamic model of the REMUS 100 AUV.
|
|
09:45-10:00, Paper ThA16.2 | |
On Phase in Scaled Graphs |
|
van den Eijnden, Sebastiaan | Eindhoven University of Technology |
Chen, Chao | The University of Manchester |
Scheres, Koen | Eindhoven University of Technology |
Chaffey, Thomas | University of Sydney |
Lanzon, Alexander | University of Manchester |
Keywords: Stability of nonlinear systems, Nonlinear systems
Abstract: The scaled graph has been introduced recently as a nonlinear extension of the classical Nyquist plot for linear time-invariant systems. In this paper, we introduce a modified definition for the scaled graph, termed the signed scaled graph (SSG), in which the phase component is characterized by making use of the Hilbert transform. Whereas the original definition of the scaled graph uses unsigned phase angles, the new definition has signed phase angles which ensures the possibility to differentiate between phase-lead and phase-lag properties in a system. Making such distinction is important from both an analysis and a synthesis perspective, and helps in providing tighter stability estimates of feedback interconnections. We show how the proposed SSG leads to intuitive characterizations of positive real and negative imaginary nonlinear systems, and present various interconnection results. We showcase the effectiveness of our results through a motivating example.
|
|
10:00-10:15, Paper ThA16.3 | |
Iterative Approximations of Periodic Trajectories for Nonlinear Systems with Discontinuous Inputs |
|
Zuyev, Alexander | Max Planck Institute for Dynamics of Complex Systems |
Benner, Peter | Max Planck Institute for Dynamics of Complex TechnicalSystems |
Keywords: Algebraic/geometric methods, Time-varying systems, Computational methods
Abstract: Nonlinear control-affine systems described by ordinary differential equations with bounded measurable input functions are considered. The solvability of a broad class of boundary value problems for these systems is formulated in the sense of Carathéodory solutions. It is shown that, under the dominant linearization assumption, the considered class of boundary value problems admits a unique solution for any admissible control. These solutions can be obtained as the limit of the proposed simple iterative scheme and, in the case of periodic boundary conditions, via the developed Newton-type schemes. Under additional technical assumptions, sufficient contraction conditions of the corresponding generating operators are derived analytically. The proposed iterative approach is applied to compute periodic solutions of a realistic chemical reaction model with discontinuous control inputs.
|
|
10:15-10:30, Paper ThA16.4 | |
A Fast Discrete-Time Disturbance Observer for a Rotating Rigid Body |
|
Wang, Ningshan | University of Michigan |
Sanyal, Amit | Syracuse University |
Keywords: Aerospace, Stability of nonlinear systems, Flight control
Abstract: This article presents a disturbance torque estimation scheme for a multi-rotor vehicle modeled as a rotating rigid body in the presence of unknown disturbance torque and measurement uncertainties. The proposed estimation scheme depends on the knowledge of rotational inertia, control inputs, and measurements from a combination of inertial measurement units, consisting of a three-axis gyroscope placed at the center of the vehicle and a set of three-axis accelerometers placed around the gyroscope. The angular velocity and angular acceleration vectors are estimated using a simple linear Kalman filter. Thereafter, these estimated motion states are sent to the disturbance observer to provide disturbance torque estimation. A Lyapunov stability analysis proves that the proposed estimation scheme is discrete-time finite-time stable. The estimation scheme is discretized as a geometric integrator for numerical simulations and practical implementations. Numerical simulations involve the modeling of industrial-grade inertial measurement units to demonstrate the feasibility, stability, and robustness properties of the proposed scheme.
|
|
10:30-10:45, Paper ThA16.5 | |
On Resilience Guarantees by Finite-Time Robust Control Barrier Functions with Application to Power Inverter Networks |
|
Hassan, Kamil | KTH Royal Institute of Technology, Sweden |
Selvaratnam, Daniel | KTH Royal Institute of Technology |
Sandberg, Henrik | KTH Royal Institute of Technology |
Keywords: Resilient Control Systems, Nonlinear systems, Power systems
Abstract: In this study, a control theoretic description of resilience is provided to quantify the characteristics of a resilient system. The aim is to establish a paradigm for resilient control design based on tangible control objectives that yield desirable attributes for safety-critical systems. In that regard, durability and recoverability properties are identified as key components of the proposed resilience framework and, to offer a methodology to enforce these attributes, the notion of finite-time robust control barrier function (FR-CBF) is introduced. Furthermore, to offer a comprehensive treatment of the problem, resilient control design is investigated for both continuous and sampled-data systems. To that end, FR-CBF-based design conditions for both continuous and piece-wise constant zero-order hold (ZOH) control inputs are included. Moreover, to provide a concrete example of how the proposed framework could be adopted for safety-critical control applications, in this study we also investigate the voltage regulation problem for inverter-interfaced radial power distribution networks subject to adversarial injections. In that regard, sufficient conditions for both the continuous and sampled-data ZOH control are derived to guarantee finite-time recovery and safe operation of the distribution grid in accordance with the proposed resilience framework. Finally, the efficacy of the proposed results is advocated using a simulation study showing resilient grid performance in the presence of the ‘worst-case’ power injection attack, as reported in (Lindström et al. 2021).
|
|
10:45-11:00, Paper ThA16.6 | |
Behavioral-Feedback SIR Epidemic Model: Analysis and Control |
|
Alutto, Martina | Politecnico Di Torino |
Cianfanelli, Leonardo | Politecnico Di Torino |
Como, Giacomo | Politecnico Di Torino |
Fagnani, Fabio | Politecnico Di Torino |
Parise, Francesca | Cornell University |
Keywords: Compartmental and Positive systems, Nonlinear systems
Abstract: This paper investigates a behavioral-feedback SIR model in which the infection rate adapts dynamically based on the fractions of susceptible and infected individuals. We introduce an invariant of motion and we characterize the peak of infection. We further examine the system under a threshold constraint on the infection level. Based on this analysis, we formulate an optimal control problem to keep the infection curve below a healthcare capacity threshold while minimizing the economic cost. For this problem, we study a feasible strategy that involves applying the minimal necessary restrictions to meet the capacity constraint and characterize the corresponding cost.
|
|
11:00-11:15, Paper ThA16.7 | |
Robustly Stabilizing Lyapunov-Based Control for a Multi-Input DC-DC Converter with Output Load Estimation |
|
Merchán Riveros, María Camila | Universidad De Sevilla |
Sferlazza, Antonino | University of Palermo |
Garraffa, Giovanni | University of Palermo |
Zaccarian, Luca | LAAS-CNRS |
Albea, Carolina | University of Seville, Spain |
Keywords: Nonlinear output feedback, Lyapunov methods, Power electronics
Abstract: We propose a robustly stabilizing Lyapunov-based control scheme for a Multi-Input Converter using a Nonlinear Disturbance Observer for the load current estimation. The closed-loop system ensures an output voltage regulation and eliminates the requirement of knowing the current load, thus mitigating the impact of current fluctuations, without relying on typically inaccessible or impractical measurements. Robust asymptotic stability is guaranteed by Lyapunov theory. The main result is validated by simulations and experiments.
|
|
11:15-11:30, Paper ThA16.8 | |
Parameter-Dependent Control Lyapunov Functions for Stabilizing Nonlinear Parameter-Varying Systems |
|
Zhao, Pan | University of Alabama |
Keywords: Stability of nonlinear systems, Robust control, Linear parameter-varying systems
Abstract: This paper introduces the concept of parameter-dependent (PD) control Lyapunov functions (CLFs) for gain-scheduled stabilization of nonlinear parameter-varying (NPV) systems. It shows that given a PD-CLF, a min-norm control law can be constructed by solving a robust quadratic program. For polynomial control-affine NPV systems, it provides convex conditions, based on the sum of squares programming, to jointly synthesize a PD-CLF and a PD controller, while maximizing the PD region of stabilization. Simulation results validate the efficacy of the proposed method.
|
|
ThA17 |
Capri IV |
Robust Control IV |
Regular Session |
Chair: Turner, Matthew C. | University of Southampton |
Co-Chair: Peixoto, Marcia Luciana da Costa | Université Polytechnique Hauts-De-France |
|
09:30-09:45, Paper ThA17.1 | |
Gain-Scheduled Symbiotic Control of Dynamical Systems with Nonparametric Uncertainties |
|
Naranjo, Cristian | University of South Florida |
Yucelen, Tansel | University of South Florida |
Kurtoglu, Deniz | University of South Florida |
Hrynuk, John | DEVCOM Army Research Lab |
Keywords: Robust adaptive control, LMIs
Abstract: This paper extends the recently introduced symbiotic control framework, which combines fixed-gain and adaptive control methods to suppress the uncertainties in a more predictable way and without the necessities for explicit uncertainty bounds. While the original framework was limited to linear systems, in this paper we generalize it to nonlinear behavior models across different operating conditions with the use of a gain-scheduling approach. The proposed method uses a family of controllers, each designed for a specific equilibrium point, to handle nonparametric uncertainties without requiring prior knowledge of uncertainty bounds. A numerical example is also provided to demonstrate the effectiveness of the proposed gain-scheduled symbiotic control approach.
|
|
09:45-10:00, Paper ThA17.2 | |
Complementary Tracking Control for Linear Systems Subject to External Disturbances and Stochastic Noise |
|
Xu, Jiapeng | University of Windsor |
Chen, Guanrong | City University of Hong Kong |
Chen, Xiang | University of Windsor |
Zhou, Kemin | Nanjing University |
Keywords: Robust control, Optimal control, Linear systems
Abstract: This paper is concerned with designing a multiobjective tracking controller for linear time-invariant systems subject to both unknown external disturbances and stochastic noise simultaneously. The proposed approach involves two controllers designed separately: a nominal tracking controller that achieves optimal tracking performance and a mixed H_2/H_infty controller that addresses the impacts of external disturbances and stochastic noise. The two controllers are then integrated together to ensure that the nominal optimal tracking performance and the mixed H_2/H_infty performance are complementary to each other, thereby overcoming the conservativeness exhibited in existing robust tracking control designs. Numerical comparisons demonstrate the advantages of the proposed controller over existing ones.
|
|
10:00-10:15, Paper ThA17.3 | |
L2 Gain for Ultimately Bounded Systems with Application to Quantized Input Systems |
|
Turner, Matthew C. | University of Southampton |
Richards, Christopher | University of Louisville |
Keywords: Robust control, Lyapunov methods, Constrained control
Abstract: This paper discusses the L2 gain of nonlinear systems which, in the absence of exogenous inputs, are uniformly ultimately bounded. Unlike for many systems which are asymptotically stable in the absence of exogenous inputs, it is not possible to infer standard L2 gain conditions for systems which are only known to be ultimately bounded. This paper gives another interpretation of L2 gain and shows that, under some assumptions, consideration of a typical L2 performance objective leads to satisfaction of this new interpretation of L2 gain - called the dist-L2 gain. This is harnessed for performance analysis of systems with quantized inputs and an approach to computing the dist L2 gain is presented.
|
|
10:15-10:30, Paper ThA17.4 | |
Robust Stabilizing Control of Semi-Markov Jump Linear Systems with Decay Rate Guarantees |
|
de Oliveira, André M. | Universidade Federal De São Paulo (UNIFESP) |
Costa, Oswaldo Luiz V. | Univ. of Sao Paulo |
Keywords: Robust control, Markov processes, LMIs
Abstract: This paper deals with the problem of robust mean-square stabilization of Semi-Markov jump linear systems with a guaranteed decay rate. It is considered that the jump times are modeled through phase-type (PH) distributions and that the system parameters are subject to polytopic uncertainties. The goal is to design state feedback controllers that robustly stabilize the closed-loop system while satisfying an exponential decay rate, thus having better control over the transient behavior of the system. The design problem is written in linear matrix inequalities (LMI) so that it can be readily implemented numerically. The paper concludes with a numerical example.
|
|
10:30-10:45, Paper ThA17.5 | |
Feedback Stability under Mixed Gain and Phase Uncertainty |
|
Liang, Jiajin | Hong Kong University of Science and Technology |
Zhao, Di | Nanjing University |
Qiu, Li | Hong Kong Univ. of Sci. & Tech |
Keywords: Stability of linear systems, Robust control, Uncertain systems
Abstract: In this work, we proposed and studied a matrix sectored-disk problem in which we wish to determine the invertibility of a matrix with mixed gain and phase uncertainty. The matrix sectored-disk problem can then be carried on to the robust feedback stability problem for multiple-input-multiple-output (MIMO) linear time-invariant (LTI) systems involving sectored-disk uncertainty, namely, dynamic uncertainty subject to simultaneous gain and phase constraints. This problem is thereby called an LTI system sectored-disk problem.
|
|
10:45-11:00, Paper ThA17.6 | |
On the Equivalence between Functionally Affine LPV State-Space Representations and LFT Models |
|
Petreczky, Mihaly | UMR CNRS 9189, Ecole Centrale De Lille |
Alkhoury, Ziad | University Od Stasbourg |
Mercère, Guillaume | University of Poitiers |
Keywords: Algebraic/geometric methods, Linear parameter-varying systems, Robust control
Abstract: In this paper, we propose a transformation algorithm for a class of Linear Parameter-Varying (LPV) systems with functional affine dependence on parameters, where the system matrices depend affinely on nonlinear functions of the scheduling variable, into Linear Fractional Transformation (LFT) systems. The transformation preserves input-output behavior and minimality. For input-output equivalent LPV systems, the resulting LFT representations are also input-output equivalent for all uncertainty blocks. This ensures consistency in system identification and control, as minimal and input-output equivalent LPV systems yield minimal and isomorphic LFT systems, maintaining consistent performance in controller synthesis.
|
|
11:00-11:15, Paper ThA17.7 | |
Sampled-Data Control of LPV Systems with Magnitude and Rate Saturating Actuators |
|
Oliveira, Lucas A. L. | CEFET-MG/Université De Reims Champagne-Ardenne |
Guelton, Kevin | Univ. De Reims Champagne-Ardenne |
Motchon, Koffi M. Djidula | Université De Reims Champagne Ardenne, CReSTIC EA 3804, 51097 Re |
Leite, Valter J. S. | Centro Federal De Educação Tecnológica De Minas Gerais |
Keywords: Sampled-data control, Linear parameter-varying systems, Constrained control
Abstract: This paper presents the parameter-dependent aperiodic sampled-data state feedback controller design for linear parameter varying (LPV) systems with actuators subject to magnitude and rate saturation, using Linear Matrix Inequalities (LMIs). The proposed method integrates the looped-functional approach and a parameter-dependent generalized sector condition. The local stabilization is verified through a new definite negativeness lemma for second-order matrix polynomials. The proposed conditions can be simplified to recover a robust controller design whenever the time-varying parameter is unavailable. Two numerical examples demonstrate the effectiveness of the proposed method, highlighting less conservative stability conditions compared to existing approaches.
|
|
11:15-11:30, Paper ThA17.8 | |
Fault Hiding of Nonlinear Parameter Varying Systems |
|
Bessa, Iury | Federal University of Amazonas |
Peixoto, Marcia Luciana da Costa | Université Polytechnique Hauts-De-France |
Coutinho, Pedro Henrique Silva | State University of Rio De Janeiro |
Puig, Vicenc | UPC |
Palhares, Reinaldo Martinez | Federal University of Minas Gerais |
Keywords: Fault tolerant systems, LMIs, Linear parameter-varying systems
Abstract: This paper addresses the problem of fault-tolerant control of nonlinear parameter-varying (N-LPV) systems. Specifically, it focuses on N-LPV models that effectively represent nonlinear systems by partially embedding nonlinearities while preserving sector-bounded terms. This representation simplifies control design by reducing the number of polytope vertices compared to traditional linear parameter-varying (LPV) approaches. The main contribution of this paper is to present novel fault-hiding approaches for fault-tolerant control of N-LPV models. The proposed approach can recover the stability of N-LPV systems under additive and multiplicative sensor and actuator faults by using a generic static reconfiguration block structure.
|
|
ThA18 |
Aruba I+II+III |
Linear Systems IV |
Regular Session |
Chair: Chen, Wei | Peking University |
Co-Chair: Steur, Erik | Eindhoven University of Technology |
|
09:30-09:45, Paper ThA18.1 | |
Sectored Real Lemma and Its Integration with Bounded Real Lemma |
|
Yang, Xiaokan | Peking University |
Zhang, Ding | The Hong Kong University of Science and Technology |
Chen, Wei | Peking University |
Hara, Shinji | Tokyo Institute of Technology |
Qiu, Li | Hong Kong Univ. of Sci. & Tech |
Keywords: Linear systems, LMIs
Abstract: In this paper, we study the state-space characterization of multi-input multi-output linear time-invariant systems. A sectored real lemma is developed for phase-bounded systems, serving as a counterpart of bounded real lemma and an extension of positive real lemma. Moreover, we propose a mixed bounded/sectored real lemma that integrates the gain and phase information, thereby enhancing its applicability to practical systems. All results are formulated in a novel phase-related terminology, which is equivalent to the LMI statement but provides a distinct conceptual perspective on phase.
|
|
09:45-10:00, Paper ThA18.2 | |
Remote State Estimation with Discounted Multi-Armed Bandits for Non-Stationary Channel Selection |
|
Zhang, Jiuzhou | Hong Kong University of Science and Technology |
Huo, Wei | HKUST |
Chen, Xiaomeng | Hong Kong University of Science and Technology |
Quevedo, Daniel E. | The University of Sydney |
Shi, Ling | Hong Kong University of Science and Technology |
Keywords: Linear systems, Estimation, Kalman filtering
Abstract: This paper addresses the problem of optimal channel selection for remote state estimation in cyber-physical systems, where a sensor transmits measurements over multiple time-varying wireless channels. We model the packet arrival probability of each channel as a non-stationary Bernoulli process and propose two discounted Multi-Armed Bandit (MAB) algorithms-Discounted Upper Confidence Bound (D-UCB) and Discounted Thompson Sampling (D-TS) to select channels with the highest expected packet arrival rates adaptively. The estimation error covariance is analyzed using Kalman filtering, and the cumulative estimation regret is defined as the excess trace of the estimation error covariance compared to an optimal policy. Theoretical analysis shows the algorithms achieve a regret that grows gradually over time, and numerical simulations validate its effectiveness under non-stationary conditions.
|
|
10:00-10:15, Paper ThA18.3 | |
On Sample-Based Functional Observability of Linear Systems |
|
Krauss, Isabelle | Leibniz University Hannover |
Lopez, Victor G. | Leibniz University Hannover |
Müller, Matthias A. | Leibniz University Hannover |
Keywords: Linear systems, Observers for Linear systems, Estimation
Abstract: Sample-based observability characterizes the ability to reconstruct the internal state of a dynamical system by using limited output information, i.e., when measurements are only infrequently and/or irregularly available. In this work, we investigate the concept of functional observability, which refers to the ability to infer a function of the system state from the outputs, within a sample-based framework. Here, we give necessary and sufficient conditions for a system to be sample-based functionally observable, and formulate conditions on the sampling schemes such that these are satisfied. Furthermore, we provide a numerical example, where we demonstrate the applicability of the obtained results.
|
|
10:15-10:30, Paper ThA18.4 | |
Distributed State Estimation for Discrete-Time Observable Linear Time-Invariant Systems with Unknown Exogenous Inputs |
|
Torchiaro, Franco Angelo | University of Calabria |
Gagliardi, Gianfranco | Università Degli Studi Della Calabria |
Tedesco, Francesco | Università Della Calabria |
Casavola, Alessandro | Universita' Della Calabria |
Keywords: Observers for Linear systems, Sensor networks, LMIs
Abstract: This paper presents a Distributed Unknown Input Observer (D-UIO) design methodology based on node-wise observability decomposition to estimate the state of discrete-time linear time-invariant (LTI) systems affected by measurement noise and unknown inputs. The framework models sensors as integral components of logical units, which together constitute the nodes of a distributed observer network. Each node of the network has only partial access to system information—restricted to local measurements—and can exchange data solely with its direct neighbors. The observer design challenge is therefore split into two complementary tasks: (i) determining local output injection gains to intelligently use locally available data, and (ii) defining diffusive coupling gains that, through a consensus mechanism, compensate for incomplete information. The proposed synthesis is cast as a set of linear matrix inequalities (LMI) conditions, which can be efficiently solved via semidefinite programming, yielding a tractable design procedure. The effectiveness of the distributed observer is finally demonstrated through a numerical simulation example.
|
|
10:30-10:45, Paper ThA18.5 | |
Distributed Reduced-Order Observers for Networked LTI Systems: A Fully Decentralized Design Approach with Guaranteed Performance |
|
Li, Yaodong | Eindhoven University of Technology |
Michiels, Wim | KU Leuven |
Van De Wouw, Nathan | Eindhoven University of Technology |
Steur, Erik | Eindhoven University of Technology |
Keywords: Observers for Linear systems, Sensor networks, Distributed control
Abstract: Distributed estimation is the collaborative asymptotic estimation of the state of (usually large-scale) linear time-invariant (LTI) systems by multiple observers (or sensors). Existing distributed observer designs are scalable and provide a guaranteed rate of convergence, but often these designs require global network information, restricting flexibility. This paper proposes a fully decentralized, reduced-order distributed observer that leverages a local observability decomposition. As usual, a Luenberger observer is designed for observable states, but instead of adopting consensus dynamics, we determine the unobservable states using an algebraic equation. This approach is plug-and-play, reduces computational complexity, allows tunable convergence rates, and is implementable without knowing the global network topology. Numerical simulations validate its efficiency. We also discuss an extension of our approach for distributed estimation in a finite, prescribed time.
|
|
10:45-11:00, Paper ThA18.6 | |
Learning a Mixture of Experts Approximation of a Model Predictive Controller with Guarantees |
|
Ahrazoglu, Mehmet Akif | University of Michigan, Ann-Arbor |
Keywords: Predictive control for linear systems, Linear systems
Abstract: In this paper, we investigate the integration of a Mixture of Experts (MoE) architecture into Model Predictive Control (MPC) frameworks. The proposed approach enhances the real-time applicability and computational efficiency of MPC while guaranteeing closed-loop stability and constraint satisfaction. The MoE architecture provides a flexible, modular strategy for representing complex control policies by partitioning the input space into regions, each managed by a specialized expert model coordinated via a gating network. We evaluate the architecture's effectiveness in alleviating the computational burden of traditional MPC and leveraging its universal function approximation capabilities. This involves developing a learning-based control policy through approximations that characterize MPC behavior with stability guarantees. The practicality of the approach is demonstrated via simulations, highlighting its potential for robust, real-time control in diverse dynamic systems. By presenting both theoretical foundations and practical implementations, this paper advances control strategies that adaptively handle computational constraints while maintaining high performance in safety-critical applications.
|
|
11:00-11:15, Paper ThA18.7 | |
Symmetric Kullback Leibler Divergence Based Robust Sensor Placement Design for Linear Dynamical System Subject to Bounded Uncertainties |
|
Kumar, Brijesh | Indian Institute of Technology, Bombay |
Patel, Garima | Indian Institute of Technology Bombay |
Bhushan, Mani | Indian Instiute of Technology Bombay |
Keywords: Sensor networks, Kalman filtering, Linear systems
Abstract: In this work, we propose a Symmetric Kullback Leibler Divergence (SKLD) based approach for Optimal Sensor Placement Design (OSPD) for linear dynamical systems subjected to the presence of bounded modelling uncertainty. Use of SKLD as an optimality criterion over conventional alphabetical optimality criteria facilitates incorporation of the end-user specified target performance of the estimates in the problem formulation. The proposed SKLD based SPD formulation is a Robust Sensor Placement Design (R-SPD) that guarantees robustness by accounting for uncertainties in process dynamics. This is achieved by choosing sensors which minimize the worst case SKLD value. Thus, the resulting SKLD value provides an upper bound on SKLD for all admissible uncertainties. The proposed sensor placement design formulation is a Mixed Integer Non-Linear Programming (MINLP) problem and is computationally intractable. In this work, we also provide a computationally tractable reformulation of MINLP problem to a Mixed Integer Semidefinite Programming (MISDP) formulation. Utility of the approach is demonstrated on Tennessee Eastman challenge problem.
|
|
11:15-11:30, Paper ThA18.8 | |
First and Second Order Optimal mathcal{H}_2 Model Reduction for Linear Continuous-Time Systems |
|
Zhu, Wenshan | Imperial College London |
Jaimoukha, Imad M. | Imperial College London |
Keywords: Model/Controller reduction, Reduced order modeling, LMIs
Abstract: In this paper, we investigate the optimal mathcal{H}_2 model reduction problem for single-input single-output (SISO) continuous-time linear time-invariant (LTI) systems. A semi-definite relaxation (SDR) approach is proposed to determine globally optimal interpolation points, providing an effective way to compute the reduced-order models via Krylov projection-based methods. In contrast to iterative approaches, we use the controllability Gramian and the moment-matching conditions to recast the model reduction problem as a convex optimization by introducing an upper bound gamma to minimize the mathcal{H}_2 norm of the model reduction error system. We also prove that the relaxation is exact for first order reduced models and demonstrate, through examples, that it is exact for second order reduced models. We compare the performance of our proposed method with other iterative approaches and shift-selection methods on examples. Importantly, our approach also provides a means to verify the global optimality of known locally convergent methods.
|
|
ThA19 |
Ibiza IV |
Optimal Control IV |
Regular Session |
Chair: Aronna, María Soledad | Fundação Getulio Vargas |
Co-Chair: Arcak, Murat | University of California, Berkeley |
|
09:30-09:45, Paper ThA19.1 | |
Time-Optimal Control for High-Order Chain-Of-Integrators Systems with Full State Constraints and Arbitrary Terminal States |
|
Wang, Yunan | Tsinghua University |
Hu, Chuxiong | Tsinghua University |
Li, Zeyang | Massachusetts Institute of Technology |
Lin, Shize | Tsinghua University |
He, Suqin | Tsinghua University |
Zhu, Yu | Tsinghua University |
Keywords: Optimal control, Linear systems, Variational methods
Abstract: Time-optimal control for high-order chain-of-integrator systems with full state constraints and arbitrarily given terminal states remains a challenging problem in the optimal control theory domain, yet to be resolved. To enhance further comprehension of the problem, a novel notation system and theoretical framework is established, providing the switching manifold for high-order problems in the form of augmented switching laws (ASL). Guided by the ASL theory, a trajectory planning method named the manifold-intercept method (MIM) is developed. MIM can plan near-optimal non-chattering higher-order trajectories with full state constraints, achieving strict time-optimality for problems of order n≤3. Experiments indicate that MIM outperforms baselines regarding computational time, computational accuracy, and trajectory quality by a large gap.
|
|
09:45-10:00, Paper ThA19.2 | |
Value of Information-Based Deceptive Path Planning under Adversarial Interventions |
|
Suttle, Wesley | U.S. Army Research Laboratory |
Milzman, Jesse | DEVCOM Army Research Laboratory |
Karabag, Mustafa O. | The University of Texas at Austin |
Sadler, Brian | Army Research Laboratory |
Topcu, Ufuk | The University of Texas at Austin |
Keywords: Optimal control, Markov processes, Optimization
Abstract: Existing methods for deceptive path planning (DPP) address the problem of designing paths that conceal their true goal from a passive, external observer. Such methods do not apply to problems where the observer has the ability to perform adversarial interventions to impede the path planning agent. In this paper, we propose a novel Markov decision process (MDP)-based model for the DPP problem under adversarial interventions and develop new value of information (VoI) objectives to guide the design of DPP policies. Using the VoI objectives we propose, path planning agents deceive the adversarial observer into choosing suboptimal interventions by selecting trajectories that are of low informational value to the observer. Leveraging connections to the linear programming theory for MDPs, we derive computationally efficient solution methods for synthesizing policies for performing DPP under adversarial interventions. In our experiments, we illustrate the effectiveness of the proposed solution method in achieving deceptiveness under adversarial interventions and demonstrate the superior performance of our approach to both existing DPP methods and conservative path planning approaches on illustrative gridworld problems.
|
|
10:00-10:15, Paper ThA19.3 | |
L1-Optimal Controls for Driftless Affine Control Systems |
|
Cavaré, Pierre | University of Lorraine |
Jungers, Marc | CNRS - Université De Lorraine |
Loheac, Jerome | CNRS, Universite De Lorraine |
Keywords: Optimal control, Nonholonomic systems
Abstract: In this paper, we search for control of minimal L1-norm steering the solution of a non-linear driftless affine control system from an initial state to a prescribed target state in a prescribed time T>0. This study indicates that minimal L1-norm controls are not unique and in particular, among others, there always exist purely impulsive controls and controls in L1. As an outcome of this result, we can also say that there are controls of minimal L1-norm which are sparse in the sense that their support is of null Lebesgue measure. To tackle this problem, we use the graph-completion method introduced by Bressan and Rampazzo in 1988.
|
|
10:15-10:30, Paper ThA19.4 | |
On Exact Solutions to the Linear Bellman Equation |
|
Ohlin, David | Lund University |
Pates, Richard | Lund University |
Arcak, Murat | University of California, Berkeley |
Keywords: Optimal control, Markov processes, Linear systems
Abstract: This paper presents sufficient conditions for optimal control of systems with dynamics given by a linear operator, in order to obtain an explicit solution to the Bellman equation that can be calculated in a distributed fashion. Further, the class of Linearly Solvable MDP is reformulated as a continuous-state optimal control problem. It is shown that this class naturally satisfies the conditions for explicit solution of the Bellman equation, motivating the extension of previous results to semilinear dynamics to account for input nonlinearities. The applicability of the given conditions is illustrated in scenarios with linear and quadratic cost, corresponding to the Stochastic Shortest Path and Linear-Quadratic Regulator problems.
|
|
10:30-10:45, Paper ThA19.5 | |
Singular Arcs in Average Optimal Control-Affine Problems |
|
Aronna, María Soledad | Fundação Getulio Vargas |
de Lima Monteiro, Gabriel | Fundação Getulio Vargas |
Sierra Fonseca, Oscar | Fundação Getulio Vargas |
Keywords: Optimal control, Variational methods, Stochastic systems
Abstract: In this paper, we address optimal control problems in which the system parameters follow a probability distribution, and the optimization is based on average performance. These problems, known as Riemann-Stieltjes optimal control or optimal ensemble control problems, involve uncertainties that influence system dynamics. Focusing on control-affine systems, we apply the Pontryagin Maximum Principle to characterize singular arcs in a feedback form. To demonstrate the practical relevance of our approach, we show an example of a model for the sterile insect technique, which is a biological pest control method. Numerical simulations confirm the effectiveness of our framework in addressing control problems under uncertainty.
|
|
10:45-11:00, Paper ThA19.6 | |
Further Results on Exact Penalization for Linear Quadratic Optimal Control Problems |
|
Grimaldi, Riccardo Alessandro | Imperial College London |
Astolfi, Alessandro | Imperial College & Univ. of Rome |
Keywords: Optimal control, Linear systems, Optimization
Abstract: A systematic procedure for solving general linear quadratic optimal control problems with linear equality con- straints is presented. We exploit the theory of exact penalization, in the spirit of [1], to transform a general linear quadratic problem with a set of linear equality constraints on the state into an equivalent penalized unconstrained problem. Both the finite and the infinite horizon cases are discussed, and the results are given under minimal feasibility assumptions on the constrained problem, which are expressed in a coordinate free form. Two simple examples illustrate the theory.
|
|
11:00-11:15, Paper ThA19.7 | |
Bridging Continuous-Time LQR and Reinforcement Learning Via Gradient Flow of the Bellman Error |
|
Gießler, Armin | Karlsruhe Institute of Technology |
Malan, Albertus J. | Karlsruhe Institute of Technology |
Hohmann, Soeren | KIT |
Keywords: Optimal control, Reinforcement learning, Linear systems
Abstract: In this paper, we present a novel method for computing the optimal feedback gain of the infinite-horizon Linear Quadratic Regulator (LQR) problem via an ordinary differential equation. We introduce a novel continuous-time Bellman error, derived from the Hamilton-Jacobi-Bellman (HJB) equation, which quantifies the suboptimality of stabilizing policies and is parametrized in terms of the feedback gain. We analyze its properties, including its effective domain, smoothness, and coerciveness, and show the existence of a unique stationary point within the stability region. Furthermore, we derive a closed-form gradient expression of the Bellman error that induces a gradient flow. This converges to the optimal feedback and generates a unique trajectory that exclusively comprises stabilizing feedback policies. Additionally, this work advances interesting connections between LQR theory and Reinforcement Learning (RL) by redefining suboptimality of the Algebraic Riccati Equation (ARE) as a Bellman error, adapting a state-independent formulation, and leveraging Lyapunov equations to overcome the infinite-horizon challenge. We validate our method in a simulation and compare it to the state of the art.
|
|
11:15-11:30, Paper ThA19.8 | |
Pulse Control of Affine Systems with Applications to Quantum Control |
|
Beschastnyi, Ivan | INRIA |
Dell'Elce, Lamberto | Inria |
Pomet, Jean-Baptiste | INRIA |
Sacchelli, Ludovic | Inria |
Tinoco, David | INRIA |
Keywords: Optimal control, Quantum information and control, Nonlinear systems
Abstract: Motivated by problems in quantum control, we introduce a novel method for studying time-optimal control problems of affine systems with unbounded controls. Intuitively, we construct a relaxation of our problem as follows: for each control of finite amplitude, we appropriately rescale the time variable so that, in the limit, an instantaneous jump in the state of the system becomes a trajectory containing both the state before and after the jump. The main advantage of this method is the direct applicability of the classical Pontryagin maximum principle. Several examples, including the generation of noiseless and noisy qubit gates, as well as population transfer in open two-level systems, are discussed.
|
|
ThA20 |
Asia I+II+III+IV |
Sampling-Based Methods for Optimal Control: Theory, Algorithms, and
Applications |
Tutorial Session |
Chair: Qu, Guannan | Carnegie Mellon University |
Co-Chair: Yi, Zeji | Georgia Institute of Technology |
Organizer: Qu, Guannan | Carnegie Mellon University |
Organizer: Shi, Guanya | Carnegie Mellon University |
Organizer: Yi, Zeji | Georgia Institute of Technology |
Organizer: Pan, Chaoyi | Carnegie Mellon University |
|
09:30-11:30, Paper ThA20.1 | |
Sampling-Based Methods for Optimal Control: Theory, Algorithms, and Applications (I) |
|
Pan, Chaoyi | Carnegie Mellon University |
Yi, Zeji | Carnegie Mellon University |
Shi, Guanya | Carnegie Mellon University |
Qu, Guannan | Carnegie Mellon University |
Keywords: Optimization, Optimal control, Robotics
Abstract: This tutorial will address the challenges of optimal control for complex robotic systems, particularly those with rich contact dynamics, where traditional methods struggle. While Sampling-Based Optimal Control (SBOC) techniques like Model Predictive Path Integral Control (MPPI) offer empirical success due to their handling of nonlinearities and parallelizability, they lack theoretical guarantees regarding convergence and stability. This tutorial will summarize recent results that advance the theoretical understanding and algorithmic capabilities of SBOC by analyzing convergence, leading to new algorithms named CoVo-MPC [50] with optimal covariance design. Furthermore, by establishing and exploiting a novel connection to generative diffusion models, this work introduces approaches like MBD for trajectory optimization and DIAL-MPC, an online framework with diffusion-inspired annealing. The efficacy of these theoretical insights and algorithms is validated through simulations and experiments, ultimately providing a stronger theoretical foundation for SBOC, yielding more reliable and efficient algorithms for complex robotics, and bridging optimal control with generative modeling.
|
|
ThB01 |
Galapagos I |
Analysis and Control of Complex Systems in the Social and Life Sciences |
Invited Session |
Chair: Zino, Lorenzo | Politecnico Di Torino |
Co-Chair: Bizyaeva, Anastasia | Cornell University |
Organizer: Ye, Mengbin | Curtin University |
Organizer: Zino, Lorenzo | Politecnico Di Torino |
Organizer: Cao, Ming | University of Groningen |
Organizer: Leonard, Naomi Ehrich | Princeton University |
|
14:00-14:15, Paper ThB01.1 | |
Adaptive-Gain Control for Equilibrium Selection in the Logit Dynamics (I) |
|
Gavin, Rory | FSE, Rijksuniversiteit Groningen |
Paarporn, Keith | University of Colorado, Colorado Springs |
Ye, Mengbin | Curtin University |
Zino, Lorenzo | Politecnico Di Torino |
Cao, Ming | University of Groningen |
Keywords: Adaptive control, Nonlinear systems, Game theory
Abstract: We study the problem of controlling evolutionary game-theoretic dynamics when agents follow sophisticated learning rules, in particular the logit protocol. Much previous work focused on settings where agents are less sophisticated learners following imitative protocols that leads to the well-known replicator dynamic. Here, we consider adaptive control schemes for the logit dynamics with the objective of steering the population to a desired equilibrium by modifying the agents' payoff functions in a 2-action coordination game. Through the analysis of the controlled dynamics, we establish sufficient conditions for global convergence to the desired equilibrium. We find that the conditions to control the logit system have fewer requirements than those to control the replicator equation: Adaptive-gain controllers that are successful in performing their task in the logit system may fail in the replicator system. We then provide numerical simulations to illustrate and compare the amount of control effort needed to achieve the objective in the logit system versus the replicator system.
|
|
14:15-14:30, Paper ThB01.2 | |
Wisdom of Crowds in Signed Opinion Dynamics Models (I) |
|
Razaq, Muhammad Ahsan | Linkoping University |
Altafini, Claudio | Linkoping University |
|
|
14:30-14:45, Paper ThB01.3 | |
Mixed-Feedback Oscillations in the Foraging Dynamics of Arboreal Turtle Ants (I) |
|
Valentine, Alia | Cornell University |
Godron, Deborah | Stanford University |
Bizyaeva, Anastasia | Cornell University |
Keywords: Adaptive systems, Biological systems, Nonlinear systems
Abstract: We propose and analyze a model for the dynamics of the flow into and out of a nest for the arboreal turtle ant Cephalotes goniodontus during foraging to investigate a possible mechanism for the emergence of oscillations. In our model, there is mixed dynamic feedback between the flow of ants between different behavioral compartments and the amount of pheromone along trails. On one hand, the ants deposit pheromone along the trail, which provides a positive feedback by concentrating the flow of ants along specific trail segments and further increasing the rate of deposition. On the other hand, pheromone evaporation is a source of negative feedback, as it depletes the pheromone and inhibits the return rate to the nest. We prove that the model is globally asymptotically stable in the absence of pheromone feedback. Then we show that pheromone feedback can lead to a loss of stability of the equilibrium and onset of sustained oscillations in the flow in and out of the nest via a Hopf bifurcation. This analysis sheds light on a potential key mechanism that enables arboreal turtle ants to effectively change their trail networks to minimize traveled path lengths and eliminate cycles.
|
|
14:45-15:00, Paper ThB01.4 | |
A Parsimonious Opinion Dynamics Model Based on Multi-Objective Game Explaining the Emergence of Pluralistic Ignorance (I) |
|
Luo, Yuheng | Peking University |
Zhang, Chuanzhe | Peking University |
Feng, Yilong | Peking University |
Liu, Qingsong | Wuhan University of Science and Technology |
Mei, Wenjun | Peking University |
Keywords: Network analysis and control, Agents-based systems, Game theory
Abstract: Opinion dynamics studies how interpersonal influence and social network structures shape the evolution of public opinions. Recently, various models of opinion dynamics have been proposed within a game-theoretic framework, where interpersonal influence mechanisms are captured by players’ cost functions, reflecting their motivations. Conventionally, when players have multiple motivations, an aggregated cost function is constructed by summing individual cost functions corresponding to different motivations. However, whether these “costs” in people’s minds are interchangeable remains a subject of debate. In this paper, we propose an opinion dynamics model based on a multi-objective game framework. In our model, individuals experience two distinct costs: social pressure from disagreeing with others and cognitive dissonance from deviating from the truth. Opinion updates are modeled as Pareto improvements between these two cost functions. This approach provides a parsimonious explanation for the emergence of pluralistic ignorance—where individuals may “agree” on something untrue, even though they all know the underlying truth. We conduct a theoretical analysis of the proposed model, establishing conditions for the almost-sure convergence and for the prevalence of truth.
|
|
15:00-15:15, Paper ThB01.5 | |
A Quantum-Compliant Formulation for Network Epidemic Control |
|
Zino, Lorenzo | Politecnico Di Torino |
Boggio, Mattia | Politecnico Di Torino |
Volpe, Deborah | National Institute of Geophysics and Vulcanology |
Orlandi, Giacomo | Politecnico Di Torino |
Turvani, Giovanna | Politecnico Di Torino |
Novara, Carlo | Politecnico Di Torino |
Keywords: Control of networks, Optimization, Control applications
Abstract: We deal with controlling the spread of an epidemic disease on a network by isolating one or multiple locations by banning people from leaving them. To this aim, we build on the susceptible-infected-susceptible and the susceptible-infected-removed discrete-time network models, encapsulating a control action that captures mobility bans via removing links from the network. Then, we formulate the problem of optimally devising a control policy based on mobility bans that trades-off the burden on the healthcare system and the social and economic costs associated with interventions. The binary nature of mobility bans hampers the possibility to solve the control problem with standard optimization methods, yielding a NP-hard problem. Here, this is tackled by deriving a Quadratic Unconstrained Binary Optimization (QUBO) formulation of the control problem, and leveraging the growing potentialities of quantum computing to efficiently solve it.
|
|
15:15-15:30, Paper ThB01.6 | |
Graph and Hypergraph Topologies in Decentralized Coalition Consensus Control for Financial and Economic Networks |
|
Papastaikoudis, Ioannis | University of Cambridge |
Watson, Jeremy | University of Canterbury |
Lestas, Ioannis | University of Cambridge |
Keywords: Network analysis and control, Decentralized control
Abstract: This work explores network coalition-based models using dynamic average consensus protocols, where agents in coalitions interact to reach global agreement. We employ hypergraphs to model communication structures and compare their convergence rates with clique expansion graphs. Our results show that hypergraph-based models achieve faster convergence for the case of continuous consensus dynamical systems and also in discrete time for coalitions with an equal number of agents. Our findings suggest that hypergraphs offer a scalable, decentralized approach to improving consensus algorithms in generalized tree like information structures, with significant potential for enhancing performance in applications like finance and economics.
|
|
15:30-15:45, Paper ThB01.7 | |
Bifurcation Analysis of an Opinion Dynamics Model Coupled with an Environmental Dynamics |
|
Couthures, Anthony | University of Lorraine |
Bizyaeva, Anastasia | Cornell University |
Satheeskumar Varma, Vineeth | CNRS |
Franci, Alessio | University of Liege |
Morarescu, Irinel-Constantin | CRAN, CNRS, Université De Lorraine |
Keywords: Autonomous systems, Control applications, Network analysis and control
Abstract: We consider an opinion dynamics model coupled with an environmental dynamics. Based on a forward invariance argument, we can simplify the analysis of the asymptotic behavior to the case when all the opinions in the social network are synchronized. Our goal is to emphasize the role of the trust given to the environmental signal in the asymptotic behavior of the opinion dynamics and implicitly of the coupled system. To do that, we conduct a bifurcation analysis of the system around the origin when the trust parameter is varying. Specific conditions are presented for both pitchfork and Hopf bifurcation. Numerical illustration completes the theoretical findings.
|
|
15:45-16:00, Paper ThB01.8 | |
Containment Control Approach for Steering Opinion in a Social Network |
|
Rastgoftar, Hossein | University of Arizona |
Keywords: Control of networks, Control over communications, Networked control systems
Abstract: The paper studies the problem of steering multidimensional opinion in a social network. Assuming the society of desire consists of stubborn and regular agents, stubborn agents are considered as leaders who specify the desired opinion distribution as a distributed reward or utility function. In this context, each regular agent is seen as a follower, updating its bias on the initial opinion and influence weights by averaging their observations of the rewards their influencers have received. Assuming random graphs with reducible and irreducible topology specify the influences on regular agents, opinion evolution is represented as a containment control problem in which stability and convergence to the final opinion are proven.
|
|
ThB02 |
Oceania II |
Safe, Secure and Learning-Based Control I |
Invited Session |
Chair: Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Co-Chair: Panagou, Dimitra | University of Michigan, Ann Arbor |
Organizer: Doan, Thinh T. | University of Texas at Austin |
Organizer: Jha, Mayank Shekhar | University of Lorraine |
Organizer: Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
|
14:00-14:15, Paper ThB02.1 | |
Robot Learning Optimal Control Via an Adaptive Critic Reservoir (I) |
|
Chen, Anthony Siming | University of Nottingham |
Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Keywords: Optimal control, Adaptive control, Robotics
Abstract: In this paper we develop a reservoir-based adaptive critic framework for optimal real-time control of nonlinear robotic systems. Unlike traditional neural network (NN) based critics, deterministic reservoir computing significantly reduces computational complexity by requiring only the training of output weights. Using deterministic reservoir computing, the method efficiently approximates the value function and computes optimal control laws, achieving rapid error convergence and computational efficiency. In addition, a momentum-enhanced adaptation law is proposed to accelerate convergence rates. The framework also contributes to robot learning by enabling adaptive behavior through iterative policy improvement, allowing robots to autonomously refine their control strategies by self-learning. It is theoretically validated under the persistency of excitation condition and demonstrated through simulations of trajectory tracking compared against traditional NN parameterization.
|
|
14:15-14:30, Paper ThB02.2 | |
Neural Ordinary Differential Equations Based System Identification for Reinforcement Learning with Provable Guarantees (I) |
|
Rutschke, Théo | Université De Lorraine, CNR |
Jha, Mayank Shekhar | University of Lorraine |
Garnier, Hugues | University of Lorraine |
Keywords: Identification for control, Reinforcement learning, Nonlinear systems identification
Abstract: This paper investigates nonlinear identification for control learning with provable guarantees. A novel approach is proposed for Model-Based Reinforcement Learning (MBRL) where Neural Ordinary Differential Equation (NODE) based nonlinear system identification in continuous time is integrated within Policy Iteration (PI) based Reinforcement Learning. To that end, first, a continuous-time NODE model is identified from measured data, which is then leveraged to learn an optimal controller using off-policy PI. Rigorous proofs are developed to guarantee boundedness of parameter as well as prediction errors. The identified NODE model enables admissible initialization of the PI algorithm through a Quadratic Program (QP) under NODE-based Control Lyapunov Function (CLF) constraints, leading to guaranteed admissibility and stability at the initialization phase of control learning. During the exploration phase, closed-loop stability is maintained by enforcing NODE-based Input-to-State Stable CLF (ISS-CLF) constraints. The resulting controller achieves closed-loop stability and optimality with respect to the identified model, providing guarantees throughout the MBRL process. Simulation results assess the effectiveness of the proposed approach.
|
|
14:30-14:45, Paper ThB02.3 | |
Distributed Reconstruction of Sensor Cyber-Attacks in Cyber-Physical Networks (I) |
|
Bonagura, Valeria | Roma Tre University |
Kasis, Andreas | University of Cyprus |
Polycarpou, Marios M. | University of Cyprus |
Pascucci, Federica | University of Roma TRE |
Panzieri, Stefano | Univ. "Roma Tre" |
Keywords: Cyber-Physical Security, Large-scale systems, Fault detection
Abstract: Critical infrastructures, such as power and transportation, are frequently modeled as interconnected Cyber-Physical Systems (CPS). This interconnectivity makes CPS vulnerable to cyber-attacks, threatening system stability and operational continuity. Notably, many of these textcolor{black}{malicious attacks involve manipulating} sensor measurements. In this context, accurately reconstructing the attack signal is crucial for recovering true measurements, distinguishing cyber-attacks from natural faults, and enabling appropriate countermeasures. This work presents a distributed scheme that leverages sliding mode observers to reconstruct attack signals targeting sensors. We provide suitable bounds on the reconstruction error that depend on the disturbance characteristics of the system. Additionally, we present an isolation mechanism and analytically show that it accurately determines the attacked nodes. The applicability of the proposed approach is demonstrated through numerical simulations conducted on a six-subsystem interconnected system, which validate our analytic results.
|
|
14:45-15:00, Paper ThB02.4 | |
Sub-Optimality of the Separation Principle for Quadratic Control from Bilinear Observations (I) |
|
Sattar, Yahya | Cornell University |
Choi, Sunmook | Cornell University |
Jedra, Yassir | MIT |
Fazel, Maryam | University of Washington |
Dean, Sarah | Cornell |
Keywords: Stochastic optimal control, Nonlinear output feedback, Optimal control
Abstract: We consider the problem minimizing a quadratic cost of controlling linear dynamical systems from bilinear observations. Despite the similarity of this problem to the standard Linear-Quadratic-Gaussian (LQG) control, we show that when the observation model is bilinear, neither does the Separation Principle hold, nor is the optimal controller affine in the estimated state. Moreover, the problem of finding the optimal controller is non-convex. Hence, finding an analytical expression for the optimal feedback controller is difficult in general. Under certain settings, we show that the standard LQG controller locally maximizes the cost instead of minimizing, whereas, the optimal controllers (derived analytically) are nonlinear in the estimated state. We also introduce a notion of input dependent observability and derive conditions under which the Kalman filtering covariance remains bounded. We verify our theoretical results through extensive numerical experiments with synthetic data as well as data generated from a double integrator system.
|
|
15:00-15:15, Paper ThB02.5 | |
Distributed Resilience-Aware Control in Multi-Robot Networks (I) |
|
Lee, Haejoon | University of Michigan |
Panagou, Dimitra | University of Michigan, Ann Arbor |
Keywords: Resilient Control Systems, Networked control systems, Control of networks
Abstract: Ensuring resilient consensus in multi-robot systems with misbehaving agents remains a challenge, as many existing network resilience properties are inherently combinatorial and globally defined. While previous works have proposed control laws to enhance or preserve resilience in multi-robot networks, they often assume a fixed topology with known resilience properties, or require global state knowledge. These assumptions may be impractical in physically-constrained environments, where safety and resilience requirements are conflicting, or when misbehaving agents share inaccurate state information. In this work, we propose a distributed control law that enables each robot to guarantee resilient consensus and safety during its navigation without fixed topologies using only locally available information. To this end, we establish a sufficient condition for resilient consensus in time-varying networks based on the degree of non-misbehaving or normal agents. Using this condition, we design a Control Barrier Function (CBF)-based controller that guarantees resilient consensus and collision avoidance without requiring estimates of global state and/or control actions of all other robots. Finally, we validate our method through simulations.
|
|
15:15-15:30, Paper ThB02.6 | |
DR-PETS: Learning-Based Control with Planning in Adversarial Environments |
|
Jesawada, Hozefa Zuzer | University of Sannio |
Acernese, Antonio | Università Degli Studi Del Sannio |
Del Vecchio, Davide | Independent Researcher |
Russo, Giovanni | University of Salerno |
Del Vecchio, Carmen | Università Del Sannio |
Keywords: Robust control, Reinforcement learning, Uncertain systems
Abstract: The probabilistic ensembles with trajectory sampling (PETS) algorithm is a recognized baseline among model-based reinforcement learning (MBRL) methods. PETS incorporates planning and handles uncertainty using ensemble-based probabilistic models. However, no formal robustness guarantees against epistemic uncertainty exist for PETS. Providing such guarantees is a key enabler for reliable real-world deployment. To address this gap, we propose a distributionally robust extension of PETS, called DR-PETS. We formalize model uncertainty using a distributional ambiguity set and optimize the worst-case expected return. We derive a tractable convex approximation of the resulting min-max planning problem, which integrates seamlessly into PETS’s planning loop as a regularized objective. Experiments on pendulum and cart-pole environments show that DR-PETS certifies robustness against adversarial parameter perturbations, achieving consistent performance in worst-case scenarios where PETS deteriorates.
|
|
15:30-15:45, Paper ThB02.7 | |
Data-Driven Security Control for CPSs under Aperiodic DoS Attacks: A Switched System Approach |
|
Zhang, Ruifeng | Shandong University |
Yang, Rongni | Shandong University |
Zhu, Yanzheng | Shandong University of Science and Technology |
Shi, Peng | University of Adelaide |
Keywords: Data driven control, Switched systems
Abstract: This work studies a data-driven security control problem for unknown discrete-time cyber-physical systems (CPSs) under aperiodic denial-of-service (DoS) attacks based on the switched system approach. Particularly, the concerned DoS attacks are characterized by the aperiodicity with the constraints of minimum silent interval and maximum active interval. Compared with the existing data-driven methods, a novel data-driven security control approach via descriptor method and auxiliary matrices is developed for the considered CPSs with the advantage of lower computational complexity and less input-state data information. First, by utilizing the collected data and switching strategy, the considered CPSs under attacks are transformed into a direct data-driven parametrization of the corresponding switched systems. Next, a data-driven control approach via Willems' fundamental lemma is provided. Meanwhile, through introducing auxiliary matrices to construct the connection between data information and stability conditions, a different data-driven control approach via descriptor method is also presented for the considered unknown CPSs. Finally, the efficiency and advantage of the proposed method are validated by the comparative experiment of the two-wheeled robot.
|
|
15:45-16:00, Paper ThB02.8 | |
A Data-Driven Approach to Safe Control of Linear Systems |
|
Ghiasi, Niyousha | Michigan State University |
Kiumarsi, Bahare | Michigan State University |
Keywords: Data driven control, Uncertain systems, Optimization
Abstract: A data-driven approach is proposed, formulated as a quadratically constrained quadratic program, to guarantee the safety and stability of safety-critical control systems relying solely on input-output data. Control Lyapunov functions (CLFs) and control barrier functions (CBFs) are utilized to design control inputs that ensure both stability and safety without requiring explicit knowledge of the system model. To achieve this, an augmented system based on historical input-output measurements is constructed, enabling the data-driven formulation of CLFs and CBFs. The efficiency of the proposed method is evaluated in terms of computational complexity, and its performance is validated through a numerical case study with a mobile robot, underscoring its practical utility and potential for real-world implementation.
|
|
ThB03 |
Oceania III |
Safe Planning and Control with Uncertainty Quantification I |
Invited Session |
Chair: Vertovec, Nikolaus | University of Oxford |
Co-Chair: Lindemann, Lars | University of Southern California |
Organizer: Gao, Yulong | Imperial College London |
Organizer: Lindemann, Lars | ETH Zürich |
Organizer: Vertovec, Nikolaus | University of Oxford |
Organizer: Yu, Pian | University College London |
|
14:00-14:15, Paper ThB03.1 | |
Quadratic Truncated Random Return in Distributional LQR: Positive Definiteness, Density, and Log-Concavity (I) |
|
Teng, Ruyi | Imperial College London |
Wang, Dan | Nanjing University |
Chen, Wei | Peking University |
Gao, Yulong | Imperial College London |
Keywords: Linear systems, Uncertain systems, Stochastic optimal control
Abstract: Distributional linear quadratic regulator (LQR) is a new framework that integrates the distributional reinforcement learning and classical LQR, which offers a new way to study the random return instead of the expected cost. Unlike iterative approximation using dynamic programming in the DRL, a closed-form expression for the random return can be exactly characterized in the distributional LQR, which is defined over infinitely many random variables. In recent work, it has been shown that this random return can be well approximated by a finite number of random variables, which we call truncated random return. In this paper, we study the truncated random return in the distributional LQR. We show that the truncated random return can be naturally expressed in the quadratic form. We develop a sufficient condition for the positive definiteness of the block symmetric matrix in the quadratic form and provide the lower and upper bounds on the eigenvalues of this matrix. We further show that in this case, the truncated random return follows a positively weighted non-central chi-square distribution if the random disturbances is Gaussian, and its cumulative density distribution is log-concave if the probability density distribution of the random disturbances is log-concave.
|
|
14:15-14:30, Paper ThB03.2 | |
Integral Input-To-State Safe Barrier Functions (I) |
|
Lyu, Ziliang | Tongji University |
Fang, Xu | Dalian University of Technology |
Yuan, Heling | Nanyang Technological University |
Li, Xiuxian | Tongji University |
Hong, Yiguang | Tongji University |
Xie, Lihua | Nanyang Tech. Univ |
Keywords: Nonlinear systems, Stability of nonlinear systems, Lyapunov methods
Abstract: Understanding the effect of inputs on system safety is one of the most important issues in the study of safety-critical systems. Integral input-to-state safety (iISSf) is a concept that can describe the dependence of safety on the integral of external inputs. This paper studies the characterization of iISSf properties from a barrier function perspective. We introduce iISSf barrier functions (iISSf-BFs) as a tool to verify iISSf, and establish that the existence of an iISSf-BF is a sufficient condition for iISSf. With iISSf control barrier functions (iISSf-CBFs) and quadratic programs (QPs), we construct a safety-critical controller to enforce iISSf with respect to a prescribed iISSf gain. Finally, under an additional assumption of integral input-to-state stability, we show that iISSfs-BFs are also necessary for iISSf.
|
|
14:30-14:45, Paper ThB03.3 | |
Certified Approximate Reachability (CARe): Formal Error Bounds on Deep Learning of Reachable Sets (I) |
|
Solanki, Prashant | Delft University of Technology (TU Delft) |
Vertovec, Nikolaus | University of Oxford |
Schnitzer, Yannik | University of Oxford |
van Beers, Jasper | Delft University of Technology |
de Visser, Coen | Delft University of Technology |
Abate, Alessandro | University of Oxford |
Keywords: Neural networks, Optimal control, Formal Verification/Synthesis
Abstract: Recent approaches to leveraging deep learning for computing reachable sets of continuous-time dynamical systems have gained popularity over traditional level-set methods, as they overcome the curse of dimensionality. However, as with level-set methods, considerable care needs to be taken in limiting approximation errors, particularly since no guarantees are provided during training on the accuracy of the learned reachable set. To address this limitation, we introduce an epsilon-approximate Hamilton-Jacobi partial differential equation (HJ-PDE), which establishes a relationship between training loss and accuracy of the true reachable set. To formally certify this approximation, we leverage Satisfiability Modulo Theories (SMT) solvers to bound the residual error of the HJ-based loss function across the domain of interest. Leveraging Counter Example Guided Inductive Synthesis (CEGIS), we close the loop around learning and verification, by fine-tuning the neural network on counterexamples found by the SMT solver, thus improving the accuracy of the learned reachable set. To the best of our knowledge, Certified Approximate Reachability (CARe) is the first approach to provide soundness guarantees on learned reachable sets of continuous dynamical systems.
|
|
14:45-15:00, Paper ThB03.4 | |
Data-Driven Safety Verification Using Barrier Certificates and Matrix Zonotopes (I) |
|
Oumer, Mohammed Adib | University of Colorado Boulder |
Alanwar, Amr | Technical University of Munich |
Zamani, Majid | University of Colorado Boulder |
Keywords: Formal Verification/Synthesis, Uncertain systems, Linear systems
Abstract: Ensuring safety in cyber-physical systems (CPSs) is a critical challenge, especially when system models are difficult to obtain or cannot be fully trusted due to uncertainty, modeling errors, or environmental disturbances. Traditional model-based approaches rely on precise system dynamics, which may not be available in real-world scenarios. To address this, we propose a data-driven safety verification framework that leverages matrix zonotopes and barrier certificates to verify system safety directly from noisy data. Instead of trusting a single unreliable model, we construct a set of models that capture all possible system dynamics that align with the observed data, ensuring that the true system model is always contained within this set. This model set is compactly represented using matrix zonotopes, enabling efficient computation and propagation of uncertainty. By integrating this representation into a barrier certificate framework, we establish rigorous safety guarantees without requiring an explicit system model. Numerical experiments demonstrate the effectiveness of our approach in verifying safety for dynamical systems with unknown models, showcasing its potential for real-world CPS applications.
|
|
15:00-15:15, Paper ThB03.5 | |
Probabilistic Alternating Simulations for Policy Synthesis in Uncertain Stochastic Dynamical Systems (I) |
|
Badings, Thom | University of Oxford |
Abate, Alessandro | University of Oxford |
Keywords: Stochastic systems, Markov processes, Uncertain systems
Abstract: A classical approach to formal policy synthesis in stochastic dynamical systems is to construct a finite-state abstraction, often represented as a Markov decision process (MDP). The correctness of these approaches hinges on a behavioural relation between the dynamical system and its abstraction, such as a probabilistic simulation relation. However, probabilistic simulation relations do not suffice when the system dynamics are, next to being stochastic, also subject to nondeterministic (i.e., set-valued) disturbances. In this work, we extend probabilistic simulation relations to systems with both stochastic and nondeterministic disturbances. Our relation, which is inspired by a notion of alternating simulation, generalises existing relations used for verification and policy synthesis used in several works. Intuitively, our relation allows reasoning probabilistically over stochastic uncertainty, while reasoning robustly (i.e., adversarially) over nondeterministic disturbances. We experimentally demonstrate the applicability of our relations for policy synthesis in a 4D-state Dubins vehicle.
|
|
15:15-15:30, Paper ThB03.6 | |
Data-Driven Reachability with Scenario Optimization and the Holdout Method (I) |
|
Dietrich, Elizabeth | University of California, Berkeley |
Devonport, Rosalyn Alice | University of New Mexico |
Tu, Stephen | University of California, Berkeley |
Arcak, Murat | University of California, Berkeley |
Keywords: Statistical learning, Formal Verification/Synthesis, Uncertain systems
Abstract: Reachability analysis is an important method in providing safety guarantees for systems with unknown or uncertain dynamics. Due to the computational intractability of exact reachability analysis for general nonlinear, high-dimensional systems, recent work has focused on the use of probabilistic methods for computing approximate reachable sets. In this work, we advocate for the use of a general purpose, practical, and sharp method for data-driven reachability: the holdout method. Despite the simplicity of the holdout method, we show---on several numerical examples including scenario-based reach tubes---that the resulting probabilistic bounds are substantially sharper and require fewer samples than existing methods for data-driven reachability. Furthermore, we complement our work with a discussion on the necessity of probabilistic reachability bounds. We argue that any method that attempts to de-randomize the bounds, by converting the guarantees to hold deterministically, requires (a) an exponential in state-dimension amount of samples to achieve non-vacuous guarantees, and (b) extra assumptions on the dynamics.
|
|
15:30-15:45, Paper ThB03.7 | |
Unraveling Tensor Structures in Correct-By-Design Controller Synthesis (I) |
|
Wang, Ruohan | Technische Universiteit Eindhoven |
Sun, Zhiyong | Peking University (PKU) |
Haesaert, Sofie | Eindhoven University of Technology |
Keywords: Formal Verification/Synthesis, Markov processes, Stochastic systems
Abstract: Formal safety guarantees on the synthesis of controllers for stochastic systems can be obtained using correct-by-design approaches. These approaches often use abstractions to finite-state Markov Decision Processes. As the state space of these MDPs grows, the curse of dimensionality makes the computational and memory cost of the probabilistic guarantees, quantified with dynamic programming, scale exponentially. In this work, we leverage decoupled dynamics and unravel, via dynamic programming operations, a tree structure in the Canonical Polyadic Decomposition (CPD) of the value functions. For discrete-time stochastic systems with syntactically co-safe linear temporal logic (scLTL) specifications, we provide provable probabilistic safety guarantees and significantly alleviate the computational burden. We provide an initial validation of the theoretical results on several typical case studies and showcase that the uncovered tree structure enables efficient reductions in the computational burden.
|
|
15:45-16:00, Paper ThB03.8 | |
Fair Control of Uncertain Dynamical Systems under LTL Specifications (I) |
|
Zhou, Can | Imperial College London |
Yu, Pian | University College London |
Parisini, Thomas | Imperial C., Aalborg U. & Univ. of Trieste |
Abate, Alessandro | University of Oxford |
Gao, Yulong | Imperial College London |
Keywords: Formal Verification/Synthesis, Stochastic systems, Autonomous systems
Abstract: Uncertainties are inevitable for control of dynamical systems of practical relevance. The conventional robust control methods assume persistent worst-case realisations of uncertainties, which leads to conservative solutions. To relax this assumption, we exploit the fairness notion in formal verification to qualitatively capture realistic epistemic disturbance behaviours. We study the fair control problem, that is, to maximise the probability of satisfying an Linear Temporal Logic subject to an action-based fairness constraint for uncertain dynamical systems. The action-based fairness of epistemic disturbances can be defined flexibly: by their observed behaviour and/or by their interaction with control input, enabling both input-dependent and independent disturbance modelling. We develop a sound and complete methodology to perform correct-by-design synthesis. We show that this optimal control problem is equivalent to a reachability problem in the fairness-realisable sub-product Markov decision process. We validate our results over a case study of room temperature regulation.
|
|
ThB04 |
Oceania IV |
Control Theory for Algorithm Analysis and Design |
Invited Session |
Chair: Martin, Andrea | KTH Royal Institute of Technology |
Co-Chair: Bastianello, Nicola | KTH Royal Institute of Technology |
Organizer: Furieri, Luca | University of Oxford |
Organizer: Martin, Andrea | KTH Royal Institute of Technology |
Organizer: Bastianello, Nicola | KTH Royal Institute of Technology |
Organizer: Carnevale, Guido | University of Bologna |
|
14:00-14:15, Paper ThB04.1 | |
The Fastest Known Globally Convergent First-Order Method for Minimizing Locally Quadratic Smooth Strongly Convex Functions |
|
Van Scoy, Bryan | Miami University |
Lessard, Laurent | Northeastern University |
Keywords: Optimization algorithms, Optimization, Robust control
Abstract: We consider iterative gradient-based optimization algorithms applied to functions that are smooth and strongly convex. The fastest globally convergent algorithm for this class of functions is the Triple Momentum (TM) method. We show that if the objective function is also twice continuously differentiable, a new, faster algorithm emerges, which we call C2-Momentum (C2M). We prove that C2M is globally convergent and that its worst-case convergence rate is strictly faster than that of TM, with no additional computational cost. We validate our theoretical findings with numerical examples, demonstrating that C2M outperforms TM when the objective function is twice continuously differentiable.
|
|
14:15-14:30, Paper ThB04.2 | |
Modular Distributed Nonconvex Learning with Error Feedback |
|
Carnevale, Guido | University of Bologna |
Bastianello, Nicola | KTH Royal Institute of Technology |
Keywords: Optimization algorithms, Stochastic systems, Network analysis and control
Abstract: In this paper, we design a novel distributed learning algorithm using stochastic compressed communications. In detail, we pursue a modular approach, merging ADMM and a gradient-based approach, benefiting from the robustness of the former and the computational efficiency of the latter. Additionally, we integrate a stochastic integral action (error feedback) enabling almost sure rejection of the compression error. We analyze the resulting method in nonconvex scenarios and guarantee almost sure asymptotic convergence to the set of stationary points of the problem. This result is obtained using system-theoretic tools based on stochastic timescale separation. We corroborate our findings with numerical simulations in nonconvex classification.
|
|
14:30-14:45, Paper ThB04.3 | |
The Discrete-Time Internal Model Principle of Time-Varying Optimization (I) |
|
Bianchin, Gianluca | University of Louvain |
Van Scoy, Bryan | Miami University |
Keywords: Optimization algorithms, Output regulation
Abstract: Time-varying optimization problems arise in a variety of engineering applications. The available information about how the problem changes in time dictates the types of algorithms that are applicable to a particular problem as well as the types of convergence guarantees that may be proven. In this paper, we explore the fundamental properties shared by the entire class of gradient-based optimization algorithms for time-varying optimization. By casting the design of such algorithms as an output regulation problem for dynamical systems, we provide necessary and sufficient conditions for the existence of an algorithm that asymptotically tracks an optimizer of the problem of interest. When these conditions hold, we provide a design procedure to construct such an algorithm. As a fundamental limitation, we show that any algorithm that achieves exact tracking needs to incorporate an internal model of the temporal variation, which we refer to as the internal model principle of time-varying optimization.
|
|
14:45-15:00, Paper ThB04.4 | |
Semidefinite Programming Duality in Infinite-Horizon Linear Quadratic Differential Games (I) |
|
Watanabe, Yuto | University of California, San Diego |
Pai, Chih-Fan Rich | University of California San Diego |
Zheng, Yang | University of California San Diego |
Keywords: LMIs, Optimal control, Game theory
Abstract: Semidefinite programs (SDPs) play a crucial role in control theory, traditionally as a computational tool. Beyond computation, the duality theory in convex optimization also provides valuable analytical insights and new proofs of classical results in control. In this work, we extend this analytical use of SDPs to study the infinite-horizon linear-quadratic (LQ) differential game in continuous time. Under standard assumptions, we introduce a new SDP-based primal-dual approach to establish the saddle point characterized by linear static policies in LQ games. For this, we leverage the Gramian representation technique, which elegantly transforms linear quadratic control problems into tractable convex programs. We also extend this duality-based proof to the H∞ suboptimal control problem. To our knowledge, this work provides the first primal-dual analysis using Gramian representations for the LQ game and H∞ control beyond LQ optimal control and Hinf analysis.
|
|
15:00-15:15, Paper ThB04.5 | |
Robust Feedback Optimization with Model Uncertainty: A Regularization Approach (I) |
|
Chan, Winnie | ETH Zurich |
He, Zhiyu | ETH Zurich |
Moffat, Keith | ETH Zurich |
Bolognani, Saverio | ETH Zurich |
Muehlebach, Michael | Max Planck Institute for Intelligent Systems |
Dörfler, Florian | Swiss Federal Institute of Technology (ETH) Zurich |
Keywords: Optimization algorithms, Optimization, Power systems
Abstract: Feedback optimization optimizes the steady state of a dynamical system by implementing optimization iterations in closed loop with the plant. It relies on online measurements and limited model information, namely, the input-output sensitivity. In practice, various issues, including inaccurate modeling, lack of observation, or changing conditions, can lead to sensitivity mismatches, causing closed-loop sub-optimality or even instability. To handle such uncertainties, we pursue robust feedback optimization, where we optimize the closed-loop performance against all possible sensitivities lying in specific uncertainty sets. We provide tractable reformulations for the corresponding min-max problems via regularizations and characterize the online closed-loop performance through the tracking error in case of time-varying optimal solutions. Simulations on a distribution grid illustrate the effectiveness of our robust feedback optimization controller in addressing sensitivity mismatches in a non-stationary environment.
|
|
15:15-15:30, Paper ThB04.6 | |
Automated Algorithm Design for Convex Optimization Problems with Linear Equality Constraints (I) |
|
Ozaslan, Ibrahim Kurban | University of Southern California |
Wu, Wuwei | City University of Hong Kong |
Chen, Jie | City University of Hong Kong |
Georgiou, Tryphon T. | University of California, Irvine |
Jovanovic, Mihailo R. | University of Southern California |
Keywords: Optimization algorithms, Optimization, Robust control
Abstract: Synthesis of optimization algorithms typically follows a design-then-analyze approach, which can obscure fundamental performance limits and hinder the systematic development of algorithms that operate near these limits. Recently, a framework grounded in robust control theory has emerged as a powerful tool for automating algorithm synthesis. By integrating design and analysis stages, fundamental performance bounds are revealed and synthesis of algorithms that achieve them is enabled. In this paper, we apply this framework to design algorithms for solving strongly convex optimization problems with linear equality constraints. Our approach yields a single-loop, gradient-based algorithm whose convergence rate is independent of the condition number of the constraint matrix. This improves upon the best known rate within the same algorithm class, which depends on the product of the condition numbers of the objective function and the constraint matrix.
|
|
15:30-15:45, Paper ThB04.7 | |
Passivity-Based Interpretation of the Tracking-ADMM Algorithm for Distributed Constraint-Coupled Optimization |
|
Notarnicola, Ivano | University of Bologna |
Falsone, Alessandro | Politecnico Di Milano |
Keywords: Optimization algorithms, Distributed control, Optimization
Abstract: We propose a system-theoretic perspective on Tracking-ADMM, a recently introduced distributed optimization algorithm for constraint-coupled optimization problems that combines ADMM and tracking over networks. Through a judicious coordinate transformation, we formulate the algorithm as a dynamical linear component in closed-loop with a static nonlinearity resulting from an optimization step. We show, by direct calculation, that the linear component is discrete-positive real, while the static nonlinearity is monotone. As both components are passive systems, their feedback interconnection is passive as well, thus ensuring the stability of Tracking-ADMM.
|
|
15:45-16:00, Paper ThB04.8 | |
Stochastic Gradient Descent for Constrained Optimization Based on Adaptive Relaxed Barrier Functions |
|
Dimitrieski, Naum | RWTH Aachen University |
Cao, Jing | RWTH Aachen |
Ebenbauer, Christian | RWTH Aachen University |
Keywords: Optimization algorithms, Stochastic systems, Optimization
Abstract: This paper presents a novel stochastic gradient descent algorithm for constrained optimization. The proposed algorithm randomly samples constraints and components of the finite sum objective function and relies on a relaxed logarithmic barrier function that is appropriately adapted in each optimization iteration. For a strongly convex objective function and affine inequality constraints, step-size rules and barrier adaptation rules are established that guarantee asymptotic convergence with probability one. The theoretical results in the paper are complemented by numerical studies which highlight potential advantages of the proposed algorithm for optimization problems with a large number of constraints.
|
|
ThB05 |
Galapagos II |
Modelling, Control and Optimization of Electromobility: Synergies between
Transportation, Energy, and Markets |
Invited Session |
Chair: Delle Monache, Maria Laura | University of California, Berkeley |
Co-Chair: Cicic, Mladen | University of California, Berkeley |
Organizer: Cicic, Mladen | CentraleSupélec |
Organizer: Fierro Ulloa, Joel Ignacio | Centre Inria De l'Université Grenoble Alpes |
Organizer: Canudas de Wit, Carlos | CNRS, GIPSA-Lab |
Organizer: Delle Monache, Maria Laura | University of California, Berkeley |
|
14:00-14:15, Paper ThB05.1 | |
Two-Stage Mechanism Design for Electric Vehicle Charging with Day-Ahead Reservations (I) |
|
Su, Pan-Yang | University of California, Berkeley |
Ju, Yi | University of California, Berkeley |
Moura, Scott | University of California, Berkeley |
Sastry, Shankar | Univ. of California at Berkeley |
Keywords: Energy systems, Transportation networks, Agents-based systems
Abstract: We analyze the economic (mechanism design) aspect of incorporating flexibility in electric vehicle (EV) charging demand management. We propose a general two-period model, where EVs can reserve charging sessions in the day-ahead market and swap them in the real-time market. Under the model, we explore several candidate mechanisms for running the two markets, compared using several normative properties such as incentive compatibility, efficiency, reservation awareness, and budget balance. Specifically, reservation awareness is the only property coupling the two markets and dictates that an EV will not get a lower utility by joining the real-time market. Focusing on the real-time market, we show that the classical Vickrey-Clarke-Groves (VCG) mechanism that treats the day-ahead allocations as endowments is not budget-balanced. Moreover, we show that no mechanism satisfies some combinations of the properties. Then, we propose using a posted-price mechanism to resolve the issue, which turns out to be the dynamic pricing mechanism adopted in many real-world systems. The proposed mechanism has no efficiency guarantee but satisfies all the other properties. To improve efficiency, we propose using a VCG auction in the day-ahead market that guides the reserve prices in the real-time market. When EVs' valuations in the two markets are close, the proposed approach is approximately efficient.
|
|
14:15-14:30, Paper ThB05.2 | |
Optimizing Electrical Vehicle Charging Infrastructure: A Congestion Game Approach to Pricing and Placement (I) |
|
Gasnier, Guillaume | GIPSA-Lab, CNRS |
Arcak, Murat | University of California, Berkeley |
Poolla, Kameshwar | Univ. of California at Berkeley |
Canudas de Wit, Carlos | CNRS, GIPSA-Lab |
Keywords: Traffic control, Nonlinear systems, Optimization
Abstract: We propose an optimal pricing method for multiple charging stations within a congestion game framework. We compute the equilibrium flows for each pricing strategy and select the prices that maximize the operator’s revenue. The demand at each station is influenced by travel times and incentives to charge. Vehicle types and behaviors (thermal vs. electric, must/may/not charge) are considered. This results in a bi-level optimization problem, which is solved using a Branch-and-Bound approach enhanced with pruning techniques for improved efficiency. Our experiment integrates three levels of optimization: maximizing revenue by optimizing the placement of stations when maximizing their pricing strategies, while minimizing the demand derived from the congestion game model. We examine the total travel time and maximizing revenue does not increase congestion.
|
|
14:30-14:45, Paper ThB05.3 | |
No-Regret Learning in Stackelberg Games with an Application to Electric Ride-Hailing (I) |
|
Maddux, Anna | EPFL Lausanne |
Maljkovic, Marko | Ecole Polytechnique Fédérale De Lausanne (EPFL) |
Geroliminis, Nikolas | Urban Transport Systems Laboratory, EPFL |
Kamgarpour, Maryam | EPFL |
Keywords: Game theory, Learning, Transportation networks
Abstract: We consider the problem of efficiently learning to play single-leader multi-follower Stackelberg games when the leader lacks knowledge of the lower-level game. Such games arise in hierarchical decision-making problems involving self-interested agents. For example, in electric ride-hailing markets, a central authority aims to learn optimal charging prices to shape fleet distributions and charging patterns of ride-hailing companies. Existing works typically apply gradient-based methods to find the leader's optimal strategy. Such methods are impractical as they require that the followers share private utility information with the leader. Instead, we treat the lower-level game as a black box, assuming only that the followers' interactions approximate a Nash equilibrium while the leader observes the realized cost of the resulting approximation. Under kernel-based regularity assumptions on the leader's cost function, we develop a no-regret algorithm that converges to an epsilon-Stackelberg equilibrium in O(sqrt{T}) rounds. Finally, we validate our approach through a numerical case study on optimal pricing in electric ride-hailing markets.
|
|
14:45-15:00, Paper ThB05.4 | |
Tensor Completion Via Integer Optimization (I) |
|
Chen, Xin | Stanford University |
Kudva, Sukanya | UC Berkeley |
Dai, Yongzheng | The Ohio State University |
Aswani, Anil | UC Berkeley |
Chen, Chen | Ohio State University |
Keywords: Computational methods, Optimization algorithms, Smart grid
Abstract: The tensor completion problem is to fill-in unobserved entries of a partially observed tensor. However, past approaches to tensor completion either achieved the information-theoretic rate but lacked practical algorithms, or proposed polynomial-time algorithms that require an exponentially-larger number of samples for low estimation error. In this paper, we develop a novel tensor completion algorithm to tackle this challenge by achieving both provable convergence (in numerical tolerance) in a linear number of oracle steps and the information-theoretic rate. We formulate tensor completion as a convex optimization problem constrained using a gauge-based tensor norm and provide proofs of properties of this norm including its computational complexity and tensor rank surrogacy. We formulate this norm such that linear separation problems over the gauge unit-ball can be solved using integer optimization. This enables the use of Frank-Wolfe variant to build our algorithm. We demonstrate effectiveness of our method using experiments with simulated data and with an application towards providing low-computation predictions of battery storage flows that may be beneficial in billion-device-scale integration of electromobility systems with the grid.
|
|
15:00-15:15, Paper ThB05.5 | |
Macroscopic Modeling and Hierarchical Control of Battery Swapping Stations (I) |
|
Wang, Ruiting | University of California, Berkeley |
Cicic, Mladen | University of California, Berkeley |
Moura, Scott | University of California, Berkeley |
Delle Monache, Maria Laura | University of California, Berkeley |
Keywords: Control applications, Optimal control, Robust control
Abstract: Battery swapping offers a compelling alternative to fast charging for large EV fleets. By decoupling charging from vehicle dwell time, battery swapping stations (BSS) can charge batteries slower, reducing grid strain and extending battery life, while enabling quick vehicle turnaround. In this work, we present a hierarchical control architecture for large-scale BSS that addresses the computational limits of conventional integer programming approaches. By adopting a macroscopic model that represents battery states as a continuous distribution, our method captures nonlinear battery dynamics without sacrificing tractability. In this framework, the upper level optimizes station power procurement in response to market prices, while the lower level enforces realistic charging constraints across hundreds of batteries. This design enables robust operation under stochastic customer arrivals, ensures high service quality, and ultimately maximizes BSS profit, offering a practically scalable solution for heavy-duty EV fleets.
|
|
15:15-15:30, Paper ThB05.6 | |
Coordinating Distributed Energy Resources with Nodal Pricing in Distribution Networks: A Game-Theoretic Approach |
|
Brock, Eli | University of California Berkeley |
Li, Jingqi | University of California, Berkeley |
Lavaei, Javad | UC Berkeley |
Sojoudi, Somayeh | UC Berkeley |
Keywords: Smart grid, Game theory, Power systems
Abstract: We propose a real-time nodal pricing mechanism for cost minimization and voltage control in distribution networks with autonomous distributed energy resources. Unlike existing methods, the proposed pricing scheme does not require device-aware centralized coordination or communication between prosumers. The resulting market is naturally represented as a stochastic game where prosumers learn feedback control policies to optimize their individual rewards. By developing new sufficient conditions under which a stochastic game is a Markov potential game, we show that the problem of computing an equilibrium for the proposed model is equivalent to solving a single-agent Markov decision process. These new conditions are general and may apply to other applications. An equilibrium is computed for an IEEE test system to empirically demonstrate the near-optimal efficiency of the pricing policy.
|
|
15:30-15:45, Paper ThB05.7 | |
MPC for Self-Powered Systems with Distributed Energy Storage |
|
Veurink, Madelyn | University of Michigan |
Scruggs, Jeff | University of Michigan |
Keywords: Energy systems, Predictive control for linear systems, Constrained control
Abstract: Self-powered control technologies derive all the energy needed to implement control by harvesting energy from disturbances. These systems are applicable in cases of vibration suppression, or robotic systems, where the controller cannot rely on external energy sources. Constraints on energy storage impose limitations on control actions, ensuring that stored energy is neither depleted nor exceeded. In this paper the energy constraints are enforced while an integral-quadratic objective is minimized through the use of Model Predictive Control. The presence of multiple storage elements and upper limits on the capacity of the stored energy introduce non-convexities into the optimization problem. To enable real-time implementation, convex-concave techniques are utilized to conservatively bound the energy constraints and determine the optimal control solution. This method is applied to a simple mechanical system composed of three interconnected linkages.
|
|
15:45-16:00, Paper ThB05.8 | |
Adaptive Pricing for Optimal Coordination in Networked Energy Systems with Nonsmooth Cost Functions |
|
Li, Jiayi | University of Washington, Seattle |
Wei, Jiale | School of Science and Engineering , the Chinese University of Ho |
Motoki, Matthew | University of Washington |
Jiang, Yan | The Chinese University of Hong Kong, Shenzhen |
Zhang, Baosen | University of Washington |
Keywords: Power systems, Stability of nonlinear systems
Abstract: Incentive-based coordination mechanisms for distributed energy consumption have shown promise in aligning individual user objectives with social welfare, especially under privacy constraints. Our prior work proposed a two-timescale adaptive pricing framework, where users respond to prices by minimizing their local costs, and the system operator iteratively updates the prices based on aggregate user responses. A key assumption was that the system cost need to smoothly depend on the aggregate of the user demands. In this paper, we relax this assumption by considering the more realistic model in which the system cost is determined by solving a Direct Current Optimal Power Flow (DCOPF) problem with constraints. We present a generalization of the pricing update rule that leverages the generalized gradients of the system cost function, which may be nonsmooth due to the structure of DCOPF. We prove that the resulting dynamic system converges to a unique equilibrium, which solves the social welfare optimization problem. Our theoretical results provide guarantees on convergence and stability using tools from nonsmooth analysis and Lyapunov theory. Numerical simulations on networked energy systems illustrate the effectiveness of the proposed scheme.
|
|
ThB06 |
Oceania I |
Security, Safety, and Resiliency in Cyber-Physical Systems I - Privacy and
Security |
Invited Session |
Chair: Soudjani, Sadegh | Max Planck Institute for Software Systems |
Co-Chair: Escudero, Cédric | INSA Lyon, Laboratoire Ampère |
Organizer: Escudero, Cédric | INSA Lyon, Laboratoire Ampère |
Organizer: Sadabadi, Mahdieh S. | The University of Manchester |
Organizer: Lucia, Walter | Concordia University |
Organizer: Murguia, Carlos | Eindhoven University of Technology |
Organizer: Selvi, Daniela | Università Di Pisa |
Organizer: Soudjani, Sadegh | Max Planck Institute for Software Systems |
|
14:00-14:15, Paper ThB06.1 | |
Current State Estimation of Timed Labeled Synchronized Petri Nets |
|
Gaouar, Mouna | Aix Marseille Université |
Ammour, Rabah | Aix-Marseille Univ |
Demongodin, Isabel | Aix-Marseille University |
Lefebvre, Dimitri | University Le Havre |
Keywords: Discrete event systems, Petri nets, Estimation
Abstract: In this paper, we address current-state estimation in Timed Labeled Synchronized Petri Nets (TLSPNs), a subclass of Timed Output Synchronized Petri Nets. We introduce the Synchronized State Class Graph to model the state space and propose its transformation into a State Class Interval Automaton (SCIA), a finite state automaton where continuous time is discretized into time tick events. In this framework, a state observer for SCIA is proposed, allowing for current state estimation. The proposed state estimation method is illustrated on an example with secret states, where it is used to infer the net’s current state and compromise confidentiality.
|
|
14:15-14:30, Paper ThB06.2 | |
Anti-Spoofing Aided Solutions for Urban Air Mobility: Ground Command Authentication (I) |
|
Shahkar, Shaham | Concordia University |
Khorasani, Khashayar | Concordia University |
Keywords: Cyber-Physical Security, Autonomous systems, Air traffic management
Abstract: The Cyber-Physical System of Urban Air Mobility (UAM) is among the important pivots of the future smart cities that aim at efficient, safe, and sustainable air transportation of people and goods. UAM technology is characterized by integration and the process of private and state-owned information through wireless tele-communication that exchange important messages, including traffic control commands and geo-fencing rules, public safety announcements, and flight path, etc. Therefore, authentication of transmitted messages is among crucial tasks that require integration to the existing navigation systems, in order to protect the airspace against catastrophic consequences of spoofing cyberattacks. This article aims at introducing an intelligent authentication solution for aerial vehicles, to distinguish legitimate Ground Control Centres (GCCs) from adversaries and intruders by means of behavioural analytics, through consistency examination of the transmitted flight plan with prior waypoint trajectories and flight dynamics of the vehicle. In particular, the proposed authentication technology monitors remotely transmitted flight plans to ensure that firstly, there exists a coherent and consistent path with respect to prior waypoints, and secondly, a deliberate dynamic policy that is consistent with optimal energy conservation practices, both of which require access to information that are rarely available to intruders and adversaries. Consequently, flight plans are only obliged if the likelihood of threats or violations to predetermined constraints and traffic rules are acceptable. Numeric simulations of the results have been provided to validate the developed concepts.
|
|
14:30-14:45, Paper ThB06.3 | |
Privacy Preservation for Statistical Input in Dynamical Systems (I) |
|
Liu, Le | University of Groningen |
Kawano, Yu | Hiroshima University |
Cao, Ming | University of Groningen |
Keywords: Control Systems Privacy, Networked control systems, Linear systems
Abstract: This paper addresses the challenge of privacy preservation for statistical inputs in dynamical systems. Motivated by an autonomous building application, we formulate a privacy preservation problem for statistical inputs in linear time-invariant systems. What makes this problem widely applicable is that the inputs, rather than being assumed to be deterministic, follow a probability distribution, inherently embedding privacy-sensitive information that requires protection. This formulation also presents a technical challenge as conventional differential privacy mechanisms are not directly applicable. Through rigorous analysis, we develop strategy to achieve (0, delta) differential privacy through adding noise. Finally, the effectiveness of our methods is demonstrated by revisiting the autonomous building application.
|
|
14:45-15:00, Paper ThB06.4 | |
On the Interplay of Privacy, Persuasion and Quantization (I) |
|
Anand, Anju | Binghamton University |
Akyol, Emrah | SUNY Binghamton |
Keywords: Game theory, Control Systems Privacy, Quantized systems
Abstract: We develop a communication-theoretic framework for privacy-aware and resilient decision making in cyber-physical systems under emph{misaligned} objectives between the encoder and the decoder. The encoder observes two correlated signals (X,theta) and transmits a finite-rate message Z to aid a legitimate controller (the decoder) in estimating X+theta, while an eavesdropper intercepts Z to infer the private parameter theta. Unlike conventional setups where encoder and decoder share a common MSE objective, here the encoder minimizes a Lagrangian that balances legitimate control fidelity emph{and} the privacy leakage about theta. In contrast, the decoder’s goal is purely to minimize its own estimation error without regard for privacy. We analyze fully, partially, and non-revealing strategies that arise from this conflict, and characterize optimal linear encoders when the rate constraints are lifted. For finite-rate channels, we employ gradient-based methods to compute the optimal controllers. Numerical experiments illustrate how tuning the privacy parameter shapes the trade-off between control performance and resilience against unauthorized inferences.
|
|
15:00-15:15, Paper ThB06.5 | |
Influential Pipes in Water Networks |
|
Bahavarnia, MirSaleh | Vanderbilt University |
Elsherif, Salma M. | Vanderbilt University |
Taha, Ahmad | Vanderbilt University |
Keywords: Network analysis and control, Networked control systems, Cyber-Physical Security
Abstract: Water distribution networks (WDNs) are aging infrastructure that susceptible to various failures including human errors, malicious cyber-attacks, pipe bursts and leaks. This necessitates vulnerability analysis of WDN topologies and hydraulic models that capture the network physics. Graph-theoretic centrality measures—as a primary class of centrality measures in network science—aim to rank the network components solely based on their influence on the WDN topology in the case of changes, while overlooking the WDN dynamics. To overcome such a limitation, this paper uses a control-theoretic centrality measure to identify the network’s most and least influential pipes in WDNs, by simultaneously incorporating the dynamics and topology of the WDN. We introduce a node centrality-based measure, namely vulnerability vector (VV), to rank the network pipes based on their influence on the dynamics and topology of the WDN in the case of changes. This approach enables water system operators to better understand the network’s vulnerabilities and effectively prioritize the maintenance and operational efforts on the most influential pipes.
|
|
15:15-15:30, Paper ThB06.6 | |
Decentralized Attack Detection and Localization for Finite State Machines |
|
Bushra, Bushra | Politecnico Di Bari |
De Santis, Elena | University of L'Aquila |
Pola, Giordano | University of L'Aquila |
Keywords: Attack Detection, Cyber-Physical Security, Discrete event systems
Abstract: This work aims to analyze different aspects of security for a network of two agents in a decentralized framework. Mainly, it deals with two concerns: attack detection and localization. Agents and Attacker are modeled by finite state machines. Agents are interconnected via output feedback composition and share their output information through vulnerable communication channels. Attacker can act in the communication channels between the agents. Using the composed model, necessary and sufficient conditions are developed to ensure the detectability and localizability of the attacked channel. A numerical example is also included at the end of the paper.
|
|
15:30-15:45, Paper ThB06.7 | |
Robust Decentralized Control for Local Detection of Covert Cyberattacks in Interconnected Systems |
|
Ansari Rad, Saeed | University of British Columbia |
Al-Dabbagh, Ahmad | University of British Columbia |
Bin, Michelangelo | University of Bologna |
Keywords: Attack Detection, Cyber-Physical Security, Resilient Control Systems
Abstract: Interconnected systems consist of multiple subsystems coupled through physical and cyber connections. Within a subsystem, a covert cyberattack manipulates actuator commands to achieve a malicious objective while simultaneously manipulating sensor measurements to mimic nominal behavior, thereby remain undetected. Traditionally in the literature, approaches are proposed to detect and isolate such cyberattacks by using banks of observers, irrespective of any control scheme. In addition, the detection of a covert cyberattack within a subsystem as well as its isolation is feasible only by its neighbouring subsystems. In this paper, we propose an alternative approach, by incorporating a decentralized controller and a decentralized state-disturbance observer within each subsystem, which makes detection of covert cyberattacks feasible locally within each subsystem (i.e., local detection), thereby eliminating the reliance on neighbouring subsystems as well as the need for additional design complexity for isolation. The proposed approach is validated through numerical simulations.
|
|
15:45-16:00, Paper ThB06.8 | |
Synthesizing Grid Data with Cyber Resilience and Privacy Guarantees |
|
Wu, Shengyang | University of Michigan |
Dvorkin, Vladimir | University of Michigan |
Keywords: Power systems, Optimization, Smart grid
Abstract: Differential privacy (DP) provides a principled approach to synthesizing data (e.g., loads) from real-world power systems while limiting the exposure of sensitive information. However, adversaries may exploit synthetic data to calibrate cyberattacks on the source grids. To control these risks, we propose new DP algorithms for synthesizing data that provide the source grids with both cyber resilience and privacy guarantees. The algorithms incorporate both normal operation and attack optimization models to balance the fidelity of synthesized data and cyber resilience. The resulting post-processing optimization is reformulated as a robust optimization problem, which is compatible with the exponential mechanism of DP to moderate its computational burden.
|
|
ThB07 |
Capri I |
Positive and Monotone Systems As Unifying Perspectives on Control |
Invited Session |
Chair: Rantzer, Anders | Lund University |
Co-Chair: Kawano, Yu | Hiroshima University |
Organizer: Pates, Richard | Lund University |
Organizer: Rantzer, Anders | Lund University |
|
14:00-14:15, Paper ThB07.1 | |
Performance Analysis for Cone-Preserving Switched Systems with Constrained Switching (I) |
|
Seidel, Marc | University of Stuttgart |
Pates, Richard | Lund University |
Allgöwer, Frank | University of Stuttgart |
Keywords: Switched systems, Compartmental and Positive systems
Abstract: This paper studies cone-preserving linear discrete-time switched systems whose switching is governed by an automaton. For this general system class, we present performance analysis conditions for a broadly usable performance measure. In doing so, we generalize several known results for performance and stability analysis for switched and positive switched systems, providing a unifying perspective. We also arrive at novel ℓ1-performance analysis conditions for positive switched systems with constrained switching, for which we present an application-motivated numerical example. Further, the cone-preserving perspective provides insights into appropriate Lyapunov function selection.
|
|
14:15-14:30, Paper ThB07.2 | |
A Study of Altruistic Behaviour from a Control Theory Perspective (I) |
|
Blanchini, Franco | Univ. Degli Studi Di Udine |
Casagrande, Daniele | University of Udine |
Colaneri, Patrizio | Politecnico Di Milano |
Keywords: Cooperative control, Optimal control, Modeling
Abstract: The paper shows that the altruistic behaviour can be interpreted as the solution of an optimal control problem. To this purpose, two discrete-time dynamic models for the interaction between individuals’ fitnesses are described. For the first model, which is linear with constant coefficients, it is possible to prove that the optimal control input is a classic bang-bang function with one switching (at maximum) that can be explicitly located in time. To show that a solution exists also for the second model, which is a non-linear extension of the first one, we adapt to the discrete-time case a result available in the literature and concerning the implicit expression of the solution of optimal control problems for continuous-time positive systems. Numerical examples are also added to highlight the results. Finally, we show that the models can also fit other real problems, among which we consider, in particular, the optimal investment of a capital.
|
|
14:30-14:45, Paper ThB07.3 | |
Linear Time-And-Space-Invariant Relaxation Systems (I) |
|
Donchev, Tihol Ivanov | KU Leuven |
Shali, Brayan M. | KU Leuven |
Sepulchre, Rodolphe | University of Cambridge |
Keywords: Distributed parameter systems, Linear systems
Abstract: This paper generalizes the physical property of relaxation from linear time-invariant (LTI) to linear time-and-space-invariant (LTSI) systems. It is shown that the defining features of relaxation---complete monotonicity, passivity, and memory-based storage---carry over seamlessly to the spatio-temporal domain. An LTSI system is shown to be of relaxation type if and only if its associated spatio-temporal Hankel operator is cyclically monotone. This implies the existence of an intrinsic quadratic storage functional defined uniquely by past inputs, independently of any state-space realization. As in the LTI case, LTSI relaxation systems are shown to be those systems for which the state-space concept of storage coincides with the input-output concept of fading memory functional.
|
|
14:45-15:00, Paper ThB07.4 | |
Characterization of Discrete-Time Periodically Monotone Systems (I) |
|
Grussler, Christian | Technion - Israel Institute of Technology |
Keywords: Linear systems, Nonlinear systems, PID control
Abstract: Systems that preserve the property of periodic monotonicity, i.e., period-wise unimodality are studied. Leveraging total positivity theory and its geometric interpretations, we derive tractable characterizations of such systems by linking them to our tractable description of sequentially convex contours. Since common static nonlinearities (e.g., ideal relay, saturation, etc.) preserve periodic monotonicity, our findings also apply to discrete-time Lur’e feedback systems. This lays the way for signal-based fixed-point theorems that aid in predicting self-sustained oscillations. Our examples demonstrate the utility of periodic monotonicity preservation in relay feedback systems.
|
|
15:00-15:15, Paper ThB07.5 | |
Self-Sustained Oscillations in Discrete-Time Relay Feedback Systems (I) |
|
Tong, Kang | Technion – Israel Institute of Technology |
Grussler, Christian | Technion - Israel Institute of Technology |
Chong, Michelle | Eindhoven University of Technology |
Keywords: Nonlinear systems, PID control
Abstract: We study the problem of determining self-sustained oscillations in discrete-time linear time-invariant relay feedback systems. Concretely, we are interested in predicting when such a system admits unimodal oscillations, i.e., when the output has a single-peaked period. Under the assumption that the linear system is stable and has an impulse response that is strictly monotonically decreasing on its infinite support, we take a novel approach in using the framework of total positivity to address our main question. It is shown that unimodal self-oscillations can only exist if the number of positive and negative elements in a period coincides. Based on this result, we derive conditions for the existence of such oscillations, determine bounds on their periods, and address the question of uniqueness.
|
|
15:15-15:30, Paper ThB07.6 | |
The KYP Lemma As a Positive Extension Problem (I) |
|
Bamieh, Bassam | Univ. of California at Santa Barbara |
Keywords: Linear systems, Behavioural systems, Robust control
Abstract: The well-known KYP lemma has many interpretations. It generally involves a (possibly indefinite) quadratic form defined jointly on the state and input. The KYP lemma gives a necessary and sufficient Linear Matrix Inequality (LMI) condition for this form being non-negative when constrained to state/input pairs obeying the system dynamics. Through a covariance representation, this problem is converted to whether a linear functional on a subspace is positive. For certain functional/subspace pairs, this problem is known to be equivalent to whether a positive linear functional on a subspace can be positively extended to the entire space. This is a ``conic'' version of the Hahn-Banach theorem, but such extensions may or may not exist depending on the relation between the subspace and the linear form. Using this formalism, we provide a new insight into the KYP Lemma in which the LMI is seen as the condition for the existence of a positive extension, thus certifying the original question of the positivity of the functional on the subspace.
|
|
15:30-15:45, Paper ThB07.7 | |
Minimax Optimal Adaptive Control for Systems on Cones (I) |
|
Rantzer, Anders | Lund University |
Keywords: Optimal control, Adaptive control, Robust adaptive control
Abstract: The theory of optimal control on positive cones has recently identified several new problem classes where the Bellman equation can be solved explicitly, in analogy with classical linear quadratic control. In this paper, the idea is extended to adaptive control, yielding exact solutions to instances of the Bellman equation for dual control. In particular, this allows for optimization of the fundamental tradeoff between exploration and exploitation.
|
|
15:45-16:00, Paper ThB07.8 | |
Multi-Stable Monotone Networks |
|
Watanabe, Rintaro | Hiroshima University |
Kawano, Yu | Hiroshima University |
Wada, Nobutaka | Hiroshima University |
Keywords: Compartmental and Positive systems, Stability of nonlinear systems, Decentralized control
Abstract: Almost every bounded trajectory of a strongly monotone system converges to an equilibrium. Relying on this, we derive a component-wise sufficient condition for multi-stability of monotone networks. We first show that a network is strongly monotone if each monotone subsystem is excitable and transparent, and every interconnection preserves monotonicity in a strong sense. Next, all trajectories of a network are bounded if each subsystem is input-to-state stable (ISS), and every interconnection function is bounded; we derive a sufficient condition for ISS tailored to monotone systems. Finally, we present a component-wise instability condition of an equilibrium for a network based on linearization, where instability guarantees the existence of multiple attracting equilibria.
|
|
ThB08 |
Oceania V |
Reinforcement Learning I |
Regular Session |
Chair: Panaganti, Kishan | Caltech |
Co-Chair: Anderson, James | Columbia University |
|
14:00-14:15, Paper ThB08.1 | |
Optimal Control of Probabilistic Dynamics Models Via Mean Hamiltonian Minimization |
|
Leeftink, David | Radboud University, Nijmegen |
Yildiz, Cagatay | University of Tubingen |
Ridderbusch, Steffen | University of Oxford |
Hinne, Max | Radboud University |
van Gerven, Marcel | Radboud University |
Keywords: Reinforcement learning, Optimal control, Predictive control for nonlinear systems
Abstract: Without exact knowledge of the true system dynamics, optimal control of non-linear continuous-time systems requires careful treatment under epistemic uncertainty. In this work, we translate a probabilistic interpretation of the Pontryagin maximum principle to the challenge of optimal control with learned probabilistic dynamics models. Our framework provides a principled treatment of epistemic uncertainty by minimizing the mean Hamiltonian with respect to a posterior distribution over the system dynamics. We propose a multiple shooting numerical method that leverages mean Hamiltonian minimization and is scalable to large-scale probabilistic dynamics models, including ensemble neural ordinary differential equations. Comparisons against other baselines in online and offline model-based reinforcement learning tasks show that our probabilistic Hamiltonian approach leads to reduced trial costs in offline settings and achieves competitive performance in online scenarios. By bridging optimal control and reinforcement learning, our approach offers a principled and practical framework for controlling uncertain systems with learned dynamics.
|
|
14:15-14:30, Paper ThB08.2 | |
Beyond Expected Value: Geometric Mean Optimization for Long-Term Policy Performance in Reinforcement Learning |
|
Sheng, Xinyi | Aalto University |
Baumann, Dominik | Aalto University |
Keywords: Reinforcement learning, Machine learning, Optimization algorithms
Abstract: Reinforcement learning (RL) algorithms typically optimize the expected cumulative reward, i.e., the expected value of the sum of scalar rewards an agent receives over the course of a trajectory. The expected value averages the performance over an infinite number of trajectories. However, when deploying the agent in the real world, this ensemble average may be uninformative for the performance of individual trajectories. Thus, in many applications, optimizing the long-term performance of individual trajectories might be more desirable. In this work, we propose a novel RL algorithm that combines the standard ensemble average with the time-average growth rate, a measure for the long-term performance of individual trajectories. We first define the Bellman operator for the time-average growth rate. We then show that, under multiplicative reward dynamics, the geometric mean aligns with the time-average growth rate. To address more general and unknown reward dynamics, we propose a modified geometric mean with N-sliding window that captures the path-dependency as an estimator for the time-average growth rate. This estimator is embedded as a regularizer into the objective, forming a practical algorithm and enabling the policy to benefit from ensemble average and time-average simultaneously. We evaluate our algorithm in challenging simulations, where it outperforms conventional RL methods.
|
|
14:30-14:45, Paper ThB08.3 | |
Teaching Precommitted Agents: Model-Free Policy Evaluation and Control in Quasi-Hyperbolic Discounted MDPs |
|
Sure Reddappa Setty, Eshwar | Indian Institute of Science |
Keywords: Reinforcement learning, Stochastic optimal control, Iterative learning control
Abstract: Time-inconsistent preferences, where agents favor smaller-sooner over larger-later rewards, are a key feature of human and animal decision-making. Quasi-Hyperbolic (QH) discounting provides a simple yet powerful model for this behavior, but its integration into the reinforcement learning (RL) framework has been limited. This paper addresses key theoretical and algorithmic gaps for precommitted agents with QH preferences. We make two primary contributions: (i) we formally characterize the structure of the optimal policy, proving for the first time that it reduces to a simple one-step non-stationary form; and (ii) we design the first practical, model-free algorithms for both policy evaluation and Q-learning in this setting, both with provable convergence guarantees. Our results provide foundational insights for incorporating QH preferences in RL.
|
|
14:45-15:00, Paper ThB08.4 | |
Coreset-Based Task Selection for Sample-Efficient Meta-Reinforcement Learning |
|
Zhan, Donglin | Columbia University |
Toso, Leonardo Felipe | Columbia University |
Anderson, James | Columbia University |
Keywords: Reinforcement learning, Optimal control, Learning
Abstract: We study how task selection can enhance sample efficiency in model-agnostic meta-reinforcement learning. Traditional meta-RL typically assumes that all available tasks are equally important, which can lead to task redundancy when they share significant similarities. To address this, we propose a coreset-based task selection approach that selects a weighted subset of tasks based on how diverse they are in gradient space, prioritizing the most informative and diverse tasks. Such task selection reduces the number of samples needed to find an epsilon-close stationary solution by a factor of O(1/epsilon). Consequently, it guarantees a faster adaptation to unseen tasks while focusing training on the most relevant tasks. As a case study, we incorporate task selection to MAML-LQR, and prove a sample complexity reduction proportional to O(log(1/epsilon)) when task-specific costs also satisfy the gradient dominance property. Our theoretical guarantees underscore task selection as a key component for scalable and sample-efficient meta-RL. We numerically validate this trend across multiple RL benchmark problems, illustrating the benefits of task selection beyond the LQR baseline.
|
|
15:00-15:15, Paper ThB08.5 | |
Policy Gradient for LQR with Domain Randomization |
|
Fujinami, Tesshu | University of Pennsylvania |
Lee, Bruce | University of Pennsylvania |
Matni, Nikolai | University of Pennsylvania |
Pappas, George J. | University of Pennsylvania |
Keywords: Reinforcement learning, Learning, Machine learning
Abstract: Domain randomization (DR) enables sim-to-real transfer by training controllers on a distribution of simulated environments, with the goal of achieving robust performance in the real world. Although DR is widely used in practice and is often solved using simple policy gradient (PG) methods, understanding of its theoretical guarantees remains limited. Toward addressing this gap, we provide the first convergence analysis of PG methods for domain-randomized linear quadratic regulation (LQR). We show that PG converges globally to the minimizer of a finite-sample approximation of the DR objective under suitable bounds on the heterogeneity of the sampled systems. We also quantify the sample-complexity associated with achieving a small performance gap between the sample-average and population-level objectives. Additionally, we propose and analyze a discount-factor annealing algorithm that obviates the need for an initial jointly stabilizing controller, which may be challenging to find. Empirical results support our theoretical findings and highlight promising directions for future work, including risk-sensitive DR formulations and stochastic PG algorithms.
|
|
15:15-15:30, Paper ThB08.6 | |
Plan for the Worst with Advice: Advice-Augmented Robust Markov Decision Processes |
|
Handina, Tinashe | California Institute of Technology |
Panaganti, Kishan | Caltech |
Mazumdar, Eric | California Institute of Technology |
Wierman, Adam | California Institute of Technology |
Keywords: Markov processes, Uncertain systems, Reinforcement learning
Abstract: We consider the integration of advice into Robust Markov Decision Processes (RMDPs). While the RMDP formulation aids in modeling ambiguity with respect to transition dynamics, it is overly conservative due to its focus on worst-case instances. To move beyond the worst-case framework, we propose an advice-augmented setting in which the decision maker has access to advice in the form of a predicted transition kernel they seek to leverage to obtain better guarantees. The decision maker in this setting cares about finding a policy that performs well for both the worst case and advice transition dynamics. Thus, we define robustness and consistency as metrics the decision maker optimizes and propose a family of optimization problems whose solutions are Pareto-optimal with respect to robustness and consistency. Under standard assumptions on the ambiguity set, the optimal solutions are deterministic, Markovian, and stationary. Given a set of Pareto-optimal policies, we then provide a policy selection algorithm that achieves max-min optimality across robustness and consistency. Finally, we provide empirical evidence that this algorithm achieves robust and consistent performance in planning problems even under significant ambiguity in the underlying dynamics.
|
|
15:30-15:45, Paper ThB08.7 | |
Efficient Inverse Reinforcement Learning for Unknown Discrete-Time Systems |
|
Huang, Longyang | Shanghai Jiao Tong University |
Liu, Ruonan | The Department of Automation, Shanghai Jiao Tong University |
Jia, Zehua | Hainan University |
Zhang, Weidong | Shanghai Jiaotong Univ |
Keywords: Reinforcement learning, Linear systems, Optimal control
Abstract: This paper develops a data-driven inverse Q-learning algorithm for unknown discrete-time linear time-invariant systems. A novel data representation method is designed to construct the iteration of the cost kernel, on the basis of which a sample-efficient inverse Q-learning algorithm is proposed to learn the cost function. The proposed algorithm only utilizes expert demonstration trajectories and does not require system dynamics information and stabilizing control gains. The convergence of the algorithm is proven. Moreover, it is also demonstrated that the stability of the closed-loop system under each iterative gain is guaranteed. Compared with existing data-driven inverse reinforcement learning (RL) methods, the proposed algorithm has significant advantages in terms of sample efficiency. The substantial reduction in sample complexity significantly improves efficiency and scalability. The effectiveness of the algorithm is verified through simulations.
|
|
15:45-16:00, Paper ThB08.8 | |
Task-Oriented Energy Storage Management for Solar-Powered UAVs: An Enhanced Multi-Objective Deep Reinforcement Learning Approach |
|
Du, Yunhao | Tianjin University |
Zuo, Zhiqiang | Tianjin University |
Zhang, Zhicheng | Tianjin University |
Li, Peng | Tianjin University |
Wang, Yijing | Tianjin University |
Zhao, Rui | Tianjin University |
Li, Zheng | Tianjin University |
Keywords: Energy systems, Reinforcement learning, Autonomous systems
Abstract: The energy storage system of solar-powered unmanned aerial vehicles (SUAVs) faces several important tasks during the long-endurance continuous flight process: maximizing energy utilization, enhancing energy system health and addressing complex flight conditions. To tackle the challenges posed by those tasks, this work presents a multi-objective deep reinforcement learning-based energy management strategy that takes into account both the energy storage state and gravitational energy. First, a comprehensive model of the flight environment and SUAVs energy system is developed, capturing the interactions among solar energy input, energy storage, and flight dynamics. Next, a multi-objective Markov decision process is formulated, incorporating the state of charge, battery health, and altitude. Additionally, an enhanced n-step twin delayed deep deterministic policy gradient (TD3) reinforcement learning algorithm is proposed to optimize long-term energy management. The simulation results demonstrate that the proposed algorithm effectively manages power distribution between the battery and load, with a notable performance improvement compared to the conventional ones. Finally, the practicality of the proposed framework is validated through a hardware-inthe- loop (HIL) experimental platform, confirming its feasibility in real-world scenarios.
|
|
ThB09 |
Oceania VIII |
Nonlinear System Identification I |
Regular Session |
Chair: Mattsson, Per | Uppsala University |
Co-Chair: Ferrari-Trecate, Giancarlo | Ecole Polytechnique Fédérale De Lausanne |
|
14:00-14:15, Paper ThB09.1 | |
Neural Identification of Feedback-Stabilized Nonlinear Systems |
|
Ghoddousi Boroujeni, Mahrokh | École Polytechnique Fédérale De Lausanne |
Meroi, Laura | EPFL |
Massai, Leonardo | EPFL |
Galimberti, Clara Lucía | Scuola Universitaria Professionale Della Svizzera Italiana |
Ferrari-Trecate, Giancarlo | Ecole Polytechnique Fédérale De Lausanne |
Keywords: Closed-loop identification, Neural networks, Nonlinear systems
Abstract: Neural networks have demonstrated remarkable success in modeling nonlinear dynamical systems. However, identifying these systems from closed-loop experimental data remains a challenge due to the correlations induced by the feedback loop. Traditional nonlinear closed-loop system identification methods struggle with reliance on precise noise models, robustness to data variations, or computational feasibility. Additionally, it is essential to ensure that the identified model is stabilized by the same controller used during data collection, ensuring alignment with the true system’s closed-loop behavior. The dual Youla parameterization provides a promising solution for linear systems, offering statistical guarantees and closed-loop stability. However, extending this approach to nonlinear systems presents additional complexities. In this work, we propose a computationally tractable framework for identifying complex, potentially unstable systems while ensuring closed-loop stability using a complete parameterization of systems stabilized by a given controller. We establish asymptotic consistency in the linear case and validate our method through numerical comparisons, demonstrating superior accuracy over direct identification baselines and compatibility with the true system in stability properties.
|
|
14:15-14:30, Paper ThB09.2 | |
Identification of Saturated Networked Systems |
|
Annabi, Adel Malik | Inria Center of University Côte D'Azur |
Pomet, Jean-Baptiste | INRIA |
Prandi, Dario | Université Paris-Saclay, CentraleSupélec, CNRS |
Sacchelli, Ludovic | Inria |
Keywords: Nonlinear systems identification, Biological systems, Closed-loop identification
Abstract: We address the challenge of parameter identification in networks of neural mass models. We consider a system of n interconnected populations, structured in successive layers, where measurements are available from only one. Traditional offline methods struggle with parameter ambiguity and time-dependent variability. We propose an online approach that exploits the system's nonlinear characteristics, particularly saturation functions, to enable effective parameter recovery. By designing specific control inputs applied to targeted nodes, it becomes possible to isolate and retrieve parameters effectively. Crucially, we show that this process must be conducted online to achieve full identification, as control inputs must adapt to previously unknown states and parameters.
|
|
14:30-14:45, Paper ThB09.3 | |
A Hybrid Framework for Efficient Koopman Operator Learning |
|
Jung, Leonard | Northeastern University |
Spiro, Alenna | Northeastern University |
Estornell, Alexander | Northeastern University |
Everett, Michael | Northeastern University |
Sznaier, Mario | Northeastern University |
Keywords: Nonlinear systems identification, Nonlinear systems, Neural networks
Abstract: Koopman analysis of a general dynamics system provides a linear Koopman operator and an embedded eigenfunction space, enabling the application of standard techniques from linear analysis. However, in practice, deriving exact operators and mappings for the observable space is intractable, and deriving an approximation or expressive subset of these functions is challenging. Programmatic methods often rely on system-specific parameters and may scale poorly in both time and space, while learning-based approaches depend heavily on difficult-to-know hyperparameters, such as the dimension of the observable space. To address the limitations of both methods, we propose a hybrid framework that uses semidefinite programming to find a representation of the linear operator, then learns an approximate mapping into and out of the space that the operator propagates. This approach enables efficient learning of the operator and explicit mappings while reducing the need for specifying the unknown structure ahead of time.
|
|
14:45-15:00, Paper ThB09.4 | |
System Identification for Virtual Sensor-Based Model Predictive Control: Application to a 2-DoF Direct-Drive Robotic Arm |
|
Tsuji, Kosei | Kyoto University |
Maruta, Ichiro | Kyoto University |
Fujimoto, Kenji | Kyoto University |
Maeda, Tomoyuki | Process Tech. Rsrch Lab |
Tamase, Yoshihisa | Kobe Steel, Ltd |
Shinohara, Tsukasa | Kobe Steel, Ltd |
Keywords: Nonlinear systems identification, Identification for control, Neural networks
Abstract: Nonlinear Model Predictive Control (NMPC) offers a powerful approach for controlling complex nonlinear systems, yet faces two key challenges. First, accurately modeling nonlinear dynamics remains difficult. Second, variables directly related to control objectives often cannot be directly measured during operation. Although high-cost sensors can measure these variables during model development, their use in practical deployment is typically infeasible. To overcome these limitations, we propose a Predictive Virtual Sensor Identification (PVSID) framework that leverages temporary high-cost sensors during the modeling phase to create virtual sensors for NMPC implementation. We validate PVSID on a Two-Degree-of-Freedom (2-DoF) direct-drive robotic arm with complex joint interactions. The tip position is measured using a motion capture system during modeling, while encoders and an Inertial Measurement Unit (IMU) are utilized in NMPC. Experimental results show that our NMPC with identified virtual sensors achieves precise tip trajectory tracking without requiring the motion capture system during operation. PVSID offers a practical solution for implementing optimal control in nonlinear systems where the measurement of key variables is constrained by cost or operational limitations.
|
|
15:00-15:15, Paper ThB09.5 | |
Convergence in On-Line Learning of Static and Dynamic Systems |
|
Wigren, Torbjorn | Uppsala University |
Zhang, Ruoqi | Uppsala Univeristy |
Mattsson, Per | Uppsala University |
Keywords: Nonlinear systems identification, Neural networks, Learning
Abstract: The paper derives analytical expressions for the asymptotic average updating direction of the adaptive moment generation (ADAM) algorithm when applied to recursive identification of nonlinear systems. It is proved that the standard hyper-parameter setting results in the same asymptotic average updating direction as a diagonally power normalized stochastic gradient algorithm. With the internal filtering turned off, the asymptotic average updating direction is instead equivalent to that of a sign-sign stochastic gradient algorithm. Global convergence to an invariant set follows, where a subset of parameters contain those that give a correct input-output description of the system. The paper also exploits a nonlinear dynamic model to embed structure in recurrent neural networks. A Monte-Carlo simulation study validates the results.
|
|
15:15-15:30, Paper ThB09.6 | |
Nonlinear Mode and Koopman Participation Factor Analysis for Inverter Dominated Power Systems |
|
Kar, Jishnudeep | North Carolina State University |
Chakrabortty, Aranya | North Carolina State University |
Bai, He | Oklahoma State University |
Keywords: Power systems, Nonlinear systems identification, Identification
Abstract: This paper presents a novel framework for identifying dominant nonlinear modes and their interactions in electric power systems with high penetration of inverter-based resources (IBRs), leveraging a Carleman linearization-based Koopman participation factor analysis. By capturing higher-order nonlinear dynamics and exposing the complex interactions between the states of the IBRs and those of the power system, this approach helps in characterizing the stability of the nonlinear model of the grid. The proposed methodology is validated on the IEEE-68 bus system, demonstrating its efficiency in quantifying the impact of IBRs on grid dynamics and stability, and providing actionable insights for the design of advanced control strategies to mitigate such nonlinear interactions.
|
|
15:30-15:45, Paper ThB09.7 | |
Inference and Learning of Nonlinear LFR State-Space Models |
|
Floren, Merijn | KU Leuven |
Noël, Jean-Philippe | TU Eindhoven |
Swevers, Jan | KU Leuven |
Keywords: Nonlinear systems identification, Identification for control, Optimization
Abstract: Estimating the parameters of nonlinear block-oriented state-space models from input-output data typically involves solving a highly non-convex optimization problem, making it susceptible to poor local minima and slow convergence. This paper presents a computationally efficient initialization method for fully parametrizing nonlinear linear fractional representation (NL-LFR) models using periodic data. The approach first infers the latent variables and then estimates the model parameters, yielding initial estimates that serve as a starting point for further nonlinear optimization. The proposed method shows robustness against poor local minima, and achieves a twofold error reduction compared to the state-of-the-art on a challenging benchmark dataset.
|
|
15:45-16:00, Paper ThB09.8 | |
Set-Valued Transformer Network for High-Emission Mobile Source Identification |
|
Cao, Yunning | University of Science and Technology of China |
Pei, Lihong | Academy of Mathematics and Systems Sciences, Chinese Academy Of |
Guo, Jian | Academy of Mathematics and Systems Science, Chinese Academy of S |
Cao, Yang | University of Science and Technology of China |
Kang, Yu | University of Science and Technology of China |
Zhao, Yanlong | Academy of Mathematics and Systems Science, Chinese Academyof Sci |
Keywords: Intelligent systems, Identification
Abstract: Identifying high-emission vehicles is a crucial step in regulating urban pollution levels and formulating traffic emission reduction strategies. However, in practical monitoring data, the proportion of high-emission state data is significantly lower compared to normal emission states. This characteristic long-tailed distribution severely impedes the extraction of discriminative features for emission state identification during data mining. Furthermore, the highly nonlinear nature of vehicle emission states and the lack of relevant prior knowledge also pose significant challenges to the construction of identification models.To address the aforementioned issues, we propose a Set-Valued Transformer Network (SVTN) to achieve comprehensive learning of discriminative features from high-emission samples, thereby enhancing detection accuracy. Specifically, this model first employs the transformer to measure the temporal similarity of micro-trip condition variations, thus constructing a mapping rule that projects the original high-dimensional emission data into a low-dimensional feature space. Next, a set-valued identification algorithm is used to probabilistically model the relationship between the generated feature vectors and their labels, providing an accurate metric criterion for the classification algorithm. To validate the effectiveness of our proposed approach, we conducted extensive experiments on the diesel vehicle monitoring data of Hefei city in 2020. The results demonstrate that our method achieves a 9.5% reduction in the missed detection rate for high-emission vehicles compared to the transformer-based baseline, highlighting its superior capability in accurately identifying high-emission mobile pollution sources.
|
|
ThB10 |
Oceania VII |
Distributed and Decentralized Control II |
Regular Session |
Chair: Tegling, Emma | Lund University |
Co-Chair: Masero, Eva | Politecnico Di Milano |
|
14:00-14:15, Paper ThB10.1 | |
Distributed Consensus of Second-Order Multi-Agent Systems with Full-State Constraints under Switching Directed Graphs |
|
Sun, Yue | Harbin Institute of Technology, Shenzhen |
Gong, Youmin | Harbin Institute of Technology, Shenzhen |
Mei, Jie | Harbin Institute of Technology, Shenzhen |
Ma, Guangfu | Harbin Institute of Technology, Shenzhen |
Guo, Yanning | Harbin Institute of Technology |
Wu, Weiren | Harbin Institute of Technology, Shenzhen |
Keywords: Distributed control, Constrained control, Adaptive control
Abstract: This paper presents a distributed control approach for second-order multi-agent systems, considering full-state constraints and external disturbances. The first-order state variables are transformed into unconstrained forms, upon which a linear reference model is constructed to ensure consensus under switching directed graphs. The second-order state constraints are reformulated as constraints on the sliding surface, and a distributed consensus protocol is developed to guarantee the satisfaction of full-state constraints. Finally, an adaptive control scheme is further integrated to estimate and compensate for external disturbances.
|
|
14:15-14:30, Paper ThB10.2 | |
Coalitional Control: Clustering Based on the H2 Norm |
|
Baldivieso Monasterios, Pablo Rodolfo | The University of Sheffield |
Masero, Eva | Politecnico Di Milano |
Keywords: Distributed control, Networked control systems, Control of networks
Abstract: This work presents an original analysis of the coupling sources in a network of dynamical systems. A coalitional controller exploits switching between different groupings of network members to achieve its regulation goals. In this work, we rigorously analyse a key assumption in robust distributed control methods: the weak coupling assumption. We present an LMI-based analysis to assess this assumption and to design RCI sets for all possible network partitions. We also exploit our coupling analysis to propose a novel partition selection optimisation based on the system's H2 norm. This strategy simplifies the partition selection process, as demonstrated through illustrative examples, leading to enhanced computational efficiency while rigorously preserving the stability and robustness essential for distributed control system applications.
|
|
14:30-14:45, Paper ThB10.3 | |
Distributed Team-Based Coverage Control with Aerial Sensing and Ground Execution |
|
Zhang, Hang | Zhejiang University |
Zheng, Ronghao | Zhejiang University, ZJU |
Zhang, Senlin | Zhejiang University |
Liu, Meiqin | Zhejiang University |
Keywords: Distributed control, Networked control systems, Sensor networks
Abstract: In many real-world applications, such as covering a potential forest fire site, homogeneous ground robots are insufficient to respond to the fire. Moreover, when the forest is dense, the sensing capabilities of ground robots are severely limited, resulting in poor coverage. To address these challenges, this paper introduces an air-ground team-based coverage control scheme, where each team consists of one aerial robot that acts as the “eye” to assist several sensing-limited ground robots in coverage. Within this scheme, two weight settings are introduced to design diverse forms of coverage cost functions, meeting diverse needs. Based on these functions, distributed coverage control laws are developed for aerial and ground robots to achieve optimal coverage collaboratively. Simulations are conducted to validate the effectiveness of the control laws.
|
|
14:45-15:00, Paper ThB10.4 | |
GMM-Based Time-Varying Coverage Control |
|
Zamani, Behzad | University of Melbourne |
Kennedy, James | Defence Science and Technology Group |
Chapman, Airlie | University of Melbourne |
Dower, Peter M. | University of Melbourne |
Manzie, Chris | The University of Melbourne |
Crase, Simon | The Defence Science and Technology Group |
Keywords: Distributed control, Autonomous robots, Cooperative control
Abstract: In coverage control problems that involve time-varying density functions, the coverage control law depends on spatial integrals of time evolution of the density function. The latter is often neglected, replaced with an upper bound or calculated as numerical approximation of the spatial integrals involved. In this paper, we consider a special case of time-varying density functions modeled as Gaussian Mixture Models (GMMs) that evolve with time via a set of time-varying sources (with known corresponding velocities). By imposing this structure, we obtain an efficient time-varying coverage controller that fully incorporates the time evolution of the density function. We show that the induced trajectories under our control law minimise the overall coverage cost. We elicit the structure of the proposed controller in contrast to a classical time-varying coverage controller against which we benchmark the coverage performance of the proposed method in simulation. Furthermore, we highlight that the computationally efficient and distributed nature of the proposed control law makes it ideal for multi-vehicle robotic applications involving time-varying coverage control problems. We employ our method in plume monitoring using a swarm of drones. In an experimental field trial we show that drones guided by the proposed controller are able to track a simulated time-varying chemical plume in a distributed manner.
|
|
15:00-15:15, Paper ThB10.5 | |
Decentralized Continuification Control of Multi-Agent Systems Via Distributed Density Estimation |
|
Di Lorenzo, Beniamino | Scuola Superiore Meridionale |
Maffettone, Gian Carlo | Scuola Superiore Meridionale |
di Bernardo, Mario | University of Naples Federico II |
Keywords: Autonomous systems, Distributed parameter systems, Distributed control
Abstract: This letter introduces a novel decentralized implementation of a continuification-based strategy to control the density of large-scale multi-agent systems. While continuification methods effectively address micro-to-macro control problems by reformulating ordinary/stochastic differential equations (ODEs/SDEs) agent-based models into more tractable partial differential equations (PDEs), they traditionally require centralized knowledge of macroscopic state observables. We overcome this limitation by developing a distributed density estimation framework that combines kernel density estimation with PI consensus dynamics. Our approach enables agents to compute local density estimates and derive local control actions using only information from neighboring agents in a communication network. Numerical validations across multiple scenarios—including regulation, tracking, and time-varying communication topologies—confirm the effectiveness of the proposed approach. They also convincingly demonstrate that our decentralized implementation achieves performance comparable to centralized approaches while enhancing reliability and practical applicability.
|
|
15:15-15:30, Paper ThB10.6 | |
Continuous-Time Distributed Learning for Collective Wisdom Maximization |
|
Baković, Luka | Lund University |
Como, Giacomo | Politecnico Di Torino |
Fagnani, Fabio | Politecnico Di Torino |
Proskurnikov, Anton V. | Politecnico Di Torino |
Tegling, Emma | Lund University |
Keywords: Network analysis and control, Distributed control, Nonlinear systems
Abstract: Motivated by the well established idea that collective wisdom is greater than that of an individual, we propose a novel learning dynamics as a sort of companion to the Abelson model of opinion dynamics. Agents are assumed to make independent guesses about the true state of the world after which they engage in opinion exchange leading to consensus. We investigate the problem of finding the optimal parameters for this exchange, e.g. those that minimize the variance of the consensus value. Specifically, the parameter we examine is susceptibility to opinion change. We propose a dynamics for distributed learning of the optimal parameters and analytically show that it converges for all relevant initial conditions by linking to well established results from consensus theory. Lastly, a numerical example provides intuition on both system behavior and our proof methods.
|
|
15:30-15:45, Paper ThB10.7 | |
A Distributed Method for Identifying Articulation Points in Undirected Networks |
|
Xie, Xinye | Zhejiang University |
Zheng, Ronghao | Zhejiang University, ZJU |
Zhang, Senlin | Zhejiang University |
Liu, Meiqin | Zhejiang University |
Keywords: Distributed control, Network analysis and control
Abstract: Identifying articulation points is crucial for evaluating network robustness, as these are nodes whose removal disconnects the associated undirected network. In this work, we propose a distributed algorithm that leverages the maximum consensus protocol to enable individual nodes to locally determine their status as articulation points and identify the biconnected components to which their neighbors belong. The proposed distributed algorithm operates without requiring global topological information while preserving the privacy of the overall network topology. Formal correctness guarantees and time complexity are provided for the algorithm. In addition to articulation point identification, this method enables distributed assessment of network biconnectivity. Experimental results demonstrate the applicability of the algorithm in Barabasi-Albert networks.
|
|
15:45-16:00, Paper ThB10.8 | |
A Security Masking Protocol for Nonlinear and Incrementally Passive Average Consensus Algorithms |
|
Baldoma-Mitjans, Pol | Universitat Politècnica De Catalunya |
Cecilia, Andreu | Universitat Politècnica De Catalunya |
Casadei, Giacomo | Université Grenoble Alpes |
Astolfi, Daniele | Cnrs - Lagepp |
Puig, Vicenc | Universitat Politècnica De Catalunya |
Keywords: Fault detection, Distributed control, Stability of nonlinear systems
Abstract: This work introduces a masking protocol to enhance the security of a consensus protocol for nonlinear multi-agent systems. The proposed approach involves adding a masking signal to each agent's output and applying a de-masking filter at the receiving agent. We establish sufficient conditions to ensure that the proposed security protocol preserves output consensus. Furthermore, numerical simulations demonstrate its effectiveness in preventing eavesdropping and false data injection attacks.
|
|
ThB11 |
Oceania VI |
Control of Networks I |
Regular Session |
Chair: Fabris, Marco | University of Padua |
Co-Chair: Massai, Leonardo | EPFL |
|
14:00-14:15, Paper ThB11.1 | |
Zone Allocation and Preservation for Disturbed Nonholonomic Mobile Robots |
|
Kurtoglu, Deniz | University of South Florida |
Yucelen, Tansel | University of South Florida |
Tran, Dzung | AFRL |
Casbeer, David W. | Air Force Research Laboratory |
Garcia, Eloy | Air Force Research Laboratory |
Keywords: Constrained control, Nonlinear systems, Uncertain systems
Abstract: In this paper, we study the zone allocation and preservation problem for multiagent systems composed of nonholonomic mobile robots subject to exogenous disturbances. Specifically, we propose a novel control design framework that guarantees each robot remains within its designated zone at all times and follows a command, accessible only to the leader agent(s), when that command is within its zone. To this end, first, feedback linearization is employed to represent each robot with double integrator dynamics. Second, a diffeomorphic map is applied to transform its constrained dynamics into an unconstrained form. Third, a new distributed adaptive control protocol is designed using the unconstrained robot dynamics and it is rigorously established from a system-theoretic perspective, that this protocol ensures the zone allocation and preservation problem for disturbed nonholonomic mobile robots. Finally, a numerical example demonstrates the efficacy of the proposed framework.
|
|
14:00-14:15, Paper ThB11.1 | |
Low-Dimensional Solutions for Optimal Control of Subsystems Coupled Over a Directed Network |
|
Azzouz, Mohamed-Amine | McGill University |
Gao, Shuang | Polytechnique Montreal |
Mahajan, Aditya | McGill University |
Keywords: Control of networks, Stochastic optimal control, Linear systems
Abstract: In this paper, we investigate optimal control of network-coupled subsystems, where the coupling between the dynamics of the subsystems is represented by the adjacency or Laplacian matrix of a directed graph. Under the assumption that the coupling matrix is normal and the cost coupling is compatible with the dynamics coupling, we use the spectral decomposition of the coupling matrix to decompose the overall system into at most n systems with noise coupled dynamics and decoupled cost, where n is the size of the network. Furthermore, the optimal control input at each subsystem can be computed by solving n1 decoupled Riccati equations where n_1 (n_1 ≤ n) denotes the number of distinct eigenvalues of the coupling matrix, where complex conjugate pairs are not double-counted. A salient feature of the result is that the solution complexity depends on the number of distinct eigenvalues of the coupling matrix rather than the size of the network. Therefore, the proposed solution framework provides a scalable method for synthesizing and implementing optimal control laws for large-scale network-coupled subsystems.
|
|
14:15-14:30, Paper ThB11.2 | |
How Complex Is a Complex Network? Insights from Linear Systems Theory |
|
Baggio, Giacomo | University of Padova, Italy |
Fabris, Marco | University of Padua |
Keywords: Network analysis and control, Linear systems
Abstract: This paper leverages linear systems theory to propose a principled measure of complexity for network systems. We focus on a network of first-order scalar linear systems interconnected through a directed graph. By locally filtering out the effect of nodal dynamics in the interconnected system, we propose a new quantitative index of network complexity rooted in the notion of McMillan degree of a linear system. First, we show that network systems with the same interconnection pattern share the same complexity index for almost all choices of their interconnection weights. Then, we investigate the dependence of the proposed index on the topology of the network and the degree of heterogeneity of the nodal dynamics. Specifically, we find that the index depends on the matching number of subgraphs identified by nodal dynamics of different nature, highlighting the joint impact of network architecture and component diversity on overall system complexity.
|
|
14:30-14:45, Paper ThB11.3 | |
Data-Driven Pattern Formation in Oscillator Networks Using Partial Observations |
|
Shih, Yi-Hsuan | Washington University in St. Louis |
Singhal, Bharat | Washington University in St. Louis |
Li, Jr-Shin | Washington University in St. Louis |
Keywords: Large-scale systems, Data driven control, Control of networks
Abstract: Effective control of oscillator networks is a fundamental challenge with applications across neuroscience, circadian biology, and engineering. The absence of accurate dynamical models has driven a shift toward data-driven control approaches. However, these methods often rely on measuring individual network elements and can only achieve simple binary patterns, such as synchronization and desynchronization, limiting their practical applicability. In this paper, we overcome these limitations and propose a data-driven control framework that can attain any synchronization patterns in oscillator populations without requiring measurement from all network elements. Our principal idea is to characterize a network synchronization pattern as a sequence of order parameters and formulate the control task as a stochastic optimization problem, which is solved using stochastic gradient descent. Through a range of numerical simulations, we show the effectiveness of our approach in forming various synchronization patterns in both simplified phase models and biophysical neuron models.
|
|
14:45-15:00, Paper ThB11.4 | |
Duality between Controllability and Observability for Target Control and Estimation in Networks |
|
Montanari, Arthur | Northwestern University |
Duan, Chao | Xi’an Jiaotong University |
Motter, Adilson E. | Northwestern University |
Keywords: Network analysis and control, Control of networks
Abstract: Output controllability and functional observability are properties that enable respectively the control and estimation of part of the state vector. These notions are of utmost importance in applications to high-dimensional systems, such as large-scale networks, in which only a target subset of variables (nodes) is sought to be controlled or estimated. Although the duality between controllability and observability is well established, the characterization of the duality between their generalized counterparts remains an outstanding problem. Here, we establish both the weak and the strong duality between output controllability and functional observability. Specifically, we show that functional observability of a system implies output controllability of a dual system (weak duality), and that under a certain condition the converse holds (strong duality). As an application of the strong duality, we derive a necessary and sufficient condition for target control via static feedback. This allows us to establish a separation principle between the design of target controllers and functional observers in closed-loop systems. These results generalize the classical duality and separation principles in modern control theory.
|
|
15:00-15:15, Paper ThB11.5 | |
Consensus-Based Stability Analysis of Multi-Agent Networks |
|
Jang, Ingyu | Duke University |
LoCicero, Ethan | Duke University |
Bridgeman, Leila J. | Duke University |
Keywords: Large-scale systems, Network analysis and control, Robust control
Abstract: The emergence of large-scale multi-agent systems has led to controller synthesis methods for sparse communication between agents. However, most sparse controller synthesis algorithms remain centralized, requiring information exchange and high computational costs. This underscores the need for distributed algorithms that design controllers using only local dynamics information from each agent. This paper presents a consensus-based distributed stability analysis. The proposed stability analysis algorithms leverage Vidyasagar's Network Dissipativity Theorem and the alternating direction methods of multipliers to perform general stability analysis. Numerical examples involving a 2D swarm of unmanned aerial vehicles demonstrate the convergence of the proposed algorithms.
|
|
15:15-15:30, Paper ThB11.6 | |
On Decentralized Stability Conditions Using Scaled Relative Graphs |
|
Baron Prada, Eder David | Austrian Institute of Technology |
Anta, Adolfo | Austrian Institute of Technology |
Dorfler, Florian | Swiss Federal Institute of Technology (ETH) Zurich |
Keywords: Stability of linear systems, Decentralized control, Network analysis and control
Abstract: Traditional centralized methods for stability analysis in linear multi-agent systems face significant challenges, including limited scalability, lack of modularity, and difficulties in distributed implementation. Various decentralized approaches have been developed to overcome these issues, such as passivity-based techniques and methods combining small gain and phase theorems. Although these approaches improve scalability, they often yield conservative results. In this paper, we introduce a novel set of decentralized stability conditions based on the Scaled Relative Graphs (SRG) framework, providing an efficient and effective tool for analyzing large-scale systems.
|
|
15:30-15:45, Paper ThB11.7 | |
U-Centrality: A Network Centrality Measure Based on Minimum Energy Control for Laplacian Dynamics |
|
Zheng, Xinran | University of Illinois Urbana-Champaign |
Massai, Leonardo | EPFL |
Franceschetti, Massimo | UCSD |
Touri, Behrouz | University of Illinois at Urbana Champaign |
Keywords: Control of networks, Linear systems, Optimal control
Abstract: Network centrality is a foundational concept for quantifying the importance of nodes within a network. Many traditional centrality measures—such as degree and betweenness centrality—are purely structural and often overlook the dynamics that unfold across the network. However, the notion of a node’s importance is inherently context-dependent and must reflect both the system’s dynamics and the specific objectives guiding its operation. Motivated by this perspective, we propose a dynamic, task-aware centrality framework rooted in optimal control theory. By formulating a problem on minimum energy control of average opinion based on Laplacian dynamics and focusing on the variance of terminal state, we introduce a novel centrality measure—termed U-centrality—that quantifies a node’s ability to unify the agents' state. We demonstrate that U-centrality interpolates between known measures: it aligns with degree centrality in the short-time horizon and converges to a new centrality over longer time scales which is closely related to current-flow closeness centrality. This work bridges structural and dynamical approaches to centrality, offering a principled, versatile tool for network analysis in dynamic environments.
|
|
ThB12 |
Oceania X |
Optimization Algorithms I |
Regular Session |
Chair: Zheng, Ronghao | Zhejiang University, ZJU |
Co-Chair: Rikos, Apostolos I. | The Hong Kong University of Science and Technology (Guangzhou) |
|
14:00-14:15, Paper ThB12.1 | |
An Optimistic Gradient Tracking Method for Distributed Minimax Optimization |
|
Huang, Yan | KTH - Kungliga Tekniska Högskolan |
Xu, Jinming | Zhejiang University |
Chen, Jiming | Zhejiang University |
Johansson, Karl H. | KTH Royal Institute of Technology |
Keywords: Optimization algorithms, Cooperative control, Agents-based systems
Abstract: This paper studies the distributed minimax optimization problem over networks. To enhance convergence performance, we propose a distributed optimistic gradient tracking method, termed DOGT, which solves a surrogate function that captures the similarity between local objective functions to approximate a centralized optimistic approach locally. Leveraging a Lyapunov-based analysis, we prove that DOGT achieves linear convergence to the optimal solution for strongly convex-strongly concave objective functions while remaining robust to the heterogeneity among them. Moreover, by integrating an accelerated consensus protocol, the accelerated DOGT (ADOGT) algorithm achieves an optimal convergence rate of mathcal{O} left( kappa log left( epsilon ^{-1} right) right) and communication complexity of mathcal{O} left( kappa log left( epsilon ^{-1} right) /sqrt{1-sqrt{rho _W}} right) for a suboptimality level of epsilon>0, where kappa is the condition number of the objective function and rho_W is the spectrum gap of the network. Numerical experiments illustrate the effectiveness of the proposed algorithms.
|
|
14:15-14:30, Paper ThB12.2 | |
A Fully Distributed Algorithm for the Nonconvex Constrained Optimization Problem |
|
Shi, Xiasheng | Jiangnan University |
Xu, Lei | Northeastern University |
Yang, Tao | Northeastern University |
Lin, Zhiyun | Southern University of Science and Technology |
Su, Chun-Yi | Concordia Univ |
Keywords: Optimization algorithms, Distributed control, Power systems
Abstract: This paper addresses the nonconvex constrained optimization problem over multi-agent systems, where each agent possesses a nonconvex local objective function and a bounded convex constraint set. A fully distributed primal-dual algorithm is proposed to resolve this problem without relying on prior knowledge of network connectivity or problem-specific parameters. The key innovations include (i) the integration of a differential projection operator to handle local convex constraints, and (ii) the introduction of node-based adaptive control parameters to eliminate dependency on global information such as Lipschitz constants or Laplacian eigenvalues. By leveraging Lyapunov stability theory, we rigorously prove that the proposed algorithm asymptotically converges to a local optimal solution of the nonconvex problem. Furthermore, the algorithm’s effectiveness is validated through two numerical simulations. Comparative results demonstrate superior convergence and robustness against existing methods.
|
|
14:30-14:45, Paper ThB12.3 | |
Decentralized Optimization Via RC-ALADIN with Efficient Quantized Communication |
|
Du, Xu | The Hong Kong University of Science and Technology (Guangzhou) |
Johansson, Karl H. | KTH Royal Institute of Technology |
Rikos, Apostolos I. | The Hong Kong University of Science and Technology (Gz) |
Keywords: Optimization algorithms, Large-scale systems, Networked control systems
Abstract: In this paper, we investigate the problem of decentralized consensus optimization over directed graphs with limited communication bandwidth. We introduce a novel decentralized optimization algorithm that combines the Reduced Consensus Augmented Lagrangian Alternating Direction Inexact Newton (RC-ALADIN) method with a finite time quantized coordination protocol, enabling quantized information exchange among nodes. Assuming the nodes' local objective functions are mu-strongly convex and simply smooth, we establish global convergence at a linear rate to a neighborhood of the optimal solution, with the neighborhood size determined by the quantization level. Additionally, we show that the same convergence result also holds for the case where the local objective functions are convex and L-smooth. Numerical experiments demonstrate that our proposed algorithm compares favorably against algorithms in the current literature while exhibiting communication efficient operation.
|
|
14:45-15:00, Paper ThB12.4 | |
Compressed Zeroth-Order Algorithm for Stochastic Distributed Nonconvex Optimization |
|
Wang, Haonan | Tongji University |
Yi, Xinlei | College of Electronics and Information Engineering, Tongji Unive |
Hong, Yiguang | Tongji University |
Keywords: Networked control systems, Optimization algorithms, Agents-based systems
Abstract: This paper studies the stochastic distributed nonconvex optimization problem over a network of agents, where agents only access stochastic zeroth-order information about their local cost functions and collaboratively optimize the global objective over bandwidth-limited communication networks. To mitigate communication overhead and handle the unavailability of explicit gradient information, we propose a communication compressed zeroth-order stochastic distributed (CZSD) algorithm. By integrating a generalized contractive compressor and a stochastic two-point zeroth-order oracle, CZSD achieves convergence rates comparable to its exact communication counterpart while reducing both communication overhead and sampling complexity. Specifically, to the best of our knowledge, CZSD is the first compressed zeroth-order algorithm achieving linear speedup, with convergence rates of mathcal{O}(sqrt{p}/sqrt{nT}) and mathcal{O}(p/(nT)) under general nonconvex settings and the Polyak--{L}ojasiewicz condition, respectively. Numerical experiments validate the algorithm's effectiveness and communication efficiency.
|
|
15:00-15:15, Paper ThB12.5 | |
Distributed Optimization for MASs Subject to Disturbances: An Event-Triggered Derivative Feedback Approach with Minimum Inter-Event Time |
|
Wang, Dandan | Yangtze Delta Region Academy of Beijing Institute of Technology |
Keywords: Distributed control, Optimization algorithms
Abstract: The distributed optimization for multi-agent systems (MASs) is investigated in this paper. In the system, the unknown time-varying disturbances affecting each agent are taken into account. A distributed event-triggered controller is designed for each agent to solve the optimization problem, meeting several demands, including eliminating the effects of disturbances, discrete communication and controller updates, avoiding using agents' continuous-time actual states, and ensuring the existence of a positive minimum inter-event time (MIET) between consecutive events. Existing approaches in the literature only satisfy one or two of these demands. The proposed event-triggered controller not only conserves communication and computational resources but also reduces the frequency of controller updates for agents. With mild assumptions on local cost functions, the communication network among agents and disturbances, sufficient conditions for the convergence of agents' states are derived. It is also proven that a positive MIET exists. Finally, simulation examples are provided to validate the performance of the proposed controller.
|
|
15:15-15:30, Paper ThB12.6 | |
Friedkin-Johnsen Model Is Distributed Gradient Descent |
|
Akgün, Orhan Eren | Harvard University |
Vékássy, Áron | Harvard University |
Ballotta, Luca | Delft University of Technology |
Yemini, Michal | Bar Ilan University |
Gil, Stephanie | Harvard University |
Keywords: Optimization, Communication networks, Cooperative control
Abstract: The Friedkin-Johnsen (FJ) model describes how agents adjust their opinions through repeated inter actions while accounting for the influence of agents who are partially stubborn. In this letter, we demonstrate that the FJ model is stepwise equivalent to solving the average consensus problem via distributed gradient descent. This perspective provides a unifying framework that bridges opinion dynamics and optimization, enabling the application of well-established results from the optimization literature. To illustrate this, we examine the recently proposed FJ model with diminishing stubbornness and extend prior results that were concerned with fixed communication graphs to time-varying and jointly connected communication graphs. We derive convergence guaran tees and analyze convergence rates under these relaxed assumptions. Finally, we present numerical experiments on random graphs to showcase the impact of diminishing stubbornness dynamics on convergence in both static and time-varying settings.
|
|
15:30-15:45, Paper ThB12.7 | |
Decentralized Riemannian Quasi-Newton Method on Compact Submanifolds |
|
Zhao, Yixian | Zhejiang University |
Huang, Yan | KTH - Kungliga Tekniska Högskolan |
Xu, Jinming | Zhejiang University |
Keywords: Optimization algorithms, Cooperative control, Agents-based systems
Abstract: In this paper, we propose a decentralized Riemannian quasi-Newton method (DRQN) for solving nonconvex optimization problems on compact submanifolds over networks, where each local agent’s objective function is smooth yet non-convex. Different from conventional Riemannian quasi-Newton techniques, DRQN circumvents global geometric operations by locally estimating the global quasi-Newton direction via consensus protocols and gradient tracking schemes. This approach enables efficient convergence under standard geometric conditions while significantly reducing communication overhead. To our knowledge, DRQN is the first decentralized Riemannian quasi-Newton algorithm that is both retraction- and vector transport-free, achieving global convergence under standard assumptions for distributed Riemannian optimization. We validate our theoretical findings through numerical experiments on distributed eigenvalue computation, demonstrating the superior performance of DRQN over state-of-the-art Riemannian gradient tracking and conjugate gradient methods.
|
|
15:45-16:00, Paper ThB12.8 | |
Distributed Optimization and Learning for Automated Stepsize Selection with Finite Time Coordination |
|
Rikos, Apostolos I. | The Hong Kong University of Science and Technology (Gz) |
Bastianello, Nicola | KTH Royal Institute of Technology |
Charalambous, Themistoklis | University of Cyprus |
Johansson, Karl H. | KTH Royal Institute of Technology |
Keywords: Optimization algorithms, Large-scale systems, Networked control systems
Abstract: Distributed optimization and learning algorithms are designed to operate over large scale networks enabling processing of vast amounts of data effectively and efficiently. One of the main challenges for ensuring a smooth learning process in gradient-based methods is the appropriate selection of a learning stepsize. Most current distributed approaches let individual nodes adapt their stepsizes locally. However, this may introduce stepsize heterogeneity in the network, thus disrupting the learning process and potentially leading to divergence. In this paper, we propose a distributed learning algorithm that incorporates a novel mechanism for automating stepsize selection among nodes. Our main idea relies on implementing a finite time coordination algorithm for eliminating stepsize heterogeneity among nodes. We analyze the operation of our algorithm and we establish its convergence to the optimal solution. We conclude our paper with numerical simulations for a linear regression problem, showcasing that eliminating stepsize heterogeneity enhances convergence speed and accuracy against current approaches.
|
|
ThB13 |
Oceania IX |
Game Theory II |
Regular Session |
Chair: Li, Max | University of Michigan |
Co-Chair: Brown, Philip N. | University of Colorado Colorado Springs |
|
14:00-14:15, Paper ThB13.1 | |
A Soft Inducement Framework for Incentive-Aided Steering of No-Regret Players |
|
Yorulmaz, Asrin Efe | University of Illinois Urbana-Champaign |
Velicheti, Raj Kiriti | University of Illinois at Urbana Champaign |
Bastopcu, Melih | Bilkent University |
Basar, Tamer | Univ of Illinois, Urbana-Champaign |
Keywords: Game theory, Learning, Agents-based systems
Abstract: In this work, we investigate a steering problem in a mediator-augmented two-player normal-form game, where the mediator aims to guide players toward a specific action profile through information and incentive design. We first characterize the games for which successful steering is possible. Moreover, we establish that steering players to any desired action profile is not always achievable with information design alone, nor when accompanied with sublinear payment schemes. Consequently, we derive a lower bound on the constant payments required per round to achieve this goal. To address these limitations incurred with information design, we introduce an augmented approach that involves a one-shot information design phase before the start of the repeated game, transforming the prior interaction into a Stackelberg game. Finally, we theoretically demonstrate that this approach improves the convergence rate of players' action profiles to the target point by a constant factor with high probability, and support it with empirical results.
|
|
14:15-14:30, Paper ThB13.2 | |
A Convex Formulation of Game-Theoretic Hierarchical Routing |
|
Lee, Dong Ho | The University of Texas at Austin |
Donnel, Kaitlyn | The University of Texas at Austin |
Li, Max | University of Michigan |
Fridovich-Keil, David | The University of Texas at Austin |
Keywords: Game theory, Autonomous systems, Optimization
Abstract: Hierarchical decision-making is a natural paradigm for coordinating multi-agent systems in complex environments such as air traffic management. In this paper, we present a bilevel framework for game-theoretic hierarchical routing, where a high-level router assigns discrete routes to multiple vehicles who seek to optimize potentially noncooperative objectives that depend upon the assigned routes. To address computational challenges, we propose a reformulation that preserves the convexity of each agent's feasible set. This convex reformulation enables a solution to be identified efficiently via a customized branch-and-bound algorithm. Our approach ensures global optimality while capturing strategic interactions between agents at the lower level. We demonstrate the solution concept of our framework in two-vehicle and three-vehicle routing scenarios.
|
|
14:30-14:45, Paper ThB13.3 | |
Worst-Case Equilibria in Networked Resource Allocation Games Rest on a Knife-Edge |
|
Singh, Vartika | University of Colorado Colorado Springs |
Brown, Philip N. | University of Colorado Colorado Springs |
Keywords: Game theory, Networked control systems
Abstract: In networked resource allocation games, the performance of the overall system depends finely on the emergent behavior of individual agents. Past game-theoretic work has frequently focused on characterizing worst-case bounds on the performance of suboptimal Nash equilibria; these bounds indicate that Nash equilibria can be quite poor compared to centrally-computed optima. However, in this paper we show that worst-case Nash equilibria always exhibit a knife-edge phenomenon: at a worst-case Nash equilibrium, every agent is indifferent between its equilibrium choice and its choice in the system-optimal outcome. Furthermore, we introduce tools which allow us to study the relationship between a measure of the agents' aggregate "satisfaction" at equilibrium and the system-level performance of Nash equilibria; our results indicate that a Nash equilibrium can have very poor system-level performance or the agents can be highly satisfied with the equilibrium, but both cannot be true simultaneously. This extends recent observations of the same phenomenon in other classes of games, and gives further credence to the emerging notion that worst-case equilibrium guarantees in games are overly pessimistic.
|
|
14:45-15:00, Paper ThB13.4 | |
Distributed Task Allocation for Self-Interested Agents with Partially Unknown Rewards |
|
Mandal, Nirabhra | University of California San Diego |
Khajenejad, Mohammad | The University of Tulsa |
Martinez, Sonia | University of California at San Diego |
Keywords: Game theory, Optimization
Abstract: This paper provides a novel solution to a task allocation problem, by which a group of agents assigns a discrete set of tasks in a distributed manner. In this setting, heterogeneous agents have individual preferences and associated rewards for doing each task; however, these rewards are only known asymptotically. The assignment problem is formulated by means of a combinatorial partition game for known rewards, with no constraints on the number of tasks per agent. We relax this into a weight game, which together with the former, are shown to contain the optimal task allocation in the corresponding set of Nash Equilibria (NE). We then propose a projected, best-response, ascending gradient dynamics (PBRAG) that converges to a NE in finite time. This forms the basis of a distributed online version that can deal with a converging sequence of rewards by means of an agreement sub-routine. We present simulations that support our results.
|
|
15:00-15:15, Paper ThB13.5 | |
A Colonel Blotto Approach to Deterrence |
|
Grimsman, David | Brigham Young University |
Paarporn, Keith | University of Colorado, Colorado Springs |
Keywords: Game theory, Agents-based systems
Abstract: Deterrence games represent strategic interactions where a defender seeks to prevent an attacker from taking an undesired action by threatening credible retaliation or consequences. In this work, we propose a hierarchical Colonel Blotto game where the defender and attacker have soldiers that they can deploy to various battlefields. The novelty is that once deployed, the attacker soldiers act as decision-making entities who have structured incentives to either attack or defect. The incentives are modeled via a subgame, and are based on a reward for attacking, a penalty, and a probability of being caught. We analyze both levels of strategic interaction, deriving the equilibria for both the lower-level subgames and the higher-level Blotto game. We show under what circumstances the defender can exploit these dynamics through its allocation strategy in order to effectively deter the attacking soldiers from cooperating.
|
|
15:15-15:30, Paper ThB13.6 | |
The Impact of Social Value Orientation on Nash Equilibria of Two Player Quadratic Games |
|
Calderone, Daniel J. | University of New Mexico |
Oishi, Meeko | University of New Mexico |
Keywords: Game theory, Cooperative control, Human-in-the-loop control
Abstract: We consider two player quadratic games in a cooperative framework based in social value orientation, motivated by the need to account for complex interactions between humans and autonomous agents in dynamical systems. Social value orientation posits that each player incorporates the other player's cost into their own objective. Each player's ``orientation'' determines how the player weights their own cost relative to the other player's cost. We derive explicit formulas for the Nash equilibria as a function of social value orientation. We characterize conditions under which equilibria become unbounded and the asymptotes that these unbounded solutions follow. We analyze linear quadratic games with open-loop control, with application to trajectory planning with integrator dynamics.
|
|
15:30-15:45, Paper ThB13.7 | |
How Irrationality Affects Nash Equilibria: A Prospect-Theoretic Perspective |
|
Komalan Sindhu, Ashok Krishnan | Inria, Paris |
Le Cadre, Helene | Inria Lille-Nord Europe |
Busic, Ana | Inria |
Keywords: Game theory, Optimization, Smart grid
Abstract: Noncooperative games with uncertain payoffs have been classically studied under the expected-utility theory framework, which relies on the strong assumption that agents behave rationally. However, simple experiments on human decision makers found them to be not fully rational, due to their subjective risk perception. Prospect theory was proposed as an empirically-grounded model to incorporate irrational behaviours into game-theoretic models. But, how prospect theory shapes the set of Nash equilibria when considering irrational agents, is still poorly understood. To this end, we study how prospect theoretic transformations may generate new equilibria while eliminating existing ones. Focusing on aggregative games, we show that capturing users' irrationality can preserve symmetric equilibria while causing the vanishing of asymmetric equilibria. Further, there exist value functions which map uncountable sets of equilibria in the expected-utility maximization framework to finite sets. This last result may shape some equilibrium selection theories for human-in-the-loop systems where computing a single equilibrium is insufficient and comparison of equilibria is needed.
|
|
15:45-16:00, Paper ThB13.8 | |
Blotto on the Ballot: A Ballot Stuffing Blotto Game |
|
Shah, Harsh | Indian Institute of Technology Bombay |
Nair, Jayakrishnan | IIT Bombay |
Manjunath, D | IIT Bombay, India |
Mandayam, Narayan | Rutgers |
Keywords: Game theory, Optimization
Abstract: We consider the following Colonel Blotto game between parties P1 and PA. P1 deploys a non negative number of troops across J battlefields, while PA chooses K, K
|
|
ThB14 |
Galapagos III |
Robotics and Autonomous Systems I |
Regular Session |
Chair: Malikopoulos, Andreas A. | Cornell University |
Co-Chair: Otte, Michael | University of Maryland College Park |
|
14:00-14:15, Paper ThB14.1 | |
Combining Graph Attention Networks and Distributed Optimization for Multi-Robot Mixed-Integer Convex Programming |
|
Le, Viet-Anh | University of Delaware |
Kounatidis, Panagiotis | Cornell University |
Malikopoulos, Andreas A. | Cornell University |
Keywords: Robotics, Autonomous robots
Abstract: In this paper, we develop a fast mixed-integer convex programming (MICP) framework for multi-robot navigation by combining graph attention networks and distributed optimization. We formulate a mixed-integer optimization problem for receding horizon motion planning of a multi-robot system, taking into account the surrounding obstacles. To address the resulting multi-agent MICP problem in real time, we propose a framework that utilizes heterogeneous graph attention networks to learn the latent mapping from problem parameters to optimal binary solutions. Furthermore, we apply a distributed proximal alternating direction method of multipliers algorithm for solving the convex continuous optimization problem. We demonstrate the effectiveness of our proposed framework through experiments conducted on a robotic testbed.
|
|
14:15-14:30, Paper ThB14.2 | |
Human-State-Aware Non-Linear Control Framework for Physical Collaborative Aerial Transportation |
|
Prajapati, Pratik | Indian Institute of Technology Gandhinagar |
Banavar, Ravi N. | Indian Institute of Technology Bombay |
Vashista, Vineet | Indian Institute of Technology Gandhinagar |
Keywords: Control applications, Robotics, Human-in-the-loop control
Abstract: Recent advancements in mechanical design and robust control modalities have significantly enhanced the ability of aerial robots to interact with humans in various physical tasks. A critical challenge remains to ensure safe, cooperative, and intuitive collaboration between humans and aerial robots during such interactions. In this context, this work introduces a human-state-aware nonlinear control framework specifically designed for collaborative object transportation involving a human operator and a quadcopter. A system is considered in which a rigid object is lifted by a human using a custom-built sensor device, the Human Handle Device. At the same time, the other end is suspended from the quadcopter using a cable. The proposed control modality is designed to improve the quadcopter’s tracking capabilities and is proven exponentially stable. The controller's performance is justified by conducting experiments with and without feed forwarding the human states, particularly the human acceleration, in the quadcopter's controller. The experimental results validate the proposed approach, demonstrating that incorporating human states allows the quadcopter to respond effectively to human actions. This leads to smoother and more stable tracking performance, enhancing overall cooperation and interaction in shared-object transportation tasks.
|
|
14:30-14:45, Paper ThB14.3 | |
Dissipative Avoidance Feedback for Reactive Navigation under Second-Order Dynamics |
|
Smaili, Lyes | Université Du Québec En Outaouais |
Tang, Zhiqi | KTH Royal Institute of Technology |
Berkane, Soulaimane | University of Quebec in Outaouais |
Hamel, Tarek | I3S-CNRS-UCA |
Keywords: Autonomous robots
Abstract: This paper addresses the problem of autonomous robot navigation in unknown, obstacle-filled environments with second-order dynamics by proposing a Dissipative Avoidance Feedback (DAF). Compared to the Artificial Potential Field (APF), which primarily uses repulsive forces based on position, DAF employs a dissipative feedback mechanism that accounts for both position and velocity, contributing to smoother and more natural obstacle avoidance. The proposed continuously differentiable controller solves the motion-to-goal problem while guaranteeing collision-free navigation by using the robot's state and local obstacle distance information. We show that the controller guarantees safe navigation in generic n-dimensional environments and achieves Almost Global Asymptotic Stability (AGAS) under certain curvature conditions. Designed for real-time implementation, DAF requires only locally measured data from limited-range sensors (e.g., LiDAR, depth cameras), making it particularly effective for robots navigating unknown workspaces. Simulations in 2D and 3D environments are conducted to validate the theoretical results and showcase the effectiveness of our approach.
|
|
14:45-15:00, Paper ThB14.4 | |
High-Performance Tracking MPC for Quadcopters with Coupled Time-Varying Constraints and Stability Proofs |
|
Izadi Najafabadi, Maedeh | Eindhoven University of Technology |
Cobbenhagen, A.T.J.R. | Eindhoven University of Technology |
Sommer, Ruben | Avular Mobile Robotics |
Andrien, Alex | Eindhoven University of Technology |
Lefeber, Erjen | Eindhoven University of Technology |
Heemels, W.P.M.H. (Maurice) | Eindhoven University of Technology |
Keywords: Predictive control for linear systems, Stability of linear systems, Robotics
Abstract: In this paper, we present a cascade control structure to address the trajectory tracking problem for quadcopters, ensuring uniform global asymptotic stability of the state tracking error dynamics. An MPC strategy based on a 12-dimensional prediction model is proposed for the outer loop, explicitly accounting for time-varying “coupled” constraints, where multiple variables are interdependent and need to be handled together. The outer-loop controller generates an acceleration reference, which is then converted into attitude and angular velocity references, later tracked by a nonlinear inner-loop controller. Numerical simulations validate the approach, demonstrating enhanced performance in precise and fast tracking by imposing less conservative constraints than existing approaches, while still guaranteeing stability.
|
|
15:00-15:15, Paper ThB14.5 | |
Stability Governor-Guided RLMPC for Robot Manipulators |
|
Dai, Yufan | University of Victoria |
Bellinger, Colin | National Research Council Canada |
Wang, Yunli | National Research Council Canada |
Drummond, Chris | National Research Council Canada |
Shi, Yang | University of Victoria |
Keywords: Predictive control for linear systems, Reinforcement learning, Robotics
Abstract: Multi-joint manipulators hold significant potential across various applications; however, achieving optimized performance while ensuring constraint satisfaction remains challenging. To address this, a reinforcement learning-based model predictive control (RLMPC) framework is employed to optimize the manipulator's motion while simultaneously tuning the terminal weighting. To reduce the computational burden and meet real-time requirements, the terminal constraint is removed from the optimization problem. However, the absence of a terminal constraint in conventional RLMPC frameworks necessitates a sufficiently large prediction horizon for convergence, since a longer horizon helps approximate the long-term cost and guides the system toward stability. Meanwhile, efficiently obtaining feasible samples in the state space remains challenging for manipulators. To overcome these limitations, a stability governor is introduced to generate a reference target at each time step, which enhances sampling efficiency and guides the RLMPC optimization toward a feasible solution that balances path efficiency and control performance. The proposed framework is validated through comparison simulations using a numerical model of the UR10e robot manipulator, demonstrating improved tracking performance, reduced computational complexity, and enhanced constraint satisfaction, showing its potential for real-world applications.
|
|
15:15-15:30, Paper ThB14.6 | |
Unified Hierarchical MPC in Task Trajectory Executing for Modular Manipulators across Diverse Morphologies |
|
Lei, Maolin | Italian Institute of Technology |
Romiti, Edoardo | IIT |
Laurenzi, Arturo | Italian Institute of Technology |
Zhou, Cheng | Tencent |
Xing, Wanli | The University of Hong Kong |
Lu, Liang | University of Hong Kong |
Tsagarakis, Nikos | Italian Institute of Technology |
Keywords: Robotics, Autonomous robots
Abstract: —This work proposes a unified Hierarchical Model Predictive Control (H-MPC) framework for modular manipulator across various morphologies, as the controller can adapt to different configurations for execute the given task without extensive parameter tuning in the controller. The H-MPC framework divides the control process into two levels: a high-level MPC and a low-level MPC. The high-level MPC predicts future states and provides trajectory information, while the low-level MPC refines control actions by updating the predictive model based on this high-level information. This hierarchical structure allows for the integration of kinematic constraints and ensures smooth joint-space trajectories, even near singular configurations. Moreover, the low-level MPC incorporates secondary linearization by leveraging predictive information from the high-level MPC, effectively capturing the second-order Taylor expansion information of the kinematic model while still maintaining a linearized model formulation. This approach not only preserves the simplicity of a linear control model but also enhances the accuracy of the kinematic representation, thereby improving overall control precision and reliability. To validate the effectiveness of the proposed framework, we conduct extensive evaluations across different manipulator morphologies and demonstrate execution of pick-and-place tasks in real-world scenarios.
|
|
15:30-15:45, Paper ThB14.7 | |
Control Barrier Functions Via Minkowski Operations for Safe Navigation among Polytopic Sets |
|
Chen, Yi-Hsuan | University of Maryland, College Park |
Liu, Shuo | Boston University |
Xiao, Wei | WPI |
Belta, Calin | University of Maryland |
Otte, Michael | University of Maryland College Park |
Keywords: Robotics, Optimization, Control applications
Abstract: Safely navigating around obstacles while respecting the dynamics, control, and geometry of the underlying system is a key challenge in robotics. Control Barrier Functions (CBFs) generate safe control policies by considering system dynamics and geometry when calculating safe forward-invariant sets. Existing CBF-based methods often rely on conservative shape approximations, like spheres or ellipsoids, which have explicit and differentiable distance functions. In this paper, we propose an optimization-defined CBF that directly considers the exact Signed Distance Function (SDF) between a polytopic robot and polytopic obstacles. Inspired by the Gilbert-Johnson-Keerthi (GJK) algorithm, we formulate both (i) minimum distance and (ii) penetration depth between polytopic sets as convex optimization problems in the space of Minkowski difference operations (the MD-space). Convenient geometric properties of the MD-space enable the derivatives of implicit SDF between two polytopes to be computed via differentiable optimization. We demonstrate the proposed framework in three scenarios including pure translation, initialization inside an unsafe set, and multi-obstacle avoidance. These three scenarios highlight the generation of a non-conservative maneuver, a recovery after starting in collision, and the consideration of multiple obstacles via pairwise CBF constraint, respectively.
|
|
15:45-16:00, Paper ThB14.8 | |
Sub-Shortest Smooth 3-D Path Planning Generation for Underwater Gliding Robots Considering Oil Bladder Offset |
|
Jing, Anyan | Hunan University |
Liu, Xiaofeng | Hohai University |
Yang, Chenguang | University of the West of England |
Keywords: Optimal control, Identification for control, Autonomous robots
Abstract: This paper proposes a sub-shortest three-dimensional (3-D) path smoothing generation method, referred to as the left-right asymmetric Dubins-Helix path smoothing method, which takes into account the asymmetric left-right turning radii of an underwater gliding robot (UGR) caused by oil bladder offset. Based on the established UGR roll model, the forgetting factor recursive least squares algorithm is employed to identify the center of gravity offset, and this identified offset is used to calculate the left-right turn radius. An algorithm is designed to generate a sub-shortest Dubins-Helix path, meeting constraints on real-time computation, curvature radius, and pitch angle. Numerical simulation results demonstrate that the proposed method generates shorter paths compared to the real-time dynamic Dubins-Helix method, extending the applicability of 3-D path smoothing methods. Notably, the proposed method is applicable to all Dubins vehicles, not limited to UGRs alone.
|
|
ThB15 |
Capri II |
Stochastic Optimal Control II |
Regular Session |
Chair: Stavrou, Photios A. | Eurecom |
Co-Chair: Maity, Dipankar | University of North Carolina at Charlotte |
|
14:00-14:15, Paper ThB15.1 | |
Rollout-Based Approximate Dynamic Programming for MDPs with Information-Theoretic Constraints |
|
He, Zixuan | EURECOM |
Charalambous, Charalambos D. | University of Cyprus |
Stavrou, Photios A. | Eurecom |
Keywords: Information theory and control, Stochastic optimal control, Reinforcement learning
Abstract: This paper studies a finite-horizon Markov decision problem with information-theoretic constraints, where the goal is to minimize directed information from the controlled source process to the control process, subject to stage-wise cost constraints, aiming for an optimal control policy. We propose a new way of approximating a solution for this problem, which is known to be formulated as an unconstrained MDP with a continuous information-state using Q-factors. To avoid the computational complexity of discretizing the continuous information-state space, we propose a truncated rollout-based backward-forward approximate dynamic programming (ADP) framework. Our approach consists of two phases: an offline base policy approximation over a shorter time horizon, followed by an online rollout lookahead minimization, both supported by provable convergence guarantees. We supplement our theoretical results with a numerical example where we demonstrate the cost improvement of the rollout method compared to a previously proposed policy approximation method, and the computational complexity observed in executing the offline and online phases for the two methods.
|
|
14:15-14:30, Paper ThB15.2 | |
Low-Power Optimal Strategy for Witsenhausen Counterexample |
|
Zhao, Mengyuan | KTH Royal Institute of Technology |
Le Treust, Mael | CNRS |
Oechtering, Tobias J. | Royal Institute of Technology (KTH) |
Keywords: Information theory and control, Control over communications, Stochastic optimal control
Abstract: We discuss the Witsenhausen counterexample from the perspective of varying power budgets and propose a low-power estimation (LoPE) strategy. Specifically, our LoPE approach designs the first decision-maker (DM) a quantization step function of the Gaussian source state, making the target system state a piecewise linear function of the source with slope one. This approach contrasts with Witsenhausen’s original two-point strategy, which instead designs the system state itself to be a binary step. While the two-point strategy can outperform the linear strategy in estimation cost, it, along with its multi-step extensions, typically requires a substantial power budget. Analogous to Binary Phase Shift Keying (BPSK) communication for Gaussian channels, we show that the binary LoPE strategy attains first-order optimality in the low-power regime, matching the performance of the linear strategy as the power budget increases from zero. Our analysis also provides an interpretation of the previously observed near-optimal sloped step function ("sawtooth") structure to the Witsenhausen counterexample: In the low-power regime, power saving is prioritized, in which case the LoPE strategy dominates, making the system state a piecewise linear function with slope close to one. Conversely, in the high-power regime, setting the system state as a step function with the slope approaching zero facilitates accurate estimation. Hence, the sawtooth solution can be seen as a combination of both strategies.
|
|
14:30-14:45, Paper ThB15.3 | |
Quadratic Solution for the Data-Based LQ Regulator with Non-Gaussian Noise |
|
Borri, Alessandro | CNR-IASI |
Cacace, Filippo | Università Campus Biomedico Di Roma |
Cusimano, Valerio | CNR-IASI, Italian National Research Council - Institute for Syst |
d'Angelo, Massimiliano | Università Mercatorum |
De Gaetano, Andrea | CNR |
Germani, Alfredo | Universita' Dell'Aquila |
Palombo, Giovanni | IASI-CNR |
Panunzi, Simona | Consiglio Nazionale Delle Ricerche |
Keywords: Stochastic optimal control, Data driven control, Kalman filtering
Abstract: Control theory has made significant progress over the years, yet the challenge of selecting an accurate model for controller design remains a critical issue. Even small mismatches between the assumed model and true system dynamics can lead to performance degradation or instability, particularly in Linear Quadratic Gaussian (LQG) control, which relies on precise noise and system modeling. While data-driven approaches have been explored to mitigate model inaccuracies, conventional LQG methods still struggle in non-Gaussian noise environments. To address this limitation, we propose a novel hybrid framework that integrates polynomial-based methods with a data-based Markov parameter approach. By leveraging higher-order statistical information, our method improves the control performance under non-Gaussian noise, extending the applicability of LQG control in real-world scenarios. Simulations highlight the potential of the proposed approach.
|
|
14:45-15:00, Paper ThB15.4 | |
Robustness of Optimal Controlled Diffusions with Near-Brownian Noise to Brownian Idealization Via Rough Paths Theory |
|
Pradhan, Somnath | Indian Institute of Science Education and Research Bhopal |
Yuksel, Serdar | Queen's University |
Selk, Zachary | Queen's University |
Keywords: Stochastic optimal control, Stochastic systems, Robust control
Abstract: In this article we present a robustness theorem for controlled stochastic differential equations driven by approximations of Brownian motion, where the approximations are those that converge to the Brownian under the rough paths topology along sample paths. These approximations include the Wong-Zakai, Karhunen-Lo`eve, mollified Brownian and fractional Brownian processes, which can be interpreted to be more physical than the Brownian idealization of the driving noise process. We establish robustness using rough paths theory. To this end, in particular, we show that within the class of Lipschitz continuous control policies, an optimal solution for the Brownian idealized model is near optimal for a true system driven by a non-Brownian (but near-Brownian) noise.
|
|
15:00-15:15, Paper ThB15.5 | |
InterQ: A DQN Framework for Optimal Intermittent Control |
|
Aggarwal, Shubham | University of Illinois, Urbana Champaign |
Maity, Dipankar | University of North Carolina at Charlotte |
Basar, Tamer | Univ of Illinois, Urbana-Champaign |
Keywords: Control over communications, Stochastic optimal control, Reinforcement learning
Abstract: In this paper, we explore the communication-control co-design of discrete-time stochastic linear systems through reinforcement learning. Specifically, we examine a closed-loop system involving two sequential decision-makers: a scheduler and a controller. The scheduler continuously monitors the system’s state but transmits it to the controller intermittently to balance the communication cost and control performance. The controller, in turn, determines the control input based on the intermittently received information. Given the partially nested information structure, we show that the optimal control policy follows a certainty-equivalence form. Subsequently, we analyze the qualitative behavior of the scheduling policy. To develop the optimal scheduling policy, we propose InterQ, a deep reinforcement learning algorithm which uses a deep neural network to approximate the Q–function. Through extensive numerical evaluations, we analyze the scheduling landscape and further compare our approach against two baseline strategies: (a) a multi-period periodic scheduling policy, and (b) an event-triggered policy. The results demonstrate that our proposed method outperforms both baselines.
|
|
15:15-15:30, Paper ThB15.6 | |
Optimal Information Design for Incentivizing Strategies in Dynamic Systems |
|
Sun, Renyan | University of Southern California |
Nayyar, Ashutosh | University of Southern California |
Keywords: Stochastic optimal control, Game theory, Markov processes
Abstract: We study a finite-horizon discrete-time dynamic system that is jointly controlled by a designer and an agent. The designer can influence the agent's behavior by selectively revealing some information to the agent. Specifically, at each time step, the designer sends a message to the agent based on its private information. The agent uses the received message (and its own information) to choose its action. Both the actions of the designer and the agent influence the evolution of the underlying dynamic system and their reward structures. We are interested in the setting where the designer would like to send messages in a way that incentivizes the agent to play a specific strategy. Under certain assumptions on the information structure of the designer and the agent, we provide an algorithm to find an optimal messaging strategy for the designer that incentivizes the agent to play the desired strategy. Our algorithm requires solving a family of linear programs in a backward inductive manner.
|
|
15:30-15:45, Paper ThB15.7 | |
Trajectory Optimization of Stochastic Systems under Chance Constraints Via Set Erosion |
|
Liu, Zishun | Georgia Institute of Technology |
Ma, Liqian | Georgia Institute of Technology |
Chen, Yongxin | Georgia Institute of Technology |
Keywords: Stochastic systems, Stochastic optimal control, Formal Verification/Synthesis
Abstract: We study the trajectory optimization problem under chance constraints for continuous-time stochastic systems. To address chance constraints imposed on the entire stochastic trajectory, we propose a framework based on the set erosion strategy, which converts the chance constraints into safety constraints on an eroded subset of the safe set along the corresponding deterministic trajectory. The depth of erosion is captured by the probabilistic bound on the distance between the stochastic trajectory and its deterministic counterpart, for which we utilize a novel and sharp probabilistic bound developed recently. By adopting this framework, a deterministic control input sequence can be obtained, whose feasibility and performance are demonstrated through theoretical analysis. Our framework is compatible with various deterministic optimal control techniques, offering great flexibility and computational efficiency in a wide range of scenarios. To the best of our knowledge, our method provides the first scalable trajectory optimization scheme for high-dimensional stochastic systems under trajectory level chance constraints. We validate the proposed method through two numerical experiments.
|
|
15:45-16:00, Paper ThB15.8 | |
Integrating Sequential Hypothesis Testing into Adversarial Games: A Sun Zi-Inspired Framework |
|
Zhou, Haosheng | University of California, Santa Barbara |
Ralston, Daniel | University of California, Santa Barbara |
Yang, Xu | University of California, Santa Barbara |
Hu, Ruimeng | University of California, Santa Barbara |
Keywords: Stochastic optimal control, Game theory
Abstract: This paper investigates the interplay between sequential hypothesis testing (SHT) and adversarial decision-making in partially observable games, focusing on the deceptive strategies of red and blue teams. Inspired by Sun Zi's The Art of War and its emphasis on deception, we develop a novel framework to both deceive adversaries and counter their deceptive tactics. We model this interaction as a Stackelberg game where the blue team, as the follower, optimizes its controls to achieve its goals while misleading the red team into forming incorrect beliefs on its intentions. The red team, as the leader, strategically constructs and instills false beliefs through the blue team's envisioned SHT to manipulate the blue team’s behavior and reveal its true objectives. The blue team’s optimization problem balances the fulfillment of its primary objectives and the level of misdirection, while the red team coaxes the blue team into behaving consistently with its actual intentions. We derive a semi-explicit solution for the blue team’s control problem within a linear-quadratic framework, and illustrate how the red team leverages leaked information from the blue team to counteract deception. Numerical experiments validate the model, showcasing the effectiveness of deception-driven strategies in adversarial systems. These findings integrate ancient strategic insights with modern control and game theory, providing a foundation for further exploration in adversarial decision-making, such as cybersecurity, autonomous systems, and financial markets.
|
|
ThB16 |
Capri III |
Predictive Control for Nonlinear Systems I |
Regular Session |
Chair: Patrinos, Panagiotis | KU Leuven |
Co-Chair: Allgöwer, Frank | University of Stuttgart |
|
14:00-14:15, Paper ThB16.1 | |
Probabilistic Reachable Set Estimation for Saturated Systems with Unbounded Additive Disturbances |
|
Karam, Carlo | Institut Polytechnique De Grenoble, GIPSA-Lab |
Tacchi, Matteo | Univ. Grenoble Alpes, CNRS, Grenoble INP, GIPSA-Lab |
Fiacchini, Mirko | CNRS, Univ. Grenoble Alpes |
Keywords: Predictive control for nonlinear systems, Stochastic optimal control, LMIs
Abstract: In this paper, we present an analytical approach for the synthesis of ellipsoidal probabilistic reachable sets of saturated systems subject to unbounded additive noise. Using convex optimization methods, we compute a contraction factor of the saturated error dynamics that allows us to tightly bound its evolution and therefore construct accurate reachable sets. The proposed approach is applicable to independent, zero mean disturbances with a known covariance. A numerical example illustrates the applicability and effectiveness of the proposed design.
|
|
14:15-14:30, Paper ThB16.2 | |
Constrained Path Following Control of AUVs with Model Predictive Guidance: A Periodic Dynamic Event-Triggered Approach |
|
Min, Boxu | Northwestern Polytechnical University |
Gao, Jian | Northwestern Polytechnical University |
Jing, Anyan | Hunan University |
Chen, Yimin | Northwestern Polytechnical University |
Keywords: Maritime control, Predictive control for nonlinear systems, Hybrid systems
Abstract: This paper presents a Lyapunov-based model predictive guidance (LBMPG) approach for constrained path-following control of underactuated autonomous underwater vehicles (AUVs). The proposed method generates optimal guidance signals, subject to velocity and heading error constraints, using an online Lyapunov-based model predictive control (MPC) framework with a stability-enforcing contractive constraint. To reduce the computational burden of frequent optimization, a novel periodic dynamic event-triggered mechanism (PDETM) is introduced, which activates optimization based on a triggering condition tied to prediction accuracy and stability preservation. The resulting closed-loop system, characterized by mixed continuous and discrete dynamics, is analyzed within a hybrid system framework. Sufficient conditions are derived to ensure stability by jointly constraining the event-detection period, triggering functions, and related parameters. Simulations validate that the approach has effective path-following performance with significantly reduced computational demands, aligning with practical AUV requirements.
|
|
14:30-14:45, Paper ThB16.3 | |
On Model Predictive Funnel Control with Equilibrium Endpoint Constraints |
|
Göbel, Jens | Fraunhofer Institute for Industrial Mathematics ITWM |
Dennstädt, Dario | Universität Paderborn |
Lanza, Lukas | Technische Universität Ilmenau |
Worthmann, Karl | Technische Universität Ilmenau |
Berger, Thomas | Universität Paderborn |
Damm, Tobias | University of Kaiserslautern |
Keywords: Predictive control for nonlinear systems, Adaptive control, Nonlinear output feedback
Abstract: We propose model predictive funnel control, a novel model predictive control (MPC) scheme building upon recent results in funnel control. The latter is a high-gain feedback methodology that achieves evolution of the measured output within predefined error margins. The proposed method dynamically optimizes a parameter-dependent error boundary in a receding-horizon manner, thereby combining prescribed error guarantees from funnel control with the predictive advantages of MPC. This approach promises faster optimization times due to a reduced number of decision variables, whose number does not depend on the horizon length, as well as improved robustness due to a continuous feedback law to deal with the inter-sampling behavior. In this paper, we focus on proving stability by leveraging results from MPC stability theory with terminal equality constraints. Moreover, we rigorously show initial and recursive feasibility.
|
|
14:45-15:00, Paper ThB16.4 | |
Transient Performance of MPC for Tracking without Terminal Constraints |
|
Ehmann, Nadine | University of Stuttgart |
Koehler, Matthias | University of Stuttgart |
Allgöwer, Frank | University of Stuttgart |
Keywords: Predictive control for nonlinear systems
Abstract: Model predictive control (MPC) for tracking is a recently introduced approach, which extends standard MPC formulations by incorporating an artificial reference as an additional optimization variable, in order to track external and potentially time-varying references. In this work, we analyze the performance of such an MPC for tracking scheme without a terminal cost and terminal constraints. We derive a transient performance estimate, i.e. a bound on the closed-loop performance over an arbitrary time interval, yielding insights on how to select the scheme's parameters for performance. Furthermore, we show that in the asymptotic case, where the prediction horizon and observed time interval tend to infinity, the closed-loop solution of MPC for tracking recovers the infinite horizon optimal solution.
|
|
15:00-15:15, Paper ThB16.5 | |
Risk-Sensitive Model Predictive Control for Interaction-Aware Planning---A Sequential Convexification Algorithm |
|
Wang, Renzi | KU Leuven |
Schuurmans, Mathijs | KU Leuven |
Patrinos, Panagiotis | KU Leuven |
Keywords: Predictive control for nonlinear systems, Optimization algorithms, Stochastic optimal control
Abstract: This paper considers risk-sensitive model predictive control for stochastic systems with a decision-dependent distribution. This class of systems is commonly found in human-robot interaction scenarios. We derive computationally tractable convex upper bounds to both the objective function, and to frequently used penalty terms for collision avoidance, allowing us to efficiently solve the generally nonconvex optimal control problem as a sequence of convex problems. Simulations of a robot navigating a corridor demonstrate the effectiveness and the computational advantage of the proposed approach.
|
|
15:15-15:30, Paper ThB16.6 | |
A Hierarchical Surrogate Model for Efficient Multi-Task Parameter Learning in Closed-Loop Control |
|
Hirt, Sebastian | TU Darmstadt |
Theiner, Lukas | TU Darmstadt |
Pfefferkorn, Maik | Technical University of Darmstadt |
Findeisen, Rolf | TU Darmstadt |
Keywords: Predictive control for nonlinear systems, Learning, Statistical learning
Abstract: Many control problems require repeated tuning and adaptation of controllers across distinct closed-loop tasks, where data efficiency and adaptability are critical. We propose a hierarchical Bayesian optimization framework that is tailored to efficient controller parameter learning in sequential decision-making and control scenarios for distinct tasks. Instead of treating the closed-loop cost as a black-box, our method exploits structural knowledge of the underlying problem, consisting of a dynamical system, a control law, and an associated closed-loop cost function. We construct a hierarchical surrogate model using Gaussian processes that capture the closed-loop state evolution under different parameterizations, while the task-specific weighting and accumulation into the closed-loop cost are computed exactly via known closed-form expressions. This allows knowledge transfer and enhanced data efficiency between different closed-loop tasks. The proposed framework retains sublinear regret guarantees on par with standard black-box Bayesian optimization, while enabling multi-task or transfer learning. Simulation experiments with model predictive control demonstrate substantial benefits in both sample efficiency and adaptability when compared to purely black-box Bayesian optimization approaches.
|
|
15:30-15:45, Paper ThB16.7 | |
An Economic Nonlinear Model Predictive Control Approach for Mitigating Epidemic Spreading on Networks |
|
Calogero, Lorenzo | Politecnico Di Torino |
Pagone, Michele | Politecnico Di Torino |
Zino, Lorenzo | Politecnico Di Torino |
Rizzo, Alessandro | Politecnico Di Torino |
Keywords: Control applications, Predictive control for nonlinear systems, Control of networks
Abstract: We consider a discrete-time susceptible-infected-susceptible epidemic model on a network, in which we incorporate two control actions: vaccination of part of the population and implementation of non-pharmaceutical interventions. Then, we formulate the problem of devising an optimal control strategy for the epidemic disease using the two actions, with a tradeoff between public healthcare impact of the disease and social and economic costs associated with interventions. The control problem is solved by leveraging an economic nonlinear model predictive control scheme, for which the closed-loop stability holds from dissipativity arguments. Finally, we demonstrate our approach on a case study with realistic data.
|
|
15:45-16:00, Paper ThB16.8 | |
Distributed MPC for Dynamic Cooperation without Terminal Constraints |
|
Koehler, Matthias | University of Stuttgart |
Müller, Matthias A. | Leibniz University Hannover |
Allgöwer, Frank | University of Stuttgart |
Keywords: Predictive control for nonlinear systems, Cooperative control
Abstract: We propose a distributed model predictive control framework without terminal costs or constraints for cooperative control of multi-agent systems comprising heterogeneous, nonlinear agents subject to individual and coupling constraints. The cooperative task is encoded by an energy-like objective function, such that minimising this function leads towards its fulfilment. The agents trade off tracking an artificial reference against the need for that reference to minimise the cooperative task's objective function. The scheme guarantees asymptotic fulfilment of the cooperative task. Importantly, the eventual solution is not specified a priori, but emerges from the agents' decisions.
|
|
ThB17 |
Capri IV |
Stability of Nonlinear Systems I |
Regular Session |
Chair: Deaecto, Grace S. | FEM/UNICAMP |
Co-Chair: Labbadi, Moussa | Aix-Marseille University |
|
14:00-14:15, Paper ThB17.1 | |
Control Design for Reducing Vulnerability of Nonlinear Systems to Large Disturbances Using Modes of Instability |
|
Wang, Jinghan | University of Waterloo |
Fisher, Michael W | University of Waterloo |
Keywords: Nonlinear systems, Stability of nonlinear systems, Robust control
Abstract: Engineered systems naturally experience large disturbances which have the potential to disrupt operational stability, especially if the system fails to recover to a desired stable equilibrium point. It is valuable to design controllers capable of mitigating these disturbances to enhance system resilience and ensure reliable performance. Consider a particular finite-time disturbance applied to a nonlinear system which possesses a stable equilibrium point representing desired behavior. The system is able to recover from the disturbance if its post-disturbance initial condition lies inside the region of attraction of the desired stable equilibrium point. In cases where the system fails to recover, the nonlinear mode of instability for the disturbance represents the subset of system dynamics most responsible for this failure to recover. In prior work, the mode of instability has been defined to be the unstable eigenvector of the Jacobian at a particular unstable equilibrium point on the region of attraction boundary that plays a key role in the failure to recover from the disturbance, and points outwards from the region of attraction boundary. Efficient methods were developed to numerically compute the mode of instability. This paper develops a novel approach to control design for reducing disturbance vulnerability of nonlinear systems using knowledge of the mode of instability. The main idea is to tune controller parameter values so as to drive the post-disturbance initial condition further inwards away from the region of attraction boundary by driving it in the direction opposite to the mode of instability. To achieve this, the problem is formulated as a nonconvex optimization problem, and an efficient algorithm is developed to solve it. Local convergence guarantees are provided for this method. Numerical examples illustrate the successful application of the method to reduce the vulnerability of power systems subject to temporary short circuits.
|
|
14:15-14:30, Paper ThB17.2 | |
Regional Stability Analysis of Discrete-Time Piecewise Affine Systems |
|
Cabral, Leonardo | Universidade Federal Do Rio Grande Do Sul |
Valmorbida, Giorgio | L2S, CentraleSupelec |
Gomes da Silva Jr, Joao Manoel | Universidade Federal Do Rio Grande Do Sul |
Keywords: Stability of nonlinear systems, Lyapunov methods, LMIs
Abstract: This paper studies the regional stability of discrete-time Piecewise Affine (PWA) systems. The proposed method for the stability analysis uses an implicit representation of PWA systems based on ramp functions, and it builds upon Linear Matrix Inequalities to verify the nonnegativity of Piecewise Quadratic (PWQ) functions in a given set. Verifying the nonnegativity of PWQ functions allows us to solve Lyapunov inequalities yielding a PWQ Lyapunov function of which a level set gives an estimate of the Region of Attraction of the Origin (RAO). Numerical results illustrate the effectiveness of the proposed method.
|
|
14:30-14:45, Paper ThB17.3 | |
On Hyperexponential Stabilization of a Class of Nonlinear Systems |
|
Labbadi, Moussa | Aix-Marseille University |
Efimov, Denis | Inria |
Keywords: Stability of nonlinear systems, Time-varying systems, Numerical algorithms
Abstract: This article studies the problem of hyperexponential stabilization for a class of nonlinear systems subject to perturbations, where the nonlinear functions are unknown but satisfy a linear growth condition. Both continuous-time and discrete-time control designs are proposed. In the continuous-time setting, we employ parametric Lyapunov equations incorporating a time-varying function that diverges to infinity as ( t to infty ). A linear time-varying controller is designed, and the closed-loop system stability is established using Lyapunov quadratic functions. In the discrete-time setting, we utilize a semi-implicit Euler method that preserves the properties of the continuous-time dynamics.
|
|
14:45-15:00, Paper ThB17.4 | |
Stability of Switched Nonlinear Systems under Persistent Dwell-Time Constraints |
|
He, Liting | Imperial College London |
Deaecto, Grace S. | FEM/UNICAMP |
Keywords: Switched systems, Nonlinear systems, Stability of nonlinear systems
Abstract: This paper studies the stability of discrete-time switched nonlinear systems subject to persistent dwell-time (PDT) constraints, which are modeled by using a recent concept called dictionary, where a suitable combination of index sequences is able to represent precisely any PDT switching function. These constraints are characterized by non-uniform time-dependent switching functions that alternate time intervals subject to dwell-time constraints and arbitrary switching. The proposed stability conditions are based on a time-varying Lyapunov function that ensures both asymptotic stability of the zero equilibrium and a guaranteed performance cost for the overall system. Global and local stability are considered and a two-step procedure is proposed to estimate a region of attraction in the second case. Academic examples are used to validate the theoretical results and compare them with the existing literature.
|
|
15:00-15:15, Paper ThB17.5 | |
Controller Design for Bilinear Neural Feedback Loops |
|
Shah, Dhruv | University of California, San Diego |
Cortes, Jorge | UC San Diego |
Keywords: LMIs, Neural networks, Stability of nonlinear systems
Abstract: This paper considers a class of bilinear systems with a neural network in the loop. These arise naturally when employing machine learning techniques to approximate general, non-affine in the input, control systems. We propose a controller design framework that combines linear fractional representations and tools from linear parameter varying control to guarantee local exponential stability of a desired equilibrium. The controller is obtained from the solution of linear matrix inequalities, which can be solved offline, making the approach suitable for online applications. The proposed methodology offers tools for stability and robustness analysis of deep neural networks interconnected with dynamical systems.
|
|
15:15-15:30, Paper ThB17.6 | |
Nonlinear Static Output Feedback Design for Polynomial Systems with Non-Symmetric Input Saturation Bounds |
|
Madeira, Diego de S. | Federal University of Ceará (UFC) |
Napoleão Silva, João Gabriel | Federal University of Ceará |
Machado, Gabriel Freitas | The University of Sheffield |
Papachristodoulou, Antonis | University of Oxford |
Keywords: Nonlinear output feedback, Stability of nonlinear systems, Lyapunov methods
Abstract: We apply dissipativity theory to the design of nonlinear static output feedback controllers for polynomial systems whose input signals are subject to possibly non-symmetric saturation bounds. In a new approach that extends recent results in literature, a two-step procedure based on sum-of-squares programming is employed for solving this problem, which had not been addressed thus far in literature. A numerical example demonstrates the applicability of the strategy.
|
|
15:30-15:45, Paper ThB17.7 | |
Detecting Destabilizing Nonlinearities in Absolute Stability Analysis with Static O'Shea-Zames-Falb Multipliers |
|
Gyotoku, Hibiki | Kyushu University |
Yuno, Tsuyoshi | Kyushu Univ |
Ebihara, Yoshio | Kyushu University |
Peaucelle, Dimitri | LAAS-CNRS, Université De Toulouse |
Tarbouriech, Sophie | LAAS-CNRS |
Keywords: LMIs, Stability of nonlinear systems, Robust control
Abstract: This work addresses the absolute stability analysis of feedback systems incorporating slope-restricted and repeated nonlinearities. Under the integral quadratic constraint (IQC) framework, one can employ static O'Shea-Zames-Falb (OZF) multipliers to obtain an LMI-based criterion that ensures absolute stability. However, this criterion serves only as a sufficient condition. In other words, if the ``primal" LMI is infeasible, no definitive conclusion can be drawn about the system's absolute stability. To overcome this limitation, we consider the corresponding dual LMI, which becomes feasible if and only if the primal LMI is infeasible. Our main result establishes that if the dual solution satisfies a certain rank condition, it is possible to construct both a destabilizing slope-restricted nonlinearity and a non-zero equilibrium state, thereby proving that the system of interest is not absolutely stable. Numerical examples are provided to illustrate the effectiveness of these findings.
|
|
15:45-16:00, Paper ThB17.8 | |
Runtime Assurance with Stability Regulation |
|
Kurtoglu, Deniz | University of South Florida |
Yucelen, Tansel | University of South Florida |
Tran, Dzung | AFRL |
Casbeer, David W. | Air Force Research Laboratory |
Garcia, Eloy | Air Force Research Laboratory |
Keywords: Lyapunov methods, Human-in-the-loop control, Reinforcement learning
Abstract: This paper introduces a novel runtime assurance approach that departs from existing methods, which typically monitor system trajectories and switch from complex or nondeterministic control laws to safe ones when necessary. In particular, a general runtime assurance with stability regulation framework is proposed, in which a complex or nondeterministic control law is permitted to drive the dynamical system when a predefined stability metric is satisfied; otherwise, control is transferred to a safe law that inherently guarantees this metric. The key feature of the proposed framework is its flexibility in selecting the stability metric, which can be based on Lyapunov stability, asymptotic stability, exponential stability, or boundedness of system trajectories. Two illustrative numerical examples are further provided to demonstrate the efficacy of the proposed framework.
|
|
ThB18 |
Aruba I+II+III |
Predictive Control for Linear Systems |
Regular Session |
Chair: Kerrigan, Eric C. | Imperial College London |
Co-Chair: Findeisen, Rolf | TU Darmstadt |
|
14:00-14:15, Paper ThB18.1 | |
Reducing Conservatism in Robust Data-Driven MPC Via the S-Variable Method and Time-Varying Lyapunov Functions |
|
Nguyen, Hoang Hai | TU Darmstadt |
Gramlich, Dennis | RWTH Aachen |
Ebenbauer, Christian | RWTH Aachen University |
Findeisen, Rolf | TU Darmstadt |
Keywords: Predictive control for linear systems, Robust control, Optimization
Abstract: Predictive control relies on a system model to forecast future behavior. In scenarios where a nominal model is unavailable, data-driven model predictive control techniques can compute control inputs directly from past measured trajectories. When the system is subject to noise and disturbances, the collected data becomes corrupted, potentially degrading control performance. By integrating robust control techniques with data-driven MPC, it is possible to ensure stability and robust constraint satisfaction in the presence of such uncertainty. Recent methods based on Linear Matrix Inequalities (LMIs) have shown promise in this direction, but often lead to conservative solutions due to structural limitations in the formulation. In this work, we propose a less conservative data-driven MPC scheme by incorporating a sequence of time-varying Lyapunov functions together with the S-variable approach. This enables a relaxation of the LMI conditions, decouples the controller design from the Lyapunov matrix, and improves feasibility and performance. We show that the resulting controller guarantees asymptotic stability and robust constraint satisfaction of the closed-loop system. A numerical example illustrates the effectiveness of the proposed approach compared to existing methods.
|
|
14:15-14:30, Paper ThB18.2 | |
Robust Output Feedback MPC for Constrained Linear Systems Based on Zonotopic Kalman Filter |
|
Zhang, Jingyu | Dalian University of Technology |
Tang, Wentao | Dalian University of Technology |
Wu, Yuhu | Dalian University of Technology |
Sun, Xi-Ming | Dalian University of Technology |
Keywords: Predictive control for linear systems, Observers for Linear systems, Uncertain systems
Abstract: This paper presents a robust output feedback model predictive control (MPC) method integrating a zonotopic Kalman filter (ZKF) for constrained linear systems under disturbances. Firstly, A ZKF is used to increase estimation accuracy and its optimal gain is obtained via solving a discrete-time algebraic Riccati equation. Secondly, a robust output MPC strategy is proposed, where an efficient constraint-handling strategy utilizing zonotope properties to replace linear programming (LP) with matrix operations, significantly reducing computational load. The approach ensures recursive feasibility and exponential stability while maintaining computational tractability. Finally, two simulation examples demonstrate the effectiveness and superiority of the proposed method.
|
|
14:30-14:45, Paper ThB18.3 | |
The Bidirectional Mapping between Linear Model Predictive Control Policies and ReLU Neural Networks |
|
Li, Xingchen | Tsinghua University |
You, Keyou | Tsinghua University |
Keywords: Predictive control for linear systems, Neural networks
Abstract: Recently, there has been a resurgence of interest in using neural networks (NNs) to approximate model predictive control (MPC) policies, as NNs show promising computational efficiency in practice. However, the relation between linear MPC (LMPC) and ReLU NNs has not been fully studied. This work establishes a textit{bidirectional mapping} between LMPC and ReLU NNs: any LMPC policy can be represented by a ReLU NN and, any ReLU NN can be represented by an LMPC policy. We analyze the computational complexity required for ReLU NNs to exactly represent LMPC policies, demonstrating that a depth of lceillog_2(n_x+1)rceil is sufficient while at least a depth of 2 is necessary, where n_x is the state dimension of the system. Conversely, through explicit constructive proofs, we show that LMPC policies can represent any continuous piecewise linear function, including ReLU NNs. A numerical example is provided to illustrate the construction of LMPC policies that can represent ReLU NNs.
|
|
14:45-15:00, Paper ThB18.4 | |
Bayesian Optimization-Based Tunable Explicit MPC on a Pocket-Sized Embedded Platform |
|
Peter, Bakaráč | Slovak University of Technology in Bratislava |
Pavlovičová, Erika | Slovak University of Technology in Bratislava |
Klauco, Martin | Czech Technical University |
Oravec, Juraj | Slovak University of Technology in Bratislava |
Keywords: Predictive control for linear systems, Embedded systems, Optimization algorithms
Abstract: The paper presents a pocket-sized embedded platform designed for the validation of advanced control methods. The platform is based on the ESP32-S3 microcontroller and is equipped with a miniaturized heat exchange device, making it suitable for temperature control experiments. The platform enables the implementation of a real-time tunable explicit Model Predictive Control (MPC) algorithm, which allows for online tuning of the weighting parameter in the MPC cost function. The paper also introduces a novel application of a method based on Bayesian optimization, which efficiently explores the parameter space to find an optimal performance metric value. The performance metric is a weighted overall parameter of the controller's performance, in which actuator power consumption and control signal fluctuation are evaluated. Experimental results demonstrate the effectiveness of the proposed approach, while the exploration and exploitation approaches have been tested. Both the explicit MPC controller and the Bayesian optimization method are implemented on the embedded platform, showcasing its capabilities for real-time control applications. The results highlight the potential of the platform for further research and development in advanced control strategies.
|
|
15:00-15:15, Paper ThB18.5 | |
Using Ramp Functions to Solve LP-Based MPC |
|
Hovd, Morten | Norwegian Univ of Sci & Tech |
Valmorbida, Giorgio | L2S, CentraleSupelec |
Keywords: Optimization, Predictive control for linear systems, Numerical algorithms
Abstract: A new solution strategy, utilizing ramp functions, is proposed for solving LP problems arising in Model Predictive Control. The solution strategy can take a general LP formulation, and avoids the increase in optimization variables resulting from converting the LP problem into standard form. Unlike the simplex method, there is no need to find an initial feasible solution. Numerical experiments demonstrate a significant increase in solution speed compared to the previously proposed ramp-based Simplex, and compares favorably to a state-of-the-art commercial solver like CPLEX for small to moderately sized problems.
|
|
15:15-15:30, Paper ThB18.6 | |
Exploiting Multistage Optimization Structure in Proximal Solvers |
|
Schwan, Roland | EPFL |
Kuhn, Daniel | EPFL |
Jones, Colin N. | EPFL |
Keywords: Optimization, Numerical algorithms, Predictive control for linear systems
Abstract: This paper presents an efficient structure-exploiting algorithm for multistage optimization problems. The proposed method extends existing approaches by supporting full coupling between stages and global decision variables in the cost, as well as equality and inequality constraints. The algorithm is implemented as a new backend in the PIQP solver and leverages a specialized block-tri-diagonal-arrow Cholesky factorization within a proximal interior-point framework to handle the underlying problem structure efficiently. The implementation features automatic structure detection and seamless integration with existing interfaces. Numerical experiments demonstrate significant performance improvements, achieving up to 13x speed-up compared to a generic sparse backend and matching/exceeding the performance of the state-of-the-art specialized solver HPIPM. The solver is particularly effective for applications such as model predictive control, robust scenario optimization, and periodic optimization problems.
|
|
15:30-15:45, Paper ThB18.7 | |
A Scenario-Based Approach for Stochastic Economic Model Predictive Control with an Expected Shortfall Constraint |
|
Arastou, Alireza | University of Melbourne |
Care', Algo | University of Brescia |
Campi, M. C. | University of Brescia |
Wang, Ye | The University of Melbourne |
Weyer, Erik | Univ. of Melbourne |
Keywords: Control applications, Predictive control for linear systems, Randomized algorithms
Abstract: This paper presents a novel approach to stochastic economic model predictive control (SEMPC) that minimizes average economic cost while satisfying an empirical expected shortfall (EES) constraint to manage risk. A new scenario-based problem formulation ensuring controlled risk with high confidence while minimizing the average cost is introduced. The probabilistic guarantees is dependent on the number of support elements over the entire input domain, which is difficult to find for high-dimensional systems. A heuristic algorithm is proposed to find the number of support elements. Finally, an efficient method is presented to reduce the computational complexity of the SEMPC problem with an EES constraint. The approach is validated on a water distribution network, showing its effectiveness in balancing performance and risk.
|
|
15:45-16:00, Paper ThB18.8 | |
A Robust Predictive Control Method for Pump Scheduling in Water Distribution Networks |
|
Urkmez, Mirhan | Aalborg University |
Kallesøe, Carsten Skovmose | Aalborg University |
Bendtsen, Jan Dimon | Aalborg University |
Kerrigan, Eric C. | Imperial College London |
Leth, John | Aalborg University |
Keywords: Control of networks, Control applications, Predictive control for linear systems
Abstract: This paper proposes a Robust Model Predictive Control (RMPC) method for energy-efficient and reliable pump scheduling in Water Distribution Networks (WDNs), accounting for model uncertainties and demand forecast errors. Building on a previous robust control approach, this extended method uses a linear model with bounded disturbances and optimizes affine disturbance-based pump schedules over a receding horizon. The optimization complexity is reduced from O(N^6) to O(N^3) via a sparse reformulation. When applied to the Randers WDN in Denmark, the method surpasses traditional MPC variants in meeting constraints while maintaining comparable economic performance.
|
|
ThB19 |
Ibiza IV |
Optimal Control V |
Regular Session |
Chair: Postoyan, Romain | CNRS, CRAN, Université De Lorraine |
Co-Chair: Zhan, Siyuan | Trinity College Dublin |
|
14:00-14:15, Paper ThB19.1 | |
Serial-Correlation-Driven Disturbance Utilization in Indefinite Linear Quadratic Optimal Control |
|
Zhan, Siyuan | Trinity College Dublin |
Bertolin, Ariádne de Lourdes Justi | University of Bath |
Keywords: Optimal control, Stochastic optimal control
Abstract: This paper investigates the discrete-time indefinite Linear Quadratic Optimal Control (iLQOC) problem subject to additive disturbances with time-correlation features. Such problems arise in applications such as marine renewable energy systems, where disturbances (e.g., wave excitations) can enhance performance if strategically utilised. This novel application paradigm challenges conventional approaches to disturbed linear optimal control problems, which typically treat disturbances as unknown or purely stochastic and to be attenuated. However, these disturbance-attenuation approaches cannot leverage the beneficial impact of disturbances in this type of ‘goal-oriented’ control application. In this paper, we revisit the control problem from a non-causal perspective. The problem is initially formulated as a deterministic non-causal optimal control problem, assuming that future information on the disturbance is available. This results in a linear optimal control policy, consisting of a causal state feedback part and an acausal feedforward of future disturbances. Then, following a control-theoretic approach, we reveal the connection between the optimal control performance index and the optimal disturbance preview problem. This leads to the second part, which involves the development of a theoretically optimal predictor within the framework of a linear multivariate Gaussian distribution. The numerical examples provided demonstrate that by strategically leveraging disturbances rather than simply mitigating them, this method offers a novel perspective on sustainable control strategies, enhancing both performance and resilience in evolving operational environments.
|
|
14:15-14:30, Paper ThB19.2 | |
Exact Time-Varying Turnpikes for Dynamic Operation of District Heating Networks |
|
Rose, Max | Fraunhofer IEG |
Gernandt, Hannes | University of Wuppertal |
Faulwasser, Timm | Hamburg University of Technology |
Schiffer, Johannes | Brandenburg University of Technology |
Keywords: Optimal control, Predictive control for linear systems, Energy systems
Abstract: District heating networks (DHNs) are crucial for decarbonizing the heating sector. Yet, their efficient and reliable operation is complex and requires the coordination of multiple heat producers. Model predictive control (MPC) is commonly used to address this task, but existing stability analyses have overlooked the network's time-varying properties. Since the turnpike phenomenon can serve as a basis for MPC analysis, in this paper we examine its role in DHN optimization by analyzing the underlying optimal control problem with time-varying prices and demands. That is, we derive conditions for the existence of a unique time-varying singular arc and provide its closed-form expression. Additionally, we extend the link between dissipativity and the turnpike property to the exact time-varying case. A numerical example illustrates our findings.
|
|
14:30-14:45, Paper ThB19.3 | |
Data-Driven Distributionally Robust Control Based on Sinkhorn Ambiguity Sets |
|
Cescon, Riccardo | Ecole Polytechnique Fédérale De Lausanne |
Martin, Andrea | KTH Royal Institute of Technology |
Ferrari-Trecate, Giancarlo | Ecole Polytechnique Fédérale De Lausanne |
Keywords: Optimal control, Stochastic systems, Robust control
Abstract: As the complexity of modern control systems increases, it becomes challenging to derive an accurate model of the uncertainty that affects their dynamics. Wasserstein Distributionally Robust Optimization (DRO) provides a powerful framework for decision-making under distributional uncertainty only using noise samples. However, while the resulting policies inherit strong probabilistic guarantees when the number of samples is sufficiently high, their performance may significantly degrade when only a few data are available. Inspired by recent results from the machine learning community, we introduce an entropic regularization to penalize deviations from a given reference distribution and study data-driven DR control over Sinkhorn ambiguity sets. We show that for finite-horizon control problems, the optimal DR linear policy can be computed via convex programming. By analyzing the relation between the ambiguity set defined in terms of Wasserstein and Sinkhorn discrepancies, we reveal that, as the regularization parameter increases, this optimal policy interpolates between the solution of the Wasserstein DR problem and that of the stochastic problem under the reference distribution. We validate our theoretical findings and the effectiveness of our approach when only scarce data are available on a numerical example.
|
|
14:45-15:00, Paper ThB19.4 | |
Discounted LQR: Stabilizing (near-)optimal State-Feedback Laws |
|
de Brusse, Jonathan | University of Lorraine |
Daafouz, Jamal | Université De Lorraine, CRAN, CNRS |
Granzotto, Mathieu | University of Melbourne |
Postoyan, Romain | CNRS, CRAN, Université De Lorraine |
Nesic, Dragan | University of Melbourne |
Keywords: Optimal control, Stability of linear systems, LMIs
Abstract: We study deterministic, discrete linear time-invariant systems with infinite-horizon discounted quadratic cost. It is well-known that standard stabilizability and detectability properties are not enough in general to conclude stability properties for the system in closed-loop with the optimal controller when the discount factor is small. In this context, we first review some of the stability conditions based on the optimal value function found in the learning and control literature and highlight their conservatism. We then propose novel (necessary and) sufficient conditions, still based on the optimal value function, under which stability of the origin for the optimal closed-loop system is guaranteed. Afterwards, we focus on the scenario where the optimal feedback law is not stabilizing because of the discount factor and the goal is to design an alternative stabilizing near-optimal static state-feedback law. We present both linear matrix inequality-based conditions and a variant of policy iteration to construct such stabilizing near-optimal controllers. The methods are illustrated via numerical examples.
|
|
15:00-15:15, Paper ThB19.5 | |
A Differential Linear Matrix Inequality-Based Approach to the Worst-Timing-Type H_2 Control of Sampled-Data Systems |
|
Park, Hae Yeon | Pohang University of Science & Technology |
Kim, Junghoon | Pohang University of Science and Technology |
Choi, Hyung Tae | Chung-Ang University |
Kim, Jung Hoon | Pohang Univeristy of Science and Technology |
Keywords: LMIs, Sampled-data control, Optimal control
Abstract: This paper addresses the problem of the worst-timing-type~(WT-type) H_2 control of linear sampled-data systems through the differential linear matrix inequality (DLMI)-based approach. The WT-type H_2 norm is defined as the supremum of the L_2 norms of all the tau-dependent outputs for the impulse inputs occurring at the instant tau in the sampling interval [0,h). To tackle this problem, we first describe the linear sampled-data system by a hybrid linear system (HLS) and establish a performance analysis framework based on DLMIs and linear matrix inequalities (LMIs). The analysis framework is further extended to an optimal controller synthesis procedure for minimizing the WT-type H_2 norm. Finally, a numerical simulation is given to validate the theoretical results and demonstrate the effectiveness of the proposed approach.
|
|
15:15-15:30, Paper ThB19.6 | |
Optimal Control for Minimizing Inescapable Ellipsoids in Linear Periodically Time-Varying Systems under Bounded Disturbances |
|
Peregudin, Alexey | University of Sheffield |
Dogadin, Egor | ITMO University |
Keywords: Time-varying systems, Optimal control, Output regulation
Abstract: This paper addresses optimal controller design for periodic linear time-varying systems under unknown-but-bounded disturbances. We introduce differential Lyapunov-type equations to describe time-varying inescapable ellipsoids and define an integral-based measure of their size. To minimize this measure, we develop a differential Riccati equation-based approach that provides exact solutions for state-feedback, observer synthesis, and output-feedback control. A key component is a systematic procedure for determining the optimal time-varying parameter, reducing an infinite-dimensional optimization to a simple iterative process. A numerical example validates the method's effectiveness.
|
|
15:30-15:45, Paper ThB19.7 | |
A Discretization for Sampled-Data Controller Synthesis of Minimizing the L1-Induced Norm |
|
Kim, Junghoon | Pohang University of Science and Technology |
Kim, Jung Hoon | Pohang Univeristy of Science and Technology |
Hagiwara, Tomomichi | Kyoto Univ |
Keywords: Sampled-data control, Optimal control
Abstract: This article presents a novel approach for designing an optimal controller that minimizes the L_1-induced norm in sampled-data systems. First, a fast-lifting approach is integrated with the lifting technique, subdividing the sampling interval [0,h) into N equal parts. Next, an averaging operator and a holding operator are introduced to approximate the output and input signals as constant over each subinterval. This procedure yields an approximate discretization of the continuous-time plant, effectively converting the optimal control problem from the sampled-data domain into a discrete-time framework. The theoretical validity of this approximation is demonstrated by showing that the L_1-induced norm of the sampled-data system, which includes both the continuous-time plant and the discrete-time controller derived from the discretization, approaches the best attainable L_1-induced norm at a rate proportional to 1/N as the approximation parameter N increases.
|
|
15:45-16:00, Paper ThB19.8 | |
Optimal Control of Heat Pumps with Thermal Storage under Time-Of-Use Tariffs |
|
Fleming, James M. | Loughborough University |
Barbour, Edward | University of Birmingham |
Andrew, Urquhart | Loughborough University |
Keywords: Optimal control, Energy systems, Smart cities/houses
Abstract: For countries with cold winters, air and ground source heat pumps have the potential to heat buildings without the use of fossil fuels. However they increase peak electrical loads for the building, and if time-of-use electricity tariffs are used, potentially lead to high costs for the end user if heating is required at peak times. Motivated by recent work showing that thermal energy storage (TES) may provide useful energy flexibility to address these issues, we consider the problem of minimising operating cost for a heat pump equipped with TES. We apply Pontryagin's principle to analyse the problem and characterise its solutions before developing practically applicable controllers via costate estimation. Simulation based on real-world demand data from a UK house with historical data on a time-of-use electricity tariff indicates that this controller could achieve a 20.1% reduction in running cost over one year of operation. A globally optimal controller with perfect knowledge of the future would achieve a 22.8% reduction in the same scenario, as calculated by dynamic programming.
|
|
ThB20 |
Asia I+II+III+IV |
Autonomous Multi-Agent Systems in Transportation: Control, Learning, and
Optimization Methods |
Tutorial Session |
Chair: Cassandras, Christos G. | Boston University |
Co-Chair: Johansson, Karl H. | KTH Royal Institute of Technology |
Organizer: Cassandras, Christos G. | Boston University |
Organizer: Johansson, Karl H. | KTH Royal Institute of Technology |
Organizer: Malikopoulos, Andreas A. | Cornell University |
|
14:00-16:00, Paper ThB20.1 | |
Control, Learning, and Optimization Methods for Autonomous Multi-Agent Systems in Transportation (I) |
|
Cassandras, Christos G. | Boston University |
Johansson, Karl H. | KTH Royal Institute of Technology |
Malikopoulos, Andreas A. | Cornell University |
Keywords: Autonomous vehicles, Autonomous systems, Cooperative control
Abstract: Emerging mobility systems are an example of Cyber-Physical Systems (CPSs) in which multiple autonomous agents (vehicles) interact with each other as well as with the infrastructure resources (road side units, traffic lights, etc). Control-theoretic and optimization methods provide a rich framework for managing these complex mixed-traffic socioeconomic multi-agent systems. Given the complexity involved and the abundance of data now available, it is essential to integrate learning-based methods not only to design optimal controllers with safety guarantees, but to also gain an understanding of human driving behavior, as well as user preferences for the mobility options that intelligent transportation systems provide. The three objectives of this tutorial paper are: (1) Set the stage for emerging mobility systems consisting of both autonomous and human-driven vehicles in a mixed traffic environment by formulating basic optimal control problems for autonomous vehicles that seek to jointly optimize travel time, energy, and comfort while ensuring that safety constraints are always satisfied. (2) Present methods for solving the formulated problems using a combination of optimization techniques and Control Barrier Functions (CBFs) that provide safety guarantees, as well as state of the art learning-based methods to design effective controllers for mixed traffic transportation systems. (3) Address the societal issues accompanying emerging mobility systems, including new metrics that incorporate accessibility and fairness in a transportation network consisting of both autonomous and human-driven vehicles.
|
|
ThC01 |
Galapagos I |
Cell Population Dynamics |
Invited Session |
Chair: Palumbo, Pasquale | University of Milano-Bicocca |
Co-Chair: Singh, Abhyudai | University of Delaware |
Organizer: Bellato, Massimo | Università Di Padova |
Organizer: Borri, Alessandro | CNR-IASI |
Organizer: Palumbo, Pasquale | University of Milano-Bicocca |
Organizer: Singh, Abhyudai | University of Delaware |
|
16:30-16:45, Paper ThC01.1 | |
Estimating Eradication Time and Probability in Stochastic Tumour Models |
|
Borri, Alessandro | CNR-IASI |
Papa, Federico | IASI-CNR |
Palumbo, Pasquale | University of Milano-Bicocca |
Keywords: Stochastic systems, Biological systems, Markov processes
Abstract: Cancer is a leading global cause of death, and biomedical research has long investigated carcinogenesis mechanisms and therapeutic strategies, often leveraging mathematical models. Low-dimensional models facilitate the description of tumour growth and the optimization of drug administration. Recently, Chemical Reaction Networks (CRNs) have enabled the integration of stochastic noise into tumour dynamics, offering an alternative to classical deterministic models. One intrinsic limitation of the deterministic approach is that tumour eradication can only occur asymptotically, i.e., over an infinite time horizon. Conversely, when the number of tumour cells becomes relatively low, the stochastic approach provides a more accurate description of the system and allows for the quantification of the eradication probability. In this note, we extend and refine existing methods for estimating the eradication probability and mean eradication time in stochastic tumour models. We then apply the results to a model of interest, for which stochastic simulations numerically confirm and validate the theoretical results.
|
|
16:45-17:00, Paper ThC01.2 | |
Adaptive Observers for Calibrating Immune Digital Twins in Viral Infections (I) |
|
Rodríguez, Angel | Universidad Autónoma De Nuevo León |
Sereno, Juan E. | University of Idaho |
Quiroz, Griselda | Universidad Autónoma De Nuevo León |
Hernandez-Vargas, Esteban Abelardo | University of Idaho |
Keywords: Biological systems, Cellular dynamics, Systems biology
Abstract: Software duplicates, ``Digital twins", are promising tools to merge known immunological mechanisms with real-time patient-specific clinical data to develop predictive computer simulations of viral infection and immune response. During respiratory infections, host immune cells and viral dynamics are critical clinical markers that could help practitioners decide how to treat an infected patient. This paper describes the problem of estimating immune responses and model parameters, which is fundamental for developing immune digital twins. We derived observability and identifiability properties that were the basis of the proposed observation scheme. Different simulation scenarios show which parameters can be estimated during a respiratory infection
|
|
17:00-17:15, Paper ThC01.3 | |
Modeling Drug Tolerance Via Fluctuation Tests with Reversible Phenotypic Switching (I) |
|
Hlubinová, Anna | Comenius University |
Bokes, Pavol | Comenius University |
Singh, Abhyudai | University of Delaware |
Keywords: Cellular dynamics, Biological systems, Markov processes
Abstract: This paper examines a structurally symmetric fluctuation test experiment in which cell populations grow from a single cell to a set size before undergoing treatment. During growth, cells may acquire tolerance to treatment through probabilistic events, passed to progeny. Motivated by recent research on drug tolerance in microbial and cancer cells, the model also allows tolerant cells to revert to a sensitive state, reflecting dynamic phenotypic switching. The master equation governing the probability distribution of tolerant cells is solved via the generating function method and the quasi-powers approximation. Depending on model parameters, the distribution may be approximated by a stable distribution (or its special case, the normal distribution) or through large deviations theory. In the regime of frequent switching, the large deviations approach provides better agreement with numerical solutions, particularly at distribution tails. Conversely, in the regime of infrequent switching, the general stable distributions offer improved accuracy over the Landau distribution, which is a limiting distribution in case of unidirectional switching.
|
|
17:15-17:30, Paper ThC01.4 | |
A Hybrid Model for Tumor Growth and Dormancy with the Combination of Deterministic and Stochastic Dynamics (I) |
|
Drexler, Dániel András | Obuda University |
Füredi, András | Research Center for Natural Sciences |
Gombos, Balázs | Research Center for Natural Sciences |
Szakács, Gergely | Medical University of Vienna |
Kovács, Levente | Obuda University |
Keywords: Systems biology, Biomedical, Biological systems
Abstract: We present a hybrid tumor growth model that integrates deterministic and stochastic dynamics to capture remission and relapse driven by cellular dormancy. Ordinary differential equations describe large tumor populations, while a Gillespie-based simulation accounts for stochastic fluctuations at low cell numbers, with dynamic switching between regimes. Dormant cells are modeled as either pre-existing or therapy-induced, and their reactivation drives recurrence. The model is validated against experimental data from triple-negative breast cancer in mice treated with pegylated liposomal doxorubicin. Our results demonstrate that even small dormant cell populations critically influence relapse timing. The proposed framework balances biological interpretability with computational efficiency, providing a tractable basis for therapy optimization and control-oriented applications.
|
|
17:30-17:45, Paper ThC01.5 | |
The Role of Cheater Cells in Quorum Sensing Bacterial Cultures (I) |
|
Cimolato, Chiara | Università Di Padova - Dipartimento Di Ingegneria Dell'Informaiz |
Schenato, Luca | University of Padova |
Giordano, Giulia | University of Trento |
Bellato, Massimo | Università Di Padova |
Keywords: Biological systems, Cellular dynamics, Systems biology
Abstract: Cooperation is essential for the survival of microbial communities, as it enables access to resources and enhances fitness. Social cheating, where some individuals exploit the benefits of cooperation without contributing, can destabilize these communities. While traditionally seen as harmful, the frequent occurrence of cheaters in environmental bacterial populations suggests they may confer an evolutionary advantage. We investigate the role of cheater cells in bacterial populations governed by Quorum sensing (QS), which regulates the production of public goods, such as enzymes and protective biofilms, through cooperative behaviors. We model the dynamics of a bacterial population where cooperative cells can mutate into cheaters, exploiting QS-controlled public goods without incurring the associated metabolic cost. By leveraging time-scale separation and analyzing the system’s equilibria, we demonstrate the existence of a stable equilibrium where both cooperative and cheater cells can coexist. We also extend our model to competing bacterial populations and show that cheaters can transiently confer a selective advantage by accelerating niche colonization. Our findings confirm that cheaters may contribute to increasing population resilience, but only if they are a limited fraction of the entire population.
|
|
17:45-18:00, Paper ThC01.6 | |
Optimal Control for Cancer Chemotherapy Using Hybrid Quantum Particle Swarm Optimization |
|
Kidane, Bereket Sitotaw | The University of Texas at Arlington |
Motayed, Md Samiul Haque | The University of Texas at Arlington |
Wang, Shuo | University of Texas at Arlington |
Keywords: Optimal control, Biological systems, Nonlinear systems
Abstract: Optimal control in cancer chemotherapy is challenged by tumor heterogeneity and mutations, which complicate the effectiveness of treatment. Traditional methods, such as Pontryagin's maximum principle (PMP), are often hindered by their reliance on an initial guess for the costate equation, affecting accuracy and convergence. To address these limitations, this work introduces a hybrid Quantum Particle Swarm Optimization (QPSO) method based on regularization. QPSO is employed for global exploration to approximate the optimal control trajectory, followed by a regularization-based refinement to ensure smoothness and consistency with optimality conditions. The Hamiltonian function is used for first- and second-order optimality checks, verifying solution quality. Numerical case studies explore various drug effectiveness functions, demonstrating the role of periodic and localized drug delivery in achieving robust tumor suppression and providing insights into the impact of different drug combinations on optimal chemotherapy strategies.
|
|
18:00-18:15, Paper ThC01.7 | |
Nearly Optimal Chaotic Desynchronization of Neural Oscillators |
|
Moehlis, Jeff | University of California, Santa Barbara |
Zimet, Michael | University of California, Santa Barbara |
Rajabi, Faranak | University of California, Santa Barbara |
Keywords: Biological systems, Optimal control, Control applications
Abstract: Motivated by deep brain stimulation treatment of neural disorders such as Parkinson's disease, it has been proposed that desynchronization of neural oscillators can be achieved by maximizing the Lyapunov exponent of the phase difference between pairs of oscillators. Here we consider two approximations to optimal stimuli for chaotic desynchronization of neural oscillators. These approximations are based on the oscillators' phase response curve, and unlike previous approaches do not require numerical solution of a two-point boundary value problem. It is shown that these approximations can achieve nearly optimal desynchronization, and can be used with an event-based control scheme to desynchronize populations of noisy, coupled neurons.
|
|
18:15-18:30, Paper ThC01.8 | |
Feedback Control in Cellular Mechanoregulation: A Model of Caveolar Dynamics |
|
Kazemi, Mohammadreza | Florida International University |
Gal, Ciprian | Florida International University |
Baum, Taylor Elise | Massachusetts Institute of Technology |
Hutcheson, Joshua | Florida International University |
Keywords: Genetic regulatory systems, Cellular dynamics, Systems biology
Abstract: Cells live in a mechanically rich environment. Mechanical stimuli from outside the cell (e.g., shear stress due to uid ow, stretching due to forces from neighboring cells, etc.) have a profound effect on cells’ internal state. This is potentially because cells need to adapt to their mechanical environment to ensure the integrity of the plasma membrane (PM). As such, at the heart of the cellular mechanoregulation problem, there lies an optimal control problem: how cells can observe and control their mechanical environment. Previous experimental work has shown that cells can observe the stretching of the PM through specialized PM domains named caveolae (cave-like invaginations on the PM composed of two main proteins: Cavins and Caveolins). Caveolae play a dual role in cellular mechanoregulation. First, they can act as observers of PM’s mechanical state by attening in response to forces, triggering downstream signaling pathways. Second, they can act as actuators by buffering PM’s tension as a result of their attening in response to forces. Despite experimental evidence indicating implication of caveolae in cellular mechanoregula- tion, to the best of our knowledge, no control-theoretic analysis of caveolar dynamics in the context of mechanoregulation has been done. As such, we aim to model and analyze the role of feedback in caveolae-dependent mechanoregulation. We construct a novel model of caveolae-dependent signaling highlighting the role of feedback in mechanoregulation. We then demonstrate robustness properties of our model to stochastic perturbations and external mechanical stimuli. Further, we demonstrate through simulations how cells can adapt to their mechanical environment emphasizing the role of caveolae. We also provide an explanation for how our model can explain previous experimental results on the role of caveolins in mechanoregulatory signaling pathways. Overall, we present a novel model of cellular caveolae-dependent mechnoregulation and provide a detailed control-theoretic analysis of how our model can explain experimental data.
|
|
ThC02 |
Oceania II |
Safe, Secure and Learning-Based Control II |
Invited Session |
Chair: Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Co-Chair: Jha, Mayank Shekhar | University of Lorraine |
Organizer: Doan, Thinh T. | University of Texas at Austin |
Organizer: Jha, Mayank Shekhar | University of Lorraine |
Organizer: Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
|
16:30-16:45, Paper ThC02.1 | |
Zero-Sum Turn Games Using Q-Learning: Finite Computation with Security Guarantees (I) |
|
Anderson, Sean | University of California Santa Barbara |
Darken, Christian | Naval Postgraduate School |
Hespanha, Joao P. | Univ. of California, Santa Barbara |
Keywords: Reinforcement learning, Game theory, Agents-based systems
Abstract: This paper addresses zero-sum ``turn'' games, in which only one player can make decisions at each state. We show that pure saddle-point state-feedback policies for turn games can be constructed from dynamic programming fixed-point equations for a single value function or Q-function. These fixed-points can be constructed using a suitable form of Q-learning. For discounted costs, convergence of this form of Q-learning can be established using classical techniques. For undiscounted costs, we provide a convergence result that applies to finite-time deterministic games, which we use to illustrate our results. For complex games, the Q-learning iteration must be terminated before exploring the full-state, which can lead to policies that cannot guarantee the security levels implied by the final Q-function. To mitigate this, we propose an ``opponent-informed'' exploration policy for selecting the Q-learning samples. This form of exploration can guarantee that the final Q-function provides security levels that hold, at least, against a given set of policies. A numerical demonstration for a multi-agent game, Atlatl, indicates the effectiveness of these methods.
|
|
16:45-17:00, Paper ThC02.2 | |
On-Policy Safe Reinforcement Learning under Input Saturation and State Constraints for Nonlinear Discrete Time Systems (I) |
|
Marthi, Satya Vinay Chavan | CRAN |
Jha, Mayank Shekhar | University of Lorraine |
Kanso, Soha | Université De Lorraine |
Ponsart, Jean-Christophe | Université De Lorraine |
Theilliol, Didier | Universite De Lorraine |
Keywords: Reinforcement learning, Neural networks, Lyapunov methods
Abstract: This paper proposes an on-policy Policy Iteration (PI) safe reinforcement learning (RL) framework for control of nonlinear discrete-time control-affine systems with input saturation constraints, ensuring safety throughout exploration and learning phases. This work emphasizes the importance of exploration noise while also addressing the challenges resulting from the conflict between control input saturation and exploration. Against the typically adopted off-policy PI based approaches for safe RL, our approach proposes on-policy PI algorithm under input saturation to iteratively learn the optimal control policy. To ensure safety during the exploration phase, where the high magnitude of exploration noise could compromise system safety, a so called fall-back policy is proposed and incorporated. This paper also develops novel theoretical guarantees on the convergence to optimality under safety restrictions and evaluates the effectiveness through simulation studies, comparing its performance against the traditional Quadratic Programming based Control Lyapunov and Control Barrier function (QP-CLF-CBF) approach.
|
|
17:00-17:15, Paper ThC02.3 | |
Control Theory-Informed Machine Learning Based Control and Performance Monitoring of Nonlinear Dynamic Systems (I) |
|
Cheng, Wei | University of Duisburg-Essen, Institute for Automatic Control An |
Liang, Ketian | University of Duisburg-Essen |
Cuturic, Danijel | University of Duisburg-Essen, Institute for Automatic Control An |
Li, Linlin | University of Science and Technology Beijing |
Louen, Chris | University of Duisburg-Essen |
Ding, Steven X. | University of Duisburg-Essen |
Keywords: Observers for nonlinear systems, Fault detection, Machine learning
Abstract: This paper explores control and performance monitoring of nonlinear dynamic systems with uncertainties and possible faults, focusing on scenarios where only partial model knowledge is available. We begin by developing optimal control and performance monitoring schemes from a nonlinear control theoretic framework, employing normalized stable kernel and image representation structures. These schemes form the basis for a unified design of the observer and feedback controller, which respectively serve as a state estimator/residual generator and a stabilizing control law. Moreover, performance indicators based on the corresponding Hamilton Jacobi Equations (HJEs), value functions, and residuals are extracted from the observer and controller structures. Then, we explore a data-driven implementation of the above framework that integrates control theory knowledge with machine learning, referred to as control theory-informed machine learning. Specifically, neural networks approximate the HJE solutions while preserving the lossless property, enabling a stable, data-driven realization of the proposed schemes. Finally, the proposed scheme is demonstrated through a case study.
|
|
17:15-17:30, Paper ThC02.4 | |
Semi-Supervised Safe Visuomotor Policy Synthesis Using Barrier Certificates |
|
Tayal, Manan | Indian Institute of Science, Bengaluru |
Singh, Aditya | Indian Institute of Technology, Patna |
Jagtap, Pushpak | Indian Institute of Science |
Nadubettu Yadukumar, Shishir | Indian Institute of Science |
Keywords: Data driven control, Vision-based control, Autonomous robots
Abstract: In modern robotics, addressing the lack of accurate state space information in real-world scenarios has led to a significant focus on utilizing visuomotor observation to provide safety assurances. Although supervised learning methods, such as imitation learning, have demonstrated potential in synthesizing control policies based on visuomotor observations, they require ground truth safety labels for the complete dataset and do not provide formal safety assurances. On the other hand, traditional control-theoretic methods like Control Barrier Functions (CBFs) and Hamilton-Jacobi (HJ) Reachability provide formal safety guarantees but depend on accurate knowledge of system dynamics, which is often unavailable for high-dimensional visuomotor data. To overcome these limitations, we propose a novel approach to synthesize a semi-supervised safe visuomotor policy using barrier certificates that integrate the strengths of model-free supervised learning and model-based control methods. This framework synthesizes a provably safe controller without requiring safety labels for the complete dataset and ensures completeness guarantees for both the barrier certificate and the policy. We validate our approach through distinct case studies: an inverted pendulum system and the obstacle avoidance of an autonomous mobile robot.
|
|
17:30-17:45, Paper ThC02.5 | |
Initial Distribution Sensitivity of Constrained Markov Decision Processes (I) |
|
Tercan, Alperen | University of Michigan |
Ozay, Necmiye | Univ. of Michigan |
Keywords: Uncertain systems, Constrained control, Reinforcement learning
Abstract: Constrained Markov Decision Processes (CMDPs) are notably more complex to solve than standard MDPs due to the absence of universally optimal policies across all initial state distributions. This necessitates re-solving the CMDP whenever the initial distribution changes. In this work, we analyze how the optimal value of CMDPs varies with different initial distributions, deriving bounds on these variations using duality analysis of CMDPs and perturbation analysis in linear programming. Moreover, we show how such bounds can be used to analyze the regret of a given policy due to unknown variations of the initial distribution.
|
|
17:45-18:00, Paper ThC02.6 | |
Wasserstein Distributionally Robust Adaptive Covariance Steering (I) |
|
Gahlawat, Aditya | University of Illinois at Urbana-Champaign |
Khatana, Vivek | University of Illinois at Urbana-Champaign |
Wang, Duo | NORTHEASTERN UNIVERSITY |
Karumanchi, Sambhu Harimanas | University of Illinois, Urbana-Champaign |
Hovakimyan, Naira | University of Illinois at Urbana-Champaign |
Voulgaris, Petros G. | Univ of Nevada, Reno |
Keywords: Adaptive control, Stochastic systems, Predictive control for nonlinear systems
Abstract: We present a methodology for predictable and safe covariance steering control of uncertain nonlinear stochastic processes.The systems under consideration are subject to general uncertainties, which include unbounded random disturbances (aleatoric uncertainties) and incomplete model knowledge (state-dependent epistemic uncertainties). These general uncertainties lead to temporally evolving state distributions that are entirely unknown, can have arbitrary shapes, and may diverge unquantifiably from expected behaviors, leading to unpredictable and unsafe behaviors. Our method relies on an L1-adaptive control architecture that ensures robust control of uncertain stochastic processes while providing Wasserstein metric certificates in the space of probability measures.We show how these distributional certificates can be incorporated into the high-level covariance control steering to guarantee safe control. Unlike existing distributionally robust planning and control methodologies, our approach avoids difficult-to-verify requirements like the availability of finite samples from the true underlying distribution or an a priori knowledge of time-varying ambiguity sets to which the state distributions are assumed to belong.
|
|
18:00-18:15, Paper ThC02.7 | |
A Guided Retraining Strategy for Safe ReLU Neural Network Controllers of Linear Systems |
|
Zago, João Gabriel | Federal University of Santa Catarina |
Camponogara, Eduardo | Federal University of Santa Catarina |
Notarstefano, Giuseppe | University of Bologna |
Keywords: Iterative learning control, Machine learning, Predictive control for linear systems
Abstract: This paper considers a ReLU neural network approximation for an optimization-based controller of a discrete Linear Time Invariant (LTI) system. If trained in a standard way, a data-driven controller based on neural networks may violate safety constraints or require many training samples to avoid violation. We propose a novel sampling approach, based on a proper reachability analysis for ReLU networks, that results in an efficient retraining of the neural network and guarantees constraint satisfaction. We show the effectiveness of the proposed strategy by providing numerical results on a neural network trained to approximate a Model Predictive Controller (MPC). Compared to a random sampling, our approach achieves safety with a relatively small number of samples thanks to an ad-hoc resampling in suitable regions of the state space.
|
|
18:15-18:30, Paper ThC02.8 | |
Reachability Interval-Based Online Safe Optimal Control Using Reinforcement Learning |
|
Xue, Wenqian | University of Florida |
Dixon, Warren E. | University of Florida |
Keywords: Optimal control, Reinforcement learning, Adaptive control
Abstract: This paper explores safe and optimal control for the trajectories of linear systems from an initial state set, addressing cases with imprecise initial conditions. To avoid relying on every precise state and reduce computational burden, we develop two novel reachability-based approaches integrating interval analysis and neural network-based off-policy reinforcement learning. An interval system encompassing all trajectories is first developed. A safe optimal controller is then designed using a cost function for stability and a control barrier function for safety. An online off-policy RL algorithm is developed to compute the control policy with guaranteed safety and stability of the interval system. The approach maintains the included trajectories within a safe region; however, our results highlight a trade-off between inclusion capability and controllability.
|
|
ThC03 |
Oceania III |
Safe Planning and Control with Uncertainty Quantification II |
Invited Session |
Chair: Lindemann, Lars | University of Southern California |
Co-Chair: Vertovec, Nikolaus | University of Oxford |
Organizer: Gao, Yulong | Imperial College London |
Organizer: Lindemann, Lars | ETH Zürich |
Organizer: Vertovec, Nikolaus | University of Oxford |
Organizer: Yu, Pian | University College London |
|
16:30-16:45, Paper ThC03.1 | |
Distributionally Robust Equilibria Over the Wasserstein Distance for Generalized Nash Game (I) |
|
Wen, Yixun | University College London |
Gao, Yulong | Imperial College London |
Chen, Boli | University College London |
Keywords: Game theory, Optimization algorithms, Uncertain systems
Abstract: Generalized Nash equilibrium problem (GNEP) is fundamental for practical applications where multiple self-interested agents work together to make optimal decisions. In this work, we study GNEP with shared distributionally robust chance constraints (DRCCs) for incorporating inevitable uncertainties. The DRCCs are defined over the Wasserstein ball, which can be explicitly characterized even with limited sample data. To determine the equilibrium of the GNEP, we propose an exact approach to transform the original computationally intractable problem into a deterministic formulation using the Nikaido-Isoda function. Specifically, we show that when all agents' objectives are quadratic in their respective variables, the equilibrium can be obtained by solving a typical mixed-integer nonlinear programming (MINLP) problem, where the integer and continuous variables are decoupled in both the objective function and the constraints. This structure significantly improves computational tractability, as demonstrated through a case study on the charging station pricing problem.
|
|
16:45-17:00, Paper ThC03.2 | |
Conformal Prediction in the Loop: Risk-Aware Control Barrier Functions for Stochastic Systems with Data-Driven State Estimators |
|
Zhang, Junhui | Nanjing University |
Hoxha, Bardh | Toyota Motor North America |
Fainekos, Georgios | Toyota NA-R&D |
Panagou, Dimitra | University of Michigan, Ann Arbor |
Keywords: Stochastic systems, Robust control, Estimation
Abstract: This paper proposes a sampled-data, measurement-robust, risk-aware control barrier function (RA-CBF) framework for stochastic systems with measurement uncertainty. In this framework, what is available for control design are measurements of the system states, which are subject to unknown noise. First, in order to estimate the system states from these measurements, an offline-trained neural network is employed as a state estimator. Next, to quantify the performance of the state estimator, the state space is discretized, and calibration datasets are sampled from the grid points. Conformal prediction is then implemented, providing the estimation error bound with user-defined probability. In addition, we leverage the estimation error bound into sampled data robust RA-CBF design, such that the probability that the state of the system enters the unsafe set during a finite time horizon is bounded by a desired threshold. Various case studies demonstrate the effectiveness of the proposed method.
|
|
17:00-17:15, Paper ThC03.3 | |
Risk-Aware Adaptive Control Barrier Functions for Safe Control of Nonlinear Systems under Stochastic Uncertainty (I) |
|
Liu, Shuo | Boston University |
Belta, Calin | University of Maryland |
Keywords: Constrained control, Lyapunov methods, Optimal control
Abstract: This paper addresses the challenge of ensuring safety in stochastic control systems with high-relative-degree constraints, while maintaining feasibility and mitigating conservatism in risk evaluation. Control Barrier Functions (CBFs) provide an effective framework for enforcing safety constraints in nonlinear systems. However, existing methods struggle with feasibility issues and multi-step uncertainties. To address these challenges, we introduce Risk-aware Adaptive CBFs (RACBFs), which integrate Discrete-time Auxiliary-Variable adaptive CBFs (DAVCBFs) with coherent risk measures. DAVCBFs introduce auxiliary variables to improve the feasibility of the optimal control problem, while RACBFs incorporate risk-aware formulations to balance safety and risk evaluation performance. By extending discrete-time high-order CBF constraints over multiple steps, RACBFs effectively handle multi-step uncertainties that propagate through the system dynamics. We demonstrate the effectiveness of our approach on a stochastic unicycle system, showing that RACBFs maintain safety and feasibility while reducing unnecessary conservatism compared to standard robust formulations of discrete-time CBF methods.
|
|
17:15-17:30, Paper ThC03.4 | |
Data-Driven Nonconvex Reachability Analysis Using Exact Multiplication (I) |
|
Zhang, Zhen | Technical University of Munich |
Niazi, M. Umar B. | Massachusetts Institute of Technology |
Chong, Michelle | Eindhoven University of Technology |
Johansson, Karl H. | KTH Royal Institute of Technology |
Alanwar, Amr | Technical University of Munich |
Keywords: Cyber-Physical Security
Abstract: This paper addresses a fundamental challenge in data-driven reachability analysis: accurately representing and propagating non-convex reachable sets. We propose a novel approach using constrained polynomial zonotopes to describe reachable sets for unknown LTI systems. Unlike constrained zonotopes commonly used in existing literature, constrained polynomial zonotopes are closed under multiplication with constrained matrix zonotopes. We leverage this property to develop an exact multiplication method that preserves the non-convex geometry of reachable sets without resorting to approximations. We demonstrate that our approach provides tighter over-approximations of reachable sets for LTI systems compared to conventional methods.
|
|
17:30-17:45, Paper ThC03.5 | |
Safety-Aware Reinforcement Learning for Control Via Risk-Sensitive Action-Value Iteration and Quantile Regression (I) |
|
Enwerem, Clinton | Department of Electrical & Computer Engineering and the Institut |
Puranic, Aniruddh | University of Maryland, College Park |
Baras, John S. | University of Maryland |
Belta, Calin | University of Maryland |
Keywords: Reinforcement learning, Estimation, Data driven control
Abstract: Mainstream approximate action-value iteration reinforcement learning (RL) algorithms suffer from overestimation bias, leading to suboptimal policies in high-variance stochastic environments. Quantile-based action-value iteration methods reduce this bias by learning a distribution of the expected cost-to-go using quantile regression. However, ensuring that the learned policy satisfies safety constraints remains a challenge when these constraints are not explicitly integrated into the RL framework. Existing methods often require complex neural architectures or manual tradeoffs due to combined cost functions. To address this, we propose a risk-regularized quantile-based algorithm integrating Conditional Value-at-Risk (CVaR) to enforce safety without complex architectures. We also provide theoretical guarantees on the contraction properties of the risk-sensitive distributional Bellman operator in Wasserstein space, ensuring convergence to a unique cost distribution. Simulations of a mobile robot in a dynamic reach-avoid task show that our approach leads to more goal successes, fewer collisions, and better safety-performance trade-offs than risk-neutral methods.
|
|
17:45-18:00, Paper ThC03.6 | |
Formal Uncertainty Propagation for Stochastic Dynamical Systems with Additive Noise (I) |
|
Adams, Steven | TU Delft |
Figueiredo, Eduardo | TU Delft |
Laurenti, Luca | TU Delft |
Keywords: Stochastic systems, Formal Verification/Synthesis, Nonlinear systems
Abstract: In this paper, we consider discrete-time non-linear stochastic dynamical systems with additive process noise in which both the initial state and noise distributions are uncertain. Our goal is to quantify how the uncertainty in these distributions is propagated by the system dynamics for possibly infinite time steps. In particular, we model the uncertainty over input and noise as ambiguity sets of probability distributions close in the rho-Wasserstein distance and aim to quantify how these sets evolve over time. Our approach relies on results from quantization theory, optimal transport, and stochastic optimization to construct ambiguity sets of distributions centered at mixture of Gaussian distributions that are guaranteed to contain the true sets for both finite and infinite prediction time horizons. We empirically evaluate the effectiveness of our framework in various benchmarks from the control and machine learning literature, showing how our approach can efficiently and formally quantify the uncertainty in linear and non-linear stochastic dynamical systems.
|
|
18:00-18:15, Paper ThC03.7 | |
Distributionally Robust Cascading Risk Quantification in Multi-Agent Rendezvous: Effects of Time Delay and Network Connectivity (I) |
|
Pandey, Vivek | Lehigh University |
Siami, Milad | Northeastern University |
Motee, Nader | Lehigh University |
Keywords: Networked control systems, Stochastic systems, Agents-based systems
Abstract: Achieving safety in autonomous multi-agent systems is a critical challenge. In this paper, we propose a distributionally robust risk framework for analyzing cascading failures in multi-agent rendezvous. To capture the complex interactions between network connectivity, system dynamics, and communication delays, we use a time-delayed network model as a benchmark. We introduce a conditional distributionally robust functional to quantify cascading effects between agents, utilizing a bi-variate normal distribution. In our formulation, distributional ambiguity stems from emph{parameter uncertainty} in the underlying noise statistics and system dynamics, ensuring robustness against model misspecification. Our approach yields closed-form risk expressions that reveal the impact of time delay, parameter uncertainty, communication topology, and failure modes on rendezvous risk. The insights derived inform the design of resilient networks that mitigate the risk of cascading failures.
|
|
18:15-18:30, Paper ThC03.8 | |
Egocentric Conformal Prediction for Safe and Efficient Navigation in Dynamic Cluttered Environments (I) |
|
Shin, Jaeuk | Seoul National University |
Lee, Jungjin | Seoul National University |
Yang, Insoon | Seoul National University |
Keywords: Autonomous robots, Autonomous systems
Abstract: Conformal prediction (CP) has emerged as a powerful tool in robotics and control, thanks to its ability to calibrate complex, data-driven models with formal guarantees. However, in robot navigation tasks, existing CP-based methods often decouple prediction from control, evaluating models without considering whether prediction errors actually compromise safety. Consequently, ego-vehicles may become overly conservative or even immobilized when all potential trajectories appear infeasible. To address this issue, we propose a novel CP-based navigation framework that responds exclusively to safety-critical prediction errors. Our approach introduces egocentric score functions that quantify how much closer obstacles are to a candidate vehicle position than anticipated. These score functions are then integrated into a model predictive control scheme, wherein each candidate state is individually evaluated for safety. Combined with an adaptive CP mechanism, our framework dynamically adjusts to changes in obstacle motion without resorting to unnecessary conservatism. Theoretical analyses indicate that our method outperforms existing CP-based approaches in terms of cost-efficiency while maintaining the desired safety levels, as further validated through experiments on real-world datasets featuring densely populated pedestrian environments.
|
|
ThC04 |
Oceania IV |
Dynamics and Learning in Games |
Invited Session |
Chair: Martins, Nuno C. | University of Maryland |
Co-Chair: Ferguson, Bryce L. | Dartmouth College |
Organizer: Martins, Nuno C. | University of Maryland |
|
16:30-16:45, Paper ThC04.1 | |
Nash Equilibrium Learning in Large Populations with First Order Payoff Modifications |
|
Hankins, Matthew | University of Maryland |
Certorio, Jair | University of Maryland |
Jeng, Tzuyu | University of Maryland, College Park |
Martins, Nuno C. | University of Maryland |
Keywords: Game theory, Stability of nonlinear systems, Compartmental and Positive systems
Abstract: We establish Nash equilibrium learning—convergence of the population state to a suitably defined Nash equilibria set—for a class of payoff dynamical mechanism with a first order modification. The first order payoff modification can model aspects of the agents’ bounded rationality, anticipatory or averaging terms in the payoff mechanism, or first order Pade approximations of delays. To obtain our main results, we apply a combination of two nonstandard system-theoretic passivity notions.
|
|
16:45-17:00, Paper ThC04.2 | |
Hierarchical Decision-Making in Population Games (I) |
|
Chen, Yu-Wen | University of California, Berkeley |
Martins, Nuno C. | University of Maryland |
Arcak, Murat | University of California, Berkeley |
Keywords: Game theory, Compartmental and Positive systems, Stability of nonlinear systems
Abstract: This paper introduces a hierarchical framework for population games, where individuals delegate decision-making to proxies that act within their own strategic interests. This framework extends classical population games, where individuals are assumed to make decisions directly, to capture various real- world scenarios involving multiple decision layers. We establish equilibrium properties and provide convergence results for the proposed hierarchical structure. Additionally, based on these results, we develop a systematic approach to analyze population games with general convex constraints, without requiring individuals to have full knowledge of the constraints as in existing methods. We present a navigation application with capacity constraints as a case study.
|
|
17:00-17:15, Paper ThC04.3 | |
Robust Accelerated Dynamics for Subnetwork Bilinear Zero-Sum Games with Distributed Restarting (I) |
|
Li, Weijian | University of Toronto |
Pavel, Lacra | University of Toronto |
Malikopoulos, Andreas A. | Cornell University |
Keywords: Game theory, Robust adaptive control, Distributed parameter systems
Abstract: In this paper, we investigate distributed Nash equilibrium seeking for a class of two-subnetwork zero-sum games characterized by bilinear coupling. We present a distributed primal-dual accelerated mirror-descent algorithm with convergence guarantees. However, we demonstrate that this time-varying algorithm is not robust, as it fails to converge under arbitrarily small disturbances. To address this limitation, we introduce a distributed accelerated algorithm that incorporates a coordinated restarting mechanism. We model this new algorithm as a hybrid dynamical system and establish its structural robustness.
|
|
17:15-17:30, Paper ThC04.4 | |
Trial-And-Error Learning in Decentralized Matching Markets (I) |
|
Shah, Vade | University of California, Santa Barbara |
Ferguson, Bryce L. | Dartmouth College |
Marden, Jason R. | University of California, Santa Barbara |
Keywords: Game theory, Agents-based systems, Cooperative control
Abstract: Two-sided matching markets, environments in which two disjoint groups of agents seek to partner with one another, arise in several contexts. In static, centralized markets where agents know their preferences, standard algorithms can yield a stable matching. However, in dynamic, decentralized markets where agents must learn their preferences through interaction, such algorithms cannot be used. The goal in this work is to identify achievable stability guarantees in decentralized matching markets where (i) agents have limited information about their preferences and (ii) no central entity determines the match. Surprisingly, our results demonstrate that these constraints do not preclude stability—simple ``trial and error" learning policies guarantee convergence to stable matchings, showing that stability can be achieved even in the absence of information and centralization.
|
|
17:30-17:45, Paper ThC04.5 | |
Aggregate Fictitious Play for Learning in Anonymous Polymatrix Games (I) |
|
Kara, Semih | University of Illinois at Urbana-Champaign |
Basar, Tamer | Univ of Illinois, Urbana-Champaign |
Keywords: Game theory, Learning, Reinforcement learning
Abstract: Fictitious play (FP) is a well-studied algorithm that enables agents to learn Nash equilibrium in games with certain reward structures. However, when agents have no prior knowledge of the reward functions, FP faces a major challenge: the joint action space grows exponentially with the number of agents, which slows down reward exploration. Anonymous games offer a structure that mitigates this issue. In these games, the rewards depend only on the actions taken; not on who is taking which action. Under this structure, we introduce aggregate fictitious play (agg-FP), a variant of FP in which each agent tracks the frequency of the number of other agents playing each action, rather than the action frequencies of individual agents. We show that in anonymous polymatrix games, agg-FP converges to a Nash equilibrium under the same conditions as classical FP. In essence, by aggregating the agents' actions, we reduce the action space without losing the convergence guarantees. Using simulations, we provide empirical evidence on how this reduction accelerates convergence.
|
|
17:45-18:00, Paper ThC04.6 | |
Passivity, No-Regret, and Convergent Learning in Contractive Games (I) |
|
Abdelraouf, Hassan | University of Illinois at Urbana Champaign |
Piliouras, Georgios | Singapore University of Technology and Design |
Shamma, Jeff S. | University of Illinois at Urbana-Champaign |
Keywords: Learning, Game theory, Stability of nonlinear systems
Abstract: We investigate the interplay between passivity, no-regret, and convergence in contractive games for various learning dynamic models and their higher-order variants. Our setting is continuous time. Building on prior work for replicator dynamics, we show that if learning dynamics satisfy a passivity condition between the payoff vector and the difference between its evolving strategy and any fixed strategy, then it achieves finite regret. We then establish that the passivity condition holds for various learning dynamics and their higher-order variants. Consequentially, the higher-order variants can achieve convergence to Nash equilibrium in cases where their standard order counterparts cannot, while maintaining a finite regret property. We provide numerical examples to illustrate the lack of finite regret of different evolutionary dynamic models that violate the passivity property. We also examine the fragility of the finite regret property in the case of perturbed learning dynamics. Continuing with passivity, we establish another connection between finite regret and passivity, but with the related equilibrium-independent passivity property. Finally, we present a passivity-based classification of dynamic models according to the various passivity notions they satisfy, namely, incremental passivity, delta-passivity, and equilibrium-independent passivity. This passivity-based classification provides a framework to analyze the convergence of learning dynamic models in contractive games.
|
|
18:00-18:15, Paper ThC04.7 | |
Opinion Dynamics for Utility Maximizing Agents: Exploring the Impact of the Resource Penalty |
|
Wankhede, Prashil | Indian Institute of Science |
Mandal, Nirabhra | University of California San Diego |
Martinez, Sonia | University of California at San Diego |
Tallapragada, Pavankumar | Indian Institute of Science |
Keywords: Game theory, Agents-based systems, Nonlinear systems
Abstract: We propose a continuous-time nonlinear model of opinion dynamics with utility-maximizing agents connected via a social influence network. A distinguishing feature of the proposed model is the inclusion of an opinion-dependent resource-penalty term in the utilities, which limits the agents from holding opinions of large magnitude. This model is applicable in scenarios where the opinions pertain to the usage of resources, such as money, time, computational resources, etc. Each agent myopically seeks to maximize its utility by revising its opinion in the gradient ascent direction of its utility function, thus leading to the proposed opinion dynamics. We show that for any arbitrary social influence network, opinions are ultimately bounded. For networks with weak antagonistic relations, we show that there exists a globally exponentially stable equilibrium using contraction theory. We establish conditions for the existence of consensus equilibrium and analyze the relative dominance of the agents at consensus. We also conduct a game-theoretic analysis of the underlying opinion formation game, including on Nash equilibria and on prices of anarchy in terms of satisfaction ratios. In addition, we also investigate the oscillatory behavior of opinions in a two-agent scenario. Finally, simulations illustrate our findings.
|
|
18:15-18:30, Paper ThC04.8 | |
Epidemic Population Games and Perturbed Best Response Dynamics |
|
Park, Shinkyu | KAUST |
Certorio, Jair | University of Maryland |
Martins, Nuno C. | University of Maryland |
La, Richard J. | University of Maryland, College Park |
Keywords: Stability of nonlinear systems, Game theory, Compartmental and Positive systems
Abstract: This paper proposes an approach to mitigate epidemic spread in a population of strategic agents by encouraging safer behaviors through carefully designed rewards. These rewards, which adapt to the evolving state of the epidemic, are ascribed by a dynamic payoff mechanism we seek to design. We use a modified SIRS model to track how the epidemic progresses in response to the agents' strategic choices. By employing perturbed best response evolutionary dynamics to model the population's strategic behavior, we extend previous related work so as to allow for noise in the agents' perceptions of the rewards and intrinsic costs of the available strategies. Central to our approach is the use of system-theoretic methods and passivity concepts to obtain a Lyapunov function, ensuring the global asymptotic stability of an endemic equilibrium with minimized infection prevalence under budget constraints. We leverage the Lyapunov function to analyze how the epidemic's spread rate is influenced by the time scale of the payoff mechanism's dynamics. Additionally, we derive anytime upper bounds on both the infectious fraction of the population and the instantaneous cost a social planner must incur to control the spread, allowing us to quantify the trade-off between peak infection prevalence and the corresponding cost. For a class of one-parameter perturbed best response models, we propose a method to learn the model's parameter from data.
|
|
ThC05 |
Galapagos II |
Analysis and Optimization of Urban Transportation Networks for Green
Mobility |
Invited Session |
Chair: Laurini, Mattia | University of Parma |
Co-Chair: Panayiotou, Christos | University of Cyprus |
Organizer: Laurini, Mattia | University of Parma |
Organizer: Ardizzoni, Stefano | University of Parma |
Organizer: Consolini, Luca | Università Di Parma |
|
16:30-16:45, Paper ThC05.1 | |
On Transport Justice and Safety in Bicycle Network Design Optimization (I) |
|
Campero Jurado, Manuel | Institut National De Recherche En Sciences Et Technologies Du Nu |
Canudas de Wit, Carlos | CNRS, GIPSA-Lab |
De Nunzio, Giovanni | IFP Energies Nouvelles |
Salazar, Mauro | Eindhoven University of Technology |
Keywords: Transportation networks, Modeling, Network analysis and control
Abstract: This paper presents a bi-level optimization framework to decide on urban cycling infrastructure safety upgrades, increasing the degree of separation between bike lanes and motorized traffic, in line with principles of justice. Specifically, we first instantiate the lower level modeling cyclist flows with an all-or-nothing traffic assignment. Second, the upper level allocates upgrades within a fixed budget and in line with two principles of justice: conventional utilitarianism (i.e., the minimization of the overall exposure to traffic on average) and sufficientarianism (i.e., ensuring that the highest number of cyclists achieves a sufficient safety level). Third, we propose two heuristic algorithms to efficiently solve the bi-level problem. We showcase our framework for the Grenoble cycling network. Our results reveal that following a sufficientarian paradigm can considerably improve the situation of the most vulnerable users without significantly affecting overall utilitarian safety levels.
|
|
16:45-17:00, Paper ThC05.2 | |
Green-Pressure – a Weighted Queue-Length Approach towards Sustainable Intersection Management (I) |
|
Riehl, Kevin | ETH Zürich5 |
Kouvelas, Anastasios | ETH Zurich |
Makridis, Michail | ETH Zurich |
Keywords: Traffic control, Intelligent systems, Simulation
Abstract: Urban transportation networks increasingly suffer from congestion. Negative externalities resulting from noise and pollution, affect public health, quality of life, and the economy. The major traffic bottlenecks in cities are conflicts at intersections, leading to this pressing issue. Intelligent transportation systems leverage sensors to optimize traffic flows, mainly by control of traffic lights. Green-Pressure is an extension of the Max-Pressure algorithm, that leverages vehicle category information from loop-detectors for a weighted queue-length approach, to reduce emissions at signalized intersections. A multi-modal, case study of a real-world artery network with seven intersections, and 96 traffic signals, demonstrates the feasibility of the proposed method using a calibrated microsimulation model. Interestingly, the differentiation of vehicle categories at traffic lights not only enables reductions in emissions up to 9% but also improves traffic efficiency significantly (5% reduction of total travel time) when compared with the (unweighted) Max-Pressure controller. This is achieved by systematic prioritization of transporters, trucks, and buses, at the cost of slightly larger delays for passenger cars and motorcycles. Ultimately, the proposed method has the potential to achieve more sustainable road traffic leveraging existing sensor infrastructure. Code & Resources: https://github.com/DerKevinRiehl/green_pressure_control
|
|
17:00-17:15, Paper ThC05.3 | |
EQ-ALINEA – Equitable Ramp Metering for Sustainable Metropolitan Highways (I) |
|
Riehl, Kevin | ETH Zürich5 |
Zhan, Yangle | ETH Zürich |
Kouvelas, Anastasios | ETH Zurich |
Makridis, Michail | ETH Zurich |
Keywords: Traffic control, Intelligent systems, Simulation
Abstract: Highway congestion leads to urban traffic diversion, increased emissions, and extended travel times. Even though ramp metering systems effectively reduce congestion, they often do face public opposition and lack acceptance due to inequitable delay distribution and ramp access among users. EQ-ALINEA, an extension to the ALINEA algorithm, balances both the fairness and efficiency aspects of ramp metering. EQ-ALINEA implements Utilitarian (total travel time), Rawlsian (maximum waiting times at on-ramps), and Egalitarian (dispersion of delays) fairness. Three boundary conditions prevent queue spill-backs, unacceptably long maximum waiting times, and ensure sufficient time for ramp dequeueing. A microsimulation-based case study on Barcelona's metropolitan highway ring-road Ronda de Dalt showcases how EQ-ALINEA can effectively improve efficiency and fairness of highway traffic. The results show that more equitable transportation does not have to come at the cost of losses in efficiency or environmental impact. Besides democratizing the delay distribution over the user population (50% smaller Gini-coefficient) and significantly reducing maximum waiting times (by 40%), EQ-ALINEA redistributes highway accessibility to create more equal opportunities for all ramp users. Ultimately, the contribution of this work is to gain public acceptance for ramp metering by integrating fairness into the traffic control strategy. Code & Resources: https://github.com/DerKevinRiehl/eq_alinea_control
|
|
17:15-17:30, Paper ThC05.4 | |
Dynamics of Cycling Adoption: A Model with Social Influence (I) |
|
Rodriguez Canales, Eduardo Steve | Inria, Univ. Grenoble Alpes |
Frasca, Paolo | CNRS, GIPSA-Lab, Univ. Grenoble Alpes |
Kibangou, Alain | Univ. Grenoble Alpes |
Keywords: Compartmental and Positive systems, Nonlinear systems, Stability of nonlinear systems
Abstract: To address the consequences of climate change, policies promoting green transportation, particularly cycling, are gaining importance. To address this need, this paper introduces a novel compartmental model to analyze the dynamics of bicycle adoption. We demonstrate the existence and global asymptotic stability of a single equilibrium point using order- preserving monotonic systems theory. Furthermore, we establish the system’s identifiability, ensuring unique parameter estimation from observed trajectories. A case study of Stockholm, Sweden, showcases the model’s ability to accurately characterize cycling adoption dynamics, highlighting its potential for informing sustainable transportation strategies.
|
|
17:30-17:45, Paper ThC05.5 | |
Optimization Methods to Improve the Quality of a Cycling Network under Budget Constraints (I) |
|
Praxedes, Rafael | Università Degli Studi Di Parma |
Subramanian, Anand | Universidade Federal Da Paraíba |
Ardizzoni, Stefano | University of Parma |
Consolini, Luca | Università Di Parma |
Laurini, Mattia | University of Parma |
Locatelli, Marco | University of Parma |
Keywords: Transportation networks, Optimization
Abstract: A well-designed cycling network is fundamental to stimulating bicycle use. The evaluation of the network quality is a complex process, that involves multiple criteria, such as the connectivity of cycling paths, road safety, and other factors. An ideal cycling network provides cycle paths along all roads, completely separated from car traffic, and built with high-quality materials. However, in practice, achieving this is very difficult due to budget constraints. In this paper, we propose an optimization strategy to select the best interventions to be applied to achieve the highest possible improvement in cycling network quality, as perceived by users, taking into account budget constraints. A Mixed-Integer Linear Programming (MILP) formulation and three solution methods are proposed. The first method consists of enumerating feasible solutions with respect to budget constraints, where each solution is associated with a subset of interventions. Although this method is more efficient in terms of memory usage than the commercial CPLEX solver to solve the MILP problem, its computation times increase exponentially with the number of interventions. Therefore, to improve computational time, we propose two heuristics based on the solution of knapsack problems, which are solved using dynamic programming. Although optimality cannot be guaranteed, these approximations are capable of providing high-quality solutions while being efficient in both computational time and memory usage.
|
|
17:45-18:00, Paper ThC05.6 | |
Optimal Safe Sequencing and Motion Control for Mixed Traffic Roundabout |
|
Chen, Yingqing | Boston University |
Cassandras, Christos G. | Boston University |
Keywords: Autonomous vehicles, Transportation networks, Traffic control
Abstract: This paper develops a control framework that jointly optimizes vehicle sequencing and motion control in a mixed traffic roundabout (where both CAVs and Human-Driven Vehicles (HDVs) coexist) to minimize travel time, energy consumption, and discomfort while ensuring speed-dependent safety guarantees and adhering to velocity and acceleration constraints. This is achieved by integrating (a) a Safe Sequencing (SS) policy that ensures merging safety without requiring any knowledge of HDV behavior, and (b) Model Predictive Control with Control Lyapunov Barrier Functions (MPC-CLBF), which optimizes CAV motion control while mitigating infeasibility and myopic control issues common in the use of Control Barrier Functions (CBFs) to provide safety guarantees. Simulation results across various traffic demands, CAV penetration rates, and control patterns demonstrate the framework's effectiveness and stability.
|
|
18:00-18:15, Paper ThC05.7 | |
A Successive Convexification-Based Approach for Efficient School Scheduling in Multi-Region Urban Networks |
|
Georgantas, Antonios | University of Cyprus |
Timotheou, Stelios | University of Cyprus |
Panayiotou, Christos | University of Cyprus |
Keywords: Transportation networks, Traffic control, Optimization algorithms
Abstract: In urban traffic networks, morning commuters exhibit diverse travel patterns, with some needing to make intermediate stops, such as dropping off children at school, before reaching their destination. When schools have synchronized start times, this induces high peak demand, exacerbating congestion. To address this challenge, we consider the problem of regulating the start times of schools in a multi-region urban network characterized by well-defined Macroscopic Fundamental Diagrams in different regions. We formulate the problem as a bi-objective mixed-integer nonlinear program aiming to jointly minimize (i) the total time spent of all vehicles, and (ii) the overall deviation from current school start times. The problem is challenging due to its large-scale and combinatorial nature, along with the nonconvexity present in the traffic dynamics across multiple interconnected urban regions. To address these challenges, we introduce a successive convexification algorithm that iteratively tightens traffic density bounds and convexifies constraints, enabling the acquisition of feasible and efficient solutions with respect to the optimization problem. Numerical experiments demonstrate that our approach yields near-optimal results, significantly mitigating congestion and improving overall traffic efficiency.
|
|
18:15-18:30, Paper ThC05.8 | |
Strategic Pricing and Routing to Maximize Profit in Congested Roads Considering Interactions with Travelers |
|
Kim, Youngseo | UCLA |
Duan, Ning | Cornell University |
Zardini, Gioele | Massachusetts Institute of Technology |
Samaranayake, Samitha | Cornell University |
Wischik, Damon | UCL |
Keywords: Transportation networks, Optimization, Network analysis and control
Abstract: We introduce an innovative approach for analyzing strategic interactions in transportation networks featuring Mobility-on-Demand (MoD) services. This study focuses on achieving company-traveler equilibria, whereby a single company optimizes pricing and routing decisions to maximize profitability while considering travelers’ mode choices, modeled via a multinomial logit model (MNL). Although profit maximization problems have been extensively studied in the field of revenue management across various domains, their application to transportation networks poses unique challenges, such as the influence of network topology and additional constraints (e.g., flow conservation, rebalancing, etc.). To address the inherent non-linear relationship arising from endogenous travel demand, we shift our domain space from price to market share. Subsequently, we derive prices using a direct one-to-one correspondence within the MNL. This work is the first effort in leveraging such novel techniques in the context of transportation network analysis. Remarkably, the proposed reformulation results in an equivalent problem exhibiting convexity, offering computational efficiency and interpretability. By solving the KKT conditions, we characterize user equilibrium with the generalized route cost, which incorporates the operating cost by rebalancing and travelers’ disutility caused by congestion. Our approach is empirically validated through a numerical analysis conducted on the widely recognized Sioux Falls network. The results underscore the effectiveness and practical applicability of our method in analyzing transportation networks featuring MoD services, and open the stage for important future investigations.
|
|
ThC06 |
Oceania I |
Security, Safety, and Resiliency in Cyber-Physical Systems II |
Invited Session |
Chair: Soudjani, Sadegh | Max Planck Institute for Software Systems |
Co-Chair: Sandberg, Henrik | KTH Royal Institute of Technology |
Organizer: Escudero, Cédric | INSA Lyon, Laboratoire Ampère |
Organizer: Sadabadi, Mahdieh S. | The University of Manchester |
Organizer: Lucia, Walter | Concordia University |
Organizer: Murguia, Carlos | Eindhoven University of Technology |
Organizer: Selvi, Daniela | Università Di Pisa |
Organizer: Soudjani, Sadegh | Max Planck Institute for Software Systems |
|
16:30-16:45, Paper ThC06.1 | |
Conditions for Effective Mitigation of Attack Impact Via Randomized Detector Tuning (I) |
|
Coimbatore Anand, Sribalaji | KTH Royal Institute of Technology |
Hassan, Kamil | KTH Royal Institute of Technology, Sweden |
Sandberg, Henrik | KTH Royal Institute of Technology |
Keywords: Networked control systems, Fault detection, Robust control
Abstract: This paper considers the problem of detector tuning against false data injection attacks. In particular, we consider an adversary injecting false sensor data to maximize the state deviation of the plant, referred to as impact, whilst being stealthy. To minimize the impact of stealthy attacks, inspired by moving target defense, the operator randomly switches the detector thresholds. In this paper, we theoretically derive the sufficient (and in some cases necessary) conditions under which the impact of stealthy attacks can be made smaller with randomized switching of detector thresholds compared to static thresholds. We establish the conditions for the stateless (chi^2) and the stateful (CUSUM) detectors. The results are illustrated through numerical examples.
|
|
16:45-17:00, Paper ThC06.2 | |
Set-Theoretic Control and Moving Horizon Estimation for Compensation of False Data Injections in Linear Cyber Physical Systems (I) |
|
Alessandri, Angelo | University of Genoa |
Franze, Giuseppe | Universita' Della Calabria |
Keywords: Resilient Control Systems, Predictive control for linear systems, Estimation
Abstract: In this paper, a novel approach for compensating the effect of external actions on the normal operating mode of linear cyber physical systems is presented. The key idea consists in integrating into a unique framework two receding horizon-based schemes, namely the set-theoretic model predictive control and the moving horizon estimation. In particular, it is hypothesized that the intruder has model knowledge, disclosure and disruptions resources in charge to secretly tamper by false data injections the data exchanged between the controller-to-actuator and sensor-to-controller channels. The proposed architecture is capable to promptly detect the attack occurrence by exploiting set-membership properties of control unit, to accurately estimate the malicious perturbation via the moving horizon estimation procedure, and to formally prove that the resulting compensation is always feasible.
|
|
17:00-17:15, Paper ThC06.3 | |
A Distributed Model Predictive Control Architecture for Mitigating Cyber Attack Effects in Multi-Agent Leader-Follower Formations (I) |
|
Tedesco, Francesco | Università Della Calabria |
Venturino, Antonello | Università Della Calabria |
Famularo, Domenico | Università Degli Studi Della Calabria |
Franze, Giuseppe | Universita' Della Calabria |
Keywords: Resilient Control Systems, Attack Detection, Predictive control for linear systems
Abstract: This paper presents a resilient distributed reced- ing horizon control framework for leader-follower multi-agent systems subject to false data injection attacks. The approach integrates a set-membership-based detection mechanism and predictive consistency checks to identify attacks in finite time. Two mitigation strategies are introduced: (i) altruistic recovery, where agents apply precomputed control actions during an attack, and (ii) adaptive formation reconfiguration, enabling dynamic leader switching and sub-formation rejoining to iso- late compromised agents. The framework ensures constraint satisfaction, recursive feasibility, and closed-loop stability. Nu- merical simulations with unmanned aerial vehicles validate the method, demonstrating effective attack detection and recovery while maintaining formation integrity. Compared to existing approaches, this work uniquely combines model predictive control, anomaly detection, and topology adaptation for resilient multi-agent coordination.
|
|
17:15-17:30, Paper ThC06.4 | |
Cyber-Resilience Certification of Cyber-Physical Systems Subject to Impactful-Stealthy Cyber-Attacks (I) |
|
Khorasani, Khashayar | Concordia University |
Nematollahi, Mohammadreza | Concordia University |
Meskin, Nader | Qatar University |
Keywords: Cyber-Physical Security, Resilient Control Systems, Nonlinear systems
Abstract: Ensuring the cyber-resilience of control systems requires an analytical framework that systematically characterizes cyber-attacks, their stealthiness, and their impact on cyber-physical systems (CPS). In this work, we propose a novel control-theoretic framework for modeling impactful cyber-attacks that induce instability in closed-loop dynamics despite integrity and privacy safeguards. To this end, we leverage a time-reversed system formulation and recast the attack design problem as a robust stabilization problem, the solvability of which—depending on adversarial disruption and disclosure resources—certifies the system’s vulnerability to such cyber-attacks. Additionally, we introduce a barrier function-based methodology to incorporate stealth constraints, enabling a quantitative analysis of the trade-offs between attack impact and detectability. The proposed framework thus provides a rigorous foundation for cyber-resilience certification, security standardization, and the development of attack detection and mitigation strategies in industrial control systems and CPS.
|
|
17:30-17:45, Paper ThC06.5 | |
Detection and Isolation of Multiple Consecutive Faults in Nonlinear Uncertain Systems (I) |
|
Shahvali, Milad | University of Cyprus |
Kasis, Andreas | University of Cyprus |
Polycarpou, Marios M. | University of Cyprus |
Keywords: Fault diagnosis, Fault detection, Nonlinear systems
Abstract: This paper presents a robust dynamic fault detection and isolation scheme for nonlinear uncertain systems subject to multiple consecutive actuator and process faults. A model-based monitoring framework consisting of multiple submodules is developed, with the number of submodules dynamically adjusted based on the number of detected faults. Each newly designed submodule is activated to: (i) approximate the cumulative sum of all detected faults, (ii) isolate the last detected fault, either partially or fully, and (iii) detect the next fault occurrence. A key feature of the proposed approach is the utilization of the online learning-based approximation of the cumulative sum of previously detected faults to enable detection and isolation of subsequent consecutive faults in a computationally efficient manner. The boundedness of the system variables and the robustness of the proposed scheme are analytically shown. Finally, a numerical simulation validates the presented approach and demonstrates its effectiveness and applicability.
|
|
17:45-18:00, Paper ThC06.6 | |
Maximally Resilient Controllers under Temporal Logic Specifications (I) |
|
Ait Si, Youssef | Mohammed VI Polytechnic University |
Das, Ratnangshu | Indian Institute of Science, Bangalore |
Seyedmonir, Seyedehnegar | Newcastle University |
Soudjani, Sadegh | Max Planck Institute for Software Systems |
Jagtap, Pushpak | Indian Institute of Science |
Saoud, Adnane | University Mohammed VI Polytechnic |
Keywords: Robust control, Optimization, Optimal control
Abstract: In this paper, we consider the notion of resilience of a dynamical system, defined by the maximum disturbance a controlled dynamical system can withstand while satisfying a given temporal logic specifications. Indeed, given a dynamical system and a specification, the objective is to synthesize the controller that allows it to satisfy this specification while maximizing its resilience. The problem is formulated as a robust optimization program where the objective is to compute the maximum resilience while simultaneously synthesizing the corresponding controller parameters. For linear systems and linear controllers, exact solutions are provided for the class of time-varying polytopic specifications. For the case of nonlinear systems, nonlinear controllers and more general specifications, we leverage tools from scenario optimization approach, offering a probabilistic guarantee of the solution as well as computational feasibility. Different case studies are presented to illustrate the theoretical results.
|
|
18:00-18:15, Paper ThC06.7 | |
Safety-Aware Multi-Agent Reinforcement Learning for Dynamic Network Bridging (I) |
|
Galliera, Raffaele | The University of West Florida - Institute for Human and Machine |
Mitsopoulos, Konstantinos | Institute for Human and Machine Cognition |
Suri, Niranjan | The University of West Florida - Institute for Human and Machine |
Romagnoli, Raffaele | Duquesne University |
Keywords: Reinforcement learning, Cooperative control, Agents-based systems
Abstract: Addressing complex cooperative tasks in safety-critical environments poses significant challenges for multi-agent systems, especially under conditions of partial observability. We focus on a dynamic network bridging task, where agents must learn to maintain a communication path between two moving targets. To ensure safety during training and deployment, we integrate a control-theoretic safety filter that enforces collision avoidance through local setpoint updates. We develop and evaluate a multi-agent reinforcement learning (MARL) approach with safety-informed message passing. The results suggest that local safety enforcement and MARL can be effectively combined for distributed tasks.
|
|
18:15-18:30, Paper ThC06.8 | |
Resilient Distributed State Estimation for Nonlinear Cyber-Physical Systems with Sensor Networks under Cyberattacks |
|
Kazemi, Hamed | Concordia University |
Khorasani, Khashayar | Concordia University |
Keywords: Cyber-Physical Security, Sensor networks, Resilient Control Systems
Abstract: This paper introduces a resilient distributed state estimation framework for nonlinear cyber-physical systems (CPS) using sensor networks under cyberattacks. Each sensor node integrates a state estimator and a cyberattack detector to enhance system robustness. The proposed approach combines the Distributed Hybrid Information Fusion (DHIF) technique with graph-theoretic principles for multi-step state estimation. False Data Injection (FDI) attacks on communication links are detected and isolated, with a resilient estimation strategy that adapts to changes in network topologies, ensuring that the system can recover from attack impacts. We prove that distributed state estimation remains ultimately bounded, even with graph switching. A case study on a UAV navigating a sensor network with limited sensing coverage demonstrates the effectiveness of the proposed method in detecting cyberattacks, isolating compromised links, and recovering resilient state estimation.
|
|
ThC07 |
Capri I |
Analysis and Design of Input Redundant Systems |
Invited Session |
Chair: Galeani, Sergio | Università Di Roma Tor Vergata |
Co-Chair: Kreiss, Jérémie | Université De Lorraine, CRAN, ENSEM, |
Organizer: Valentim Viana, Valessa | Université De Lorraine |
Organizer: Kreiss, Jérémie | Université De Lorraine, CRAN, ENSEM, |
Organizer: Sassano, Mario | University of Rome, Tor Vergata |
Organizer: Galeani, Sergio | Università Di Roma Tor Vergata |
|
16:30-16:45, Paper ThC07.1 | |
Input Redundancy of Switched Systems Concerning the Switching Signal |
|
Valentim Viana, Valessa | Université De Lorraine |
Kreiss, Jérémie | Université De Lorraine, CRAN, ENSEM, |
Jungers, Marc | CNRS - Université De Lorraine |
Keywords: Switched systems, Linear systems
Abstract: In this paper, we investigate the concept of input redundancy of switched systems, where the switching signal is the control input. An input redundant system in this context describes a system in which the same output trajectory can be generated by non-equivalent switching signals. The concept of equivalence in switching signals means that two signals generate identical state trajectories under identical initial conditions. Unlike the linear time-invariant case where input redundancy is uniform in the initial condition, this property is absent in the considered study. For autonomous switched linear systems, we developed a strategy to verify the existence of initial conditions where non-equivalent switching signals produce the same output. A numerical example is presented to illustrate the proposed strategy, for which we also compute the entire set of initial conditions that leads to input redundancy.
|
|
16:45-17:00, Paper ThC07.2 | |
On the Construction of a Minimal Order Annihilator and Its Role in Dynamic Control Allocation (I) |
|
Valentim Viana, Valessa | Université De Lorraine |
Galeani, Sergio | Università Di Roma Tor Vergata |
Sassano, Mario | University of Rome, Tor Vergata |
Keywords: Linear systems, Optimal control
Abstract: This paper revisits the dynamic allocation problem for linear systems subject to periodic exogenous inputs. In previous works, the allocator is structured as a cascade of an optimizer and an annihilator, and incorporates a certain number of copies of the exosystem model. The annihilator can be designed using different methodologies. In this work, we compare two strategies for its computation with the objective of obtaining one of minimal order. Based on this analysis, we propose a revisited allocator design that reduces the overall dimension of its dynamics by embedding the exosystem modes directly into the annihilator.
|
|
17:00-17:15, Paper ThC07.3 | |
Necessary and Sufficient Condition for Solvability of Output Regulation Problem for Hybrid Linear Systems with Unpredictable Jumps (I) |
|
Gabrielli, Gianmatteo | University of Rome, Tor Vergata |
Galeani, Sergio | Università Di Roma Tor Vergata |
Menini, Laura | Univ. Rome Tor Vergata |
Sassano, Mario | University of Rome, Tor Vergata |
Keywords: Hybrid systems, Output regulation, Linear systems
Abstract: In this paper the output regulation problem for a class of hybrid linear systems in the presence of time- driven jumps and uncertain time domain is considered. More specifically, a generalized family of hybrid time domains, not known a priori to the controller, is analyzed, allowing for the possibility of multiple jumps occurring at the same time instant, namely simultaneous jumps. The geometric characterization of the relevant regulation manifolds is provided by means of necessary and sufficient conditions, within the full information setting. Finally, the theory is illustrated and validated through numerical examples.
|
|
17:15-17:30, Paper ThC07.4 | |
Sparse Control of Linear Continuous-Time Systems: A Geometric Approach (I) |
|
Safarika, Eleftheria | Imperial College London |
Astolfi, Alessandro | Imperial College & Univ. of Rome |
Keywords: Linear systems, Switched systems, Stability of linear systems
Abstract: This paper solves the sparse control problem for linear two-input systems. This is achieved by developing a canonical form for sparse control and exploiting the theory of switched systems to design a sparse feedback law achieving asymptotic stability of the zero equilibrium of the closed-loop system. The results are illustrated through a simple case study.
|
|
17:30-17:45, Paper ThC07.5 | |
A Sensitivity Approach to Periodic Control Allocation in Nonlinear Systems (I) |
|
Akbari, Shima | PhD Student at Italian National Program in Autonomous Systems |
Galeani, Sergio | Università Di Roma Tor Vergata |
Manca, Giorgio | Tor Vergata University of Rome |
Sassano, Mario | University of Rome, Tor Vergata |
Keywords: Optimal control, Output regulation, Nonlinear systems
Abstract: The objective of this paper is to propose a dynamic control allocation architecture for nonlinear systems that allow for the presence of periodic exogenous signals. The design is achieved by combining a gradient-based optimization strategy with the use of sensitivity dynamics for nonlinear systems that depend on parameters. The resulting strategy permits extending control optimization beyond the more standard point-wise (instantaneous) allocation schemes, as illustrated also by means of an academic example, while preserving the appealing output invisibility feature typical of the linear context.
|
|
17:45-18:00, Paper ThC07.6 | |
Set-Based and Dynamical Feedback-Augmented Hands-Off Control |
|
Sperila, Andrei | CentraleSupelec, Universite Paris-Saclay |
Olaru, Sorin | CentraleSupélec |
Drobot, Stéphane | RTE |
Keywords: Linear systems, Constrained control, Switched systems
Abstract: A novel set-theoretical approach to hands-off control is proposed, which focuses on spatial arguments for command limitation, rather than temporal ones. By employing dynamical feedback alongside invariant set-based constraints, actuation is employed only to drive the system’s state inside a “hands-off region” of its state-space, where the plant may freely evolve in open-loop configuration. A computationally-efficient procedure with strong theoretical guarantees is devised, and its effectiveness is showcased via an intuitive practical example.
|
|
18:00-18:15, Paper ThC07.7 | |
Maximum Hands-Off Hybrid Control for Discrete-Time Switched Linear Systems |
|
U, Darsana | Indian Institute of Technology, Kharagpur |
Kundu, Atreyee | Indian Institute of Technology Kharagpur |
Keywords: Switched systems, Optimization, Optimal control
Abstract: We study a maximum hands-off hybrid control paradigm for discrete-time switched linear systems. It is the sparsest element among all hybrid control sequences that steer a given initial state of the switched system to the origin in a given duration of time. Our contributions are twofold. First, we present an algorithm that designs sparse hybrid control sequences for discrete-time switched linear systems. Second, we present sufficient conditions on the subsystems dynamics, the initial state and the time horizon under which a hybrid control sequence obtained from our algorithm is a maximum hands-off hybrid control sequence. The key apparatuses for our analysis are mixed integer optimization and sparse optimization theory.
|
|
18:15-18:30, Paper ThC07.8 | |
Recursive Regulator for Systems with State and Input Delays and Parametric Uncertainties |
|
Almeida Dias Bueno, José Nuno | University of São Paulo at São Carlos |
Odorico, Elizandra Karla | University of Sao Paulo |
Terra, Marco Henrique | University of São Paulo at São Carlos |
Ribeiro, Eduardo Godinho | University of São Paulo |
Barbosa Marcos, Lucas | Federal University of São Carlos |
Keywords: Linear systems, Robust control, Uncertain systems
Abstract: This paper revisits the control problem of discrete-time linear systems subject to state and input delays, as well as norm-bounded parametric uncertainties. A robust recursive regulator based on an augmented system encompassing delays and uncertainties in a unified framework is proposed. A robust regularization approach with penalization is developed for a constrained optimization problem. Both delays are transformed in data under an umbrella of regularized least-squares problems. A recursive Riccati equation is obtained as a solution, providing standard conditions for proving convergence and stability a posteriori. Based on Gauss's arguments, updated to account for uncertainties, this approach has been proposed as an alternative to Lyapunov-type solutions, where the stability of the feedback control system is defined a priori. We present two numerical examples where we consider an industrial electric heater model to illustrate the performance of the robust regulator. A comparative study is performed with three guaranteed cost approaches.
|
|
ThC08 |
Oceania V |
Reinforcement Learning II |
Regular Session |
Chair: Antunes, Duarte | Eindhoven University of Technology, the Netherlands |
Co-Chair: van Hulst, Jilles | Eindhoven University of Technology |
|
16:30-16:45, Paper ThC08.1 | |
Efficient Reward Identification in Max Entropy Reinforcement Learning with Sparsity and Rank Priors |
|
Shehab, Mohamad Louai | University of Michigan Ann Arbor |
Tercan, Alperen | University of Michigan |
Ozay, Necmiye | Univ. of Michigan |
Keywords: Reinforcement learning, Identification, Optimization
Abstract: In this paper, we consider the problem of recovering time-varying reward functions from either optimal policies or demonstrations coming from a max entropy reinforcement learning problem. This problem is highly ill-posed when no additional structural properties of the underlying rewards are assumed. However, in many applications, the rewards are indeed parsimonious, and some prior information is available. We consider two such priors on the rewards: 1) rewards are mostly constant and they change infrequently, 2) rewards can be represented by a linear combination of a small number of feature functions. We first show that the reward identification problem with the former prior can be recast as a sparsification problem subject to linear constraints. Moreover, we give a polynomial-time algorithm that solves this sparsification problem exactly. Then, we show that identifying rewards representable with the minimum number of features can be recast as a rank minimization problem subject to linear constraints, for which convex relaxations of rank can be invoked. In both cases, these observations lead to efficient optimization-based reward identification algorithms. Several examples are given to demonstrate the accuracy of the recovered rewards as well as their generalizability.
|
|
16:45-17:00, Paper ThC08.2 | |
A Lagrangian Framework for Safe Cooperative Reinforcement Learning |
|
Das, Soham | The University of Tennessee, Knoxville |
Chamon, Luiz F. O. | École Polytechnique |
Paternain, Santiago | Rensselaer Polytechnic Institute |
Eksin, Ceyhun | Texas A&M University |
Keywords: Reinforcement learning, Distributed control, Cooperative control
Abstract: We consider the problem of safe cooperative multiagent reinforcement learning (MARL) within the framework of a constrained multiagent Markov decision process (MDP). Agents share a common value function and learn to coordinate their actions to maximize a joint objective while adhering to system-level constraints. These constraints can enforce safety, reliability, or additional regulatory requirements governing the evolution of the multiagent system. We propose a Lagrangian-based approach, where agents iteratively solve a relaxed Lagrangian MDP using a joint learning mechanism. During execution, agents independently follow their policies, accumulating constraint violations over an epoch, which are then used to update the Lagrange multipliers. We show that continuous execution of this primal-dual algorithm produces episodes which are feasible almost surely. Further, we prove that the sequence of policies generated by the algorithm yields a nonstationary approximately optimal solution for the safe cooperative MARL problem.
|
|
17:00-17:15, Paper ThC08.3 | |
Incentivized Lipschitz Bandits |
|
Chakraborty, Sourav | University of Colorado |
Rege, Amit Kiran | University of Colorado Boulder |
Monteleoni, Claire | University of Colorado Boulder |
Chen, Lijun | University of Colorado at Boulder |
Keywords: Reinforcement learning, Randomized algorithms, Machine learning
Abstract: We study incentivized exploration in multi-armed bandit (MAB) settings with infinitely many arms, modeled as elements of a continuous metric space. Unlike classical bandit models, we consider scenarios where the decision-maker (principal) incentivizes myopic agents to explore beyond their greedy choices through compensation, but with the complication of reward drift, i.e., biased feedback arising due to the incentives. We propose novel incentivized exploration algorithms that discretize the infinite arm space uniformly, demonstrating that these algorithms simultaneously achieve sublinear cumulative regret and sublinear total compensation in expectation. Specifically, we derive regret and compensation bounds of Tilde{O}(T^{d+1/d+2}), with d representing the covering dimension of the metric space. Furthermore, we generalize our results to contextual bandits and achieve comparable performance guarantees. We support our theoretical findings with numerical simulations.
|
|
17:15-17:30, Paper ThC08.4 | |
Generating Informative Benchmarks for Reinforcement Learning |
|
Yaremenko, Grigory | Skolkovo Institute of Science and Technology |
Ibrahim, Sinan | Skolkovo Institute for Science and Technology |
Moreno Mora, Francisco Javier | Technische Universität Chemnitz |
Osinenko, Pavel | Skoltech Institute of Science and Technology |
Streif, Stefan | Technische Universität Chemnitz |
Keywords: Reinforcement learning
Abstract: Reinforcement learning (RL) benchmarking has long relied on learning curves and cumulative reward tables, yet these metrics fail to capture critical design challenges, such as environment sensitivity, robustness, and reproducibility. This paper introduces a novel benchmarking paradigm that rigorously evaluates these understudied properties through a generative framework grounded in converse optimality. By constructing families of control problems with provably optimal solutions, our method enables statistical hypothesis testing of an algorithm’s response to environment perturbations both during training and post-deployment. Central to this approach is a problem generator that synthesizes parametric variations of dynamics and costs, paired with a sensitivity evaluation pipeline. To illustrate the framework’s utility, we apply it to compare environment sensitivity between Adam and RAdam optimizers -- a case study designed to clarify how exactly the methodology provides the means to evaluate reinforcement learning algorithms in an informative way. This work contributes to RL evaluation by shifting focus from sample efficiency alone to methodologies that assess algorithms under conditions reflective of practical engineering challenges.
|
|
17:30-17:45, Paper ThC08.5 | |
Smart Exploration in Reinforcement Learning Using Bounded Uncertainty Models |
|
van Hulst, Jilles | Eindhoven University of Technology |
Heemels, W.P.M.H. (Maurice) | Eindhoven University of Technology |
Antunes, Duarte | Eindhoven University of Technology, the Netherlands |
Keywords: Reinforcement learning, Stochastic optimal control, Uncertain systems
Abstract: Reinforcement learning (RL) is a powerful framework for decision-making in uncertain environments, but it often requires large amounts of data to learn an optimal policy. We address this challenge by incorporating prior model knowledge to guide exploration and accelerate the learning process. Specifically, we assume access to a model set that contains the true transition kernel and reward function. We optimize over this model set to obtain upper and lower bounds on the Q-function, which are then used to guide the exploration of the agent. We provide theoretical guarantees on the convergence of the Q-function to the optimal Q-function under the proposed class of exploring policies. Furthermore, we also introduce a data-driven regularized version of the model set optimization problem that ensures the convergence of the class of exploring policies to the optimal policy. Lastly, we show that when the model set has a specific structure, namely the bounded-parameter MDP (BMDP) framework, the regularized model set optimization problem becomes convex and simple to implement. In this setting, we also prove finite-time convergence to the optimal policy under mild assumptions. We demonstrate the effectiveness of the proposed exploration strategy, which we call BUMEX (Bounded Uncertainty Model-based Exploration), in a simulation study. The results indicate that the proposed method can significantly accelerate learning in benchmark examples. A toolbox is available at https://github.com/JvHulst/BUMEX.
|
|
17:45-18:00, Paper ThC08.6 | |
Coordinated Q-Functionals |
|
Findik, Yasin | University of Massachusetts Lowell |
Azadeh, Reza | University of Massachusetts Lowell |
Keywords: Reinforcement learning, Cooperative control
Abstract: Learning efficiently in continuous-action multi-agent environments remains a central challenge in reinforcement learning. While value-based algorithms offer strong sample efficiency in discrete domains, extending them to continuous spaces is non-trivial due to the difficulty of evaluating infinite action sets. Policy-based methods address this challenge by using critic networks to guide and stabilize the learning process, yet they often struggle with high variance in gradient estimates and convergence to local optima. Departing from the typical reliance on critics, we propose Coordinated Q-Functionals (CoQF) --- a novel value-based algorithm for learning in continuous multi-agent domains. CoQF represents each agent' state using basis functions, enabling efficient evaluation of densely sampled actions and coordinated mixing of value estimates across agents. This architecture supports both effective coordination and improved sample efficiency. Empirical results across six cooperative tasks show that CoQF consistently outperforms four Deep Deterministic Policy Gradient (DDPG) variants in both learning speed and final performance.
|
|
18:00-18:15, Paper ThC08.7 | |
Enabling Pareto-Stationarity Exploration in Multi-Objective Reinforcement Learning: A Multi-Objective Weighted-Chebyshev Actor-Critic Approach |
|
Hairi, Fnu | University of Wisconsin-Whitewater |
Yang, Jiao | Amazon |
Zhou, Tianchen | Amazon |
Yang, Haibo | Rochester Institute of Technology |
Dong, Chaosheng | Amazon |
Yang, Fan | Amazon |
Momma, Michinari | Amazon |
Gao, Yan | Amazon |
Liu, Jia | The Ohio State University |
Keywords: Reinforcement learning
Abstract: In many multi-objective reinforcement learning (MORL) applications, being able to systematically explore the Pareto-stationary solutions under multiple non-convex reward objectives with theoretical finite-time sample complexity guarantee is an important and yet under-explored problem. This motivates us to take the first step and fill the important gap in MORL. Specifically, in this paper, we propose a uline{M}ulti-uline{O}bjective weighted-uline{CH}ebyshev uline{A}ctor-critic (policyns) algorithm for MORL, which judiciously integrates the weighted-Chebychev (WC) and actor-critic framework to enable Pareto-stationarity exploration systematically with finite-time sample complexity guarantee. Sample complexity result of policy algorithm reveals an interesting dependency on p_{min} in finding an epsilon-Pareto-stationary solution, where p_{min} denotes the minimum entry of a given weight vector bm{p} in WC-scalarization. By carefully choosing learning rates, the sample complexity for each exploration can be tilde{mathcal{O}}(epsilon^{-2}). Furthermore, simulation studies on a large KuaiRand offline dataset, show that the performance of policy algorithm significantly outperforms other baseline MORL approaches.
|
|
ThC09 |
Oceania VIII |
Nonlinear System Identification II |
Regular Session |
Chair: Zeilinger, Melanie N. | ETH Zurich |
Co-Chair: Gluzman, Igal | Technion - Israel Institute of Technology |
|
16:30-16:45, Paper ThC09.1 | |
Zone Model Predictive Control with Active Learning and Application to Cerebrospinal Fluid Dynamics |
|
Flürenbrock, Fabian | ETH Zurich |
Köhler, Johannes | ETH Zurich |
Schmid Daners, Marianne | ETH Zurich |
Zeilinger, Melanie N. | ETH Zurich |
Keywords: Closed-loop identification, Predictive control for nonlinear systems, Biomedical
Abstract: This paper presents a zone model predictive control (MPC) scheme with active dynamics learning for a class of nonlinear systems with time-varying and uncertain parameters. The goal of this MPC scheme is to provide a systematic trade-off between three competing objectives: regulation of the system state to a desired zone, identification of the system parameters up to a desired accuracy, and minimization of the rate of input changes. To address this challenge, we integrate the covariance propagation of a Kalman filter used for parameter estimation into a zone MPC scheme and control the accuracy of the parameter estimates to a desired level by introducing a suitable soft constraint on the predicted covariances. We highlight the potential of the active learning zone MPC for biomedical applications through simulations of cerebrospinal fluid dynamics, demonstrating how the proposed method can be used to improve the diagnosis and therapy of hydrocephalus.
|
|
16:45-17:00, Paper ThC09.2 | |
Adaptive Neuro-Fuzzy Approach for Identification of Multivariable Hammerstein Systems with Static Non-Smooth Nonlinearities |
|
Santos, Luís Henrique | Federal University of Minas Gerais |
Ricco, Rodrigo Augusto | Universidade Federal De Ouro Preto |
Teixeira, Bruno Otávio Soares | Federal University of Minas Gerais (UFMG) |
Keywords: Nonlinear systems identification, Subspace methods, Fuzzy systems
Abstract: This paper presents a methodology for the identification of multivariable systems with non-smooth nonlinearities using Hammerstein models in the state-space representation. The proposed approach extends existing methods in the literature based on autoregressive models, enabling the estimation of consistent models even in the presence of both white measurement and process noise. Additionally, an adaptive clustering strategy is introduced, which dynamically adjusts the distribution of clusters according to the variation of the steady-state gain curve, reducing the model complexity without loss of accuracy. The methodology is validated in a multivariable thermal system, demonstrating improved accuracy in representing nonlinearity with a reduced number of clusters.
|
|
17:00-17:15, Paper ThC09.3 | |
Safe Extrapolation of Autonomous Data-Driven Augmented Models |
|
Habboush, Abdullah | Eindhoven University of Technology |
Shakib, Fahim | Imperial College London |
Oomen, Tom | Eindhoven University of Technology |
Van De Wouw, Nathan | Eindhoven University of Technology |
Keywords: Nonlinear systems identification, Lyapunov methods, Stability of nonlinear systems
Abstract: Hybrid modeling augments physics-based models with data-driven components to improve accuracy on a given dataset while retaining physical insight. However, data-driven components are known to be unreliable when extrapolating beyond the training data, which can severely degrade a hybrid model's accuracy and jeopardize its stability properties. Given an existing hybrid model, we propose a novel model modification technique that 1) preserves the accuracy of the hybrid model within the training domain, 2) enforces alignment with the physics-based model when extrapolating beyond the training domain, and 3) guarantees a safe (i.e., stability preserving) extrapolation behavior outside the training domain. This ensures reliable extrapolation of the hybrid model consistent with prior physical knowledge. The effectiveness of the proposed approach is demonstrated through a simulation case study.
|
|
17:15-17:30, Paper ThC09.4 | |
An Actuator Pre-Filtering Approach to Control-Coherent Koopman Modeling: Extending Koopman Operators to Systems with Control |
|
Asada, H. Harry | Massachusetts Inst. of Tech |
Keywords: Modeling, Robotics, Nonlinear systems identification
Abstract: The original Koopman operator theory cannot be applied to non-autonomous systems with exogenous input. Linearizing a nonlinear state equation with respect to the control input causes significant error, misinforming the controller. Bilinear approximations produce better accuracy, but the resultant model is no longer linear. This paper presents an alternative method for constructing a Koopman model for non-autonomous systems. Control-Coherent Koopman (CCK) modeling allows us to obtain a linear model with a constant control matrix by incorporating the dynamics of actuators. Here, the principle of CCK modeling is extended to those systems where actuator dynamics do not meet the requirements for the CCK formulation. Physical actuator dynamics are replaced by virtual dynamics, which are analogous to actuator pre-filters. These virtual dynamics possess independent state variables and new input terms that appear linearly in the filter dynamics. Although the original CCK physical modeling approach suffers a stiff equation problem, the actuator pre-filter approach does not. Conditions for implementation and numerical examples are discussed at the end.
|
|
17:30-17:45, Paper ThC09.5 | |
Reduced Order Modeling Using Rational Approximations |
|
Singh, Rajiv | The MathWorks |
Sznaier, Mario | Northeastern University |
Ljung, Lennart | Linkoping Univ |
Keywords: Reduced order modeling, Linear parameter-varying systems, Nonlinear systems identification
Abstract: We consider the problem of the identification and reduction of high-fidelity models using test trajectories. We propose a multivariate rational approximation technique that is amenable to adaptive updates and scales well with the data size and state dimensions. Building upon the notion of metric interpolation, we develop algorithms for creating least-order rational approximants that select and interpolate a set of local models using certain noise-robust distance metrics. We develop a sum-of-squares (SOS) polynomial based method for determining the asymmetric region of influence of the local models. The estimation method is able to incorporate long-horizon fidelity requirements with noisy data. We make a connection to the Linear Parameter-Varying (LPV) models that reveals a way to determine the LPV form as well as its scheduling rules automatically from the data. We illustrate the efficacy of the proposed approach on a Gyroscope modeling problem.
|
|
17:45-18:00, Paper ThC09.6 | |
EXplainable AI for Data Driven Control: An Inverse Optimal Control Approach |
|
Porcari, Federico | Politecnico Di Milano |
Formentin, Simone | Politecnico Di Milano |
Materassi, Donatello | University of Minnesota |
Keywords: Data driven control, Optimal control, Learning
Abstract: Understanding the behavior of black-box data-driven controllers is a key challenge in modern control design. In this work, we propose an eXplainable AI (XAI) methodology based on Inverse Optimal Control (IOC) to obtain local explanations for the behavior of a controller operating around a given region. Specifically, we extract the weights assigned to tracking errors and control effort in the implicit cost function that a black-box controller is optimizing, offering a more transparent and interpretable representation of the controller’s underlying objectives. This approach presents connections with well-established XAI techniques, such as Local Interpretable Model-agnostic Explanations (LIME) since it is still based on a local approximation of the control policy. However, rather being limited to a standard sensitivity analysis, the explanation provided by our method relies on the solution of an inverse Linear Quadratic (LQ) problem, offering a structured and more control-relevant perspective. Numerical examples demonstrate that the inferred cost function consistently provides a deeper understanding of the controller’s decision-making process, shedding light on otherwise counterintuitive or unexpected phenomena.
|
|
18:00-18:15, Paper ThC09.7 | |
Bridging Abstraction-Based Hierarchical Control and Moment Matching: A Conceptual Unification |
|
Niu, Zirui | Imperial College London |
Shakib, Fahim | Imperial College London |
Scarciotti, Giordano | Imperial College London |
Keywords: Model/Controller reduction, Simulation, Reduced order modeling
Abstract: In this paper, we establish a relation between approximate-simulation-based hierarchical control (ASHC) and moment matching techniques, and build a conceptual bridge between these two frameworks. To this end, we study the two key requirements of the ASHC technique, namely the bounded output discrepancy and the M-relation, through the lens of moment matching. We show that, in the linear time-invariant case, both requirements can be interpreted in the moment matching perspective through certain system interconnection structures. Building this conceptual bridge provides a foundation for cross-pollination of ideas between these two frameworks.
|
|
18:15-18:30, Paper ThC09.8 | |
Choosing between Active and Passive Flow Control Via Input-Output Analysis: Application to Couette Flow |
|
Frank-Shapir, Ofek | Technion |
Gluzman, Igal | Technion - Israel Institute of Technology |
Keywords: Fluid flow systems, Modeling
Abstract: Input-output analysis is utilized to determine between passive or active flow control strategies in transitional wall-bounded shear flows for a given streamwise and spanwise wave number pair (k_x,k_z) and to quantify the optimal temporal frequency in active flow control with constant actuation frequency that yields the strongest response to external forcing. Applying our methodology to plane Couette base flow reveals that for most actuation geometries and Reynolds numbers Re, the optimal actuation frequency is zero, corresponding to passive control devices or actuators that impose continuous forcing. We find that the scenarios in which active actuation is preferred are concentrated on a thin strip on the logarithmic Re-k_x plane. Using our results, we derive two analytical equations: the first equation allows us to determine if active or passive actuation is preferred for a given streamwise and spanwise wave number pair and the Reynolds number. The second equation allows us to determine the optimal actuation frequency in the region where active actuation is preferred without the need for experimentation or high-fidelity simulations. Our analysis shows that this optimal actuation frequency is inversely proportional to the Reynolds number.
|
|
ThC10 |
Oceania VII |
Distributed and Decentralized Control III |
Regular Session |
Chair: Lesic, Vinko | University of Zagreb, Faculty of Electrical Engineering and Computing |
Co-Chair: Vasak, Mario | University of Zagreb Faculty of Electrical Engineering and Computing |
|
16:30-16:45, Paper ThC10.1 | |
Distributed Model Predictive Control of Hybrid Energy Storage System |
|
Vrbanc, Filip | University of Zagreb, Faculty of Electrical Engineering and Comp |
Car, Mateja | University of Zagreb, Faculty of Electrical Engineering and Comp |
Vasak, Mario | University of Zagreb Faculty of Electrical Engineering and Compu |
Lesic, Vinko | University of Zagreb, Faculty of Electrical Engineering and Comp |
Keywords: Distributed control, Smart grid, Predictive control for nonlinear systems
Abstract: This paper deals with a distributed heterogeneous storage microgrid, which consists of variable efficiency batteries and latent thermal energy storage. The nonlinear efficiency of battery charging and discharging is determined using data from converter datasheets. A corresponding nonlinear problem for each battery is solved using sequential linear programming. The nonlinearity of thermal energy storage stems from the state transition properties of phase change material and is addressed through mixed integer programming. Distributed control is applied to adjust individual solutions of energy storage units and improve overall microgrid performance while adhering to the joint constraint of maximum grid power exchange capacity. It is based on asymmetric projection algorithm that utilizes the gradient of cost function to steer the solution towards the global optimum. Furthermore, it enhances data privacy and improves economic gain. The algorithm is implemented in a realistic simulation that incorporates actual load consumption data and real price profile. Presented results show that distributed control improves monetary profit by 1.20% and 5.12% compared to the constrained decentralized control and conventional transactive control, respectively and over 4× compared to the no storage case.
|
|
16:45-17:00, Paper ThC10.2 | |
PredSLS: A System-Level Framework for Distributed Predictive Control |
|
Wu, Yifei | Chinese University of Hong Kong, Shenzhen |
Yu, Jing | University of Washington |
Li, Tongxin | The Chinese University of Hong Kong, Shenzhen |
Keywords: Distributed control, Predictive control for linear systems, Networked control systems
Abstract: Distributed control of large-scale systems is challenging due to the need for scalable and localized communication and computation. In this work, we introduce a Predictive System-Level Synthesis (PredSLS) framework that designs closed-loop controllers by integrating prediction information into an affine feedback structure. Rather than focusing on worst-case uncertainty, PredSLS leverages both current state feedback and future predictions to achieve effective control in distributed settings. In particular, PredSLS enables a unified system synthesis of the optimal kappa-truncated controller, therefore outperforms approaches with post hoc communication truncation, as was commonly seen in the literature. A core feature of the PredSLS is the temporal decaying property ensures that the influence of prediction errors diminishes exponentially over time, effectively localizing the effect of disturbances and enabling the use of finite impulse response approximations for efficient computation. Moreover, the PredSLS framework can be naturally decomposed into spatial and temporal components for scalable parallel computation across the network.
|
|
17:00-17:15, Paper ThC10.3 | |
Distributed Model Predictive Frequency Control in Inverter-Based Microgrids Based on ADMM with Virtual Subsystems |
|
Och, Alexander | RWTH Aachen |
Ulbig, Andreas | RWTH Aachen University |
Keywords: Power systems, Distributed control, Power electronics
Abstract: This paper presents a distributed Model Predictive Controller (dMPC) for frequency regulation in inverterbased microgrids, leveraging the Alternating Direction Method of Multipliers (ADMM) for decentralized optimization. The proposed approach addresses the scalability and communication challenges of centralized MPC by decomposing the global optimization problem into localized subproblems. The algorithm is validated using a microgrid scenario derived from the CIGRE medium-voltage benchmark network, adapted for islanded operation. The network configuration emphasizes local subsystem interactions, reflecting realistic operating conditions while focusing on the optimizer’s performance. Results demonstrate that the dMPC achieves near-identical performance to its centralized counterpart when sufficient iterations ensure convergence. Under limited computational resources, the controller reliably tracks optimal operating points when constraints are inactive but exhibits violations when constraints become active. Strategies such as improved initialization schemes and dynamically adjusted optimization parameters show promise in mitigating these limitations. The findings underscore effectiveness and robustness of ADMM in decentralizing complex control problems while ensuring coordination across subsystems.
|
|
17:15-17:30, Paper ThC10.4 | |
Lyapunov-Certified Resilient Secondary Defense Strategies of AC Microgrids under Exponentially Energy-Unbounded FDI Attacks |
|
Rajabinezhad, Mohamadamin | University of Connecticut (UCONN) |
Shams, Nesa | University of Connecticut |
Khan, Asad Ali | The University of Texas at San Antonio |
Beg, Omar | The University of Texas Permian Basin |
Zuo, Shan | University of Connecticut |
Keywords: Distributed control, Power systems, Hierarchical control
Abstract: This letter presents fully distributed Lyapunov-certified resilient secondary defense strategies for islanded inverter-based AC microgrids, designed to counter a broad spectrum of exponentially energy-unbounded false data injection (EEU-FDI) attacks, targeting control input channels. While distributed control improves scalability and reliability, it also increases susceptibility to cyber threats. The proposed defense strategies, supported by rigorous Lyapunov-based proofs, ensure uniformly ultimately bounded (UUB) convergence for frequency regulation, voltage containment, and power sharing, even under EEU-FDI attacks. The effectiveness of the proposed defense strategies has been demonstrated through case studies on a modified IEEE 34-bus system, leveraging simulations and real-time controller hardware-in-the-loop experiments using OPAL-RT.
|
|
17:30-17:45, Paper ThC10.5 | |
Distributed Source Seeking for an Uncertain Signal Source Using Adaptive Model Predictive Control |
|
Gao, Xinzhou | University of Alberta |
Shu, Zhan | University of Alberta |
Liu, Jason J. R. | The University of Hong Kong |
Keywords: Distributed control, Predictive control for linear systems, Sensor networks
Abstract: In this paper, we address the source seeking problem with a moving signal source, and propose an adaptive model predictive control (MPC) scheme. In particular, we model the signal field generated by the signal source as a time- varying linear combination of several convex basis functions. A distributed system is leveraged to estimate the signal field based on a distributed set-membership estimation procedure, and the estimation result is used as the output function of each agent in the system. To ensure the recursive feasibility of MPC while the signal source is moving unpredictably, an adaptive terminal set is designed. We also demonstrate that our closed- loop system is practically stable under the proposed adaptive MPC. Simulation results show the effectiveness of our method.
|
|
17:45-18:00, Paper ThC10.6 | |
PRIME: Fast Primal-Dual Feedback Optimization for Markets with Application to Optimal Power Flow |
|
Behr, Nicholas Julian | ETH Zuerich |
Bianchi, Mattia | ETH Zurich |
Moffat, Keith | ETH Zurich |
Bolognani, Saverio | ETH Zurich |
Dörfler, Florian | Swiss Federal Institute of Technology (ETH) Zurich |
Keywords: Distributed control, Smart grid, Control of networks
Abstract: Online Feedback Optimization (OFO) controllers iteratively drive a plant to an optimal operating point that satisfies input and output constraints, relying solely on the input-output sensitivity as model information. This paper introduces PRIME (PRoximal Iterative MarkEts), a novel OFO approach based on proximal-point iterations. Unlike existing OFO solutions, PRIME admits a market-based implementation, where self-interested actors are incentivized to make choices that result in safe and efficient operation, without communicating private costs or constraints. Furthermore, PRIME can handle non-smooth objective functions, achieve fast convergence rates and rapid constraint satisfaction, and effectively reject measurement noise. We demonstrate PRIME on an AC optimal power flow problem, obtaining an efficient real-time nonlinear local marginal pricing scheme.
|
|
18:00-18:15, Paper ThC10.7 | |
A Quadratic Programming Approach for Network Distributed L1 Optimal Control |
|
Zou, Yuanji | University of Minnesota |
Elia, Nicola | University of Minnesota |
Keywords: Distributed control, Optimal control, Optimization
Abstract: We present an approximation method for the distributed l1 output-feedback control problem under network-induced sparsity and delays, assuming quadratic invariance with the plant. We avoid structured doubly co-prime factorizations and instead start from the plant’s spectral factorization. We introduce a mixed l1/ l2 norm with arbitrary truncation and formulate a jointly convex, two-stage optimization. We establish tight upper and lower bounds through a primal-dual analysis. We cast the method as a quadratic program and illustrate its effectiveness with a numerical example.
|
|
18:15-18:30, Paper ThC10.8 | |
Safety-Oriented Vulnerability Assessment of Discrete-Time Interconnected Control Systems |
|
Lu, Limin | Zhejiang University |
Luo, Xiaoyu | Boston University |
Zhao, Chengcheng | Zhejiang University |
Keywords: Distributed control, Cyber-Physical Security, Computer/Network Security
Abstract: Safety is critical to interconnected control systems composed of multiple interacting subsystems, which have become major targets of cyberattacks. Understanding how attacks on one or more subsystems propagate through the network and threaten overall system safety is crucial for defense. In this paper, we assess the safety-oriented vulnerability of discrete-time interconnected control systems. Specifically, we aim to identify the set of compromised subsystems, referred to as the most vulnerable set, that drives the system state into the unsafe region within the minimum time under given false data injection (FDI) attack signals. We propose a Vulnerable Set Selection Algorithm (VSSA) for efficiently searching for the most vulnerable set. Specifically, based on the observation that consecutive violations of the discrete-time control barrier functions (DTCBF) condition can drive the state into an unsafe region, we formulate a new optimization problem that maximizes the cumulative violation of the DTCBF condition over a given time horizon. To connect this formulation with the original vulnerability assessment problem, we iteratively solve the cumulative violation maximization problem using moment relaxation under an increasing time horizon to derive the most vulnerable set. Furthermore, we prove the optimality of the proposed algorithm. Finally, case studies on vehicle platoons illustrate the effectiveness of the proposed method.
|
|
ThC11 |
Oceania VI |
Control of Networks II |
Regular Session |
Chair: Hendrickx, Julien M. | UCLouvain |
Co-Chair: Altafini, Claudio | Linkoping University |
|
, Paper ThC11.0 | |
Scalable Controller Design for Consensus with Performance Considerations: A Phase-Theoretical Approach |
|
Wang, Dan | Nanjing University |
Chen, Wei | Peking University |
Johansson, Karl H. | KTH Royal Institute of Technology |
Qiu, Li | Hong Kong Univ. of Sci. & Tech |
Keywords: Control of networks, Agents-based systems, Linear systems
Abstract: In this paper, we study the problem of designing a uniform controller for heterogeneous multi-agent systems to achieve consensus, taking into account transient performance requirement characterized by convergence rate and damping. Two main issues are addressed: 1) Under what conditions the problem is solvable? 2) When the problem is solvable, how to design such a uniform controller? To answer these questions, we define a measure of diversity of the agents through simultaneous phase alignment of a set of matrices, and define a measure of interaction quality using the essential phase of the Laplacian matrix of a graph. The main finding of the paper is a critical trade-off among the diversity of the agents, the interaction quality among them, and the desired damping performance that constitutes the solvability condition. We also propose a method to design the controller when the condition is satisfied. The analysis of departure angles of multivariable root loci plays a useful role in our study.
|
|
16:30-16:45, Paper ThC11.1 | |
Optimal Disturbance Decoupling Over Networks Via State Feedback |
|
Lebon, Luca Claude Gino | Linköping University |
Altafini, Claudio | Linkoping University |
Keywords: Control of networks, Networked control systems, Optimization algorithms
Abstract: This paper addresses the problem of optimal disturbance decoupling over networks through state feedback control. We propose an algorithm that isolates and eliminates the impact of disturbance nodes on specific target nodes to be protected, leveraging well-known properties of controlled invariance and reformulating them for systems over networks. The optimal solution, intended as a minimal-cardinality set of control inputs needed to achieve the decoupling, can be characterized as a function of the maximal controlled invariant set contained in the null space of the target nodes, and can be computed by solving a modified min-cut/max-flow problem. A closed-form solution for the state feedback matrix that achieves disturbance decoupling is also provided, and its graphical meaning is explained.
|
|
17:00-17:15, Paper ThC11.3 | |
Learning-Based Control of the Consensus Value in Unknown Graphs |
|
Gogianu, Florin | Technical University of Cluj-Napoca |
Busoniu, Lucian | Technical University of Cluj-Napoca |
Morarescu, Irinel-Constantin | CRAN, CNRS, Université De Lorraine |
Keywords: Control of networks, Machine learning
Abstract: We consider the problem of optimal budget allocation for controlling the consensus value over unknown networks. The network is represented by an unknown directed graph whose vertices corresponds to dynamic agents influencing each other. On top of that, at discrete instants, the agents are influenced by an external entity that intends to sway the consensus value to a desired target using a given control budget. Between two external influence instants, the states of the agents evolve continuously due to the interaction within the network. It has been proven that, in a known network, the marketer must use a water-filling strategy that targets the most influential agents first in order to optimize budget allocation. In our approach to the unknown-network setup, the marketer uses the evolution between two influence instants to update a learned model of the interaction graph and identify the most influential nodes. Then, the control allocation at the next marketing instant is done according to the water-filling strategy applied to the current model of the graph, and the procedure repeats. Our main analytical contribution states that the sub-optimality of the budget allocation induced by the approximation of the graph is proportional to the error of the learning algorithm. We perform an extensive numerical analysis illustrating the performance of our method and we suggest a regularization scheme that improves it further.
|
|
17:15-17:30, Paper ThC11.4 | |
Optimizing Weighted Hodge Laplacian Flows on Simplicial Complexes |
|
Hudoba de Badyn, Mathias | University of Oslo |
Summers, Tyler H. | University of Texas at Dallas |
Keywords: Network analysis and control
Abstract: Simplicial complexes are generalizations of graphs that describe higher-order network interactions among nodes in the graph. Network dynamics described by graph Laplacian flows have been widely studied in network science and control systems, and these can be generalized to simplicial complexes using Hodge Laplacians. We study weighted Hodge Laplacian flows on simplicial complexes. In particular, we develop a framework for weighted consensus dynamics based on weighted Hodge Laplacian flows and show some decomposition results for weighted Hodge Laplacians. We then show that two key spectral functions of the weighted Hodge Laplacians, the trace of the pseudoinverse and the smallest non-zero eigenvalue, are jointly convex in upper and lower simplex weights and can be formulated as semidefinite programs. Thus, globally optimal weights can be efficiently determined to optimize flows in terms of these functions. Numerical experiments demonstrate that optimal weights can substantially improve these metrics compared to uniform weights.
|
|
17:30-17:45, Paper ThC11.5 | |
Consensus on Open Multi-Agent Systems Over Graphs Sampled from Graphons |
|
Vizuete, Renato | UCLouvain |
Hendrickx, Julien M. | UCLouvain |
Keywords: Network analysis and control, Agents-based systems
Abstract: We show how graphons can be used to model and analyze open multi-agent systems, which are multi-agent systems subject to arrivals and departures, in the specific case of linear consensus. First, we analyze the case of replacements, where under the assumption of a deterministic interval between two replacements, we derive an upper bound for the disagreement in expectation. Then, we study the case of arrivals and departures, where we define a process for the evolution of the number of agents that guarantees a minimum and a maximum number of agents. Next, we derive an upper bound for the disagreement in expectation, and we establish a link with the spectrum of the expected graph used to generate the graph topologies. Finally, for stochastic block model (SBM) graphons, we prove that the computation of the spectrum of the expected graph can be performed based on a matrix whose dimension depends only on the graphon and is independent of the number of agents.
|
|
17:45-18:00, Paper ThC11.6 | |
Stabilizing Populations of Well-Behaved Learning Agents with Exogenous Dynamics |
|
Certorio, Jair | University of Maryland |
Martins, Nuno C. | University of Maryland |
Keywords: Game theory, Stability of nonlinear systems, Compartmental and Positive systems
Abstract: Our work addresses the challenge of determining convergence in scenarios where the aggregate strategic decision of a large number of learning agents influences the dynamics of an exogenous system, potentially leading to oscillations. We obtain an incentive design method based on broad properties of bounded-rational learning rules, such that the particular learning method of the agents does not need to be fully known by the policymaker, and determine conditions on the exogenous systems that can be jointly stabilized with the population to a desirable set of equilibra. Our method includes design parameters that can be used to select the set of equilibra. We exemplify our results using an epidemic disease model coupled to a population model, a system known to exhibit oscillations due to the interplay of the two models, and we design a stabilizing incentive for the epidemic-population model. Unlike previous results, our method does not require canceling the intrinsic cost to each strategy.
|
|
18:00-18:15, Paper ThC11.7 | |
Minimum Clustering of Matrices Based on Phase Alignment |
|
Wu, Honghao | Southern University of Science and Technology |
Ding, Kemi | Southern University of Science and Technology |
Qiu, Li | Hong Kong Univ. of Sci. & Tech |
Keywords: Control of networks, Optimization algorithms, Linear systems
Abstract: Coordinating multi-agent systems requires balancing synchronization performance and controller implementation costs. To this end, we classify agents by their intrinsic properties, enabling each group to be controlled by a uniform controller and thus reducing the number of unique controller types required. Existing centralized control methods, despite their capability to achieve high synchronization performance with fewer types of controllers, suffer from critical drawbacks such as limited scalability and vulnerability to single points of failure. On the other hand, distributed control strategies, where controllers are typically agent-dependent, result in the type of required controllers increasing proportionally with the size of the system. This paper introduces a novel phase-alignment-based framework to minimize the type of controllers by strategically clustering agents with aligned synchronization behaviors. Leveraging the intrinsic phase properties of complex matrices, we formulate a constrained clustering problem and propose a hierarchical optimization method combining recursive exact searches for small-scale systems and scalable stochastic approximations for large-scale networks. This work bridges theoretical phase analysis with practical control synthesis, offering a cost-effective solution for large-scale multi-agent systems. The theoretical results applied for the analysis of a 50-agent network illustrate the effectiveness of the proposed algorithms.
|
|
ThC12 |
Oceania X |
Optimization Algorithms II |
Regular Session |
Chair: Patrinos, Panagiotis | KU Leuven |
Co-Chair: Pu, Ye | The University of Melbourne |
|
16:30-16:45, Paper ThC12.1 | |
Stochastic Gradient Descent with Strategic Querying |
|
Jiang, Nanfei | University of California, Santa Barbara |
Wai, Hoi-To | Chinese University of Hong Kong |
Alizadeh, Mahnoosh | University of California Santa Barbara |
Keywords: Optimization algorithms, Machine learning, Distributed control
Abstract: This paper considers a finite-sum optimization problem under first-order queries and investigates the benefits of strategic querying on stochastic gradient-based methods compared to uniform querying strategy. We first introduce Oracle Gradient Querying (OGQ), an idealized algorithm that selects one user's gradient yielding the largest possible expected improvement (EI) at each step. However, OGQ assumes oracle access to the gradients of all users to make such a selection, which is impractical in real-world scenarios. To address this limitation, we propose Strategic Gradient Querying (SGQ), a practical algorithm that achieves faster convergence than SGD while making only one query per iteration. For smooth objective functions satisfying the Polyak-Lojasiewicz condition, we show that under the assumption of EI heterogeneity, OGQ achieves faster transient-state convergence and reduces steady-state variance, while SGQ improves transient-state convergence over SGD. Our numerical experiments validate our findings in theory.
|
|
16:45-17:00, Paper ThC12.2 | |
Stochastic Online Feedback Optimization for Networks of Non-Compliant Agents |
|
Kalil Lauand, Caio | University of Florida |
Bernstein, Andrey | National Renewable Energy Lab (NREL) |
Keywords: Optimization algorithms, Networked control systems, Power systems
Abstract: In several applications of online optimization to networked systems such as power grids and robotic networks, information about the system model and its disturbances is not generally available. Within the optimization community, increasing interest has been devoted to the framework of online feedback optimization (OFO), which aims to address these challenges by leveraging real-time input-output measurements to empower online optimization. We extend the OFO framework to a stochastic setting, allowing the subsystems comprising the network (the agents) to be non-compliant. This means that the actual control input implemented by the agents is a random variable that depending upon the control setpoint generated by the OFO algorithm. Mean-square error bounds are obtained for the general algorithm and the theory is illustrated in application to power systems.
|
|
17:00-17:15, Paper ThC12.3 | |
Two-Timescale EXTRA for Distributed Smooth Non-Convex Optimization |
|
Peng, Zeyu | The University of Melbourne |
Farokhi, Farhad | The University of Melbourne |
Pu, Ye | The University of Melbourne |
Keywords: Optimization algorithms, Optimization, Lyapunov methods
Abstract: In this paper, we study distributed optimization with smooth non-convex local objectives. We propose a novel variant of the well-known EXact firsT-ordeR Algorithm (EXTRA), called Two-timescale EXTRA, by introducing two distinct step-sizes. Leveraging the two-timescale strategy, we construct a Lyapunov function and establish the sub-linear convergence of Two-timescale EXTRA to a consensual first-order stationary point. Additionally, we introduce an off-line sequential method for algorithm parameter selection, and the numerical results support the theoretical guarantees.
|
|
17:15-17:30, Paper ThC12.4 | |
Blocked Cholesky Factorization Updates of the Riccati Recursion Using Hyperbolic Householder Transformations |
|
Pas, Pieter | KU Leuven |
Patrinos, Panagiotis | KU Leuven |
Keywords: Optimization algorithms, Optimal control
Abstract: Newton systems in quadratic programming (QP) methods are often solved using direct Cholesky or LDLT factorizations. When the linear systems in successive iterations differ by a low-rank modification (as is common in active set and augmented Lagrangian methods), updating the existing factorization can offer significant performance improvements over recomputing a full Cholesky factorization. We review the hyperbolic Householder transformation, and demonstrate its usefulness in describing low-rank Cholesky factorization updates. By applying this hyperbolic Householder-based framework to the well-known Riccati recursion for solving saddle-point problems with optimal control structure, we develop a novel algorithm for updating the factorizations used in optimization solvers for model predictive control (MPC). Specifically, the proposed method can be used to efficiently solve the semismooth Newton systems that are at the core of the QPALM-OCP solver. Optimized open-source implementations of the proposed factorization update routines are made available.
|
|
17:30-17:45, Paper ThC12.5 | |
ALADIN-beta: A Distributed Optimization Algorithm for Solving MPCC Problems |
|
Wang, Yifei | Shanghai Jiao Tong University |
Wu, Shuting | Henan Academy of Science |
Yang, Genke | Shanghai Jiao Tong University |
Chu, Jian | Shanghai Jiao Tong University |
Rikos, Apostolos I. | The Hong Kong University of Science and Technology (Gz) |
Du, Xu | The Hong Kong University of Science and Technology (Guangzhou) |
Keywords: Optimization algorithms, Boolean control networks and logic networks, Numerical algorithms
Abstract: Mathematical Programs with Complementarity Constraints (MPCC) are critical in various real-world applications but notoriously challenging due to non-smoothness and degeneracy from complementarity constraints. The ell_1-Exact Penalty-Barrier enhanced texttt{IPOPT} improves performance and robustness by introducing additional inequality constraints and decision variables. However, this comes at the cost of increased computational complexity due to the higher dimensionality and additional constraints introduced in the centralized formulation. To mitigate this, we propose a distributed structure-splitting reformulation that decomposes these inequality constraints and auxiliary variables into independent sub-problems. Furthermore, we introduce Augmented Lagrangian Alternating Direction Inexact Newton (ALADIN)-beta, a novel approach that integrates the ell_1-Exact Penalty-Barrier method with ALADIN to efficiently solve the distributed reformulation. Numerical experiments demonstrate that even without a globalization strategy, the proposed distributed approach achieves fast convergence while maintaining high precision.
|
|
17:45-18:00, Paper ThC12.6 | |
A Time Splitting Based Optimization Method for Nonlinear MHE |
|
Wu, Shuting | Henan Academy of Science |
Wang, Yifei | Shanghai Jiao Tong University |
Wang, Jingzhe | University of Pittsburgh |
Rikos, Apostolos I. | The Hong Kong University of Science and Technology (Gz) |
Du, Xu | The Hong Kong University of Science and Technology (Guangzhou) |
Keywords: Optimization algorithms, Estimation, Numerical algorithms
Abstract: This paper presents computationally efficient algorithms for solving nonlinear Moving Horizon Estimation (MHE) problems, which face challenges due to the textit{curse of dimensionality}. Specifically, we first introduce a distributed reformulation utilizing a time-splitting technique. Leveraging this, we develop the Efficient Gauss-Newton Augmented Lagrangian Alternating Direction Inexact Newton (ALADIN) algorithm to improve efficiency. To address limited computational power in some sub-problem solvers, we propose the Efficient Sensitivity Assisted ALADIN, allowing inexact solutions without compromising performance. Additionally, we propose a Distributed Sequential Quadratic Programming (SQP) method for scenarios with no computational resources for sub-problems. Numerical experiments on a differential drive robot MHE problem demonstrate that our algorithms achieve both high accuracy and computational efficiency, meeting real-time requirements.
|
|
18:00-18:15, Paper ThC12.7 | |
On the Perturbed Projection-Based Distributed Gradient-Descent Algorithm: A Fully-Distributed Adaptive Redesign |
|
Bazizi, Tarek | GIPSA-Lab, CNRS, Grenoble INP |
Maghenem, Mohamed Adlene | Gipsa Lab, CNRS, France |
Frasca, Paolo | CNRS, GIPSA-Lab, Univ. Grenoble Alpes |
Loria, Antonio | CNRS |
Panteley, Elena | CNRS |
Keywords: Optimization algorithms, Adaptive systems, Lyapunov methods
Abstract: In this work, we revisit a classical distributed gradient-descent algorithm, introducing an interesting class of perturbed multi-agent systems. The state of each subsystem represents a local estimate of a solution to the global optimization problem. Thereby, the network is required to minimize local cost functions, while gathering the local estimates around a common value. Such a complex task suggests the interplay of consensus-based dynamics with gradient-descent dynamics. The latter descent dynamics involves the projection operator, which is assumed to provide corrupted projections of a specific form, reminiscent of existing (fast) projection algorithms. Hence, for the resulting class of perturbed networks, we are able to adaptively tune some gains in a fully distributed fashion, to approach the optimal consensus set up to arbitrary-desired precision.
|
|
18:15-18:30, Paper ThC12.8 | |
Convergence Rates of Lq Penalty Methods for Nonsmooth Nonconvex Optimization with Nonlinear Equality Constraints |
|
El Bourkhissi, Lahcen | University Polytechnic of Bucharest |
Necoara, Ion | Universitatea Nationala De Stiinta Si Tehnologie POLITEHNICA Buc |
Keywords: Optimization algorithms, Optimization, Predictive control for nonlinear systems
Abstract: In this paper, we consider nonsmooth nonconvex optimization problems with nonlinear equality constraints having the objective function in composite form. To solve this problem, we introduce a linearized lq penalty method, where q is the parameter defining the norm used in the construction of the penalty function. Our method involves linearizing the differentiable part of the penalty function and adding a quadratic regularization. This approach requires the computation of a proximal step, which becomes the next iterate. By using an adaptive choice of the regularization parameter, we establish that the iterates of our method converge to a first-order stationary point and derive explicit convergence rates. Finally, we put our theory into practice and evaluate the performance of the proposed algorithm on model predictive control for an inverted pendulum system.
|
|
ThC13 |
Oceania IX |
Game Theory III |
Regular Session |
Chair: Franci, Barbara | Politecnico Di Torino |
Co-Chair: Nax, Heinrich H. | ETHZ |
|
16:30-16:45, Paper ThC13.1 | |
The Limits of Fairness of the Variational Generalized Nash Equilibrium |
|
Hall, Sophie | ETH |
Dörfler, Florian | Swiss Federal Institute of Technology (ETH) Zurich |
Nax, Heinrich H. | ETHZ |
Bolognani, Saverio | ETH Zurich |
Keywords: Game theory, Agents-based systems, Cooperative control
Abstract: Generalized Nash equilibrium (GNE) problems are commonly used to model strategic interactions between self-interested agents who are coupled in cost and constraints. Specifically, the variational GNE, a refinement of the GNE, is often selected as the solution concept due to its non-discriminatory treatment of agents by charging a uniform ``shadow price" for shared resources. We study the fairness concept of v-GNEs from a comparability perspective and show that it makes an implicit assumption of unit comparability of agent's cost functions, one of the strongest comparability notions. Further, we introduce a new solution concept, f-GNE in which a fairness metric is chosen a priori which is compatible with the comparability at hand. We introduce an electric vehicle charging game to demonstrate the fragility of v-GNE fairness and compare it to the f-GNE under various fairness metrics.
|
|
16:45-17:00, Paper ThC13.2 | |
A Set-Theoretic Robust Control Approach for Linear Quadratic Games with Unknown Counterparts |
|
Bianchin, Francesco | Technische Universität München |
Lefringhausen, Robert | Technical University of Munich |
Gaetan, Elisa | University of Modena and Reggio Emilia |
Tesfazgi, Samuel | Technical University of Munich |
Hirche, Sandra | Technische Universität München |
Keywords: Game theory, Robust control, Optimal control
Abstract: Ensuring robust decision-making in multi-agent systems is challenging when agents have distinct, possibly conflicting objectives and lack full knowledge of each other’s strategies. This is clear in safety-critical applications such as human-robot interaction and assisted driving, where uncertainty arises not only from unknown adversary strategies but also from external disturbances. To address this, the paper proposes a robust adaptive control approach based on linear quadratic differential games. The method allows a controlled agent to iteratively refine its belief about the adversary’s strategy and disturbances using a set-membership approach, while simultaneously adapting its policy to guarantee robustness against the uncertain adversary policy and improve performance over time. We formally derive theoretical guarantees on the robustness of the proposed control scheme and its convergence to ε-Nash strategies. The effectiveness of our approach is demonstrated in numerical simulations.
|
|
17:00-17:15, Paper ThC13.3 | |
Distributed Nash Equilibrium Seeking in Non-Monotone Games Over the Simplex |
|
Tatarenko, Tatiana | TU Darmstadt |
Etesami, Rasoul | University of Illinois at Urbana-Champaign |
Keywords: Game theory, Distributed control, Optimization
Abstract: In this work, we present a novel characterization of approximate Nash equilibria in a class of convex games over the simplex. To this end, we regularize the utility functions using a Shannon entropy term, establish a connection between the solutions of the regularized game and the set of Nash equilibria, and formulate a multi-objective optimization problem to solve the regularized game. Based on the properties of the stationary points of this optimization problem, we propose two distributed heuristic algorithms to compute an approximate Nash equilibrium of the original game.
|
|
17:15-17:30, Paper ThC13.4 | |
Smooth Games of Configuration in the Linear-Quadratic Setting |
|
Milzman, Jesse | DEVCOM Army Research Laboratory |
Mao, Jeffrey | NYU |
Loianno, Giuseppe | UC Berkeley |
Keywords: Game theory, Decentralized control, Linear parameter-varying systems
Abstract: Dynamic game theory offers a toolbox for formalizing and solving both cooperative and non-cooperative individual strategies in multi-agent scenarios. However, the optimal configuration of such games remains largely unexplored. While there is existing literature on the parametrization of dynamic games, little research examines this parametrization from a strategic perspective—where each agent's configuration choice is influenced by the decisions of others. In this work, we introduce the concept of a game of configuration, providing a framework for the strategic fine-tuning of differential games. We define a game of configuration as a two-stage game within the setting of finite-horizon, affine-quadratic (AQ) differential games. In the first stage, each player chooses their corresponding configuration parameter, which will impact their dynamics and costs in the second stage. We provide the subgame perfect solution concept and a method for computing first stage cost gradients over the configuration space. This then allows us to formulate a gradient-based method for searching for local solutions to the configuration game, as well as provide necessary conditions for equilibrium configurations over their downstream (second stage) trajectories. We conclude by demonstrating the effectiveness of our approach in example AQ systems, both zero-sum and general-sum.
|
|
17:30-17:45, Paper ThC13.5 | |
Price Equilibria with Positive Margins in Loyal-Strategic Markets with Discrete Prices |
|
Wadhwa, Gurkirat | IIT Bombay |
Verma, Akansh | IIT Bombay |
Veeraruna, Kavitha | IIT Bombay, India |
Sinha, Priyank | IIT Bombay |
Keywords: Game theory, Modeling
Abstract: In competitive supply chains (SCs), pricing decisions are crucial, as they directly impact market share and profitability. Traditional SC models often assume continuous pricing for mathematical convenience, overlooking the practical reality of discrete price increments driven by currency constraints. Additionally, customer behavior, influenced by loyalty and strategic considerations, plays a significant role in purchasing decisions. To address these gaps, this study examines an SC model involving one supplier and two manufacturers, incorporating realistic factors such as customer demand segmentation and discrete price setting. Our analysis shows that the Nash equilibria (NE) among manufacturers are not unique. It also reveals that low denomination factors can lead to instability as the corresponding game does not have pure NE. Numerical simulations demonstrate that even small changes in price increments significantly affect the competitive dynamics and market share distribution.
|
|
17:45-18:00, Paper ThC13.6 | |
When More Information Means Less: A Case Study in Asymmetric All-Pay Auctions |
|
Diaz-Garcia, Gilberto | University of California, Santa Barbara |
Paarporn, Keith | University of Colorado, Colorado Springs |
Marden, Jason R. | University of California, Santa Barbara |
Keywords: Game theory, Agents-based systems
Abstract: Informational advantages represent a strategic opportunity in a large class of strategic games. In general, the level of uncertainty about the environment perceived by a competitor directly affects their performance. Therefore, having a clear understanding between uncertainty and performance has become a relevant research interest. Intuitively, it is expected that reducing uncertainty offers better guarantees in terms of the resulting performance. However, in this work, we provide a characterization that shows that reduced uncertainty does not guarantee better performance. Furthermore, it can be detrimental. We present such characterization in the context of asymmetric all-pay auctions, a standard model to study competitive interactions. By analyzing the resulting behavior of the competitors in terms of the Bayesian Nash equilibrium, we identify scenarios where acquiring better information not only does not affect the performance but also can be detrimental to it.
|
|
18:00-18:15, Paper ThC13.7 | |
Actively Learning Equilibria in Nash Games with Misleading Information |
|
Franci, Barbara | Politecnico Di Torino |
Fabiani, Filippo | IMT School for Advanced Studies Lucca |
Bemporad, Alberto | IMT School for Advanced Studies Lucca |
Keywords: Stochastic systems, Game theory, Optimization
Abstract: We develop a scheme based on active learning to compute equilibria in a generalized Nash equilibrium problem (GNEP). Specifically, an external observer (or entity), with little knowledge on the multi-agent process at hand, collects sensible data by probing the agents’ best-response (BR) mappings, which are then used to recursively update local parametric estimates of these mappings. Unlike [1], we consider the realistic case in which the agents share corrupted information with the external entity for, e.g., protecting their privacy. Inspired by a popular approach in stochastic optimization, we endow the external observer with an inexact proximal scheme for updating the local BR proxies. This technique will prove key to establishing the convergence of our scheme under standard assumptions, thereby enabling the external observer to predict an equilibrium strategy even when relying on masked information.
|
|
18:15-18:30, Paper ThC13.8 | |
Punitive Policies to Combat Misreporting in Dynamic Supply Chains |
|
Dhiman, Madhu | IIT Bombay |
Maurya, Atul | IIT Bombay |
Veeraruna, Kavitha | IIT Bombay, India |
Sinha, Priyank | IIT Bombay |
Keywords: Game theory, Stochastic systems, Time-varying systems
Abstract: Wholesale price contracts are known to be associated with double marginalization effects, which prevent supply chains from achieving their true market share. In a dynamic setting under information asymmetry, these inefficiencies manifest in the form of misreporting of the market potential by the manufacturer to the supplier, again leading to the loss of market share. We pose the dynamics of interaction between the supplier and manufacturer as the Stackelberg game and develop theoretical results for optimal punitive strategies that the supplier can implement to ensure that the manufacturer truthfully reveals the market potential in the single-stage setting. Later, we validate these results through the randomly generated, Monte-Carlo simulation based numerical examples.
|
|
ThC14 |
Galapagos III |
Robotics and Autonomous Systems II |
Regular Session |
Chair: Findeisen, Rolf | TU Darmstadt |
Co-Chair: Halder, Udit | University of South Florida |
|
16:30-16:45, Paper ThC14.1 | |
Synthesis of Dynamic Responses of Redundant Robot Manipulators |
|
Moreno Charco, Josue Raphael | Escuela Superior Politécnica Del Litoral |
Patiño Miñán, José Johil | Cardiff University |
Helguero, Carlos G. | Escuela Superior Politécnica Del Litoral |
Saldarriaga, Carlos | Escuela Superior Politécnica Del Litoral |
Keywords: Robotics, Control applications, Modeling
Abstract: In this paper, we present and validate a novel methodology that synthesizes the dynamic response of robotic manipulators performing Cartesian impedance-related tasks. By leveraging linear system theory and addressing the kinematic redundancies inherent in the system, we derive and directly solve equations to compute the damping parameters based on desired control criteria (damping ratios). The proposed equations correspond to the complex eigenvalues governing the system’s dynamics. Unlike existing approaches, this method bypasses complex optimization or iterative processes, providing direct solutions to damping or stiffness parameters. Our approach applies broadly, irrespective of specific redundancy conditions or Cartesian task coordinates. Using a 7-DoF robot, we demonstrate that multiple solutions can impose similar dynamic responses. We further show that the system’s natural frequencies must align with defined criteria, and while imposing damping ratios may suggest infinite possible frequency values, physically meaningful natural frequencies are calculated based on robot geometry, stiffness, and mass matrices. This ensures that these values are not arbitrary. This methodology contributes significantly to the field by simplifying implementation and improving system stability and predictability.
|
|
16:45-17:00, Paper ThC14.2 | |
Towards Real-Time Personalized Control in Wearable Robotics: A Hierarchical Architecture for Lower-Limb Assistance |
|
Ahmadi, Arjang | Technical University of Darmstadt |
Firouzi, Vahid | Technical University of Darmstadt |
Haufe, Dennis | Technical University of Darmstadt |
Hirt, Sebastian | TU Darmstadt |
Seyfarth, Andre | TU Darmstadt |
Sawicki, Gregory | Georgia Institute of Technology |
Findeisen, Rolf | TU Darmstadt |
Ahmad Sharbafi, Maziar | TU Darmstadt |
Keywords: Control applications, Human-in-the-loop control, Robotics
Abstract: Personalized and effective control is essential for the acceptance of wearable robotic systems such as lower-limb exoskeletons. This paper introduces the adaptation and control problem in these systems, outlines key challenges, and presents a hierarchical control framework considering the BiArticular Thigh EXosuit (BATEX) as an example. Lower-limb exoskeletons are wearable robots that assist walking by applying joint-level torques in coordination with the user. The proposed architecture includes a low-level hybrid controller for velocity tracking, a mid-level neuromechanical controller for stiffness modulation based on biomechanical feedback, and a high-level user-in-the-loop gain adaptation. To explore the role of predictive methods, a model predictive controller is implemented at the actuator level, improving force tracking, disturbance rejection, and constraint handling compared to conventional control, outlining the potential of a unified predictive control framework exploited at the different levels. Experiments with human subjects indicate enhanced gait performance, demonstrating the promise of hierarchical and predictive control for adaptive, user-centered assistance in wearable robotics.
|
|
17:00-17:15, Paper ThC14.3 | |
Full-Dynamics Analytical Modeling of Normal Forces for Skid-Steering Mobile Heavy-Duty Manipulators with Actively Articulated Suspension |
|
Paz, Alvaro | Tampere University |
Bahari, Mohammad | Tampere University |
Mattila, Jouni | Tampere University |
Keywords: Modeling, Robotics, Simulation
Abstract: This paper presents a geometric approach to calculating the ground reaction normal forces of a four-wheel heavy-duty parallel-serial mobile manipulator in an analytical manner. Such a skid-steering mobile platform is endowed with an actively articulated suspension, thus becoming a rigid multibody system itself. Our solution exploits the wheel's dynamics decomposition to project the wheels' internal dynamics and friction forces, preserving all their mechanical effects without hard assumptions. Developing a closed-kinematic chain dynamical analysis, our approach embraces the parallel-serial manipulator and the articulated suspension in a unique screw-theory formulation. Additionally, with respect to previous approaches, our normal forces solutions account for all nonlinear dynamics and accelerations in both the articulated suspension and the wheels. This results in an accuracy enhancement, as demonstrated by a reduction of the average error in normal forces computation by half while traversing uneven terrains when simulating a 7-DoF mobile manipulator with a 2-DoF suspension. We include a losing-contacts robust formulation.
|
|
17:15-17:30, Paper ThC14.4 | |
PROD: Palpative Reconstruction of Deformable Objects through Elastostatic Signed Distance Functions |
|
El-Kebir, Hamza | University of Illinois at Urbana-Champaign |
Keywords: Flexible structures, Robotics, Biologically-inspired methods
Abstract: We introduce PROD (Palpative Reconstruction of Deformables), a novel method for reconstructing the shape and mechanical properties of deformable objects using elastostatic signed distance functions (SDFs). Unlike traditional approaches that rely on purely geometric or visual data, PROD integrates palpative interaction---measured through force-controlled surface probing---to estimate both the static and dynamic response of soft materials. We model the deformation of an object as an elastostatic process and derive a governing Poisson equation for estimating its SDF from a sparse set of pose and force measurements. By incorporating steady-state elastodynamic assumptions, we show that the undeformed SDF can be recovered from deformed observations with provable convergence. Our approach also enables the estimation of material stiffness by analyzing displacement responses to varying force inputs. We demonstrate the robustness of PROD in handling pose errors, non-normal force application, and curvature errors in simulated soft body interactions. These capabilities make PROD a powerful tool for reconstructing deformable objects in applications ranging from robotic manipulation to medical imaging and haptic feedback systems.
|
|
17:30-17:45, Paper ThC14.5 | |
Global Obstacle Avoidance Using Synergistic Stream Functions |
|
Casau, Pedro | University of Aveiro, NIF 501 461 108, 29335 / 2025 |
Keywords: Hybrid systems, Robotics, Supervisory control
Abstract: Stream functions, derived from inviscid fluid flow theory, offer a natural method for obstacle avoidance in robotics. Like other continuous feedback methods, they introduce stagnation points where the velocity of a robot drops to zero. This paper addresses this limitation through a synergistic hybrid feedback approach, ensuring that the robot is able to reach a desired setpoint from any initial condition. Our results illustrate the behaviour of the closed-loop system from undesirable initial conditions.
|
|
17:45-18:00, Paper ThC14.6 | |
Statics of Continuum Planar Grasping |
|
Halder, Udit | University of South Florida |
Keywords: Robotics, Optimal control, Modeling
Abstract: Continuum robotic grasping, inspired by biological appendages such as octopus arms and elephant trunks, provides a versatile and adaptive approach to object manipulation. Unlike conventional rigid-body grasping, continuum robots leverage distributed compliance and whole-body contact to achieve robust and dexterous grasping. This paper presents a control-theoretic framework for analyzing the statics of continuous contact with a planar object. The governing equations of static equilibrium of the object are formulated as a linear control system, where the distributed contact forces act as control inputs. To optimize grasping performance, a constrained optimal control problem is posed to minimize contact forces required to achieve a static grasp, with solutions derived using the Pontryagin Maximum Principle. Furthermore, two optimization problems are introduced: (i) for assigning a measure to the quality of a particular grasp, which generalizes a (rigid-body) grasp quality measure in the continuum case, and (ii) for finding the best grasping configuration that maximizes the continuum grasp quality. Several numerical results are also provided to elucidate our methods.
|
|
18:00-18:15, Paper ThC14.7 | |
Modeling and Controls of Fluid-Structure Interactions (FSI) in Dynamic Morphing Flight |
|
Gupta, Bibek | Northeastern University |
Sihite, Eric | Northeastern University |
Ramezani, Alireza | Northeastern University |
Keywords: Biologically-inspired methods, Flight control, Robotics
Abstract: The primary aim of this study is to enhance the accuracy of our aerodynamic Fluid-Structure Interaction (FSI) model to support the controlled tracking of 3D flight trajectories by Aerobat, which is a dynamic morphing winged drone. Building upon our previously documented Unsteady Aerodynamic model rooted in horseshoe vortices, we introduce a new iteration of Aerobat, labeled as version beta, which is designed for attachment to a Kinova arm. Through a series of experiments, we gather force-moment data from the robotic arm attachment and utilize it to fine-tune our unsteady model for banking turn maneuvers. Subsequently, we employ the tuned FSI model alongside a collocation control strategy to accomplish 3D banking turns of Aerobat within simulation environments. The primary contribution lies in presenting a methodical approach to calibrate our FSI model to predict complex 3D maneuvers and successfully assessing the model's potential for closed-loop flight control of Aerobat using an optimization-based collocation method.
|
|
18:15-18:30, Paper ThC14.8 | |
Robust Signal Decompositions on the Circle |
|
Köse, Aral | Bogazici University |
Liberzon, Daniel | Univ of Illinois, Urbana-Champaign |
Keywords: Sensor fusion, Sensor networks, Robotics
Abstract: We consider the problem of decomposing a piecewise constant function on the circle into a sum of indicator functions of closed circular disks in the plane, whose number and location are not a priori known. This represents a situation where an agent moving on the circle is able to sense its proximity to some landmarks, and the goal is to estimate the number of these landmarks and their possible locations---which can in turn enable control tasks such as motion planning and obstacle avoidance. Moreover, the exact values of the function at its discontinuities (which correspond to disk boundaries for the individual indicator functions) are not assumed to be known to the agent. We introduce suitable notions of robustness and degrees of freedom to single out those decompositions that are more desirable, or more likely, given this non-precise data collected by the agent. We provide a characterization of robust decompositions and give a procedure for generating all such decompositions. We also compute the number of possible robust decompositions (when they exist).
|
|
ThC15 |
Capri II |
Stochastic Systems I |
Regular Session |
Chair: Aubin-Frankowski, Pierre-Cyril | ENPC, Institut Polytechnique De Paris, , France |
Co-Chair: Wisniewski, Rafal | Aalborg University |
|
16:30-16:45, Paper ThC15.1 | |
Stochastic Robust W-Infinity Optimal Control |
|
Cardoso, Daniel Neri | Federal University of Minas Gerais |
Raffo, Guilherme Vianna | Federal University of Minas Gerais |
Keywords: Stochastic optimal control, Optimal control, Robust control
Abstract: This paper introduces a novel robust W-infinity optimal control formulation for stochastic systems, aiming to achieve rapid disturbance mitigation under stochastic uncertainties. The proposed approach requires the cost variable to belong to a Sobolev space, which incorporates the system dynamics into the cost functional, introducing predictive features into the control design. The stochastic W-infinity optimal control problem is formulated via dynamic programming through the Hamilton-Jacobi-Bellman-Isaacs (HJBI) equation for a general class of stochastic systems. Thereafter, to address the challenges associated with solving the HJBI equation, we develop linear matrix inequality (LMI) conditions which transform the optimal control problem into a semidefinite programming (SDP) problem for linear systems affected by state-, input-, and disturbance-dependent noise. Comparative analysis with a stochastic H-infinity controller is performed to corroborate the mathematical developments.
|
|
16:45-17:00, Paper ThC15.2 | |
Solving LQ Stochastic Control and Defining the Controllability Gramian through Kernel Methods |
|
Aubin-Frankowski, Pierre-Cyril | ENPC, Institut Polytechnique De Paris, , France |
Bensoussan, Alain | UTD University of Texas at Dallas |
Keywords: Stochastic optimal control, Stochastic systems, Time-varying systems
Abstract: We introduce a reproducing kernel approach to the linear-quadratic (LQ) stochastic control problem, where control affects both drift and volatility. Unlike previous methods, our framework extends the controllability Gramian to general stochastic systems. Existing approaches, such as in (Liu and Pen, 2010), force to restrict to scalar noise and full control on volatility. Our method removes these limitations, establishing a direct link between deterministic and stochastic Gramians.
|
|
17:00-17:15, Paper ThC15.3 | |
Model Predictive Control of Semi-Markov Jump Systems Via Learning-Based Koopman Operator |
|
Ning, Zepeng | Nanyang Technological University |
Fang, Xu | Dalian University of Technology |
Xie, Lihua | Nanyang Tech. Univ |
Keywords: Stochastic systems, Switched systems, Markov processes
Abstract: This paper investigates model predictive control (MPC) for semi-Markov jump (SMJ) nonlinear systems based on learning-based Koopman operators (KOs). A multi-mode neural network is trained to obtain mode-dependent lifting functions, KOs, and reconstruction matrices, such that the original nonlinear SMJ system can be represented as a higher-dimensional SMJ linear system, where the approximation errors are treated as disturbances. Afterward, an MPC strategy is formulated for the KO-based SMJ linear system, which avoids the non-convex optimization arising from nonlinearity. The recursive feasibility of the MPC and the mean-square stability and the boundedness of the closed-loop SMJ system are theoretically guaranteed. Finally, a multi-mode robotic arm serves as a case study for validation.
|
|
17:15-17:30, Paper ThC15.4 | |
Stability Conditions for Discrete-Time Stochastic Systems Introduced on Left-Bounded Time Interval |
|
Hayashi, Daiki | Kyoto University |
Hosoe, Yohei | Kyoto University |
Kawano, Yu | Hiroshima University |
Hagiwara, Tomomichi | Kyoto Univ |
Keywords: Stochastic systems, Stability of linear systems, Time-varying systems
Abstract: Stability conditions for discrete-time stochastic systems characterized by general stochastic processes have been discussed in earlier articles, under the assumption that the systems are introduced on the unbounded time interval. This paper newly develops theoretical bridges between the frameworks of those stability conditions and the stochastic systems introduced on the left-bounded time interval, and explicitly shows necessary and sufficient stability conditions on the left-bounded time interval. The results in this paper are expected to be useful for ensuring the validity of the future developments of control theory on the unbounded time interval.
|
|
17:30-17:45, Paper ThC15.5 | |
Safety Robustness for Time-Inhomogeneous Markov Chains |
|
Bujorianu, Luminita Manuela | University College London |
Wisniewski, Rafal | Aalborg University |
Keywords: Stochastic systems, Markov processes, Robust control
Abstract: In this paper, we explore safety concepts for time-inhomogeneous Markov chains and characterize them using barrier certificates and the infinitesimal generator. We then examine safety robustness, analyzing the impact of perturbations on safety properties. Additionally, we establish conditions for ensuring safety under dynamic uncertainties and geometric ergodicity.
|
|
17:45-18:00, Paper ThC15.6 | |
Reverse-Time Diffusion Processes in Discrete Time |
|
Dasgupta, Soura | Univ. of Iowa |
Anderson, Brian D.O. | Australian National University |
Keywords: Markov processes, Stochastic systems, Nonlinear systems
Abstract: Generative AI relies on finding reverse models for linear discrete time forward diffusions with non-Gaussian initial states, but uses indirect methods for reversal as there is no theory to effect a direct reversal in discrete time. We provide sufficient conditions that guarantee the existence of a reverse time diffusion and give a method of meeting them. We also give a necessary and sufficient condition for the reverse model to be input-affine.
|
|
18:00-18:15, Paper ThC15.7 | |
An Exploration-Free Method for a Linear Stochastic Bandit Driven by a Linear Gaussian Dynamical System |
|
Gornet, Jonathan | Washington University in Saint Louis |
Mo, Yilin | Tsinghua University |
Sinopoli, Bruno | Washington University in St Louis |
Keywords: Learning, Kalman filtering, Stochastic systems
Abstract: In stochastic multi-armed bandits, a major problem the learner faces is the trade-off between exploration and exploitation. Recently, exploration-free methods---methods that commit to the action predicted to return the highest reward---have been studied from the perspective of linear bandits. In this paper, we introduce a linear bandit setting where the reward is the output of a linear Gaussian dynamical system. Motivated by a problem encountered in hyperparameter optimization for reinforcement learning, where the number of actions is much higher than the number of training iterations, we propose Kalman filter Observability Dependent Exploration (KODE), an exploration-free method that utilizes the Kalman filter predictions to select actions. Our major contribution of this work is our discovery that the performance of the proposed method is dependent on the observability properties of the underlying linear Gaussian dynamical system. We evaluate KODE via two different metrics: regret, which is the cumulative expected difference between the highest possible reward and the reward sampled by KODE, and action alignment, which measures how closely KODE's chosen action aligns with the linear Gaussian dynamical system's state variable. To provide intuition on the performance, we prove that KODE implicitly encourages the learner to explore actions depending on the observability of the linear Gaussian dynamical system. This method is compared to several well-known stochastic multi-armed bandit algorithms to validate our theoretical results.
|
|
18:15-18:30, Paper ThC15.8 | |
Finite-Approximate Controllability of Impulsive Stochastic Functional Evolution Equations |
|
Shukla, Nidhi | Indian Institute of Technology Roorkee |
Dabas, Jaydev | IIT Roorkee |
Keywords: Stochastic systems, Nonlinear systems, Delay systems
Abstract: This paper investigates the finite-approximate controllability (F-AC) of semilinear impulsive stochastic functional evolution equations in a Hilbert space. First, we establish the existence and uniqueness of a mild solution under suitable conditions. Then, we derive the F-AC results for the considered system. The nonlinear functions adhere to Caratheodory conditions, which offer broader applicability. The Picard iterations, fixed-point principles, and the resolvent-like operator technique are used to derive our results. Finally, an example is presented to validate the abstract theory.
|
|
ThC16 |
Capri III |
Predictive Control for Nonlinear Systems II |
Regular Session |
Chair: Dimarogonas, Dimos V. | KTH Royal Institute of Technology |
Co-Chair: Bastos, Guaraci | Federal University of Pernambuco |
|
16:30-16:45, Paper ThC16.1 | |
Value Function Approximation for Nonlinear MPC: Learning a Terminal Cost Function with a Descent Property |
|
Baltussen, Tren M.J.T. | Eindhoven University of Technology |
Orrico, Christopher Anthony | Eindhoven University of Technology |
Katriniok, Alexander | Eindhoven University of Technology |
Heemels, W.P.M.H. (Maurice) | Eindhoven University of Technology |
Krishnamoorthy, Dinesh | Norwegian University of Science and Technology (NTNU) |
Keywords: Predictive control for nonlinear systems, Randomized algorithms, Stability of nonlinear systems
Abstract: We present a novel method to synthesize a terminal cost function for a nonlinear model predictive controller (MPC) through value function approximation using supervised learning. Existing methods enforce a descent property on the terminal cost function by construction, thereby restricting the class of terminal cost functions, which in turn can limit the performance and applicability of the MPC. We present a method to approximate the true cost-to-go with a general function approximator that is convex in its parameters, and impose the descent condition on a finite number of states. Through the scenario approach, we provide probabilistic guarantees on the descent condition of the terminal cost function over the continuous state space. We demonstrate and empirically verify our method in a numerical example. By learning a terminal cost function, the prediction horizon of the MPC can be significantly reduced, resulting in reduced online computational complexity while maintaining good closed-loop performance.
|
|
16:45-17:00, Paper ThC16.2 | |
Guaranteed-Safe MPPI through Composite Control Barrier Functions for Efficient Sampling in Multi-Constrained Robotic Systems |
|
Rabiee, Pedram | University of Kentucky |
Hoagg, Jesse B. | University of Kentucky |
Keywords: Predictive control for nonlinear systems, Constrained control, Autonomous systems
Abstract: We present a guaranteed-safe model predictive path integral (GS-MPPI) control algorithm that enhances sample efficiency in nonlinear systems with multiple safety constraints. The approach uses a composite control barrier function (CBF) along with MPPI to ensure all sampled trajectories are provably safe. We construct a single CBF constraint from multiple safety constraints with potentially differing relative degrees, yielding a closed-form safe control law. Integrating this into system dynamics enables MPPI to optimize exclusively over safe trajectories. The method improves computational efficiency while addressing CBFs' myopic behavior by incorporating long-term performance considerations. Simulations of a nonholonomic ground robot with position and speed constraints demonstrate the algorithm's effectiveness.
|
|
17:00-17:15, Paper ThC16.3 | |
A Contingency Model Predictive Control Framework for Safe Learning |
|
Geurts, Merlijne | Eindhoven, University of Technology |
Baltussen, Tren M.J.T. | Eindhoven University of Technology |
Katriniok, Alexander | Eindhoven University of Technology |
Heemels, W.P.M.H. (Maurice) | Eindhoven University of Technology |
Keywords: Predictive control for nonlinear systems, Autonomous systems, Uncertain systems
Abstract: This research introduces a multi-horizon contingency model predictive control (CMPC) framework in which classes of robust MPC (RMPC) algorithms are combined with classes of learning-based MPC (LB-MPC) algorithms to enable safe learning. We prove that the CMPC framework inherits the robust recursive feasibility properties of the underlying RMPC scheme, thereby ensuring safety of the CMPC in the sense of constraint satisfaction. The CMPC leverages the LB-MPC to safely learn the unmodeled dynamics to reduce conservatism and improve performance compared to standalone RMPC schemes, which are conservative in nature. In addition, we present an implementation of the CMPC framework that combines a particular RMPC and a Gaussian Process MPC scheme. A simulation study on automated lane merging demonstrates the advantages of our general CMPC framework.
|
|
17:15-17:30, Paper ThC16.4 | |
Fault-Tolerant Model Predictive Control for Spacecraft |
|
Stöckner, Raphael | Kungliga Tekniska Högskolan |
Roque, Pedro | KTH Royal Institute of Technology |
Charitidou, Maria | University of Maryland |
Dimarogonas, Dimos V. | KTH Royal Institute of Technology |
Keywords: Predictive control for nonlinear systems, Aerospace, Nonholonomic systems
Abstract: Given the cost and critical functions of satellite constellations, ensuring mission longevity and safe decommissioning is essential for space sustainability. This article presents a Model Predictive Control for spacecraft trajectory and setpoint stabilization under multiple actuation failures. The proposed solution allows us to efficiently control the faulty spacecraft enabling safe navigation towards servicing or collision-free trajectories. The proposed scheme ensures closed-loop asymptotic stability and is shown to be recursively feasible. We demonstrate its efficacy through open-source numerical results and realistic experiments using the ATMOS platform.
|
|
17:30-17:45, Paper ThC16.5 | |
An Analytical Reference Compensator for Dynamic Set-Point Tracking with qLPV NMPCs |
|
Menezes Morato, Marcelo | UFSC |
Santos, Tito Luís Maia | Federal University of Bahia |
Keywords: Predictive control for nonlinear systems, Linear parameter-varying systems, Optimization
Abstract: In the literature, recent works have systematically shown that the use of quasi-Linear Parameter Varying (qLPV) embeddings in the place of nonlinear models can significantly enhance the numerical performances of Nonlinear Model Predictive Control (NMPC) algorithms. However, the corresponding available formulations for the reference tracking problem typically enable offset-free steady-state tracking only for the case of piece-wise constant or fixed set-points. In order to extend these algorithms for the more generic case of time-varying reference signals, we propose an analytical target modification scheme that can be directly integrated to the prior. In particular, the proposed compensator scheme has low numerical complexity, being based on the unconstrained optimisation solution of the NMPC. We also provide an explicit input-state reference trajectory representation and demonstrate that qLPV NMPC schemes coupled to the proposed scheme maintain input-to-state-practical-stability (ISpS) and recursive feasibility. A cart-spring benchmark nonlinear simulation is presented to demonstrate the effectiveness of the proposed target modification block, in comparison to the standard setting.
|
|
17:45-18:00, Paper ThC16.6 | |
Dynamic Tube-MPC for Underactuated Mechanical Systems with Matched and Unmatched Disturbances |
|
Bastos, Guaraci | Federal University of Pernambuco |
Franco, Enrico | Imperial College London |
Keywords: Control applications, Uncertain systems, Predictive control for nonlinear systems
Abstract: This work investigates the trajectory tracking problem for a class of underactuated mechanical systems subject to matched and unmatched time-varying disturbances. To this end, a new dynamic tube-Model Predictive Control is proposed which includes a new tube dynamics and a new ancillary control law constructed analytically with an energy-shaping approach. Simulation results on a two-links model representative of a soft manipulator with disturbances demonstrate the effectiveness of the new controller.
|
|
18:00-18:15, Paper ThC16.7 | |
On Sampling Time and Invariance |
|
Schutz, Spencer | University of California, Berkeley |
Vallon, Charlott | University of California, Berkeley |
Recht, Benjamin | University of California, Berkeley |
Borrelli, Francesco | Unversity of California at Berkeley |
Keywords: Constrained control, Predictive control for linear systems, Predictive control for nonlinear systems
Abstract: Invariant sets define regions of the state space where system constraints are always satisfied. The majority of numerical techniques for computing invariant sets have been developed for discrete-time systems with a fixed sampling time. Understanding how invariant sets change with sampling time is critical for designing adaptive-sampling control schemes that ensure constraint satisfaction. We introduce M-step hold control invariance, a generalization of traditional control invariance, and show its practical use to assess the link between control sampling frequency and constraint satisfaction. We robustify M-step hold control invariance against model mismatches and discretization errors, paving the way for adaptive-sampling control strategies.
|
|
18:15-18:30, Paper ThC16.8 | |
A Cascaded MPC Framework for Airborne Wind Energy Systems |
|
Heydarnia, Omid | Ghent Univeristy |
Wauters, Jolan | Ghent University |
Lefebvre, Tom | Ghent University |
Crevecoeur, Guillaume | Ghent University |
Keywords: Predictive control for nonlinear systems, Adaptive control, Flight control
Abstract: Airborne Wind Energy is an innovative approach to generate electricity from high-altitude winds. Despite advances in AWES control, controlling the aircraft in the face of turbulence, unsteady aerodynamics, and rapid wind variations remains a challenge. Model Predictive Control (MPC) shows promising results, but relies on precise models. To improve MPC performance in the presence of uncertainty, a cascaded control strategy is proposed, integrating MPC at its first stage. This study investigates the effectiveness of Incremental Nonlinear Dynamic Inversion (INDI) and L1 adaptive control for the second stage, assessing their ability to manage rotational dynamics uncertainties and mitigate disturbances. The controller performance is validated in a realistic simulation that incorporates Gaussian Process (GP) models trained on Computational Fluid Dynamics (CFD) data.
|
|
ThC17 |
Capri IV |
Stability of Nonlinear Systems II |
Regular Session |
Chair: Efimov, Denis | Inria |
Co-Chair: Ito, Hiroshi | Kyushu Institute of Technology |
|
16:30-16:45, Paper ThC17.1 | |
Nonlinear Bandwidth and Bode Diagrams Based on Scaled Relative Graphs |
|
Krebbekx, Julius | Eindhoven University of Technology |
Tóth, Roland | Eindhoven University of Technology |
Das, Amritam | Eindhoven University of Technology |
Keywords: Nonlinear systems, Mechatronics, Stability of nonlinear systems
Abstract: Scaled Relative Graphs (SRGs) provide a novel graphical frequency-domain method for the analysis of Nonlinear (NL) systems. In this paper, we restrict the SRG to particular input spaces to compute frequency-dependent incremental gain bounds for nonlinear systems. This leads to a NL generalization of the Bode diagram, where the sinusoidal, harmonic, and subharmonic inputs are considered separately. When applied to the analysis of the NL loop transfer and sensitivity, we define a notion of bandwidth for both the open-loop and closed-loop, compatible with the Linear Time-Invariant (LTI) definitions. We illustrate the power of our method on the analysis of a DC motor with a parasitic nonlinearity and verify our results in simulations.
|
|
16:45-17:00, Paper ThC17.2 | |
Global Asymptotic Stability Is Uniform Even on Non-Closed Bounded Sets of Input Values |
|
Chaillet, Antoine | CentraleSupélec |
Mason, Paolo | CNRS, Laboratoire Des Signaux Et Systèmes |
Wang, Yuan | Florida Atlantic Univ |
Keywords: Stability of nonlinear systems, Uncertain systems, Nonlinear systems
Abstract: A fundamental result for the robust stability analysis of dynamical systems states that, if inputs take values in a compact set, then global asymptotic stability of the origin is necessarily uniform, in the sense that transient overshoots are uniformly bounded and the decay rate is uniform over any bounded set of initial states and over all considered inputs. We show here that the compactness of the input values set can be relaxed to mere boundedness, thus eliminating the closedness assumption. Through a counterexample, we also show that the boundedness requirement cannot be relaxed.
|
|
17:00-17:15, Paper ThC17.3 | |
On the Dynamics of Theta-Invariant Systems and Normed Actions |
|
R. Lima, Danilo | Inria |
Ushirobira, Rosane | Inria |
Efimov, Denis | Inria |
Keywords: Stability of nonlinear systems, Nonlinear systems, Autonomous systems
Abstract: This paper expands the dynamical analysis of theta-invariant systems, with focus on autonomous systems containing equilibrium points. In particular, we present dynamical behavior that can occur only with theta-invariant systems but not G-invariant systems. Those include finite-time stability and the existence of isolated periodic orbits. Furthermore, we show that many of the good dynamical properties present in homogeneous systems can be obtained in the more general context of invariance. This is done through the introduction of the concept of normed actions.
|
|
17:15-17:30, Paper ThC17.4 | |
Lyapunov-Based Positivizing and Stabilizing Controller Design for Nonlinear Compartmental Systems with Prescribed Positive Equilibria |
|
Ito, Hiroshi | Kyushu Institute of Technology |
Keywords: Lyapunov methods, Stability of nonlinear systems, Nonlinear systems
Abstract: This paper considers the design of non-traditional compartmental systems whose state variables and their target stationary values are positive. Metzler matrices and monotonicity with respect to positive orthants are neither necessary nor sufficient for state variables positivity when its equilibrium is not at the origin. Shifting the stationary point of a compartmental system often violates the positivity. It means that the corresponding stationary offset is not physically implementable. This paper designs a controller that renders the specified stationary point realizable. It is referred to as the positivizing and stabilizing controller design. For nonlinear systems with manipulatable transportation flows between compartments, this paper pursues an approach in which a single Lyapunov function establishes positivity and asymptotic stability simultaneously. Its benefit is demonstrated by local matching and the freedom of controller gains tunning the convergence speed.
|
|
17:30-17:45, Paper ThC17.5 | |
On a Construction of Lyapunov Functions Based on Neural Networks and Homogeneous Approximations |
|
Zenkin, Artemii | ITMO |
Ushirobira, Rosane | Inria |
Efimov, Denis | Inria |
Bobtsov, Alexey | ITMO University |
Keywords: Lyapunov methods, Nonlinear systems, Numerical algorithms
Abstract: This paper applies an artificial neural network approach to design a local Lyapunov function for a general class of nonlinear systems. To avoid the singularity at the origin in computing the Lyapunov function, it is complemented by using a homogeneous approximation at the origin.
|
|
17:45-18:00, Paper ThC17.6 | |
Virtual Mass Tuning of Overhead Cranes |
|
Asani, Zemerart | Vrije University Brussels |
Nicotra, Marco M | University of Colorado Boulder |
Garone, Emanuele | Université Libre De Bruxelles |
Keywords: Lyapunov methods, Stability of nonlinear systems, Robotics
Abstract: This paper introduces a novel control paradigm for improving the performance of existing control laws for overhead cranes. The proposed approach functionally replaces the suspended mass of the physical system with a suitably tuned virtual mass. The motivating principle behind this control paradigm is that the performance of many energy-based control laws for overhead cranes strongly depends on the mass of the suspended load. Using a simple tuning procedure to optimize the virtual mass, it is shown that the proposed approach can significantly boost the performance of energy-based control laws while preserving the underlying stability guarantees.
|
|
18:00-18:15, Paper ThC17.7 | |
Cascaded Event-Triggered Control for Nonlinear Asynchronous Interconnected Systems |
|
Wang, Xiaoyu | North China Electric Power University |
Xiao, Feng | North China Electric Power University |
Liu, Pin | North China Electric Power University |
Keywords: Sampled-data control, Stability of nonlinear systems, Distributed control
Abstract: This paper investigates the stability of asynchronous interconnected systems with inherent nonlinearities. A novel concept of cascaded event-triggered control (CETC) is proposed. In CETC, a cascaded event-triggered scheme with a hierarchical event-detection structure is developed. It includes a sequence of event-triggering conditions at different layers, which are detected in order. In every event-detecting loop, the event condition at the first layer is started, the CETC proceeds to detect the conditions at the subsequent layers, and a sampling instant is generated once the condition at the last layer is satisfied. To guarantee a positive lower bound of the inter-sampling intervals, the cascaded event-triggering conditions are required to include at least one time-based condition. Under a specialized two-layer event-triggered scheme, referred to as the primary-secondary event-triggered scheme, an integral approach is developed to analyze the stability. Finally, the effectiveness of the proposed method is validated through simulation examples.
|
|
18:15-18:30, Paper ThC17.8 | |
Multivariable Feedback Control for Multi-Constraint Optimization in Online Advertising |
|
Karlsson, Niklas | Amazon |
Keywords: Emerging control applications, Stability of nonlinear systems, Control applications
Abstract: Online advertising is typically implemented via real-time bidding, and advertising campaigns are then defined as extremely high-dimensional optimization problems. To solve these problems in light of large scale and significant uncertainties, the optimization problems are modularized in a manner that makes feedback control a critical component of the solution. The control problem, however, is challenging due to plant uncertainties, nonlinearities, time-variance, and noise. Multi-constraint optimization problems are especially difficult to solve via feedback control because of the dynamic interaction across feedback loops. This paper demonstrates how one particular multi-constraint problem can be solved using a cascade feedback controller. The inner loop is managed by a linear time-periodic feedforward controller combined with a linear time-invariant feedback controller. Meanwhile, the outer loop is managed by a linear time-invariant feedforward-feedback controller. This paper is concerned with the outer loop controller and derives sufficient conditions for stability of the nonlinear closed loop system by expressing it as a Lure' system and by engaging the circle criterion. The solution is evaluated in a simulated environment based on artificial~data.
|
|
ThC18 |
Aruba I+II+III |
Observers for Linear Systems |
Regular Session |
Chair: Lessard, Laurent | Northeastern University |
Co-Chair: Postoyan, Romain | CNRS, CRAN, Université De Lorraine |
|
16:30-16:45, Paper ThC18.1 | |
Moving-Horizon Estimation for Linear Systems with Random Packet Losses: Suboptimal Arrival Cost Improves Robustness |
|
Li, Xinda | University of Sheffield |
Su, Lanlan | University of Manchester |
Trodden, Paul | University of Sheffield |
Keywords: Observers for Linear systems, Sensor networks, Linear systems
Abstract: This paper addresses the problem of estimating the state of a linear system whose output measurements are transmitted over a communication channel subject to random packet losses. For such systems, it is well established that mean-square stability of an optimal linear estimator requires the packet arrival probability to exceed a critical threshold. Although this fundamental limit cannot be circumvented, we show, both theoretically and through examples, that a moving-horizon estimator (MHE) with a carefully designed suboptimal arrival cost can achieve mean stability for arbitrarily low arrival probability. The result decouples mean stability from the more stringent mean-square condition, offering a robust estimation strategy for channels with low data rates.
|
|
16:45-17:00, Paper ThC18.2 | |
State Estimation for Linear Systems with Non-Gaussian Measurement Noise Via Dynamic Programming |
|
Yoosefian Nooshabadi, Mohammad Hussein | Northeastern University |
Lessard, Laurent | Northeastern University |
Keywords: Observers for Linear systems, Estimation, Filtering
Abstract: We propose a new recursive estimator for linear dynamical systems under Gaussian process noise and non-Gaussian measurement noise. Specifically, we develop an approximate maximum a posteriori (MAP) estimator using dynamic programming and tools from convex analysis. Our approach does not rely on restrictive noise assumptions and employs a Bellman-like update instead of a Bayesian update. Our proposed estimator is computationally efficient, with only modest overhead compared to a standard Kalman filter. Simulations demonstrate that our estimator achieves lower root mean squared error (RMSE) than the Kalman filter and has comparable performance to state-of-the-art estimators, while requiring significantly less computational power.
|
|
17:00-17:15, Paper ThC18.3 | |
Exponentially Stable Stubborn Observers for Discrete-Time Linear Systems |
|
Zambotti, Beatrice | Université Claude Bernard Lyon 1 |
Andrieu, Vincent | Université De Lyon |
Astolfi, Daniele | Cnrs - Lagepp |
Bako, Laurent | Ecole Centrale De Lyon |
Nadri, Madiha | Universite Claude Bernard Lyon 1 |
Zaccarian, Luca | LAAS-CNRS |
Keywords: Observers for Linear systems, Stability of nonlinear systems, Lyapunov methods
Abstract: State estimation is crucial for control and monitoring of dynamical systems, but sporadic disturbances (outliers) can severely degrade the estimator performance. This paper studies the stability of a "stubborn" observer, designed to mitigate the effects of outliers, for discrete-time linear systems. We prove that the detectability of the system is both necessary and sufficient for global exponential stability of the observer, providing an improvement over existing results that rely on potentially infeasible LMI conditions. The proof is constructive, leading to practical guidelines for selecting the observer parameters. Numerical simulations demonstrate the observer's effectiveness in mitigating outliers and its superior performance compared to a classical Luenberger observer.
|
|
17:15-17:30, Paper ThC18.4 | |
Self-Triggered Interval Observer Design for Multisensor Systems under Delayed Measurements |
|
Tagne Mogue, Ruth Line | Univ. Orleans |
Becis-Aubry, Yasmina | Univ. of Orléans |
Courtial, Estelle | Laboratory PRISME, University of Orleans |
Meslem, Nacim | GIPSA-LAB, CNRS |
Ramdani, Nacim | University of Orléans |
Keywords: Observers for Linear systems, Estimation, Stability of hybrid systems
Abstract: A framework for set-based state estimation in multi-sensor cyber-physical systems with delayed measurements is presented, integrating defense-in-depth strategies while ensuring optimal system resource utilization. An interval impulsive observer is designed, combining a moving target defense strategy with self-triggered sampling. The finite-gain L1 stability of the estimation errors is analyzed using hybrid system tools. By leveraging positive realization and interval analysis, sufficient conditions for synthesis are formulated as an optimization problem under algebraic inequalities. The approach is illustrated through a simple example.
|
|
17:30-17:45, Paper ThC18.5 | |
Low-Dimensional Observer Design for Stable Linear Systems by Model Reduction |
|
Shakib, Fahim | Imperial College London |
Khalil, Mira | CRAN, Université De Lorraine |
Postoyan, Romain | CNRS, CRAN, Université De Lorraine |
Keywords: Observers for Linear systems, Model/Controller reduction, Large-scale systems
Abstract: This paper presents a low-dimensional observer design for stable, single-input single-output, continuous-time linear time-invariant (LTI) systems. Leveraging the model reduction by moment matching technique, we approximate the system with a reduced-order model. Based on this reduced-order model, we design a low-dimensional observer that estimates the states of the original system. We show that this observer establishes exact asymptotic state reconstruction for a given class of inputs tied to the observer's dimension. Furthermore, we establish an exponential input-to-state stability property for generic inputs, ensuring a bounded estimation error. Numerical simulations confirm the effectiveness of the approach for a benchmark model reduction problem.
|
|
17:45-18:00, Paper ThC18.6 | |
Bridging Centralized and Distributed Frameworks in Unknown Input Observer Design |
|
Zhao, Ruixuan | University College London |
Yang, Guitao | Imperial College London |
Li, Peng | Harbin Institute of Technology, Shenzhen |
Chen, Boli | University College London |
Keywords: Observers for Linear systems, Networked control systems, Sensor networks
Abstract: State estimation for linear time-invariant systems with unknown inputs is a fundamental problem in various research domains. In this article, we establish conditions for the design of unknown input observers (UIOs) from a geometric approach perspective. Specifically, we derive a necessary and sufficient geometric condition for the existence of a centralized UIO. Compared to existing results, our condition offers a more general design framework, allowing designers the flexibility to estimate partial information of the system state. Furthermore, we extend the centralized UIO design to distributed settings. In contrast to existing distributed UIO approaches, which require each local node to satisfy the rank condition regarding the unknown input and output matrices, our method accommodates cases where a subset of nodes does not meet this requirement. This relaxation significantly broadens the range of practical applications. Simulation results are provided to demonstrate the effectiveness of the proposed design.
|
|
18:00-18:15, Paper ThC18.7 | |
A Distributed Kalman-Like Observer with Dynamic Inversion-Based Correction for Multi-Agent Estimation |
|
De Carli, Nicola | KTH Royal Institute of Technology |
Dimarogonas, Dimos V. | KTH Royal Institute of Technology |
Keywords: Kalman filtering, Distributed control, Observers for Linear systems
Abstract: We present a novel distributed Kalman-like observer for cooperative state estimation in multi-agent systems. Unlike conventional Kalman filters, our approach replaces the process covariance matrix with a forgetting factor, enabling the distributed propagation of the information matrix dynamics while preserving key stability properties. The observer’s correction term is computed by solving a linear equation dynamically in a distributed manner, circumventing the need for direct centralized matrix inversion. Unlike existing methods that discard cross-information to allow distributed computations, our approach preserves inter-agent coupling. The proposed observer requires only joint observability, allowing for flexible sensing configurations. Rigorous stability guarantees are provided, and numerical simulations in a cooperative localization scenario demonstrate the effectiveness of the approach in estimating agent states.
|
|
18:15-18:30, Paper ThC18.8 | |
Active Fault Diagnosis for Spacecraft Attitude Control Systems Using Set-Theoretic Unknown Input Observers |
|
Wang, Songtao | Shanghai Jiao Tong University |
Shen, Qiang | Shanghai Jiao Tong University |
Li, Huihui | Shanghai Jiao Tong University |
Keywords: Fault diagnosis, Observers for Linear systems, Identification
Abstract: This paper proposes an active fault diagnosis method for spacecraft attitude control systems with distur- bances by utilizing the unknown input observer. Firstly, the spacecraft attitude control system is formulated as a linear time-invariant model under diverse faulty conditions. A general unknown input observer framework is constructed to decouple process disturbances by designing optimal gain matrices, which are analytically obtained by solving an unconstrained optimiza- tion problem. Then, zonotopes are utilized to characterize the dynamics of state estimation error, and a set-theoretic unknown input observer is built. To achieve the guaranteed active fault diagnosis, a quadratic program and a mixed-integer quadratic optimization program are formulated to obtain the separating input that can maximize the separation degree of all zonotopes under fault isolation and input saturation constraints. Finally, an illustrative example is provided to verify the effectiveness of the proposed active fault diagnosis method.
|
|
ThC19 |
Ibiza IV |
Optimal Control VI |
Regular Session |
Chair: Notomista, Gennaro | University of Waterloo |
Co-Chair: Mallick, Samuel | Delft University of Technology |
|
16:30-16:45, Paper ThC19.1 | |
Pointwise Optimal Feedback Laws for Hybrid Inclusions Using Multiple Control Barrier Functions |
|
Montenegro Gonzalez, Carlos | University of California, Santa Cruz |
Sweatland, Hannah | University of Florida |
Currier, Keith | University of Florida |
Dixon, Warren E. | University of Florida |
Sanfelice, Ricardo G. | University of California at Santa Cruz |
Keywords: Hybrid systems, Optimal control, Stability of hybrid systems
Abstract: This paper studies the problem of designing optimization-based controllers, with desired regularity properties, encoding the satisfaction of multiple state constraints via barrier functions for hybrid systems, modeled as hybrid inclusions. Sufficient conditions are given to guarantee forward invariance or asymptotic stability of a closed set K for a hybrid closed-loop system, even with discontinuous feedback laws. However, robustness of such properties is not necessarily guaranteed. Thus, we present sufficient conditions for the continuity of the optimization-based feedback laws that render the hybrid closed-loop system well-posed. A numerical simulation of a one-degree-of-freedom juggling system illustrating the main result of the paper is presented.
|
|
16:45-17:00, Paper ThC19.2 | |
Control Disturbance Rejection in Neural ODEs |
|
Bayram, Erkan | University of Illinois Urbana-Champaign |
Belabbas, Mohamed Ali | University of Illinois at Urbana-Champaign |
Basar, Tamer | Univ of Illinois, Urbana-Champaign |
Keywords: Optimal control, Robust control, Neural networks
Abstract: In this paper, we propose an iterative training algorithm for Neural ODEs that provides models resilient to control (parameter) disturbances. The method builds on our earlier work Tuning without Forgetting--and similarly introduces training points sequentially, and updates the parameters on new data within the space of parameters that do not decrease performance on the previously learned training points--with the key difference that, inspired by the concept of flat minima, we solve a minimax problem for a non-convex non-concave functional over an infinite-dimensional control space. We develop a projected gradient descent algorithm on the space of parameters that admits the structure of an infinite-dimensional Banach subspace. We show through simulations that this formulation enables the model to effectively learn new data points and gain robustness against control disturbance.
|
|
17:00-17:15, Paper ThC19.3 | |
Learning-Based MPC for Fuel Efficient Control of Autonomous Vehicles with Discrete Gear Selection |
|
Mallick, Samuel | Delft University of Technology |
Battocletti, Gianpietro | Delft University of Technology |
Dong, Qizhang | Delft University of Technology |
Dabiri, Azita | Delft University of Technology |
De Schutter, Bart | Delft University of Technology |
Keywords: Optimal control, Hybrid systems, Autonomous vehicles
Abstract: Co-optimization of both vehicle speed and gear position via model predictive control (MPC) has been shown to offer benefits for fuel-efficient autonomous driving. However, optimizing both the vehicle's continuous dynamics and discrete gear positions may be too computationally intensive for a real-time implementation. This work proposes a learning-based MPC scheme to address this issue. A policy is trained to select and fix the gear positions across the prediction horizon of the MPC controller, leaving a significantly simpler continuous optimization problem to be solved online. In simulation, the proposed approach is shown to have a significantly lower computation burden and a comparable performance, with respect to pure MPC-based co-optimization.
|
|
17:15-17:30, Paper ThC19.4 | |
Boundary Control for Stability and Invariance of Traffic Flow Dynamics: A Convex Optimization Approach |
|
Chiri, Maria Teresa | Penn State University |
Guglielmi, Roberto | University of Waterloo |
Notomista, Gennaro | University of Waterloo |
Keywords: Optimal control, Traffic control, Distributed parameter systems
Abstract: In this letter we propose an optimization-based boundary controller for traffic flow dynamics capable of achieving both stability and invariance conditions. The approach is based on the definition of Boundary Control Barrier Functionals, from which sets of invariance-preserving boundary controllers are derived. In combination with sets of stabilizing controllers, we reformulate the problem as a convex optimization program solved at each point in time to synthesize the boundary control inputs. We derive sufficient conditions for the existence of optimal controllers that ensure both stability and invariance.
|
|
17:30-17:45, Paper ThC19.5 | |
Optimal Robust Containment Control for Human-Quadrotor Formation Via Critic Neural Network Learning with Relaxed PE Conditions |
|
Zhang, XingYu | University of Electronic Science and Technology of China |
Chen, Chen | University of Electronic Science and Technology of China |
Xia, Zhuo | University of Electronic Science and Technology of China |
Luo, Rui | University of Electronic Science and Technology of China |
Peng, Zhinan | University of Electronic Science and Technology of China |
Cheng, Hong | University of Electronic Science and Technology of China |
Ghosh, Bijoy | Texas Tech University |
Keywords: Optimal control, Reinforcement learning, Neural networks
Abstract: This paper proposes a novel optimal robust containment control framework for a group of quadrotors with human-operated leaders. Different from the conventional containment task, external disturbances are considered in the dynamic model of followers and human instructions are modeled as the control input for the dynamics of leaders to enhance the maneuverability of the quadrotor team. This framework consists of two parts. First, a distributed trajectory generator is proposed to derive containment state reference for each follower using only observable leaders' data. Based on the distributed trajectory generator, the containment control problem is transformed into an optimal robust tracking control problem. Furthermore, a zero-sum differential game-based position containment controller is proposed to solve the optimization. To obtain the game's optimal solution, a critic neural network (NN) is employed with a novel weight adaption law using dynamic regression extension and mixing (DREM) technology. Unlike conventional gradient-based methods, this law achieves weight convergence under relaxed persistent excitation (PE) conditions while enhancing the transient performance for the training of the critic NN. Finally, theoretical analysis is given to prove the stability of the closed-loop system and the efficacy of the proposed control method is demonstrated through simulation.
|
|
17:45-18:00, Paper ThC19.6 | |
Optimal Control of Endemic Epidemic Diseases with Behavioral Response |
|
Parino, Francesco | INSERM, Sorbonne Université |
Zino, Lorenzo | Politecnico Di Torino |
Rizzo, Alessandro | Politecnico Di Torino |
Keywords: Control applications, Nonlinear systems, Optimal control
Abstract: Behavioral factors play a crucial role in the emergence, spread, and containment of human diseases, significantly influencing the effectiveness of intervention measures. However, the integration of such factors into epidemic models is still limited, hindering the possibility of understanding how to optimally design interventions to mitigate epidemic outbreaks in real life. This paper aims to fill in this gap. In particular, we propose a parsimonious model that couples an epidemic compartmental model with a population game that captures the behavioral response, obtaining a nonlinear system of ordinary differential equations. Grounded on prevalence-elastic behavior ---the empirically proven assumption that the disease prevalence affects the adherence to self-protective behavior--- we consider a nontrivial negative feedback between contagions and adoption of self-protective behavior. We characterize the asymptotic behavior of the system, establishing conditions under which the disease is quickly eradicated or a global convergence to an endemic equilibrium is attained. In addition, we elucidate how the behavioral response affects the endemic equilibrium. Then, we formulate and solve an optimal control problem to plan cost-effective interventions for the model, accounting for their healthcare and social-economical implications. Numerical simulations on a case study calibrated on sexually transmitted diseases demonstrate and validate our findings.
|
|
18:00-18:15, Paper ThC19.7 | |
Dubins Path with Terminal Range and Field-Of-View Constraints |
|
Manyam, Satyanarayana Gupta | DCS Corp., Air Force Research Labs |
Casbeer, David W. | Air Force Research Laboratory |
Von Moll, Alexander | Air Force Research Laboratory |
Weintraub, Isaac | Air Force Research Laboratory |
Keywords: Autonomous vehicles, Nonholonomic systems, Optimal control
Abstract: This paper addresses a path planning problem for a turn-constrained vehicle equipped with a sensor that has a limited range and field of view (FOV). The vehicle is modeled using Dubins kinematics, and the objective is to determine the shortest path to point where that the target is detectable, i.e., within the sensor's range and FOV. The limited sensor range imposes a constraint requiring the vehicle to reach a circle centered at the target's position, with a radius equal to the sensor range. The field-of-view constraints are expressed as coupled conditions on the terminal position and heading. Pontryagin's minimum principle restricts the optimal path to straight lines and arcs of minimum turn radius. Further analysis shows that the paths are limited to at most two segments. Straight-line segments occur exclusively when the final position lies on the boundary of the target circle. Furthermore, we present an analytical approach for computing optimal paths in scenarios where the optimality conditions result in complex geometric configurations.
|
|
18:15-18:30, Paper ThC19.8 | |
Decoupling Collision Avoidance in and for Optimal Control Using Least-Squares Support Vector Machines |
|
Dirckx, Dries | KU Leuven |
Swevers, Jan | KU Leuven |
Decre, Wilm | KU Leuven |
Keywords: Optimal control, Optimization, Robotics
Abstract: This paper details an approach to linearise differentiable but non-convex collision avoidance constraints tailored to convex shapes. It revisits introducing differential collision avoidance constraints for convex objects into an optimal control problem (OCP) using the separating hyperplane theorem. By framing this theorem as a classification problem, the hyperplanes are eliminated as optimisation variables from the OCP. This effectively transforms non-convex constraints into linear constraints. A bi-level algorithm computes the hyperplanes between the iterations of an optimisation solver and subsequently embeds them as parameters into the OCP. Experiments demonstrate the approach’s favourable scalability towards cluttered environments and its applicability to various motion planning approaches. It decreases trajectory computation times between 50% and 90% compared to a state-of-the-art approach that directly includes the hyperplanes as variables in the optimal control problem.
|
|
ThC20 |
Asia I+II+III+IV |
Contraction Theory in Control, Optimization, and Learning |
Tutorial Session |
Chair: Bullo, Francesco | Univ of California at Santa Barbara |
Co-Chair: Manchester, Ian R. | University of Sydney |
Organizer: Bullo, Francesco | Univ of California at Santa Barbara |
|
16:30-18:30, Paper ThC20.1 | |
Advances in Contraction Theory for Robust Optimization, Control, and Neural Computation (I) |
|
Bullo, Francesco | Univ of California at Santa Barbara |
Coogan, Samuel | Georgia Institute of Technology |
Dall'Anese, Emiliano | Boston University |
Manchester, Ian R. | University of Sydney |
Russo, Giovanni | University of Salerno |
Keywords: Nonlinear systems, Stability of nonlinear systems, Optimization
Abstract: This tutorial provides an overview of recent developments in contraction theory, highlighting theoretical advances, practical applications, and emerging extensions. We explore topics including time-varying convex optimization through equilibrium tracking, biologically plausible optimization in neural networks, and the analysis of interconnected and sampled-data systems. Additional focus is given to linear differential inclusions, reachability analysis, and the integration of contraction theory with robust, control-oriented machine learning.
|
| |