|
MoA01 |
Auditorium |
Data-Driven Learning of Stable Control Laws for Adaptive Robot Planning: A
Dynamical Systems Perspective |
Tutorial Session |
Chair: Billard, Aude | EPFL |
Co-Chair: Fichera, Bernardo | EPFL |
Organizer: Billard, Aude | EPFL |
Organizer: Fichera, Bernardo | EPFL |
Organizer: Nayak, Aradhana | Huawei Technologies GmbH |
|
10:00-10:40, Paper MoA01.1 | |
Learning Provably Stable Nonlinear Dynamical Systems (I) |
|
Billard, Aude | EPFL |
Keywords: Adaptive systems, Autonomous robots, Learning
Abstract: The opening lecture of this tutorial will offer a comprehensive introduction to the field of DS-based control. It will lay the theoretical groundwork for learning stable DS from demonstrations and illustrate how these learned policies can be applied to control robotic systems, ensuring both robustness and responsiveness.
|
|
10:40-11:20, Paper MoA01.2 | |
Modulating Dynamical Systems for Real Time Avoidance of Concave and Moving Obstacles (I) |
|
Nayak, Aradhana | Huawei Technologies GmbH |
Keywords: Adaptive systems, Autonomous robots, Learning
Abstract: The second lecture of this tutorial will concentrate on modulating DSs in scenarios in- volving obstacle avoidance, presenting data-driven approaches to learning the modulation. Beginning with an overview of DS modulation theory, the lecture will trace the field’s progression towards handling increasingly complex situations without compromising guarantees of non-penetrability.
|
|
11:20-12:00, Paper MoA01.3 | |
Dynamical Systems and Differential Geometry (I) |
|
Fichera, Bernardo | EPFL |
Keywords: Adaptive systems, Autonomous robots, Learning
Abstract: The concluding lecture of this tutorial will delve into the emerging area of DSs enhanced by differential geometry, known as Geometrical DSs (GDS), within both the learning and control spheres. It will begin with a broad discussion on the motivations and objectives behind integrating differential geometry tools into DS-based control systems. Following this, the lecture will lay out the theoretical underpinnings necessary for constructing GDSs. The session will then highlight various applications and scenarios where GDSs are employed, outlining their benefits and limitations.
|
|
MoA02 |
Amber 1 |
Estimation, Control and Learning of Quantum Systems |
Invited Session |
Chair: Wang, Yuanlong | Chinese Academy of Sciences |
Co-Chair: Dong, Daoyi | Australian National University |
Organizer: Qi, Bo | CAS |
Organizer: Dong, Daoyi | Australian National University |
Organizer: Jonckheere, Edmond | University of Southern California |
Organizer: Wang, Yuanlong | Chinese Academy of Sciences |
|
10:00-10:20, Paper MoA02.1 | |
>Admissible Optimal Control for Parameter Estimation in Quantum Systems |
|
Clouatre, Maison | Massachusetts Institute of Technology (MIT) |
Marano, Stefano | University of Salerno |
Falb, Peter | Brown University |
Win, Moe Z. | Massachusetts Institute of Technology (MIT) |
Keywords: Quantum information and control
Abstract: This letter investigates parameter estimation in quantum systems that undergo dynamical evolution. Optimal control problems are formulated to maximize the information, about an unknown parameter, extracted by a given quantum measurement apparatus. This letter introduces the concept of ``admissible controls''---control laws that do not depend on the unknown parameter they elicit. For scalar parameter estimation in unital quantum systems interrogated by binary measurements, this letter derives a necessary and sufficient condition on quantum measurement operators so that an information maximizing control law is admissible. When the admissibility condition is satisfied, it is shown that the resulting optimal control problem may be solved using well-established techniques.
|
|
10:20-10:40, Paper MoA02.2 | |
>On Poles and Zeros of Linear Quantum Systems (I) |
|
Dong, Zhiyuan | Harbin Institute of Technology, Shenzhen |
Zhang, Guofeng | The Hong Kong Polytechnic University |
Lee, Heung Wing Joseph | The Hong Kong Polytechnic University |
Keywords: Quantum information and control
Abstract: The non-commutative nature of quantum mechanics imposes fundamental constraints on system dynamics, which in the linear realm are manifested by the physical realizability conditions on system matrices. These restrictions endow system matrices with special structure. The purpose of this paper is to study such structure by investigating zeros and poses of linear quantum systems. In particular, we show that -s_0^ast is a transmission zero if and only if s_0 is a pole, and which is further generalized to the relationship between system eigenvalues and invariant zeros. Additionally, we study left-invertibility and fundamental tradeoff for linear quantum systems in terms of their zeros and poles.
|
|
10:40-11:00, Paper MoA02.3 | |
>On the Role of Controllability in Pulse-Based Quantum Machine Learning Models (I) |
|
Tao, Hanxiao | Tsinghua University |
Wu, Re-Bing | Tsinghua University |
Keywords: Quantum information and control
Abstract: Pulse-based quantum machine learning (QML) models possess full expressivity when they are ensemble controllable. However, the accompanied barren plateaus render training intractable for high-dimensional systems. In this paper, we show that the trade-off can be understood via controllability analysis. We first apply the Fliess-series expansion to pulse-based QML models to investigate the effect of control system structure on model expressivity, which leads to a universal criterion for assessing the expressivity of generic QML models. We then show that uncontrollable pulse-based models that evolve on low-dimensional manifolds can achieve better balance between expressivity and trainability. Numerical experiments verify the proposed criterion and our analysis, which further demonstrate that the expressivity can be enhanced by increasing dimensionality, while barren plateaus can be avoided by reducing the controllability. Our approach provides a promising path for designing pulse-based QML models that are both highly expressive and trainable.
|
|
11:00-11:20, Paper MoA02.4 | |
>Simultaneous Quantum State and Detector Tomography through Multiple Hamiltonians (I) |
|
Xiao, Shuixin | University of New South Wales |
Liang, Weichao | University of New South Wales Canberra |
Wang, Yuanlong | Chinese Academy of Sciences |
Dong, Daoyi | Australian National University |
Petersen, Ian R. | Australian National University |
Ugrinovskii, Valery | University of New South Wales |
Keywords: Quantum information and control
Abstract: The estimation of all the parameters in an unknown quantum state/measurement device, commonly referred to as quantum state/detector tomography (QST/QDT), plays a pivotal role in characterization and control of quantum systems. In this paper, we introduce a novel method to concurrently identify a quantum state and a detector using multiple Hamiltonians. We then develop two solutions, i.e., a closed-form algorithm and the sum of squares (SOS) optimization. The numerical examples demonstrate that the closed-form solution offers computational efficiency, while the SOS optimization method ensures accuracy.
|
|
11:20-11:40, Paper MoA02.5 | |
>Model Predictive Control of Two-Level Open Quantum Systems (I) |
|
Lee, Yunyan | Australian National University |
Petersen, Ian R. | Australian National University |
Dong, Daoyi | Australian National University |
Keywords: Quantum information and control, Predictive control for nonlinear systems, Robust control
Abstract: This paper presents a robust control approach for a two-level open quantum system subject to bounded uncertainties through the application of model predictive control. We demonstrate two common cases: depolarizing decoherence and phase-damping decoherence in open quantum systems. We develop a model predictive control method using quantum measurements and provide a lower bound on the probability of obtaining the predicted state. Numerical results illustrate the effectiveness of this model predictive control approach in achieving stability and robustness in open quantum systems.
|
|
11:40-12:00, Paper MoA02.6 | |
>Parameter Estimation for Quantum Stochastic Systems (I) |
|
Liang, Weichao | University of New South Wales Canberra |
Xiao, Shuixin | University of New South Wales |
Keywords: Identification for control, Estimation, Quantum information and control
Abstract: This paper addresses the problem of parameter estimation for quantum systems undergoing homodyne detection. By generalizing the stability of quantum filters under quantum non-demolition measurements to cases with mismatched parameters, we propose an estimation approach that extends beyond finite parameter sets. Leveraging maximum likelihood estimation, our method offers improved reliability, flexibility, and potential computational savings compared to existing techniques. The asymptotic behavior of the estimator is rigorously analyzed, and simulations on spin-1/2 systems demonstrate the effectiveness and robustness of the proposed approach.
|
|
MoA03 |
Amber 2 |
Multi-Agent Systems: Awareness, Learning, and Formal Methods I |
Invited Session |
Chair: Vlahakis, Eleftherios | KTH Royal Institute of Technology |
Co-Chair: Ghosh, Arabinda | Max Planck Institute for Software Systems |
Organizer: van Huijgevoort, Birgit | Eindhoven University of Technology |
Organizer: Ghosh, Arabinda | Max Planck Institute for Software Systems |
Organizer: Vlahakis, Eleftherios | KTH Royal Institute of Technology |
Organizer: Haesaert, Sofie | Eindhoven University of Technology |
Organizer: Soudjani, Sadegh | Max Planck Institute for Software Systems |
Organizer: Dimarogonas, Dimos V. | KTH Royal Institute of Technology |
|
10:00-10:20, Paper MoA03.1 | |
>Multi-Agent Clarity-Aware Dynamic Coverage with Gaussian Processes (I) |
|
Agrawal, Devansh Ramgopal | University of Michigan |
Panagou, Dimitra | University of Michigan, Ann Arbor |
Keywords: Information theory and control, Data driven control, Cooperative control
Abstract: This paper presents two algorithms for multi-agent dynamic coverage in spatiotemporal environments, where the coverage algorithms are informed by the method of data assimilation. In particular, we show that by explicitly modeling the environment using a Gaussian Process (GP) model, and considering the sensing capabilities and the dynamics of a team of robots, we can design an estimation algorithm and multi-agent coverage controller that explores and estimates the state of the spatiotemporal environment. The uncertainty of the estimate is quantified using clarity, an information-theoretic metric, where higher clarity corresponds to lower uncertainty. By exploiting the relationship between GPs and Stochastic Differential Equations (SDEs) we quantify the increase in clarity of the estimated state at any position due to a measurement taken from any other position. We use this relationship to design two new coverage controllers, both of which scale well with the number of agents exploring the domain, assuming the robots can share the map of the clarity over the spatial domain via communication. We demonstrate the algorithms through a realistic simulation of a team of robots collecting wind data over a region in Austria.
|
|
10:20-10:40, Paper MoA03.2 | |
>Sub-Optimal Decentralized Navigation of Multiple Holonomic Agents in Simply-Connected Workspaces (I) |
|
Kotsinis, Dimitrios | Athina RC, University of Patras |
Bechlioulis, Charalampos P. | University of Patras |
Keywords: Cooperative control, Robotics, Agents-based systems
Abstract: In this paper, we propose a sub-optimal approach for the multi-agent navigation problem in simply-connected workspaces. We design a decentralized control law exhibiting the following three properties: (1) navigation of each agent with the optimal policy towards its destination, (2) avoidance of collision with other nearby agents and the workspace boundary, and (3) knowledge about the current position and not the destination of nearby agents. Moreover, we refer to the sub-optimal approach because the computational complexity and time needed to calculate the global optimal solution become unrealistic as the number of agents increases. In our case, each agent has a predetermined optimal policy calculated by a novel off-policy iterative method, to go towards its destination and it deviates from it in order to avoid collisions. The purpose of the simulation study is to examine how much the sub-optimal greedy trajectory of each agent deviates from the optimal one.
|
|
10:40-11:00, Paper MoA03.3 | |
>Asymptotic Consensus of Multi-Agent Systems with Unknown Nonlinear Dynamics Via Smooth Barrier Integral Control (I) |
|
Verginis, Christos | Uppsala University |
Keywords: Cooperative control, Adaptive control, Uncertain systems
Abstract: We consider the consensus problem for 2nd-order MIMO multi-agent systems with unknown nonlinear terms. We propose a novel control algorithm based on the Barrier Integral Control (BRIC) that combines reciprocal barrier functions with integral terms of the multi-agent disagreement errors and guarantees asymptotic consensus despite the unknown dynamics. The control algorithm is distributed, in the sense that each agent calculates its own control signal based on local information from its neighbouring agents, and does not use any a priori information from the agents' dynamics. Furthermore and unlike previous works, it does not rely on boundedness assumptions or approximation of the dynamic terms and constitutes smooth feedback of the multi-agent states. Finally, simulation results verify the theoretical findings.
|
|
11:00-11:20, Paper MoA03.4 | |
>Resilient Source Seeking with Robot Swarms (I) |
|
Acuaviva, Antonio | Lancaster University, Department of Mathematics |
Bautista, Jesús | University of Granada |
Yao, Weijia | Hunan University |
Jimenez Castellanos, Juan | Universidad Complutense De Madrid |
Garcia de Marina, Hector | Universidad De Granada |
Keywords: Autonomous robots, Distributed control, Large-scale systems
Abstract: We present a solution for locating the source, or maximum, of an unknown scalar field using a swarm of mobile robots. Unlike relying on the traditional gradient information, the swarm determines an ascending direction to approach the source with arbitrary precision. The ascending direction is calculated from field strength measurements at the robot locations and their relative positions concerning the swarm centroid. Rather than focusing on individual robots, we focus the analysis on the density of robots per unit area to guarantee a more resilient swarm, i.e., the functionality remains even if individuals go missing or are misplaced during the mission. We reinforce the algorithm's robustness by providing sufficient conditions for the swarm shape so that the ascending direction is almost parallel to the gradient. The swarm can respond to an unexpected environment by morphing its shape and exploiting the existence of multiple ascending directions. Finally, we validate our approach numerically with hundreds of robots. The fact that a large number of robots with a generic formation always calculate an ascending direction compensates for the potential loss of individuals.
|
|
11:20-11:40, Paper MoA03.5 | |
>Generalized Singularity-Free Controller Design for Distance-Based Multi-Agent Formations (I) |
|
Liu, Zhiyuan | University of Groningen |
Sun, Zhiyong | Eindhoven University of Technology (TU/e) |
Chen, Jin | University of Groningen |
Cao, Ming | University of Groningen |
Keywords: Stability of nonlinear systems, Lyapunov methods, Agents-based systems
Abstract: This paper aims to provide a generalized method of constructing distributed controllers for distance-based multi-agent formations; it also looks into possible degenerate cases when agents' positions are co-located. We first extend the definition of the distance error into a general function of the inter-agent distance, based on which a family of generalized controllers is derived. We then discuss singularities caused by co-located agents and amend some mistakes and gaps in the literature on distance-based formation control. These enable us to propose formation control methods of constructing singularity-free controllers while preserving the stability properties of the distance error system.
|
|
11:40-12:00, Paper MoA03.6 | |
>Range-Only Distributed Formation Control and Network Localization Based on Distributed Contracting Bearing Estimators (I) |
|
Chen, Jin | University of Groningen |
Marcantoni, Matteo | University of Groningen |
Jayawardhana, Bayu | University of Groningen |
Keywords: Cooperative control, Estimation, Agents-based systems
Abstract: This paper investigates the distributed formation control problem solely based on range-only measurement, without the availability of bearing or relative bearing information. Without bearing information, local relative position necessary for maintaining rigid formation control cannot be obtained only based on range measurement. Correspondingly, range-only bearing estimator for a single landmark is proposed that can be directly combined with standard distributed rigid formation control law. Asymptotic stability analysis of the desired formation shape is presented and the proposed distributed approach is validated via numerical simulations.
|
|
MoA04 |
Amber 3 |
Modeling, Control and Decisions for Natural Gas Systems and Components |
Invited Session |
Chair: Chertkov, Michael | University of Arizona |
Co-Chair: Zlotnik, Anatoly | Los Alamos National Laboratory |
Organizer: Chertkov, Michael | University of Arizona |
|
10:00-10:20, Paper MoA04.1 | |
>Mathematical Modeling of Chattering and the Optimal Design of a Valve (I) |
|
Andrea, Corli | University of Ferrara |
Razafison, Ulrich | University of Franche-Comté |
Rosini, Massimiliano Daniele | University of Chieti-Pescara |
Keywords: Nonlinear systems, Stability of nonlinear systems, Modeling
Abstract: We consider an isothermal flow through two pipes; at the junction, the flow is modified by some devices, for instance a valve. We first provide a general mathematical framework to model the coupling conditions for the flow at both sides of the junction. A key feature in the modeling is the {em coherence}; it is related to the chattering, i.e., the rapid switch on and off of a valve, which in turn is linked to the stability of the numerical schemes to find the solutions. We discuss the coherence of some models (for instance, for control valves), we provide numerical simulations showing the chattering, and finally give a procedure to eliminate it from a theoretical point of view.
|
|
10:20-10:40, Paper MoA04.2 | |
>Stochastic Finite Volume Method for Uncertainty Management in Gas Pipeline Network Flows (I) |
|
Kazi, Saif R. | Los Alamos National Laboratory |
Misra, Sidhant | Los Alamos National Laboratory |
Tokareva, Svetlana | Los Alamos National Laboratory |
Sundar, Kaarthik | Los Alamos National Laboratory |
Zlotnik, Anatoly | Los Alamos National Laboratory |
Keywords: Optimization, Uncertain systems, Energy systems
Abstract: Natural gas consumption by users of pipeline networks is subject to increasing uncertainty that originates from the intermittent nature of electric power loads serviced by gas-fired generators. To enable computationally efficient optimization of gas network flows subject to uncertainty, we develop a finite volume representation of stochastic solutions of hyperbolic partial differential equation (PDE) systems on graph-connected domains with nodal coupling and boundary conditions. The representation is used to express the physical constraints in stochastic optimization problems for gas flow allocation subject to uncertain parameters. The method is based on the stochastic finite volume approach that was recently developed for uncertainty quantification in transient flows represented by hyperbolic PDEs on graphs. In this study, we develop optimization formulations for steady-state gas flow over actuated transport networks subject to probabilistic constraints. In addition to the distributions for the physical solutions, we examine the dual variables that are produced by way of the optimization, and interpret them as price distributions that quantify the financial volatility that arises through demand uncertainty modeled in an optimization-driven gas market mechanism. We demonstrate the computation and distributional analysis using a single-pipe example and a small test network.
|
|
10:40-11:00, Paper MoA04.3 | |
>System-Wide Emergency Policy for Transitioning from Main to Secondary Fuel (I) |
|
Pagnier, Laurent | University of Arizona |
Hyett, Criston | University of Arizona |
Ferrando, Robert | University of Arizona |
Goldshtein, Igal | Noga, the Israel Independent System Operator |
Alisse, Jean | Noga, the Israel Independent System Operator |
Saban, Lilah | Noga, the Israel Independent System Operator |
Chertkov, Michael | University of Arizona |
Keywords: Power systems, Markov processes, Simulation
Abstract: Faced with the complexities of managing natural gas-dependent power system amid the surge of renewable integration and load unpredictability, this study explores strategies for navigating emergency transitions to costlier secondary fuels. Our aim is to develop decision-support tools for operators during such exigencies. We approach the problem through a Markov Decision Process (MDP) framework, accounting for multiple uncertainties. These include the potential for dual-fuel generator failures and operator response during high-pressure situations. Additionally, we consider the finite reserves of primary fuel, governed by gas-flow partial differential equations (PDEs) and constrained by nodal pressure. Other factors include the variability in power forecasts due to renewable generation and the economic impact of compulsory load shedding. For tractability, we address the MDP in a simplified context, replacing it by Markov Processes evaluated against a selection of policies and scenarios for comparison. Our study considers two models for the natural gas system: an over-simplified model tracking linepack and a more nuanced model that accounts for gas flow network heterogeneity. The efficacy of our methods is demonstrated using a realistic model replicating Israel's power-gas infrastructure.
|
|
11:00-11:20, Paper MoA04.4 | |
>Differentiable Simulator for Dynamic & Stochastic Optimal Gas & Power Flows (I) |
|
Hyett, Criston | University of Arizona |
Pagnier, Laurent | University of Arizona |
Alisse, Jean | Noga, the Israel Independent System Operator |
Goldshtein, Igal | Noga, the Israel Independent System Operator |
Saban, Lilah | Noga, the Israel Independent System Operator |
Ferrando, Robert | University of Arizona |
Chertkov, Michael | University of Arizona |
Keywords: Computational methods, Energy systems, Optimization
Abstract: In many power systems, particularly those isolated from larger intercontinental grids, reliance on natural gas is crucial. This dependence becomes particularly critical during periods of volatility or scarcity in renewable energy sources, further complicated by unpredictable consumption trends. To ensure the uninterrupted operation of these isolated gas-grid systems, innovative and efficient management strategies are essential. This paper investigates the complexities of achieving synchronized, dynamic, and stochastic optimization for autonomous transmission-level gas-grid infrastructures. We introduce a novel methodology grounded in differentiable programming, which synergizes symbolic programming, a conservative numerical method for solving gas-flow partial differential equations, and automated sensitivity analysis powered by SciML/Julia. Our methodology refines the simulation and co-optimization landscape for gas-grid systems by grounding gas dynamics in physics-adherent simulation. We demonstrate efficiency and precision of the methodology by solving a stochastic optimal gas flow problem, phrased on an open source model of Israel's gas grid model.
|
|
11:20-11:40, Paper MoA04.5 | |
>Modelling, Optimisation and Control of Full-Scale Co-Digestion Biomethane Plants |
|
Carecci, Davide | Politecnico Di Milano |
Catenacci, Arianna | Politecnico Di Milano |
Ficara, Elena | Politecnico Di Milano |
Ferretti, Gianni | Politecnico Di Milano |
Leva, Alberto | Politecnico Di Milano |
Keywords: Energy systems, Modeling, Optimization
Abstract: To match the growing demand for biomethane production, anaerobic digestors need an optimal management of the input diet, and the said diet must not be constrained to a single substrate — that is, co-digestion is required. Co-digestion is far more complicated to govern than single- substrate digestion, and constitutes a very hard challenge for the instrumentation and control equipment typically installed aboard full-scale plants. We propose a solution based on offline trajectory optimisation with the aid of a complex first-principle digestor model extended to embrace the co-digestion of complex substrates, followed by online control with a purpose-specific scheme to mitigate the risk of inhibition. We show simulation results and some preliminary tests on a pilot plant.
|
|
11:40-12:00, Paper MoA04.6 | |
>Robust Control of PEM Electrolyzer in a Renewable Energy Context |
|
Zasadzinski, Michel | Université De Lorraine & CRAN |
Ait Ziane, Meziane | GREEN, University of Lorraine |
Rafaralahy, Hugues | Université De Lorraine |
Keywords: Control applications, Energy systems, Robust control
Abstract: In this work, the current control of a proton exchange membrane electrolyzer (PEMEL) associated with a DC/DC converter is studied in a renewable energy sources context. Five realistic scenarii are considered including both supplied energy variations (smooth or abrupt) due to intermittent nature of renewable energy sources (RES) and process parameters uncertainties. The model used for the proton exchange membrane electrolyzer coupled to a DC/DC converter that is proposed in a paper and obtained from measurements made on an experimental test bench. To maximize hydrogen production, a two-degree of freedom H-infinity robust controller with integral action on the current is designed by using the normalized coprime factorization approach with loop-shaping. This robust H-infinity controller is compared with the PID designed in the above-mentioned paper. For the five scenarii above-mentioned, the robust H-infinity controller provides satisfactory results, but this is not the case for the PID controller. Thus, it is interesting to employ a more complex control law than the widely-used PID to take into account the problems generated by the use of renewable energy sources to reduce the greenhouse gas emissions.
|
|
MoA05 |
Amber 4 |
Data-Based Model Predictive Control for Nonlinear Systems |
Invited Session |
Chair: Magni, Lalo | Univ. of Pavia |
Co-Chair: Schimperna, Irene | University of Pavia |
Organizer: Magni, Lalo | Univ. of Pavia |
Organizer: Schimperna, Irene | University of Pavia |
|
10:00-10:20, Paper MoA05.1 | |
>Estimation and MPC Control Based on Gated Recurrent Unit Neural Networks with Unknown Disturbances (I) |
|
Masero Rubio, Eva | Politecnico Di Milano |
Bonassi, Fabio | Uppsala University |
La Bella, Alessio | Politecnico Di Milano |
Scattolini, Riccardo | Politecnico Di Milano |
Keywords: Machine learning, Predictive control for nonlinear systems, Learning
Abstract: This paper proposes a nonlinear model predictive control (NMPC) approach for incrementally input-to-state stable gated recurrent units (GRU) neural networks affected by state and output disturbances. In particular, a Luenberger-like observer is designed for state and disturbance estimation with guaranteed convergence properties. This paves the way for the design of an NMPC regulator capable of rejecting unknown piecewise-constant disturbances. The method is tested in simulation on a nonlinear benchmark system, i.e., a chemical reaction process, showing promising results.
|
|
10:20-10:40, Paper MoA05.2 | |
>Recurrent Neural Network Based MPC for Systems with Input and Incremental Input Constraints |
|
Schimperna, Irene | University of Pavia |
Galuppini, Giacomo | University of Pavia |
Magni, Lalo | Univ. of Pavia |
Keywords: Predictive control for nonlinear systems, Constrained control, Neural networks
Abstract: This paper proposes a stabilizing Model Predictive Control algorithm, specifically designed to handle systems learned by Incrementally Input-to-State Stable Recurrent Neural Networks, in presence of input and incremental input constraints. Closed-loop stability is proven by relying on the Incremental Input-to-State Stability property of the model, and on a terminal equality constraint involving the control sequence only. The Incremental Input-to-State Stability is also used to derive a suitable formulation of the Model Predictive Control terminal cost. The proposed control algorithm can be readily applied to a wide range of Recurrent Neural Networks, including Gated Recurrent Units, Echo State Networks, and Neural Nonlinear Autoregressive eXogenous models. Furthermore, this work specializes the approach to handle the particular case of Long Short-Term Memory Networks, and showcases its effectiveness on a four tanks process benchmark.
|
|
10:40-11:00, Paper MoA05.3 | |
>Phased LSTM-Based MPC for Modeling and Control of Nonlinear Systems Using Asynchronous and Delayed Measurement Data (I) |
|
Wu, Wanlu | National University of Singapore |
Wang, Yujia | National University of Singapore |
Zhang, Mingqing | Beijing University of Chemical Technology |
Chiu, Min-Sen | National University of Singapore |
Wu, Zhe | National University of Singapore |
Keywords: Machine learning, Predictive control for nonlinear systems, Chemical process control
Abstract: In this work, we develop novel machine learning modeling and predictive control techniques for nonlinear chemical systems with asynchronous and delayed measurements in both offline and online data collection. Specifically, Phased Long Short-Term Memory (PLSTM) network is used to learn the process dynamics amidst the irregularities in the data, during the offline training process. The generalization performance of PLSTM is theoretically studied on the basis of statistical machine learning theory to better understand the capabilities of PLSTM models. The PLSTM model is employed to forecast the evolution of states for a Lyapunov-based Model Predictive Controller (LMPC) that is designed to account for data loss and delays in real-time implementation. Finally, an application to a benchmark chemical process is adopted to show the effectiveness of PLSTM modeling and predictive control methods.
|
|
11:00-11:20, Paper MoA05.4 | |
>Koopman Data-Driven Predictive Control with Robust Stability and Recursive Feasibility Guarantees (I) |
|
de Jong, Thomas O. | Eindhoven University of Technology |
Breschi, Valentina | Eindhoven University of Technology |
Schoukens, Maarten | Eindhoven University of Technology |
Lazar, Mircea | Eindhoven University of Technology |
Keywords: Data driven control, Predictive control for nonlinear systems, Stability of nonlinear systems
Abstract: In this paper, we consider the design of data-driven predictive controllers for nonlinear systems from input-output data using linear-in-control input Koopman lifted models. Instead of identifying and simulating a Koopman model to predict future outputs, we design a subspace predictive controller in the Koopman space. This allows us to learn the observables minimizing the multi-step output prediction error, preventing the propagation of prediction errors. We compute a terminal cost and terminal set in the Koopman space, and we obtain recursive feasibility guarantees through an interpolated initial state. As a third contribution, we introduce a novel regularization cost yielding input-to-state stability guarantees with respect to the prediction error for the resulting closed-loop system. The performance is illustrated on a nonlinear benchmark example from the literature.
|
|
11:20-11:40, Paper MoA05.5 | |
>Data-Driven MPC with Terminal Conditions in the Koopman Framework (I) |
|
Worthmann, Karl | Technische Universität Ilmenau |
Strässer, Robin | University of Stuttgart |
Schaller, Manuel | Technische Universität Ilmenau |
Berberich, Julian | University of Stuttgart |
Allgöwer, Frank | University of Stuttgart |
Keywords: Predictive control for nonlinear systems, Data driven control, Stability of nonlinear systems
Abstract: We investigate nonlinear model predictive control (MPC) with terminal conditions in the Koopman framework using extended dynamic mode decomposition (EDMD) to generate a data-based surrogate model for prediction and optimization. We rigorously show recursive feasibility and prove practical asymptotic stability w.r.t. the approximation accuracy. To this end, finite-data error bounds are employed. The construction of the terminal conditions is based on recently derived proportional error bounds to ensure the required Lyapunov decrease. Finally, we illustrate the effectiveness of the proposed data-driven predictive controller including the design procedure to construct the terminal region and controller.
|
|
11:40-12:00, Paper MoA05.6 | |
>Robust Data-Driven Energy Management for Pumped Thermal Electricity Storage with Unknown Nonlinear Dynamics (I) |
|
Shi, Yanyan | The University of Manchester |
Xu, Yiqiao | University of Manchester |
Wan, Quan | University of Manchester |
Parisio, Alessandra | The University of Manchester |
Keywords: Data driven control, Energy systems, Predictive control for nonlinear systems
Abstract: Energy storage solutions are becoming increasingly important due to their capability to mitigate the variability of renewable power generation. Among these, Pumped Thermal Electricity Storage (PTES) offers a grid-level, long-lasting, and environmentally friendly alternative to batteries. However, the dynamic modeling of PTES is challenging due to its complex system architecture, which includes many interdependent components that interact in nonlinear ways. In this paper, we propose a robust data-driven energy management strategy based on Zonotopic Predictive Control (ZPC) and MixedInteger Programming (MIP) for PTES with unknown nonlinear dynamics connected to a residential Multi-Energy System (MES). Data-driven reachable sets, derived from past input-output trajectory data, are constructed as a substitute for a dynamic model. To account for noisy data, a matrix zonotope recursion that over-approximates the reachable sets is employed to ensure robust constraint satisfaction. Simulation results demonstrate the effectiveness and robustness of the approach, showing costefficient storage of off-peak and excess renewable electricity as heat, which is released during peak hours. An average reduction in the renewable curtailment rate from 6.07% to 2.94% is observed due to the use of PTES.
|
|
MoA06 |
Amber 5 |
Control of Networks |
Regular Session |
Chair: Ishii, Hideaki | University of Tokyo |
Co-Chair: De Lellis, Pietro | University of Naples Federico II |
|
10:00-10:20, Paper MoA06.1 | |
>Pinning Control in Networks of Nonidentical Systems with Many-Body Interactions |
|
Rizzello, Roberto | University of Naples Federico II |
De Lellis, Pietro | University of Naples Federico II |
Keywords: Control of networks, Network analysis and control, Decentralized control
Abstract: We study the problem of controlling an ensemble of nonidentical dynamical units to a desired trajectory set by the pinner in the presence of multi-body interactions between the units. We provide necessary and sufficient conditions for local bounded convergence, and estimate the convergence bound as a function of the parameter mismatch between the units, and of the directed hypergraph describing their interacting topology. Numerical simulations on a network of Rössler oscillators are performed to illustrate the robustness of the approach.
|
|
10:20-10:40, Paper MoA06.2 | |
>Bipartite Synchronization of Markovian Jumping Networks with Dynamical Couplings and Cooperation-Competition Mechanism |
|
Zhang, Xiaomei | Nantong University |
Zhang, Zhenjuan | Nantong University |
Ren, Jie | Nantong University |
Zhao, Min | Nantong University |
Sheng, Suying | Nantong University, School of Electronics and Information |
Keywords: Control of networks, Lyapunov methods, Markov processes
Abstract: Bipartite synchronization control for Markovian jumping networks with signed graphs is investigated in this paper. The relationships between some nodes are cooperative, and the relationships between another nodes are competitive. In addition, the coupling connections in the considered network exhibit dynamic behavior. The bipartite synchronization problem is formulated as a stabilization problem for the Markovian jumping system, where the analysis based on the Lyapunov stability theory shows that the bipartite synchronization is achieved in the sense that the bipartite synchronization error system is asymptotically stable in the mean square. The effectiveness of the obtained result is shown in a numerical simulation using a cooperation-competition network of single-link robot arms.
|
|
10:40-11:00, Paper MoA06.3 | |
>Construction of the Sparsest Maximally R-Robust Graphs |
|
Lee, Haejoon | University of Michigan |
Panagou, Dimitra | University of Michigan, Ann Arbor |
Keywords: Control of networks, Network analysis and control, Networked control systems
Abstract: In recent years, the notion of r-robustness for the communication graph of the network has been introduced to address the challenge of achieving consensus in the presence of misbehaving agents. Higher r-robustness typically implies higher tolerance to malicious information towards achieving resilient consensus, but it also implies more edges for the communication graph. This in turn conflicts with the need to minimize communication due to limited resources in real-world applications (e.g., multi-robot networks). In this paper, our contributions are twofold. (a) We provide the necessary subgraph structures and tight lower bounds on the number of edges required for graphs with a given number of nodes to achieve maximum robustness. (b) We then use the results of (a) to introduce two classes of graphs that maintain maximum robustness with the least number of edges. Our work is validated through a series of simulations.
|
|
11:00-11:20, Paper MoA06.4 | |
>Algebraic Riccati Equation Approach for Network Distributed Optimal (mathcal{H}_2) Synthesis |
|
Zou, Yuanji | University of Minnesota |
Elia, Nicola | University of Minnesota |
Keywords: Control of networks, Optimization algorithms
Abstract: In this paper, we solve the optimal H2 control problem distributed over networks with an arbitrary graph, with the assumption that the controller network has no delays but only sparsity constraints. Our solution extends the previous approach valid for strongly connected graphs. Under the quadratic invariance structure assumption, we follow the Youla parameterization framework for the distributed control synthesis setup. We show that the optimal solution is finite-dimensional and can be computed by solving three algebraic Riccati equations: two are standard for centralized H2 control, while the third emerges from the sparsity constraints imposed by the network. We present a 4-car platoon example for method validation.
|
|
11:20-11:40, Paper MoA06.5 | |
>Cluster Synchronization of Kuramoto Oscillators Via Pacemakers and Mean-Phase Feedback Control |
|
Kokubo, Ryota | Tokyo Institute of Technology |
Matsubara, Mitsuaki | Tokyo Institute of Technology |
Kato, Rui | Tokyo Institute of Technology |
Ishii, Hideaki | University of Tokyo |
Keywords: Control of networks, Optimization
Abstract: Brain networks typically exhibit specific synchronization patterns where several synchronized clusters coexist. On the other hand, neurological disorders are considered to be related to pathological synchronization such as excessive synchronization of large populations of neurons. Motivated by these facts, this paper presents two approaches to control the cluster synchronization of Kuramoto oscillators. One is based on the use of pacemakers to the clusters, and the other is based on feeding back the mean phases to the clusters. We first show conditions on the pacemaker weights and the feedback gains for the network to achieve cluster synchronization. Then, we propose a method to find optimal feedback gains through convex optimization. A numerical example demonstrates the effectiveness of the proposed methods.
|
|
11:40-12:00, Paper MoA06.6 | |
>Minimal Input Structural Modifications for Strongly Structural Controllability |
|
Joseph, Geethu | TU Delft |
Moothedath, Shana | Iowa State University |
Lin, Jiabin | Iowa State University |
Keywords: Control of networks, Uncertain systems, Randomized algorithms
Abstract: This paper studies the problem of modifying the input matrix of a structured system to make the system strongly structurally controllable. We focus on the generalized structured systems that rely on zero/nonzero/arbitrary structure, i.e., some entries of system matrices are zeros, some are nonzero, and the remaining entries can be zero or nonzero (arbitrary). We derive the feasibility conditions of the problem, and if it is feasible, we reformulate it into another equivalent problem. This new formulation leads to a greedy heuristic algorithm. However, we also show that the greedy algorithm can give arbitrarily poor solutions for some special systems. Our alternative approach is a randomized Markov chain Monte Carlo-based algorithm. Unlike the greedy algorithm, this algorithm is guaranteed to converge to an optimal solution with high probability. Finally, we numerically evaluate the algorithms on random graphs to show that the algorithms perform well.
|
|
MoA07 |
Amber 6 |
Game Theory I |
Regular Session |
Chair: Dotoli, Mariagrazia | Politecnico Di Bari |
Co-Chair: Park, Shinkyu | KAUST |
|
10:00-10:20, Paper MoA07.1 | |
>FlexNet: An Open-Formation Configuration for Cooperative Herding in Pursuit-Evasion Games with Field of View Interactions |
|
Zhu, Yutong | Northwestern Polytechnical University |
Zhang, Ye | Northwestern Polytechnical University |
Yuan, Yuan | Beihang University |
Keywords: Game theory, Agents-based systems, Algebraic/geometric methods
Abstract: This paper presents a method for herding and capturing a swarm of adversarial agents. An open formation, FlexNet, is formed, which can achieve effective capture by adjusting the size and shape of the formation based on the distance between the pursuers and evaders. A capture strategy is given through rigorous analytical proof under a novel interaction model. We demonstrate that the conditions for successful capture and the proposed strategy allow the construction of a continuous dynamics structure to hold. Also, we introduce the FOV model to each pursuer and extend the single-axis motion to the whole 2D environment. Simulations are provided to demonstrate the efficacy of the proposed approach.
|
|
10:20-10:40, Paper MoA07.2 | |
>Learning Equilibrium with Estimated Payoffs in Population Games |
|
Park, Shinkyu | KAUST |
Keywords: Game theory, Agents-based systems, Distributed control
Abstract: We study a multi-agent decision problem in population games, where agents select from multiple available strategies and continually revise their selections based on the payoffs associated with these strategies. Unlike conventional population game formulations, we consider a scenario where agents must estimate the payoffs through local measurements and communication with their neighbors. By employing task allocation games -- dynamic extensions of conventional population games -- we examine how errors in payoff estimation by individual agents affect the convergence of the strategy revision process. Our main contribution is an analysis of how estimation errors impact the convergence of the agents' strategy profile to equilibrium. Based on the analytical results, we propose a design for a time-varying strategy revision rate to guarantee convergence. Simulation studies illustrate how the proposed method for updating the revision rate facilitates convergence to equilibrium.
|
|
10:40-11:00, Paper MoA07.3 | |
>Equilibrium Seeking in Learning-Based Noncooperative Nash Games |
|
Scarabaggio, Paolo | Politecnico Di Bari |
Mignoni, Nicola | Politecnico Di Bari |
Carli, Raffaele | Polytechnic of Bari |
Dotoli, Mariagrazia | Politecnico Di Bari |
Keywords: Game theory, Agents-based systems, Learning
Abstract: Traditionally based on convexity, multi-agent decision-making models can hardly handle scenarios where agents' cost functions defy this assumption, which is specifically required to ensure the existence of several equilibrium concepts. More recently, the advent of machine learning (ML), with its inherent non-convexity, has changed the conventional approach of pursuing convexity at all costs. This paper explores and integrates the robustness of game theoretic frameworks in managing conflicts among agents with the capacity of ML approaches, such as deep neural networks (DNNs), to capture complex agent behaviors. Specifically, we employ feed-forward DNNs to characterize agents' best response actions rather than modeling their goals with convex functions. We introduce a technical assumption on the weight of the DNN to establish the existence and uniqueness of Nash equilibria and present two distributed algorithms based on fixed-point iterations for their computation. Finally, we demonstrate the practical application of our framework to a noncooperative community of smart energy users under a dynamic time-of-use energy pricing scheme.
|
|
11:00-11:20, Paper MoA07.4 | |
>Maximizing Revenue from Selfish Agents in Crowd Tasks: Indirect Incentive Strategies |
|
Montazeri, Mina | Empa |
Gatti, Nicola | Politecnico Di Milano |
Kebriaei, Hamed | University of Tehran |
Castiglioni, Matteo | Politecnico Di Milano |
Romano, Giulia | Politecnico Di Milano |
Keywords: Game theory, Agents-based systems, Optimization
Abstract: We study mechanisms incentivizing the contribution of selfish agents addressing crowd tasks in a Bayesian setting with externalities due to network effects. A notable example is crowdsensing, where a large group of individuals, with mobile devices capable of sensing and computing collectively, share data. The central problem we investigate is the relationship between direct and indirect mechanisms. Indeed, while direct mechanisms represent tools to address implementation problems optimally, the requirement that the contribution level of every agent when addressing the task is chosen by the mechanism makes these mechanisms hardly used in practice. On the other hand, while indirect mechanisms allow every agent to be free to choose their contribution level, these mechanisms may be highly inefficient. Our desideratum is to design indirect mechanisms that closely match the performance of optimal direct mechanisms. We design an indirect mechanism without network effects such that the Price of Anarchy over the revenue is one. We also extend our results to the case with network effects, providing an upper bound to the Price of Stability and showing that the inefficiency is low in practice. Our results suggest that indirect revelation mechanisms can be an excellent option in real-world applications.
|
|
11:20-11:40, Paper MoA07.5 | |
>Follower Agnostic Learning in Stackelberg Games |
|
Maheshwari, Chinmay | University of California Berkeley |
Cheng, James | University of California Berkeley |
Sastry, Shankar | Univ. of California at Berkeley |
Ratliff, Lillian J. | University of Washington |
Mazumdar, Eric | California Institute of Technology |
Keywords: Game theory, Agents-based systems, Optimization algorithms
Abstract: In this paper, we present an efficient algorithm to solve online Stackelberg games, featuring multiple followers, in a follower-agnostic manner. Unlike previous works, our approach works even when leader has no knowledge about the followers' utility functions, strategy space or learning algorithm. Our algorithm introduces a unique gradient estimator, leveraging specially designed strategies to probe followers. In a departure from traditional assumptions of optimal play, we model followers' responses using a convergent adaptation rule, allowing for realistic and dynamic interactions. The leader constructs the gradient estimator solely based on observations of followers' actions. We provide both non-asymptotic convergence rates to stationary points of the leader's objective and demonstrate asymptotic convergence to a emph{local Stackelberg equilibrium}. To validate the effectiveness of our algorithm, we use this algorithm to solve the problem of incentive design on a large-scale transportation network, showcasing its robustness even when the leader lacks access to followers' demand information.
|
|
11:40-12:00, Paper MoA07.6 | |
>Coordination in Noncooperative Multiplayer Matrix Games Via Reduced Rank Correlated Equilibria |
|
Im, Jaehan | University of Texas at Austin |
Yu, Yue | University of Minnesota |
Fridovich-Keil, David | The University of Texas at Austin |
Topcu, Ufuk | The University of Texas at Austin |
Keywords: Game theory, Air traffic management, Agents-based systems
Abstract: Coordination in multiplayer games enables players to avoid the lose-lose outcome that often arises at Nash equilibria. However, designing a coordination mechanism typically requires the consideration of the joint actions of all players, which becomes intractable in large-scale games. We develop a novel coordination mechanism, termed reduced rank correlated equilibria. The idea is to approximate the set of all joint actions with the actions used in a set of pre-computed Nash equilibria via a convex hull operation. In a game with n players and each player having m actions, the proposed mechanism reduces the number of joint actions considered from O(m^n) to O(mn) and thereby mitigates computational complexity. We demonstrate the application of the proposed mechanism to an air traffic queue management problem. Compared with the correlated equilibrium-a popular benchmark coordination mechanism-the proposed approach is capable of solving a queue management problem involving four thousand times more joint actions while yielding similar or better performance in terms of a fairness indicator and showing a maximum optimality gap of 0.066% in terms of the average delay cost. In the meantime, it yields a solution that shows a 58.5% to 99.5% improvement in a fairness indicator and a 1.8% to 50.4% reduction in average delay cost compared to the Nash solution, which does not involve coordination.
|
|
MoA08 |
Amber 7 |
Optimal Control I |
Regular Session |
Chair: Di Giamberardino, Paolo | Sapienza, Universita' Di Roma |
Co-Chair: Goreac, Dan | Université Paris-Est, LAMA (UMR 8050), UPEMLV, UPEC, CNRS, Marne-La-Vallée, France |
|
10:00-10:20, Paper MoA08.1 | |
>Modified Legendre-Gauss Collocation Method for Optimal Control Problems with Nonsmooth Solutions |
|
Abadia-Doyle, Gabriela | University of Florida |
Rao, Anil V. | University of Florida |
Keywords: Optimal control, Computational methods, Variational methods
Abstract: A modified form of Legendre-Gauss orthogonal direct collocation is developed for solving optimal control problems whose solutions are nonsmooth due to control discontinuities. This new method adds switch time variables, control variables, and collocation conditions at both endpoints of a mesh interval, whereas these new variables and collocation conditions are not included in standard Legendre-Gauss orthogonal collocation. The modified Legendre-Gauss collocation method alters the search space of the resulting nonlinear programming problem and optimizes the switch point of the control solution. The transformed adjoint system of the modified Legendre-Gauss collocation method is then derived and shown to satisfy the necessary conditions for optimality. Finally, an example is provided where the optimal control is bang-bang and contains multiple switches. This method is shown to be capable of solving complex optimal control problems with nonsmooth solutions.
|
|
10:20-10:40, Paper MoA08.2 | |
>Properties of Singular Solutions in Optimal Control Problems under Input Dynamic Extension |
|
Di Giamberardino, Paolo | Sapienza, Universita' Di Roma |
Iacoviello, Daniela | University of Rome "La Sapienza" |
Keywords: Optimal control, Constrained control, Variational methods
Abstract: The paper addresses the problem of optimal control design in presence of singular solutions for single input dynamics. The dynamical extension for systems obtained adding an integrator on the input is addressed and analyzed. The possibility of computing the optimal control for dynamically extended systems from the solution of the initial ones is investigated, as well as the inverse procedure. These relationships are well evidenced for the singular solutions, showing the possibility of simplifying the optimal control computation. An example is introduced to better highlight the presented results.
|
|
10:40-11:00, Paper MoA08.3 | |
>Optimal Control under Action Duration Constraint for Non-Convex Dynamics |
|
Goreac, Dan | Université Laval |
Rapaport, Alain | INRAE & Univ. Montpellier |
Keywords: Optimal control, Nonlinear systems, Constrained control
Abstract: The motivation of this paper stems from a family of optimal control problems wherein the control’s active duration is constrained within a predefined limit. The duration constraint can be perceived as an additional variable in the dynamics, and the relaxation of the naturally associated control problem is equivalent to a an L1-constraint. The paper provides a generalization of a preliminary work by the authors to encompass scenarios with non-convex dynamics. The relaxed problems are formulated through Linear Programming techniques and their qualitative properties, alongside the inter-relations between different formulations, are investigated.
|
|
11:00-11:20, Paper MoA08.4 | |
>Finite-Frequency Dynamic Output-Feedback Synthesis for Lur'e-Type Control Systems |
|
Shakib, Fahim | Imperial College London |
Huang, Guilin | Imperial College London |
Keywords: Optimal control, Nonlinear systems, Stability of nonlinear systems
Abstract: This paper introduces a finite-frequency dynamic output-feedback control synthesis approach for nonlinear Lur'e-type control systems, focusing on finite-frequency disturbances. The method addresses the challenge of handling the infinite number of higher harmonics resulting from nonlinearities. Despite the infinite number of higher harmonics, we present a performance bound for the closed-loop nonlinear control system analogues to the L2-gain for linear time-invariant systems. The synthesis problem then aims to minimise this performance bound while ensuring closed-loop stability through the notion of global asymptotic convergence. A numerical example on a nonlinear mechanical system demonstrates improved performance compared to traditional full-frequency output-feedback control.
|
|
11:20-11:40, Paper MoA08.5 | |
>Convexity in Optimal Control Problems |
|
Abhijeet, Fnu | Texas A&M University |
Gul Mohamed, Mohamed Naveed | Texas A&M University |
Sharma, Aayushman | Texas A&M University |
Chakravorty, Suman | Texas A&M University |
Keywords: Optimal control, Nonlinear systems, Numerical algorithms
Abstract: This paper investigates the central role played by the Hamiltonian in continuous-time nonlinear optimal control problems. We show that the strict convexity of the Hamiltonian in the control variable is a sufficient condition for the existence of a unique optimal trajectory, and the nonlinearity/non-convexity of the dynamics and the cost are immaterial. The analysis is extended to discrete-time problems, revealing that discretization destroys the convex Hamiltonian structure, leading to multiple spurious optima, unless the time discretization is sufficiently small. We present simulated results comparing the ``indirect" Iterative Linear Quadratic Regulator (iLQR) and the ``direct" Sequential Quadratic Programming (SQP) approach for solving the optimal control problem for the cartpole and pendulum models to validate the theoretical analysis. Results show that the ILQR always converges to the ``globally" optimum solution while the SQP approach gets stuck in spurious minima given multiple random initial guesses for a time discretization that is insufficiently small, while both converge to the same unique solution if the discretization is sufficiently small.
|
|
11:40-12:00, Paper MoA08.6 | |
>Computational Reduction for Systems with Low-Dimensional Nonlinearities Via Staticization-Based Duality |
|
McEneaney, William M. | Univ. California San Diego |
Dower, Peter M. | University of Melbourne |
Zheng, Yifei | University of California San Diego |
McEneaney, William | University of California, San Diego |
Keywords: Optimal control, Nonlinear systems
Abstract: A finite-horizon nonlinear optimal control problem is considered. Stat-quad duality is used to generate an equivalent problem with linear dynamics and a modification term in the running cost and two auxiliary controls processes. This problem form is used to obtain a representation of the value function as a staticization problem over a set of quadratic functions, where the coefficients of the quadratics consists of the solution to a differential Riccati equation, a linear ODE and an integral. This representation allows the value function to be evaluated independently at any time and any point in the state space. A specialized numerical method is proposed for solving the resulting staticization problem, which is able to leverage the low dimensionality of nonlinearity. A numerical example with five-dimensional state space is included.
|
|
MoA09 |
Amber 8 |
Predictive Control for Linear Systems I |
Regular Session |
Chair: Limon, Daniel | Universidad De Sevilla |
Co-Chair: Arnström, Daniel | Uppsala University |
|
10:00-10:20, Paper MoA09.1 | |
>Variant Predictor-Corrector Method for Linear Predictive Control Using Modified Uzawa Algorithm |
|
Liu, Tianchen | University of Maryland, College Park |
Chakrabarti, Kushal | Tata Consultancy Services Research |
Chopra, Nikhil | University of Maryland, College Park |
Keywords: Predictive control for linear systems
Abstract: In this paper, a new variant of the predictor-corrector interior point method (IPM) pipeline is proposed for model predictive control (MPC) problems for linear time-invariant systems, which can be reformulated as quadratic programming (QP) problems. At each iteration in the IPM, finding the search direction via solving a linear system of equations is usually the step with the highest computational cost. A modified Uzawa algorithm is developed to improve the performance in the proposed IPM, which can address the ill-conditioning issue at the late iterations and reduce computational cost. Results of an MPC problem example are presented to show the performance of the proposed pipeline.
|
|
10:20-10:40, Paper MoA09.2 | |
>Economic Model Predictive Control for Periodic Operation: A Quadratic Programming Approach |
|
Borja-Conde, Jose A. | University of Seville |
Moreno Nadales, Juan | Universidad De Sevilla |
Fele, Filiberto | University of Seville |
Limon, Daniel | Universidad De Sevilla |
Keywords: Predictive control for linear systems, Optimal control, Time-varying systems
Abstract: Periodic dynamical systems, distinguished by their repetitive behavior over time, are prevalent across various engineering disciplines. In numerous applications, particularly within industrial contexts, the implementation of model predictive control (MPC) schemes tailored to optimize specific economic criteria was shown to offer substantial advantages. However, the real-time implementation of these schemes is often infeasible due to limited computational resources. To tackle this problem, we propose a resource-efficient economic model predictive control scheme for periodic systems, leveraging existing single-layer MPC techniques. Our method relies on a single quadratic optimization problem, which ensures high computational efficiency for real-time control in dynamic settings. We prove feasibility, stability and convergence to optimum of the proposed approach, and validate the effectiveness through numerical experiments.
|
|
10:40-11:00, Paper MoA09.3 | |
>Harnessing Data for Accelerating Model Predictive Control by Constraint Removal |
|
Hou, Zhinan | Tsinghua University |
Zhao, Feiran | Tsinghua University |
You, Keyou | Tsinghua University |
Keywords: Predictive control for linear systems, Optimization, Linear systems
Abstract: Model predictive control (MPC) solves a receding-horizon optimization problem in real-time, which can be computationally demanding when there are thousands of constraints. To accelerate online computation of MPC, we utilize data to adaptively remove the constraints while maintaining the MPC policy unchanged. Specifically, we design the removal rule based on the Lipschitz continuity of the MPC policy. This removal rule can use the information of historical data according to the Lipschitz constant and the distance between the current state and historical states. In particular, we provide the explicit expression for calculating the Lipschitz constant by the model parameters. Finally, simulations are performed to validate the effectiveness of the proposed method.
|
|
11:00-11:20, Paper MoA09.4 | |
>Implementation of Soft-Constrained MPC for Tracking Using Its Semi-Banded Problem Structure |
|
Gracia, Victor | Universidad De Sevilla |
Krupa, Pablo | Gran Sasso Science Institute |
Limon, Daniel | Universidad De Sevilla |
Alamo, Teodoro | Universidad De Sevilla |
Keywords: Predictive control for linear systems, Optimization algorithms, Embedded systems
Abstract: Model Predictive Control (MPC) is a popular control approach due to its ability to consider constraints, including input and state restrictions, while minimizing a cost function. However, in practice, these constraints can result in feasibility issues, either because the system model is not accurate or due to the existence of external disturbances. To mitigate this problem, a solution adopted by the MPC community is the use of soft constraints. In this article, we consider a not-so-typical methodology to encode soft constraints in a particular MPC formulation known as MPC for Tracking (MPCT), which has several advantages when compared to standard MPC formulations. The motivation behind the proposed encoding is to maintain the semi-banded structure of the ingredients of a recently proposed solver for the considered MPCT formulation, thus providing an efficient and fast solver when compared to alternative approaches from the literature. We show numerical results highlighting the benefits of the formulation and the computational efficiency of the solver.
|
|
11:20-11:40, Paper MoA09.5 | |
>Recursive Feasibility Guarantees in Multi-Horizon MPC |
|
Behrunani, Varsha Naresh | ETH Zürich |
Heer, Philipp | Empa |
Smith, Roy S. | ETH Zurich |
Lygeros, John | ETH Zurich |
Keywords: Predictive control for linear systems, Optimization, Linear systems
Abstract: Multi-Horizon model predictive control (MPC) is a method that uses time coarsening to increase the prediction horizon by using several models, each with a different sampling time that gradually increases later in the horizon. This facilitates having a longer prediction interval without significantly impacting the computational load or compromising the response time. However, the use of models with different granularity make guaranteeing recursive feasibility challenging as conventional approaches cannot be applied directly. This work proposes a constraint tightening strategy to enforce recursive feasibility in time-invariant multi-horizon MPC schemes. The state constraint in the optimization is replaced by adaptive state and input constraint at each time step that depend on the sampling time to ensure that the trajectory remains feasible between two sampling points even as the sampling time increases. An extensive numerical study illustrates the effectiveness and scalability of our approach and compares its performance to standard MPC and multi-horizon MPC controllers without any constraint tightening.
|
|
11:40-12:00, Paper MoA09.6 | |
>A High-Performant Multi-Parametric Quadratic Programming Solver |
|
Arnström, Daniel | Uppsala University |
Axehill, Daniel | Linköping University |
Keywords: Predictive control for linear systems, Optimization algorithms, Optimization
Abstract: We propose a combinatorial method for computing explicit solutions to multi-parametric quadratic programs, which can be used to compute explicit control laws for linear model predictive control. In contrast to classical methods, which are based on geometrical adjacency, the proposed method is based on combinatorial adjacency. After introducing the notion of combinatorial adjacency, we show that the explicit solution forms a connected graph in terms of it. We then leverage this connectedness to propose an algorithm that computes the explicit solution. The purely combinatorial nature of the algorithm leads to computational advantages since it enables demanding geometrical operations (such as computing facets of polytopes) to be avoided. Compared with classical combinatorial methods, the proposed method requires fewer combinations to be considered by exploiting combinatorial connectedness. We show that an implementation of the proposed method can yield a speedup of about two orders of magnitude compared with state-of-the-art software packages such as MPT and POP.
|
|
MoA10 |
Brown 1 |
Safe Planning and Control with Uncertainty Quantification I |
Invited Session |
Chair: Aolaritei, Liviu | ETH Zurich |
Co-Chair: Haesaert, Sofie | Eindhoven University of Technology |
Organizer: Aolaritei, Liviu | UC Berkeley |
Organizer: Yu, Pian | School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology |
Organizer: Gao, Yulong | Imperial College London |
Organizer: Lindemann, Lars | University of Southern California |
Organizer: Haesaert, Sofie | Eindhoven University of Technology |
|
10:00-10:20, Paper MoA10.1 | |
>Reachability Analysis Using Constrained Polynomial Logical Zonotopes |
|
Hafez, Ahmad | Technical University of Munich; TUM School of Computation, Infor |
Jiang, Frank J. | Royal Institute of Technology |
Johansson, Karl H. | KTH Royal Institute of Technology |
Alanwar, Amr | Technical University of Munich |
Keywords: Boolean control networks and logic networks, Uncertain systems
Abstract: In this paper, we propose reachability analysis using constrained polynomial logical zonotopes. We perform reachability analysis to compute the set of states that could be reached. To do this, we utilize a recently introduced set representation called polynomial logical zonotopes for performing computationally efficient and exact reachability analysis on logical systems. Notably, polynomial logical zonotopes address the "curse of dimensionality" when analyzing the reachability of logical systems since the set representation can represent 2^h binary vectors using h generators. After finishing the reachability analysis, the formal verification involves verifying whether the intersection of the calculated reachable set and the unsafe set is empty or not. Polynomial logical zonotopes lack closure under intersections, prompting the formulation of constrained polynomial logical zonotopes, which preserve the computational efficiency and exactness of polynomial logical zonotopes for reachability analysis while enabling exact intersections. Additionally, an extensive empirical study is presented to demonstrate and validate the advantages of constrained polynomial logical zonotopes.
|
|
10:20-10:40, Paper MoA10.2 | |
>Robust STL Control Synthesis under Maximal Disturbance Sets (I) |
|
Verhagen, Joris | KTH Royal Institute of Technology |
Lindemann, Lars | University of Southern California |
Tumova, Jana | KTH Royal Institute of Technology |
Keywords: Formal Verification/Synthesis, Autonomous robots
Abstract: This work addresses maximally robust control synthesis under unknown disturbances. We consider a nonlinear system, subject to a Signal Temporal Logic (STL) specification and jointly synthesize the maximal possible disturbance bounds and the corresponding controllers that ensure the STL specification is satisfied under these bounds. Many works have considered STL satisfaction under given bounded disturbances yet, to the authors' best knowledge, this is the first work that aims to maximize the permissible disturbance set and find corresponding maximally robust controllers. We, therefore, introduce disturbance robustness as a model-based robustness metric for STL planning and control synthesis. We extend the notion of disturbance-robust semantics for STL, which is a property of a specification, dynamical system, and controller, and provide an algorithm for maximally robust controllers satisfying an STL specification using Hamilton-Jacobi reachability. We show its soundness and provide a simulation example with an Autonomous Underwater Vehicle (AUV).
|
|
10:40-11:00, Paper MoA10.3 | |
>Learning-Based Efficient Approximation of Data-Enabled Predictive Control (I) |
|
Zhou, Yihan | Tsinghua University |
Lu, Yiwen | Tsinghua University |
Li, Zishuo | Tsinghua University |
Yan, Jiaqi | ETH Zurich |
Mo, Yilin | Tsinghua University |
Keywords: Data driven control, Learning, Model/Controller reduction
Abstract: Data-Enabled Predictive Control (DeePC) bypasses the need for system identification by directly leveraging raw data to formulate optimal control policies. However, the size of the optimization problem in DeePC grows linearly with respect to the data size, which prohibits its application to resource-constrained systems due to high computational costs. In this paper, we propose an efficient approximation of DeePC, whose size is invariant with respect to the amount of data collected, via differentiable convex programming. Specifically, the optimization problem in DeePC is decomposed into two parts: a control objective and a scoring function that evaluates the likelihood of a guessed I/O sequence, the latter of which is approximated with a size-invariant learned optimization problem. The proposed method is validated through numerical simulations on a quadruple tank system, illustrating that the learned controller can reduce the computational time of DeePC by a factor of 5 while maintaining its control performance.
|
|
11:00-11:20, Paper MoA10.4 | |
>Uncertainty Propagation in Stochastic Systems Via Mixture Models with Error Quantification (I) |
|
Figueiredo, Eduardo | TU Delft |
Patane, Andrea | University of Oxford |
Lahijanian, Morteza | University of Colorado Boulder |
Laurenti, Luca | TU Delft |
Keywords: Stochastic systems, Formal Verification/Synthesis, Nonlinear systems
Abstract: Uncertainty propagation in non-linear dynamical systems has become a key problem in various fields including control theory and machine learning. In this work, we focus on discrete-time non-linear stochastic dynamical systems. We present a novel approach to approximate the distribution of the system over a given finite time horizon with a mixture of distributions. The key novelty of our approach is that it not only provides tractable approximations for the distribution of a non-linear stochastic system but also comes with formal guarantees of correctness. In particular, we consider the Total Variation (TV) distance to quantify the distance between two distributions and derive an upper bound on the TV between the distribution of the original system and the approximating mixture distribution derived from our framework. We show that in various cases of interest, including in the case of Gaussian noise, the resulting bound can be efficiently computed in closed form. This allows us to quantify the correctness of the approximation and to optimize the parameters of the resulting mixture distribution to minimize such distance. The effectiveness of our approach is illustrated on several benchmarks from the control community.
|
|
11:20-11:40, Paper MoA10.5 | |
>Safety of Linear Systems under Severe Sensor Attacks (I) |
|
Tan, Xiao | California Institute of Technology |
Ong, Pio | California Institute of Technology |
Tabuada, Paulo | University of California at Los Angeles |
Ames, Aaron D. | California Institute of Technology |
Keywords: Constrained control, Cyber-Physical Security, Uncertain systems
Abstract: Cyber-physical systems can be subject to sensor attacks, e.g., sensor spoofing, leading to unsafe behaviors. This paper addresses this problem in the context of linear systems when an omniscient attacker can spoof several system sensors at will. In this adversarial environment, existing results have derived necessary and sufficient conditions under which the state estimation problem has a unique solution. In this work, we consider a severe attacking scenario when such conditions do not hold. To deal with potential state estimation uncertainty, we derive an exact characterization of the set of all possible state estimates. Using the framework of control barrier functions, we propose design principles for system safety in offline and online phases. For the offline phase, we derive conditions on safe sets for all possible sensor attacks that may be encountered during system deployment. For the online phase, with past system measurements collected, a quadratic program-based safety filter is proposed to enforce system safety. A 2D-vehicle example is used to illustrate the theoretical results.
|
|
11:40-12:00, Paper MoA10.6 | |
>SDP Synthesis of Maximum Coverage Trees for Probabilistic Planning under Control Constraints |
|
Aggarwal, Naman | Massachusetts Institute of Technology |
How, Jonathan P. | MIT |
Keywords: Autonomous systems, Stochastic optimal control, Control applications
Abstract: The paper presents Maximal Covariance Backward Reachable Trees (operatorname{MAXCOVAR} BRT), which is a multi-query algorithm for planning of dynamic systems under stochastic motion uncertainty and constraints on the control input with explicit coverage guarantees. In contrast to existing roadmap-based probabilistic planning methods that sample belief nodes randomly and draw edges between them cite{csbrm_tro2024}, under control constraints, the reachability of belief nodes needs to be explicitly established and is determined by checking the feasibility of a non-convex program. Moreover, there is no explicit consideration of coverage of the roadmap while adding nodes and edges during the construction procedure for the existing methods. Our contribution is a novel optimization formulation to add nodes and construct the corresponding edge controllers such that the generated roadmap results in provably maximal coverage under control constraints as compared to any other method of adding nodes and edges. We characterize formally the notion of coverage of a roadmap in this stochastic domain via introduction of the h-operatorname{BRS} (Backward Reachable Set of Distributions) of a tree of distributions under control constraints, and also support our method with extensive simulations on a 6 DoF model.
|
|
MoA11 |
Brown 2 |
Data Driven Control I |
Regular Session |
Chair: Rapisarda, Paolo | Univ. of Southampton |
Co-Chair: Jokic, Andrej | University of Zagreb |
|
10:00-10:20, Paper MoA11.1 | |
>Data-Driven Output-Feedback Controller Synthesis for Dissipativity: A Dualization-Based Approach |
|
Kristović, Pietro | The University of Zagreb, Faculty of Mechanical Engineering And |
Jokic, Andrej | University of Zagreb |
Keywords: Data driven control, Linear systems, LMIs
Abstract: In this paper we propose a non-conservative dynamic output-feedback controller synthesis method for a class of discrete-time linear time-invariant systems. The synthesis goal is to render the closed-loop system dissipative with respect to a given generic unstructured quadratic supply rate. The model of the plant is assumed to be unknown, but instead we require knowledge of trajectories in the control channel, i.e., of the controlled input and measured output available for control. It is assumed that the trajectories are corrupted by bounded noise. The controller synthesis method is based on a convexification procedure which relies on the dualization lemma.
|
|
10:20-10:40, Paper MoA11.2 | |
>A Sampling Linear Functional Framework for Data-Driven Analysis and Control of Continuous-Time Systems |
|
Ohta, Yoshito | Kyoto University |
Rapisarda, Paolo | Univ. of Southampton |
Keywords: Data driven control, Linear systems, Sampled-data control
Abstract: We set up a continuous-time data-driven control framework based on sampling linear functionals. Under some recently established sufficient conditions for informativity for system identification, we give data-based solutions to stabilization and optimal quadratic regulation problems.
|
|
10:40-11:00, Paper MoA11.3 | |
>Data-Driven Analysis and Control of 2D Fornasini-Marchesini Models |
|
Chu, Bing | University of Southampton |
Rapisarda, Paolo | Univ. of Southampton |
Rocha, Paula | University of Oporto |
Keywords: Data driven control, Linear systems, Stability of linear systems
Abstract: We study data-driven analysis and control of 2D Fornasini-Marchesini second models assuming that the input and state variable are directly measurable. We give necessary and sufficient conditions for the data to be informative for identification and a 2D version of the “fundamental lemma”. We show how to obtain a system representation from sufficiently informative data, and we propose a data-driven approach for stability verification and state-feedback stabilization via linear matrix inequalities.
|
|
11:00-11:20, Paper MoA11.4 | |
>Data-Driven Model Predictive Control for Continuous-Time Systems |
|
Wolski, Aleksander Joseph | University of Southampton |
Chu, Bing | University of Southampton |
Rapisarda, Paolo | Univ. of Southampton |
Keywords: Data driven control, Predictive control for linear systems, Computational methods
Abstract: We present some preliminary ideas on a data-driven Model Predictive Control framework for continuous-time systems. We use Chebyshev polynomial orthogonal bases to represent system trajectories and subsequently develop a data-driven continuous-time version of the classical Model Predictive Control algorithm. We investigate the effects of the parameters in our framework with two numerical examples and draw comparison to model-driven MPC schemes.
|
|
11:20-11:40, Paper MoA11.5 | |
>Robust Data-Driven Predictive Control for Linear Time-Varying Systems |
|
Hu, Kaijian | The University of Hong Kong, and HKU Shenzhen Institute of Resea |
Liu, Tao | The University of Hong Kong |
Keywords: Data driven control, Predictive control for linear systems, Linear parameter-varying systems
Abstract: This paper presents a new robust data-driven predictive control scheme for linear time-varying (LTV) systems with unknown nominal system models. To tackle the challenges arising from the unknown nominal model and the time-varying nature of the system, a data-dependent optimization problem is formulated using input-state-output data. It calculates an upper bound on the objective function and, at the same time, designs a state feedback controller to minimize the bound. Moreover, two significant concerns, namely the feasibility of the optimization problem and the stability of the closed-loop system under the designed controller, are thoroughly investigated. Compared with the existing data-enabled predictive control method for LTV systems, the proposed control scheme does not require the collected data to satisfy the persistently exciting (PE) condition and uniformly exponentially stabilizes the system. Finally, a numerical example is provided to demonstrate the effectiveness of the proposed method.
|
|
11:40-12:00, Paper MoA11.6 | |
>Distributionally Robust Stochastic Data-Driven Predictive Control with Optimized Feedback Gain |
|
Li, Ruiqi | University of Waterloo |
Simpson-Porco, John W. | University of Toronto |
Smith, Stephen L. | University of Waterloo |
Keywords: Data driven control, Predictive control for linear systems, Stochastic optimal control
Abstract: We consider the problem of direct data-driven predictive control for unknown stochastic linear time-invariant (LTI) systems with partial state observation. Building upon our previous research on data-driven stochastic control, this paper (i) relaxes the assumption of Gaussian process and measurement noise, and (ii) enables optimization of the gain matrix within the affine feedback policy. Output safety constraints are modelled using conditional value-at-risk, and enforced in a distributionally robust sense. Under idealized assumptions, we prove that our proposed data-driven control method yields control inputs identical to those produced by an equivalent model-based stochastic predictive controller. A simulation study illustrates the enhanced performance of our approach over previous designs.
|
|
MoA12 |
Brown 3 |
Reinforcement Learning Control |
Regular Session |
Chair: Uribe, Cesar A. | Rice University |
Co-Chair: Cyranka, Jacek | University of Warsaw |
|
10:00-10:20, Paper MoA12.1 | |
>Federated TD Learning in Heterogeneous Environments with Average Rewards: A Two-Timescale Approach with Polyak-Ruppert Averaging |
|
Naskar, Ankur | Indian Institute of Science |
Thoppe, Gugan Chandrashekhar Mallika | Indian Institute of Science |
Koochakzadeh, Abbasali | University of Minnesota |
Gupta, Vijay | Purdue University |
Keywords: Reinforcement learning, Agents-based systems, Numerical algorithms
Abstract: Federated Reinforcement Learning (FRL) provides a promising way to speedup training in reinforcement learning using multiple edge devices that can operate in parallel. Recently, it has been shown that even when these edge devices have access to different dynamic models, an optimal convergence rate that has a linear speedup proportional to the number of devices is achievable. However, this result requires that the stepsize in the algorithm be chosen in a manner dependent on the unknown model parameters. Also, it applies only to a discounted setting, which has been argued to fit episodic tasks better than continuing control tasks. In this paper, we obtain finite-time bounds for heterogeneous FRL with average rewards. We show that the optimal convergence rate with a linear speedup is possible even with a universal stepsize choice, independent of the underlying dynamics. To achieve our result, we modify the existing one-timescale FRL method to a novel two-timescale variant that additionally incorporates iterate averaging.
|
|
10:20-10:40, Paper MoA12.2 | |
>Collaborative Adaptation for Recovery from Unforeseen Malfunctions in Discrete and Continuous MARL Domains |
|
Findik, Yasin | University of Massachusetts Lowell |
Hasenfus, Hunter | University of Massachusetts Lowell |
Azadeh, Reza | University of Massachusetts Lowell |
Keywords: Reinforcement learning, Cooperative control, Autonomous robots
Abstract: Cooperative multi-agent learning plays a crucial role for developing effective strategies to achieve individual or shared objectives in multi-agent teams. In real-world settings, agents may face unexpected failures, such as a robot's leg malfunctioning or a teammate's battery running out. These malfunctions decrease the team's ability to accomplish assigned task(s), especially if they occur after the learning algorithms have already converged onto a collaborative strategy. Current leading approaches in Multi-Agent Reinforcement Learning (MARL) often recover slowly -- if at all -- from such malfunctions. To overcome this limitation, we present the Collaborative Adaptation (CA) framework, highlighting its unique capability to operate in both continuous and discrete domains. Our framework enhances the adaptability of agents to unexpected failures by integrating inter-agent relationships into their learning processes, thereby accelerating the recovery from malfunctions. We evaluated our framework's performance through experiments in both discrete and continuous environments. Empirical results reveal that in scenarios involving unforeseen malfunction, although state-of-the-art algorithms often converge on sub-optimal solutions, the proposed CA framework mitigates and recovers more effectively.
|
|
10:40-11:00, Paper MoA12.3 | |
>Reinforcement Learning-Based Receding Horizon Control Using Adaptive Control Barrier Functions for Safety-Critical Systems |
|
Sabouni, Ehsan | Boston University |
Ahmad, H M Sabbir | Boston University |
Giammarino, Vittorio | Boston University |
Cassandras, Christos G. | Boston University |
Paschalidis, Ioannis Ch. | Boston University |
Li, Wenchao | Boston University |
Keywords: Reinforcement learning, Cooperative control, Transportation networks
Abstract: Optimal control methods provide solutions to safety-critical problems but easily become intractable. Control Barrier Functions (CBFs) have emerged as a popular technique that facilitates their solution by provably guaranteeing safety, through their forward invariance property, at the expense of some performance loss. This approach involves defining a performance objective alongside CBF-based safety constraints that must always be enforced with both performance and solution feasibility significantly impacted by two key factors: (i) the selection of the cost function and associated parameters, and (ii) the calibration of parameters within the CBF-based constraints, which capture the trade-off between performance and conservativeness. %as well as infeasibility. To address these challenges, we propose a Reinforcement Learning (RL)-based Receding Horizon Control (RHC) approach leveraging Model Predictive Control (MPC) with CBFs (MPC-CBF). In particular, we parameterize our controller and use bilevel optimization, where RL is used to learn the optimal parameters while MPC computes the optimal control input. We validate our method by applying it to the challenging automated merging control problem for Connected and Automated Vehicles (CAVs) at conflicting roadways. Results demonstrate improved performance and a significant reduction in the number of infeasible cases compared to traditional heuristic approaches used for tuning CBF-based controllers, showcasing the effectiveness of the proposed method. In order to guarantee reproducibility, our code is provided here: https://github.com/EhsanSabouni/CDC2024_RL_adpative_MPC_CBF/tree/main
|
|
11:00-11:20, Paper MoA12.4 | |
>State Planning Policies Online Reinforcement Learning |
|
Cyranka, Jacek | University of Warsaw |
Bilinski, Piotr | University of Warsaw |
Keywords: Reinforcement learning, Data driven control, Neural networks
Abstract: We introduce State Planning Policy Reinforcement Learning (SPP-RL), an online RL approach, where the actor plans for the next state given the current state. To communicate the actor output to the environment, we incorporate an inverse dynamics control model and train it using supervised learning. SPP-RL introduces a novel way of ensuring reachability of planned target states by employing constrained optimization and the Lagrange multiplier method. We demonstrate the versatility of the SPP-RL by implementing variants of 3 standard RL algorithms: DDPG, TD3, and SAC. We conduct a thorough evaluation across 7 benchmarks, including Safety-Gym Level 0, AntPush and MuCoJo. Our results consistently showcase that SPP algorithms outperform their vanilla counterparts in terms of the average return in a systematic and significant manner. Moreover, we present SPP-RL convergence proof within a finite setting. Furthermore, we share our source code and trained models.
|
|
11:20-11:40, Paper MoA12.5 | |
>A Moreau Envelope Approach for LQR Meta-Policy Estimation (I) |
|
Aravind, Ashwin | Indian Institute of Technology Bombay |
Toghani, Mohammad Taha | Rice University |
Uribe, Cesar A. | Rice University |
Keywords: Reinforcement learning, Linear systems, Uncertain systems
Abstract: We study the problem of policy estimation for the Linear Quadratic Regulator (LQR) in discrete-time linear time-invariant uncertain dynamical systems. We propose a Moreau Envelope-based surrogate LQR cost, built from a finite set of realizations of the uncertain system, to define a meta-policy efficiently adjustable to new realizations. Moreover, we design an algorithm to find an approximate first-order stationary point of the meta-LQR cost function. Numerical results show that the proposed approach outperforms naive averaging of controllers on new realizations of the linear system. We also provide empirical evidence that our method has better sample complexity than Model-Agnostic Meta-Learning (MAML) approaches.
|
|
11:40-12:00, Paper MoA12.6 | |
>Development and Deployment of Reinforcement Learning Based Control for Large Scale Decision-Making in Manufacturing: A Case Study |
|
Maske, Harshal | Ford Motor Company |
Upadhyay, Devesh | Saab |
Long, Shuyu | University of Michigan |
Keywords: Reinforcement learning, Manufacturing systems and automation
Abstract: We investigate Deep Reinforcement Learning (DRL) for a large scale manufacturing problem of pallet loop control in an automotive assembly plant with multiple body types assembled in the same facility. The pallet loop comprises of three body construction lines that merges into a complex shared conveyor system and delivers units to the paint shop with empty pallets returning back to the start of construction lines. The decision problem is to dynamically control pallet selection at the various merge points to meet, often highly variable, daily delivery targets while being robust to unanticipated production downtime, part shortages and failures. We discuss the challenges associated with this problem and present solutions developed to successfully train and deploy a DRL control policy. Specifically, we introduce concepts of transient and steady-state reward functions required to address the problem of delayed rewards owing to spatially distributed and temporally shifted control locations. Simulation and real world deployment results show that the trained DRL policy outperforms human designed decision rules in that i) it discovered a control policy with several benefits over base human designed decisions, ii) it demonstrated robustness against uncertain downtime in body construction lines, and iii) it was resilient to variations in delivery targets. We present results from plant performance before and after the implementation of the proposed DRL control policy.
|
|
MoA13 |
Suite 1 |
Estimation and Control of Distributed Parameter Systems I |
Invited Session |
Chair: Hu, Weiwei | University of Georgia |
Co-Chair: Demetriou, Michael A. | Worcester Polytechnic Institute |
Organizer: Demetriou, Michael A. | Worcester Polytechnic Institute |
Organizer: Hu, Weiwei | University of Georgia |
|
10:00-10:20, Paper MoA13.1 | |
>Output Synchronization of Networked Second Order Infinite Dimensional Systems Using Adaptive Functional Estimation (I) |
|
Demetriou, Michael A. | Worcester Polytechnic Institute |
Keywords: Distributed parameter systems, Adaptive systems
Abstract: This paper proposes a synchronization scheme for identical structurally perturbed second order infinite dimensional systems. The structured perturbations modelled by nonlinear functions of the position and velocity outputs are estimated adaptively using a Reproducing Kernel Hilbert Space and their adaptive estimates are used in the synchronization controllers. The kernel space-based adaptive scheme enables the functional estimation of nonlinear perturbation terms without imposing any a priori assumptions on the parametrization of nonlinear perturbation terms. Adaptation of the consensus protocols used for synchronization enable the synchronization of the networked infinite dimensional systems and a model reference control component of the controller ensure each networked system follows an idealized leader.
|
|
10:20-10:40, Paper MoA13.2 | |
>Periodic Event-Triggered Boundary Control of Neuron Growth with Actuation at Soma (I) |
|
Demir, Cenk | University of California, San Diego |
Diagne, Mamadou | University of California San Diego |
Krstic, Miroslav | University of California, San Diego |
Keywords: Distributed parameter systems, Cellular dynamics, Backstepping
Abstract: Exploring novel strategies for regulating axon growth, we introduce a periodic event-triggered control (PETC) to enhance the practical implementation of the associated PDE backstepping control law. Neurological injuries may impair neuronal function, but therapies like Chondroitinase ABC (ChABC) have shown promise in improving axon elongation by influencing the extracellular matrix. This matrix, composed of macromolecules and minerals, regulates tubulin concentration, potentially aiding neuronal recovery. The concentration and spatial distribution of tubulin influence axon elongation dynamics. Recent research explores feedback control strategies for this model, leading to the development of an event-triggering control (CETC). In this approach, the control law updates when the triggering condition is met, reducing actuation resource consumption. Through redesigning the triggering mechanism, we introduce PETC, updating control inputs at intervals but evaluating the event trigger periodically, making it ideal for time-sliced actuators like ChABC. PETC is a step forward in designing feasible feedback laws for neuron growth. This strategy sets an upper bound on event triggers between periodic checks, ensuring convergence and preventing Zeno behavior. Through Lyapunov analysis, we demonstrate the local exponential convergence of the system with PETC in the L^2-norm. Numerical examples confirm the theoretical findings.
|
|
10:40-11:00, Paper MoA13.3 | |
>Stabilization of Integral Delay Equations by Solving Fredholm Equations |
|
Auriol, Jean | CNRS |
Keywords: Delay systems, Linear systems, Stability of linear systems
Abstract: In this paper, we design a stabilizing state-feedback control law for a system represented by a general class of integral delay equations subject to a pointwise and distributed input delay. The proposed controller is defined in terms of integrals of the state and input history over a fixed-length time window. We show that the closed-loop stability is guaranteed, provided the controller integral kernels are solutions to a set of Fredholm equations. The existence of solutions is guaranteed under an appropriate spectral controllability assumption, resulting in an implementable stabilizing control law. The proposed methodology appears simpler and more generic compared to existing results in the literature. In particular, under additional regularity assumptions, the proposed approach can be expanded to address the degenerate case where only a distributed control term is present.
|
|
11:00-11:20, Paper MoA13.4 | |
>Towards a MATLAB Toolbox to Compute Backstepping Kernels Using the Power Series Method (I) |
|
Lin, Xin | Harbin Institute of Technology |
Vazquez, Rafael | Universidad De Sevilla |
Krstic, Miroslav | University of California, San Diego |
Keywords: Backstepping, Distributed parameter systems, Numerical algorithms
Abstract: In this paper, we extend our previous work on the power series method for computing backstepping kernels. Our first contribution is the development of initial steps towards a MATLAB toolbox dedicated to backstepping kernel computation. This toolbox would exploit MATLAB’s linear algebra and sparse matrix manipulation features for enhanced efficiency; our initial findings show considerable improvements in computational speed with respect to the use of symbolical software without loss of precision at high orders. Additionally, we tackle limitations observed in our earlier work, such as slow convergence (due to oscillatory behaviors) and non-converging series (due to loss of analiticity at some singular points). To over- come these challenges, we introduce a technique that mitigates this behaviour by computing the expansion at different points, denoted as localized power series. This approach effectively navigates around singularities, and can also accelerates con- vergence by using more local approximations. Basic examples are provided to demonstrate these enhancements. Although this research is still ongoing, the significant potential and simplicity of the method already establish the power series approach as a viable and versatile solution for solving backstepping kernel equations, benefiting both novel and experienced practitioners in the field. We anticipate that these developments will be particularly beneficial in training the recently introduced neural operators that approximate backstepping kernels and gains.
|
|
11:20-11:40, Paper MoA13.5 | |
>A Discrete-Time Formulation of Nonlinear Distributed-Parameter Port-Hamiltonian Systems |
|
Macchelli, Alessandro | University of Bologna - Italy |
Keywords: Distributed parameter systems, Stability of nonlinear systems, Sampled-data control
Abstract: This paper introduces a new framework of nonlinear, discrete-time, boundary control systems (BCSs) in the port-Hamiltonian form. The contribution is twofold. We start with a discrete-time approximation of a nonlinear port-Hamiltonian BCS, i.e. a dynamical system modelled by a nonlinear partial differential equation with boundary actuation and sensing. The most important feature is that the discretisation is performed in time only so that the "distributed nature" of the state is preserved. By approximating the gradient of the Hamiltonian density with its discrete gradient, the obtained sampled dynamics inherit the passivity of the original one. Besides, we prove that, under mild conditions, it is well-posed, i.e. the "next" state always exists. The second contribution deals with control design. More precisely, we have determined sufficient conditions for the plant dynamics and a static output feedback gain to make the closed-loop system asymptotically stable.
|
|
11:40-12:00, Paper MoA13.6 | |
>Guaranteed Cost Boundary Control of the Semilinear Heat Equation |
|
Selivanov, Anton | The University of Sheffield |
Wang, Pengfei | Tel Aviv University |
Fridman, Emilia | Tel-Aviv Univ |
Keywords: Distributed parameter systems, Lyapunov methods
Abstract: We consider a 1D semilinear reaction-diffusion system with controlled heat flux at one of the boundaries. We design a finite-dimensional state-feedback controller guaranteeing that a given quadratic cost does not exceed a prescribed value for all nonlinearities with a predefined Lipschitz constant. To this end, we perform modal decomposition and truncate the highly damped (residue) modes. To deal with the nonlinearity that couples the residue and dominating modes, we combine the direct Lyapunov approach with the S-procedure and Parseval's identity.} The truncation may lead to spillover: the ignored modes can deteriorate the overall system performance. {Our main contribution is spillover avoidance via the L2 separation of the residue. Namely, we calculate the L2 input-to-state gains for the residue modes and add them to the control weight in the quadratic cost used to design a controller for the dominating modes. A numerical example demonstrates that the proposed idea drastically improves both the admissible Lipschitz constants and guaranteed cost bound compared to the recently introduced direct Lyapunov method.
|
|
MoA14 |
Suite 2 |
Cyber-Physical Systems Security |
Regular Session |
Chair: Simonetto, Andrea | ENSTA Paris |
Co-Chair: Pajic, Miroslav | Duke University |
|
10:00-10:20, Paper MoA14.1 | |
>Attack Stealthiness and Detection of Multiagent Systems: A Zero-Sum Formulation |
|
Zhu, Shiyong | Southeast University |
Wang, Miaomiao | Chinese Academy of Sciences |
Chen, Jie | City University of Hong Kong |
Rantzer, Anders | Lund University |
Keywords: Cyber-Physical Security, Agents-based systems
Abstract: Cybersecurity has in recent years emerged as a paramount concern in the design and operation of industrial systems and civil infrastructures, due mainly to their susceptibility to malicious cyber attacks which take advantage of the vulnerability of communication networks and IT devices. In this paper, we investigate such an attack and counter attack scenario by considering multiagent systems, a somewhat basic prototype of cyberphysical systems. We study false data injection attacks launched on the agent sensors, and possible defense of such attacks at the agent actuators. The primary issue under consideration is the stealthiness of the attacks, while steering a multiagent system away from its consensual state. We propose a metric to quantify the stealthiness, and formulate the stealthiness problem as one of zero-sum games. We solve the problem explicitly, which gives rise to a fundamental bound on the stealthiness achievable, and as well optimal attack and defense strategies that achieve the optimal stealthiness, both of which can be obtained in terms of certain augmented controllability Gramians associated with the agents. The stealthiness bound is seen to depend on agent dynamics and network characteristics including a measure of connectivity.
|
|
10:20-10:40, Paper MoA14.2 | |
>Bayesian Methods for Trust in Collaborative Multi-Agent Autonomy |
|
Hallyburton, R. Spencer | Duke University |
Pajic, Miroslav | Duke University |
Keywords: Cyber-Physical Security, Attack Detection, Agents-based systems
Abstract: Multi-agent, collaborative sensor fusion is a vital component of a multi-national intelligence toolkit. In safety-critical and/or contested environments, adversaries may infiltrate and compromise a number of agents. We analyze state of the art multi-target tracking algorithms under this compromised agent threat model. We show that the track existence probability test ("track score") is significantly vulnerable to even small numbers of adversaries. To add security awareness, we design a trust estimation framework using hierarchical Bayesian updating. Our framework builds beliefs of trust on tracks and agents by mapping sensor measurements to trust pseudomeasurements (PSMs) and incorporating prior trust beliefs in a Bayesian context. In case studies, our trust estimation algorithm accurately estimates the trustworthiness of tracks/agents, subject to observability limitations.
|
|
10:40-11:00, Paper MoA14.3 | |
>Planning with Probabilistic Opacity and Transparency: A Computational Model of Opaque/Transparent Observations |
|
Udupa, Sumukha | University of Florida |
Fu, Jie | University of Florida |
Keywords: Cyber-Physical Security, Automata, Discrete event systems
Abstract: The qualitative opacity of a secret is a security property, which means that a system trajectory satisfying the secret is observation-equivalent to a trajectory violating the secret. In this paper, we study how to synthesize a control policy that maximizes the probability of a secret being made opaque against an eavesdropping attacker/observer, while subject to other task performance constraints. In contrast to the existing belief-based approach for opacity-enforcement, we develop an approach that uses the observation function, the secret, and the model of the dynamical systems to construct a so-called opaque-observations automaton that accepts the exact set of observations that enforce opacity. Leveraging this opaque-observations automaton, we can reduce the optimal planning in Markov decision processes(MDPs) for maximizing probabilistic opacity or its dual notion, transparency, subject to task constraints into a constrained planning problem over an augmented-state MDP. Finally, we illustrate the effectiveness of the developed methods in robot motion planning problems with opacity or transparency requirements.
|
|
11:00-11:20, Paper MoA14.4 | |
>Cyberattack Detection by Using a Discrete-Time Model-Based Unknown Input Observer |
|
Nguyen, Quang Huy | University of Lorraine |
Sadki, Osama | Université De Lille |
Rafaralahy, Hugues | Université De Lorraine |
Haddad, Madjid | SEGULA Technologie |
Zemouche, Ali | CRAN UMR CNRS 7039 & Université De Lorraine |
Keywords: Autonomous vehicles, LMIs, Observers for Linear systems
Abstract: This letter introduces a new solution for cyberattack detection within the context of cooperative adaptive cruise Control (CACC) system. Our approach involves a novel method that consists of shifting the original output to establish a new system that adheres to the observer matching condition (OMC). Additionally, we represent the system in descriptor form and design a delayed unknown input observer (UIO) to achieve an Input-to-State Stable (ISS) bound on both the cyberattack and the state of the CACC system. Leveraging Lyapunov stability theory, we propose sufficient conditions expressed in terms of linear matrix inequality. To demonstrate the effectiveness of our algorithm, we proposed two simulation scenarios utilizing both Matlab software and the Carla simulator.
|
|
11:20-11:40, Paper MoA14.5 | |
>An Optimization Approach to Current State Opacity Assessment |
|
Basile, Francesco | Universita' Degli Studi Di Salerno |
De Tommasi, Gianmaria | Università Degli Studi Di Napoli Federico II |
Dubbioso, Sara | Università Di Napoli Federico II |
Fiorenza, Federico | Università Degli Studi Di Napoli Federico II - Department of Ele |
Keywords: Petri nets, Optimization, Cyber-Physical Security
Abstract: This paper provides a necessary and sufficient condition to assess Current State Opacity (CSO) in discrete event systems modeled as bounded Petri nets. The provided condition requires testing a finite number of sequences, without the need to build any observer, neither complete nor reduced. By exploiting the proposed condition, it is then possible to implement an algorithm that requires the solution of a finite number of optimization problems to check whether a system is CSO or not. Therefore, the proposed condition enables to perform CSO assessment by using off-the-shelf commercial software, not suffering from the curse of dimensionality, contrary to the observed-based approaches proposed in literature to deal with opacity. The effectiveness of the proposed approach is shown by means of an example.
|
|
11:40-12:00, Paper MoA14.6 | |
>Flexible Optimization for Cyber-Physical and Human Systems |
|
Simonetto, Andrea | ENSTA Paris |
Keywords: Optimization, Optimization algorithms, Statistical learning
Abstract: We study how to construct optimization problems whose outcome are sets of feasible, close-to-optimal decisions for human users to pick from, instead of a single, hardly explainable ``optimal'' directive. In particular, we explore two complementary ways to render convex optimization problems stemming from cyber-physical applications ``flexible''. In doing so, the optimization outcome is a trade off between engineering best and flexibility for the users to decide to do something slightly different. The first method is based on robust optimization and convex reformulations. The second one is stochastic and inspired from stochastic optimization with decision-dependent distributions.
|
|
MoA15 |
Suite 3 |
Emerging Control Applications |
Regular Session |
Chair: Wasa, Yasuaki | Waseda University |
Co-Chair: Cavraro, Guido | National Renewable Energy Laboratory |
|
10:00-10:20, Paper MoA15.1 | |
>Advocating Feedback Control for Human-Earth System Applications |
|
Cavraro, Guido | National Renewable Energy Laboratory |
Keywords: Emerging control applications, Control applications, Optimization
Abstract: This paper proposes a feedback control perspective for Human-Earth Systems (HESs) which essentially are complex systems that capture the interactions between humans and nature. Recent attention in HES research has been directed towards devising strategies for climate change mitigation and adaptation, aimed at achieving environmental and societal objectives. However, existing approaches heavily rely on HES models, which inherently suffer from inaccuracies due to the complexity of the system. Moreover, overly detailed models often prove impractical for optimization tasks. We propose a framework inheriting from feedback control strategies the robustness against model errors, because inaccuracies are mitigated using measurements retrieved from the field. The framework comprises two nested control loops. The outer loop computes the optimal inputs to the HES, which are then implemented by actuators controlled in the inner loop. Potential fields of applications are also identified.
|
|
10:20-10:40, Paper MoA15.2 | |
>Control-Based Exploration of Bifurcation Diagrams of Dynamical Systems by Partial Reference Fourierization |
|
Rezaee, Hamed | Imperial College London |
Renson, Ludovic | Imperial College London |
Keywords: Emerging control applications, Control applications
Abstract: Control-based continuation (CBC) is a systematic method for exploring the dynamics of nonlinear systems and tracing bifurcation diagrams directly during experimental tests. To find a possible natural response of an underlying system under test, CBC iterates on a control reference signal (usually approximated using Fourier series) to make the controller noninvasive. The many Fourier modes required to accurately represent possible responses of the system lead to high testing times. The main contribution of this paper is to propose a control strategy that can asymptotically track the possible natural responses of a system in a noninvasive manner whilst using approximate reference signals described by short Fourier series. Our approach relies on a time-varying control strategy where noninvasiveness is guaranteed by design, and the mechanism of the controller compensates for the errors in the reference signals due to partial Fourierization. Accordingly, asymptotic tracking of the full Fourier series (i.e., the possible natural responses of the system) based on short Fourier series as reference signals is guaranteed. The proposed control strategy is validated by rigorous analysis and a simulation example.
|
|
10:40-11:00, Paper MoA15.3 | |
>A Dynamic Consensus Control Scheme for Heterogeneous DC-DC Buck Converter Systems Utilizing `mixed' Negative Imaginary Technique |
|
Paul, Biswanath | Indian Institute of Technology Guwahati |
Bhowmick, Parijat | Indian Institute of Technology Guwahati |
Bhawal, Chayan | Indian Institute of Technology Guwahati |
Keywords: Emerging control applications, Cooperative control, Simulation
Abstract: This paper introduces a new dynamic consensus control strategy for heterogeneous (or nonidentical) DC/DC power converters (e.g. buck converters) utilizing Negative Imaginary (NI) systems theory. The study has been motivated by the application of DC/DC buck converters in supplying an equal amount of current to the energizing coils used in the ship degaussing process. The main challenge for the buck converters is to supply the desired load current to the coils when load demand varies and in the presence of exogenous disturbances. This intrigues the need for designing a cooperative control scheme for a heterogeneous buck converter system to supply the same load current that maybe time-varying. As the average model (i.e. the large-signal model) of a DC/DC buck converter inherently satisfies the strictly NI (SNI) property for any combination of resistance (R), inductance (L), and capacitance (C), a distributed NI control scheme fits well to this problem. This paper has theoretically proved that a homogeneous NI (with a `mixed' property) cooperative control scheme can drive heterogeneous buck converter agents, connected via undirected graphs, supplying the same load current. The simulation studies demonstrate that a simple type-I second-order NI control scheme achieves the desired objectives subject to load variation and exogenous disturbances.
|
|
11:00-11:20, Paper MoA15.4 | |
>Equilibrium Analysis of MPC-Based Climate Change Policy in Dynamic Games between Developed and Developing Regions |
|
Kato, Namiki | Waseda University |
Wasa, Yasuaki | Waseda University |
Akao, Ken-Ichi | Waseda University |
Keywords: Emerging control applications, Optimal control, Game theory
Abstract: This paper analyzes an open-loop Nash equilibrium of climate change policies and long-term mitigation pathways based on Model Predictive Control (MPC) in noncooperative dynamic games between developed and developing regions. After introducing a globally-used mathematical framework for a well-known integrated assessment model (IAM), the Regional Integrated Climate and Economy (RICE) model, with economic evaluations in multi-regional heterogeneity, this paper presents a dynamic noncooperative game between developed and developing regions based on possible international agreements. In particular, through numerical analysis, we show that an MPC-based iterative decision process, selected from the permissible control range to support a prescribed rising surface temperature since the Industrial Revolution, is effective.
|
|
11:20-11:40, Paper MoA15.5 | |
>Mitigating Transient Bullwhip Effects under Imperfect Demand Forecasts |
|
Li, Hui Qing | ETH Zürich |
Dörfler, Florian | Swiss Federal Institute of Technology (ETH) Zurich |
Keywords: Emerging control applications, Robust control, LMIs
Abstract: Motivated by how forecast errors exacerbate order fluctuations in supply chains, we leverage robust feedback controller synthesis to characterize, compute, and minimize the worst-case order fluctuation experienced by an individual supply chain vendor. Assuming bounded forecast errors and demand fluctuations, we model forecast error and demand fluctuations as inputs to linear inventory dynamics and use the ell_infty gain to define a transient Bullwhip measure. In contrast to the existing Bullwhip measure, the transient Bullwhip measure explicitly depends on the forecast error. This enables us to separately quantify the transient Bullwhip measure's sensitivity to forecast error and demand fluctuations. To compute the controller that minimizes the worst-case peak gain, we formulate an optimization problem with bilinear matrix inequalities and show that it is equivalent to minimizing a quasi-convex function on a bounded domain. We simulate our model for vendors with non-zero perishable rates and order backlogging rates, and prove that the transient Bullwhip measure can be bounded by a monotonic quasi-convex function whose dependency on the product backlog rate and perishing rate is verified in simulation.
|
|
11:40-12:00, Paper MoA15.6 | |
>Integral IDA-PBC for Underactuated Mechanical Systems Subject to Matched and Unmatched Disturbances |
|
Franco, Enrico | Imperial College London |
Arpenti, Pierluigi | Università Degli Studi Di Napoli Federico II |
Donaire, Alejandro | The University of Newcastle |
Ruggiero, Fabio | Università Di Napoli |
Keywords: Control applications
Abstract: This work presents a new formulation of the integral interconnection and damping-assignment passivity-based control methodology for underactuated mechanical systems subject to both matched and unmatched disturbances, either constant or position-dependent. The new controller is also applicable to systems with non-constant input matrix. Simulations results on two examples demonstrate its effectiveness.
|
|
MoA16 |
Suite 4 |
Fault Detection |
Regular Session |
Chair: Martin, Jasmin | University of Queensland |
Co-Chair: Qin, S. Joe | Lingnan University |
|
10:00-10:20, Paper MoA16.1 | |
>Dynamic Adaptation Gain Design and Tuning for Threat Discrimination |
|
Zhang, Kangkang | Imperial College London |
Chen, Kaiwen | Imperial College London |
Polycarpou, Marios M. | University of Cyprus |
Parisini, Thomas | Imperial C., Aalborg U. & Univ. of Trieste |
Keywords: Fault detection, Attack Detection, Fault diagnosis
Abstract: Considering potential threats to cyber-physical systems such as component faults and stealthy cyber-attacks, an adaptive observer-based threat discrimination method is proposed for identifying the occurring threat type. Typically, stealthy attacks have only weak effects easily obscured by disturbances on the system outputs. To solve this problem, a parameter adaptation algorithm based on a newly designed dynamic adaptive gain generator is proposed, aiming at improving the sensitivity of the adaptive threat discrimination scheme to potential threats. Only the strictly positive real condition of the proposed gain generator sufficiently ensures the stability of the adaptive observer error system. A moment-matching method is then developed to determine the proper parameters of the gain generator, allowing for the improvement of the sensitivity of the threat discriminators. A numerical example to demonstrate the effectiveness of the proposed methodology is presented.
|
|
10:20-10:40, Paper MoA16.2 | |
>On Insufficiently Informative Measurements in Bayesian Quickest Change Detection and Identification |
|
Ford, Jason John | Queensland Univeristy of Technology |
Martin, Jasmin | University of Queensland |
Molloy, Timothy L. | Australian National University |
Keywords: Fault detection, Estimation, Filtering
Abstract: In this paper, we describe an undesirable weak practical super-martingale hallucination phenomenon that can emerge in the Bayesian quickest detection and identification problem. We establish that when measurements are insufficiently informative, a situation described by a relative entropy condition on measurement densities, the Bayesian quickest detection and identification solution can (undesirably) become increasingly confident that a change has occurred, even when it has not. Finally, we illustrate the phenomenon in simulation studies and the vision-based aircraft detection application which illustrates the optimal rule can be unsuitable in the sense of hallucinating a change that has not occurred.
|
|
10:40-11:00, Paper MoA16.3 | |
>Aero-Engines Anomaly Detection Using an Unsupervised Fisher Autoencoder |
|
Sanami, Saba | Concordia University |
Aghdam, Amir G. | Concordia University |
Keywords: Fault detection, Fault diagnosis, Learning
Abstract: Reliable aero-engine anomaly detection is crucial for ensuring aircraft safety and operational efficiency. This research explores the application of the Fisher autoencoder as an unsupervised deep learning method for detecting anomalies in aero-engine multivariate sensor data, using a Gaussian mixture as the prior distribution of the latent space. The proposed method aims to minimize the Fisher divergence between the true and the modeled data distribution in order to train an autoencoder that can capture the normal patterns of aero-engine behavior. The Fisher divergence is robust to model uncertainty, meaning it can handle noisy or incomplete data. The Fisher autoencoder also has well-defined latent space regions, which makes it more generalizable and regularized for various types of aero-engines as well as facilitates diagnostic purposes. The proposed approach improves the accuracy of anomaly detection and reduces false alarms. Simulations using the CMAPSS dataset demonstrate the model's efficacy in achieving timely anomaly detection, even in the case of an unbalanced dataset.
|
|
11:00-11:20, Paper MoA16.4 | |
>A Framework for Bayesian Quickest Change Detection in General Dependent Stochastic Processes |
|
Martin, Jasmin | University of Queensland |
Ford, Jason John | Queensland Univeristy of Technology |
Molloy, Timothy L. | Australian National University |
Keywords: Fault detection, Filtering
Abstract: In this paper we present a novel framework for quickly detecting a change in a general dependent stochastic process. We propose that any general dependent Bayesian quickest change detection (QCD) problem can be converted into a hidden Markov model (HMM) QCD problem, provided that a suitable state process can be constructed. The optimal rule for HMM QCD is then a simple threshold test on the posterior probability of a change. We investigate case studies that can be considered structured generalisations of Bayesian HMM QCD problems including: quickly detecting changes in statistically periodic processes and quickest detection of a moving target in a sensor network. Using our framework we pose and establish the optimal rules for these case studies. We also illustrate the performance of our optimal rule on real air traffic data to verify its simplicity and effectiveness in detecting changes.
|
|
11:20-11:40, Paper MoA16.5 | |
>Kernel Latent Vector Autoregressive Model for Nonlinear Dynamic Data Modeling and Monitoring |
|
Yu, Jiaxin | City University of Hong Kong |
Dong, Yining | City University of Hong Kong |
Qin, S. Joe | Lingnan University |
Keywords: Fault detection, Nonlinear systems
Abstract: To handle complex non-linearity and latent dynamics in industrial processes,this paper proposes a kernel latent vector auto-regressive (K-LaVAR) algorithm for nonlinear dynamic process modeling and monitoring. By combining kernel mapping and the latent dynamic model, the K-LaVAR algorithm enables nonlinear dimension reduction and circumvents the excessively large dimension issue induced by the kernel mapping. In addition, dual monitoring indices are developed to discern normal variations from dynamic and static aspects with respective statistical control limits. A numerical simulation and the revamped Tennessee Eastman Process (TEP) simulation benchmark are adopted to demonstrate the advantages of the K-LaVAR model in dynamic latent variables extraction, overall monitoring performance improvement, and ensuring prompt detection of process disturbances.
|
|
11:40-12:00, Paper MoA16.6 | |
>A Novel Approach to the Finite Frequency H−/H∞ Fault Detection Observer Design Problem |
|
Huang, Guilin | Imperial College London |
Zhu, Wenshan | Imperial College London |
Jaimoukha, Imad M. | Imperial College London |
Keywords: Fault detection, LMIs, Optimization algorithms
Abstract: In this paper, an observer design problem for a Linear Time-Invariant (LTI) system is studied. An iterative algorithm is proposed as part of the design process,effectively enabling the system to detect fault signals in a finite frequency range. Both the H−-index and H∞-norm are introduced and combined in an observer-based design to generate a residual signal that is sensitive to the fault signal and insensitive to the disturbances over a specified finite frequency range. In particular, our approach enforces an upper bound on the H∞- norm of the disturbances to residual transfer matrix and a lower bound on the H−-index of the faults to residual transfer matrix to ensure that the system achieves optimal performance in detecting all fault signals while limiting the impact of disturbances on the residual within this frequency range. This approach is achieved through the generalized Kalman-Yakubovich-Popov (gKYP) Lemma and uses the Projection Lemma in a novel way to reformulate the problem as a linear matrix inequality (LMI) optimization problem. To address the challenge of finding the best multiplier from the Projection Lemma, an iterative process is designed to obtain a local optimum. The initial solution of this iterative process can be selected from any existing algorithms, leading to an improved version of the observer since each iteration gives a solution which is at least as effective as the previous solution. A numerical example is provided in the last section to illustrate the effectiveness of our approach.
|
|
MoA17 |
Suite 6 |
Biological Systems I |
Regular Session |
Chair: Djema, Walid | Centre INRIA d'Université Côte D'Azur |
Co-Chair: Medvedev, Alexander V. | Uppsala University |
|
10:00-10:20, Paper MoA17.1 | |
>A Minimal Dynamical System and Analog Circuit for Non-Associative Learning |
|
Smart, Matthew | Flatiron Institute |
Shvartsman, Stanislav Y. | Princeton University |
Monnigmann, Martin | Ruhr-Universität Bochum |
Keywords: Biological systems, Learning, Systems biology
Abstract: Learning in living organisms is typically associated with networks of neurons. The use of large numbers of adjustable units has also been a crucial factor in the continued success of artificial neural networks. In light of the complexity of both living and artificial neural networks, it is surprising to see that very simple organisms – even unicellular organisms that do not possess a nervous system – are capable of certain forms of learning. Since in these cases learning may be implemented with much simpler structures than neural networks, it is natural to ask how simple the building blocks required for basic forms of learning may be. The purpose of this study is to discuss the simplest dynamical systems that model a fundamental form of non-associative learning, habituation, and to elucidate technical implementations of such systems, which may be used to implement non-associative learning in neuromorphic computing and related applications.
|
|
10:20-10:40, Paper MoA17.2 | |
>Analytical Characterization of Epileptic Dynamics in a Bistable System (I) |
|
Qin, Yuzhen | Radboud University |
ElGazzar, Ahmed | Donders Institute for Brain, Cognition, and Behaviour | Radboud |
Bassett, Danielle | University of Pennsylvania |
Pasqualetti, Fabio | University of California, Riverside |
van Gerven, Marcel | Radboud University |
Keywords: Biological systems, Nonlinear systems, Network analysis and control
Abstract: Epilepsy is one of the most common neurological disorders globally, affecting millions of individuals. Despite significant advancements, the precise mechanisms underlying this condition remain largely unknown, making accurately predicting and preventing epileptic seizures challenging. In this paper, we employ a bistable model, where a stable equilibrium and a stable limit cycle coexist, to describe epileptic dynamics. The equilibrium captures normal steady-state neural activity, while the stable limit cycle signifies seizure-like oscillations. The noise-driven switch from the equilibrium to the limit cycle characterizes the onset of seizures. The differences in the regions of attraction of these two stable states distinguish epileptic brain dynamics from healthy ones. We analytically construct the regions of attraction for both states. Further, using the notion of input-to-state stability with respect to small inputs, we analytically show how the regions of attraction influence the robustness of the system subject to external perturbations. Generalizing the bistable system into coupled networks, we also find the role of network parameters in shaping the regions of attraction. Our findings shed light on the intricate interplay between brain networks and epileptic activity, offering mechanistic insights into potential avenues for more predictable treatments.
|
|
10:40-11:00, Paper MoA17.3 | |
>Optimization of Microalgae Biosynthesis Via Controlled Algal-Bacterial Symbiosis |
|
Asswad, Rand | Inria - Université Grenoble Alpes |
Djema, Walid | Centre INRIA d'Université Côte D'Azur |
Bernard, Olivier | Inria |
Gouze, Jean-Luc | INRIA |
Cinquemani, Eugenio | INRIA Grenoble - Rhone-Alpes |
Keywords: Biological systems, Optimal control, Nonlinear systems
Abstract: We investigate optimization of an algal-bacterial consortium, where an exogenous control input modulates bacterial resource allocation between growth and synthesis of a resource that is limiting for algal growth. Maximization of algal biomass synthesis is pursued in a continuous bioreactor, with dilution rate as an additional control variable. We formulate optimal control in the two variants of static and dynamic control problems, and address them by theoretical and numerical tools. We explore convexity of the static problem and uniqueness of its solution, and show that the dynamic problem displays a solution with bang-bang control actions and singular arcs that result in cyclic control actions. We finally discuss the relation among the two solutions and show the extent to which dynamic control can outperform static optimal solutions.
|
|
11:00-11:20, Paper MoA17.4 | |
>Time-Optimal Circadian Rhythm Entrainment Is Not Robust |
|
Tao, Zidi | Rensselaer Polytechnic Institute |
Julius, Agung | Rensselaer Polytechnic Institute |
Wen, John | Rensselaer Polytechnic Inst |
Keywords: Biological systems, Robust control, Data driven control
Abstract: This paper explores the continuity characteristics of value functions associated with optimal control in circadian rhythm entrainment problems. Our results demonstrate that when the optimal objective is to minimize the time required for entrainment, the corresponding value function is not Lips- chitz continuous, suggesting that the optimal cost and optimal trajectory are not robust under perturbation. As an alternative, we propose a new objective function that is based on the cumulative squared tracking error and show that the resulting value function is Lipschitz continuous. Through numerical simulations, we further establish that data-driven feedback control systems exhibit higher robustness to input perturbation when the data are collected from optimal control solutions that minimize the cumulative squared tracking error, as opposed to those that are time-optimal.
|
|
11:20-11:40, Paper MoA17.5 | |
>Stability Properties of the Impulsive Goodwin's Oscillator in 1-Cycle |
|
Proskurnikov, Anton V. | Politecnico Di Torino |
Medvedev, Alexander V. | Uppsala University |
Keywords: Biological systems, Stability of nonlinear systems, Stability of hybrid systems
Abstract: The Impulsive Goodwin's Oscillator (IGO) is a mathematical model of a hybrid closed-loop system. It arises by closing a special kind of continuous linear positive time-invariant system with impulsive feedback, which employs both amplitude and frequency pulse modulation. The structure of IGO precludes the existence of equilibria, and all its solutions are oscillatory. With its origin in mathematical biology, the IGO also presents a control paradigm useful in a wide range of applications, in particular dosing of chemicals and medicines. Since the pulse modulation feedback mechanism introduces significant nonlinearity and non-smoothness in the closed-loop dynamics, conventional controller design methods fail to apply. However, the hybrid dynamics of IGO reduce to a nonlinear, time-invariant discrete-time system, exhibiting a one-to-one correspondence between periodic solutions of the original IGO and those of the discrete-time system. The paper proposes a design approach that leverages the linearization of the equivalent discrete-time dynamics in the vicinity of a fixed point. A simple and efficient local stability condition of the 1-cycle in terms of the characteristics of the amplitude and frequency modulation functions is obtained.
|
|
11:40-12:00, Paper MoA17.6 | |
>Feedforward Regulation of Interneuronal Communication (I) |
|
Gambrell, Oliver | University of Delaware |
Vahdat, Zahra | University of Delaware |
Singh, Abhyudai | University of Delaware |
Keywords: Biological systems, Stochastic systems, Biomedical
Abstract: We formulate a mechanistic model capturing the dynamics of neurotransmitter release in a chemical synapse. The proposed modeling framework captures key aspects such as the random arrival of action potentials (AP) in the presynaptic (input) neuron, probabilistic docking and release of neurotransmitter-filled vesicles, and clearance of the released neurotransmitter from the synaptic cleft. Analysis of the model results in analytical expressions for the statistical moments of docked vesicle count and the number of neurotransmitters in the cleft as a function of model parameters, and the frequency of presynaptic APs. We next consider a postsynaptic (output) neuron that fires an AP based on integrating upstream neurotransmitter activity. Our results show the existence of an optimal presynaptic AP frequency that maximizes the probability of a postsynaptic AP firing. We extend these results to consider feedforward regulation where in addition to a direct excitatory synapse, the input neuron also impacts the output indirectly via an inhibitory interneuron, and identify parameter regimes where the feedforward neuronal networks result in band-pass filtering, i.e., the output neuron frequency is maximized at intermediate input neuron frequencies.
|
|
MoA18 |
Suite 7 |
Linear Systems I |
Regular Session |
Chair: Ossareh, Hamid | University of Vermont |
Co-Chair: Dorea, Carlos E.T. | Universidade Federal Do Rio Grande Do Norte |
|
10:00-10:20, Paper MoA18.1 | |
>Constructing Multidimensional Difference Equations from a State-Space Representation Using the Generalized Cayley-Hamilton Theorem |
|
Vanpoucke, Lukas | KU Leuven |
De Moor, Bart L.R. | Katholieke Universiteit Leuven |
Keywords: Linear systems, Autonomous systems
Abstract: We show that applying a generalization of the Cayley-Hamilton Theorem to a state-space representation of a single-output, multidimensional (mD), linear, shift-invariant, causal, autonomous system, with distinct eigentuples, results in an equivalent difference equation representation. The proposed method is also applicable to parameterize a set of mD shift-invariant difference equations in terms of a given set of eigenvalues. Lastly, a closed-form expression in terms of the eigenvalues of the system matrices is derived.
|
|
10:20-10:40, Paper MoA18.2 | |
>Lifted Time Stable Inversion Based High Precision Feedforward Control for Non-Minimum Phase Systems |
|
Zhu, Shaoqin | The Chinese University of Hong Kong, Shenzhen, Shenzhen 518172, |
Ji, Xiaoqiang | The Chinese University of Hong Kong, Shenzhen, Institute of Arti |
Longman, Richard W. | Columbia Univ. MS 4703 |
Xu, Yangsheng | Chinese Univ. of Hong Kong |
Keywords: Linear systems, Computational methods
Abstract: Model-based feedforward is essential to realize the high-precision tracking in advanced manufacturing. The crucial step is to solve the inverse problem, which has unstable solution for non-minimum phase systems. The existing stable inversion method necessitates fixed initial condition and final condition of the tracking task. However, the initial condition and final condition are assigned differently according to diverse requirements in practice. In this paper, a new stable inversion based feedforward control method is proposed. By reformulating the tracking problem in lifted time form and truncating the infinite dimensional inverse operator, the proposed method addresses the inverse problem for stable solution and permits arbitrary initial condition and final condition with rigorous stability analysis and robustness analysis. The implementation of the developed method is presented. The high precision tracking performance of the proposed method is illustrated by simulation.
|
|
10:40-11:00, Paper MoA18.3 | |
>Complexity Bounds for the Maximal Admissible Set of LTI Systems with Disturbances |
|
Ossareh, Hamid | University of Vermont |
Kolmanovsky, Ilya V. | The University of Michigan |
Keywords: Linear systems, Constrained control, Predictive control for linear systems
Abstract: The maximal admissible set (MAS) of a dynamical system characterizes the set of all initial conditions and constant inputs for which the ensuing response satisfies the specified state/output constraints for all time. For a discrete-time, linear time-invariant (LTI) system subject to polytopic constraints and unknown bounded disturbances, the MAS is known to be a polytope, which may not be finitely determined (i.e., it may not be defined by a finite number of inequalities). Thus, the steady-state constraint is usually tightened, which results in a finitely-determined inner approximation of the MAS. However, the complexity of this approximation is not known apriori from problem data. This paper presents and compares two computationally efficient methods, based on matrix power series and on quadratic ISS-Lyapunov functions, respectively, to upper bound the complexity of the MAS. The bounds may facilitate the online computation of the MAS and the implementation of robust reference governors and model predictive controllers.
|
|
11:00-11:20, Paper MoA18.4 | |
>A Data-Driven Stability Test for LTI Systems |
|
Guo, Taosha | University of California, Riverside |
Al Makdah, Abed AlRahman | University of California Riverside |
Tesi, Pietro | Università Degli Studi Di Firenze |
Pasqualetti, Fabio | University of California, Riverside |
Keywords: Linear systems, Data driven control, Stability of linear systems
Abstract: In this work, we derive a data-based test to assess the stability of discrete-time, linear, time-invariant systems directly from finite datasets of noisy input-output trajectories, without the need to estimate the system matrices or the noise statistics. We characterize the performance of our test and we show that, despite inaccuracies in the data due to the system noise, the test provides accurate results when the available data is sufficiently large yet finite (the required amount of data depends on the properties of the system, which we also characterize). These results complement the body of literature on data-driven control and finite-sample analysis, and they provide new ways to assess the stability of control systems that do not assume, nor require the estimation of, a model of the system and noise and do not rely on solving eigenvalue equations.
|
|
11:20-11:40, Paper MoA18.5 | |
>Set-Invariance for Discrete-Time Linear Systems with Time-Varying Delay: A Polyhedral Approach |
|
Dorea, Carlos E.T. | Universidade Federal Do Rio Grande Do Norte |
Olaru, Sorin | CentraleSupélec |
Niculescu, Silviu-Iulian | University Paris-Saclay, CNRS, CentraleSupelec, Inria |
Castelan, Eugenio B. | Univ. Federal De Santa Catarina |
Keywords: Linear systems, Delay systems, Switched systems
Abstract: In this paper, we address the positive invariance property in linear discrete-time systems in the presence of time-varying delays in the states. We build the study on a recently proposed approach based on an appropriate model transformation which allows the derivation of delay-dependent invariance conditions. For polyhedral sets, we establish necessary and sufficient invariance conditions with respect to the transformed model and prove that such conditions imply the confinement of the state trajectories of the original system in the set for arbitrary realizations of the varying delay, as long as the initial states belong to a set which is positively invariant with respect to an augmented switching system without delay, and can be computed in a finite number of steps known in advance.
|
|
11:40-12:00, Paper MoA18.6 | |
>Positive Stabilization and Observer Design for Positive Singular Systems |
|
Shafai, Bahram | Northeastern Univ |
Zarei, Fatemeh | Northeastern University |
Keywords: Linear systems, Differential-algebraic systems, Observers for Linear systems
Abstract: This paper considers the class of positive singular systems and provides a new approach to achieve stabilization and observer design with positivity constraints. First, the connection between singular systems and its equivalent input derivative systems is established and their positivity with stability properties are analyzed. Then, an algebraic transformation is introduced, which allows to eliminate the derivative inputs and to obtain an equivalent standard system. A careful mathematical derivation was performed to obtain closed-form expressions for the coefficient matrices of the resulting transformed system. Consequently, the design of positive stabilization of positive singular systems was made possible using the equivalent standard state space representation. Finally, the design of positive observer for positive singular system is provided. It is shown that a similar procedure is required to eliminate the input derivatives that appear in the output equation allowing the observer to be designed. Both positive stabilization and positive observer designs are formulated and solved in terms of LMIs.
|
|
MoA19 |
Suite 8 |
Stochastic Optimal Control I |
Regular Session |
Chair: Pakniyat, Ali | University of Alabama |
Co-Chair: Tsiotras, Panagiotis | Georgia Institute of Technology |
|
10:00-10:20, Paper MoA19.1 | |
>Chance-Constrained Information-Theoretic Stochastic Model Predictive Control with Safety Shielding |
|
Yin, Ji | Georgia Institute of Technology |
Tsiotras, Panagiotis | Georgia Institute of Technology |
Berntorp, Karl | Mitsubishi Electric Research Labs |
Keywords: Stochastic optimal control, Autonomous systems, Robotics
Abstract: We introduce a nonlinear stochastic model pre- dictive control path integral (MPPI) method that considers chance constraints on system states. The proposed belief-space stochastic MPPI (BSS-MPPI) applies Monte-Carlo sampling to evaluate state distributions resulting from underlying systematic disturbances, and utilizes a Control Barrier Function (CBF) inspired heuristic in belief space to fulfill the specified chance constraints. Compared to several previous stochastic predictive control methods, our approach applies to general nonlinear dy- namics without requiring the computationally expensive system linearization step. Moreover, the BSS-MPPI controller can solve optimization problems without limiting the form of the objective function and chance constraints and is parallelizable. Results on a realistic race-car simulation study show significant reductions in constraint violation compared to some of the prior MPPI approaches, while being comparable in computation times.
|
|
10:20-10:40, Paper MoA19.2 | |
>Adaptive Dual Covariance Steering with Active Parameter Estimation |
|
Knaup, Jacob | Georgia Institute of Technology |
Tsiotras, Panagiotis | Georgia Institute of Technology |
Keywords: Stochastic optimal control, Closed-loop identification, Adaptive control
Abstract: This work examines the optimal covariance steering problem for systems subject to unknown parameters that enter multiplicatively with the state and control, in addition to additive disturbances. In contrast to existing works, the unknown parameters are modeled as random variables and are estimated online. A dual control problem is formulated in which the effect of the planned control policy on the parameter estimates is modeled and optimized for. The parameter estimates are then used to modify the pre-computed control policy online in an adaptive control fashion. Finally, the proposed approach is demonstrated in a vehicle control example with closed-loop parameter identification.
|
|
10:40-11:00, Paper MoA19.3 | |
>Distributionally Constrained Convex Duality Optimal Control (DC-CDOC) Subject to Different Forms of Constraining the Terminal State of Nonlinear Stochastic Systems |
|
Pakniyat, Ali | University of Alabama |
Keywords: Stochastic optimal control, Constrained control, Optimal control
Abstract: This article explores different methods of constraining the terminal state of nonlinear stochastic systems and presents the results of the Distributionally Constrained Convex Duality Optimal Control (DC-CDOC) for each case. Specifically, we consider: (a) almost surely equality constraints on the terminal state, (b) constraints on the expected value of the terminal state, and (c) constraints on the probability distributions of the terminal state, both (i) under the total probability measure and (ii) under all conditional probability measures. For each case, the associated optimal control problem is formulated as a convex linear program on the space of Radon measures, and by exploiting the duality relations between the space of measures and that of continuous functions, we derive the optimality conditions in the form of an optimization problem over the space of differentiable functions constrained to a Hamilton-Jacobi (HJ) inequality and, in some of these cases, a terminal value inequality. An iterative algorithm is also proposed for identifying the value function in the studied cases.
|
|
11:00-11:20, Paper MoA19.4 | |
>Risk-Aware Finite-Horizon Social Optimal Control of Mean-Field Coupled Linear-Quadratic Subsystems |
|
Patel, Dhairya | University of Toronto |
Chapman, Margaret P | University of Toronto |
Keywords: Stochastic optimal control, Cooperative control, Linear systems
Abstract: We formulate and solve a decentralized optimal control problem with cooperative, coupled linear-quadratic subsystems and additional risk-aware costs depending on the covariance and skew of the disturbance. In contrast to related work, this problem quantifies the variability of the subsystem state energy rather than merely its expectation. To tackle this challenge, we develop an alternative linear-algebraic approach illuminating a family of matrices with compelling properties and interpretation. Notably, this approach facilitates an efficient analytical treatment of coupled subsystems, suggesting its potential applicability to future problems in cooperative control.
|
|
11:20-11:40, Paper MoA19.5 | |
>Computationally Efficient Chance Constrained Covariance Control with Output Feedback |
|
Pilipovsky, Joshua | Georgia Institute of Technology |
Tsiotras, Panagiotis | Georgia Institute of Technology |
Keywords: Stochastic optimal control, Optimization, Filtering
Abstract: This paper studies the problem of developing computationally efficient solutions for steering the distribution of the state of a stochastic, linear dynamical system between two boundary Gaussian distributions in the presence of chance-constraints on the state and control input. It is assumed that the state is only partially available through a measurement model corrupted with noise. The filtered state is reconstructed with a Kalman filter, the chance constraints are reformulated as difference of convex (DC) constraints, and the resulting covariance control problem is reformulated as a DC program, which is solved using successive convexification. The efficiency of the proposed method is illustrated on a double integrator example with varying time horizons, and is compared to other state-of-the-art chance-constrained covariance control methods.
|
|
11:40-12:00, Paper MoA19.6 | |
>Minimum Energy Density Steering of Linear Systems with Gromov-Wasserstein Terminal Cost |
|
Morimoto, Kohei | Kyoto University |
Kashima, Kenji | Kyoto University |
Keywords: Stochastic optimal control, Optimization algorithms, Linear systems
Abstract: In this study, we address optimal control problems focused on steering the probabilistic distribution of state variables in linear dynamical systems. Specifically, we address the problem of controlling the structural properties of Gaussian state distributions to predefined targets at terminal times. This task is not yet explored in existing works that primarily aim to exactly match state distributions. By employing the Gromov-Wasserstein (GW) distance as the terminal cost, we formulate a problem that seeks to align the structure of the state density with that of a desired distribution. This approach allows us to extend the control objectives to capture the distribution's shape. We demonstrate that this complex problem can be reduced to a Difference of Convex (DC) programming, which is efficiently solvable through the DC algorithm. Through numerical experiments, we confirm that the terminal distribution indeed gets closer to the desired structural properties of the target distribution.
|
|
MoA20 |
Suite 9 |
Algebraic/Geometric Methods |
Regular Session |
Chair: Miller, Jared | ETH Zurich |
Co-Chair: van Goor, Pieter | Australian National University |
|
10:00-10:20, Paper MoA20.1 | |
>On the Calculation of Equilibrium Points of a Nonlinear System Via Exact Quadratization |
|
Carravetta, Francesco | IASI-CNR |
Keywords: Algebraic/geometric methods, Nonlinear systems, Computational methods
Abstract: We show that, for quadratizable dynamic nonlinear systems, all the equilibrium points satisfy an augmented system of nonlinear equations obtained by applying exact quadratization to a suitably modified dynamic system. For autonomous polynomial dynamic systems (i.e. with no input) this method can be viewed as a general method of solving systems of polynomial equations, which has a lower computational complexity with respect to the classical one consisting of the Buchberger’s algorithm (that searches for a Groebner basis) followed by variable elimination. As a matter, we show that the augmented system of polynomials obtained by quadratization is always a Groebner basis for the ideal associated to the originary problem. This allows skipping the computationally heavy Buchberger’s algorithm, and applying directly elimination theory in the solution-searching algorithm.
|
|
10:20-10:40, Paper MoA20.2 | |
>Efficient Reachable Sets on Lie Groups Using Lie Algebra Monotonicity and Tangent Intervals |
|
Harapanahalli, Akash | Georgia Institute of Technology |
Coogan, Samuel | Georgia Institute of Technology |
Keywords: Algebraic/geometric methods, Nonlinear systems, Computational methods
Abstract: In this paper, we efficiently compute overapproximating reachable sets for control systems evolving on Lie groups, building off results from monotone systems theory and geometric integration theory. We consider intervals in the tangent space, which describe real sets on the Lie group through the exponential map. A local equivalence between the original system and a system evolving on the Lie algebra allows existing interval reachability techniques to apply in the tangent space. Using interval bounds of the Baker-Campbell-Hausdorff formula, these reachable set estimates are extended to arbitrary time horizons in an efficient Runge-Kutta-Munthe-Kaas integration algorithm. The algorithm is demonstrated through consensus on a torus and attitude control on SO(3).
|
|
10:40-11:00, Paper MoA20.3 | |
>Peak Time-Windowed Mean Estimation Using Convex Optimization |
|
Miller, Jared | ETH Zurich |
Schmid, Niklas | ETH Zürich |
Tacchi, Matteo | Univ. Grenoble Alpes, CNRS, Grenoble INP, GIPSA-Lab |
Henrion, Didier | LAAS-CNRS |
Smith, Roy S. | ETH Zurich |
Keywords: Algebraic/geometric methods, Nonlinear systems, LMIs
Abstract: This paper presents an algorithmic approach towards bounding the peak time-windowed average value attained by a state function along trajectories of a dynamical system. An example includes the maximum average current flowing across a power line in any 5-minute window. The peak time-windowed mean estimation task may be posed as a finite-dimensional but nonconvex optimization problem in terms of an initial condition and stopping time. This problem can be lifted into an infinite-dimensional linear program in occupation measures, where no conservatism is introduced under compactness and dynamical regularity assumptions. The peak time-windowed mean estimation linear program is in turn truncated into a convergent sequence of semidefinite programs using the moment-Sum-of-Squares hierarchy. Bounds of the time-windowed mean are computed for example systems.
|
|
11:00-11:20, Paper MoA20.4 | |
>Spatial Group Error and Synchrony for Tracking Control of Left-Invariant Kinematic Systems on Lie Groups |
|
Hampsey, Matthew | Australian National University |
van Goor, Pieter | Australian National University |
Mahony, Robert | Australian National University, |
Keywords: Algebraic/geometric methods, Nonlinear systems, Lyapunov methods
Abstract: Trajectory tracking for underactuated systems has been heavily studied for several decades. When the system state-space is a Lie group, the group multiplication can be used to define a global error. In this paper, we consider the spatial, or right-invariant, group error in the design of a tracking controller for left-invariant systems. This choice of error is shown to exhibit synchrony; that is, the error kinematics depend linearly on the control input difference. In particular, if the actual system is driven by the desired control signal, then the error is constant. This property is used to propose a simple nonlinear tracking control scheme that is globally stable and locally exponentially stable for a class of persistently exciting trajectories for left-invariant systems on Lie-groups. We explore the example system of a mobile robot and show that in this particular case, the proposed control scheme is almost-globally asymptotically stable.
|
|
11:20-11:40, Paper MoA20.5 | |
>Controlled Invariant Sets for Polynomial Systems Defined by Non-Polynomial Equations |
|
Iori, Tomoyuki | Osaka University |
Keywords: Algebraic/geometric methods, Constrained control, Computational methods
Abstract: This study proposes a method for designing a feedback controller that makes a prescribed set invariant for a given polynomial system. Although the system is restricted to polynomial systems, the class of invariant sets is not limited to algebraic sets; it is the zero sets of nonlinear functions satisfying a specific type of partial differential equations (PDE) with coefficients in polynomials. Based on the algebraic relations between a nonlinear function and its derivatives derived from the PDE, a constructive sufficient condition for the existence of the desired controllers is provided. Using symbolic computation, the controllers can be computed exactly. Numerical examples are provided to illustrate the effectiveness of the proposed method.
|
|
11:40-12:00, Paper MoA20.6 | |
>Maximum Likelihood Estimation of the Extended Kalman Filter's Parameters with Natural Gradient |
|
Parellier, Colin | Mines ParisTech |
Chapdelaine, Camille | SAFRAN SA |
Barrau, Axel | Offroad |
Bonnabel, Silvere | Armines |
Keywords: Algebraic/geometric methods, Observers for nonlinear systems, Kalman filtering
Abstract: The extended Kalman filter (EKF) relies on noise parameters, notably the covariance matrix of the observation noise. To identify them using real data, the standard approach consists in maximizing the likelihood of the EKF’s estimates. To perform the optimization, we propose in this paper to use Amari’s natural gradient descent, in a way that preserves positive semi-definiteness of the covariance parameter. We derive the corresponding equations, and we bring the method to bear on a real-world experiment, where we identify the covariance matrix of a GNSS for a vehicle localization problem.
|
|
MoB01 |
Auditorium |
Hybrid Feedback Control Design |
Tutorial Session |
Chair: Ferrante, Francesco | Universita Degli Studi Di Perugia |
Co-Chair: Casau, Pedro | University of Aveiro |
Organizer: Sanfelice, Ricardo G. | University of California at Santa Cruz |
Organizer: Casau, Pedro | University of Aveiro |
Organizer: Ferrante, Francesco | Universita Degli Studi Di Perugia |
|
13:30-13:50, Paper MoB01.1 | |
>A Tutorial on Hybrid Feedback Control (I) |
|
Casau, Pedro | University of Aveiro |
Ferrante, Francesco | Universita Degli Studi Di Perugia |
Sanfelice, Ricardo G. | University of California at Santa Cruz |
Keywords: Hybrid systems, Stability of hybrid systems, Supervisory control
Abstract: This tutorial paper introduces hybrid feedback control through a self-contained examination of hybrid control systems modeled by the combination of differential and difference equations with constraints. Using multiple examples, it illustrates the power of hybrid feedback control, which stems from the integration of continuous and discrete dynamics, where state variables update instantaneously at specific events while flowing continuously otherwise. The paper defines hybrid closed-loop systems as interconnected hybrid plants and controllers with designated inputs and outputs, and formalizes their solutions. It summarizes key properties of hybrid systems and reviews various control strategies, including supervisory control with logic variables to select feedback controllers, event- triggered control to minimize control input updates, and strategies using multiple Lyapunov-like functions for stabilization. Pointers to further reading and other strategies in the literature are provided.
|
|
13:50-14:10, Paper MoB01.2 | |
Introduction and Analysis Tools (I) |
|
Sanfelice, Ricardo G. | University of California at Santa Cruz |
Ferrante, Francesco | Universita Degli Studi Di Perugia |
|
|
14:10-14:30, Paper MoB01.3 | |
Uniting and Supervisory Control (I) |
|
Ferrante, Francesco | Universita Degli Studi Di Perugia |
|
|
14:30-14:50, Paper MoB01.4 | |
Event-Triggered Control (I) |
|
Casau, Pedro | University of Aveiro |
|
|
14:50-15:10, Paper MoB01.5 | |
Synergistic Control (I) |
|
Casau, Pedro | University of Aveiro |
|
|
15:10-15:30, Paper MoB01.6 | |
Other Strategies and Open Problems (I) |
|
Sanfelice, Ricardo G. | University of California at Santa Cruz |
|
|
MoB02 |
Amber 1 |
New Tools for Estimation, Modeling and Control of Quantum Systems |
Invited Session |
Chair: Grigoletto, Tommaso | University of Padova |
Co-Chair: Ticozzi, Francesco | Università Di Padova |
Organizer: Grigoletto, Tommaso | University of Padova |
Organizer: Ticozzi, Francesco | Università Di Padova |
|
13:30-13:50, Paper MoB02.1 | |
>Exact Model Reduction for Discrete-Time Conditional Quantum Dynamics |
|
Grigoletto, Tommaso | University of Padova |
Ticozzi, Francesco | Università Di Padova |
Keywords: Quantum information and control, Model/Controller reduction, Filtering
Abstract: Leveraging an algebraic approach built on minimal realizations and conditional expectations in quantum probability, we propose a method to reduce the dimension of quantum filters in discrete-time, while maintaining the correct distributions on the measurement outcomes and the expectations of some relevant observable. The method is presented for general quantum systems whose dynamics depend on measurement outcomes, hinges on a system-theoretic observability analysis, and is tested on prototypical examples.
|
|
13:50-14:10, Paper MoB02.2 | |
>Explicit Formulas for Adiabatic Elimination with Fast Unitary Dynamics (I) |
|
Riva, Angela | Inria Paris |
Sarlette, Alain | Laboratoire De Physique De l’Ecole Normale Supérieure, Inria, C |
Rouchon, Pierre | Mines Paris PSL |
Keywords: Quantum information and control, Model/Controller reduction
Abstract: The so-called "adiabatic elimination" of fast decaying degrees of freedom in open quantum systems can be performed with a series expansion in the timescale separation. The associated computations are significantly more difficult when the remaining degrees of freedom (center manifold) follow fast unitary dynamics instead of just being slow. This paper highlights how a formulation with Sylvester's equation and with adjoint dynamics leads to systematic, explicit expressions at high orders for settings of physical interest.
|
|
14:10-14:30, Paper MoB02.3 | |
>Provably Time-Optimal Cooling of Markovian Quantum Systems (I) |
|
Malvetti, Emanuel | Technical University Munich |
Keywords: Quantum information and control, Optimal control, Nonlinear systems
Abstract: We address the problem of cooling a Markovian quantum system to a pure state in the shortest amount of time possible. Here, the system drift takes the form of a Lindblad master equation and we assume fast unitary control. This setting allows for a natural reduction of the control system to the eigenvalues of the state density matrix. We give a simple necessary and sufficient characterization of systems which are (asymptotically) coolable and present a powerful maximum principle, based on majorization, which allows to considerably simplify the search for optimal cooling solutions. With these tools at our disposal, we derive explicit, provably time-optimal cooling protocols for rank one qubit systems, inverted lambda-systems on a qutrit, and a certain system consisting of two coupled qubits.
|
|
14:30-14:50, Paper MoB02.4 | |
>Ensemble Quantum Control with a Scalar Input (I) |
|
Liang, Ruikang | Sorbonne University |
Boscain, Ugo V. | CNRS |
Sigalotti, Mario | INRIA Paris |
Keywords: Quantum information and control
Abstract: In this article, we discuss how a three-level closed quantum system with dispersed parameters can be steered between eigenstates via a scalar control. The technique exploits a dynamical decoupling of the control based on the rotating wave approximation, which works under suitable conditions on the spectral gaps of the system and on the bounds on the parameter dispersion. We test numerically the sharpness of the conditions on several examples.
|
|
14:50-15:10, Paper MoB02.5 | |
>Reconstructing Quantum States from Local Observation: A Dynamical Viewpoint |
|
Peruzzo, Marco | University of Padua |
Grigoletto, Tommaso | University of Padova |
Ticozzi, Francesco | Università Di Padova |
Keywords: Quantum information and control
Abstract: We analyze the problem of reconstructing an unknown quantum state of a multipartite system from repeated measurements of local observables. In particular, via a system-theoretic observability analysis, we show that, even when the initial state is not uniquely determined for a static system, this can be reconstructed if we leverage the system’s dynamics. The choice of dynamical generators and the effect of finite samples is discussed, along with an illustrative example.
|
|
15:10-15:30, Paper MoB02.6 | |
>Measuring Quantum Information Leakage under Detection Threat |
|
Farokhi, Farhad | The University of Melbourne |
Kim, Sejeong | The University of Melbourne |
Keywords: Quantum information and control, Cyber-Physical Security, Control Systems Privacy
Abstract: Gentle quantum leakage is proposed as a measure of information leakage to arbitrary eavesdroppers that aim to avoid detection. Gentle (also sometimes referred to as weak or non-demolition) measurements are used to encode the desire of the eavesdropper to evade detection. The gentle quantum leakage meets important axioms proposed for measures of information leakage including positivity, independence, and unitary invariance. Global depolarizing noise, an important family of physical noise in quantum devices, is shown to reduce gentle quantum leakage (and hence can be used as a mechanism to ensure privacy or security). A lower bound for the gentle quantum leakage based on asymmetric approximate cloning is presented. This lower bound relates information leakage to mutual incompatibility of quantum states. A numerical example, based on the encoding in the celebrated BB84 quantum key distribution algorithm, is used to demonstrate the results.
|
|
MoB03 |
Amber 2 |
Multi-Agent Systems: Awareness, Learning, and Formal Methods II |
Invited Session |
Chair: Ghosh, Arabinda | Max Planck Institute for Software Systems |
Co-Chair: Vlahakis, Eleftherios | KTH Royal Institute of Technology |
Organizer: van Huijgevoort, Birgit | Eindhoven University of Technology |
Organizer: Ghosh, Arabinda | Max Planck Institute for Software Systems |
Organizer: Vlahakis, Eleftherios | KTH Royal Institute of Technology |
Organizer: Haesaert, Sofie | Eindhoven University of Technology |
Organizer: Soudjani, Sadegh | Max Planck Institute for Software Systems |
Organizer: Dimarogonas, Dimos V. | KTH Royal Institute of Technology |
|
13:30-13:50, Paper MoB03.1 | |
>Learning Optimal Stable Matches in Decentralized Markets with Unknown Preferences (I) |
|
Shah, Vade | University of California, Santa Barbara |
Marden, Jason R. | University of California, Santa Barbara |
Ferguson, Bryce L. | University of California, Santa Barbara |
Keywords: Game theory, Learning, Agents-based systems
Abstract: Matching algorithms have demonstrated great success in several practical applications, but they often require centralized coordination and plentiful information. In many modern online marketplaces, agents must independently seek out and match with another using little to no information. For these kinds of settings, can we design decentralized, limited-information matching algorithms that preserve the desirable properties of standard centralized techniques? In this work, we constructively answer this question in the affirmative. We model a two-sided matching market as a game consisting of two disjoint sets of agents, referred to as proposers and acceptors, each of whom seeks to match with their most preferable partner on the opposite side of the market. However, each proposer has no knowledge of their own preferences, so they must learn their preferences while forming matches in the market. We present a simple online learning rule that guarantees a strong notion of probabilistic convergence to the welfare-maximizing equilibrium of the game, referred to as the proposer-optimal stable match. To the best of our knowledge, this represents the first completely decoupled, communication-free algorithm that guarantees probabilistic convergence to an optimal stable match, irrespective of the structure of the matching market.
|
|
13:50-14:10, Paper MoB03.2 | |
>Liquid-Graph Time-Constant Network for Multi-Agent Systems Control (I) |
|
Marino, Antonio | University of Rennes 1 |
Pacchierotti, Claudio | CNRS, Univ Rennes, Inria |
Robuffo Giordano, Paolo | Centre National De La Recherche Scientifique (CNRS) |
Keywords: Neural networks, Distributed control, Networked control systems
Abstract: In this paper, we propose the Liquid-Graph Time-constant (LGTC) network, a continuous graph neural network (GNN) model for multi-agent system control based on the recent Liquid Time Constant (LTC) network. We analyse its stability leveraging contraction analysis and propose a closed-form model that preserves the model contraction rate and does not require solving an ODE at each iteration. Compared to discrete models like Graph Gated Neural Networks (GGNNs), the higher expressivity of the proposed model guarantees remarkable performance while reducing the large amount of communicated variables normally required by GNNs. We evaluate our model on a distributed multi-agent control case study (flocking) taking into account variable communication range and scalability under non-instantaneous communication.
|
|
14:10-14:30, Paper MoB03.3 | |
>Asynchronous Heterogeneous Linear Quadratic Regulator Design (I) |
|
Toso, Leonardo Felipe | Columbia University |
Wang, Han | Columbia University |
Anderson, James | Columbia University |
Keywords: Optimal control, Learning, Linear systems
Abstract: We address the problem of designing an LQR controller in a distributed setting, where M similar but not identical systems share their locally computed policy gradient (PG) estimates with a server that aggregates the estimates and computes a controller that, on average, performs well on all systems. Learning in a distributed setting has the potential to offer statistical benefits -- multiple datasets can be leveraged simultaneously to produce more accurate policy gradient estimates. However, the interplay of heterogeneous trajectory data and varying levels of local computational power introduce bias to the aggregated PG descent direction, and prevents us from fully exploiting the parallelism in the distributed computation. The latter stems from synchronous aggregation, where straggler systems negatively impact the runtime. To address this, we propose an asynchronous policy gradient algorithm for LQR control design. By carefully controlling the "staleness" in the asynchronous aggregation, we show that the designed controller converges to each system's epsilon-near optimal controller up to a heterogeneity bias. Furthermore, we prove exact local convergence at a sub-linear rate.
|
|
14:30-14:50, Paper MoB03.4 | |
>Switching Control for Identification Deception in Cyber-Physical Systems (I) |
|
Mavridis, Christos | KTH Royal Institute of Technology |
Kanellopoulos, Aris | KTH Royal Institute of Technology |
Sandberg, Henrik | KTH Royal Institute of Technology |
Johansson, Karl H. | KTH Royal Institute of Technology |
Keywords: Cyber-Physical Security, Switched systems, Identification
Abstract: We investigate the problem of deceiving a malicious agent employing an identification method to estimate the closed-loop dynamics of a cyber-physical system. In particular, we propose a moving target defense mechanism that utilizes stochastic switching between linear closed-loop dynamics to drive a linear system identification process of a potential adversary to sub-optimal solutions with non-vanishing error. We provide a statistical analysis of the induced identification error and show that it is not possible for any linear system identification method to reconstruct the average dynamics of a stochastic switched linear system. Finally, we utilize the theory of Markov jump linear systems to guarantee asymptotic stability of the switching system, and formulate the switching control problem as an optimization problem that guarantees stability while taking into account the trade-off between security and switching effort. Simulation results showcase the efficacy of the proposed approach in inducing identification error for the adversary using minimal switching.
|
|
14:50-15:10, Paper MoB03.5 | |
>Robust Optimal Network Topology Switching for Zero Dynamics Attacks (I) |
|
Tsukamoto, Hiroyasu | NASA JPL, Caltech |
Ibrahim, Joshua | California Institute of Technology |
Hajar, Joudi | Caltech |
Ragan, James | California Institute of Technology |
Chung, Soon-Jo | California Institute of Technology |
Hadaegh, Fred Y. | California Inst. of Tech |
Keywords: Attack Detection, Networked control systems, Robust control
Abstract: The intrinsic, sampling, and enforced zero dynamics attacks (ZDAs) are among the most detrimental stealthy attacks in robotics, aerospace, and cyber-physical systems. They exploit internal dynamics, discretization, redundancy/asynchronous actuation and sensing, to construct disruptive attacks that are completely stealthy in the measurement. They work even when the systems are both controllable and observable. This paper presents a novel framework to robustly and optimally detect and mitigate ZDAs for networked linear control systems. We utilize controllability, observability, robustness, and sensitivity metrics written explicitly in terms of the system topology, thereby proposing a robust and optimal switching topology formulation for resilient ZDA detection and mitigation. Our main contribution is the reformulation of this problem into an equivalent rank-constrained optimization problem (i.e., optimization with a convex objective function subject to convex constraints and rank constraints), which can be solved using convex rank minimization approaches. The effectiveness of our method is demonstrated using networked double integrators subject to ZDAs.
|
|
15:10-15:30, Paper MoB03.6 | |
>Optimal Cooperative Multiplayer Learning Bandits with Noisy Rewards and No Communication |
|
Chang, William | University of California, Los Angeles |
Lu, Yuanhao | Princeton University |
Keywords: Agents-based systems, Cooperative control, Machine learning
Abstract: We consider a cooperative multiplayer bandit learning problem where the players are only allowed to agree on a strategy beforehand, but cannot communicate during the learning process. In this problem, each player simultaneously selects an action. Based on the actions selected by all players, the team of players receives a reward. The actions of all the players are commonly observed. However, each player receives a noisy version of the reward which cannot be shared with other players. Since players receive potentially different rewards, there is an asymmetry in the information used to select their actions. In this paper, we provide an algorithm based on upper and lower confidence bounds that the players can use to select their optimal actions despite the asymmetry in the reward information. We show that this algorithm is asymptotically optimal in T. We also show that it performs empirically better than the current state-of-the-art algorithm for this environment.
|
|
MoB04 |
Amber 3 |
Incentives, Flexibility, and Human Factors in Large-Scale Distributed
Energy Resources Control |
Invited Session |
Chair: Hamilton, Dakota | The University of Vermont |
Co-Chair: Bernstein, Andrey | National Renewable Energy Lab (NREL) |
Organizer: Bernstein, Andrey | National Renewable Energy Lab (NREL) |
Organizer: Cavraro, Guido | National Renewable Energy Laboratory |
Organizer: Almassalkhi, Mads | University of Vermont |
Organizer: Hamilton, Dakota | The University of Vermont |
|
13:30-13:50, Paper MoB04.1 | |
>Feedback Optimization of Incentives for Distribution Grid Services |
|
Cavraro, Guido | National Renewable Energy Laboratory |
Comden, Joshua | National Renewable Energy Laboratory |
Bernstein, Andrey | National Renewable Energy Lab (NREL) |
Keywords: Energy systems, Power systems, Optimization
Abstract: Energy prices and net power injection limitations regulate the operations in distribution grids and typically ensure that operational constraints are met. Nevertheless, unexpected or prolonged abnormal events could undermine the grid's functioning. During contingencies, customers could contribute effectively to sustaining the network by providing services. This paper proposes an incentive mechanism that promotes users' active participation by essentially altering the energy pricing rule. The incentives are modeled via a linear function whose parameters can be computed by the system operator (SO) by solving an optimization problem. Feedback-based optimization algorithms are then proposed to seek optimal incentives by leveraging measurements from the grid, even in the case when the SO does not have a full grid and customer information. Numerical simulations on a standard testbed validate the proposed approach.
|
|
13:50-14:10, Paper MoB04.2 | |
>Data-Driven Network-Aware Control of Thermostatically Controlled Loads with Unknown Dynamics (I) |
|
Jang, Sunho | University of Michigan - Ann Arbor |
Mathieu, Johanna L. | University of Michigan |
Ozay, Necmiye | Univ. of Michigan |
Keywords: Power systems, Data driven control, Switched systems
Abstract: This paper presents a novel data-driven control algorithm that coordinates a large aggregation of heterogeneous thermostatically controlled loads (TCLs) with unknown temperature dynamics and disturbance distributions to provide balancing services to the grid. Each TCL is subject to local constraints on their temperatures due to user comfort and global constraints on the number of TCLs in a mode, called mode-counting constraints, for the safe operation of the distribution network. We first develop a data-driven method to compute sets of modes at each discretized interval of the temperature state space in which the probability of moving outside of the temperature dead-band at the next step is bounded at a certain confidence level. Subsequently, we design a model predictive control (MPC) algorithm that instructs every TCL to switch to the modes within the obtained set while satisfying the mode-counting constraints. A case study compares the proposed control algorithm with a benchmark and verifies its effectiveness in maintaining end-user comfort and the safe operation of the distribution network.
|
|
14:10-14:30, Paper MoB04.3 | |
>Towards Energysheds: A Technical Definition and Cooperative Framework for Future Power System Operations (I) |
|
Hamilton, Dakota | The University of Vermont |
Chevalier, Samuel | MIT |
Pandey, Amritanshu | University of Vermont |
Almassalkhi, Mads | University of Vermont |
Keywords: Power systems, Optimization, Energy systems
Abstract: There is growing interest in understanding how interactions between system-wide objectives and local community decision-making will impact the clean energy transition. The concept of energysheds has gained traction in the areas of public policy and social science as a way to study these relationships. However, development of technical definitions of energysheds that permit system analysis are still largely missing. In this work, we propose a mathematical definition for energysheds, and introduce an analytical framework for studying energyshed concepts within the context of future electric power system operations. This framework is used to develop insights into the factors that impact a community’s ability to achieve energyshed policy incentives within a larger connected power grid, as well as the tradeoffs associated with different spatial policy requirements. We also propose an optimization-based energyshed policy design problem, and show that it can be solved to global optimality within arbitrary precision by employing concepts from quasi-convex optimization. Finally, we investigate how interconnected energysheds can cooperatively achieve their objectives in bulk power system operations.
|
|
14:30-14:50, Paper MoB04.4 | |
>Revealing Decision Conservativeness through Inverse Distributionally Robust Optimization |
|
Li, Qi | Johns Hopkins University |
Liang, Zhirui | Johns Hopkins University |
Bernstein, Andrey | National Renewable Energy Lab (NREL) |
Dvorkin, Yury | Johns Hopkins University |
Keywords: Power systems, Optimization, Modeling
Abstract: This paper introduces Inverse Distributionally Robust Optimization (I-DRO) as a method to infer the conservativeness level of a decision-maker, represented by the size of a Wasserstein metric-based ambiguity set, from the optimal decisions made using Forward Distributionally Robust Optimization (F-DRO). By leveraging the Karush-Kuhn-Tucker (KKT) conditions of the convex F-DRO model, we formulate I-DRO as a bi-linear program, which can be solved using off-the-shelf optimization solvers. Additionally, this formulation exhibits several advantageous properties. We demonstrate that I-DRO not only guarantees the existence and uniqueness of an optimal solution but also establishes the necessary and sufficient conditions for this optimal solution to accurately match the actual conservativeness level in F-DRO. Furthermore, we identify three extreme scenarios that may impact I-DRO effectiveness. Our case study applies F-DRO for power system scheduling under uncertainty and employs I-DRO to recover the conservativeness level of system operators. Numerical experiments based on an IEEE 5-bus system and a realistic NYISO 11-zone system demonstrate I-DRO performance in both normal and extreme scenarios.
|
|
14:50-15:10, Paper MoB04.5 | |
>Network-Aware and Welfare-Maximizing Dynamic Pricing for Energy Sharing (I) |
|
Alahmed, Ahmed | Cornell University |
Cavraro, Guido | National Renewable Energy Laboratory |
Bernstein, Andrey | National Renewable Energy Lab (NREL) |
Tong, Lang | Cornell University |
Keywords: Energy systems, Large-scale systems, Decentralized control
Abstract: The proliferation of behind-the-meter (BTM) distributed energy resources (DER) within the electrical distribution network presents significant supply and demand flexibilities, but also introduces operational challenges such as voltage spikes and reverse power flows. In response, this paper proposes a network-aware dynamic pricing framework tailored for energy-sharing coalitions that aggregate small, but ubiquitous, BTM DER downstream of a distribution system operator's (DSO) revenue meter that adopts a generic net energy metering (NEM) tariff. By formulating a Stackelberg game between the energy-sharing market leader and its prosumers, we show that the dynamic pricing policy induces the prosumers toward a network-safe operation and decentrally maximizes the energy-sharing social welfare. The dynamic pricing mechanism involves a combination of a locational ex-ante dynamic price and an ex-post allocation, both of which are functions of the energy sharing's BTM DER. The ex-post allocation is proportionate to the price differential between the DSO NEM price and the energy-sharing locational price. Simulation results using real DER data and the IEEE 13-bus test systems illustrate the dynamic nature of network-aware pricing at each bus, and its impact on voltage.
|
|
15:10-15:30, Paper MoB04.6 | |
>Learning with Adaptive Conservativeness for Distributionally Robust Optimization: Incentive Design for Voltage Regulation (I) |
|
Liang, Zhirui | Johns Hopkins University |
Li, Qi | Johns Hopkins University |
Comden, Joshua | National Renewable Energy Laboratory |
Bernstein, Andrey | National Renewable Energy Lab (NREL) |
Dvorkin, Yury | Johns Hopkins University |
Keywords: Smart grid, Optimization, Statistical learning
Abstract: Information asymmetry between the Distribution System Operator (DSO) and Distributed Energy Resource Aggregators (DERAs) obstructs designing effective incentives for voltage regulation. To capture this effect, we employ a Stackelberg game-theoretic framework, where the DSO seeks to overcome the information asymmetry and refine its incentive strategies by learning from DERA behavior over multiple iterations. We introduce a model-based online learning algorithm for the DSO, aimed at inferring the relationship between incentives and DERA responses. Given the uncertain nature of these responses, we also propose a distributionally robust incentive design model to control the probability of voltage regulation failure and then reformulate it into a convex problem. This model allows the DSO to periodically revise distribution assumptions on uncertain parameters in the decision model of the DERA. Finally, we present a gradient-based method that permits the DSO to adaptively modify its conservativeness level, measured by the size of a Wasserstein metric-based ambiguity set, according to historical voltage regulation performance. The effectiveness of our proposed method is demonstrated through numerical experiments.
|
|
MoB05 |
Amber 4 |
Numerical Methods for Mixed-Integer and Nonsmooth Optimal Control I |
Invited Session |
Chair: Nurkanovic, Armin | University of Freiburg |
Co-Chair: Kronqvist, Jan | KTH Royal Insitute of Technology |
Organizer: Nurkanovic, Armin | University of Freiburg |
Organizer: Kronqvist, Jan | KTH Royal Insitute of Technology |
Organizer: Acary, Vincent | INRIA Centre De Recherche De L'université De Grenoble Alpes |
Organizer: Diehl, Moritz | University of Freiburg |
|
13:30-13:50, Paper MoB05.1 | |
>Finite Elements with Switch Detection for Numerical Optimal Control of Projected Dynamical Systems (I) |
|
Pozharskiy, Anton Edvinovich | University of Freiburg |
Nurkanovic, Armin | University of Freiburg |
Diehl, Moritz | University of Freiburg |
Keywords: Optimal control, Switched systems, Numerical algorithms
Abstract: Abstract— The Finite Elements with Switch Detection (FESD) method is a highly accurate direct transcription method for optimal control of several classes of nonsmooth dynamical systems. This paper extends the FESD method to Projected Dynamical Systems (PDS) and first-order sweeping processes with time-independent sets. FESD discretizes an equivalent dynamic complementarity system and exploits the structure of the discontinuities present in the continuous-time complementarities. This is achieved by allowing integration step sizes to be degrees of freedom, and introducing additional complementarity constraints, enabling the exact detection of nonsmooth events. In contrast to fixed step Runge-Kutta integrators, the guaranteed smoothness of the system on each step allows for the recovery of full-order integration accuracy and the correct computation of numerical sensitivities. Numerical examples illustrate the effectiveness of the proposed method in simulation and optimal control contexts. FESD for PDS and examples are implemented in the open-source software package nosnoc.
|
|
13:50-14:10, Paper MoB05.2 | |
>A Gap Penalty Method for Optimal Control of Linear Complementarity Systems (I) |
|
Lin, Kangyu | Kyoto University |
Ohtsuka, Toshiyuki | Kyoto Univ |
Keywords: Optimal control, Hybrid systems, Optimization algorithms
Abstract: In this study, we propose a novel penalty reformulation and numerical solution method for optimal control problems (OCPs) of linear complementarity systems. The proposed reformulation aims to construct a penalty term tailored to the complementarity constraints using the D-gap function. The proposed penalty term is nonconvex but exhibits a convexity structure that can be utilized by convexity-exploiting solution methods. To solve the reformulated OCP efficiently, we propose a solution method using the sequential convex quadratic programming framework. The convexity of subproblems is guaranteed by a pre-computed regularization matrix using the parameter of the D-gap function. The proposed method is globalized using a merit line search strategy. We confirmed the effectiveness of the proposed method using a benchmark test in comparison with several state-of-the-art methods.
|
|
14:10-14:30, Paper MoB05.3 | |
>Optimal Control of Nonsmooth Dynamical Systems Using Measure Relaxations (I) |
|
Chhatoi, Saroj Prasad | LAAS-CNRS |
Tanwani, Aneel | Laas -- Cnrs |
Henrion, Didier | LAAS-CNRS |
Keywords: Optimal control, Optimization, Constrained control
Abstract: We address the problem of optimal control of a nonsmooth dynamical system described by an evolution variational inequality. We consider both the discrete-time and continuous-time versions of the problem and we relax the problem in the space of measures. We show that there is no gap between the original finite-dimensional problem and the relaxed problem. We show the convergence of the relaxed discrete-time optimal control in measures to continuous-time optimal control in measures. This paves the way to a sound implementation of the moment sum-of-squares hierarchy to solve numerically the optimal control of nonsmooth dynamical systems.
|
|
14:30-14:50, Paper MoB05.4 | |
>Reachability Analysis for Piecewise Affine Systems with Neural Network-Based Controllers (I) |
|
Teichrib, Dieter | TU Dortmund University |
Schulze Darup, Moritz | TU Dortmund University |
Keywords: Neural networks, Optimal control, Switched systems
Abstract: Neural networks (NN) have been successfully applied to approximate various types of complex control laws, resulting in low-complexity NN-based controllers that are fast to evaluate. However, when approximating control laws using NN, performance and stability guarantees of the original controller may not be preserved. Recently, it has been shown that it is possible to provide such guarantees for linear systems with NN-based controllers by analyzing the approximation error with respect to a stabilizing base-line controller or by computing reachable sets of the closed-loop system. The latter has the advantage of not requiring a base-line controller. In this paper, we show that similar ideas can be used to analyze the closed-loop behavior of piecewise affine (PWA) systems with an NN-based controller. Our approach builds on computing over-approximations of reachable sets using mixed-integer linear programming, which allows to certify that the closed-loop system converges to a small set containing the origin while satisfying input and state constraints. We also show how to modify a given NN-based controller to ensure asymptotic stability for the controlled PWA system.
|
|
14:50-15:10, Paper MoB05.5 | |
>High Accuracy Numerical Optimal Control for Rigid Bodies with Patch Contacts through Equivalent Contact Points (I) |
|
Dietz, Christian | Siemens AG / University of Freiburg |
Nurkanovic, Armin | University of Freiburg |
Albrecht, Sebastian | Siemens AG |
Diehl, Moritz | University of Freiburg |
Keywords: Switched systems, Optimal control
Abstract: This paper extends the Finite Elements with Switch Detection and Jumps (FESD-J) [1] method to problems of rigid body dynamics involving patch contacts. The FESD-J method is a high accuracy discretization scheme suitable for use in direct optimal control of nonsmooth mechanical systems. It detects dynamic switches exactly in time and, thereby, maintains the integration order of the underlying Runge- Kutta (RK) method. This is in contrast to commonly used time-stepping methods which only achieve first-order accuracy. Considering rigid bodies with possible patch contacts results in nondifferentiable signed distance functions (SDF), which introduces additional nonsmoothness into the dynamical system. In this work, we utilize so-called equivalent contact points (ECP), which parameterize force and impulse distributions on contact patches by evaluation at single points. We embed a nondifferentiable SDF into a complementarity Lagrangian system (CLS) and show that the determined ECP are well-defined. We then extend the FESD-J discretization to the considered CLS such that its integration accuracy is maintained. The functionality of the method is illustrated for both a simulation and an optimal control example.
|
|
15:10-15:30, Paper MoB05.6 | |
>An Inverse Optimal Control Interpretation of Augmented Distributed Optimisation Algorithms |
|
Hallinan, Liam | University of Cambridge |
Lestas, Ioannis | University of Cambridge |
Keywords: Optimal control, Networked control systems, Optimization algorithms
Abstract: Distributed optimisation algorithms are used in a wide variety of problems where the goal is for a set of agents in a network to solve a network-wide optimisation problem via decentralised update rules. Here, we use inverse optimal control to show that a broad class of augmented primal-dual distributed optimisation algorithms can be viewed as the optimal solution to a network-wide optimal control problem, thus providing a performance metric for such methods. The result is shown when inequality constraints are also present, which is a setting that requires additional consideration due to the non-smooth nature of the algorithm dynamics.
|
|
MoB06 |
Amber 5 |
Networked Control Systems I |
Regular Session |
Chair: Altafini, Claudio | Linkoping University |
Co-Chair: Santini, Stefania | University of Naples Federico II Italy |
|
13:30-13:50, Paper MoB06.1 | |
>Connectivity and Synchronization in Bounded Confidence Kuramoto Oscillators |
|
Srivastava, Trisha | Università Degli Studi Del Sannio |
Bernardo, Carmela | University of Sannio |
Altafini, Claudio | Linkoping University |
Vasca, Francesco | University of Sannio |
Keywords: Networked control systems, Agents-based systems, Modeling
Abstract: Frequency synchronization of bounded confidence Kuramoto oscillators is analyzed. The dynamics of each oscillator is defined by the average of the phase differences with its neighbors, where any two oscillators are considered neighbors if their geodesic distance is less than a certain confidence threshold. A phase-dependent graph is defined whose nodes and edges represent the oscillators and their connections, respectively. It is studied how the connectivity of the graph influences steady-state behaviors of the oscillators. It is proved that the oscillators synchronize asymptotically if the subgraph of each partition, possibly not complete, eventually remains constant over time. Simulation results show the application of the theoretical findings also in the presence of oscillators having different natural frequencies.
|
|
13:50-14:10, Paper MoB06.2 | |
>Social Network-Based Epidemic Spread with Opinion-Dependent Vaccination |
|
Bhowmick, Sourav | Indian Institute of Space Science and Technology |
Narayanasamy, Selvaganesan | Indian Institute of Space Science and Technology |
Keywords: Networked control systems, Biomedical, Control applications
Abstract: In this letter, an epidemic dynamical model driven by perceived disease severity opinion in societies is investigated along with its various dynamical characteristics. More specifically, the epidemic model namely Susceptible-Infected-Recovered-Vaccinated (SIRV) is considered over a transmission network, while the opinion reflecting the perceived disease risk evolves over a social network. In particular, the global and the local stability conditions of the disease-free equilibrium (DFE), i.e., there is no disease in the network, have been investigated, wherein the local stability is revealed to be linked with the basic reproduction rate and the transverse (non-zero) eigenvalues of the Jacobian evaluated at the DFE points. Moreover, the local stability analysis of the endemic equilibrium (EE), i.e., where disease persists in the network, has been investigated. The simulation results verify the theoretical methods.
|
|
14:10-14:30, Paper MoB06.3 | |
>String Stability for Predecessor-Following Platoons Over Channels with Heterogeneous Reception Probabilities |
|
Maass, Alejandro I. | Pontificia Universidad Católica De Chile |
Vargas, Francisco J. | Universidad Técnica Federico Santa María |
Peters, Andres A. | Universidad Adolfo Ibáñez |
Keywords: Networked control systems, Control over communications, Autonomous vehicles
Abstract: We study predecessor-following platoons in which each vehicle-to-vehicle (V2V) communication is affected by a different probability of successful transmission. We model the overall platoon as a stochastic hybrid system, and analyse its stochastic L_2 string stability via a small-gain approach. We provide an explicit string stability condition that illustrates the interplay between the channel success probabilities, transmission rate, and time headway constant. We illustrate our findings through numerical simulations.
|
|
14:30-14:50, Paper MoB06.4 | |
>Area Coverage Using Multiple Aerial Robots with Coverage Redundancy and Collision Avoidance |
|
Kim, Soobum | Georgia Institute of Technology |
Lin, Ruoyu | University of California, Irvine |
Coogan, Samuel | Georgia Institute of Technology |
Egerstedt, Magnus | University of California, Irvine |
Keywords: Networked control systems, Decentralized control, Sensor networks
Abstract: This paper presents a coverage control strategy for a team of aerial robots equipped with downward facing cameras. Based on the observation that the resolution of a camera mounted on an aerial robot degrades with the altitude of the robot, we propose a decentralized gradient-based controller that allows each robot to trade off between the size of the area it monitors and the quality of sensing it performs over the area. Moreover, the proposed controller drives a team of robots to a configuration that maximizes the joint probability for detecting targets or events of interest in the coverage domain. To ensure inter-robot collision avoidance during deployment, we utilize control barrier functions to prevent the robots from getting closer to each other than a specified safety distance. The proposed controller is experimentally validated in simulations.
|
|
14:50-15:10, Paper MoB06.5 | |
>Event-Triggered Control of Nonlinear Systems under Deception and Denial-Of-Service Attacks |
|
Seifullaev, Ruslan | Uppsala University |
Teixeira, André M. H. | Uppsala University |
Ahlen, Anders | Uppsala University |
Keywords: Networked control systems, Delay systems, Attack Detection
Abstract: We address the problem of event-triggered networked control of nonlinear systems under simultaneous deception and Denial-of-Service (DoS) attacks. By DoS attacks, we refer to disruptions in the communication channel that prevent sensor measurements from reaching the controller. When the system undergoes a deception attack, the controller receives a modified output, deviating from the sensor's original measurement. We implement the input delay approach and the Lyapunov--Krasovskii technique to obtain sufficient conditions, expressed in terms of linear matrix inequalities (LMIs), that characterize the duration of the DoS interruptions under which input-to-state stability (ISS) of the closed-loop system is preserved. Furthermore, we explore scenarios involving simultaneous attacks, where the DoS is modeled as a stochastic Bernoulli process. The closed-loop system is then considered as a stochastic impulsive system. In a similar manner, we derive conditions to ensure mean-square ISS for this case. A numerical example illustrates the efficiency of the results.
|
|
15:10-15:30, Paper MoB06.6 | |
>From Piece-Wise Constant to Continuous Time-Varying Delays: Global Exponential Stability Preservation for Nonlinear Systems under Sampling |
|
Caiazzo, Bianca | University of Naples Federico II |
Leccese, Sara | University of Naples Federico II |
Pepe, Pierdomenico | University of L' Aquila |
Petrillo, Alberto | University of Naples Federico II |
Santini, Stefania | University of Naples Federico II Italy |
Keywords: Networked control systems, Delay systems, Sampled-data control
Abstract: The preservation of the global exponential stability property for the class of fully nonlinear retarded globally Lipschitz systems with time-varying delays under suitable fast sampling is proven through this manuscript. Two main classes of time-varying delays are considered: i) piece-wise constant state delays and ii) continuous-time Lipschitz state delays. Halanay's inequality along with the equivalence between the piece-wise constant delay global exponential stability and measurable delay global exponential stability properties are exploited to demonstrate these results.
|
|
MoB07 |
Amber 6 |
Game Theory II |
Regular Session |
Chair: Hayakawa, Tomohisa | Tokyo Institute of Technology |
Co-Chair: Tatarenko, Tatiana | TU Darmstadt |
|
13:30-13:50, Paper MoB07.1 | |
>Fair Artificial Currency Incentives in Repeated Weighted Congestion Games: Equity vs. Equality |
|
Pedroso, Leonardo | Eindhoven University of Technology |
Agazzi, Andrea | Università Di Pisa |
Heemels, W.P.M.H. (Maurice) | Eindhoven University of Technology |
Salazar, Mauro | Eindhoven University of Technology |
Keywords: Game theory
Abstract: When users access shared resources in a selfish manner, the resulting societal cost and perceived users' cost is often higher than what would result from a centrally coordinated optimal allocation. While several contributions in mechanism design manage to steer the aggregate users choices to the desired optimum by using monetary tolls, such approaches bear the inherent drawback of discriminating against users with a lower income. More recently, incentive schemes based on artificial currencies have been studied with the goal of achieving a system-optimal resource allocation that is also fair. In this resource-sharing context, this paper focuses on repeated weighted congestion game with two resources, where users contribute to the congestion to different extents that are captured by individual weights. First, we address the broad concept of fairness by providing a rigorous mathematical characterization of the distinct societal metrics of equity and equality, i.e., the concepts of providing equal outcomes and equal opportunities, respectively. Second, we devise weight-dependent and time-invariant optimal pricing policies to maximize equity and equality, and prove convergence of the aggregate user choices to the system-optimum. In our framework it is always possible to achieve system-optimal allocations with perfect equity, while the maximum equality that can be reached may not be perfect, which is also shown via numerical simulations.
|
|
13:50-14:10, Paper MoB07.2 | |
>Pareto Improvement for Noncooperative Systems with Vector-Valued Payoff Functions |
|
Guo, Zehui | Tokyo Institute of Technology |
Hayakawa, Tomohisa | Tokyo Institute of Technology |
Keywords: Game theory
Abstract: Definition of weak Pareto improvement is given for noncooperative systems with vector-valued payoff functions. The region where a system trajectory is Pareto improving is characerized. Some necessary and sufficient conditions are given for when the system trajectory is (weak) Pareto improving in the entire state space. We provide several numerical examples to illustrate our results.
|
|
14:10-14:30, Paper MoB07.3 | |
>From Discrete to Continuous Imitation Dynamics |
|
Aghaeeyan, Azadeh | Brock University |
Ramazi, Pouria | Brock University |
Keywords: Game theory
Abstract: It has been previously shown that a finite well-mixed population of individuals imitating the highest earners in a binary game can undergo perpetual fluctuations. However, it remains unknown whether the fluctuations in the population proportions of the two strategies persist as population size grows. In this paper, we answer this question for an imitative population with diagonal anticoordination matrices. We show that the collection of Markov chains corresponding to the population dynamics is a family of generalized stochastic approximation process for a good upper semicontinuous differential inclusion. We additionally show that the differential inclusion always converges to an equilibrium. This convergence, based on the available results in the stochastic approximation theory, implies that the lengths of the fluctuations in the population proportions of the two strategies in a finite population of imitators with diagonal anticoordination payoff matrices vanish with probability one as population size grows. Furthermore, taking the same steps for a population of imitators with diagonal coordination payoff matrices results in a similar conclusion, which is consistent with the previously reported results for finite populations of imitators with coordination payoff matrices.
|
|
14:30-14:50, Paper MoB07.4 | |
>Learning Nash in Constrained Markov Games with an alpha-Potential |
|
Das, Soham | Texas A&M University |
Eksin, Ceyhun | Texas A&M University |
Keywords: Game theory, Optimization algorithms, Agents-based systems
Abstract: We develop a best-response algorithm for solving constrained Markov games assuming limited violations for the potential game property. The limited violations of the potential game property mean that changes in value function due to unilateral policy alterations can be measured by the potential function up to an error alpha. We show the existence of stationary epsilon-approximate constrained Nash policy whenever the set of feasible stationary policies is non-empty. Our setting has agents accessing an efficient probably approximately correct solver for a constrained Markov decision process which they use for generating best-response policies against the other agents' former policies. For an accuracy threshold epsilon>4alpha, the best-response dynamics generate provable convergence to epsilon-Nash policy in finite time with probability at least 1-delta at the expense of polynomial bounds on sample complexity that scales with the reciprocal of epsilon and delta.
|
|
14:50-15:10, Paper MoB07.5 | |
>Fully Distributed Nash Equilibrium Seeking in N-Cluster Games with Zero-Order Information |
|
Tatarenko, Tatiana | TU Darmstadt |
Keywords: Game theory, Optimization algorithms, Randomized algorithms
Abstract: This paper deals with distributed Nash equilib- rium seeking in n-cluster games with zero-order information. In such games agents have an access only to the values of their own cost functions and can communicate with their neighbors in the same cluster. The agents within each cluster are cooperative and intend to minimize the overall cluster’s cost. This cost depends, however, on actions of agents from other clusters. Thus, there is a game between the clusters. We present a fully distributed gradient play algorithm to solve this game. The algorithm does not require agents to have any knowledge about action sets, actions or cost functions of others and is based on the zero- order information in the system. We prove convergence of the proposed procedure and estimate its convergence rate which turns out to be optimal up to a logarithmic term in the class of problems under consideration.
|
|
15:10-15:30, Paper MoB07.6 | |
>On the Analysis of Distributed Splitting Methods with Variance Reduction for Stochastic Generalized Nash Equilibrium Problems |
|
Tao, Haochen | Beijing Institute of Technology |
Cui, Shisheng | Beijing Institute of Technology |
Sun, Jian | Beijing Institute of Technology |
Keywords: Game theory, Optimization algorithms, Stochastic systems
Abstract: We focus on problems of stochastic generalized Nash equilibrium seeking with joint constraints and expectation-valued operators. In this work, the stochastic variance-reduced gradient (SVRG) technique is modified to contend with infinite sample space and then, a stochastic forward-backward-forward splitting scheme with variance reduction (DVRSFBF) is proposed for resolving structured monotone inclusion problems. In DVRSFBF, the average gradient is computed periodically in the outer loop, while only cheap sampling is required in the frequently activated inner loop, thus achieving significant speedups when sampling costs cannot be overlooked. The algorithm is fully distributed and it guarantees almost sure convergence under appropriate batch size and strong monotonicity assumptions. Moreover, it exhibits a linear rate with possible biased estimators, which is relatively mild and adopted by many optimization schemes especially for those based on simulations. A numerical study on a class of networked Cournot games reflects the performance of DVRSFBF.
|
|
MoB08 |
Amber 7 |
Optimal Control II |
Regular Session |
Chair: Sassano, Mario | University of Rome, Tor Vergata |
Co-Chair: Jungers, Raphaël M. | University of Louvain |
|
13:30-13:50, Paper MoB08.1 | |
>Optimal Control on Positive Cones |
|
Rantzer, Anders | Lund University |
Pates, Richard | Lund University |
Keywords: Optimal control, Linear systems, Compartmental and Positive systems
Abstract: An optimal control problem on finite-dimensional positive cones is stated. Under a critical assumption on the cone, the corresponding Bellman equation is satisfied by a linear function, which can be computed by convex optimization. A separate theorem relates the assumption on the cone to the existence of minimal elements in certain subsets of the dual cone. Three special cases are derived as examples. The first one, where the positive cone is the set of positive semi-definite matrices, reduces to standard linear quadratic control. The second one, where the positive cone is a polyhedron, reduces to a recent result on optimal control of positive systems. The third special case corresponds to linear quadratic control with additional structure, such as spatial invariance.
|
|
13:50-14:10, Paper MoB08.2 | |
>Policy Algebraic Equation for the LQR and the H-Infinity Control Problems |
|
Sassano, Mario | University of Rome, Tor Vergata |
Keywords: Optimal control, Linear systems, Robust control
Abstract: The Linear Quadratic Regulator (LQR) and the H-infinity control problems for linear systems are revisited with the objective of deriving a novel algebraic (polynomial) equation alternative to the standard Algebraic Riccati Equation (ARE). Differently from the latter, the former is envisioned to involve the policy alone, in place of the value function as in the ARE. The resulting equation, referred to as the Policy Algebraic Equation, contains nm variables and equations, of order less than or equal to 2n, where n and m denote the dimension of the state and the input, respectively.
|
|
14:10-14:30, Paper MoB08.3 | |
>About the Bang-Bang Principle for Piecewise Affine Systems |
|
Chenevat, Ruben | Université De Montpellier, MISTEA - INRAE |
Cheviron, Bruno | G-EAU, Univ Montpellier, AgroParisTech, BRGM, INRAE, Institut Ag |
Rapaport, Alain | INRAE & Univ. Montpellier |
Roux, Sébastien | INRAE |
Keywords: Optimal control, Linear systems
Abstract: We show that for piecewise affine controlled systems, the bang-bang principle does not hold even when the dynamics is continuous with respect to the state variable. However, we show that there exist minimal time trajectories with extreme controls at the loci where the dynamics is differentiable. We give two examples which exhibit singular arcs exactly at the loci of non-differentiability.
|
|
14:30-14:50, Paper MoB08.4 | |
>On Moment Relaxations for Linear State Feedback Controller Synthesis with Non-Convex Quadratic Costs and Constraints |
|
Gramlich, Dennis | RWTH Aachen |
Zhang, Hao | Tongji University |
Gao, Sheng | Tongji University |
Scherer, Carsten W. | University of Stuttgart |
Ebenbauer, Christian | RWTH Aachen University |
Keywords: Optimal control, LMIs, Time-varying systems
Abstract: We present a simple and effective way to account for non-convex costs and constraints in~state feedback synthesis, and an interpretation for the variables in which state feedback synthesis is typically convex. We achieve this by deriving the controller design using moment matrices of state and input. It turns out that this approach allows the consideration of non-convex constraints by relaxing them as expectation constraints, and that the variables in which state feedback synthesis is typically convexified can be identified with blocks of these moment matrices.
|
|
14:50-15:10, Paper MoB08.5 | |
>Multi-Crop Scheduling of Sowings in Adaptive Vertical Farms with Model Predictive Control |
|
Bagnerini, Patrizia | University of Genoa |
Gaggero, Mauro | National Research Council of Italy |
Ghio, Marco | Space V Srl |
Keywords: Optimal control, Predictive control for linear systems, Optimization
Abstract: An approach based on model predictive control is presented for the scheduling of sowings in an adaptive vertical farm, that is, a pioneering vertical greenhouse where shelf spacing is adjusted automatically according to the growth of crops. We consider the case in which the greenhouse is used for the simultaneous cultivation of multiple types of crops, with plants that may deviate from the ideal growth curve due to suboptimal temperature or humidity (modeled as a system noise). Model predictive control is used to account for such possible deviations and determine the best time instants to perform sowings in the various shelves composing the greenhouse, with the aim of maximizing the production yield. Simulation results are presented in a case study involving the simultaneous cultivation of three different types of crops. They showcase the effectiveness of the proposed method in maximizing production yield while effectively using almost all the vertical space available in the greenhouse, with various control horizons and types of disturbances.
|
|
15:10-15:30, Paper MoB08.6 | |
>Smart Abstraction Based on Iterative Cover and Non-Uniform Cells |
|
Calbert, Julien | UCLouvain |
Egidio, Lucas N. | Université Catholique De Louvain |
Jungers, Raphaël M. | University of Louvain |
Keywords: Optimal control, Robust control, LMIs
Abstract: We propose a multi-scale approach for computing abstractions of dynamical systems, that incorporates both local and global optimal control to construct a goal-specific abstraction. For a local optimal control problem, we not only design the controller ensuring the transition between every two subsets (cells) of the state space but also incorporate the volume and shape of these cells into the optimization process. This integrated approach enables the design of non-uniform cells, effectively reducing the complexity of the abstraction. These local optimal controllers are then combined into a digraph, which is globally optimized to obtain the entire trajectory. The global optimizer attempts to lazily build the abstraction along the optimal trajectory, which is less affected by an increase in the number of dimensions. Since the optimal trajectory is generally unknown in practice, we propose a methodology based on the RRT* algorithm to determine it incrementally. Finally, we provide a tractable implementation of this algorithm for the optimal control of L-smooth nonlinear dynamical systems.
|
|
MoB09 |
Amber 8 |
Predictive Control for Linear Systems II |
Regular Session |
Chair: Olaru, Sorin | CentraleSupélec |
Co-Chair: Monnigmann, Martin | Ruhr-Universität Bochum |
|
13:30-13:50, Paper MoB09.1 | |
>Robust Tube MPC Using Gain-Scheduled Policies for a Class of LPV Systems |
|
Fleming, James M. | Loughborough University |
Hawari, Qusay | Loughborough University |
Cannon, Mark | University of Oxford |
Keywords: Predictive control for linear systems, Linear parameter-varying systems, Robust control
Abstract: This paper presents a method for robust model predictive control (MPC) of linear parameter varying (LPV) systems considering control policies that are affine functions of the parameter, which is possible when only the `A' and not the `B' matrix depends on the uncertain parameter (LPV-A systems). This is less conservative than formulations in which the policy is restricted to perturbations on a feedback law, as it includes such policies as a special case. State and input constraints are handled efficiently by bounding predicted states in a sequence of polyhedra (i.e. tube MPC), that are parameterised by variables in the online optimisation. The resulting controller can be implemented by online solution of a single quadratic programming problem and can exploit rate bounds on the LPV parameters, which requires a pre-processing step at each iteration. Recursive feasibility and exponential stability are proven and the approach is compared to existing methods in numerical examples drawn from other publications, showing reduced conservatism and improved regions of attraction.
|
|
13:50-14:10, Paper MoB09.2 | |
>Elastic Tube Model Predictive Control with Scaled Zonotopic Sets |
|
Diaconescu, Sabin | University “Politehnica” of Bucharest |
Stoican, Florin | Universitatea Nationala De Stiinta Si Tehnologie POLITEHNICA BUC |
Ciubotaru, Bogdan D. | Faculty of Automatic Control and Computers, Polytechnic Universi |
Olaru, Sorin | CentraleSupélec |
Keywords: Predictive control for linear systems, Linear systems, Algebraic/geometric methods
Abstract: A novel parameterization of the tube associated with the Robust Model Predictive Control law is proposed. The aim is to reduce the computational complexity by describing the tube as a sequence of elastically-scaled zonotopic sets. The developments demonstrate the efficacy within the context of constrained control of a system affected by additive and bounded exogenous disturbances. The underlying set containment conditions using zonotopic sets result in linear complexity.
|
|
14:10-14:30, Paper MoB09.3 | |
>Remote Tube-Based MPC for Tracking Over Lossy Networks |
|
Umsonst, David | Ericsson Research |
Barbosa, Fernando | Ericsson Research |
Keywords: Predictive control for linear systems, Networked control systems, Optimal control
Abstract: This paper addresses the problem of controlling constrained systems subject to disturbances in the case where controller and system are connected over a lossy network. To do so, we propose a novel framework that splits the concept of tube-based model predictive control into two parts. One runs locally on the system and is responsible for disturbance rejection, while the other runs remotely and provides optimal input trajectories that satisfy the system's state and input constraints. Key to our approach is the presence of a nominal model and an ancillary controller on the local system. Theoretical guarantees regarding the recursive feasibility and the tracking capabilities in the presence of disturbances and packet losses in both directions are provided. To test the efficacy of the proposed approach, we compare it to a state-of-the-art solution in the case of controlling a cartpole system. Extensive simulations are carried out with both linearized and nonlinear system dynamics, as well as different packet loss probabilities and disturbances. The code for this work is available at https://github.com/EricssonResearch/Robust-Tracking-MPC-ove r-Lossy-Networks
|
|
14:30-14:50, Paper MoB09.4 | |
>A Deconvolution-Based Model Predictive Control Scheme for Resilient and Maintenance Requirements in Cyber-Physical Systems |
|
Famularo, Domenico | Università Degli Studi Della Calabria |
Franze, Giuseppe | Universita' Della Calabria |
Tedesco, Francesco | Università Della Calabria |
Keywords: Predictive control for linear systems, Networked control systems, Resilient Control Systems
Abstract: In this paper, a constrained regulation problem for networked control systems, whose plants are described by polytopic linear descriptions with a partial state availability, is considered when the communication medium is unreliable. To address the issue, an ad-hoc state-estimate control architecture has been deployed via a robust model predictive control strategy that is resilient in achieving regulation goals. Additionally, the proposed scheme is designed to prevent communication disconnections in situations where reaching the origin goal is not viable. A final solid numerical example puts in light the effectiveness and the main benefits of the proposed solution.
|
|
14:50-15:10, Paper MoB09.5 | |
>Computing the Explicit MPC Solution in the Constrained Zonotope Case |
|
Stoican, Florin | Universitatea Nationala De Stiinta Si Tehnologie POLITEHNICA BUC |
Mihai, Sergiu-Stefan | Politehnica University of Bucharest |
Monnigmann, Martin | Ruhr-Universität Bochum |
Ciubotaru, Bogdan D. | Faculty of Automatic Control and Computers, Polytechnic Universi |
Keywords: Predictive control for linear systems, Algebraic/geometric methods
Abstract: Solving the explicit model predictive control (MPC) problem entails enumerating a list of critical regions and their ancillary feedback laws. Unfortunately, their number and the time required to compute them increase exponentially with the problem size (state-space model dimension and length of the prediction horizon). We show that when, as it is often the case, the problem's constraints take the form of boxes or zonotopes, the resulting feasible domain can be compactly described as a constrained zonotope. Subsequently, we investigate whether, and under which circumstances, the combinatorial structure of the constrained zonotope interpretation accelerates the computation of the explicit solution.
|
|
15:10-15:30, Paper MoB09.6 | |
>Computing the Maximal Positive Invariant Set for the Constrained Zonotopic Case |
|
Gheorghe, Bogdan | University Politehnica of Bucharest |
Ioan, Daniel-Mihail | UNSTPB |
Stoican, Florin | Universitatea Nationala De Stiinta Si Tehnologie POLITEHNICA BUC |
Prodan, Ionela | Grenoble Institute of Technology (Grenoble INP) - Esisar |
Keywords: Predictive control for linear systems, Constrained control, Algebraic/geometric methods
Abstract: The maximal positive invariant (MPI) set results from a finite set recurrence instantiated by the intersection of input and state bounds (e.g., the stage constraints of the linear model predictive control problem). When these constraints take the form of hyper-rectangles, zonotopes or constrained zonotopes, the resulting polyhedral MPI set may be succinctly described as a constrained zonotope, eliminating the need for explicit enumeration of its halfspaces. In this paper we discuss the various MPI computation algorithms (with both exact and sufficient stop conditions), recasted in the framework of constrained zonotopes. We analyze one of these variations over a dynamical system whose dimension can be arbitrarily increased in order to asses changes in computation time and storage requirements with respect to the polyhedral case (under the simplifying assumption of closed-loop invertibility of the state matrix).
|
|
MoB10 |
Brown 1 |
Safe Planning and Control with Uncertainty Quantification II |
Invited Session |
Chair: Lindemann, Lars | University of Southern California |
Co-Chair: Haesaert, Sofie | Eindhoven University of Technology |
Organizer: Aolaritei, Liviu | UC Berkeley |
Organizer: Yu, Pian | School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology |
Organizer: Gao, Yulong | Imperial College London |
Organizer: Lindemann, Lars | University of Southern California |
Organizer: Haesaert, Sofie | Eindhoven University of Technology |
|
13:30-13:50, Paper MoB10.1 | |
>Conformal Off-Policy Prediction for Multi-Agent Systems (I) |
|
Kuipers, Tom David | King’s College London |
Tumu, Renukanandan | University of Pennsylvania |
Yang, Shuo | University of Pennsylvania |
Kazemi Mehrabadi, Milad | King's College London |
Mangharam, Rahul | University of Pennsylvania |
Paoletti, Nicola | King's College London |
Keywords: Agents-based systems, Reinforcement learning, Machine learning
Abstract: Off-Policy Prediction (OPP), i.e., predicting the outcomes of a target policy using only data collected under a nominal (behavioural) policy, is a paramount problem in data-driven analysis of safety-critical systems where the deployment of a new policy may be unsafe. To achieve dependable off-policy predictions, recent work on Conformal Off-Policy Prediction (COPP) leverage the conformal prediction framework to derive prediction regions with probabilistic guarantees under the target process. Existing COPP methods can account for the distribution shifts induced by policy switching, but are limited to single-agent systems and scalar outcomes (e.g., rewards). In this work, we introduce MA-COPP, the first conformal prediction method to solve OPP problems involving multi-agent systems, deriving joint prediction regions for all agents' trajectories when one or more ego agents change their policies. Unlike the single-agent scenario, this setting introduces higher complexity as the distribution shifts affect predictions for all agents, not just the ego agents, and the prediction task involves full multi-dimensional trajectories, not just reward values. A key contribution of MA-COPP is to avoid enumeration or exhaustive search of the output space of agent trajectories, which is instead required by existing COPP methods to construct the prediction region. We achieve this by showing that an over-approximation of the true joint prediction region (JPR) can be constructed, without enumeration, from the maximum density ratio of the JPR trajectories. We evaluate the effectiveness of MA-COPP in multi-agent systems from the PettingZoo library and the F1TENTH autonomous racing environment, achieving nominal coverage in higher dimensions and various shift settings.
|
|
13:50-14:10, Paper MoB10.2 | |
>Optimization of Utility-Based Shortfall Risk: A Non-Asymptotic Viewpoint (I) |
|
Gupte, Sumedh Sunil | TCS Research |
L.A., Prashanth | Indian Institute of Technology Madras |
Bhat, Sanjay P. | Tata Consultancy Services Limited |
Keywords: Machine learning, Optimization algorithms, Stochastic systems
Abstract: We consider the problems of estimation and optimization of utility-based shortfall risk (UBSR), which is a popular risk measure in finance. In the context of UBSR estimation, we derive a non-asymptotic bound on the mean-squared error of the classical sample average approximation (SAA) of UBSR. Next, in the context of UBSR optimization, we derive an expression for the UBSR gradient under a smooth parameterization. This expression is a ratio of expectations, both of which involve the UBSR. We use SAA for the numerator as well as denominator in the UBSR gradient expression to arrive at a biased gradient estimator. We derive non-asymptotic bounds on the estimation error, which show that our gradient estimator is asymptotically unbiased. We incorporate the aforementioned gradient estimator into a stochastic gradient (SG) algorithm for UBSR optimization. Finally, we derive non-asymptotic bounds that quantify the rate of convergence of our SG algorithm for UBSR optimization.
|
|
14:10-14:30, Paper MoB10.3 | |
>Distributionally Robust Density Control with Wasserstein Ambiguity Sets (I) |
|
Pilipovsky, Joshua | Georgia Institute of Technology |
Tsiotras, Panagiotis | Georgia Institute of Technology |
Keywords: Stochastic optimal control, Robust control, Uncertain systems
Abstract: Precise control under uncertainty requires a good understanding and characterization of the noise affecting the system. This paper studies the problem of steering state distributions of dynamical systems subject to partially known uncertainties. We model the distributional uncertainty of the noise process in terms of Wasserstein ambiguity sets, which, based on recent results, have been shown to be an effective means of capturing and propagating uncertainty through stochastic LTI systems. To this end, we propagate the distributional uncertainty of the state through the dynamical system, and, using an affine feedback control law, we steer the ambiguity set of the state to a prescribed, terminal ambiguity set. We also enforce distributionally robust CVaR constraints for the transient motion of the state so as to reside within a prescribed constraint space. The resulting optimization problem is formulated as a semi-definite program, which can be solved efficiently using standard off-the-shelf solvers. We illustrate the proposed distributionally-robust framework on a path planning problem and compare with non-robust solutions.
|
|
14:30-14:50, Paper MoB10.4 | |
>Robust Stochastic Shortest-Path Planning Via Risk-Sensitive Incremental Sampling (I) |
|
Enwerem, Clinton | Department of Electrical & Computer Engineering and the Institut |
Noorani, Erfaun | University of Maryland College Park |
Baras, John S. | Univ. of Maryland |
Sadler, Brian | Army Research Laboratory |
Keywords: Randomized algorithms, Uncertain systems, Robotics
Abstract: With the pervasiveness of Stochastic Shortest-Path (SSP) problems in high-risk industries, such as last-mile autonomous delivery and supply chain management, robust planning algorithms are crucial for ensuring successful task completion while mitigating hazardous outcomes. Mainstream chance-constrained incremental sampling techniques for solving SSP problems tend to be overly conservative and typically do not consider the likelihood of undesirable tail events. We propose an alternative risk-aware approach inspired by the asymptotically-optimal Rapidly-Exploring Random Trees (RRT*) planning algorithm, which selects nodes along path segments with minimal Conditional Value-at-Risk (CVaR). Our motivation rests on the step-wise coherence of the CVaR risk measure and the optimal substructure of the SSP problem. Thus, optimizing with respect to the CVaR at each sampling iteration necessarily leads to an optimal path in the limit of the sample size. We validate our approach via numerical path planning experiments in a two-dimensional grid world with obstacles and stochastic path-segment lengths. Our simulation results show that incorporating risk into the tree growth process yields paths with lengths that are significantly less sensitive to variations in the noise parameter, or equivalently, paths that are more robust to environmental uncertainty. Algorithmic analyses reveal similar query time and memory space complexity to the baseline RRT* procedure, with only a marginal increase in processing time. This increase is offset by significantly lower noise sensitivity and reduced planner failure rates.
|
|
14:50-15:10, Paper MoB10.5 | |
>Uncertainty-Aware Decision-Making and Planning for Autonomous Forced Merging (I) |
|
Zhou, Jian | Linköping University |
Gao, Yulong | Imperial College London |
Olofsson, Bjorn | Lund University |
Frisk, Erik | Linkoping Univ |
Keywords: Autonomous vehicles, Optimal control, Uncertain systems
Abstract: In this paper, we develop an uncertainty-aware decision-making and motion-planning method for an autonomous ego vehicle in forced merging scenarios, considering the motion uncertainty of surrounding vehicles. The method dynamically captures the uncertainty of surrounding vehicles by online estimation of their acceleration bounds, enabling a reactive but rapid understanding of the uncertainty characteristics of the surrounding vehicles. By leveraging these estimated bounds, a non-conservative forward occupancy of surrounding vehicles is predicted over a horizon, which is incorporated in both the decision-making process and the motion-planning strategy, to enhance the resilience and safety of the planned reference trajectory. The method successfully fulfills the tasks in challenging forced merging scenarios, and the properties are illustrated by comparison with several alternative approaches.
|
|
15:10-15:30, Paper MoB10.6 | |
>System-Level Analysis of Module Uncertainty Quantification in the Autonomy Pipeline |
|
Deglurkar, Sampada | University of California, Berkeley |
Shen, Haotian | University of California, Berkeley |
Muthali, Anish | University of California, Berkeley |
Pavone, Marco | Stanford University |
Margineantu, Dragos | Boeing |
Karkus, Peter | NVIDIA |
Ivanovic, Boris | NVIDIA Research |
Tomlin, Claire J. | UC Berkeley |
Keywords: Autonomous systems, Learning, Uncertain systems
Abstract: We present a novel perspective on the design, use, and role of uncertainty measures for learned modules in an autonomous system. While in the current literature uncertainty measures are produced for standalone modules without considering the broader system context, in our work we explicitly consider the role of decision-making under uncertainty in illuminating how "good" an uncertainty measure is. Our insights are centered around quantifying the ways in which being uncertainty-aware makes a system more robust. Firstly, we use level set generation tools to produce a measure for system robustness and use this measure to compare system designs, thus placing uncertainty quantification in the context of system performance and evaluation metrics. Secondly, we use the concept of specification generation from systems theory to produce a formulation under which a designer can simultaneously constrain the properties of an uncertainty measure and analyze the efficacy of the decision-making-under-uncertainty algorithm used by the system. We apply our analyses to two real-world and complex autonomous systems, one for autonomous driving and another for aircraft runway incursion detection, helping to form a toolbox for an uncertainty-aware system designer to produce more effective and robust systems.
|
|
MoB11 |
Brown 2 |
Data Driven Control II |
Regular Session |
Chair: Shim, Hyungbo | Seoul National University |
Co-Chair: Zemanek, Jiri | Czech Technical University in Prague |
|
13:30-13:50, Paper MoB11.1 | |
>Path Generating Inverse Gaussian Process Regression for Data-Driven Ultimate Boundedness Control of Nonlinear Systems |
|
Jang, Yeongjun | Seoul National University |
Chang, Hamin | Seoul National University |
Park, Heein | Control Dynamic Systems Laboratory, Seoul National University |
Shim, Hyungbo | Seoul National University |
Keywords: Data driven control, Nonlinear systems identification, Identification for control
Abstract: A data-driven ultimate boundedness controller for a nonlinear system is proposed. The controller is designed based on the inverse model of the system identified by Gaussian process regression with state/input measurement data. In particular, the controller actively selects a reference point from the current state with the data used for the identification to achieve a desired ultimate bound. For this reason, the controller is named as the path generating inverse Gaussian process regression (PGIGP) controller. We provide a sufficient condition on the data under which the PGIGP controller guarantees the ultimate boundedness of the closed-loop system with a desired ultimate bound. It is shown that the condition can serve as a guide for data acquisition and, conversely, be employed to establish the best control performance achievable from a given data. The performance of the PGIGP controller is demonstrated through numerical simulations.
|
|
13:50-14:10, Paper MoB11.2 | |
>Data-Driven Safe Control of Discrete-Time Non-Linear Systems |
|
Zheng, Jian | Northeastern University |
Miller, Jared | ETH Zurich |
Sznaier, Mario | Northeastern University |
Keywords: Data driven control, Robust control, Lyapunov methods
Abstract: This paper proposes a framework to perform verifiably safe control of all discrete-time non-linear systems that are compatible with collected data. Most safety-maintaining control synthesis algorithms (e.g., control barrier functions, density functions) are limited to obtaining theoretical guarantees of safety in continuous-time, even while their implementation on real systems is typically in discrete-time. We first present a sum-of-squares based program to prove the existence of an (acausal) control policy that can safely stabilize all possible data-consistent systems. Causal control policies may be extracted by online optimization, and we provide sufficient conditions for the extraction of this control policy in general scenarios and address as a specific case when convexity assumptions are imposed on the candidate Lyapunov function and safety region descriptor. Discrete-time safe stabilization is demonstrated on two example systems.
|
|
14:10-14:30, Paper MoB11.3 | |
>Data-Driven Feedback Domination Control of a Class of Nonlinear Systems |
|
Li, Jinjiang | The University of Hong Kong; HKU Shenzhen Institute of Research |
Hu, Kaijian | The University of Hong Kong, and HKU Shenzhen Institute of Resea |
Liu, Tao | The University of Hong Kong |
Keywords: Data driven control, Sampled-data control
Abstract: This paper investigates the data-driven control (DDC) problem for a class of nonlinear systems satisfying the linear growth condition. The studied system has completely unknown model parameters and mismatched nonlinearity and input. A static linear state-feedback domination controller is proposed to make the closed-loop system globally exponentially stable, which is obtained by offline solving the data-based mixed integer programs (MIPs). Compared to the existing DDC methods, our approach can handle high-order nonlinear systems with nontriangular structures. In contrast to adaptive control methods that require introducing adaptive parameters or Nussbaum functions for handling unknown parameters in the control path, resulting in a dynamic nonlinear controller, our proposed approach only requires offline computation to achieve the desired control objectives using a static linear controller. Two numerical examples are given to illustrate the effectiveness of the proposed method.
|
|
14:30-14:50, Paper MoB11.4 | |
>Data-Driven Stabilization of Non-Zero Equilibrium for Polynomial Systems |
|
Liu, Yixuan | Delft University of Technology |
Guo, Meichen | Delft University of Technology |
Keywords: Data driven control, Stability of nonlinear systems, Lyapunov methods
Abstract: Most existing work on direct data-driven stabilization considers the equilibrium at the origin. When the desired equilibrium is not the origin, existing data-driven approaches often require performing coordinate transformation, or adding integrator action to the controller. As an alternative, this work addresses data-driven state feedback stabilization of any given assignable equilibrium via dissipativity theory. We show that for a polynomial system, if a data-driven stabilizer can be designed to render the origin globally asymptotically stable, then by modifying the stabilizer, we obtain a stabilizer for any given assignable equilibrium.
|
|
14:50-15:10, Paper MoB11.5 | |
>Setpoint Control of Bilinear Systems from Noisy Data |
|
Bisoffi, Andrea | Politecnico Di Milano |
Steeman, Dominiek | University of Groningen |
De Persis, Claudio | University of Groningen |
Keywords: Data driven control, Stability of nonlinear systems, Uncertain systems
Abstract: We consider the problem of designing a controller for an unknown bilinear system using only noisy input-states data points generated by it. The controller should in principle achieve regulation to a given state setpoint and provide a guaranteed basin of attraction. Determining the equilibrium input to achieve that setpoint is not trivial in a data-based setting and we propose the design of a controller in two scenarios. The design takes the form of linear matrix inequalities and is validated numerically for a Cuk converter.
|
|
15:10-15:30, Paper MoB11.6 | |
>Data-Driven Feedback Control of Lattice Structures with Localized Actuation and Sensing |
|
Fischer, Dominik | Czech Technical University in Prague |
Do, Loi | Czech Technical University, Prague |
Smith, Miana | Massachusetts Institute of Technology |
Zemanek, Jiri | Czech Technical University in Prague |
Keywords: Data driven control, Mechatronics, Flexible structures
Abstract: Assembling lattices from discrete building blocks enables the composition of large, heterogeneous, and easily reconfigurable objects with desirable mass-to-stiffness ratios. This type of building system may also be referred to as a digital material, as it is constituted from discrete, error-correcting components. Researchers have demonstrated various active structures and even robotic systems that take advantage of the reconfigurable, mass-efficient properties of discrete lattice structures. However, the existing literature has predominantly used open-loop control strategies, limiting the performance of the presented systems. In this paper, we present a novel approach to feedback control of digital lattice structures, leveraging real-time measurements of the system dynamics. We introduce an actuated voxel which constitutes a novel means for actuation of lattice structures. Our control method is based on the Extended Dynamical Mode Decomposition algorithm in conjunction with the Linear Quadratic Regulator and the Koopman Model Predictive Control. The key advantage of our approach lies in its purely data-driven nature, without the need for any prior knowledge of a system's structure. We illustrate the developed method via real experiments with custom-built flexible lattice beam, showing its ability to accomplish various tasks even with minimal sensing and actuation resources. In particular, we address two problems: stabilization together with disturbance attenuation, and reference tracking.
|
|
MoB12 |
Brown 3 |
Reinforcement Learning I |
Regular Session |
Chair: Paschalidis, Ioannis Ch. | Boston University |
Co-Chair: Russo, Alessio | KTH Royal Institute of Technology |
|
13:30-13:50, Paper MoB12.1 | |
>A Model-Based Approach for Improving Reinforcement Learning Efficiency Leveraging Expert Observations |
|
Ozcan, Erhan Can | Boston University |
Giammarino, Vittorio | Boston University |
Queeney, James | Mitsubishi Electric Research Laboratories |
Paschalidis, Ioannis Ch. | Boston University |
Keywords: Reinforcement learning, Machine learning, Optimization
Abstract: This paper investigates how to incorporate expert observations (without explicit information on expert actions) into a deep reinforcement learning setting to improve sample efficiency. First, we formulate an augmented policy loss combining a maximum entropy reinforcement learning objective with a behavioral cloning loss that leverages a forward dynamics model. Then, we propose an algorithm that automatically adjusts the weights of each component in the augmented loss function. Experiments on a variety of continuous control tasks demonstrate that the proposed algorithm outperforms various benchmarks by effectively utilizing available expert observations.
|
|
13:50-14:10, Paper MoB12.2 | |
>Analysis of Off-Policy Multi-Step TD-Learning with Linear Function Approximation |
|
Lee, Donghwan | KAIST |
Keywords: Reinforcement learning, Learning, Machine learning
Abstract: This paper analyzes multi-step TD-learning algorithms within the “deadly triad” scenario, characterized by linear function approximation, off-policy learning, and bootstrapping. In particular, we prove that n-step TD-learning algorithms converge to a solution as the sampling horizon n increases sufficiently. The paper is divided into two parts. In the first part, we comprehensively examine the fundamental properties of their model-based deterministic counterparts, including projected value iteration, gradient descent algorithms, and the control theoretic approach, which can be viewed as prototype deterministic algorithms whose analysis plays a pivotal role in understanding and developing their model-free reinforcement learning counterparts. In particular, we prove that these algorithms converge to meaningful solutions when n is sufficiently large. Based on these findings, two n-step TD-learning algorithms are proposed and analyzed, which can be seen as the model-free reinforcement learning counterparts of the gradient and control theoretic algorithms.
|
|
14:10-14:30, Paper MoB12.3 | |
>Data-Efficient Quadratic Q-Learning Using LMIs |
|
van Hulst, Jilles | Eindhoven University of Technology |
Heemels, W.P.M.H. (Maurice) | Eindhoven University of Technology |
Antunes, Duarte | Eindhoven University of Technology, the Netherlands |
Keywords: Reinforcement learning, Learning, LMIs
Abstract: Reinforcement learning (RL) has seen significant research and application results but often requires large amounts of training data. This paper proposes two data-efficient off-policy RL methods that use parametrized Q-learning. In these methods, the Q-function is chosen to be linear in the parameters and quadratic in selected basis functions in the state and control deviations from a base policy. A cost penalizing the ell_1-norm of Bellman errors is minimized. We propose two methods: Linear Matrix Inequality Q-Learning (LMI-QL) and its iterative variant (LMI-QLi), which solve the resulting episodic optimization problem through convex optimization. LMI-QL relies on a convex relaxation that yields a semidefinite programming (SDP) problem with linear matrix inequalities (LMIs). LMI-QLi entails solving sequential iterations of an SDP problem. Both methods combine convex optimization with direct Q-function learning, significantly improving learning speed. A numerical case study demonstrates their advantages over existing parametrized Q-learning methods.
|
|
14:30-14:50, Paper MoB12.4 | |
>Non-Stationary Bandits with Habituation and Recovery Dynamics and Knapsack Constraints |
|
He, Qinyang | University of Wisconsin - Madison |
Mintz, Yonatan | University of Wisconsin Madison |
Keywords: Reinforcement learning, Machine learning
Abstract: Multi-armed bandit models have proven to be useful in modeling many real world problems in the areas of control and sequential decision making with partial information. However, in many scenarios, such as those prevalent in healthcare and operations management, the decision maker's expected reward will decrease if an action is selected too frequently while it may recover if they abstain from selecting this action. This scenario is further complicated when choosing a particular action also expends a random amount of a limited resource where the distribution is also initially unknown to the decision maker. In this paper we study a class of models that address this setting that we call reducing or gaining unknown efficacy bandits with stochastic knapsack constraints (ROGUEwK). We propose a combination upper confidence bound (UCB) and lower confidence bound (LCB) approximation algorithm for optimizing this model. Our algorithm chooses which action to play at each time point by solving a linear program (LP) with the UCB for the average rewards and LCB for the average costs as inputs. We show that the regret of our algorithm is sub-linear as a function of time and total constraint budget when compared to a dynamic oracle. We validate the performance of our algorithm against existing state of the art non-stationary and knapsack bandit approaches in a simulation study and show that our methods are able to on average achieve a 13% improvement in terms of total reward.
|
|
14:50-15:10, Paper MoB12.5 | |
>Fair Best Arm Identification with Fixed Confidence |
|
Russo, Alessio | KTH Royal Institute of Technology |
Vannella, Filippo | Ericsson Research |
Keywords: Reinforcement learning, Statistical learning, Learning
Abstract: In this work, we present a novel framework for Best Arm Identification (BAI) under fairness constraints, a setting that we refer to as F-BAI (fair BAI). Unlike traditional BAI, which solely focuses on identifying the optimal arm with minimal sample complexity, F-BAI also includes a set of fairness constraints. These constraints impose a lower limit on the selection rate of each arm and can be either model-agnostic or model-dependent. For this setting, we establish an instance-specific sample complexity lower bound and analyze the price of fairness, quantifying how fairness impacts sample complexity. Based on the sample complexity lower bound, we propose F-TaS, an algorithm provably matching the sample complexity lower bound, while ensuring that the fairness constraints are satisfied. Numerical results, conducted using both a synthetic model and a practical wireless scheduling application, show the efficiency of F-TaS in minimizing the sample complexity while achieving low fairness violations.
|
|
15:10-15:30, Paper MoB12.6 | |
>Robust Q-Learning under Corrupted Rewards |
|
Maity, Sreejeet | North Carolina State University, Raleigh |
Mitra, Aritra | North Carolina State University |
Keywords: Reinforcement learning, Statistical learning, Resilient Control Systems
Abstract: Recently, there has been a surge of interest in analyzing the non-asymptotic behavior of model-free reinforcement learning algorithms. However, the performance of such algorithms in non-ideal environments - such as in the presence of corrupted rewards - is poorly understood. Motivated by this gap, we investigate the robustness of the celebrated Q-learning algorithm to a strong-contamination attack model, where an adversary can arbitrarily perturb a small fraction of the observed rewards. We start by proving that such an attack can cause the vanilla Q-learning algorithm to incur arbitrarily large errors. We then develop a novel robust synchronous Q-learning algorithm that uses historical reward data to construct robust empirical Bellman operators at each time step. Finally, we prove a finite-time convergence rate for our algorithm that matches known state-of-the-art bounds (in the absence of attacks) up to a small inevitable error term that scales with the adversarial corruption fraction.
|
|
MoB13 |
Suite 1 |
Estimation and Control of Distributed Parameter Systems II |
Invited Session |
Chair: Demetriou, Michael A. | Worcester Polytechnic Institute |
Co-Chair: Hu, Weiwei | University of Georgia |
Organizer: Demetriou, Michael A. | Worcester Polytechnic Institute |
Organizer: Hu, Weiwei | University of Georgia |
|
13:30-13:50, Paper MoB13.1 | |
>Stabilization of a Reaction-Diffusion Equation in H2-Norm with Application to Saturated Neumann Measurement (I) |
|
Lhachemi, Hugo | CentraleSupelec |
Prieur, Christophe | CNRS |
Keywords: Distributed parameter systems
Abstract: This paper is concerned with the boundary output stabilization of a reaction-diffusion equation in H^2-norm. Stabilizability of reaction-diffusion PDEs is most often studied in L^2-norm and sometimes in H^1-norm. The case of the H^2-norm is much less reported in the literature. In this paper, the study of the system trajectory in H^2-norm is motivated by the fact that such a regularity is required, from a mathematical perspective, to handle a saturated Neumann measurement. More precisely, combining a classical sector condition for saturation functions and spectral methods, we show how the study of the system in H^2-norm allows the local exponential stabilization of the PDE plant while estimating a subset of the domain of attraction.
|
|
13:50-14:10, Paper MoB13.2 | |
>Enhanced Stability Conditions Associated with Stabilization and Estimator Design for a Coupled Parabolic-Elliptic System (I) |
|
Alalabi, Ala' | University of Waterloo |
Morris, Kirsten | University of Waterloo |
Keywords: Backstepping, Distributed parameter systems, Linear systems
Abstract: Stabilization of coupled parabolic-elliptic equations presents a number of challenges. Previous work by the authors presented the design of a single control that stabilizes the dynamics of a coupled parabolic-elliptic system, subject to a criterion on the system's parameters. A similar condition was obtained for estimator design with a single measurement. This paper presents a weaker sufficient condition that guarantees the system's stability. Next, the estimator design for the same system with a single measurement is considered. A less strict condition than the previous one is obtained that ensures stable error dynamics. Numerical simulations illustrate the theoretical results.
|
|
14:10-14:30, Paper MoB13.3 | |
>Unbiased Extremum Seeking for PDEs (I) |
|
Yilmaz, Cemal Tugrul | UC San Diego |
Diagne, Mamadou | University of California San Diego |
Krstic, Miroslav | University of California, San Diego |
Keywords: Extremum seeking, Distributed parameter systems, Delay systems
Abstract: There have been recent efforts that combine seemingly disparate methods, extremum seeking (ES) optimization and partial differential equation (PDE) backstepping, to address the problem of model-free optimization with PDE actuator dynamics. In contrast to prior PDE-compensating ES designs, which only guarantee local stability around the extremum, we introduce unbiased ES that compensates for delay and diffusion PDE dynamics while ensuring exponential and unbiased convergence to the optimum. Our method leverages exponentially decaying/growing signals within the modulation/demodulation stages and carefully selected design parameters. The stability analysis of our designs relies on a state transformation, infinite-dimensional averaging, local exponential stability of the averaged system, local stability of the transformed system, and local exponential stability of the original system. Numerical simulations are presented to demonstrate the efficacy of the developed designs.
|
|
14:30-14:50, Paper MoB13.4 | |
>A Surrogate Prediction Model for Systems Governed by Partial Differential Equations (I) |
|
Kang, Wei | Naval Postgraduate School |
Xu, Liang | Naval Research Laboratory |
Zhou, Hong | Naval Postgraduate School |
Keywords: Estimation, Computational methods, Machine learning
Abstract: This work is dedicated to the real-time prediction of dynamical system trajectories using sensor data. Our approach introduces a learning-based surrogate prediction model tailored for forecasting the state of partial differential equations (PDEs) within a limited area. Different from existing learning based methods of solving PDEs, the prediction is made based on observation data, without the necessity of knowing the initial condition and precise lateral boundary conditions for the limited-area model, both in online and offline computations. The design of our surrogate prediction model hinges on two pivotal concepts: predictability and effective region. Predictability enables us to quantitatively assess whether the observation data is sufficient for accurate prediction. Concurrently, the effective region concept decreases the computational burden associated with determining predictability and generating training data. Compared to the conventional two-stage approach—first employing data assimilation followed by prediction through differential equation integration—commonly utilized in control systems and numerical weather prediction, our surrogate prediction model offers real-time forecasting in a single step, namely, evaluating a neural network.
|
|
14:50-15:10, Paper MoB13.5 | |
>Stabilization and Optimal Control of Interconnected SDE - Scalar PDE System |
|
Velho, Gabriel | Université Paris-Saclay, CentraleSupélec, Laboratoire Des Signau |
Auriol, Jean | CNRS |
Bonalli, Riccardo | Laboratoire Des Signaux Et Systèmes |
Boussaada, Islam | Universite Paris Saclay, CNRS-CentraleSupelec-Inria |
Keywords: Distributed parameter systems, Stochastic optimal control, Delay systems
Abstract: In this paper, we design a controller for an interconnected system consisting of a linear Stochastic Differential Equation (SDE) actuated through a linear hyperbolic Partial Differential Equation (PDE). Our approach aims to minimize the variance of the state of the SDE component. We leverage a backstepping technique to transform the original PDE into an uncoupled stochastic PDE. As such, we reformulate our initial problem as the control of a delayed SDE with a non-deterministic drift. Under standard controllability assumptions, we design a controller steering the mean of the states to zero while keeping its covariance bounded. As final step, we address the optimal control of the delayed SDE employing Artstein's transformation and Linear Quadratic stochastic control techniques.
|
|
15:10-15:30, Paper MoB13.6 | |
>Event-Triggered Gain Scheduling of 2 × 2 Linear Hyperbolic PDEs with Time and Space Varying Coupling Coefficients (I) |
|
Auriol, Jean | CNRS |
Espitia, Nicolas | University of Lille - CNRS - CRIStAL Lab |
Keywords: Distributed parameter systems, Backstepping, Time-varying systems
Abstract: In this paper, we address the problem of expo- nential stabilization of 2 × 2 hyperbolic PDEs systems with time- and space-varying in-domain coupling coefficients using event-triggered gain scheduling. More precisely, we sample the coupling terms according to a Lyapunov-based event- triggering condition. At each triggering time, we define the control input as the classical static backstepping control law that would stabilize the system, thereby scheduling the gains of the controller according to the triggering mechanism while solely considering the spatial variation of the coefficients. We prove that we avoid the Zeno phenomenon under the even- triggering policy, provided that the coupling coefficients are slowly time-varying. The closed-loop exponential stability is shown using a Lyapunov analysis. Unlike existing results in the literature, the proposed approach does not require solving time-varying backstepping kernel equations in real-time, which implies a smaller computational burden and better applicability.
|
|
MoB14 |
Suite 2 |
Large-Scale Systems |
Regular Session |
Chair: Yao, Wang | Beihang University |
Co-Chair: Cao, Ming | University of Groningen |
|
13:30-13:50, Paper MoB14.1 | |
>Approximate State Estimation with Large-Scale Sensor Networks under Event-Based Sensor Scheduling Strategy |
|
Liu, Xinhui | Beijing Institute of Technology |
Cheng, Meiqi | Beijing Institute of Technology |
Shi, Dawei | Beijing Institute of Technology |
Shi, Ling | Hong Kong University of Science and Technology |
Keywords: Large-scale systems, Estimation, Sensor networks
Abstract: In this paper, we investigate the issues of real-time sensor scheduling and state estimator design within large-scale sensor network systems. Specifically, data redundancy sometimes occurs in large-scale sensor arrays due to the excessive proximity of sensing units in the spatial domain, leading to high similarity in their measurement data. Currently, it is difficult to account for such redundancy in sensor scheduling algorithms found in existing literature, where the optimal subset of sensors is generally selected by optimizing objective functions formulated from certain performance criteria. To tackle this problem, we introduce an event-based sensor scheduling strategy, the triggering condition of which is designed founded on the similarity of sensor data, so as to identify the most informative subset of sensors for state estimation. To evaluate the impact of the sensor scheduling protocol on system observability, we propose a new notion of mathcal{E}(epsilon)-observability, based on which an observability criterion is derived. In addition, we have designed a set-valued state estimation algorithm, which takes into account the intricate measurement information structure inherent within the sensor selection mechanism. The performance enhancement of the proposed estimator is also investigated. Finally, numerical experiments are conducted to validate the effectiveness of the proposed estimation algorithm and to verify the performance improvement.
|
|
13:50-14:10, Paper MoB14.2 | |
>On the Compressibility of State Snapshots for String Systems |
|
Zhu, Chenyan | Texas A&M University |
Roy, Sandip | Washington State University |
Keywords: Large-scale systems, Linear systems, Sampled-data control
Abstract: Sparse representation of the states of strings or cascades of dynamical systems is examined. Specifically, a notion of compressibility for string system states is introduced, which captures whether and to what extent the state can be expressed sparsely in a fixed basis. For a highly simplified string system model (made up of scalar, linear, discrete-time objects), compressibility in the Laplacian spectrum and Gramian spectrum bases is characterized analytically. The main result of this analysis is that the energy in the state snapshot is captured in a diminishing fraction of the basis vectors, as the string is made long. Simulations are used to illustrate the formal results, and demonstrate state recovery from sparse, randomly-located samples.
|
|
14:10-14:30, Paper MoB14.3 | |
>A Global Optimal Task Allocation Model for Large-Scale Agents Based on Mean Field Game |
|
Huang, SanJin | Beihang University |
Yao, Wang | Beihang University |
Niu, Zijia | Beihang University |
Zhang, Xiao | Beihang University |
Keywords: Large-scale systems, Mean field games, Agents-based systems
Abstract: Nowadays, task allocation methods for large-scale agents have significant application value across various fields. However, as the scale grows, the interactions between agents will increase exponentially, resulting in a dimensionality explosion problem. Therefore, finding the global optimal solution to the task allocation problem of large-scale agents has become a challenge. Aiming at global social optimal, this paper proposes a task allocation method for large-scale agents based on mean field game (MFG). The game goal of the proposed method is to minimize the total cost of task allocation for all agents and the existence of Nash equilibrium is proved. In the numerical experiments, we compare the proposed method with the task allocation method based on the greedy algorithm and the brute-force search algorithm respectively, and verify the global optimality, effectiveness, and high efficiency of the proposed method.
|
|
14:30-14:50, Paper MoB14.4 | |
>Initial Error Affection and Strategy Modification in Multi-Population LQ Mean Field Games under Erroneous Initial Distribution Information |
|
Jin, Yuxin | Beihang University |
Ren, Lu | Beihang University |
Yao, Wang | Beihang University |
Zhang, Xiao | Beihang University |
Keywords: Mean field games, Large-scale systems, Optimal control
Abstract: In this paper, the initial error affection and strategy modification in multi-population linear quadratic mean field games (MPLQMFGs) under erroneous initial distribution information are investigated. First, a MPLQMFG model is developed where agents in different populations are coupled by dynamics and cost functions. Next, by studying the evolutionary of MPLQMFGs under erroneous initial distribution information, the predicted and the actual evolutionary of mean field states are given. Furthermore, assume that each agent maintains observations only of its own state and control as well as the mean field terms of its own population, and agents are allowed to modify their strategies at an intermediate moment, two sufficient conditions are provided, where the game will reach the Nash equilibrium under correct information. Besides, the affection of the initial error on the game is discussed. Finally, simulations on the opinion evolutions of two groups are performed to verify above conclusions.
|
|
14:50-15:10, Paper MoB14.5 | |
>Computational Moment Control of Ensemble Systems |
|
Kuan, Yuan-Hung | Washington University in St. Louis |
Zhang, Wei | Washington University in St. Louis |
Li, Jr-Shin | Washington University in St. Louis |
Keywords: Large-scale systems, Optimal control, Optimization algorithms
Abstract: Finely manipulation of a large population of structurally identical dynamical systems exhibiting different dynamics, referred to as an ensemble system, is a crucial task arising from various emerging applications across diverse disciplines. A significant challenge in controlling this class of systems is the inherent scalability issue, involving computational complexity and efficiency, due to the massive size. To overcome this bottleneck, in this paper, we introduce a moment transform that maps ensemble systems defined on the space of continuous functions to their associated moment systems defined on the space of moment sequences. This transformation enables the approximation of the dynamics of an ensemble system in terms of a finite-dimensional truncated moment system. We leverage this reduction to facilitate control design for ensemble systems by developing an iterative computational optimal control algorithm with convergence guarantees. The efficiency and performance of the proposed algorithm are further demonstrated through its application to practical ensemble control problems encountered in physics and robotics.
|
|
15:10-15:30, Paper MoB14.6 | |
>Hierarchical Optimization Framework for Network Resource Allocation under Uncertainty |
|
Liu, Yifan | University of Groningen |
Cherukuri, Ashish | University of Groningen |
Keywords: Large-scale systems, Optimization algorithms, Optimization
Abstract: This paper introduces hierarchical optimization algorithms to solve, in a data-driven manner, an uncertain resource allocation problem over a two-layered tree network. In this optimization problem, the root of the tree aims to minimize the cost of procuring a certain resource, the demand for which is uncertain and originates at the leaves of the network. The demand is to be met with a certain probability which is represented as chance-constraints. We assume that data regarding the uncertain demand is available at each leaf of the network and we design a general framework of hierarchical optimization procedures where the chance-constrained allocation problem is solved using the available data with the constraint that the data is not transferred to the root. Our hierarchical procedure, termed the abstraction-allocation framework, is adapted to three data-driven algorithms for solving the chance-constrained problem: the scenario method, sample average approximation, and distributionally robust optimization. In our framework, the middle layer of the network facilitates the abstraction and allocation when information flows from leaves to the root and the other way around, respectively.
|
|
MoB15 |
Suite 3 |
Control of Complex Systems |
Regular Session |
Chair: Serrani, Andrea | The Ohio State University |
Co-Chair: Panayiotou, Christos | University of Cyprus |
|
13:30-13:50, Paper MoB15.1 | |
>On Simultaneous Implementability Problem in the Behavioral Framework |
|
Ishii, Rei | The University of Electro-Comunication |
Kaneko, Osamu | The University of Electro-Communications |
Keywords: Behavioural systems, Linear systems
Abstract: The dynamic characteristics of a plant vary with related deterioration and external factors. Therefore, it is very useful to know how much change in the dynamic characteristics of the plant can be tolerated. This paper seeks a parametrization of a plant that satisfies the given control specification using the same controller within the behavioral framework. Since this is regarded as the problem whether a single controller can be achieve (implement) the given control specification for multiple plants, this problem is referred to as simultaneous implementability problem. As for this problem, we give parametrizations of a plant with respect to both of a controller and a specification. We also consider the stabilization case. Finally, we give some illustrative examples to show the validity of our results.
|
|
13:50-14:10, Paper MoB15.2 | |
>Assessing an Energy-Based Control for the Soft Inverted Pendulum in Hamiltonian Form |
|
Pagnanelli, Giulia | University of Pisa |
Pierallini, Michele | University of Pisa |
Angelini, Franco | Università Di Pisa |
Bicchi, Antonio | Universita' Di Pisa |
Keywords: Control applications, Robotics
Abstract: Thanks to their continuously deformable structure, continuum soft robots are suited for safe human-robot interactions. However, the executable tasks are still limited in complexity due to the high number of degrees of freedom, and the consequent under-actuation that characterizes these robots complicates the control problem. To develop a control strategy taking advantage of the main system properties this work investigates the use of Interconnection and Damping Assignment Passivity Based Control in the regulation of unstable equilibria of the underactuated template model soft inverted pendulum with affine curvature. We show that remarkably, the partial differential equations that arise from the application of this technique, have a closed-form solution for this system. We verify the effectiveness of the controller via simulations, and we compare the results achieved considering the swing-up problem against the ones obtained with baseline controllers, i.e., Proportional Derivative control and Feedback Linearization.
|
|
14:10-14:30, Paper MoB15.3 | |
>Deadlock Resolution of Connected Multi-Agent Systems Using Hierarchical Control |
|
Garg, Kunal | Massachusetts Institute of Technology |
Hamilton, Sera | MIT |
Fan, Chuchu | Massachusetts Institute of Technology |
Keywords: Hierarchical control, Distributed control, Control system architecture
Abstract: Multi-agent robotic systems often require control design for a multi-objective problem, such as maintaining a safe distance from other agents as well as obstacles, maintaining network connectivity for building team knowledge, and completing team objectives for performance. Such problems are intractable in the centralized framework for large-scale systems. Thus, a distributed framework is necessary where each agent only requires its neighbors' information while being able to contribute towards completing the team objective. However, a decentralized control framework often leads to a sub-optimal solution, resulting in the system getting stuck in local minima or a deadlock. This paper addresses the issue of deadlock resolution via a hierarchical control framework. We propose a high-level planner for temporary goal assignment and a low-level controller that drives the agents to their assigned goals. The proposed framework is distributed in nature, making it scalable to large-scale multi-agent systems. We perform extensive simulation and experimental case studies to demonstrate the efficacy and need for such a hierarchical control framework.
|
|
14:30-14:50, Paper MoB15.4 | |
>Distributed Robust Indirect Adaptive Control for Manipulation of a Passive Object by a Group of Robotic Agents |
|
Giordano, Jacopo | University of Padua |
Cenedese, Angelo | University of Padova |
Serrani, Andrea | The Ohio State University |
Keywords: Indirect adaptive control, Distributed control, Robotics
Abstract: In this work, a distributed indirect adaptive controller is designed for a group of robotic agents cooperatively manipulating a common payload. Uncertainty on the model of the manipulated object and limited actuation capabilities of the single agents can significantly impact the overall behavior of the control system. An indirect adaptive control scheme is proposed in this paper to address these shortcomings. In particular, model uncertainty and loss of effectiveness of the actuators are handled in a unifying fashion by an adaptive control architecture that preserves physical consistency of the estimated inertial parameters of the manipulated object, while simultaneously providing an anti-windup mechanism for the estimated inertial parameters against actuator saturation. The stability of the closed loop system is proven theoretically and the performance and robustness of the control system are validated by means of comparative simulations with respect to a baseline state-of-the-art controller.
|
|
14:50-15:10, Paper MoB15.5 | |
>Stealthy Attack Detection of Controlled Ramp Meters in Freeway Networks |
|
Menelaou, Charalambos | Kios Research Center |
Zhang, Kangkang | Imperial College London |
Timotheou, Stelios | University of Cyprus |
Panayiotou, Christos | University of Cyprus |
Parisini, Thomas | Imperial C., Aalborg U. & Univ. of Trieste |
Keywords: Traffic control, Transportation networks, Attack Detection
Abstract: The increasing reliance on intelligent transportation systems (ITS) for traffic management has simultaneously heightened the potential for cybersecurity threats. Malicious cyber attacks on such systems can lead to operational inefficiencies, heightened congestion, and compromised safety. This work introduces a stealthy integrity attack tailored for ramp metered freeway systems. The attack model is conceptualized as a closed-loop dynamical system, formulated as an optimization problem based on the Cell Transmission Model. This work further proposes a distributed detection mechanism designed to identify attack residuals, which, by incorporating an adaptive threshold scheme, can detect the presence of an attack. To demonstrate the efficacy of our detection scheme, we present a scenario illustrating its application in freeway road networks.
|
|
15:10-15:30, Paper MoB15.6 | |
>Homography-Based Adaptive Robot Visual Tracking with Camera Parameter Convergence |
|
Lai, Beixian | Sun Yat-Sen University |
Pan, Yongping | Sun Yat-Sen University |
Li, Zhiwen | Sun Yat-Sen University |
Wen, Changyun | Nanyang Tech. Univ |
Keywords: Visual servo control, Adaptive control, Estimation
Abstract: Parametric uncertainties are ubiquitous in vision-based robotic systems. However, existing adaptive visual servoing methods can not simultaneously achieve six-degree-of-freedom (6-DoF) robot pose control in the three-dimensional (3D) space and accurate camera parameter estimation. This paper considers robot manipulators with eye-to-hand monocular cameras under unknown extrinsic parameters and proposes a passivity-based kinematic control method called homography-based visual servoing with composite learning (CL-HBVS) to achieve 6-DoF robot pose tracking. The convergence of camera extrinsic parameters is achieved by designing a composite learning mechanism without the stringent condition of persistent excitation, which ensures the exact estimation of the time-varying depth and mitigates singularity in the estimated rotation matrix. The proposed method eliminates the need to calibrate camera extrinsic parameters and measure the depths of reference feature points. Experiments on a 7-DoF robot manipulator have verified the effectiveness of the proposed CL-HBVS method.
|
|
MoB16 |
Suite 4 |
Fault Detection and Accomodation |
Regular Session |
Chair: Classens, Koen | Eindhoven University of Technology |
Co-Chair: Ferramosca, Antonio | Univeristy of Bergamo |
|
13:30-13:50, Paper MoB16.1 | |
>A Fault-Tolerant Distributed Sensor Reconciliation Scheme Based on Decomposed Steady-State Kalman Filter |
|
Torchiaro, Franco Angelo | University of Calabria |
Gagliardi, Gianfranco | Università Degli Studi Della Calabria |
Casavola, Alessandro | Universita' Della Calabria |
Tedesco, Francesco | Università Della Calabria |
Sinopoli, Bruno | Washington University in St Louis |
Keywords: Fault accomodation, Sensor networks, Kalman filtering
Abstract: This paper presents a Distributed Sensor Reconciliation (DSR) methodology based on the decomposition of a centralized steady-state Kalman Filter (KF), which is used to distributively estimate the state of a LTI plant in the presence of unpredictable sensor faults. A scenario where individual local measurements may not ensure system observability, but their collective combination does, is considered. To tackle this challenge, the proposed DSR scheme aims at hiding the faulty sensor measurements in achieving the locally state estimates and in making all of them converge to the true system's state. This is accomplished by fusing, at each agent site, both local measurements and communicated estimates from a subset of neighboring agents. The estimation scheme leverages sensor redundancy to enhance robustness against sensor faults, thereby mitigating both multiplicative and additive faults. Theoretical guarantees regarding the stability of the scheme are provided and a simulation example is discussed to illustrate the effectiveness of the scheme.
|
|
13:50-14:10, Paper MoB16.2 | |
>A Meal Detection Approach Based on Parity Space to Detect Untreated Meals in Subjects with Type 1 Diabetes |
|
Mongini, Paolo Alberto | University of Pavia, Department of Electrical, Computer and Biom |
Mazzoleni, Mirko | University of Bergamo |
Ferramosca, Antonio | Univeristy of Bergamo |
Magni, Lalo | Univ. of Pavia |
Toffanin, Chiara | University of Pavia |
Keywords: Fault detection, Simulation, Healthcare and medical systems
Abstract: In this paper a data-driven fault detection technique based on parity space is applied to the problem of detecting unannounced meals for type 1 diabetes patients. This method involves the generation and evaluation of a residual signal to detect faults (unannounced meals) acting on the system (patient). Insulin on Board (IOB), meal intake, glucose and its second derivative have been selected as input signals to generate the residuals, and the parity space matrices are estimated using 8-hour training in silico data generated by the UVA/Padova simulator. The residual evaluation module compares the residual with a threshold that is optimized for each patient using 2-day tuning data. The complete system is evaluated on 1-week scenario obtaining promising results (TPR=95%, PPV=72%) with a detection delay of 42 minutes for 80% of patients. For the 20 outlier patients, a fine-tuning and patient-tailored input signal processing are proposed as a future development.
|
|
14:10-14:30, Paper MoB16.3 | |
>Distributed Joint Fault Estimation for Multi-Agent Systems Via Dynamic Event-Triggered Communication |
|
Wang, Zeyuan | University of Paris-Saclay |
Chadli, M. | University of Paris-Saclay - UEVE |
Keywords: Fault diagnosis, Estimation, Distributed control
Abstract: This paper studies a novel distributed fault estimation framework for multi-agent systems under directed topology, subject to time-varying multiplicative and additive faults. Both actuator and sensor faults are simultaneously addressed by introducing an augmented system. A two-step design process is presented aimed at joint estimation of faults, system state, and exogenous disturbance, which involves an lth-order proportional-integral observer and a constrained least square estimator. Utilizing the relative output of neighbor information enhances the accuracy of fault and state estimation. Output sharing is realized by a dynamic event-triggered communication protocol, which effectively saves network resources. The design conditions of the observer are formulated as an optimization problem subject to linear matrix inequalities, ensuring guaranteed H-infinity performance with regard to not only estimation error but also event error. Simulation results validate the effectiveness and feasibility of the proposed method.
|
|
14:30-14:50, Paper MoB16.4 | |
>Few-Shot Metric Adversarial Adaptation for Cross-Machine Fault Diagnosis |
|
Chen, Qitong | Soochow University |
Zhuang, Hong | Soochow University |
Zhang, Yueyuan | Soochow University |
Chen, Liang | Soochow University |
Li, Qi | Tsinghua University |
Keywords: Fault diagnosis, Fault detection
Abstract: Research interest in the area of fault diagnosis is shifting from cross-domain to cross-machine, which is crucial for industrial applications with variable operation conditions and different machine configurations. This paper proposes a method named Few-shot Metric Adversarial Adaptation (FMAA) for cross-machine diagnosis of industrial machinery. Firstly, FMAA reduces the data distribution differences between few-shots belonging to the same category in the source domain and the target domain through metric adversarial learning, while increasing the feature distances among different categories. Secondly, the Label Self-Correcting Maximum Mean Discrepancy (LSMMD) method is proposed to correct misclassifications of the model while reducing the conditional distribution differences between the source and target domains. Furthermore, a lightweight attention mechanism-based diagnosis model is proposed to perform cross-machine fault classification tasks. The robustness, universality, and superiority of the proposed method are verified through comprehensive experiments on two platforms for industrial robots and bearings. The code is available on: https://github.com/CCSLab425/FMAA.
|
|
14:50-15:10, Paper MoB16.5 | |
>Optimal Fault Detection for Closed-Loop Linear Uncertain Systems |
|
Classens, Koen | Eindhoven University of Technology |
Ickenroth, Tjeerd | Eindhoven University of Technology |
van de Wijdeven, Jeroen | ASML Netherlands B.V |
Heemels, W.P.M.H. (Maurice) | Eindhoven University of Technology |
Oomen, Tom | Eindhoven University of Technology |
Keywords: Fault detection, Optimization, Fault diagnosis
Abstract: Robust fault detection is crucial for ensuring the reliability and safety of complex engineering systems. However, distinguishing faults from disturbances and modelling uncertainty which are inherently present in any practical system remains remains a challenging task. This paper addresses the robust fault detection filter design problem for continuous-time linear time-invariant uncertain systems operating in open or closed-loop configurations. The proposed framework offers a unified approach to handle both parametric and dynamic uncertainties by solving a single Riccati equation, based on a worst-case disturbance and uncertainty scenario. The efficacy of the proposed approach is demonstrated on a numerical multivariable double mass-spring-damper system. The results illustrate that an optimal compromise is achieved between fault sensitivity and rejection of modelling uncertainties and disturbances. This capability enables the clear differentiation between faults and undesired effects in the residuals, thereby enhancing fault detection reliability, ultimately contributing to improved safety and performance.
|
|
15:10-15:30, Paper MoB16.6 | |
>Convex Reformulation of Information Constrained Linear State Estimation with Mixed-Binary Variables for Outlier Accommodation |
|
Hu, Wang | University of California Riverside |
Jiang, Zeyi | University of California Riverside |
Mohsenian-Rad, Hamed | University of California at Riverside |
Farrell, Jay A. | University of California Riverside |
Keywords: Kalman filtering, Optimization, Fault accomodation
Abstract: This article considers the challenge of accommodating outlier measurements in state estimation. The Risk Averse Performance-Specified (RAPS) state estimation approach addresses outliers as a measurement selection Bayesian risk minimization problem subject to an information accuracy constraint, which is a non-convex optimization problem. Prior explorations into RAPS rely on exhaustive search, which becomes computationally infeasible as the number of measurements increases. This paper derives a convex formulation for the RAPS optimization problems via transforming the mixed-binary variables into linear constraints. The convex reformulation herein can be solved by convex programming toolboxes, significantly enhancing computational efficiency. We explore two specifications: Full-RAPS, utilizing the full information matrix, and Diag-RAPS, focusing on diagonal elements only. The simulation comparison demonstrates that Diag-RAPS is faster and more efficient than Full-RAPS. In comparison with Kalman Filter (KF) and Threshold Decisions (TD), Diag-RAPS consistently achieves the lowest risk, while achieving the performance specification when it is feasible.
|
|
MoB17 |
Suite 6 |
Biological Systems II |
Regular Session |
Chair: Djema, Walid | Centre INRIA d'Université Côte D'Azur |
Co-Chair: Notarstefano, Giuseppe | University of Bologna |
|
13:30-13:50, Paper MoB17.1 | |
>Biological Cell Tracking Via Multi-Agent Identification and Filtering |
|
Tramaloni, Andrea | University of Bologna |
Testa, Andrea | University of Bologna |
Avnet, Sofia | University of Bologna |
Baldini, Nicola | University of Bologna |
Notarstefano, Giuseppe | University of Bologna |
Keywords: Cellular dynamics, Systems biology, Distributed control
Abstract: Understanding cellular dynamics is a fundamental topic in different biomedical applications. Nowadays, optical microscopy is one of the most used techniques to visualize cell movements. In this paper, we consider a novel cell-tracking algorithm to track multiple cells in optical microscopy videos. The proposed methodology combines two steps. First, we model cell movements and their neighboring interactions according to tailored nonlinear multi-agent systems. Then, we identify model parameters from real cellular trajectories and predict cell movements across different frames of a video. In particular, we use an Extended Kalman Filter that exploits the distributed nature of cell dynamics. Numerical experiments on videos from the Cell Tracking Challenge dataset are performed to validate the proposed method and performance metrics are shown.
|
|
13:50-14:10, Paper MoB17.2 | |
>Multi-Variable Control to Mitigate Loads in CRISPRa Networks |
|
Manoj, Krishna | Massachusetts Institute of Technology |
Grunberg, Theodore W. | Massachusetts Institute of Technology |
Del Vecchio, Domitilla | Massachusetts Institute of Technology |
Keywords: Genetic regulatory systems, Systems biology, Biomolecular systems
Abstract: The discovery of CRISPR-mediated gene activation (CRISPRa) has transformed the way in which we perform genetic screening, bioproduction and therapeutics through its ability to scale and multiplex. However, the emergence of loads on the key molecular resources constituting CRISPRa by the orthogonal short RNA that guide such resources to gene targets, couple theoretically independent CRISPRa modules. This coupling negates the ability of CRISPRa systems to concurrently regulate multiple genes independent of one another. In this paper, we propose to reduce this coupling by mitigating the loads on the molecular resources that constitute CRISPRa. In particular, we design a multi-variable controller that makes the concentration of these molecular resources robust to variations in the level of the short RNA loads. This work serves as a foundation to design and implement CRISPRa controllers for practical applications.
|
|
14:10-14:30, Paper MoB17.3 | |
>Optimal Control for a Combination of Cancer Therapies in a Model of Cell Competition |
|
Mazel, Pauline | Centre Inria d'Univerté Côte D'Azur |
Djema, Walid | Centre INRIA d'Université Côte D'Azur |
Grognard, Frederic | INRIA Sophia-Antipolis |
Keywords: Optimal control, Biological systems, Nonlinear systems
Abstract: We present a mathematical model of ordinary differential equations that describes the interaction between a healthy cell population and cancerous cell population. This model includes the effects on cell populations of chemotherapy and targeted therapy, which are two bounded control variables. We study this model and seek to optimize the fraction of healthy cells within the total cell population over a given therapy period. We apply the Pontryagin Maximum Principle (PMP) and establish the expressions of singular solutions in different interaction cases between healthy and cancer cells. Then, we use a direct optimization method to validate and illustrate our theoretical results.
|
|
14:30-14:50, Paper MoB17.4 | |
>State Estimation and Dynamic Control of a Natural Co-Culture |
|
Morlock, Jan | ETH Zurich |
Lee, Ting An | University of Oxford |
Allan, John | Newcastle College |
Steel, Harrison | University of Oxford |
Keywords: Systems biology, Kalman filtering, Estimation
Abstract: Achieving compositional control of microbial co-cultures and communities is essential before they can be utilised in bioprocesses. By exploiting natural microbial characteristics, this letter presents the first application of cybernetic control to a bacterial co-culture without metabolically burdensome genetic engineering. This is demonstrated by first integrating several individually insufficient but complementary bioreactor measurements to generate robust population estimates. Measurement and modelling methodology is calibrated on bacterial monocultures, then directly translated to the estimation of co-culture composition over extended time periods. Finally, we show that this modelling approach enables predictable actuation of co-culture composition by regulating its growth temperature to trade-off each member’s optimum temperature niche, using a PI controller to perform closed-loop control capable of tracking dynamic references. This adds to the growing body of work applying control theory to enable a new paradigm of more productive and robust bioprocesses using microbial consortia instead of monocultures.
|
|
14:50-15:10, Paper MoB17.5 | |
>Common Structures of Optimal Solutions for a Crop Irrigation Problem under Various Constraints and Criteria |
|
Chenevat, Ruben | Université De Montpellier, MISTEA - INRAE |
Cheviron, Bruno | G-EAU, Univ Montpellier, AgroParisTech, BRGM, INRAE, Institut Ag |
Rapaport, Alain | INRAE & Univ. Montpellier |
Roux, Sébastien | INRAE |
Keywords: Optimal control, Biological systems
Abstract: We consider a simplified crop irrigation model written as a non-autonomous, non-smooth controlled system. Different operating contexts and objectives lead to the study of optimal control problems with various state constraints, criterion and targets. We look for feedback solutions and we derive the optimality necessary conditions for the unified formulation. We show that there are only two parametrized time-varying feedback strategies consisting of the best and the worst practices, independent of the chosen objective, and suitable for numerical analysis.
|
|
15:10-15:30, Paper MoB17.6 | |
>An Interval Predictor–based Robust Controller for the Blood Glucose Regulation Problem |
|
Ríos, Héctor | Tecnológico Nacional De México/I.T. La Laguna |
Mera, Manuel | Esime Upt Ipn |
Ferreira de Loza, Alejandra | IPN CITEDI |
Keywords: Constrained control, Robust control, Biological systems
Abstract: This paper contributes to designing a robust controller for the blood glucose regulation problem in patients with diabetes mellitus type–1. The proposed switching control approach is based on an interval predictor–based state–feedback, which takes into account the state and input constraints of the insulin–glucose system dynamics (Bergman minimal model), i.e., positive states and input, and minimum and maximum values of the blood glucose level and the insulin infusion rate. The method deals with interpatient variabilities and unannounced food intake. Additionally, the switching structure of the control law allows us to switch off the state–feedback controller stopping the insulin injection for proper glucose level regulation. The stability analysis is based on a Lyapunov function approach and guarantees the asymptotic convergence of the blood glucose level around the desired value. The synthesis of the controller is constructive since it is in terms of linear matrix inequalities. Some simulation results, over a cohort of 4 virtual type 1 diabetes mellitus adult patients, illustrate the performance of the proposed robust controller.
|
|
MoB18 |
Suite 7 |
Linear Systems II |
Regular Session |
Chair: Perdon, Anna Maria | Ancona |
Co-Chair: Komaee, Arash | Southern Illinois University |
|
13:30-13:50, Paper MoB18.1 | |
>The Model Matching Problem for Periodic Max-Plus Systems |
|
Animobono, Davide | Università Politecnica Delle Marche |
Scaradozzi, David | Università Politecnica Delle Marche |
Zattoni, Elena | Alma Mater Studiorum Università Di Bologna |
Perdon, Anna Maria | Ancona |
Conte, Giuseppe | Universita' Politecnica Delle Marche |
Keywords: Linear systems, Discrete event systems, Switched systems
Abstract: Max-plus linear systems are suitable to model discrete event systems with synchronization phenomena, but not competition. In specific situations, competition can be introduced by considering event-varying periodic parameters, which allow us to model shared resources allocated in accordance to a periodic schedule, thus obtaining a periodic max-plus linear system. In this paper, we propose an extension of the geometric approach to systems of such class. The new results can be used to solve the model matching problem, so as to force a given plant to match the output of a given model exactly. A geometric, structural, necessary and sufficient condition for the solvability of such problem is presented.
|
|
13:50-14:10, Paper MoB18.2 | |
>LP-Based Positive Leader-Following Consensus Protocol for PMASs under Sensor and Actuator Faults (I) |
|
Xing, Wei | Hainan University |
Wang, Shinong | Hainan University |
Tang, Hao | Hainan University |
Ahmed, Bakr | Al-Azhar University |
Zhang, Junfeng | Hainan University |
Keywords: Linear systems, Lyapunov methods, Cooperative control
Abstract: To address the challenges posed by sensor faults (SFs), actuator faults (AFs), and external disturbances, this paper introduces a positive leader-following consensus protocol(PLFCP) for positive multi-agent systems, aiming to prevent anomalous operations and system failures. We provide a novel PLFCP for PMASs under SFs using relative output information and states of the leader and followers. The positivity and leader-following consensus of PMASs can be guaranteed in terms of positive systems theory and co-positive Lyapunov function (CPLF). In addition, a linear programming (LP) is presented to obtain the corresponding positivity and consensus conditions. In the presence of SFs and external disturbances, we propose a matrix decomposition-based approach to construct a PLFCP with mathcal{L}_{1}-gain performance, specifically designed for addressing SFs and AFs in positive multi-agent systems. Finally, an illustrative example is provided to verify the validity of the obtained results.
|
|
14:10-14:30, Paper MoB18.3 | |
>Partial Closed-Loop Pole Assignment Via Sylvester Equation for Linear Time-Invariant Systems |
|
Wang, Shang | Wageningen University & Research |
Cheng, Xiaodong | Wageningen University and Research |
van Heijster, Peter | Wageningen University & Research |
Keywords: Linear systems, Model/Controller reduction, Optimal control
Abstract: This paper introduces a novel approach of partial closed-loop pole assignment for linear time-invariant (LTI) systems. A linear algebraic condition, derived from the Sylvester equation of the system, is presented to characterize the state feedback gains that allocate a subset of the closed-loop poles to a prescribed set of points. We show that this condition can serve as a constraint for designing controllers to achieve other control objectives, for instance, maintaining the other unspecified poles unchanged or minimizing the H_infty norm of the closed-loop system. Furthermore, a connection between the proposed method and moment-matching-based model reduction is also discussed, which shows that the controller gain can be designed based on a reduced-order model that matches certain moments of the original system. Finally, some numerical examples are used to illustrate the proposed method.
|
|
14:30-14:50, Paper MoB18.4 | |
>Minimum Number of Observables and System Invariants in Super-Linearization |
|
Ko, Jehyung | University of Illinois Urbana Champaign |
Belabbas, Mohamed Ali | University of Illinois at Urbana-Champaign |
Keywords: Linear systems, Modeling
Abstract: This paper explores the concept of super-linearization, a technique for transforming non-linear systems into a linear system in a higher dimensional space. Our main result is to determine the lowest dimension of this linear representation, provided that one exists. In proving this result, we introduce new universal invariants for super-linearizations.
|
|
14:50-15:10, Paper MoB18.5 | |
>Explicit Computation of Guaranteed State Estimates Using Constrained Convex Generators |
|
Castro Rego, Francisco | ISR/IST |
Silvestre, Daniel | NOVA University of Lisbon |
Keywords: Linear systems, Observers for Linear systems, Estimation
Abstract: One of the main challenges when performing set-based state estimation is the inherent trade-off between accuracy and computing time. When using accurate set representations like polytopes, even if written in Constrained Zonotopes (CZs) format, the data structures keep increasing in size which will lead to the need of some order reduction method that increases the computational load in the iterations when such a routine is run. Moreover, computing a vector estimate will amount to solving an optimization problem or a matrix inversion, which are expensive procedures if the state space is large. In this paper, we propose an efficient approach for the state estimation of discrete-time Linear Time-Invariant (LTI) systems based on Constrained Convex Generators (CCGs) that allows to write explicitly the set in terms of a fixed number of past inputs and measurements. In doing so, the whole estimation task amounts to performing a small number of multiplications with offline-computed matrices which makes the runtime computation significantly faster and removes the need for order reduction methods. Numerical results show the effectiveness of the proposed method.
|
|
15:10-15:30, Paper MoB18.6 | |
>Preventing Actuator Saturation in Linear Quadratic Regulators |
|
Komaee, Arash | Southern Illinois University |
Keywords: Linear systems, Optimal control, Stability of nonlinear systems
Abstract: This paper presents a nonlinear state feedback law for regulation of linear systems with control constraints. Such constraints mathematically describe the physical limitations on the actuators driving these systems, usually known as actuator saturation. The proposed state feedback law resembles a linear quadratic regulator (LQR) with an adaptable gain, dynamically adjusted to prevent actuator saturation. This feedback law is computationally inexpensive for real-time implementation, and offers a viable alternative to the typically more complex control laws derived from a constrained LQR formulation. Simulation results are presented that demonstrate the effectiveness of the presented feedback law.
|
|
MoB19 |
Suite 8 |
Stochastic Optimal Control II |
Regular Session |
Chair: Tanaka, Takashi | University of Texas at Austin |
Co-Chair: Lucia, Sergio | TU Dortmund University |
|
13:30-13:50, Paper MoB19.1 | |
>LQG Problem of Descriptor Systems and Separation Principle |
|
Nosrati, Komeil | Tallinn University of Technology |
Belikov, Juri | Tallinn University of Technology |
Tepljakov, Aleksei | Tallinn University of Technology |
Petlenkov, Eduard | Tallinn University of Technology |
Keywords: Stochastic optimal control, Estimation, Differential-algebraic systems
Abstract: The performance of linear-quadratic regulators (LQRs), which possess robustness with guaranteed levels of gain and phase margin, will be affected by incomplete state information. By furnishing this regulator with a linear-quadratic estimator (LQE), a recursive algorithm with an optimal feedback law can be obtained. Notwithstanding this, the optimal operation of this so-called linear-quadratic Gaussian (LQG) algorithm might be compromised in transformed stochastic descriptor systems if constraints are violated. Inspired by recent results in stochastic control, we deal with the LQG problem using a direct strategy without any transformations and regularity assumptions. First, an expected quadratic cost function using a regularized least-squares is formulated. Using the estimated states with a designed LQE, we then connect the formulation to a constrained recursive minimization problem under Bellman's principle. To accomplish this, a dynamic programming approach is used in a backward policy within a finite horizon to develop an LQG algorithm for the original system. We conclude the study by stating the separation principle of optimal control and state estimation and verifying the results, where despite noisy measurements, the controller effectively track the reference position while stabilizing the DC motor.
|
|
13:50-14:10, Paper MoB19.2 | |
>Structured Reinforcement Learning for Incentivized Stochastic Covert Optimization |
|
Jain, Adit | Cornell University |
Krishnamurthy, Vikram | Cornell University |
Keywords: Stochastic optimal control, Stochastic systems, Machine learning
Abstract: This paper studies how a stochastic gradient algorithm (SG) can be controlled to hide the estimate of the local stationary point from an eavesdropper. Such problems are of significant interest in distributed optimization settings like federated learning and inventory management. A learner queries a stochastic oracle and incentivizes the oracle to obtain noisy gradient measurements and perform SG. The oracle probabilistically returns either a noisy gradient of the function or a non-informative measurement, depending on the oracle state and incentive. The learner's query and incentive are visible to an eavesdropper who wishes to estimate the stationary point. This paper formulates the problem of the learner performing covert optimization by dynamically incentivizing the stochastic oracle and obfuscating the eavesdropper as a finite-horizon Markov decision process (MDP). Using conditions for interval-dominance on the cost and transition probability structure, we show that the optimal policy for the MDP has a monotone threshold structure. We propose searching for the optimal stationary policy with the threshold structure using a stochastic approximation algorithm and a multi-armed bandit approach. The effectiveness of our methods is numerically demonstrated on a covert federated learning hate-speech classification task.
|
|
14:10-14:30, Paper MoB19.3 | |
>Fourth-Order Suboptimality of Nominal Model Predictive Control in the Presence of Uncertainty |
|
Messerer, Florian | University of Freiburg |
Baumgärtner, Katrin | University of Freiburg |
Lucia, Sergio | TU Dortmund University |
Diehl, Moritz | University of Freiburg |
Keywords: Stochastic optimal control, Predictive control for nonlinear systems, Optimization
Abstract: We investigate the suboptimality resulting from the application of nominal model predictive control (MPC) to a nonlinear discrete time stochastic system. The suboptimality is defined with respect to the corresponding stochastic optimal control problem (OCP) that minimizes the expected cost of the closed loop system. In this context, nominal MPC corresponds to a form of certainty-equivalent control (CEC). We prove that, in a smooth and unconstrained setting, the suboptimality growth is of fourth order with respect to the level of uncertainty, a parameter which we can think of as a standard deviation. This implies that the suboptimality does not grow very quickly as the level of uncertainty is increased, providing further insight into the practical success of nominal MPC. Similarly, the difference between the optimal and suboptimal control inputs is of second order. We illustrate the result on a simple numerical example, which we also use to show how the proven relationship may cease to hold in the presence of state constraints.
|
|
14:30-14:50, Paper MoB19.4 | |
>Expected Time-Optimal Control: A Particle Model Predictive Control-Based Approach Via Sequential Convex Programming |
|
Echigo, Kazuya | University of Washington |
Cauligi, Abhishek | NASA Jet Propulsion Lab |
Acikmese, Behcet | University of Washington |
Keywords: Stochastic optimal control, Nonlinear systems, Aerospace
Abstract: In this paper, we consider the problem of minimum-time optimal control for a dynamical system with initial state uncertainties and propose a sequential convex programming (SCP) solution framework. We seek to minimize the expected terminal (mission) time, which is an essential capability for planetary exploration missions where ground rovers have to carry out scientific tasks efficiently within the mission timelines in uncertain environments. Our main contribution is to convert the underlying stochastic optimal control problem into a deterministic, numerically tractable, optimal control problem. To this end, the proposed solution framework combines two strategies from previous methods: i) a partial model predictive control with consensus horizon approach and ii) a sum-of-norm cost, a temporally strictly increasing weighted-norm, promoting minimum-time trajectories. Our contribution is to adopt these formulations into an SCP solution framework and obtain a numerically tractable stochastic control algorithm. We then demonstrate the resulting control method in multiple applications: i) a closed-loop linear system as a representative result (a spacecraft double integrator model), ii) an open-loop linear system (the same model), and then iii) a nonlinear system (Dubin's car).
|
|
14:50-15:10, Paper MoB19.5 | |
>Discrete-Time Stochastic LQR Via Path Integral Control and Its Sample Complexity Analysis |
|
Patil, Apurva | The University of Texas at Austin |
Hanasusanto, Grani A. | University of Illinois Urbana-Champaign |
Tanaka, Takashi | University of Texas at Austin |
Keywords: Stochastic optimal control, Numerical algorithms, Data driven control
Abstract: In this paper, we derive the path integral control algorithm to solve a discrete-time stochastic Linear Quadratic Regulator (LQR) problem and carry out its sample complexity analysis. While the stochastic LQR problem can be efficiently solved by the standard backward Riccati recursion, our primary focus in this paper is to establish the foundation for a sample complexity analysis of the path integral method when the analytical expressions of optimal control law and the cost are available. Specifically, we derive a bound on the error between the optimal LQR input and the input computed by the path integral method as a function of the sample size. Our analysis reveals that the sample size required exhibits a logarithmic dependence on the dimension of the control input. This observation highlights the robustness of the path integral approach against the curse of dimensionality. textcolor{}{Lastly, we formulate a chance-constrained optimization problem whose solution quantifies the worst-case control performance of the path integral approach.}
|
|
15:10-15:30, Paper MoB19.6 | |
>Stochastic Model Predictive Control with Minimal Constraint Violation Probability for Time-Variant Chance Constraints |
|
Fink, Michael | Technical University of Munich |
Wollherr, Dirk | Technische Universität München |
Leibold, Marion | TU Muenchen |
Keywords: Stochastic optimal control, Predictive control for linear systems, Robust control
Abstract: Despite the effectiveness of Robust and Stochastic Model Predictive Control, not all scenarios require robust trajectories or permit constraint violations. Achieving a balance between safety and performance is crucial. We propose a Model Predictive Control approach that provides an optimal control law by minimizing the probability of constraint violations for time-variant constraints while aiming at achieving a performance criterion. Either the constraints are satisfied robustly or with minimal probability of constraint violation. Further, we propose to switch between minimizing the performance criterion and minimizing the constraint violation probability whenever either of them does not meet the requirements anymore. Recursive feasibility and stability of the method are proved. We evaluate the approach for overtaking of an autonomous vehicle in a simulation study.
|
|
MoB20 |
Suite 9 |
Parameter Varying Systems |
Regular Session |
Chair: Tóth, Roland | Eindhoven University of Technology |
Co-Chair: Mercère, Guillaume | University of Poitiers |
|
13:30-13:50, Paper MoB20.1 | |
>Guaranteeing Stability in Structured Input-Output Models: With Application to System Identification |
|
Kon, Johan | Eindhoven University of Technology |
Tóth, Roland | Eindhoven University of Technology |
van de Wijdeven, Jeroen | ASML Netherlands B.V |
Heertjes, Marcel | Eindhoven University of Technology |
Oomen, Tom | Eindhoven University of Technology |
Keywords: Linear parameter-varying systems, Identification, Neural networks
Abstract: Identifying structured discrete-time linear time/parameter-varying (LPV) input-output (IO) models with global stability guarantees is a challenging problem since stability for such models is only implicitly defined through the solution of matrix inequalities (MI) in terms of the model's coefficient functions. In this paper, a structured linear IO model class is developed that results in a quadratically stable model for any choice of coefficient functions, enabling identification using standard optimization routines while guaranteeing stability. This is achieved through transforming the MI-based stability constraints in a necessary and sufficient manner, such that for any choice of transformed coefficient functions the MIs are satisfied. The developed stable LPV-IO model is employed in simulation to estimate the parameter-varying damping of mass-damper-spring system with stability guarantees, while a standard LPV-IO model results in an unstable estimate.
|
|
13:50-14:10, Paper MoB20.2 | |
>Spline-Based Parameter Varying Output Feedback Synthesis with Improved l_2 Gain |
|
Xu, Yicheng | University of California Irvine |
Jabbari, Faryar | Univ. of California at Irvine |
Keywords: Linear parameter-varying systems, Nonlinear output feedback, LMIs
Abstract: Dynamic output feedback synthesis for systems with Linear Parameter Varying (LPV) models is presented. The study focuses on solving Parameter-Dependent (PD) Linear Matrix Inequalities with a grid-based method, providing a modification to existing techniques that improves performance, particularly when a small number of nodes or splines are used. Additionally, a novel approach is proposed for synthesizing LPV Dynamic Output Feedback controllers without relying on the time derivative of scheduling parameters. The gridding-based method is employed for solving the associated PD LMIs in estimating the { L_2} gain of the closed-loop system. Numerical examples are provided to demonstrate the effectiveness of the proposed method.
|
|
14:10-14:30, Paper MoB20.3 | |
>Robust Nonlinear Model Predictive Control Based on the LPV System with Polytopic Uncertainty Identified Using Koopman Operator |
|
Chen, Zhong | Central South University |
Chen, Xiaofang | Central South University |
Cen, Lihui | Central South University |
Gui, Weihua | Central South University |
Keywords: Linear parameter-varying systems, Nonlinear systems identification, Predictive control for nonlinear systems
Abstract: Koopman operator-based model predictive control is a data-driven control approach for nonlinear systems with unknown dynamics. However, the modeling error is almost inevitable when using the Koopman operator. To address this issue, an approach for handling modeling errors in the identification of nonlinear systems via the Koopman operator is proposed. The nonlinear system is represented as a linear parameter-varying (LPV) system with polytopic uncertainty whose vertices of the polytope are obtained through a data-driven approach. A robust model predictive controller is designed for the original nonlinear system based on the identified Koopman operator-polytopic LPV system under the worst-case scenarios. The control design is formulated as an optimization problem subject to the specific linear matrix inequality (LMI) constraints, ensuring recursive feasibility and closed-loop stability. The effectiveness of the proposed modeling and control approach is demonstrated through numerical simulations.
|
|
14:30-14:50, Paper MoB20.4 | |
>Generalized Dynamic Observer Design for the Nonlinear Parameter Varying Systems: Exploiting Matrix-Multiplier-Based LMIs |
|
Mohite, Shivaraj | Research Assitant, RPTU, Kaiserslautern, Germany |
Alma, Marouane | CRAN Lorraine University |
Bajcinca, Naim | University of Kaiserslautern |
Keywords: Linear parameter-varying systems, Observers for nonlinear systems, LMIs
Abstract: This paper deals with the design of a generalized dynamic observer for a class of Nonlinear Parameter-Varying (NLPV) systems. The primary goal is to formulate a less conservative Linear Matrix Inequality (LMI) condition than the existing ones, ensuring the exponential stability of the error dynamics in the proposed observer. Through the incorporation of the reformulated Lipschitz property, Young’s inequalities and a generalized matrix multiplier, two new LMI conditions are established. Due to the deliberate use of these mathematical tools, the obtained LMI conditions contain a larger number of decision variables than existing LMI conditions. As a result, these LMIs have enhanced feasibility than the one presented in the literature. The effectiveness of the newly designed LMI-based generalized dynamic observer is highlighted through a numerical example.
|
|
14:50-15:10, Paper MoB20.5 | |
>Robust Path Following Control of Autonomous Underwater Vehicles Via Gain Scheduling and Integral Quadratic Constraints |
|
Sinha, Sourav | Virginia Tech |
Farhood, Mazen | Virginia Tech |
Stilwell, Daniel J. | Virginia Tech |
Keywords: Robust control, Uncertain systems, Linear parameter-varying systems
Abstract: This paper addresses the design and analysis of linear parameter-varying (LPV) path-following controllers for an autonomous underwater vehicle (AUV) operating in environments affected by ocean currents and subject to measurement noise. Leveraging a recently developed robustness analysis framework based on integral quadratic constraints (IQCs), the work focuses on constructing LPV controllers capable of effectively steering the AUV along planar paths with bounded curvature. The approach involves approximating the path-following dynamics of the AUV as a linear fractional transformation (LFT) on uncertainties, followed by a comprehensive IQC-based robustness analysis of the uncertain LFT system. This approach not only facilitates comparing the performances of various LPV controllers but also guides the control design process. The robust performance level estimates derived through IQC analysis are validated through nonlinear simulations. Subsequently, the most effective LPV controller is deployed on the AUV for underwater testing.
|
|
15:10-15:30, Paper MoB20.6 | |
>A Robust and Regularized Algorithm for Recursive Total Least Squares Estimation |
|
Koide, Hugo | University of Poitiers |
Vayssettes, Jérémy | Michelin |
Mercère, Guillaume | University of Poitiers |
Keywords: Estimation, Adaptive control, Linear parameter-varying systems
Abstract: In this work, a novel recursive total least squares (RTLS) algorithm that is grounded in a constrained Lagrange optimization of the errors-in-variables model is presented. The proposed RTLS method and its regularized counterpart are shown to be computationally efficient and produce robust estimates in the face of heavily unfavorable noise conditions, sub-optimal parametric initializations, and ill-conditioned input-output data. A Monte Carlo simulation study empirically demonstrates the improved stability and convergence properties of the proposed algorithms compared to the well-known recursive least squares algorithm, and a benchmark RTLS algorithm which is based on the minimization of the constrained generalized Rayleigh quotient. Furthermore, the applicability of the proposed method is validated with an experimental case study for online vehicle gear ratio estimation, highlighting its relevance in industrial settings.
|
|
MoC01 |
Auditorium |
Contraction Theory in Systems and Control I |
Invited Session |
Chair: Giaccagli, Mattia | Université De Lorraine |
Co-Chair: Russo, Giovanni | University of Salerno |
Organizer: Giaccagli, Mattia | Cnrs - Ul |
Organizer: Russo, Giovanni | University of Salerno |
Organizer: Astolfi, Daniele | Cnrs - Lagepp |
Organizer: Bullo, Francesco | Univ of California at Santa Barbara |
|
16:00-16:20, Paper MoC01.1 | |
>On Weakly Contracting Dynamics for Convex Optimization |
|
Centorrino, Veronica | Scuola Superiore Meridionale |
Davydov, Alexander | University of California, Santa Barbara |
Gokhale, Anand | University of California, Santa Barbara |
Russo, Giovanni | University of Salerno |
Bullo, Francesco | Univ of California at Santa Barbara |
Keywords: Stability of nonlinear systems, Optimization
Abstract: We analyze the convergence behavior of globally weakly and locally strongly contracting dynamics. Such dynamics naturally arise in the context of convex optimization problems with a unique minimizer. We show that convergence to the equilibrium is emph{linear-exponential}, in the sense that the distance between each solution and the equilibrium is upper bounded by a function that first decreases linearly and then exponentially. As we show, the linear-exponential dependency arises naturally in certain dynamics with saturations. Additionally, we provide a sufficient condition for local input-to-state stability. Finally, we illustrate our results on, and propose a conjecture for, continuous-time dynamical systems solving linear programs.
|
|
16:20-16:40, Paper MoC01.2 | |
>Exponential Stability of Parametric Optimization-Based Controllers Via Lur'e Contractivity |
|
Davydov, Alexander | University of California, Santa Barbara |
Bullo, Francesco | Univ of California at Santa Barbara |
Keywords: Stability of nonlinear systems, Lyapunov methods, Optimization
Abstract: In this letter, we investigate sufficient conditions for the exponential stability of LTI systems driven by controllers derived from parametric optimization problems. Our primary focus is on parametric projection controllers, namely parametric programs whose objective function is the squared distance to a nominal controller. Leveraging the virtual system method of analysis and a novel contractivity result for Lur'e systems, we establish a sufficient LMI condition for the exponential stability of an LTI system with a parametric projection-based controller. Separately, we prove additional results for single-integrator systems. Finally, we apply our results to state-dependent saturated control systems and control barrier function-based control and provide numerical simulations.
|
|
16:40-17:00, Paper MoC01.3 | |
>Stability of Nonexpansive Monotone Systems and Application to Recurrent Neural Networks |
|
Deplano, Diego | University of Cagliari |
Franceschelli, Mauro | University of Cagliari |
Giua, Alessandro | University of Cagliari |
Keywords: Stability of nonlinear systems, Autonomous systems, Neural networks
Abstract: This paper shows that trajectories of continuous-time monotone systems (in the sense of Kamke-Muller) converge to equilibrium points if their vector field is continuously differentiable and if they are nonexpansive w.r.t. a diagonally weighted infinity norm. Differently from the current literature trend, the system is not required to be contractive but merely nonexpansive, thus allowing for multiple equilibrium points. A second result is that of providing easy-to-check conditions on the vector field to verify that the system is both monotone and nonexpansive. This is done by showing that nonexpansiveness is implied by subhomogeneity of the system, a generalization of the translation invariance property. We apply the results in the context of RNNs, thus providing sufficient conditions for convergence of the state trajectories of nonexpansive monotone neural networks that are not contractive.
|
|
17:00-17:20, Paper MoC01.4 | |
>A Remark on Omega Limit Sets for Non-Expansive Dynamics (I) |
|
Duvall, Alon | Northeastern University |
Sontag, Eduardo | Northeastern University |
Keywords: Nonlinear systems, Stability of nonlinear systems
Abstract: In this paper, we study systems of time-invariant ordinary differential equations whose flows are non-expansive with respect to a norm, meaning that the distance between solutions may not increase. Since non-expansiveness (and contractivity) are norm-dependent notions, the topology of omega-limit sets of solutions may depend on the norm. For example, and at least for systems defined by real-analytic vector fields, the only possible omega-limit sets of systems that are non-expansive with respect to polyhedral norms (such as lp norms with p =1 or p=infinity are equilibria. In contrast, for non-expansive systems with respect to Euclidean (l^2) norm, other limit sets may arise (such as multi-dimensional tori): for example linear harmonic oscillators are non-expansive (and even isometric) flows, yet have periodic orbits as omega-limit sets. This paper shows that the Euclidean linear case is what can be expected in general: for flows that are contractive with respect to any strictly convex norm such as lp for any p distinct from 1 and finite), and if there is at least one bounded solution, then the omega-limit set of every trajectory is also an omega limit set of a linear time-invariant system.
|
|
17:20-17:40, Paper MoC01.5 | |
>Construction of a Contraction Metric for Time-Periodic Systems Using Meshless Collocation |
|
Giesl, Peter | University of Sussex |
Hafstein, Sigurdur | University of Iceland |
Keywords: Stability of nonlinear systems, Computational methods, Time-varying systems
Abstract: The existence and stability of a periodic orbit for time-periodic systems as well as its basin of attraction can be determined using a contraction metric. In this paper, we will present a numerical construction method based on meshless collocation with radial basis functions. We will first show the existence of a contraction metric satisfying a PDE and then use meshless collocation to approximately solve this PDE, which results in a contraction metric, if the approximation is sufficiently fine.
|
|
17:40-18:00, Paper MoC01.6 | |
>Singularly Perturbed k-Contractive Linear Systems |
|
Lorenzetti, Pietro | Universite' De Lorraine |
Giaccagli, Mattia | Université De Lorraine |
Morarescu, Irinel-Constantin | CRAN, CNRS, Université De Lorraine |
Postoyan, Romain | CNRS, CRAN, Université De Lorraine |
Keywords: Stability of linear systems, LMIs, Linear systems
Abstract: A dynamical system is said to be k-contractive when its trajectories contract k-dimensional volumes. For k=1, this property coincides with the classical notion of contraction. However, for k>1, it allows to characterize a much richer asymptotic behavior. The property of k-contraction has been introduced only recently, thus many analysis tools that are key in relevant applications are currently lacking for k-contractive systems. Motivated by this, we study k-contraction for singularly perturbed systems, which naturally arise in many engineering applications. In particular, we focus on singularly perturbed linear time-invariant (LTI) systems. First we show that, for a ``sufficiently large'' time-scale separation, the k-contraction properties of a singularly perturbed system can be derived from those of the associated boundary-layer (fast) system, and from the dimension of the reduced order (slow) model. Then, we focus on the case in which the reduced order (slow) model is k-contractive and the boundary-layer (fast) system is 1-contractive. In this setting, we provide a stronger result by showing that the overall system is k-contractive when the time-scale separation is ``large''.
|
|
MoC02 |
Amber 1 |
Machine Learning: Physics-Informed and Large Language Models for Control |
Invited Session |
Chair: Amo Alonso, Carmen | ETH |
Co-Chair: Beckers, Thomas | Vanderbilt University |
Organizer: Amo Alonso, Carmen | ETH |
Organizer: Pappas, George J. | University of Pennsylvania |
Organizer: Tabuada, Paulo | University of California at Los Angeles |
Organizer: Beckers, Thomas | Vanderbilt University |
Organizer: Drgona, Jan | Pacific Northwest National Laboratory |
Organizer: Hirche, Sandra | Technische Universität München |
Organizer: Findeisen, Rolf | TU Darmstadt |
|
16:00-16:20, Paper MoC02.1 | |
>Heat Death of Generative Models in Closed-Loop Learning (I) |
|
Marchi, Matteo | University of California, Los Angeles |
Soatto, Stefano | University of California, Los Angeles |
Chaudhari, Pratik | University of California, Los Angeles |
Tabuada, Paulo | University of California at Los Angeles |
Keywords: Machine learning, Stochastic systems, Neural networks
Abstract: Improvement and adoption of generative machine learning models is rapidly accelerating, as exemplified by the popularity of LLMs (Large Language Models) for text, and diffusion models for image generation. As generative models become widespread, data they generate is incorporated into shared content through the public web. This opens the question of what happens when data generated by a model is fed back to the model in subsequent training campaigns. This is a question about the stability of the training process, whether the distribution of publicly accessible content, which we refer to as "knowledge", remains stable or collapses. Small scale empirical experiments reported in the literature show that this closed-loop training process is prone to degenerating. Models may start producing gibberish data, or sample from only a small subset of the desired data distribution (a phenomenon referred to as mode collapse). So far there has been only limited theoretical understanding of this process, in part due to the complexity of the deep networks underlying these generative models. The aim of this paper is to provide insights into this process (that we refer to as "generative closed-loop learning") by studying the learning dynamics of generative models that are fed back their own produced content in addition to their original training dataset. The sampling of many of these models can be controlled via a "temperature" parameter. Using dynamical systems tools, we show that, unless a sufficient amount of external data is introduced at each iteration, any non-trivial temperature leads the model to asymptotically degenerate. In fact, either the generative distribution collapses to a small set of outputs, or becomes uniform over a large set of outputs.
|
|
16:20-16:40, Paper MoC02.2 | |
>Prompt a Robot to Walk with Large Language Models (I) |
|
Wang, Yen-Jen | University of California, Berkeley |
Zhang, Bike | University of California, Berkeley |
Chen, Jianyu | Tsinghua University |
Sreenath, Koushil | University of California, Berkeley |
Keywords: Robotics, Control applications
Abstract: Large language models (LLMs) pre-trained on vast internet-scale data have showcased remarkable capabilities across diverse domains. Recently, there has been escalating interest in deploying LLMs for robotics, aiming to harness the power of foundation models in real-world settings. However, this approach faces significant challenges, particularly in grounding these models in the physical world and in generating dynamic robot motions. To address these issues, we introduce a novel paradigm in which we use few-shot prompts collected from the physical environment, enabling the LLM to autoregressively predict low-level control actions for robots without task-specific fine-tuning. We utilize LLMs as a controller, diverging from the conventional approach of employing them primarily as planners. Simulation experiments across various robots and environments validate that our method can effectively prompt a robot to walk. We thus illustrate how LLMs can function as low-level feedback controllers for dynamic motion control, even in high-dimensional robotic systems. The project website and source code can be found at: prompt2walk.github.io.
|
|
16:40-17:00, Paper MoC02.3 | |
>REAL: Resilience and Adaptation Using Large Language Models on Autonomous Aerial Robots (I) |
|
Tagliabue, Andrea | MIT |
Kondo, Kota | Massachusetts Institute of Technology |
Zhao, Tong | Massachusetts Institute of Technology |
Peterson, Mason Burgon | Massachusetts Institute of Technology |
Tewari, Claudius Taroon | Massachusetts Institute of Technology |
How, Jonathan P. | MIT |
Keywords: Learning, Adaptive control, Aerospace
Abstract: Large Language Models (LLMs) pre-trained on internet-scale datasets have shown impressive capabilities in code understanding, synthesis and in processing extended sequences of symbols, often presented in natural language. This work aims to explore new opportunities in long-term reasoning, natural language comprehension, and the available prior knowledge of LLMs for increased resilience and adaptation in autonomous mobile robots. We introduce REAL, an approach for REsilience and Adaptation using LLMs. REAL interfaces LLMs with the mission planning and control framework of an autonomous robot. The LLM employed by REAL provides (i) a source of prior knowledge to increase resilience for challenging scenarios that the system has not been explicitly designed for; (ii) a way to interpret natural language and other log/diagnostic information available in the autonomy stack, for mission planning; (iii) a way to adapt the control inputs using minimal user-provided prior knowledge about the robot. We integrate REAL in the autonomy stack of a real multirotor, querying onboard an offboard LLM at about 1.0-0.1 Hz as part of the robot's mission planning and control feedback loops. We provide a demonstration of capabilities by showcasing in real-world experiments the ability of the LLM to reduce the position tracking errors of a multirotor, and decision-making to avoid potentially dangerous scenarios (e.g., robot oscillates) that are not explicitly accounted for in the initial prompt design.
|
|
17:00-17:20, Paper MoC02.4 | |
>CAGES: Cost-Aware Gradient Entropy Search for Efficient Local Multi-Fidelity Bayesian Optimization (I) |
|
Tang, Wei-Ting | The Ohio State University |
Paulson, Joel | The Ohio State University |
Keywords: Machine learning, Optimization algorithms, Reinforcement learning
Abstract: Bayesian optimization (BO) is a popular approach for optimizing expensive-to-evaluate black-box objective functions. An important challenge in BO is its application to high-dimensional search spaces due in large part to the curse of dimensionality. One way to overcome this challenge is to focus on local BO methods that aim to efficiently learn gradients, which have shown strong empirical performance on high-dimensional problems including policy search in reinforcement learning (RL). Current local BO methods assume access to only a single high-fidelity information source whereas, in many problems, one has access to multiple cheaper approximations of the objective. We propose a novel algorithm, Cost-Aware Gradient Entropy Search (CAGES), for local BO of multi-fidelity black-box functions. CAGES makes no assumption about the relationship between different information sources, making it more flexible than other multi-fidelity methods. It also employs a new information-theoretic acquisition function, which enables systematic identification of samples that maximize the information gain about the unknown gradient per evaluation cost. We demonstrate CAGES can achieve significant performance improvements compared to other state-of-the-art methods on synthetic and benchmark RL problems.
|
|
17:20-17:40, Paper MoC02.5 | |
>Metric Learning to Accelerate Convergence of Operator Splitting Methods (I) |
|
King, Ethan | Pacific Northwest National Laboratory |
Kotary, James | University of Virginia |
Fioretto, Ferdinando | University of Virginia |
Drgona, Jan | Pacific Northwest National Laboratory |
Keywords: Optimization algorithms, Machine learning, Computational methods
Abstract: Recent developments in machine learning have led to promising advances in accelerating the solution of constrained optimization problems. Increasing demand for real-time decision-making capabilities in applications such as artificial intelligence and optimal control has led to a variety of proposed strategies. This work proposes a new approach, in which the underlying metric spaces of proximal operator splitting algorithms are learned to maximize their convergence rate. While prior works in optimization theory have derived optimal metrics for some cases, for many practical problem forms including general Quadratic Programming (QP) we are not aware of any such results. This paper shows how differentiable optimization can enable the end-to-end learning of proximal metrics, enhancing the convergence of proximal algorithms for QP problems beyond what is possible based on known theory. Additionally, the results illustrate a strong connection between the learned proximal metrics and active constraints at the optima, leading to an interpretation in which the learning of proximal metrics can be viewed as a form of active set learning.
|
|
17:40-18:00, Paper MoC02.6 | |
>Physics-Constrained Meta-Learning for Online Adaptation and Estimation in Positioning Applications (I) |
|
Chakrabarty, Ankush | Mitsubishi Electric Research Laboratories (MERL) |
Deshpande, Vedang M. | Mitsubishi Electric Research Laboratories |
Wichern, Gordon | Mitsubishi Electric Research Laboratories |
Berntorp, Karl | Mitsubishi Electric Research Labs |
Keywords: Machine learning, Nonlinear systems identification, Kalman filtering
Abstract: Deep neural state-space models (NSSMs) based on autoencoders are a powerful tool for system identification. Recently, meta-learning approaches have been proposed to efficiently adapt NSSMs to specific dynamical systems within a family of systems with similar dynamics. Sophisticated automatic differentiation tools can enable the use of meta-learned NSSMs as predictive models for online state estimation. This is particularly useful if the underlying target system has uncertain parameters or unmodeled dynamics. Such is often the case in magnetic-field positioning applications: wherein a magnetometer can exhibit uncertain motion dynamics, with measurements obtained from within an unknown magnetic vector field. In this paper, we propose a meta-learning framework where `physics-constrained' NSSMs are trained using a dataset of varied motion dynamics and magnetic vector fields, where the physics-informed constraints are embedded into the network for learning a curl-free magnetic field. Online, such a meta-learned NSSM can rapidly adapt to a target motion model and magnetic field in a few-shot manner without explicitly estimating the parameters of the motion dynamics or magnetic field, and subsequently can inform an extended Kalman filter for state estimation. We demonstrate in simulation that the proposed approach yields accurate position estimates in spite of the unknown magnetic vector field.
|
|
MoC03 |
Amber 2 |
Multi-Agent Systems: Awareness, Learning, and Formal Methods III |
Invited Session |
Chair: Soudjani, Sadegh | Newcastle University |
Co-Chair: Vlahakis, Eleftherios | KTH Royal Institute of Technology |
Organizer: van Huijgevoort, Birgit | Eindhoven University of Technology |
Organizer: Ghosh, Arabinda | Max Planck Institute for Software Systems |
Organizer: Vlahakis, Eleftherios | KTH Royal Institute of Technology |
Organizer: Haesaert, Sofie | Eindhoven University of Technology |
Organizer: Soudjani, Sadegh | Max Planck Institute for Software Systems |
Organizer: Dimarogonas, Dimos V. | KTH Royal Institute of Technology |
|
16:00-16:20, Paper MoC03.1 | |
>Compositional Verification for Large-Scale Systems Via Closure Certificates |
|
Galarza-Jimenez, Felipe | University of Colorado, Boulder |
Murali, Vishnu | University of Colorado, Boulder |
Zamani, Majid | University of Colorado Boulder |
Keywords: Large-scale systems, Hybrid systems, Automata
Abstract: Closure certificates (CCs), function analogs of transition invariants, provide a framework to verify discrete-time dynamical systems against omega-regular specifications. Such certificates are similar to barrier certificates (BCs) yet are less conservative than BCs when leveraged to verify omega-regular properties. However, CCs are defined over pairs of states of the system rather than over the state of the system, and seek to overapproximate the transitive closure of the transition relation. Thus, finding these certificates is often harder and computationally more demanding than BCs, especially for large-scale systems. To address this challenge, we propose a dissipativity-inspired approach to construct closure certificates for interconnected systems. In such a setting, we assume our large-scale system to be an interconnection of subsystems under a linear map. We then find local certificates for these subsystems. These local certificates are then composed to form a closure certificate for the interconnected system, acting as proof of the satisfaction of a desired omega-regular specification. Finally, we illustrate our approach with a numerical simulation.
|
|
16:20-16:40, Paper MoC03.2 | |
>Capability Augmentation for Heterogeneous Dynamic Teaming with Temporal Logic Tasks (I) |
|
Berlind, Carter | Boston University |
Liu, Wenliang | Boston University |
Pierson, Alyssa | MIT |
Belta, Calin | Boston University |
Keywords: Robotics, Autonomous robots, Cooperative control
Abstract: Abstract— This paper considers how heterogeneous multi- agent teams can leverage their different capabilities to mutually improve individual agent performance. We present Capability- Augmenting Tasks (CAT), which encode how agents can augment their capabilities based on interactions with other teammates. Our framework integrates CAT into the semantics of a Metric Temporal Logic (MTL) formulation, which defines individual spatiotemporal tasks for all agents, and a centralized Mixed-Integer Program (MIP) synthesizes trajectories for all agents. Simulations demonstrate our approach improves individual performance when agents leverage the capabilities of their teammates, and case studies illustrate the expressiveness of our formulation.
|
|
16:40-17:00, Paper MoC03.3 | |
>An STL Formulation for Intent-Expressive Motion Planning and Intent Estimation with Output Feedback |
|
Gah, Elikplim | Northeastern University |
Yong, Sze Zheng | Northeastern University |
Keywords: Autonomous systems, Estimation, Model Validation
Abstract: This letter presents a tractable signal temporal logic (STL) approach for designing set-based intent-expressive trajectory planning and intent estimation algorithms with (noisy) output feedback for multi-agent teams. These algorithms allow an observed agent to implicitly convey intent information to observer agents while guaranteeing that the agent robustly satisfies state and input constraints, avoids obstacles and achieves its intended STL task specification under worst-case realizations of uncertainties. Specifically, the intent-expressive trajectory planning algorithm encodes intent information by ensuring that the output reachable sets (i.e., all possible measured outputs by the observer agents) for satisfying the intended STL task specifications are disjoint from each other, while the intent estimation algorithm enables the observer agents to decode the intent by eliminating all intent models that are incompatible with noisy run-time observations.
|
|
17:00-17:20, Paper MoC03.4 | |
>Probabilistic Tube-Based Control Synthesis of Stochastic Multi-Agent Systems under Signal Temporal Logic (I) |
|
Vlahakis, Eleftherios | KTH Royal Institute of Technology |
Lindemann, Lars | University of Southern California |
Sopasakis, Pantelis | Queen's University Belfast |
Dimarogonas, Dimos V. | KTH Royal Institute of Technology |
Keywords: Agents-based systems, Stochastic systems, Formal Verification/Synthesis
Abstract: We consider the control design of stochastic discrete-time linear multi-agent systems (MASs) under a global signal temporal logic (STL) specification to be satisfied at a predefined probability. By decomposing the dynamics into deterministic and error components, we construct a probabilistic reachable tube (PRT) as the Cartesian product of reachable sets of the individual error systems driven by disturbances lying in confidence regions (CRs) with a fixed probability. By bounding the PRT probability with the specification probability, we tighten all state constraints induced by the STL specification by solving tractable optimization problems over segments of the PRT, and convert the underlying stochastic problem into a deterministic one. This approach reduces conservatism compared to tightening guided by the STL structure. Additionally, we propose a recursively feasible algorithm to attack the resulting problem by decomposing it into agent-level subproblems, which are solved iteratively according to a scheduling policy. We demonstrate our method on a ten-agent system, where existing approaches are impractical.
|
|
17:20-17:40, Paper MoC03.5 | |
>Distributionally Robust Control for Chance-Constrained Signal Temporal Logic Specifications (I) |
|
Bahari Kordabad, Arash | Max Planck Institute for Software Systems |
Vlahakis, Eleftherios | KTH Royal Institute of Technology |
Lindemann, Lars | University of Southern California |
Dimarogonas, Dimos V. | KTH Royal Institute of Technology |
Soudjani, Sadegh | Newcastle University |
Keywords: Sampled-data control, Stochastic optimal control, Optimization
Abstract: We consider distributionally robust optimal control of stochastic linear systems under signal temporal logic (STL) chance constraints when the disturbance distribution is unknown. By assuming that the underlying predicate functions are Lipschitz continuous and the noise realizations are drawn from a distribution having a concentration of measure property, we first formulate the underlying chance-constrained control problem as stochastic programming with constraints on expectations and propose a solution using a distributionally robust approach based on the Wasserstein metric. We show that by choosing a proper Wasserstein radius, the original chance-constrained optimization can be satisfied with a user-defined confidence level. A numerical example illustrates the efficacy of the method.
|
|
17:40-18:00, Paper MoC03.6 | |
>Reactive Planning for Teams of Heterogeneous Robots with Dynamic Collaborative Temporal Logic Missions |
|
Zhang, Yuqing | Washington University in St. Louis |
Kalluraya, Samarth | Washington University in St. Louis |
Pappas, George J. | University of Pennsylvania |
Kantaros, Yiannis | Washington University in St. Louis |
Keywords: Autonomous robots, Formal Verification/Synthesis, Automata
Abstract: Several task and motion planning algorithms have been proposed recently for teams of robots assigned to collaborative high-level tasks specified using Linear Temporal Logic (LTL). However, the majority of prior works cannot effectively adapt to new missions that may arise in the field due to unexpected service requests. To address this novel challenge, we propose a reactive planning algorithm for teams of heterogeneous robots with collaborative LTL-encoded missions that dynamically change. The robots are heterogeneous in terms of their skills while the mission requires them to apply these skills in specific regions/objects in a temporal/logical order. Our method designs paths that can adapt to unexpected changes in the mission and effectively address potential mission violations arising due to conflicting logical task requirements or a limited number of robots. We achieve this by locally allocating new sub-tasks to the robots based on their capabilities, minimizing disruptions to the existing team plan, and strategically prioritizing the most crucial sub-tasks according to user-specified priorities. We provide theoretical guarantees and numerical experiments to demonstrate the efficiency of our method.
|
|
MoC04 |
Amber 3 |
Sampled-Data Control |
Regular Session |
Chair: Daafouz, Jamal | Université De Lorraine, CRAN, CNRS |
Co-Chair: Mirkin, Leonid | Technion - IIT |
|
16:00-16:20, Paper MoC04.1 | |
>Computing the L1-Induced Norm of Sampled-Data Systems |
|
Kim, Junghoon | Pohang University of Science and Technology |
Kwak, Dohyeok | POSTECH |
Kim, Jung Hoon | Pohang Univeristy of Science and Technology |
Hagiwara, Tomomichi | Kyoto Univ |
Keywords: Sampled-data control, Computational methods
Abstract: This paper is concerned with developing a method for computing the L1-induced norm of sampled-data systems. We first derive an operator-based form of the L1-induced norm in the lifted representation of sampled-data systems. The corresponding operators are further considered on the top of the fast-lifted treatment, in which the sampling interval [0,h) is divided into M subintervals with an equal width. This treatment allows us to develop a piecewise constant approximation of the input and output signals of sampled-data systems, by which an upper bound and a lower bound on the L1-induced norm can be obtained.The gap between these bounds is shown to converge to 0 at the rate of 1/M with the fast-lifting parameter M.
|
|
16:20-16:40, Paper MoC04.2 | |
>Saturating Sampled-Data Control Design for Lur'e Systems |
|
Gomes da Silva Jr, Joao Manoel | Universidade Federal Do Rio Grande Do Sul |
Tarbouriech, Sophie | LAAS-CNRS |
Keywords: Sampled-data control, Constrained control, Hybrid systems
Abstract: This paper addresses the problem of designing saturating sampled-data control laws for Lur’e type systems. The closed-loop system is modeled as an hybrid system to formally take into account the aperiodic sampling phenomenon and a control law that depends on the time elapsed between two consecutive sampling instants. This control law considers a nonlinear feedback of the sampled state and the value of the previous value of the control signal. Under this setup and the use of timer-dependent quadratic Lyapunov functions, regional stabilizing conditions are proposed as timer-dependent linear matrix inequalities (LMIs). Considering an affine timer dependency, an optimization problem based on semi-definite programming is proposed to compute the control law gains aiming at enlarging an estimate of the region of attraction of the closed-loop system. A numerical example illustrates the application of the proposed results.
|
|
16:40-17:00, Paper MoC04.3 | |
>Asynchronous Sampled-Data Synchronization with Small Communications Delays |
|
Barkai, Gal | Technion—Israel Institute of Technology |
Mirkin, Leonid | Technion - IIT |
Zelazo, Daniel | Technion - Israel Institute of Technology |
Keywords: Sampled-data control, Networked control systems, Delay systems
Abstract: This study investigates the state synchronization of Linear Time-Invariant (LTI) agents within a networked environment characterized by intermittent and asynchronous communication, alongside heterogeneous time-varying transmission delays. These delays are not assumed to be known a-priori but only time-stamped. A hybrid controller, augmented with a special kind of predictor, is proposed to compensate for the delays and guarantee synchronization. Notably, synchronization is achieved under comparable conditions to the delay-free case, provided that transmission delays are smaller than the corresponding sampling interval. This is independent of the agents’ dynamics and requires no additional knowledge of the underlying communication topology. An algorithm is presented for implementing the required predictor buffer with a size of one, offering a straightforward and scalable implementation.
|
|
17:00-17:20, Paper MoC04.4 | |
>Mixed Regular and Impulsive Sampled-Data LQR |
|
Daafouz, Jamal | Université De Lorraine, CRAN, CNRS |
Loheac, Jerome | CNRS, Universite De Lorraine |
Postoyan, Romain | CNRS, CRAN, Université De Lorraine |
Keywords: Sampled-data control, Optimal control, Linear systems
Abstract: We investigate the benefits of combining regular and impulsive inputs for the control of sampled-data linear time-invariant systems. We first observe that adding an impulsive term to a regular, zero-order-hold controller may help enlarging the set of sampling periods under which controllability is preserved by sampling. In this context, we provide a tailored Hautus-like necessary and sufficient condition under which controllability of the mixed regular, impulsive (MRI) sampled-data model is preserved. We then focus on LQR optimal control. After having presented the optimal controllers for the sampled-data LQR control in the MRI setting, we consider the scenario where an impulsive disturbance affects the dynamics and is known ahead of time. The solution to the so-called preview LQR is presented exploiting both regular and impulsive input components. Numerical examples, that include an insulin infusion benchmark, illustrate that leveraging both future disturbance information and MRI controls may lead to significant performance improvements.
|
|
17:20-17:40, Paper MoC04.5 | |
>Quantized Event-Based Sampled-Data Nonlinear Control for PEM Fuel Cell Air Supply |
|
Salucci, Pasquale | Università Degli Studi Dell'Aquila |
Di Ferdinando, Mario | University of L'Aquila |
Di Benedetto, Maria Domenica | University of L'Aquila |
Balluchi, Andrea | DANA ITALIA |
Pepe, Pierdomenico | University of L' Aquila |
Di Gennaro, Stefano | University of L'Aquila |
Keywords: Sampled-data control, Nonlinear systems, Control applications
Abstract: The problem of oxygen starvation avoidance in air-feed proton exchange membrane fuel cells is widely known and studied in the literature because it crucially affects the performances of the overall system. To address this problem, many nonlinear control techniques have been provided which, however, do not take into account the effects on the performances induced by the digital environments commonly used for the practical implementation of control strategies. In this paper, a methodology for the design of quantized sampled-data event-based stabilizers is proposed for a class of time-varying nonlinear systems and applied to the problem of oxygen starvation and maximum net power achievement in Proton Exchange Membrane Fuel Cells (PEMFCs). In particular, by exploiting a Lyapunov-based approach and the stabilization in the sample-and-hold sense theory, it is shown that there exist a suitably fast sampling and an accurate quantization of the input/output channels such that the digital event-triggered implementation of a proposed continuous-time stabilizer ensures the semi-global practical stability property of the related closed-loop system. In the theory developed here, time-varying sampling periods and non-uniform quantization of the input/output channels are allowed. Simulations confirm the effectiveness of the theoretical results.
|
|
17:40-18:00, Paper MoC04.6 | |
>Sampling and Quantization-Aware Control Barrier Functions for Safety-Critical Control of Cyber-Physical Systems |
|
Niu, Luyao | University of Washington |
Ramasubramanian, Bhaskar | Western Washington University |
Clark, Andrew | Washington University in St. Louis |
Poovendran, Radha | University of Washington |
Keywords: Sampled-data control, Quantized systems, Nonlinear systems
Abstract: Safety is critical to a wide range of cyber-physical systems (CPS). Safety violations may damage CPS and cause harm to humans that co-exist in the operating environment. However, it is nontrivial to guarantee safety of complex CPS whose computation and control workload are shifted to the cloud. The reason is that the system states which evolve continuously are sampled periodically and quantized before being sent to the controller to compute control inputs. Moreover, the controller may operate with finite precision, making the coefficients involved in computation different from those of the actual system. Consequently, the synthesized control inputs to the system may lead to safety violations. In this paper, we study the co-design of quantizer and control inputs for such CPS. We construct a control barrier function (CBF) constraint for the digital controller and analyze how it differs from the CBF constraint formulated using the actual system states and dynamics. We observe that this difference is dependent on the sampling error, quantization error, and error induced by finite precision of the controller. We derive upper bounds of these errors and use the bounds to design a state quantizer. We show that the problem of designing a quantizer can be converted to a facility location problem. We prove the submodularity of the quantizer design problem, and leverage the submodularity property to develop an efficient greedy algorithm to construct the quantizer. Given the quantized states calculated by the quantizer, we modify the CBF constraint used by the controller to synthesize control inputs for the system at each sampling interval. We show that the synthesized inputs guarantee the system safety. We demonstrate the proposed approach using a numerical case study on a batch reactor system.
|
|
MoC05 |
Amber 4 |
Numerical Methods for Mixed-Integer and Nonsmooth Optimal Control II |
Invited Session |
Chair: Nurkanovic, Armin | University of Freiburg |
Co-Chair: Diehl, Moritz | University of Freiburg |
Organizer: Nurkanovic, Armin | University of Freiburg |
Organizer: Kronqvist, Jan | KTH Royal Insitute of Technology |
Organizer: Acary, Vincent | INRIA Centre De Recherche De L'université De Grenoble Alpes |
Organizer: Diehl, Moritz | University of Freiburg |
|
16:00-16:20, Paper MoC05.1 | |
>Scaling Mixed-Integer Programming for Certification of Neural Network Controllers Using Bounds Tightening (I) |
|
Sosnin, Philip | Imperial College London |
Tsay, Calvin | Imperial College London |
Keywords: Machine learning, Optimization, Predictive control for linear systems
Abstract: Neural networks offer a computationally efficient approximation of model predictive control, but they lack guarantees on the resulting controlled system’s properties. Formal certification of neural networks is crucial for ensuring safety, particularly in safety-critical domains such as autonomous vehicles. One approach to formally certify properties of neural networks is to solve a mixed-integer program based on the network. This approach suffers from scalability issues due to the complexity of solving the resulting mixed-integer programs. Nevertheless, these issues can be (partially) mitigated via bound-tightening techniques prior to forming the mixed-integer program, which results in tighter formulations and faster optimization. This paper presents bound-tightening techniques in the context of neural network explicit control policies. Bound tightening is particularly important when considering problems spanning multiple time steps of a controlled system, as the bounds must be propagated through the problem depth. Several strategies for bound tightening are evaluated in of both computational complexity and tightness of the bounds.
|
|
16:20-16:40, Paper MoC05.2 | |
>Learning Hierarchical Control Systems for Autonomous Systems with Energy Constraints (I) |
|
Vallon, Charlott | University of California, Berkeley |
Pustilnik, Mark | UC Berkeley |
Pinto, Alessandro | NASA Jet Propulsion Laboratory |
Borrelli, Francesco | Unversity of California at Berkeley |
Keywords: Hierarchical control, Control system architecture, Predictive control for nonlinear systems
Abstract: This paper focuses on the design of hierarchical control architectures for autonomous systems with energy constraints. We focus on systems where energy storage limitations and slow recharge rates drastically affect the way the autonomous systems are operated. Using examples from space robotics and public transportation, we motivate the need for formally designed learning hierarchical control systems. We propose a learning control architecture which incorporates learning mechanisms at various levels of the control hierarchy to improve performance and resource utilization. The proposed hierarchical control scheme relies on high-level energy-aware task planning and assignment, complemented by a low-level predictive control mechanism responsible for the autonomous execution of tasks, including motion control and energy management. Simulation examples show the benefits and the limitations of the proposed architecture when learning is used to obtain a more energy-efficient task allocation.
|
|
16:40-17:00, Paper MoC05.3 | |
>On Methods for Improved Efficiency of Optimal Task and Motion Planning (I) |
|
Hellander, Anja | Linköping University |
Axehill, Daniel | Linköping University |
Keywords: Autonomous vehicles, Optimal control
Abstract: Optimal task and motion planning (TAMP) has seen an increase in interest in recent years. An important performance bottleneck when solving such problems is that solving motion-planning problems for nonholonomic systems to (resolution) optimality is relatively costly, and when this has to be done a potentially large number of times, in the form of a subroutine, time quickly adds up. In this work, we significantly increase the efficiency of our previously presented optimal TAMP algorithm for rearrangement problems. The core idea that we introduce in this work is to use intermediary results from the motion planner to infer solutions to other related motion-planning problems that might be of interest to the overall TAMP problem. We also introduce the concept of equivalent states to recognize state-action pairs that require the solution of the same motion-planning problem in order to compute their associated cost. Evaluations on numerical examples considering rearrangement TAMP problems involving tractor-trailers show that the proposed strategies can significantly reduce the total computation time of the TAMP planner, as well as the number of motion-planning problems that are solved, and the number of candidate task plans that are computed.
|
|
17:00-17:20, Paper MoC05.4 | |
>Addressing Discrete Dynamic Optimization Via a Logic-Based Discrete-Steepest Descent Algorithm (I) |
|
Peng, Zedong | Purdue University |
Lee, Albert | Purdue University |
Bernal Neira, David | Purdue University |
Keywords: Differential-algebraic systems, Optimization, Computational methods
Abstract: Dynamic optimization problems involving discrete decisions have several applications, yet lead to challenging optimization problems that must be addressed efficiently. Combining discrete variables with potentially nonlinear constraints stemming from dynamics within an optimization model results in mathematical programs for which off-the-shelf techniques might be insufficient. This work uses a novel approach, the Logic-based Discrete-Steepest Descent Algorithm (LD-SDA), to solve Discrete Dynamic Optimization problems. The problems are formulated using Boolean variables that enforce differential systems of constraints and encode logic constraints that the optimization problem needs to satisfy. By posing the problem as a generalized disjunctive program with dynamic equations within the disjunctions, the LD-SDA takes advantage of the problem's inherent structure to efficiently explore the combinatorial space of the Boolean variables and selectively include relevant differential equations to mitigate the computational complexity inherent in dynamic optimization scenarios. We rigorously evaluate the LD-SDA with benchmark problems from the literature that include dynamic transitioning modes and find it to outperform traditional methods, i.e., mixed-integer nonlinear and generalized disjunctive programming solvers, in terms of efficiency and capability to handle dynamic scenarios. This work presents a systematic method and provides an open-source software implementation to address these discrete dynamic optimization problems by harnessing the information within its logical-differential structure.
|
|
17:20-17:40, Paper MoC05.5 | |
>Optimization-Based Reference Tracking for Two-Dimensional Multiple Source Heating |
|
Scholz, Stephan | University of Applied Sciences Ravensburg-Weingarten |
Lothar, Berger | University of Applied Sciences Ravensburg-Weingarten |
Keywords: Computational methods, Optimal control, Distributed parameter systems
Abstract: In this contribution, we create a model of two-dimensional linear heat conduction with nonlinear boundary conditions as heat transfer and heat radiation. The temperature is steered via multiple actuators on one boundary side, and measurements are taken with multiple sensors on the opposite boundary side. We design with a parametrized input function an optimization-based control strategy to follow a reference trajectory from low to high temperatures. When reaching the desired temperature value, measurements are stabilized at this operating point with a model predictive control technique. Finally, we demonstrate our proposed procedure with a numerical simulation.
|
|
17:40-18:00, Paper MoC05.6 | |
>Constraint Preconditioning and Parameter Selection for a First-Order Primal-Dual Method Applied to Model Predictive Control |
|
Chari, Govind | University of Washington |
Yu, Yue | University of Minnesota |
Acikmese, Behcet | University of Washington |
Keywords: Optimization, Optimization algorithms, Optimal control
Abstract: Many techniques for real-time trajectory optimization and control require the solution of optimization problems at high frequencies. However, ill-conditioning in the optimization problem can significantly reduce the speed of first-order primal-dual optimization algorithms. We introduce a preconditioning technique and step-size heuristic for Proportional-Integral Projected Gradient (PIPG), a first-order primal-dual algorithm. The preconditioning technique, based on the QR factorization, aims to reduce the condition number of the KKT matrix associated with the optimization problem. Our step-size selection heuristic chooses step-sizes to minimize the upper bound on the convergence of the primal-dual gap for the optimization problem. These algorithms are tested on two model predictive control problems and show a solve-time reduction of at least 3.6x.
|
|
MoC06 |
Amber 5 |
Networked Control Systems II |
Regular Session |
Chair: Satheeskumar Varma, Vineeth | CNRS |
Co-Chair: Athalye, Chirayu D. | BITS Pilani, K.K. Birla Goa Campus |
|
16:00-16:20, Paper MoC06.1 | |
>On the Role of Network Structure in Learning to Coordinate with Bounded Rationality |
|
Zhang, Yifei | Florida State University |
Vasconcelos, Marcos M. | Florida State University |
Keywords: Networked control systems, Learning, Game theory
Abstract: Many socioeconomic phenomena, such as technology adoption, collaborative problem-solving, and content engagement, involve a collection of agents coordinating to take a common action, aligning their decisions to maximize their individual goals. We consider a model for networked interactions where agents learn to coordinate their binary actions under a strict bound on their rationality. We first prove that our model is a potential game and that the optimal action profile is always to achieve perfect alignment at one of the two possible actions, regardless of the network structure. Using a stochastic learning algorithm known as Log Linear Learning, where agents have the same finite rationality parameter, we show that the probability of agents successfully agreeing on the correct decision is monotonically increasing in the number of network links. Therefore, more connectivity improves the accuracy of collective decision-making, as predicted by the phenomenon known as ``Wisdom of Crowds.'' Finally, we show that for a fixed number of links, a regular network maximizes the probability of success. We conclude that when using a network of irrational agents, promoting more homogeneous connectivity improves the accuracy of collective decision-making.
|
|
16:20-16:40, Paper MoC06.2 | |
>Linear Output Regulation of Discrete-Time Networked Systems Subject to Stochastic Packet Drops |
|
Giaccagli, Mattia | Université De Lorraine |
Satheeskumar Varma, Vineeth | CNRS |
Astolfi, Daniele | Cnrs - Lagepp |
Keywords: Networked control systems, Linear systems, Switched systems
Abstract: We investigate a scenario in which a given discrete-time controller communicates with a discrete-time plant via a wireless erasure channel in a linear scenario. The controller is designed for the output of the plant to track a periodic reference generated by a finite superimposition of discrete-time linear oscillators while rejecting additional disturbances, i.e., to solve an output regulation problem. Due to stochastic packet drop induced by the network, the closed-loop behavior switches between two different dynamical systems, depending on the probability of transmission. We show that, when packet drops occur on the channel from the controller to the actuator, the exogenous signal implies that the expected regulation error does not go to zero but converges to a ball centered at zero. Differently, when the packet drops are on the output channel, we provide a set of sufficient conditions such that the regulation error asymptotically shrinks to zero in expectation. Results are validated via numerical examples.
|
|
16:40-17:00, Paper MoC06.3 | |
>Model-Based Dynamic Periodic Event-Triggered Control for Nonlinear Networked Control Systems with Transmission Delays |
|
Li, Wangjiang | Beijing Institute of Technology |
Yu, Hao | Beijing Institute of Technology |
Dhullipalla, Mani Hemanth | University of Alberta |
Shi, Dawei | Beijing Institute of Technology |
Keywords: Networked control systems, Nonlinear systems, Hybrid systems
Abstract: This paper considers dynamic periodic event-triggered control for nonlinear networked control systems. A model-based periodic event-triggering mechanism is proposed to potentially reduce the consumption of transmission resources for the networked control systems that are subject to time-varying inter-sampling intervals, transmission delays, and scheduling protocols. Furthermore, to compensate for the adverse effects of delays, the controller node is equipped with two specialized units: a propagation unit and a model unit. The role of propagation units is to work with the delayed data, using the data to estimate its current value and subsequently updating the state within the model unit. A predictor unit is employed at the sensor node to configure event-triggering conditions. A hybrid system framework is used to model the networked control systems. Moreover, sufficient conditions on the transmission intervals, delays and dynamic event-triggered control are given to ensure closed-loop asymptotic stability. Additionally, an allocation framework of the event-triggering mechanism is proposed, taking into account the information available at the sensor node. Finally, an example of a single-link robot arm is simulated to illustrate the effectiveness and feasibility of the theoretical results.
|
|
17:00-17:20, Paper MoC06.4 | |
>Global Event-Triggered Regulation of Unicycle Dynamics by Bounded Control |
|
Efimov, Denis | Inria |
Khalili, Mata | Nokia Bell Labs |
Liu, Shiyu | Nokia Bell Labs |
Keywords: Networked control systems, Nonlinear systems, Robotics
Abstract: The problem of stabilization of the position of a mobile robot using its cinematic (unicycle type) model is considered. The suggested control is discontinuous and bounded; moreover, it guarantees a global solution to the posed problem. The properties of the proposed control law are analyzed by applying the Lyapunov function method. An event-triggering realization of the control algorithm is presented, and its performance and tuning are evaluated through simulations.
|
|
17:20-17:40, Paper MoC06.5 | |
>Structural Control of Drift-Free Bilinear Systems under Link Failures and Sparsity Constraints |
|
Dilip, Sanand | IIT Kharagpur |
Athalye, Chirayu D. | BITS Pilani, K K Birla Goa Campus |
Keywords: Networked control systems, Optimization, Nonlinear systems
Abstract: We study structural controllability properties of drift-free bilinear systems under link failures using a graph theoretic approach. We give an equivalent condition for structural controllability in the presence of link failures, which can be checked in polynomial time. We show that the problem of finding a sparsest structure for structural controllability under link failures is equivalent to a known NP-hard problem. We also consider the case of probabilistic link failures. The problem of deciding whether the underlying system is structurally controllable with a certain probability is equivalent to an NP-hard network reliability problem. Finally, we observe that the problem of designing a structurally controllable bilinear system from an uncontrollable one under nonuniform cost constraints is equivalent to an NP-complete strong connectivity augmentation problem.
|
|
17:40-18:00, Paper MoC06.6 | |
>Design of Denial-Of-Service Attack Strategy with Energy Constraint: An Approximate Dynamic Programming Approach |
|
Xie, Kaiyun | University of Science and Technology of China |
Shu, Zhan | University of Alberta |
Xiong, Junlin | University of Science and Technology of China |
Keywords: Networked control systems, Optimization algorithms, Markov processes
Abstract: This paper aims to design a denial-of-service attack strategy that maximizes the LQG cost over a finite time. During each attack instance, the attacker chooses from multiple energy levels and expends attack energy to initiate interference. With the limitation of total attack energy, a dynamic programming algorithm is used to search for the optimal attack strategy. However, due to the curse of dimensionality, the algorithm exhibits high computational complexity. To address the issue, an approximate dynamic programming algorithm is presented to design approximately optimal strategies by leveraging the monotonicity of optimal value functions. The new algorithm is capable of obtaining a high-quality attack strategy within a relatively small number of iterations. Finally, a numerical example is provided to demonstrate the superiority of the developed algorithm.
|
|
MoC07 |
Amber 6 |
Game Theory III |
Regular Session |
Chair: Mylvaganam, Thulasi | Imperial College London |
Co-Chair: Brown, Philip N. | University of Colorado Colorado Springs |
|
16:00-16:20, Paper MoC07.1 | |
>Feedback Nash Equilibrium Solutions of Two-Player LQ Differential Games: Synthesis and Analysis Via a State/Costate Interpretation |
|
Scarpa, Maria Luisa | Imperial College London |
Nortmann, Benita Alessandra Lucia | Imperial College London |
Sassano, Mario | University of Rome, Tor Vergata |
Mylvaganam, Thulasi | Imperial College London |
Keywords: Game theory, Optimal control, Linear systems
Abstract: Linear quadratic differential games and their feedback Nash equilibrium (F-NE) solutions are considered. First, it is shown that F-NE strategies can be derived from the restriction to an invariant subspace of a system that is reminiscent of the state/costate dynamics arising in the context of open-loop NE solutions. Second, in terms of synthesis, it is shown that the equilibrium subspace can be rendered externally stable via virtual inputs without modifying the underlying F-NE strategies. Building upon these findings, we propose a gradient descent algorithm to determine a solution of the coupled Algebraic Riccati Equations associated with F-NE, which are generally challenging to solve. Finally, in terms of analysis, we show that the F-NE strategy of each player can be interpreted as the output of a passive Port-Controlled Hamiltonian system, and that the behaviour of the original system under the action of the F-NE strategies can be interpreted as an interconnection of these.
|
|
16:20-16:40, Paper MoC07.2 | |
>A Policy Iteration Algorithm for N-Player General-Sum Linear Quadratic Dynamic Games |
|
Guan, Yuxiang | University of Texas at Dallas |
Salizzoni, Giulio | EPFL |
Kamgarpour, Maryam | EPFL |
Summers, Tyler H. | University of Texas at Dallas |
Keywords: Game theory, Optimal control, Optimization algorithms
Abstract: We present a policy iteration algorithm for the infinite-horizon N-player general-sum deterministic linear quadratic dynamic games and compare it to policy gradient methods. We demonstrate that the proposed policy iteration algorithm is distinct from the Gauss-Newton policy gradient method in the N-player game setting, in contrast to the single-player setting where under suitable choice of step size they are equivalent. We illustrate in numerical experiments that the convergence rate of the proposed policy iteration algorithm significantly surpasses that of the Gauss-Newton policy gradient method and other policy gradient variations. Furthermore, our numerical results indicate that, compared to policy gradient methods, the convergence performance of the proposed policy iteration algorithm is less sensitive to the initial policy and changes in the number of players.
|
|
16:40-17:00, Paper MoC07.3 | |
>Cooperate or Compete: Coalition Formation in Congestion Games |
|
Sultana, Riya | Indian Institute of Technology Bombay |
Veeraruna, Kavitha | IIT Bombay, India |
Keywords: Game theory, Optimization
Abstract: This paper investigates the potential benefits of cooperation in scenarios where finitely many agents compete for shared resources, leading to congestion and thereby reduced rewards. By appropriate coordination the members of the co- operating group (a.k.a., coalition) can minimize the congestion losses due to inmates, while efficiently facing the competition from outsiders (the coalitions indulge in a non-cooperative congestion game). The quest in this paper is to identify the stable partition of coalitions that are not challenged by a new coalition. In contrast to the traditional cooperative games, the worth of a coalition in our game also depends upon the arrangement of the opponents. Every arrangement leads to a partition and a corresponding congestion game; the resultant Nash equilibria (NEs) dictate the ‘worth’. The analysis is further complicated due to the presence of multiple NEs for each such game. The major findings are: a) the grand coalition of all players is stable only in certain scenarios; b) interestingly, in more realistic scenarios, the grand coalition is not stable, but other partitions (mostly the ones with only one non-singleton group) are stable or none of the partitions are stable. When none of the partitions are stable, there is a possibility of cyclic behaviour. Basically the players cycle through different collaborative ar- rangements constantly in pursuit of better outcomes. In essence, this research highlights the stable coalition structures when the resources are congestible and when the coordination efforts carry a price.
|
|
17:00-17:20, Paper MoC07.4 | |
>A Coupled Optimization Framework for Correlated Equilibria in Normal-Form Games |
|
Li, Hui Qing | ETH Zürich |
Yu, Yue | University of Minnesota |
Dörfler, Florian | Swiss Federal Institute of Technology (ETH) Zurich |
Lygeros, John | ETH Zurich |
Keywords: Game theory, Optimization
Abstract: In competitive multi-player interactions, simultaneous optimality is a key requirement for establishing strategic equilibria. This property is explicit when the game-theoretic equilibrium is the simultaneously optimal solution of coupled optimization problems. However, no such optimization problems exist for the correlated equilibrium, a strategic equilibrium where the players can correlate their actions. We address the lack of a coupled optimization framework for the correlated equilibrium by introducing an unnormalized game---an extension of normal-form games in which the player strategies are lifted to unnormalized measures over the joint actions. We show that the set of fully mixed generalized Nash equilibria of this unnormalized game is a subset of the correlated equilibria of the normal-form game. Furthermore, we introduce an entropy regularization to the unnormalized game and prove that the entropy-regularized generalized Nash equilibrium is a sub-optimal correlated equilibrium where the degree of sub-optimality depends on the magnitude of regularization. We derive a closed form solution for an entropy-regularized generalized Nash equilibrium and verify via simulation its computational complexity.
|
|
17:20-17:40, Paper MoC07.5 | |
>ABRA: An Algorithm Which Cannot Converge to Low-Quality Nash Equilibria |
|
Singh, Vartika | University of Colorado Colorado Springs |
Brown, Philip N. | University of Colorado Colorado Springs |
Keywords: Game theory, Optimization algorithms
Abstract: We consider a game theoretic approach to solve multi-agent coordination problems with submodular objectives. It is known for such problems that the Nash equilibria for the corresponding game are always within 50% of the optimal. A recent work further shows that the equilibria which achieve this worst-case bound are not stable. Leveraging this, we design an Approximate Best Response Algorithm (ABRA) governed by a noise parameter and a rationality parameter. The noise allows ABRA to escape the bad equilibria and the rationality parameter balances any degradation in the objective function caused by the noise. We show for any two-player game that if ABRA converges to a Nash equilibrium, its system objective value is strictly more than 50% of optimal plus a term controlled by the noise parameter. Otherwise, ABRA converges to some recurrent class: if a recurrent class contains any action profile yielding system objective less than 50% of the optimal, the class must also contain either the optimal action profile or an action profile yielding system objective strictly more than 50% of the optimal by the same amount in addition to a factor controlled by noise parameter. The time that ABRA spends in such action profiles can be controlled using the rationality parameter. Using numerical simulations, we show that the minimum expected objective function is typically well above half of the optimal.
|
|
17:40-18:00, Paper MoC07.6 | |
>Solving Monotone Variational Inequalities with Best Response Dynamics |
|
Chen, Yu-Wen | University of California, Berkeley |
Kizilkale, Can | University of California Berkeley, LBL |
Arcak, Murat | University of California, Berkeley |
Keywords: Game theory, Stability of nonlinear systems, Optimization algorithms
Abstract: We leverage best response dynamics to solve monotone variational inequalities on compact and convex sets. Specialization of the method to variational inequalities in game theory recovers convergence results to Nash equilibria when agents select the best response to the current distribution of strategies. We apply the method to generalize population games with additional constraints. Furthermore, we explore the robustness of the method by introducing various types of time-varying disturbances.
|
|
MoC08 |
Amber 7 |
Optimal Control III |
Regular Session |
Chair: Hespanha, Joao P. | Univ. of California, Santa Barbara |
Co-Chair: Diehl, Moritz | University of Freiburg |
|
16:00-16:20, Paper MoC08.1 | |
>Derivation of the Recovery Matrix for Continuous-Time Inverse Optimal Control Problems Using Incomplete Trajectory Observations |
|
Borum, Andy | Vassar College |
Keywords: Optimal control, Optimization, Optimization algorithms
Abstract: This paper considers the problem of continuous-time inverse optimal control—given a trajectory of a control system’s state and the corresponding control input that minimizes an unknown cost function, our goal is to recover the cost being minimized. We consider problems for which the cost function is a linear combination of known basis functions with unknown weights, and we allow for incomplete trajectory observations, i.e., the system state and control input are only observed on a subset of the time interval over which the cost is being minimized. For such problems, we derive a recovery matrix whose kernel contains the unknown cost function weights. We then consider two special cases in which this recovery matrix is straightforward to compute. Finally, we provide an illustrative one-dimensional example that demonstrates how observing the state and control trajectory along different time intervals can affect the recovery of the cost function weights.
|
|
16:20-16:40, Paper MoC08.2 | |
>Markov Chain Monte Carlo for Koopman-Based Optimal Control |
|
Hespanha, Joao P. | Univ. of California, Santa Barbara |
Camsari, Kerem | University of California, Santa Barbara |
Keywords: Optimal control, Optimization algorithms, Randomized algorithms
Abstract: We propose a Markov Chain Monte Carlo (MCMC) algorithm based on Gibbs sampling with parallel tempering to solve nonlinear optimal control problems. The algorithm is applicable to nonlinear systems with dynamics that can be approximately represented by a finite dimensional Koopman model, potentially with high dimension. This algorithm exploits linearity of the Koopman representation to achieve significant computational saving for large lifted states. We use a video-game to illustrate the use of the method.
|
|
16:40-17:00, Paper MoC08.3 | |
>Adaptive Optimal Control of Continuous-Time Nonlinear Systems Via Hybrid Iteration: An Adaptive Dynamic Programming Approach |
|
Qasem, Omar | American International University |
Gao, Weinan | Northeastern University |
Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Gutierrez, Hector M. | Florida Institute of Technology |
Keywords: Optimal control, Reinforcement learning, Nonlinear systems
Abstract: In this paper, a novel neural-network-based approximation hybrid learning framework, named hybrid iteration (HI), is proposed to solve the adaptive optimal control problem for a class of continuous-time nonlinear affine input systems. Different from the existing adaptive dynamic programming methods such as policy iteration (PI) and value iteration (VI), the HI strategy shows a superior learning performance. Different from the PI method, HI removes the admissibility condition of the initial control policy. Compared to the VI, the HI convergence rate is much faster than that of the VI, since the rate of convergence of HI is quadratic. With that, a large number of learning iterations and CPU-time are saved when using the HI strategy. First, the data-driven HI algorithm is given, wherein the optimal control policy is approximated using the state/input data instead of using the analytical model of the underlying dynamical system. Following that, the proof of convergence of the data-driven HI algorithm is presented. Finally, experimental simulation results are given with technical discussion, which shows the efficacy of the proposed HI method compared with the traditional PI and VI methods.
|
|
17:00-17:20, Paper MoC08.4 | |
>Desensitized Optimal Guidance Using Adaptive Radau Collocation |
|
Winkler, Katrina | University of Florida |
Rao, Anil V. | University of Florida |
Keywords: Optimal control, Optimization algorithms, Robust control
Abstract: An optimal guidance method is developed that reduces sensitivity to parameters in the dynamic model. The method combines a previously developed method for guidance and control using adaptive Legendre-Gauss-Radau (LGR) collocation and a previously developed approach for desensitized optimal control. Guidance updates are performed such that the desensitized optimal control problem is re-solved on the remaining horizon at the start of each guidance cycle. The effectiveness of the method is demonstrated on a simple example using Monte Carlo simulation. The application of the method results in a smaller final state error distribution when compared to desensitized optimal control without guidance as well as a previously developed method for optimal guidance and control.
|
|
17:20-17:40, Paper MoC08.5 | |
>Enhanced Quadratic Programming Via Pseudo-Transient Continuation: An Application to Model Predictive Control |
|
Calogero, Lorenzo | Politecnico Di Torino |
Pagone, Michele | Politecnico Di Torino |
Rizzo, Alessandro | Politecnico Di Torino |
Keywords: Optimal control, Predictive control for nonlinear systems, Numerical algorithms
Abstract: In this letter, we present a novel fast solver for convex quadratic programs (QPs) based on pseudo-transient continuation (PTC). Tailored for real-time applications with strict computational requirements, our solver offers high execution speed and guaranteed global convergence to the optimal solution. PTC is a numerical technique that transforms multivariate nonlinear equations into autonomous systems that converge to the solution sought. In our approach, we recast the general QP Karush-Kuhn-Tucker (KKT) conditions into a system of equations and employ PTC to solve the latter to attain the optimal solution. Importantly, we provide theoretical guarantees demonstrating the global convergence of our PTC-based solver to the optimal solution of any given QP. To showcase the effectiveness of PTC, we employ it within the domain of Model Predictive Control (MPC). Specifically, numerical simulations are carried out on the MPC control of a quadrotor - a demanding dynamical system - highlighting excellent results in accurately executing the control task and ensuring lower computational times compared to conventional QP solvers.
|
|
17:40-18:00, Paper MoC08.6 | |
>MPC4RL - a Sofware Package for Reinforcement Learning Based on Model Predictive Control |
|
Reinhardt, Dirk Peter | Norwegian University of Science and Technology |
Baumgärtner, Katrin | University of Freiburg |
Frey, Jonathan | University of Freiburg |
Diehl, Moritz | University of Freiburg |
Gros, Sebastien | NTNU |
Keywords: Optimal control, Reinforcement learning, Learning
Abstract: In this paper, we present an early software integrating Reinforcement Learning (RL) with Model Predictive Control (MPC). Our aim is to make recent theoretical contributions from the literature more accessible to both the RL and MPC communities. We combine standard software tools developed by the RL community, such as Gymnasium, stable-baselines3, or CleanRL with the acados toolbox, a widely-used software package for efficient MPC algorithms. Our core contribution is MPC4RL, an open-source package that supports learning-enhanced MPC schemes for existing acados implementations. The package is designed to be modular, extensible, and user-friendly, facilitating the tuning of MPC algorithms for a broad range of control problems. It is available on GitHub.
|
|
MoC09 |
Amber 8 |
Predictive Control for Linear Systems III |
Regular Session |
Chair: Findeisen, Rolf | TU Darmstadt |
Co-Chair: Maestre, Jose Maria (Pepe) | University of Seville |
|
16:00-16:20, Paper MoC09.1 | |
>Designing Implicit Invariant Sets for Model Predictive Control for Tracking |
|
Luque, Irene | University of Seville |
Chanfreut, Paula | Eindhoven University of Technology |
Limon, Daniel | Universidad De Sevilla |
Maestre, Jose Maria (Pepe) | University of Seville |
Keywords: Predictive control for linear systems
Abstract: This work introduces a model predictive control (MPC) approach designed for tracking changing setpoints with implicit terminal components. In the presented method, an artificial setpoint is used as a decision variable, and the terminal constraint is implicitly defined for an augmented system that depends on this setpoint. In this respect, instead of constraining the terminal state to belong to an invariant set, an extended prediction horizon is applied, whose minimum length can be efficiently determined by solving linear programs. This methodology overcomes size-related limitations and complexity inherent in invariant set calculations, while simplifying the offline design of the controller components. The applicability of the proposed approach is illustrated through an academic example.
|
|
16:20-16:40, Paper MoC09.2 | |
>Robust-Guaranteed Approximation of Disturbance Invariant Sets for Systems with Near-Unit-Disk Spectral Radius |
|
Nguyen, Duc Giap | Kyungpook National University |
Park, Suyong | Kyungpook National University |
Li, Nan | Tongji University |
Park, Jinrak | Hyundai Motor Company |
Kim, Dohee | Hyundai Motor Company |
Eo, Jeong Soo | Hyundai Motor Company |
Han, Kyoungseok | Hanyang University |
Keywords: Predictive control for linear systems, Robust control, Optimal control
Abstract: This study presents a practical algorithm for approximating the Robust Positively Invariant (RPI) set within the context of robust tube Model Predictive Control (MPC) for discrete-time, linear time-invariant systems. When the stable matrix exhibits a spectral radius close to the unit disk, computing the RPI set becomes challenging, potentially rendering it infeasible. We first analyze the impact of the spectral radius on RPI set convergence, providing an insight into the problem. Subsequently, we propose an approach to integrate approximation into the RPI set computation while preserving robustness. This is achieved by enforcing the upper and lower dimensional bounds of the RPI set. Additionally, we incorporate disturbance estimation error bounding into the Tube MPC framework to address substantial additive disturbances. These disturbances, if directly treated by Tube MPC, otherwise lead to over-conservative or empty tightened state and control sets. Throughout the study, we demonstrate the effectiveness of the proposed algorithm through numerical simulations of a car-following problem.
|
|
16:40-17:00, Paper MoC09.3 | |
>Probabilistically Input-To-State Stable Stochastic Model Predictive Control |
|
Pfefferkorn, Maik | Technical University of Darmstadt |
Findeisen, Rolf | TU Darmstadt |
Keywords: Predictive control for linear systems, Stability of linear systems, Uncertain systems
Abstract: Employing model predictive control to systems with unbounded, stochastic disturbances poses the challenge of guaranteeing safety, i.e., repeated feasibility and stability of the closed-loop system. Especially, there are no strict repeated feasibility guarantees for standard stochastic MPC formulations. Thus, traditional stability proofs are not straightforwardly applicable. We exploit the concept of input-to-state stability in probability and outline how it can be used to provide stability guarantees, circumventing the requirement for strict repeated feasibility guarantees. Loss of feasibility is captured by a back-up controller, which is explicitly taken into account in the stability analysis. We illustrate our findings using a numeric example.
|
|
17:00-17:20, Paper MoC09.4 | |
>Probabilistically Safe Controllers Based on Control Barrier Functions and Scenario Model Predictive Control |
|
Andre do Nascimento, Allan | University of Oxford |
Papachristodoulou, Antonis | University of Oxford |
Margellos, Kostas | University of Oxford |
Keywords: Predictive control for linear systems, Data driven control, Stochastic systems
Abstract: Control barrier functions (CBFs) offer an efficient framework for designing real-time safe controllers. However, CBF-based controllers can be short-sighted, resulting in poor performance, a behaviour which is aggravated in uncertain conditions. This motivated research on safety filters based on model predictive control (MPC) and its stochastic variant. MPC deals with safety constraints in a direct manner, however, its computational demands grow with the prediction horizon length. We propose a safety formulation that solves a finite horizon optimization problem at each time instance like MPC, but rather than explicitly imposing constraints along the prediction horizon, we enforce probabilistic safety constraints by means of CBFs only at the first step of the horizon. The probabilistic CBF constraints are transformed in a finite number of deterministic CBF constraints via the scenario based methodology. Capitalizing on results on scenario based MPC, we provide distribution-free, emph{a priori} guarantees on the system's closed loop expected safety violation frequency. We demonstrate our results through a case study on unmanned aerial vehicle collision-free position swapping, and provide a numerical comparison with recent stochastic CBF formulations.
|
|
17:20-17:40, Paper MoC09.5 | |
>On the Sample Complexity of Imitation Learning for Smoothed Model Predictive Control |
|
Pfrommer, Daniel | Massachusetts Institute of Technology |
Padmanabhan, Swati | Massachusetts Institute of Technology |
Ahn, Kwangjun | MIT |
Umenberger, Jack | MIT |
Marcucci, Tobia | Massachusetts Institute of Technology |
Mhammedi, Zakaria | MIT |
Jadbabaie, Ali | Massachusetts Institute of Technology |
Keywords: Predictive control for linear systems, Statistical learning, Learning
Abstract: Recent work in imitation learning has shown that having an expert controller that is both suitably smooth and stable enables much stronger guarantees on the performance of the approximating learned controller. Constructing such smoothed expert controllers for arbitrary systems remains challenging, especially in the presence of input and state constraints. We show how such a smoothed expert can be designed for a general class of systems using a log-barrier-based relaxation of a standard Model Predictive Control (MPC) optimization problem. We validate our findings via experiments, demonstrating the merits of our smoothing approach over randomized smoothing.
|
|
17:40-18:00, Paper MoC09.6 | |
>On State Reconstruction for Linear Reduced-Order Output Feedback Optimal Control |
|
Schurig, Roland | TU Darmstadt, Control and Cyber-Physical Systems Laboratory |
Lenz, Eric | Technische Universität Darmstadt |
Himmel, Andreas | TU Darmstadt |
Findeisen, Rolf | TU Darmstadt |
Keywords: Predictive control for linear systems, Model/Controller reduction, Observers for Linear systems
Abstract: The solution of optimal control problems is fundamental to numerous concepts, such as model predictive control. Despite recent advancements, solving these problems remains a challenge, particularly with complex systems involving multiple states. Employing reduced order models is a strategy to simplify these problems, but accurately estimating system states from output measurements continues to be difficult. We address the challenge of reconstructing the state of a linear dynamical system using measured input and output data, intending to use this reconstructed state in an optimal control framework, reducing the problem size. Our objective is to minimize the discrepancy between the optimal solution derived from the true system state and that from the reconstructed state. We approach the reconstruction problem through the lens of observability within a finite horizon, which allows us to confine our search to a subspace of the original state space containing only observable components. This confinement effectively yields a reduced-order system representation. We delineate conditions under which the reconstruction problem can be solved and demonstrate the practicality of our approach with a case study.
|
|
MoC10 |
Brown 1 |
Stochastic Systems, Control and Related Fields |
Invited Session |
Chair: Pasik-Duncan, Bozenna | Univ. of Kansas |
Co-Chair: Stettner, L. | Polish Academy of Sciences |
Organizer: Pasik-Duncan, Bozenna | Univ. of Kansas |
Organizer: Yin, George | University of Connecticut |
Organizer: Stettner, L. | Polish Academy of Sciences |
|
16:00-16:20, Paper MoC10.1 | |
>Stability of Long Run Functionals with Respect to Stationary Markov Controls (I) |
|
Stettner, L. | Polish Academy of Sciences |
Keywords: Stochastic optimal control, Markov processes, Stochastic systems
Abstract: In the paper we study dependence of long run functionals and limit characteristics assuming that Borel measurable Markov controls converge pointwise. We consider two kinds of functionals: average cost per unit time and long run risk sensitive. We impose uniform ergodicity assumption, which is later is relaxed and suitable convergence of controlled transition probabilities.
|
|
16:20-16:40, Paper MoC10.2 | |
>An Inverse Problem for Adaptive Linear Quadratic Stochastic Differential Games (I) |
|
Chen, Zhixing | Academy of Mathematics and Systems Science, Chinese Academy of S |
Guo, Lei | Academy of Mathematics and Systems Science, Chinese Academy of S |
Keywords: Stochastic systems, Game theory, Adaptive control
Abstract: In this paper, we consider an inverse problem for adaptive two-player stochastic linear quadratic differential games where the cost functions of players are unknown to each other, which arise in many practical situations but have rarely been explored theoretically. We will introduce a new feedback pattern where Player 2 chooses the optimal actions given the actions of Player 1, while Player 1 uses an adaptive learning algorithm to recover Player 2's cost function and chooses an adaptive strategy by solving the corresponding coupled Riccati equations. Under some suitable assumptions on the system matrices, it is shown that estimates for the cost function parameters are strongly consistent, and the adaptive game systems asymptotically reach the Nash equilibrium.
|
|
16:40-17:00, Paper MoC10.3 | |
>Large-Population Risk-Sensitive Linear Quadratic Optimal Control (I) |
|
Wang, Yu | Shandong University |
Huang, Minyi | Carleton University |
Keywords: Stochastic systems, Stochastic optimal control, Mean field games
Abstract: We study a risk-sensitive linear-quadratic optimal control problem where a large number of N agents have mean-field interactions. We derive the centralized optimal control law and the resulting decentralized individual control law by passing to the mean-field limit. This procedure is similar to the so-called direct approach in mean-field control. We further compare the asymptotic performances of the above two control laws. The performance difference between the two sets of control laws does not vanish, and instead has an upper bound depending on the risk sensitivity parameter and the noise intensity. This phenomenon is very different from both risk-neural social optimization and risk-sensitive mean-field games, and is inherently due to the exponential functional structure of the cost, which is closely related to large deviations theory.
|
|
17:00-17:20, Paper MoC10.4 | |
>Multi-Layer Mean-Field-Type Games Driven by Five Noises (I) |
|
Duncan, Tyrone E. | Univ. of Kansas |
Pasik-Duncan, Bozenna | Univ. of Kansas |
Tembine, Hamidou | NYU |
Keywords: Game theory, Stochastic systems, Optimal control
Abstract: Gaussian and Poisson noises have been widely studied in mean-field-type game theory. In this paper, we examine the effect of the noise in the equilibrium strategies. We show that the equilibrium strategies obtained under states driven by Brownian motions, Fractional Brownian motions, Gauss-Volterra and Poisson processes no longer provide equilibrium when mixed with Rosenblatt noises. The result suggests a careful understanding of the noise structures involved in the quantities of interests in the state dynamics before approximating it by standard Gaussian noises
|
|
17:20-17:40, Paper MoC10.5 | |
>Asymptotic Normality of Cumulative Cost in Linear Quadratic Regulators (I) |
|
Sayedana, Borna | McGill University |
Caines, Peter E. | McGill University |
Mahajan, Aditya | McGill University |
Keywords: Stochastic optimal control, Stochastic systems, Linear systems
Abstract: The central limit theorem is a fundamental result in probability theory that characterizes the distribution of deviation from the mean in the law of large numbers. Similar distributional behavior emerges in other frameworks such as maximum likelihood estimation, least squares estimation, and stochastic approximation. In this paper, we establish a central limit theorem for the cumulative per-step cost incurred by the optimal policy in linear quadratic regulators using first principles. Our proof technique relies on a decomposition of cumulative cost using a completion of square argument, properties of the noise sequence with even density, and a central limit theorem for martingale difference sequences.
|
|
17:40-18:00, Paper MoC10.6 | |
>Variational Dynamic Programming for Stochastic Optimal Control |
|
Lambert, Marc | Ecole Normale Superieure |
Bach, Francis | INRIA - Ecole Normale Supérieure |
Bonnabel, Silvere | Mines Paris PSL |
Keywords: Stochastic optimal control, Variational methods, Robotics
Abstract: We consider the problem of stochastic optimal control, where the state-feedback control policies take the form of a probability distribution and where a penalty on the entropy is added. By viewing the cost function as a Kullback- Leibler (KL) divergence between two joint distributions, we bring the tools from variational inference to bear on our optimal control problem. This allows for deriving a dynamic programming principle, where the value function is defined as a KL divergence again. We then resort to Gaussian distributions to approximate the control policies and apply the theory to control affine nonlinear systems with quadratic costs. This results in closed-form recursive updates, which generalize LQR control and the backward Riccati equation. We illustrate this novel method on the simple problem of stabilizing an inverted pendulum.
|
|
MoC11 |
Brown 2 |
Data Driven Control III |
Regular Session |
Chair: Dörfler, Florian | Swiss Federal Institute of Technology (ETH) Zurich |
Co-Chair: Forni, Fulvio | University of Cambridge |
|
16:00-16:20, Paper MoC11.1 | |
>The Transient Predictor |
|
Moffat, Keith | ETH Zurich |
Dörfler, Florian | Swiss Federal Institute of Technology (ETH) Zurich |
Chiuso, Alessandro | Univ. Di Padova |
Keywords: Data driven control, Closed-loop identification, Predictive control for linear systems
Abstract: This paper introduces the Transient Predictor and describes how it can be used to estimate the Multistep Predictor, which can be applied to applications such as Data-Driven Predictive Control (DDPC). The Transient Predictor has two desirable traits that differentiate it from other methods for estimating the Multistep Predictor, such as the standard Subspace Predictor method: 1) Causality---the Transient Predictor asserts a causal relationship between future inputs and future outputs; and 2) Bias---the Transient Predictor is a consistent predictor of future outputs. This paper provides an easy-to-implement algorithm for estimating the Transient Predictor and in turn the Multistep Predictor, and demonstrates its efficacy for DDPC. In experiments, we find that the Transient Predictor-based DDPC performs remarkably well with small lead-in data lengths, indicating that it is well-suited for tasks in which large amounts of data are not available. In addition, the Transient Predictor is not afflicted by the same bias as subspace-based methods when data is gathered in closed loop.
|
|
16:20-16:40, Paper MoC11.2 | |
>Active Learning of Dynamics Using Prior Domain Knowledge in the Sampling Process |
|
Miller, Kevin Scott | University of Texas, Austin |
Thorpe, Adam | University of Texas at Austin |
Topcu, Ufuk | The University of Texas at Austin |
Keywords: Data driven control, Identification, Machine learning
Abstract: We present an active learning algorithm for learning dynamics that leverages side information by explicitly incorporating prior domain knowledge into the sampling process. Our proposed algorithm guides the exploration toward regions that demonstrate high empirical discrepancy between the observed data and an imperfect prior model of the dynamics derived from side information. Through numerical experiments, we demonstrate that this strategy explores regions of high discrepancy and accelerates learning while simultaneously reducing model uncertainty. We rigorously prove that our active learning algorithm yields a consistent estimate of the underlying dynamics by providing an explicit rate of convergence for the maximum predictive variance. We demonstrate the efficacy of our approach on an under-actuated pendulum system and on the half-cheetah MuJoCo environment.
|
|
16:40-17:00, Paper MoC11.3 | |
>Learning System Dynamics from Sensory Input under Optimal Control Principles |
|
Bounou, Oumayma | Inria, ENS |
Ponce, Jean | Ecole Normale Supérieure |
Carpentier, Justin | Inria |
Keywords: Data driven control, Identification for control, Machine learning
Abstract: Identifying physical systems from sensory input is of high interest in control, robotics, and engineering in general. In the context of control problems, existing approaches decouple the construction of the feature space where the dynamics identification process occurs from the target control tasks, potentially leading to a mismatch between feature and state spaces: the systems may not be controllable in feature space, and synthesized controls may not be applicable in state space. Borrowing from the Koopman formalism, we propose instead to learn an embedding of both the states and controls into a feature space where the dynamics are linear, and include the target control task in the learning objective in the form of a differentiable and robust optimal control problem. We validate the proposed approach with simulation experiments using systems with non-linear dynamics, demonstrating that the controls obtained in feature space can be used to drive the corresponding physical systems and that the learned model can serve for future state prediction.
|
|
17:00-17:20, Paper MoC11.4 | |
>Passive iFIR Filters for Data-Driven Control |
|
Wang, Zixing | University of Cambridge |
Huo, Yongkang | University of Cambridge |
Forni, Fulvio | University of Cambridge |
Keywords: Data driven control, Identification for control, PID control
Abstract: We consider the design of a new class of passive iFIR controllers given by the parallel action of an integrator and a finite impulse response filter. iFIRs are more expressive than PID controllers but retain their features and simplicity. The paper provides a model-free data-driven design for passive iFIR controllers based on virtual reference feedback tuning. Passivity is enforced through constrained optimization (three different formulations are discussed). The proposed design does not rely on large datasets or accurate plant models.
|
|
17:20-17:40, Paper MoC11.5 | |
>Safe Output Feedback Improvement with Baselines |
|
Zhang, Ruoqi | Uppsala Univeristy |
Mattsson, Per | Uppsala University |
Zachariah, Dave | Uppsala University |
Keywords: Data driven control, Identification for control, Uncertain systems
Abstract: In data-driven control design, an important problem is to deal with uncertainty due to limited and noisy data. One way to do this is to use a min-max approach, which aims to minimize some design criteria for the worst-case scenario. However, a strategy based on this approach can lead to overly conservative controllers. To overcome this issue, we apply the idea of baseline regret, and it is seen that minimizing the baseline regret under model uncertainty can guarantee safe controller improvement with less conservatism and variance in the resulting controllers. To exemplify the use of baseline controllers, we focus on the output feedback setting and propose a two-step control design method; first, an uncertainty set is constructed by a data-driven system identification approach based on finite impulse response models; then a control design criterion based on model reference control is used. To solve the baseline regret optimization problem efficiently, we use a convex approximation of the criterion and apply the scenario approach in optimization. The numerical examples show that the inclusion of baseline regret indeed improves the performance and reduces the variance of the resulting controller.
|
|
17:40-18:00, Paper MoC11.6 | |
>Robust and Finite-Time Stable Model-Free Control for Second Order Systems without Velocity Measurements (I) |
|
Wang, Ningshan | Syracuse University |
Sanyal, Amit | Syracuse University |
Keywords: Data driven control, Identification for control, Uncertain systems
Abstract: This article presents a framework for model-free control design for mechanical systems without velocity measurements and with an unknown dynamics, considered as a bounded disturbance input. The system states consist of zeroth-order (e.g position) and first-order (e.g velocity) vectors, but only the zeroth-order states are the measured outputs. This model-free control framework is based on a first-order signal differentiator and a finite-time stable extended state observer that simultaneously estimates the states and the bounded disturbance input in real time with guaranteed bounds on accuracy of the estimates. The estimates provided by this observer are used to track a desired output trajectory and compensate the disturbance in real time. Overall nonlinear stability and robustness of the observer is shown theoretically and verified through numerical simulations. The proposed method can be applied to second-order systems and their teams, like mobile robots, unmanned aerial vehicles, unmanned (under)water vehicles and space vehicles.
|
|
MoC12 |
Brown 3 |
Reinforcement Learning II |
Regular Session |
Chair: Taghvaei, Amirhossein | University of Washington Seattle |
Co-Chair: Harder, Hans | University of Paderborn |
|
16:00-16:20, Paper MoC12.1 | |
>Reinforcement Learning with Quasi-Hyperbolic Discounting |
|
Sure Reddappa Setty, Eshwar | Indian Institute of Science |
Motwani, Mayank | Indian Institute of Technology, Bombay |
Roy, Nibedita | Indian Institute of Science |
Thoppe, Gugan Chandrashekhar Mallika | Indian Institute of Science |
Keywords: Reinforcement learning
Abstract: Reinforcement learning has traditionally been studied with exponential discounting or the average reward setup, mainly due to their mathematical tractability. However, such frameworks fall short of accurately capturing human behavior, which has a bias towards immediate gratification. Quasi-Hyperbolic (QH) discounting is a simple alternative for modeling this bias. Unlike in traditional discounting, though, the optimal QH-policy, starting from some time t_1, can be different to the one starting from t_2. Hence, the future self of an agent, if it is naive or impatient, can deviate from the policy that is optimal at the start, leading to sub-optimal overall returns. To prevent this behavior, an alternative is to work with a policy anchored in a Markov Perfect Equilibrium (MPE). In this work, we propose the first model-free algorithm for finding an MPE. Using a two-timescale analysis, we show that, if our algorithm converges, then the limit must be an MPE. We also validate this claim numerically for the standard inventory system with stochastic demands. Our work significantly advances the practical application of reinforcement learning.
|
|
16:20-16:40, Paper MoC12.2 | |
>Dual Ensemble Kalman Filter for Stochastic Optimal Control |
|
Joshi, Anant A. | University of Illinois at Urbana Champaign |
Taghvaei, Amirhossein | University of Washington Seattle |
Mehta, Prashant G. | Univ of Illinois, Urbana-Champaign |
Meyn, Sean P. | Univ. of Florida |
Keywords: Reinforcement learning, Stochastic optimal control, Stochastic systems
Abstract: In this paper, stochastic optimal control problems in continuous time and space are considered. In recent years, such problems have received renewed attention from the lens of reinforcement learning (RL) which is also one of our motivation. The main contribution is a simulation-based algorithm -- dual ensemble Kalman filter (EnKF) -- to numerically approximate the solution of these problems. The paper extends our previous work where the dual EnKF was applied in deterministic settings of the problem. The theoretical results and algorithms are illustrated with numerical experiments.
|
|
16:40-17:00, Paper MoC12.3 | |
>Safe Reinforcement Learning for Constrained Markov Decision Processes with Stochastic Stopping Time |
|
Mazumdar, Abhijit | Aalborg University, Denmark |
Wisniewski, Rafal | Aalborg University |
Bujorianu, Luminita Manuela | University College London |
Keywords: Reinforcement learning, Stochastic systems, Constrained control
Abstract: In this paper, we present an online reinforcement learning algorithm for constrained Markov decision processes with a safety constraint. Despite the necessary attention of the scientific community, considering stochastic stopping time, the problem of learning optimal policy without violating safety constraints during the learning phase is yet to be addressed. To this end, we propose an algorithm based on linear programming that does not require a process model. We show that the learned policy is safe with high confidence. We also propose a method to compute a safe baseline policy, which is central in developing algorithms that do not violate the safety constraints. Finally, we provide simulation results to show the efficacy of the proposed algorithm. Further, we demonstrate that efficient exploration can be achieved by defining a subset of the state-space called proxy set.
|
|
17:00-17:20, Paper MoC12.4 | |
>A System-Theoretic Note on Model-Based Actor-Critic |
|
Baroncini, Simone | University of Bologna |
Carnevale, Guido | University of Bologna |
Gharesifard, Bahman | Queen's University |
Notarstefano, Giuseppe | University of Bologna |
Keywords: Reinforcement learning, Markov processes, Optimization algorithms
Abstract: In this paper, we provide preliminary results toward the direction of using systems theory tools for the design and analysis of reinforcement learning algorithms. Specifically, we analyze the convergence properties of a model-based scheme with an actor-critic structure. The distinctive feature of our scheme is that the actor and critic updates are equipped with auxiliary variables that allow for the use of a constant step size. Although idealized due to the assumption on access to the underlying Markov Decision Process (MDP), the investigated setting is a starting point toward a genuine (model-free) actor-critic scheme. A key contribution is the interpretation of this algorithmic framework in terms of discrete-time, interconnected dynamical systems. Specifically, by resorting to Singular Perturbations (SP), we reinterpret the whole algorithm as the interconnection of a fast subsystem (auxiliary variables’ mechanism), an intermediate one (critic), and a slow one (actor). We separately analyze three auxiliary systems each corresponding to one of the identified subsystems. These preparatory results, combined with SP and LaSalle arguments, allow us to prove that the overall method asymptotically converges to a problem stationary point. Some numerical simulations confirm our theoretical findings.
|
|
17:20-17:40, Paper MoC12.5 | |
>On the Continuity and Smoothness of the Value Function in Reinforcement Learning and Optimal Control |
|
Harder, Hans | University of Paderborn |
Peitz, Sebastian | Paderborn University |
Keywords: Reinforcement learning, Optimal control, Stochastic systems
Abstract: The value function plays a crucial role as a measure for the cumulative future reward an agent receives in both reinforcement learning and optimal control. It is therefore of interest to study how similar the values of neighboring states are, i.e., to investigate the continuity of the value function. We do so by providing and verifying upper bounds on the value function's modulus of continuity. Additionally, we show that the value function is always Hölder continuous under relatively weak assumptions on the underlying system and that non-differentiable value functions can be made differentiable by slightly "disturbing" the system.
|
|
17:40-18:00, Paper MoC12.6 | |
>Regret-Optimal Defense against Stealthy Adversaries: A System Level Approach |
|
Tsukamoto, Hiroyasu | NASA JPL, Caltech |
Hajar, Joudi | Caltech |
Chung, Soon-Jo | California Institute of Technology |
Hadaegh, Fred Y. | California Inst. of Tech |
Keywords: Resilient Control Systems, Robust control, Optimization
Abstract: Modern control designs in robotics, aerospace, and cyber-physical systems rely heavily on real-world data obtained through system outputs. However, these outputs can be compromised by system faults and malicious attacks, distorting critical system information needed for secure and reliable operation. In this paper, we introduce a novel regret-optimal control framework for designing controllers that make a linear system robust against stealthy attacks, including both sensor and actuator attacks. Specifically, we present (a) a convex optimization-based system metric to quantify the regret under the worst-case stealthy attack (the difference between actual performance and optimal performance with hindsight of the attack), which adapts and improves upon the mathcal{H}_2 and mathcal{H}_{infty} norms in the presence of stealthy adversaries, (b) an optimization problem for minimizing the regret of (a) in system-level parameterization, enabling localized and distributed implementation in large-scale systems, and (c) a rank-constrained optimization problem equivalent to the optimization of (b), which can be solved using convex rank minimization methods. We also present numerical simulations that demonstrate the effectiveness of our proposed framework.
|
|
MoC13 |
Suite 1 |
Estimation and Control of Distributed Parameter Systems III |
Invited Session |
Chair: Hu, Weiwei | University of Georgia |
Co-Chair: Demetriou, Michael A. | Worcester Polytechnic Institute |
Organizer: Demetriou, Michael A. | Worcester Polytechnic Institute |
Organizer: Hu, Weiwei | University of Georgia |
|
16:00-16:20, Paper MoC13.1 | |
>Relaxed Controls and Measure Controls (I) |
|
D'Apice, Ciro | University of Salerno |
Manzo, Rosanna | University of Salerno |
Rarita', Luigi | University of Salerno |
Piccoli, Benedetto | Rutgers University - Camden |
Keywords: Distributed parameter systems, Nonlinear systems, Stability of nonlinear systems
Abstract: In this paper, we introduce a new idea to generalize the concept of relaxed control to the framework of measure differential equations, recently introduced in [15]. A relaxed control is defined as a probability measure on the space of controls, and, similarly, a measure control is a feedback relaxed control which depends on the measure distribution on the state space representing the state of the system. Relaxed controls are useful to solve optimal control and stabilization problems. On the other side, measure differential equations allow deterministic modeling of uncertainty, finite-speed diffusion, concentration, and other phenomena. Moreover, it represents a natural generalization of Ordinary Differential Equations to measures. We establish regularity properties of measure controls to ensure existence and uniqueness of trajectories and show applications to stabilization problems.
|
|
16:20-16:40, Paper MoC13.2 | |
>Concurrent Evacuation Planning and Adaptive Spatial Field Estimation Using Field-Dependent Guidance with On-The-Fly Trajectory Reconfiguration (I) |
|
Demetriou, Michael A. | Worcester Polytechnic Institute |
Keywords: Distributed parameter systems, Adaptive systems
Abstract: This work examines an on-the-fly trajectory reconfiguration for the human evacuation in indoor environments for two different situations. When the spatial field representing carbon monoxide concentration is known but time varying, the evacuation planning uses a level-set guidance to reposition the evacuee towards an exit that will result in the smallest amount of the hazardous substance accumulated in the lungs. As the level-set guidance is based on the time-invariant spatial field, a discrete time snapshot of the time-varying field is used to generate viable escape trajectories. The escape trajectories are recalculated when new spatial field knowledge is updated thus leading to continuous trajectory reconfiguration. The other situation involves an unknown spatial field that is either constant in time or slowly time-varying. The level-set guidance is based on a current estimate of the field furnished via an adaptive spatial field estimation. Such an adaptive estimate is made possible due to the motion of the evacuee which is capable of inducing persistence of excitation and thus yielding convergence of the spatial field estimation. A planning stage is added to the duration of one cycle, consisting now of both a planning stage in which the agent is immobile and computing, and a travelling stage in which the agent is moving towards the currently-declared viable exit. During both stages, the agent is able to adaptively estimate the spatial field, but using the frozen-in-time (snapshots) knowledge of the spatial field's adaptive estimate that is produced at the end of the planning stage. Numerical studies for both cases are included to provide insights on the effects of accumulated amounts of hazardous environments on the escape trajectories in human evacuation.
|
|
16:40-17:00, Paper MoC13.3 | |
>Finite-Dimensional Homogeneous Boundary Control for a 1D Reaction-Diffusion Equation (I) |
|
Ayamou, Mericel | Univ. Lille |
Espitia, Nicolas | University of Lille - CNRS - CRIStAL Lab |
Polyakov, Andrey | Inria, Univ. Lille |
Fridman, Emilia | Tel-Aviv Univ |
Keywords: Distributed parameter systems, LMIs, Stability of nonlinear systems
Abstract: In this paper, we address the problem of finitedimensional boundary stabilization of 1D reaction diffusion equation . Using the modal decomposition approach, we propose a finite-dimensional homogeneous controller, stabilizing the unstable dynamics while ensuring the stability of the residual part. The closed loop system with homogeneous feedback is well-posed and stable. The proposed controller is proven superior to a finite-dimensional linear feedback controller in terms of closed-loop performance. The numerical simulations are presented to support the analytical results.
|
|
17:00-17:20, Paper MoC13.4 | |
>Stabilizing Nonlinear ODEs with Diffusive Actuator Dynamics |
|
Irscheid, Abdurrahman | Saarland University |
Gehring, Nicole | Johannes Kepler University Linz |
Deutscher, Joachim | Universität Ulm |
Rudolph, Joachim | Saarland University |
Keywords: Distributed parameter systems
Abstract: This paper presents a design of stabilizing controllers for a cascaded system consisting of a boundary actuated parabolic PDE and nonlinear dynamics at the unactuated boundary. Although the considered PDE is linear, the nonlinearity of the ODE constitutes a significant challenge. In order to solve this problem, it is shown that the classical backstepping transformation of Volterra type directly results from the solution of a Cauchy problem. This new perspective enables the derivation of a controller for the nonlinear setup, where a Volterra integral representation does not exist. Specifically, the solution of an appropriate linear Cauchy problem yields a novel state transformation facilitating the design of a stabilizing state feedback. This control law is shown to ensure asymptotic closed-loop stability of the origin. An efficient implementation of the controller is proposed and demonstrated for an example.
|
|
17:20-17:40, Paper MoC13.5 | |
>On Stabilization of Large-Scale Systems of Linear Hyperbolic PDEs Via Continuum Approximation of Exact Backstepping Kernels (I) |
|
Humaloja, Jukka-Pekka | Technical University of Crete |
Bekiaris-Liberis, Nikolaos | Technical University of Crete |
Keywords: Backstepping, Distributed parameter systems, Large-scale systems
Abstract: We establish that stabilization of a class of linear, hyperbolic PDEs with a large (nevertheless finite) number of components, can be achieved via employment of a backstepping-based control law, which is constructed for stabilization of a continuum version (i.e., as the number of components tends to infinity) of the PDE system. This is achieved by proving that the exact backstepping kernels, constructed for stabilization of the large-scale system, can be approximated (in certain sense such that exponential stability is preserved) by the backstepping kernels constructed for stabilization of a continuum version (essentially an infinite ensemble) of the original PDE system. The proof relies on construction of a convergent sequence of backstepping kernels that is defined such that each kernel matches the exact backstepping kernels (derived based on the original, large-scale system), in a piecewise constant manner with respect to an ensemble variable; while showing that they satisfy the continuum backstepping kernel equations. We present a numerical example that reveals that complexity of computation of stabilizing backstepping kernels may not scale with the number of components of the PDE state, when the kernels are constructed on the basis of the continuum version, in contrast to the case in which they are constructed on the basis of the original, large-scale system. Thus, this approach can be useful for design of computationally tractable, stabilizing backstepping-based control laws for large-scale PDE systems.
|
|
17:40-18:00, Paper MoC13.6 | |
>Sliding Mode Observation for a 1D Wave Equation with Dynamic Boundary Conditions |
|
Chitour, Yacine | Universit'e Paris-Sud, CNRS, Supelec |
Dahmani, Abdelhakim | Research Center for Complex Systems, Aalen University, Aalen, Ge |
Labbadi, Moussa | Aix-Marseille University |
Roman, Christophe | LIS |
Keywords: Distributed parameter systems, Variable-structure/sliding-mode control, Flexible structures
Abstract: This study focuses on employing sliding mode observer for a wave equation subject to two dynamic boundary conditions with anti-damping coefficients and a perturbation and a control at one of the boundaries. The exponential decay rate of the trajectory of the observer is demonstrated through the application of the multiplier method. The well-posedness of the error-system is proven using maximal monotone operator.
|
|
MoC14 |
Suite 2 |
Kalman Filtering |
Regular Session |
Chair: Borri, Alessandro | CNR-IASI |
Co-Chair: Kishida, Masako | National Institute of Informatics |
|
16:00-16:20, Paper MoC14.1 | |
>Asynchronous Variational-Bayes Kalman Filtering |
|
Greiff, Marcus Carl | Mitsubishi Electric Research Laboratries |
Berntorp, Karl | Mitsubishi Electric Research Labs |
Keywords: Estimation, Kalman filtering, Identification
Abstract: We consider the joint state and measurement-noise parameter estimation problem for nonlinear state-space models with asynchronous, variable-rate, and independent measurement sources. We approach the problem using variational Bayes Kalman filters (VB-KFs). By leveraging that the measurements from different sources are independent, we develop an asynchronous VB-KF (textsc{avb-kf}), which processes measurements from different sources sequentially and at a variable rate. Hence, in the measurement update step, we only update the noise parameters of measurements that have been processed at a particular time step. This results in faster computations, especially as the measurement dimension and the number of sensors grow. We validate the approach on a realistic application of autonomous mobile-robot platooning, where we perform fusion of multiple sensor modalities with time-varying noise characteristics. The results indicate more than a factor of two improvements measured as a time-averaged absolute error compared to a nonadaptive implementation.
|
|
16:20-16:40, Paper MoC14.2 | |
>Covariance Intersection-Based Invariant Kalman Filtering (DInCIKF) for Distributed Pose Estimation |
|
Li, Haoying | Chinese University of Hong Kong, Shenzhen |
Li, Xinghan | Zhejiang University |
Huang, Shuaiting | Zhejiang University |
Yang, Chao | East China University of Science and Technology |
Wu, Junfeng | The Chinese Unviersity of Hong Kong, Shenzhen |
Keywords: Kalman filtering, Agents-based systems, Autonomous robots
Abstract: This paper presents a novel approach to distributed pose estimation in multi-agent system based on Invariant Kalman Filter with Covariance Intersection. Our method models uncertainties using Lie algebra and applies object-level observations within Lie groups, which have practical application value. We integrate Covariance Intersection to handle estimates that are correlated and use the Invariant Kalman Filter for merging independent data sources. This strategy allows us to effectively tackle the complex correlations of cooperative localization among agents, ensuring our estimates are neither too conservative nor overly confident. Additionally, we examine the consistency and the stability of our algorithm, providing evidence of its reliability and effectiveness in managing multi-agent systems.
|
|
16:40-17:00, Paper MoC14.3 | |
>Low-Rank Approximated Kalman--Bucy Filters Using Oja’s Principal Component Flow for Linear Time-Invariant Systems |
|
Tsuzuki, Daiki | Kyoto University |
Ohki, Kentaro | Kyoto University |
Keywords: Kalman filtering, Model/Controller reduction, Filtering
Abstract: The Kalman--Bucy filter is extensively utilized across various applications. However, its computational complexity increases significantly in large-scale systems. To mitigate this challenge, a low-rank approximated Kalman--Bucy filter was proposed, comprising Oja's principal component flow and a low-dimensional Riccati differential equation. Previously, the estimation error was confirmed solely for linear time-invariant systems with a symmetric system matrix. This study extends the application by eliminating the constraint on the symmetricity of the system matrix and describes the equilibrium points of the Oja flow along with their stability for general matrices. In addition, the domain of attraction for a set of stable equilibrium points is estimated. Based on these findings, we demonstrate that the low-rank approximated Kalman--Bucy filter with a suitable rank maintains a bounded estimation error covariance matrix if the system is controllable and observable.
|
|
17:00-17:20, Paper MoC14.4 | |
>Invariant Filtering for Wheeled Vehicle Localization with Unknown Wheel Radius and Unknown GNSS Lever Arm |
|
Chauchat, Paul | Aix-Marseille Univ, CNRS, LIS |
Bonnabel, Silvere | Mines Paris PSL |
Barrau, Axel | Offroad |
Keywords: Kalman filtering, Observers for nonlinear systems
Abstract: We consider the problem of observer design for a nonholonomic car (more generally a wheeled robot) equipped with wheel speeds with unknown wheel radius, and whose position is measured via a GNSS antenna placed at an unknown position in the car. In a tutorial and unified exposition, we recall the recent theory of two-frame systems within the field of invariant Kalman filtering. We then show how to adapt it geometrically to address the considered problem, although it seems at first sight out of its scope. This yields an invariant extended Kalman filter having autonomous error equations, and state-independent Jacobians, which is shown to work remarkably well in simulations. The proposed novel construction thus extends the application scope of invariant filtering.
|
|
17:20-17:40, Paper MoC14.5 | |
>Enhanced Quadratic Extended Kalman Filter |
|
Palombo, Giovanni | IASI-CNR |
d'Angelo, Massimiliano | National Research Council of Italy (CNR) |
Papa, Federico | IASI-CNR |
Cusimano, Valerio | CNR-IASI, Italian National Research Council - Institute for Syst |
Borri, Alessandro | CNR-IASI |
Keywords: Filtering, Kalman filtering, Nonlinear systems
Abstract: In this note, we present a new solution to the filtering problem for stochastic discrete-time nonlinear systems, which we refer to as the Enhanced Quadratic Extended Kalman Filter (eQEKF). Starting from the concept underlying the existing formulation of the Quadratic Extended Kalman Filter (QEKF), based on the definition of an augmented output through Kronecker powers, we propose a different method that enables us to overcome certain inevitable standard approximation issues, reducing the computational workload. Also, we show the effectiveness of the proposed approach with respect to the QEKF and with respect to the classical Extended Kalman Filter, as highlighted by two numerical examples, in the case of Gaussian and non-Gaussian noises.
|
|
17:40-18:00, Paper MoC14.6 | |
>Risk-Aware Control of Discrete-Time Stochastic Systems: Integrating Kalman Filter and Worst-Case CVaR in Control Barrier Functions |
|
Kishida, Masako | National Institute of Informatics |
Keywords: Uncertain systems, Kalman filtering, Linear systems
Abstract: This paper proposes control approaches for discrete-time linear systems subject to stochastic disturbances. It employs Kalman filter to estimate the mean and covariance of the state propagation, and the worst-case conditional value-at-risk (CVaR) to quantify the tail risk using the estimated mean and covariance. The quantified risk is then integrated into a control barrier function (CBF) to derive constraints for controller synthesis, addressing tail risks near safe set boundaries. Two optimization-based control methods are presented using the obtained constraints for half-space and ellipsoidal safe sets, respectively. The effectiveness of the obtained results is demonstrated using numerical simulations.
|
|
MoC15 |
Suite 3 |
Mechatronics |
Regular Session |
Chair: Salton, Aurelio Tergolina | Universidade Federal Do Rio Grande Do Sul |
Co-Chair: Tóth, Roland | Eindhoven University of Technology |
|
16:00-16:20, Paper MoC15.1 | |
>Frequency Domain Auto-Tuning of Structured LPV Controllers for High-Precision Motion Control (I) |
|
Broens, Yorick | Eindhoven University of Technology |
Butler, Hans | ASML |
Tóth, Roland | Eindhoven University of Technology |
Keywords: Mechatronics, Linear parameter-varying systems, Optimization
Abstract: Motion systems are a vital part of many industrial processes. However, meeting the increasingly stringent demands of these systems, especially concerning precision and throughput, requires novel control design methods that can go beyond the capabilities of traditional solutions. Traditional control methods often struggle with the complexity and position-dependent effects inherent in modern motion systems, leading to compromises in performance and a laborious task of controller design. This paper addresses these challenges by introducing a novel structured feedback control auto-tuning approach for multiple-input multiple-output (MIMO) motion systems. By leveraging frequency response function (FRF) estimates and the linear-parameter-varying (LPV) control framework, the proposed approach automates the controller design, while providing local stability and performance guarantees. Key innovations include norm-based magnitude optimization of the sensitivity functions, an automated stability check through a novel extended factorized Nyquist criterion, a modular structured MIMO LPV controller parameterization, and a controller discretization approach which preserves the continuous-time (CT) controller parameterization. The proposed approach is validated through experiments using a state-of-the-art moving-magnet planar actuator prototype.
|
|
16:20-16:40, Paper MoC15.2 | |
>Calculation Method of Static Friction Forces for Multi-Joint Manipulators |
|
Murayama, Yusuke | Kyushu Institute of Technology |
Fukui, Yoshiro | Kyushu Institute of Technology |
Keywords: Mechatronics, Modeling, Numerical algorithms
Abstract: The static friction force is an important element of practical problems, such as the model-based control design of mechanical systems described by the Euler-Lagrange equations. The difficulty in calculating the static friction force, represented by a discontinuous function, lies in the fact that the differential equations representing the equations of motion become discontinuous differential-algebraic equations (DAEs). To solve the discontinuous DAEs using numerical methods, we need to solve a non-differential implicit algebraic equation at each step. In this paper, we propose an algorithm for calculating the static friction force by solving the implicit algebraic equation. Theoretical analysis shows that the friction force exists uniquely. This ensures that the proposed algorithm obtains a unique value for the static friction force. Moreover, the number of iterations in the proposed algorithm is only 3^n at worst, where n denotes the number of manipulator joints. Hence, the proposed method matches numerical differential equation solvers, such as the Euler method. The effectiveness of the proposed method was validated through simulations using simple mechanical systems.
|
|
16:40-17:00, Paper MoC15.3 | |
>Nanometer Scanning with Micrometer Sensing: Beating Quantization Constraints in Lissajous Trajectory Tracking |
|
Lohse, Matheus | UFRGS |
Castro, Rafael da Silveira | Pontifical Catholic University of Rio Grande Do Sul |
Salton, Aurelio Tergolina | Universidade Federal Do Rio Grande Do Sul |
Fu, Minyue | Southern University of Science and Technology |
Keywords: Mechatronics, Nonlinear output feedback, Quantized systems
Abstract: This paper addresses the task of tracking Lissajous trajectories in the presence of quantized positioning sensors. To do so, theoretical results on tracking of continuous time periodic signals in the presence of output quantization are provided. With these results in hand, the application to Lissajous tracking is explored. The method proposed relies on the internal model principle and dispenses perfect knowledge of the system equations. Numerical results show that an arbitrary small scanning resolution is achievable despite large sensor quantization intervals.
|
|
17:00-17:20, Paper MoC15.4 | |
>Development of a Current-Position Control Strategy for Motion Systems Utilizing Nonlinear Reluctance Actuators (I) |
|
Al Saaideh, Mohammad | Memorial University of Newfoundland |
Al-Rawashdeh, Yazan Mohammad | Memorial University of Newfoundland |
Alatawneh, Natheer | CYSCA TECHNOLOGIES |
Aljanaideh, Khaled | Jordan University of Science and Technology |
Boker, Almuatazbellah | Virginia Tech |
Al Janaideh, Mohammad | University of Guelph |
Keywords: Mechatronics, Nonlinear systems, Output regulation
Abstract: Reluctance actuators (RA) can replace the current Lorentz actuators in the next generation of positioning and scanning motion systems, such as the wafer stage in lithography machines. However, the nonlinear output force characteristic and the gap dependency of the RA are the main challenges in using the RA to drive motion systems. In this paper, we design a two-loop control approach for a reluctance actuator motion system (RAMS) to achieve tracking performance for a desired motion profile. First, the current control loop linearizes the RA under different conditions. Next, the position control loop is designed using a PID control based on an extended-high gain observer to achieve a desired motion profile considering unknown dynamics in the system. The simulation results show the efficiency of the proposed current control in linearizing the dynamic behavior under different desired forces and nominal air gaps and achieving a frequency response similar to the spring-mass-damper system. Moreover, the position control can achieve different long- and short-stroke motion profiles with different amplitudes and ranges of motion.
|
|
17:20-17:40, Paper MoC15.5 | |
>Vision-Based Estimation of Cable Slab Forces in Precision Motion Applications (I) |
|
Al-Rawashdeh, Yazan Mohammad | Memorial University of Newfoundland |
Al Saaideh, Mohammad | Memorial University of Newfoundland |
Pumphrey, Michael Joseph | University of Guelph |
Alatawneh, Natheer | CYSCA TECHNOLOGIES |
Al Janaideh, Mohammad | University of Guelph |
Keywords: Mechatronics
Abstract: In this study, a novel method for estimating the cable slab internal and reaction forces is presented. Using visual feedback comprising a high-speed camera, the cable slab responses due to cyclic and acyclic motion profiles are recorded, and the positions of a sufficient finite set of markers attached to the cable slab are extracted off-line using image processing. The kinematics of the markers are obtained via numerical differentiation and signal processing. Adopting the Voigt viscoelastic model, the cable slab is segmented into several lumped mass-spring-damper elements whose linear and angular motions are expressed analytically. Using the strain and its rate of change during motion, estimates of internal and reaction forces are written as functions of the cable slab composite material properties that are identified by manual tuning. The link between these forces and the motion system kinematics reveals the tuning process of a proposed fixed-parameter feed- forward/feedback controller/compensator that can be used to enhance the precision of the motion system despite its simplicity. The experimental results reveal the effectiveness of the proposed approach.
|
|
17:40-18:00, Paper MoC15.6 | |
>Asymptotic Trajectory Tracking for Tilt-Rotor Quadcopters with Input-Nonlinearity Compensation |
|
Ijoga, Emmanuel Ogbanje | Embry Riddle Aeronautical University |
Kidambi, Krishna Bhavithavya | University of Dayton |
MacKunis, William | Embry-Riddle Aeronautical University |
Keywords: Control applications, Mechatronics, Robust control
Abstract: A tracking controller is presented for a tilt-rotor quadcopter (TRQ) system, which formally incorporates the uncertain nonlinear actuator dynamics inherent in the TRQ dynamic model. The result is achieved through innovative mathematical development of the error system dynamics, in which the non-affine TRQ actuation dynamics are re-cast in a control-affine form through the introduction of an input gain matrix Jacobian. A robust nonlinear control development is then utilized, which renders the closed-loop system asymptotically stable under proper control gain selection. The specific contributions of this result are three-fold: 1. An error system derivation that re-casts the non-affine TRQ actuator dynamics in terms of a control input-dependent Jacobian input-gain matrix; 2. A detailed controller development that employs a bank of dynamic filters to achieve velocity-only feedback in the control law; 3. A rigorous Lyapunov-based stability analysis that proves semi-global asymptotic trajectory tracking, where the region of convergence can be made arbitrarily large through appropriate control gain selection. To the best of the authors’ knowledge, this is the first result to prove asymptotic trajectory tracking of a non-affine TRQ system, where the error system development and stability analysis formally incorporate the uncertain nonlinear actuator dynamics in addition to norm-bounded exogenous disturbances. To complement the theoretical analysis, a detailed numerical simulation study is presented, which shows that the proposed control law achieves improved disturbance rejection and parametric actuator uncertainty compensation over a standard PD-like robust feedback control law.
|
|
MoC16 |
Suite 4 |
Fault Tolerant Systems |
Regular Session |
Chair: Habibi, Hamed | University of Luxembourg |
Co-Chair: Pazera, Marcin | University of Zielona Gora |
|
16:00-16:20, Paper MoC16.1 | |
>Fixed-Time Multi-UAV Cooperative Fault-Tolerant Control for Sensor and Actuator Faults |
|
Liu, Kun | National University of Defense Technology |
Zheng, Jiayi | National University of Defense Technology |
Zhao, Shulong | National University of Defense Technology |
Keywords: Fault tolerant systems, Cooperative control, Flight control
Abstract: In this research, a fixed-time cooperative fault-tolerant control (CFTC) protocol for multiple unmanned aerial vehicles (UAVs) formation is proposed. First, the assumption that faults are bounded is removed. A fixed-time observer (FTO) is utilized to estimate the pitot tube and actuator faults, which ensures that the estimation errors of the faults converge in a fixed time. Second, the norm-normalized sign function (NNSF) is introduced to make the control output change smoother and reduce the influence of faults, while simplifying the proof process. Finally, numerical simulations demonstrate the superior performance of the proposed CFTC compared to the existing work.
|
|
16:20-16:40, Paper MoC16.2 | |
>Towards Integrated Tracking Fault-Tolerant Control for Takagi-Sugeno Systems |
|
Pazera, Marcin | University of Zielona Gora |
Witczak, Marcin | University of Zielona Gora |
Keywords: Fault tolerant systems, Fuzzy systems, Predictive control for nonlinear systems
Abstract: This paper addresses the challenge of combining fault estimation and fault-tolerant control for constrained non-linear systems subject to bounded external disturbances. To handle the nonlinearities, the fuzzy Takagi-Sugeno (T-S) methodology is utilised. As a consequence, gains of the estimator and controller are calculated for all the vertices of the polytopic set. Specifically, the intertwined nature of control and estimation leads to mutual influence, necessitating a novel integration approach to avoid deteriorating the overall system performance. A new strategy is introduced, grounded on a critical existence condition that is both necessary and sufficient. The suggested method operates through output feedback and unfolds in two distinct phases: the off-line phase involves a straightforward optimization task utilizing the set of Linear Matrix Inequalities (LMIs), while the on-line phase tackles a deterministic model predictive control issue.
|
|
16:40-17:00, Paper MoC16.3 | |
>Fixed-Time Fault-Tolerant Control of Robotic Manipulator with Actuator Faults Based on Fixed-Time Extended State Observer |
|
Ren, Xing | University of Electronic Science and Technology of China |
Kou, Jiange | Beihang University |
Cao, Yuanchao | University of Electronic Science and Technology of China |
Guo, Qing | University of Electronic Science and Technology of China |
Chen, Zhenlei | University of Electronic Science and Technology of China |
Shi, Yan | Beihang University |
Li, Tieshan | University of Electronic Science and Technology of China |
Keywords: Fault tolerant systems, Robotics, Mechatronics
Abstract: This paper proposes a fixed-time fault-tolerant controller for n-DOF robotic manipulators with actuator partial loss of effectiveness (LOE) faults, external disturbances, and unknown nonlinearities. A novel fixed-time extended state observer (FxTESO) is designed to estimate joint velocities and lumped uncertainty, and the estimation error can theoretically be arbitrarily small by increasing the bandwidth. An adaptive law is designed to estimate an upper bound related to FxTESO error, which can enhance the controller robustness. The proposed controller can guarantee practical fixed-time stability of the manipulator, and require no velocity measurement and prior information about lumped uncertainty. The comparative experiments with the other state-of-the-art controllers on a 4-DOF manipulator under different external disturbances and actuator faults verify the superiority of the proposed controller.
|
|
17:00-17:20, Paper MoC16.4 | |
>Interval Observer-Based Active Fault Tolerant Control for Discrete-Time Uncertain Switched LPV Systems |
|
Nguyen, Duc To | University of Évry-Val d'Essonne - University of Paris-Saclay |
Mammar, Said | Université D'Evry |
Ichalal, Dalil | IBISC-Lab, Univ Evry, Paris Saclay University |
Keywords: Fault tolerant systems, Uncertain systems, Automotive control
Abstract: This paper presents a novel method for co-designing a TNL interval observer and fault-tolerant control (FTC) for discrete-time switched linear parameter-varying (LPV) systems. These systems are subject to faults, unknown but bounded uncertainties, state disturbances, and measurement noise. By introducing weighting matrices T and N, the design gains additional flexibility in calculating the observer gain matrices, ensuring the cooperative condition of estimation errors. The fault is incorporated into an augmented state vector, allowing the TNL interval observer to jointly estimate the lower and upper bounds of both the system state and faults. The FTC is then designed to compensate for the estimated fault and stabilize the closed-loop system in the presence of faults. Sufficient conditions for the stability of the proposed methodology are formulated as Linear Matrix Inequalities (LMIs), derived using the Input-to-State Stability (ISS) property with multiple Lyapunov functions and the Average Dwell Time (ADT) technique. The approach is applied to vehicle lateral dynamics, demonstrating its effectiveness in estimating lateral speed within a tight interval between the lower and upper bounds, while successfully controlling the yaw rate.
|
|
17:20-17:40, Paper MoC16.5 | |
>Fault-Tolerant Iterative Learning Control of Distributed Systems Via Reservoir Computing |
|
Patan, Krzysztof | University of Zielona Gora |
Patan, Maciej | University of Zielona Gora |
Keywords: Iterative learning control, Fault tolerant systems, Neural networks
Abstract: Fault-tolerant control for a wide spectrum of lumped processes is well established. Therefore, there is strong interest in developing designs that would produce similar flexibility for classes of distributed-parameter systems. This paper develops an iterative control design for application to repetitive systems capable of effective actuator fault accommodation using the echo-state neural network. The spatio-temporal dynamics in a multidimensional domain are reconstructed based on the measured system response, then the control is applied via an actuating field using a specific sensor/actuator network. Imposing a proper partitioning of the spatial domain, it is possible to delegate the complex system dynamics to an echo-state network with a dedicated structure. Then, the fault-tolerant control design is embedded into an iterative learning control scheme driven by measurement data to properly update the control inputs to adapt to possible actuator fault states. The new design is applied to displacement control of the elastic plate.
|
|
17:40-18:00, Paper MoC16.6 | |
>A Super-Twisting Observer Design for Thrust-Loss Fault Tolerant Control of Quadrotor Vehicles |
|
Venkateswara Rao, Dasari Mohan Krishna Kishor | University of Luxembourg |
Habibi, Hamed | University of Luxembourg |
Menon, Prathyush P | Faculty of Environment, Science and Economy |
Edwards, Christopher | University of Exeter |
Voos, Holger | University of Luxembourg |
Keywords: Observers for nonlinear systems, Flight control, Fault tolerant systems
Abstract: In this paper, we tackle the thrust loss problem in a quadrotor unmanned aerial vehicle, by the design of a super-twisting observer in the body frame, achieving finite-time convergence. On this basis, we design a super-twisting low-level controller to track the desired trajectory and compensate for the loss effects. Then, the acceleration commands are designed, and the thrust, Euler angles, and angular speeds are computed. The stability and the finite-time convergence of the resultant cascade system are studied, and we show the recovery of the nominal behavior, as well as the weak recovery of the separation principle. Finally, high-fidelity experimental analyses are conducted, to evaluate the performance of the proposed algorithm in the presence of a variety of thrust loss situations.
|
|
MoC17 |
Suite 6 |
Biomedical and Biomolecular Systems |
Regular Session |
Chair: Visioli, Antonio | University of Brescia |
Co-Chair: Del Favero, Simone | University of Padova |
|
16:00-16:20, Paper MoC17.1 | |
>Closed-Loop Peripheral Nerve Stimulation for the Restoration of Normal Pain Processing (I) |
|
Myers, Patrick | Johns Hopkins University |
Hwang, Yun | Stanford |
Ehrens, Daniel | Stanford |
Christine, Beauchene | MIT Lincoln Labs |
Cui, Xiang | Johns Hopkins University |
Khasabov, Sergey | University of Minnesota |
Guan, Yun | Johns Hopkins University |
Sarma, Sridevi | Johns Hopkins University |
Keywords: Biomedical, Biological systems, Biotechnology
Abstract: Neuropathic chronic pain, caused by nerve injury, is common and debilitating. Pharmaceuticals, the primary treatment, often lead to severe side effects. Electrical stimulation is a promising alternative with fewer side effects, but its efficacy is limited. Most clinical neuromodulation methods are open-loop, with settings adjusted during office visits, leading to reduced effectiveness over time. Proposed closed-loop strategies typically activate when a pain signal crosses a threshold, acting like an anesthetic by removing all pain. However, since acute pain has a protective role, the ideal therapy should target chronic pain while preserving acute pain responses. We present a proof-of-concept for a model-based closed-loop neuromodulation therapy aimed at normalizing pain responses in neuropathic rats. By measuring evoked pain responses from thalamic local field potentials in nerve-injured and naïve rats, we estimate transfer functions for both groups. We then use a model matching approach to quantify the difference between the nerve-injured and ideal (naïve) system responses as the error signal. Finally, we design an optimal controller using H-infinity methods to minimize this error and control input within relevant frequency bands. The resulting in-silico experiment provides a foundation for a more biologically realistic controller design which can be implemented in-vivo.
|
|
16:20-16:40, Paper MoC17.2 | |
>Experimental Evaluation of Analgesia with an Event-Based PID Control Strategy for Anesthesia |
|
Schiavo, Michele | Università Degli Studi Di Brescia |
Latronico, Nicola | University of Brescia |
Paltenghi, Massimiliano | ASST Spedali Civili Brescia |
Visioli, Antonio | University of Brescia |
Keywords: Biomedical, PID control, Control applications
Abstract: The control problem of total intravenous anesthesia (TIVA) is inherently multivariable. Indeed, to obtain the desired level of hypnosis and analgesia, it is necessary to administer two specific drugs, respectively propofol and remifentanil. In this context, multiple-input multiple-output (MIMO) control structures should be implemented to provide a tight control of both anesthesia components while also taking their interactions into account. However, to date, the development of MIMO controllers has been hindered by the lack of a reliable indicator of analgesia and, thus, multiple-input single-output (MISO) solutions have been mainly developed. Recently, the introduction of the Conox monitoring system has opened new research opportunities. Indeed, it provides the qCON and the qNOX indexes which give a measure of hypnosis and analgesia, respectively. In this paper, we exploited the qCON as the feedback variable for an event-based PID MISO controller and we recorded the qNOX to evaluate the quality of analgesia. A clinical experiment has been performed on four patients and the results obtained showed that the controller can provide a good performance both in terms of hypnosis and analgesia, thus demonstrating its robustness and the soundness of the proposed MISO architecture.
|
|
16:40-17:00, Paper MoC17.3 | |
>Including the Planning of Orally-Ingested Carbohydrates in a Zone Model Predictive Control for Artificial Pancreas |
|
Cester, Lorenzo | Università Degli Studi Di Padova |
Pavan, Jacopo | University of Padova |
Del Favero, Simone | University of Padova |
Keywords: Biomedical, Predictive control for linear systems, Human-in-the-loop control
Abstract: Artificial Pancreas (AP) is a technology that assists people with type-1 diabetes (T1D) in the challenge of administering exogenous insulin in order to maintain the blood glucose concentration in a safe range. In this work, we consider a zone Model Predictive Control (zone-MPC) with asymmetric costs, and we add to it the capability to suggest the assumption of carbohydrates (CHO) to the patient. To produce sparse-in-time and quantized CHO suggestions, a Mixed Integer Programming (MIP) formulation of the MPC problem is considered. The resulting algorithm is tuned and validated on the UVa/Padova T1D Simulator, an accurate model of T1D metabolism, approved by the U.S. Food and Drug Administration (FDA) for pre-clinical testing of T1D therapies. The results show that the zone-MPC + CHO suggestions allows to increase the time spent in the blood-glucose safe range on average to 81.2 %, with respect to 78.4 % achieved by the insulin-only zone-MPC. This is achieved while consistently cutting down the time in hypoglycemia, from 1.6 % to 0.0 % in median.
|
|
17:00-17:20, Paper MoC17.4 | |
>Human-Imitating Control of Depth of Hypnosis Combining MPC and Event-Based PID Strategies |
|
Milanesi, Marco | Dipartimento Di Ingegneria Meccanica E Industriale |
Consolini, Luca | Università Di Parma |
Di Credico, Giulia | University of Parma |
Latronico, Nicola | University of Brescia |
Laurini, Mattia | Università Degli Studi Di Parma |
Paltenghi, Massimiliano | ASST Spedali Civili Brescia |
Schiavo, Michele | Università Degli Studi Di Brescia |
Visioli, Antonio | University of Brescia |
Keywords: Biomedical, Predictive control for linear systems, PID control
Abstract: In this paper we propose a human-imitating control methodology for the Depth-of-Hypnosys (DoH) in total intravenous anesthesia (TIVA) where the bispectral index (BIS) is used as process variable. The method suitably combines a move-blocking model predictive controller (MPC) that minimizes the time-to-target when the BIS value is not in the required range and an event-based Proportional-Integral-Derivative (PID) controller that provides a strong filtering action when the DoH is satisfactory. A fast induction is achieved and awareness episodes are avoided with a control action that is piecewise constant so that it mimics the behavior of the anesthesiologist and is more likely to be accepted in the clinical practice.
|
|
17:20-17:40, Paper MoC17.5 | |
>Assessing Feasibility in Resource Limited Genetic Networks |
|
Celeste Junior, Carlos Eduardo | Massachusetts Institute of Technology |
Grunberg, Theodore W. | Massachusetts Institute of Technology |
Del Vecchio, Domitilla | Massachusetts Institute of Technology |
Keywords: Biomolecular systems, Genetic regulatory systems, Systems biology
Abstract: The design of biological systems is a challenging endeavor due to the lack of modularity caused by context effects, such multiple gene expression modules sharing a limited pool of resources. This work considers the problem of determining when specifications on the steady state system behavior can be met for suitable parameter choices, while accounting for resource sharing. We establish necessary and sufficient conditions for the feasibility of a specification for a given network of subsystems that share both production and degradation resources. This extends previous work that considered just production resource sharing and thus lays the foundation for the development of co-design techniques for genetic networks with both production and degradation resources, where one may be able to mitigate the effects of one type of resource sharing by tuning the other.
|
|
17:40-18:00, Paper MoC17.6 | |
>AC-BioSD : A Biomolecular Signal Differentiator Module with Enhanced Performance |
|
Alexis, Emmanouil | Princeton University |
Avalos, José | Princeton |
Cardelli, Luca | University of Oxford |
Papachristodoulou, Antonis | University of Oxford |
Keywords: Biomolecular systems, PID control, Genetic regulatory systems
Abstract: Temporal gradient estimation is a pervasive phenomenon in natural biological systems and holds great promise for synthetic counterparts with broad-reaching applications. Here, we advance the concept of BioSD signal differentiation by introducing a novel biomolecular topology, termed AC-BioSD. Its structure allows for insensitivity to input signal changes and high precision in terms of signal differentiation, even when operating far from nominal conditions. Concurrently, disruptive high-frequency signal components are effectively attenuated. In addition, the usefulness of our topology in biological regulation is highlighted via a PID bio-control scheme with set point weighting and filtered derivative action in both the deterministic and stochastic domains.
|
|
MoC18 |
Suite 7 |
Linear Systems III |
Regular Session |
Chair: Teutsch, Johannes | Technical University of Munich |
Co-Chair: Pepe, Pierdomenico | University of L' Aquila |
|
16:00-16:20, Paper MoC18.1 | |
>On LMIs for Switching Systems with Switching Constraints Described by Digraphs and Non-Uniform Dwell-Time Ranges |
|
Pepe, Pierdomenico | University of L' Aquila |
Keywords: LMIs, Switched systems, Linear systems
Abstract: In this paper novel linear matrix inequalities (LMIs) are provided for checking the exponential stability of continuous-time linear switching systems. Novelty arises from the possibility of efficiently addressing, by these LMIs, constraints in switching as described by a switches digraph and dwell-time ranges for each mode, which are allowed to be nonuniform. Switches digraphs and dwell-time ranges may allow stability even in case of unstable subsystems in the family. The provided LMIs give the chance to show stability in this rather generally and naturally constrained case. The results provided in this paper are the continuous-time counterpart of analogous ones recently obtained for the linear discrete-time case. The utility of provided LMIs is shown with two numerical examples concerning controller failures.
|
|
16:20-16:40, Paper MoC18.2 | |
>From Data-Driven to Model-Driven Learning Via Structured Dynamic Mode Decomposition |
|
Bonenberger, Christopher Martin Amadeus | Ulm University |
Scholz, Stephan | University of Applied Sciences Ravensburg-Weingarten |
Schneider, Markus | Ravensburg-Weingarten University of Applied Sciences |
Keywords: Modeling, Linear systems, Machine learning
Abstract: We consider the modeling process of physics-informed Dynamic Mode Decomposition (piDMD), proposing a novel formalism that allows to formulate physical principles in a detailed fashion. More precisely, we constrain the solution space by explicitly specifying the system to be estimated as a sum of elementary matrices with desired properties. Moreover, we augment this approach towards DMD with control and provide closed-form solutions for a wide class of shift-equivariant systems. Finally, the proposed approach opens an alternative perspective on piDMD that also allows for efficient implementations.
|
|
16:40-17:00, Paper MoC18.3 | |
>Stability Conditions for Structured Multi-Agent Linear Systems |
|
Zattoni, Elena | Alma Mater Studiorum Università Di Bologna |
Perdon, Anna Maria | Ancona |
Conte, Giuseppe | Universita' Politecnica Delle Marche |
Keywords: Linear systems, Stability of linear systems, Agents-based systems
Abstract: In this work, multi-agent systems which consist of a finite set of agents, featuring linear dynamics and influencing each other in a linear way, are considered. On the assumption that the topology of the communication network that connects the various agents is known, while the gains of the communication channels, which can assume any real value, are unknown, the multi-agent system is modeled as a structured system whose dynamics is defined by a set of mutually independent real parameters. In this context, structural properties are defined as properties which hold for all the values that the parameters can take and, in particular, this work is focused on the study of the structural stability of the overall multi-agent system. In the case of interest, where all the agents are assumed to have asymptotically stable dynamics, it is shown that a necessary condition for the structural asymptotic stability of the multi-agent system is that the graph describing the relations between the state variables of the agents does not contain cycles of a special kind, defined herein as simple outer cycles. Namely, if the graph contains simple outer cycles, then the overall multi-agent system is not structurally asymptotically stable.
|
|
17:00-17:20, Paper MoC18.4 | |
>Passivity of Linear Singularly Perturbed Systems |
|
Susca, Mircea | Technical University of Cluj-Napoca |
Mihaly, Vlad Mihai | Technical University of Cluj-Napoca |
Lendek, Zsofia | Technical University of Cluj-Napoca |
Morarescu, Irinel-Constantin | CRAN, CNRS, Université De Lorraine |
Keywords: Linear systems, Stability of linear systems
Abstract: The passivity of singularly perturbed systems (SPSs) is generally studied without taking advantage of the time-scale separation present in this class of systems. To fill this gap, the objective of this letter is to provide easy-to-verify well-posed conditions characterizing the passivity of a perturbation variable-dependent SPS starting from the passivity of its associated reduced-order system. To achieve this goal, we rely on the connection between positive realness and passivity, as well as the notion of phase for multi-input multi-output (MIMO) systems. We use a benchmark DC motor to illustrate that classical reasoning used for stability analysis of SPSs, which is based on the stability of the reduced-order (slow) and boundary layer (fast) subsystems, cannot be applied to guarantee the passivity of an SPS. On top of that, our methodology explains how the time-scale separation can be used to analyze the passivity of general linear time-invariant (LTI) systems. The approach is illustrated on a numerical example.
|
|
17:20-17:40, Paper MoC18.5 | |
>Adaptive Stochastic Predictive Control from Noisy Data: A Sampling-Based Approach |
|
Teutsch, Johannes | Technical University of Munich |
Narr, Christopher | Technical University of Munich |
Kerz, Sebastian | Technical University Munich |
Wollherr, Dirk | Technische Universität München |
Leibold, Marion | TU Muenchen |
Keywords: Linear systems, Predictive control for linear systems, Uncertain systems
Abstract: In this work, an adaptive predictive control scheme for linear systems with unknown parameters and bounded additive disturbances is proposed. In contrast to related adaptive control approaches that robustly consider the parametric uncertainty, the proposed method handles all uncertainties stochastically by employing an online adaptive sampling-based approximation of chance constraints. The approach requires initial data in the form of a short input-output trajectory and distributional knowledge of the disturbances. This prior knowledge is used to construct an initial set of data-consistent system parameters and a distribution that allows for sample generation. As new data stream in online, the set of consistent system parameters is adapted by exploiting set membership identification. Consequently, chance constraints are deterministically approximated using a probabilistic scaling approach by sampling from the set of system parameters. In combination with a robust constraint on the first predicted step, recursive feasibility of the proposed predictive controller and closed-loop constraint satisfaction are guaranteed. A numerical example demonstrates the efficacy of the proposed method.
|
|
17:40-18:00, Paper MoC18.6 | |
>Moment Matching for Second-Order Systems with Pole-Zero Placement |
|
Cheng, Xiaodong | Wageningen University and Research |
Ionescu, Tudor C. | Politehnica Uni. of Bucharest & Romanian Acad |
Iftime, Orest V. | University of Groningen |
Necoara, Ion | Universitatea Nationala De Stiinta Si Tehnologie POLITEHNICA Buc |
Keywords: Linear systems, Reduced order modeling
Abstract: In this paper, a structure-preserving model reduction problem for second-order dynamical systems of high dimension using time-domain moment matching with pole-zero placement is studied. The moments of a second-order system are defined based on the solutions of linear matrix equations. Families of second-order reduced models, parametrized in a set of matrix degrees of freedom, that match the moments of a given second-order system at selected interpolation points are computed. We then provide formulae for the set of matrix parameters such that the reduced order approximation has a set of prescribed poles and zeros. The theory is illustrated on a damped vibratory system (e.g., a chain of mechanical oscillators) of degree n, governed by a second order dynamical model.
|
|
MoC19 |
Suite 8 |
Stochastic Optimal Control III |
Regular Session |
Chair: Tsiotras, Panagiotis | Georgia Institute of Technology |
Co-Chair: Komaee, Arash | Southern Illinois University |
|
16:00-16:20, Paper MoC19.1 | |
>Optimal Control of a Large Population of Interacting Agents with Partial Observations of a Lead Agent |
|
Komaee, Arash | Southern Illinois University |
Keywords: Stochastic optimal control, Large-scale systems, Linear systems
Abstract: This paper considers a linear Gaussian state-space model to describe the dynamics of a system of many interacting, identical agents, and the partial observations of their aggregate state via the measurements of the state of a single lead agent. These partial observations are exploited by an optimal control law that modifies the collective dynamics of the agents, aimed to minimize a quadratic performance measure. The structure of this optimal control law is studied for finite number of agents, and its convergence to a limit that is optimal for infinitely many agents is examined. This limit is obtained as the solution to an optimal control problem stated for a finite-dimensional reduced order model representing infinitely many agents.
|
|
16:20-16:40, Paper MoC19.2 | |
>Optimal Dispatch of Hybrid Renewable–Battery Storage Resources: A Stochastic Control Approach |
|
Aung, Thiha | University of California Santa Barbara |
Ludkovski, Mike | University of California Santa Barbara |
Keywords: Stochastic optimal control, Machine learning, Power systems
Abstract: We study the daily operation of hybrid energy resources that couple a renewable generator with a battery energy storage system (BESS). We propose a dynamic stochastic control formulation for optimal dispatch of BESS to maximize the reliability of the hybrid asset relative to a given day-ahead dispatch target or forecast. We develop a machine-learning algorithm based on Gaussian Process regression to efficiently find the dynamic feedback control map. Several numerical case studies highlight the flexibility and extensibility of our methodology, including the ability to consider alternative objectives, such as peak shaving. We also provide a sensitivity analysis with respect to the energy capacity and power rating of the BESS.
|
|
16:40-17:00, Paper MoC19.3 | |
>Optimal Control of Diffusion Processes: Infinite-Order Variational Analysis and Numerical Solution |
|
Chertovskih, Roman | University of Porto |
Pogodaev, Nikolay | Università Degli Studi Di Padova |
Staritsyn, Maxim | Faculty of Engineering, University of Porto |
Aguiar, A. Pedro | Faculty of Engineering, University of Porto |
Keywords: Optimal control, Stochastic optimal control, Numerical algorithms
Abstract: We tackle a nonlinear optimal control problem for a stochastic differential equation in Euclidean space and its state-linear counterpart for the Fokker-Planck-Kolmogorov equation in the space of probabilities. Our approach is founded on a novel concept of local optimality, stronger than conventional Pontryagin’s minimum and originally crafted for deterministic optimal ensemble control problems. A key practical outcome is a rapidly converging numerical algorithm, which proves its feasibility for problems involving Markovian and open-loop strategies.
|
|
17:00-17:20, Paper MoC19.4 | |
>Discrete-Time Maximum Likelihood Neural Distribution Steering |
|
Rapakoulias, George | Georgia Institute of Technology |
Tsiotras, Panagiotis | Georgia Institute of Technology |
Keywords: Stochastic optimal control, Neural networks, Nonlinear systems
Abstract: This paper studies the problem of steering the distribution of a discrete-time dynamical system from an initial distribution to a target distribution in finite time. The formulation is fully nonlinear, allowing the use of general control policies, parametrized by neural networks. Although similar solutions have been explored in the continuous-time context, extending these techniques to systems with discrete dynamics is not trivial. The proposed algorithm results in a regularized maximum likelihood optimization problem, which is solved using machine learning techniques. After presenting the algorithm, we provide several numerical examples that illustrate the capabilities of the proposed method. We start from a simple problem that admits a solution through semidefinite programming, serving as a benchmark for the proposed approach. Then, we employ the framework in more general problems that cannot be solved using existing techniques, such as problems with non-Gaussian boundary distributions and non-linear dynamics.
|
|
17:20-17:40, Paper MoC19.5 | |
>Dubins Vehicle Intercept of a Brownian Target on a Sphere |
|
Milutinovic, Dejan | University of California, Santa Cruz |
Casbeer, David W. | Air Force Research Laboratory |
Keywords: Stochastic optimal control, Nonholonomic systems, Aerospace
Abstract: This paper focuses on a navigation of a Dubins vehicle (DV) to intercept a moving target on a sphere. The uncertainty in the target motion is described by a Brownian motion model. An Ito-type stochastic differential equation for the relative motion between DV and the Brownian moving target is derived. This stochastic differential equation is used to formulate an intercept controller as a minimum time optimal controller. The properties of the controller are incorporated into the numerical computations of the controller, which by its construction can intercept a Brownian target. Our numerical simulation results show that the computed controller can intercept both stationary and moving targets on the sphere.
|
|
17:40-18:00, Paper MoC19.6 | |
>Chance-Constrained Gaussian Mixture Steering to a Terminal Gaussian Distribution |
|
Kumagai, Naoya | Purdue University |
Oguri, Kenshiro | Purdue University |
Keywords: Stochastic optimal control, Stochastic systems, Uncertain systems
Abstract: We address the problem of finite-horizon control of a discrete-time linear system, where the initial state distribution follows a Gaussian mixture model, the terminal state must follow a specified Gaussian distribution, and the state and control inputs must obey chance constraints. We show that, throughout the time horizon, the state and control distributions are fully characterized by Gaussian mixtures. We then formulate the cost, distributional terminal constraint, and affine/2-norm chance constraints on the state and control, as convex functions of the decision variables. This is leveraged to formulate the chance-constrained path planning problem as a single convex optimization problem. A numerical example demonstrates the effectiveness of the proposed method.
|
|
MoC20 |
Suite 9 |
Control with Limited Information |
Regular Session |
Chair: Mattioni, Mattia | Università Degli Studi Di Roma La Sapienza |
Co-Chair: Panteley, Elena | CNRS |
|
16:00-16:20, Paper MoC20.1 | |
>Trajectory Tracking of Unicycles under Sampling and Discrete-Time Passivity-Based Control |
|
Mattioni, Mattia | Università Degli Studi Di Roma La Sapienza |
Monaco, Salvatore | Università Di Roma |
Normand-Cyrot, Dorothée | CNRS-CentraleSupélec-Univ. ParisSaclay |
Keywords: Sampled-data control, Lyapunov methods, Nonholonomic systems
Abstract: The paper provides a new sampled-data and single-rate control law for trajectory tracking of unicycles under sampling. Assuming the reference generated by a continuous-time input sequence that is persistently exciting the design consists of two phases: first a new discrete-time generator is proposed; then, asymptotic tracking is achieved via a discrete-time IDA-PBC strategy. Simulations illustrate the result of enhancing the efficiency of the proposed control law.
|
|
16:20-16:40, Paper MoC20.2 | |
>On the Reachability of Stable Linear Systems with Quantized Input Alphabet |
|
Fan, Donglei | NVIDIA Semiconductor Technology (Beijing) Co. Ltd |
Keywords: Quantized systems, Linear systems
Abstract: In this paper, we study linear systems with a finitely quantized input alphabet. We give motivations for a new notion of reachability for this class of systems with quantized input, and propose a notion of local asymptotic reachability. We then derive a sufficient condition and a necessary condition for reachability, and propose an algorithmic procedure to verify these conditions.
|
|
16:40-17:00, Paper MoC20.3 | |
>Quantized Predictor-Based Tracking Control for Input Delay Systems: Application to a Helicopter System |
|
Schlanbusch, Siri Marte | University of Agder |
Zhou, Jing | University of Agder |
Schlanbusch, Rune | NORCE Norwegian Research Centre |
Keywords: Delay systems, Quantized systems
Abstract: In this paper, the tracking control problem is studied for systems with quantized signals and communication delays. A bounded type of quantizer is introduced to reduce the communication transmission burden, where both control inputs and states are quantized. A predictor-feedback controller is designed to compensate for delays in the inputs, while boundedness of the tracking errors are achieved and practical stability is ensured. An application to a helicopter system is presented to show the effectiveness of the proposed predictor-based controller in a closed-loop configuration, and to illustrate the main findings.
|
|
17:00-17:20, Paper MoC20.4 | |
>Distributed State Estimation of Linear Systems under Jointly Connected Directed Switching Networks |
|
Basu, Himadri | University of California Santa Cruz |
Sanfelice, Ricardo G. | University of California at Santa Cruz |
Keywords: Distributed control, Switched systems
Abstract: This paper investigates the distributed state estimation problem by a group of networked agents for jointly observable linear systems in the context of directed dynamic interaction topologies. In this setting, no individual agent can fully observe the system state using only its own measurements. We extend existing decentralized observer design methodologies to more complex scenarios, where the communication topology is directed, aperiodic and switching, and the system measurements may be available either continuously or at sporadic time instants. Leveraging the joint observability condition, and assuming a mild mutual reachability condition over specific time intervals, we demonstrate that the unobservable subspaces of each agent can be reconstructed using the observable subspaces of other agents in the network. The effectiveness of the proposed approach is illustrated through a numerical example.
|
|
17:20-17:40, Paper MoC20.5 | |
>Robust Control Design for Thruster Driven Autonomous Underwater Vehicle |
|
Gurung, Diwakar | Indian Institute of Technology Kharagpur |
Gedam, Vidit | Indian Institute of Technology Kharagpur |
Jagadale, Swapnil Laxman | Indian Institute of Technology Kharagpur |
Kumar, C.S. | IIT, Kharagpur |
Nagarajan, Vishwanath | IIT Kharagpur |
Keywords: Feedback linearization, Autonomous robots, Robust control
Abstract: The paper presents a robust control strategy for an autonomous underwater vehicle (AUV) based on uncertainty and disturbance estimation (UDE) theory. A robust control design is essential for underwater vehicles due to their highly nonlinear dynamics and the presence of external disturbances. An α-UDE based control design has been formulated for the pitch and depth control of an axisymmetric testbed AUV that is entirely controlled by thrusters. Here, the first-order α-filter is used to estimate the uncertainties and disturbance alongside a feedback control law based on feedback linearization (FL) approach. We present the comparative numerical simulations to show the effectiveness of the proposed control system.
|
|
17:40-18:00, Paper MoC20.6 | |
>Orbital Control for Swimming in Underwater Snake Robots Using Energy-Shaping and Consensus Control |
|
Lysø, Mads Erlend Bøe | Norwegian University of Science and Technology (NTNU) |
Panteley, Elena | CNRS |
Pettersen, Kristin Y. | Norwegian University of Science and Technology (NTNU) |
Gravdahl, Jan Tommy | Norwegian Univ. of Science & Tech |
Keywords: Maritime control, Distributed control, Lyapunov methods
Abstract: In this paper, we present a controller for stabilizing and synchronizing a set of phase-shifted oscillations in a system of n double integrators. The motivating application is the control of swimming locomotion in an underwater snake robot. The oscillations are stabilized to a desired amplitude and frequency, and synchronized to achieve a desired phase shift between each oscillator. The result is a controller which stabilizes the desired gait while avoiding the undesirable transient behavior (colloquially known as ”catching-up”-effects) associated with the tracking-based methods which are currently prevalent in the field. The controller is based on energy-shaping control and consensus control, and it is shown to render the desired orbit almost globally asymptotically stable. The emergence and stability of the desired behavior is demonstrated in a simulation study.
|