| |
Last updated on September 22, 2023. This conference program is tentative and subject to change
Technical Program for Thursday December 14, 2023
|
ThA01 Tutorial Session, Orchid Main 4202-4306 |
Add to My Program |
Analysis and Design of Optimization Algorithms Using Tools from Control
Theory |
|
|
Chair: Van Scoy, Bryan | Miami University |
Co-Chair: Lessard, Laurent | Northeastern University |
Organizer: Lessard, Laurent | Northeastern University |
Organizer: Van Scoy, Bryan | Miami University |
|
10:00-10:20, Paper ThA01.1 | Add to My Program |
Optimization Algorithm Synthesis Based on Integral Quadratic Constraints: A Tutorial (I) |
|
Scherer, Carsten W. | University of Stuttgart |
Ebenbauer, Christian | RWTH Aachen University |
Holicki, Tobias | University of Stuttgart |
Keywords: Optimization algorithms, Robust control, LMIs
Abstract: We expose in a tutorial fashion the mechanisms which underlie the synthesis of optimization algorithms based on dynamic integral quadratic constraints. We reveal how these tools from robust control allow to design accelerated gradient descent algorithms with optimal guaranteed convergence rates by solving small-sized convex semi-definite programs. It is shown that this extends to the design of extremum controllers, with the goal to regulate the output of a general linear closed-loop system to the minimum of an objective function. Numerical experiments illustrate that we can not only recover gradient decent and the triple momentum variant of Nesterov's accelerated first order algorithm, but also automatically synthesize optimal algorithms even if the gradient information is passed through non-trivial dynamics, such as time-delays.
|
|
10:20-10:40, Paper ThA01.2 | Add to My Program |
A Tutorial on a Lyapunov-Based Approach to the Analysis of Iterative Optimization Algorithms (I) |
|
Van Scoy, Bryan | Miami University |
Lessard, Laurent | Northeastern University |
Keywords: Optimization algorithms, Robust control, Lyapunov methods
Abstract: Iterative gradient-based optimization algorithms are widely used to solve difficult or large-scale optimization problems. There are many algorithms to choose from, such as gradient descent and its accelerated variants such as Polyak's Heavy Ball method or Nesterov's Fast Gradient method. It has long been observed that iterative algorithms can be viewed as dynamical systems, and more recently, as robust controllers. Here, the "uncertainty" in the dynamics is the gradient of the function being optimized. Therefore, worst-case or average-case performance can be analyzed using tools from robust control theory, such as integral quadratic constraints (IQCs). In this tutorial paper, we show how such an analysis can be carried out using an alternative Lyapunov-based approach. This approach recovers the same performance bounds as with IQCs, but with the added benefit of constructing a Lyapunov function.
|
|
10:40-11:00, Paper ThA01.3 | Add to My Program |
A Tutorial on the Structure of Distributed Optimization Algorithms (I) |
|
Van Scoy, Bryan | Miami University |
Lessard, Laurent | Northeastern University |
Keywords: Optimization algorithms
Abstract: We consider the distributed optimization problem for a multi-agent system. Here, multiple agents cooperatively optimize an objective by sharing information through a communication network and performing computations. In this tutorial, we provide an overview of the problem, describe the structure of its algorithms, and use simulations to illustrate some algorithmic properties based on this structure.
|
|
11:00-11:20, Paper ThA01.4 | Add to My Program |
Interpolation Constraints for Computing Worst-Case Bounds in Performance Estimation Problems (I) |
|
Rubbens, Anne | UCLouvain |
Bousselmi, Nizar | UCLouvain |
Colla, Sebastien | UCLouvain |
Hendrickx, Julien M. | UCLouvain |
Keywords: Optimization, Optimization algorithms, Computer-aided control design
Abstract: The Performance Estimation Problem (PEP) approach consists in computing worst-case performance bounds on optimization algorithms by solving an optimization problem: one maximizes an error criterion over all initial conditions allowed and all functions in a given class of interest. The maximal value is then a worst-case bound, and the maximizer provides an example reaching that worst case. This approach was introduced for optimization algorithms but could in principle be applied to many other contexts involving worst-case bounds. The key challenge is the representation of infinite-dimensional objects involved in these optimization problems such as functions, and complex or non-convex objects as linear operators and their powers, networks in decentralized optimization etc. This challenge can be resolved by interpolation constraints, which allow representing the effect of these objects on vectors of interest, rather than the whole object, leading to tractable finite dimensional problems. We review several recent interpolation results and their implications in obtaining of worst-case bounds via PEP.
|
|
11:20-11:40, Paper ThA01.5 | Add to My Program |
On Fundamental Proof Structures in First-Order Optimization (I) |
|
Goujaud, Baptiste | Ecole Polytechnique |
Dieuleveut, Aymeric | Ecole Polytechnique |
Taylor, Adrien | Inria/Ecole Normale Supérieure |
Keywords: Optimization algorithms, Optimization, LMIs
Abstract: First-order optimization methods have attracted a lot of attention due to their practical success in many applications, including in machine learning. Obtaining convergence guarantees and worst-case performance certificates for first-order methods have become crucial for understanding ingredients underlying efficient methods and for developing new ones. However, obtaining, verifying, and proving such guarantees is often a tedious task. Therefore, a few approaches were proposed for rendering this task more systematic, and even partially automated. In addition to helping researchers finding convergence proofs, these tools provide insights on the general structures of such proofs. We aim at presenting those structures, showing how to build convergence guarantees for first-order optimization methods.
|
|
ThA02 Invited Session, Melati Main 4001AB-4104 |
Add to My Program |
Learning-Based Control III: Model Learning, Analysis and Control |
|
|
Chair: Schoellig, Angela P | University of Toronto |
Co-Chair: Müller, Matthias A. | Leibniz University Hannover |
Organizer: Trimpe, Sebastian | RWTH Aachen University |
Organizer: Müller, Matthias A. | Leibniz University Hannover |
Organizer: Schoellig, Angela P | University of Toronto |
Organizer: Zeilinger, Melanie N. | ETH Zurich |
|
10:00-10:20, Paper ThA02.1 | Add to My Program |
Physically Consistent Multiple-Step Data-Driven Predictions Using Physics-Based Filters |
|
Lian, Yingzhao | EPFL |
Shi, Jicheng | EPFL |
Jones, Colin N. | EPFL |
Keywords: Sampled-data control, Building and facility automation, Predictive control for linear systems
Abstract: Data-driven control can facilitate the rapid development of controllers, offering an alternative to conventional approaches. In order to maintain consistency between any known underlying physical laws and a data-driven decision-making process, preprocessing of raw data is necessary to account for measurement noise and any inconsistencies it may introduce. In this paper, we present a physics-based filter to achieve this and demonstrate its effectiveness through practical applications, using real-world datasets collected in a building on the École Polytechnique Fédérale de Lausanne (EPFL) campus. Two distinct use cases are explored: indoor temperature control and demand response bidding.
|
|
10:20-10:40, Paper ThA02.2 | Add to My Program |
Data-Driven Feedback Linearization with Complete Dictionaries (I) |
|
De Persis, Claudio | University of Groningen |
Gadginmath, Darshan | University of California, Riverside |
Pasqualetti, Fabio | University of California, Riverside |
Tesi, Pietro | Universitŕ Degli Studi Di Firenze |
Keywords: Data driven control, Nonlinear systems, Feedback linearization
Abstract: We consider the feedback linearization problem, and contribute with a new method that can learn the linearizing controller from a library (a dictionary) of candidate functions. When the dynamics of the system is known, the method boils down to solving a set of linear equations. Remarkably, the same idea extends to the case in which the dynamics of the system is unknown and a linearizing controller must be found using experimental data. In particular, we derive a simple condition (checkable from data) to assess when the linearization property holds over the entire state space of interest and not just on the dataset used to determine the solution. We also discuss important research directions on this topic.
|
|
10:40-11:00, Paper ThA02.3 | Add to My Program |
Unconstrained Parametrization of Dissipative and Contracting Neural Ordinary Differential Equations (I) |
|
Martinelli, Daniele | École Polytechnique Fédérale De Lausanne |
Galimberti, Clara Lucía | École Polytechnique Fédérale De Lausanne |
Manchester, Ian R. | University of Sydney |
Furieri, Luca | EPFL |
Ferrari-Trecate, Giancarlo | Ecole Polytechnique Fédérale De Lausanne |
Keywords: Neural networks, Stability of nonlinear systems, Machine learning
Abstract: In this work, we introduce and study a class of Deep Neural Networks (DNNs) in continuous-time. The proposed architecture stems from the combination of Neural Ordinary Differential Equations (Neural ODEs) with the model structure of recently introduced Recurrent Equilibrium Networks (RENs). We show how to endow our proposed NodeRENs with contractivity and dissipativity --- crucial properties for robust learning and control. Most importantly, as for RENs, we derive parametrizations of contractive and dissipative NodeRENs which are unconstrained, hence enabling their learning for a large number of parameters. We validate the properties of NodeRENs, including the possibility of handling irregularly sampled data, in a case study in nonlinear system identification.
|
|
11:00-11:20, Paper ThA02.4 | Add to My Program |
Abstracting Linear Stochastic Systems Via Knowledge Filtering (I) |
|
Engelaar, Maico Hendrikus Wilhelmus | Eindhoven University of Technology |
Romao, Licio | University of Oxford |
Gao, Yulong | University of Oxford |
Lazar, Mircea | Eindhoven University of Technology |
Abate, Alessandro | University of Oxford |
Haesaert, Sofie | Eindhoven University of Technology |
Keywords: Formal Verification/Synthesis, Stochastic systems, Reduced order modeling
Abstract: In this paper, we propose a new model reduction technique for linear stochastic systems that builds upon knowledge filtering and utilizes optimal Kalman filtering techniques. This new technique will reduce the dimension of the noise disturbance and will allow any controller designed for the reduced model to be refined into a controller for the original stochastic system, while preserving any specification on the output. Although initially the reduced model will be time-varying, a method will be provided with which the reduced model can become time-invariant if it satisfies some minor technical conditions. We present our theoretical findings with an example that supports the proposed framework and illustrates how model reduction and controller refinement of stochastic systems can be achieved. We finish the paper by considering specific examples to analyze both completeness with respect to controller synthesis and model order reduction with respect to the state.
|
|
11:20-11:40, Paper ThA02.5 | Add to My Program |
Learning Control of Second-Order Systems Via Nonlinearity Cancellation (I) |
|
Guo, Meichen | Delft University of Technology |
De Persis, Claudio | University of Groningen |
Tesi, Pietro | University of Florence |
Keywords: Data driven control, Stability of nonlinear systems, Learning
Abstract: A technique to design controllers for nonlinear systems from data consists of letting the controllers learn the nonlinearities, cancel them out and stabilize the closed-loop dynamics. When control and nonlinearities are unmatched, the technique leads to an approximate cancellation and local stability results are obtained. In this paper, we show that, if the system has some structure that the designer can exploit, an iterative use of the data leads to a globally stabilizing controller even when control and nonlinearities are unmatched.
|
|
11:40-12:00, Paper ThA02.6 | Add to My Program |
On the Impact of Regularization in Data-Driven Predictive Control (I) |
|
Breschi, Valentina | Eindhoven University of Technology |
Chiuso, Alessandro | Univ. Di Padova |
Fabris, Marco | University of Padua |
Formentin, Simone | Politecnico Di Milano |
Keywords: Data driven control, Predictive control for linear systems, Uncertain systems
Abstract: Model predictive control (MPC) is a control strategy widely used in industrial applications. However, its implementation typically requires a mathematical model of the system being controlled, which can be a time-consuming and expensive task. Data-driven predictive control (DDPC) methods offer an alternative approach that does not require an explicit mathematical model, but instead optimize the control policy directly from data. In this paper, we study the impact of two different regularization penalties on the closed-loop performance of a recently introduced data-driven method called γ-DDPC. Moreover, we discuss the tuning of the related coefficients in different data and noise scenarios, to provide some guidelines for the end user.
|
|
ThA03 Invited Session, Melati Junior 4010A-4111 |
Add to My Program |
Safe Planning and Control with Uncertainty Quantification II |
|
|
Chair: Lindemann, Lars | University of Southern California |
Co-Chair: Gao, Yulong | University of Oxford |
Organizer: Gao, Yulong | University of Oxford |
Organizer: Lindemann, Lars | University of Southern California |
Organizer: Fan, Chuchu | Massachusetts Institute of Technology |
Organizer: Abate, Alessandro | University of Oxford |
Organizer: Pappas, George J. | University of Pennsylvania |
|
10:00-10:20, Paper ThA03.1 | Add to My Program |
Distributionally Robust Uncertainty Quantification Via Data-Driven Stochastic Optimal Control |
|
Pan, Guanru | TU Dortmund University |
Faulwasser, Timm | TU Dortmund University |
Keywords: Stochastic optimal control, Predictive control for linear systems, Optimal control
Abstract: This paper studies optimal control problems of unknown linear systems subject to stochastic disturbances of uncertain distribution. Uncertainty about the stochastic disturbances is usually described via ambiguity sets of probability measures or distributions. Typically, stochastic optimal control requires knowledge of underlying dynamics and is as such challenging. Relying on a stochastic fundamental lemma from data-driven control and on the framework of polynomial chaos expansions, we propose an approach to reformulate distributionally robust optimal control problems with ambiguity sets as uncertain conic programs in a finite-dimensional vector space. We show how to construct these programs from previously recorded data and how to relax the uncertain conic program to numerically tractable convex programs via appropriate sampling of the underlying distributions. The efficacy of our method is illustrated via a numerical example.
|
|
10:20-10:40, Paper ThA03.2 | Add to My Program |
Inner Approximations of Stochastic Programs for Data-Driven Stochastic Barrier Function Design (I) |
|
Mathiesen, Frederik Baymler | Delft University of Technology |
Romao, Licio | University of Oxford |
Abate, Alessandro | University of Oxford |
Calvert, Simeon Craig | Delft University of Technology |
Laurenti, Luca | TU Delft |
Keywords: Formal Verification/Synthesis, Stochastic systems, Lyapunov methods
Abstract: This paper proposes a new framework to compute finite-horizon safety guarantees for discrete-time piece-wise affine systems with stochastic noise of unknown distributions. The approach is based on a novel approach to synthesise a stochastic barrier function (SBF) from noisy data and rely on the scenario optimization theory. In particular, we show that the stochastic program to synthesize a SBF can be relaxed into a chance-constrained optimisation problem on which scenario approach theory applies. We further show that the resulting program can be reduced to a linear programming problem, thus guaranteeing efficiency. In contrast to existing approaches, this method is data efficient as it only requires the number of data to be proportional to the logarithm in the negative inverse of the confidence level and is computationally efficient due to its reduction to linear programming. The efficacy of the method is empirically evaluated on various verification benchmarks. Experiments show a significant improvement with respect to state-of-the-art, obtaining tighter certificates with a confidence that is several orders of magnitude higher.
|
|
10:40-11:00, Paper ThA03.3 | Add to My Program |
Capture, Propagate, and Control Distributional Uncertainty (I) |
|
Aolaritei, Liviu | ETH Zurich |
Lanzetti, Nicolas | ETH Zürich |
Dörfler, Florian | Swiss Federal Institute of Technology (ETH) Zurich |
Keywords: Stochastic systems, Robust control, Optimization
Abstract: We study stochastic dynamical systems in settings where only partial statistical information about the noise is available, e.g., in the form of a limited number of noise realizations. Such systems are particularly challenging to analyze and control, primarily due to an absence of a distributional uncertainty model which: (1) is expressive enough to capture practically relevant scenarios; (2) can be easily propagated through system maps; (3) is invariant under propagation; and (4) allows for computationally tractable control actions. In this paper, we propose to model distributional uncertainty via Optimal Transport ambiguity sets and show that such modeling choice satisfies all of the above requirements. We then specialize our results to stochastic LTI systems, and start by showing that the distributional uncertainty can be efficiently captured, with high probability, within an Optimal Transport ambiguity set on the space of noise trajectories. Then, we show that such ambiguity sets propagate exactly through the system dynamics, giving rise to stochastic tubes that contain, with high probability, all trajectories of the stochastic system. Finally, we show that the control task is very interpretable, unveiling an interesting decomposition between the roles of the feedforward and the feedback control terms. Our results are actionable and successfully applied in stochastic reachability analysis and in trajectory planning under distributional uncertainty.
|
|
11:00-11:20, Paper ThA03.4 | Add to My Program |
Conformal Off-Policy Evaluation in Markov Decision Processes (I) |
|
Foffano, Daniele | KTH Royal Institute of Technology |
Russo, Alessio | KTH Royal Institute of Technology |
Proutiere, Alexandre | KTH |
Keywords: Markov processes, Statistical learning, Estimation
Abstract: Reinforcement Learning aims at identifying and evaluating efficient control policies from data. In many real-world applications, the learner is not allowed to experiment and cannot gather data in an online manner (this is the case when experimenting is expensive, risky or unethical). For such applications, the reward of a given policy (the target policy) must be estimated using historical data gathered under a different policy (the behavior policy). Most methods for this learning task, referred to as Off-Policy Evaluation (OPE), do not come with accuracy and certainty guarantees. We present a novel OPE method based on Conformal Prediction that outputs an interval containing the true reward of the target policy with a prescribed level of certainty. The main challenge in OPE stems from the distribution shift due to the discrepancies between the target and the behavior policies. We propose and empirically evaluate different ways to deal with this shift. Some of these methods yield conformalized intervals with reduced length compared to existing approaches, while maintaining the same certainty level.
|
|
11:20-11:40, Paper ThA03.5 | Add to My Program |
Risk-Minimizing Two-Player Zero-Sum Stochastic Differential Game Via Path Integral Control (I) |
|
Patil, Apurva | The University of Texas at Austin |
Zhou, Yujing | Princeton University |
Fridovich-Keil, David | The University of Texas at Austin |
Tanaka, Takashi | University of Texas at Austin |
Keywords: Game theory, Stochastic optimal control, Autonomous systems
Abstract: This paper addresses a continuous-time risk-minimizing two-player zero-sum stochastic differential game (SDG), in which each player aims to minimize its probability of failure. Failure occurs in the event when the state of the game enters into predefined undesirable domains, and one player's failure is the other's success. We derive a sufficient condition for this game to have a saddle-point equilibrium and show that it can be solved via a Hamilton-Jacobi-Isaacs (HJI) partial differential equation (PDE) with Dirichlet boundary condition. Under certain assumptions on the system dynamics and cost function, we establish the existence and uniqueness of the saddle-point of the game. We provide explicit expressions for the saddle-point policies which can be numerically evaluated using path integral control. This allows us to solve the game online via Monte Carlo sampling of system trajectories. We implement our control synthesis framework on two classes of risk-minimizing zero-sum SDGs: a disturbance attenuation problem and a pursuit-evasion game. Simulation studies are presented to validate the proposed control synthesis framework.
|
|
11:40-12:00, Paper ThA03.6 | Add to My Program |
Data-Driven Reachability Analysis of Stochastic Dynamical Systems with Conformal Inference (I) |
|
Hashemi, Navid | University of Southern California |
Qin, Xin | University of Southern California |
Lindemann, Lars | University of Southern California |
Deshmukh, Jyotirmoy | University of Southern California |
Keywords: Formal Verification/Synthesis, Neural networks, Statistical learning
Abstract: We consider data-driven reachability analysis of discrete-time stochastic dynamical systems using conformal inference. We assume that we are not provided with a symbolic representation of the stochastic dynamics, but instead have access to a dataset of K-step trajectories. The reachability problem is to construct a probabilistic flowpipe such that the probability that a K-step trajectory can violate the bounds of the flowpipe does not exceed a user-specified failure probability threshold. The key ideas in this paper are: (1) to learn a surrogate predictor model from data, (2) to perform reachability analysis using the surrogate model, and (3) to quantify the surrogate model’s incurred error using conformal inference in order to give probabilistic reachability guarantees. We focus on learning-enabled control systems with complex closed-loop dynamics that are difficult to model symbolically, but where state transition pairs can be queried, e.g., using a simulator. We demonstrate the applicability of our method on examples from the domain of learning-enabled cyber-physical systems.
|
|
ThA04 Invited Session, Simpor Junior 4913 |
Add to My Program |
Electromobility: Transportation, Power Systems, and Markets |
|
|
Chair: Cicic, Mladen | CNRS, GIPSA-Lab |
Co-Chair: Cenedese, Carlo | ETH Zurich |
Organizer: Cicic, Mladen | CNRS, GIPSA-Lab |
Organizer: Cenedese, Carlo | ETH Zurich |
Organizer: Canudas de Wit, Carlos | CNRS, GIPSA-Lab |
Organizer: Lygeros, John | ETH Zurich |
|
10:00-10:20, Paper ThA04.1 | Add to My Program |
Electric Vehicle Charging Station Pricing Control under Balancing Reserve Capacity Commitments (I) |
|
Cicic, Mladen | CNRS, GIPSA-Lab |
Gasnier, Guillaume | GIPSA-Lab, CNRS |
Canudas de Wit, Carlos | CNRS, GIPSA-Lab |
Keywords: Traffic control, Smart grid, Smart cities/houses
Abstract: Electric vehicle charging stations are expected to become key players in the future sustainable power system. We propose a framework for using them to provide balancing services to the grid, by implementing charging price control laws that ensure they are able to deliver their committed balancing capacity. The control laws are based on the Coupled Traffic, Energy, and Charging (CTEC) model, incorporating electric vehicle routing and charging decisions based on the charging price and EV state of charge. Charging stations compete with each other and must ensure a certain number of charging vehicles to maintain their role of frequency containment reserves. The results demonstrate the effectiveness of the proposed pricing control scheme in maximizing charging station profits, without violating their balancing reserve capacity commitments.
|
|
10:20-10:40, Paper ThA04.2 | Add to My Program |
Routing and Charging Game in Ride-Hailing Service with Electric Vehicles (I) |
|
Zhang, Kenan | ETH Zurich |
Lygeros, John | ETH Zurich |
Keywords: Transportation networks, Mean field games, Markov processes
Abstract: This paper studies the routing and charging behaviors of electric vehicles in a competitive ride-hailing market. When the vehicles are idle, they can choose whether to continue cruising to search for passengers, or move a charging station to recharge. The behaviors of individual vehicles are then modeled by a Markov decision process (MDP). The state transitions in the MDP model, however, depend on the aggregate vehicle flows both in service zones and at charging stations. Accordingly, the value function of each vehicle is determined by the collective behaviors of all vehicles. With the assumption of the large population, we formulate the collective routing and charging behaviors as a mean-field Markov game. We characterize the equilibrium of such a game, prove its existence, and numerically show that the competition among vehicles leads to “inefficient congestion” both in service zones and at charging stations.
|
|
10:40-11:00, Paper ThA04.3 | Add to My Program |
Optimal Location of EVs Public Charging Stations Based on a Macroscopic Urban Electromobility Model (I) |
|
Mourgues, Rémi | CNRS - Gipsa Lab |
Rodriguez-Vega, Martin | CNRS, GIPSA-Lab |
Canudas de Wit, Carlos | CNRS, GIPSA-Lab |
Keywords: Transportation networks, Modeling, Optimization
Abstract: This paper introduces a graph-based dynamic model for electric vehicle (EV) mobility in urban areas. The model tracks EV state-of-charge (SoC) changes over time and space, along with power inputs from public charging stations (PCS). It considers driver behavior when deciding when and where to charge, accounting for factors like current SoC, distance to PCS, and charging cost. The model helps identify optimal PCS locations to enhance convenience for EV users and profitability for PCS owners. Additionally, an averaged version of the model is presented to reduce computational overhead while aiding in optimal PCS placement. Simulation results affirm the effectiveness of our model and optimization approach in identifying ideal charging station locations and enhancing EV charging infrastructure accessibility.
|
|
11:00-11:20, Paper ThA04.4 | Add to My Program |
A Receding Horizon Scheme for EV Charging Stations in Demand Response Programs (I) |
|
Zanvettor, Giovanni Gino | Universita' Di Siena |
Fochesato, Marta | ETH Zurich |
Casini, Marco | Universita' Di Siena |
Vicino, Antonio | Univ. Di Siena |
Keywords: Smart grid, Energy systems
Abstract: Demand response is expected to play a fundamental role in providing flexibility for balancing operations to the grid. On the other hand, the fast electrification of the transportation sector calls for new solutions to enforce safe and reliable grid operation. Here we consider an electric vehicle charging station that participates in demand response programs. The demand response program asks for a change of the charging station load profile in exchange for a monetary reward. A stochastic receding horizon scheme that exploits the charging flexibility is then designed to optimally coordinate vehicle charging. Numerical simulations show that the proposed approach ensures substantial cost reduction compared to simpler benchmarks while maintaining the computation time feasible for real-world applications.
|
|
11:20-11:40, Paper ThA04.5 | Add to My Program |
Learning How to Price Charging in Electric Ride-Hailing Markets (I) |
|
Maljkovic, Marko | Ecole Polytechnique Fédérale De Lausanne (EPFL) |
Nilsson, Gustav | EPFL |
Geroliminis, Nikolas | Urban Transport Systems Laboratory, EPFL |
Keywords: Transportation networks, Game theory, Learning
Abstract: With the electrification of ride-hailing fleets, there will be a need to incentivize where and when the ride-hailing vehicles should charge. In this work, we assume that a central authority wants to control the distribution of the vehicles and can do so by selecting charging prices. Since there will likely be more than one ride-hailing company in the market, we model the problem as a single-leader multiple-follower Stackelberg game. The followers, i.e., the companies, then compete about the charging resources under given prices provided by the leader. We present a learning algorithm based on the concept of contextual bandits that allows the central authority to find an efficient pricing strategy. We also show how the exploratory phase of the learning can be improved if the leader has some partial knowledge about the companies’ objective functions. The efficiency of the proposed algorithm is demonstrated in a simulated case study for the city of Shenzhen, China.
|
|
11:40-12:00, Paper ThA04.6 | Add to My Program |
Designing Optimal Personalized Incentive for Traffic Routing Using BIG Hype (I) |
|
Grontas, Panagiotis D | Swiss Federal Institute of Technology (ETH) Zürich |
Cenedese, Carlo | ETH Zurich |
Fochesato, Marta | ETH Zurich |
Belgioioso, Giuseppe | ETH Zürich |
Lygeros, John | ETH Zurich |
Dörfler, Florian | Swiss Federal Institute of Technology (ETH) Zurich |
Keywords: Traffic control, Game theory, Optimization algorithms
Abstract: We study the problem of routing plug-in electric and conventional fuel vehicles on a city scale using incentives. In our model, commuters selfishly aim to minimize a local cost that combines travel time and the financial expenses of using city facilities, i.e., parking and service stations. The traffic authority can influence the commuters' routing choice via personalized discounts on parking tickets and on the energy price at service stations. We formalize the problem of optimally designing these monetary incentives to induce traffic decongestion as a large-scale bilevel game, where constraints arise at both levels due to the finite capacities of city facilities and incentives budget. Then, we develop an efficient scalable solution scheme with convergence guarantees based on BIG Hype, a recently-proposed hypergradient-based algorithm for bilevel games. Finally, we validate our approach via numerical simulations over the Anaheim's traffic network, showcasing its advantages in terms of traffic decongestion and scalability.
|
|
ThA05 Invited Session, Simpor Junior 4912 |
Add to My Program |
Recent Advances in Distributed Coordination of Intelligent Systems |
|
|
Chair: Liu, Lu | City University of Hong Kong |
Co-Chair: Fang, Hao | Beijing Institute of Technology |
Organizer: Liu, Lu | City University of Hong Kong |
Organizer: Fang, Hao | Beijing Institute of Technology |
|
10:00-10:20, Paper ThA05.1 | Add to My Program |
Constructing a Virtual Leader for Sign Consensus of Heterogeneous Multi-Agent Systems (I) |
|
Meng, Yihan | City University of Hong Kong |
Liu, Lu | City University of Hong Kong |
Zhang, Hongwei | Harbin Institute of Technology, Shenzhen |
Keywords: Distributed control, Observers for Linear systems, Adaptive control
Abstract: This paper studies output sign consensus problem for leaderless heterogeneous linear multi-agent systems (MASs) over switching signed graphs. Established on the assumption that the communication graph is jointly eventually positive, a distributed ‘sign observer’ is proposed to estimate a virtual leader. This virtual leader is not pre-determined, but induced from the topology of the graph, the initial conditions of the agents and the structure of the ‘observer’. Based on the distributed ‘sign observer’ and output regulation method, a state feedback controller is designed to drive the output signals of the MAS to have the same sign.
|
|
10:20-10:40, Paper ThA05.2 | Add to My Program |
Adaptive Coverage Control for Heterogeneous Mobile Sensor Networks in an Unknown Environment (I) |
|
Zheng, Boyin | City University of HongKong, Hong Kong, P. R. China |
Liu, Lu | City University of Hong Kong |
Keywords: Cooperative control, Adaptive control, Distributed control
Abstract: This article addresses the coverage control problems for heterogeneous mobile sensor networks (MSNs) in an environment with unknown event density functions. In contrast to existing works, unknown heterogeneous sensing abilities of the mobile sensor network (MSN) are considered by leveraging a weighted Voronoi diagram, namely, the Power diagram. To guarantee that the time-varying Power diagram converges to that defined by the true sensing weights, an online weight learning law is designed. Moreover, to handle certain applications such as forest fire investigation or nuclear radiation leakage mapping where the density information for the events of interest is not known to the MSN, an adaptive law is presented so that the event density approximation of each sensor converges to the real one along its trajectory. In addition, a move-to-centroid control law is proposed to drive the MSN to a near-optimal coverage configuration as time goes to infinity. Finally, the effectiveness of the proposed approach is illustrated by an example.
|
|
10:40-11:00, Paper ThA05.3 | Add to My Program |
A Distributed Algorithm for Solving a Time-Varying Linear Equation (I) |
|
Zhang, Xiaozhen | Beijing Institute of Technology |
Yang, Qingkai | Beijing Institute of Technology |
Wei, Haijiao | China North Vehicle Research Institute |
Chen, Wei | Beijing Institute of Technology Chongqing Innovation Center |
Peng, Zhihong | Beijing Institute of Technology |
Fang, Hao | Beijing Institute of Technology |
Keywords: Cooperative control, Distributed control, Time-varying systems
Abstract: This paper studies the problem of cooperatively solving a time-varying linear equation of the form textbf{A}(t)textbf{emph{x}}(t)=textbf{emph{b}}(t), which always has a unique solution. Each agent has access to only some rows of the time-varying augmented matrix [begin{matrix} textbf{A}(t) & textbf{emph{b}}(t) end{matrix}]. We propose a distributed algorithm for solving the time-varying linear equation.The proposed distributed algorithm enforces local solutions to track local time-varying manifolds corresponding to local linear sub-equations while simultaneously reaching a consensus. This enables all local solutions to converge to the solution of the original time-varying linear equation. Finally, the effectiveness of the proposed algorithm is demonstrated through its application in a cooperative monitoring task.
|
|
11:00-11:20, Paper ThA05.4 | Add to My Program |
Fully Distributed Dynamic Event-Triggering Formation Control of UAV Swarms under DoS Attacks (I) |
|
Cao, Hui | Beihang University |
Han, Liang | Beihang University |
Li, Dongyu | BEIHANG UNIVERSITY |
Hu, Qinglei | Beihang University |
Hao, Pengkun | Beihang University |
Keywords: Cooperative control, Distributed control, Resilient Control Systems
Abstract: For large-scale unmanned aerial vehicle (UAV) swarms, the security of communication networks is critical. When subjected to cyberattacks, the performance of swarm systems will be significantly affected. This paper focuses on the fully distributed time-varying formation (TVF) control problem of UAV swarms under Denial-of-Service (DoS) attacks. First, the theoretical framework of the fully distributed dynamic eventtriggering TVF control protocol is introduced. Then, sufficient conditions and critical proofs are provided to demonstrate that the desired formation configuration can be achieved under the influence of DoS attacks, and Zeno behavior is eliminated. Finally, the framework of a mixed-reality swarm flight platform is presented, which includes virtual nodes and physical nodes and integrates the advantages of both simulation and physical experiments, enabling large-scale swarm experiments with less cost and higher efficiency. The formation experiment using this platform validates the efficacy of the proposed control protocol.
|
|
11:20-11:40, Paper ThA05.5 | Add to My Program |
Multi-Agent Coordination under Temporal Logic Tasks and Team-Wise Intermittent Communication (I) |
|
Wang, Junjie | Peking University |
Guo, Meng | Peking University |
Li, Zhongkui | Peking University |
Keywords: Autonomous systems, Decentralized control, Formal Verification/Synthesis
Abstract: Multi-agent systems outperform single agent in complex collaborative tasks. However, in large-scale scenarios, ensuring timely information exchange during decentralized task execution remains a challenge. This work presents an online decentralized coordination scheme for multi-agent systems under complex local tasks and intermittent communication constraints. Unlike existing strategies that enforce all-time or intermittent connectivity, our approach allows agents to join or leave communication networks at aperiodic intervals, as deemed optimal by their online task execution. This scheme concurrently determines local plans and refines the commu- nication strategy, i.e., where and when to communicate as a team. A decentralized potential game is modeled among agents, for which a Nash equilibrium is generated iteratively through online local search. It guarantees local task completion and intermittent communication constraints. Extensive numerical simulations are conducted against several strong baselines.
|
|
11:40-12:00, Paper ThA05.6 | Add to My Program |
Combinatorial-Hybrid Optimization for Multi-Agent Systems under Collaborative Tasks (I) |
|
Tang, Zili | Peking University |
Chen, Junfeng | Peking University |
Guo, Meng | Peking University |
Keywords: Autonomous systems, Hybrid systems, Optimization
Abstract: Multi-agent systems can be extremely efficient when working concurrently and collaboratively, e.g., for transportation, maintenance, search and rescue. Coordination of such teams often involves two aspects: (i) selecting appropriate sub-teams for different tasks; (ii) designing collaborative control strategies to execute these tasks. The former aspect can be combinatorial w.r.t. the team size, while the latter requires optimization over joint state-spaces under geometric and dynamic constraints. Existing work often tackles one aspect by assuming the other is given, while ignoring their close dependency. This work formulates such problems as combinatorial-hybrid optimizations (CHO), where both the discrete modes of collaboration and the continuous control parameters are optimized simultaneously and iteratively. The proposed framework consists of two interleaved layers: the dynamic formation of task coalitions and the hybrid optimization of collaborative behaviors. Overall feasibility and costs of different coalitions performing various tasks are approximated at different granularities to improve the computational efficiency. At last, a Nash-stable strategy for both task assignment and execution is derived with provable guarantee on the feasibility and quality.
|
|
ThA06 Regular Session, Simpor Junior 4911 |
Add to My Program |
Estimation IV |
|
|
Chair: Bonnabel, Silvere | Mines ParisTech |
Co-Chair: Xie, Junyao | University of Alberta |
|
10:00-10:20, Paper ThA06.1 | Add to My Program |
Adaptive Estimation of Time-Varying Parameters Using DREM |
|
Diget, Emil Lykke | University of Southern Denmark |
Sloth, Christoffer | University of Southern Denmark |
Keywords: Estimation, Time-varying systems, Adaptive control
Abstract: In this paper we present a method for estimating time-varying parameters in a linear regression equation. We combine local polynomial regression with dynamic regressor extension and mixing to independently estimate the parameters. During local polynomial regression, a time-varying parameter is approximated by locally constant polynomial coefficients. We propose to use the Bernstein basis instead of the commonly used monomial basis to improve numerical conditioning. A simulation example shows that our proposed estimator has improved performance compared to a similar method and allows a higher polynomial order.
|
|
10:20-10:40, Paper ThA06.2 | Add to My Program |
Filtered High Gain Interval Observer for LPV Systems with Bounded Uncertainties |
|
Hugo, Antoine | IRSEEM/ONERA |
Thabet, Rihab El Houda | IRSEEM ESIGELEC |
Meyer, Luc | ONERA, Univ Paris Saclay |
Ahmed Ali, Sofiane | IBISC, Evry-Val-d’Essonne University, Universite Paris-Saclay, E |
Piet-Lahanier, Helene | ONERA |
Keywords: Estimation, Uncertain systems, Linear parameter-varying systems
Abstract: In this paper, a new High-Gain Interval Observer (HGIO) structure and its filtered version, named Filtered High-Gain Interval Observer (FHGIO), are proposed for a class of Linear Parameter Varying (LPV) systems subject to additive disturbances and measurement noise. Those uncertainties are assumed to be unknown but bounded with known values. The HGIO is based on a high-gain observer structure from which an interval formulation is deduced taking into account the uncertainties bounds. Then, the proposed HGIO is extended to incorporate a filter for the output estimation error, leading to the FHGIO design whose goal is to reduce the measurement noise amplification. Usually, the design of such interval observers is based on monotone systems theory which is hard to satisfy in many cases. In this paper, suitable changes of coordinates are used to overcome this limitation. Moreover, a sufficient condition for the non-divergence of the radius dynamics and a procedure to design the observers gains ensuring the stability are given for each observer. The efficiency of the proposed observers is illustrated through a simulation on a numerical example.
|
|
10:40-11:00, Paper ThA06.3 | Add to My Program |
Velocity Estimation for Motorcycles Using Image-To-Road Mapping |
|
Pryde, Martin | Université Paris-Saclay |
Nehaoua, Lamri | Evry Univeristy |
Hadj-Abdelkader, Hicham | University of Evry - Paris Saclay |
Arioui, Hichem | Evry Paris-Saclay University |
Keywords: Estimation, Vision-based control, Automotive systems
Abstract: The authors propose a visual-inertial approach to estimate the body-fixed lateral velocity of motorcycles traveling along extra-urban roads. The approach comprises the following steps: First, a monocular camera takes video of the road ahead. Key features from sequential images of the road surface are ex- tracted using the Harris corner detector and matching features are identified using the Fast retina keypoint descriptor. The locations of these features on the road surface are determined using a mapping based on an intuitive ray-casting approach. Next, the feature locations on the road, the angular velocity measurements and the optical flow of the feature projection locations on the image plane are used to formulate the ego- motion of the motorcycle as a system of linear equations from which a velocity estimate is solved for using the least-squares method. Finally, this estimate is fused with readings from an inertial navigation system using a Kalman filter to produce a filtered estimate and correct integrator drift. The approach is validated against simulation data generated using BikeSim and the results are compared against state observer approaches and previously published visual-inertial approaches from the authors.
|
|
11:00-11:20, Paper ThA06.4 | Add to My Program |
Estimation of Dynamic Gaussian Processes (I) |
|
van Hulst, Jilles | Eindhoven University of Technology |
van Zuijlen, Roy | Eindhoven University of Technology |
Antunes, Duarte | Eindhoven University of Technology, the Netherlands |
Heemels, W.P.M.H. | Eindhoven University of Technology |
Keywords: Estimation, Kalman filtering, Statistical learning
Abstract: Gaussian processes provide a compact representation for modeling and estimating an unknown function, that can be updated as new measurements of the function are obtained. This paper extends this powerful framework to the case where the unknown function dynamically changes over time. Specifically, we assume that the function evolves according to an integro-difference equation and that the measurements are obtained locally in a spatial sense. In this setting, we will provide the expressions for the conditional mean and covariance of the process given the measurements, which results in a generalized estimation framework, for which we coined the term Dynamic Gaussian Process (DGP) estimation. This new framework generalizes both Gaussian process regression and Kalman filtering. For a broad class of kernels, described by a set of basis functions, fast implementations are provided. We illustrate the results on a numerical example, demonstrating that the method can accurately estimate an evolving continuous function, even in the presence of noisy measurements and disturbances.
|
|
11:20-11:40, Paper ThA06.5 | Add to My Program |
A Lie-Theoretic Approach to Propagating Uncertainty Jointly in Attitude and Angular Momentum |
|
Jayaraman, Amitesh | Stanford University |
Ye, Jikai | National University of Singapore |
Chirikjian, Gregory | National University of Singapore |
Keywords: Estimation
Abstract: Dynamic state estimation, as opposed to kinematic state estimation, seeks to estimate not only the orientation of a rigid body but also its angular velocity, through Euler's equations of rotational motion. This paper demonstrates that the dynamic state estimation problem can be reformulated as estimating a probability distribution on a Lie group defined on phase space (the product space of rotation and angular momentum). The propagation equations are derived non-parametrically for the mean and covariance of the distribution. It is also shown that the equations can be approximately solved by ignoring the third and higher moments of the probability distribution. Numerical experiments show that the distribution constructed from the propagated mean and covariance fits the sample data better than an extended Kalman filter.
|
|
11:40-12:00, Paper ThA06.6 | Add to My Program |
Moving Horizon Estimation for Discrete-Time Linear Systems Using Transfer Learning (I) |
|
Xie, Junyao | University of Alberta |
Huang, Biao | Univ. of Alberta |
Keywords: Estimation, Observers for Linear systems, Machine learning
Abstract: In this article, we propose a novel moving horizon estimation method for discrete-time linear systems through transfer learning. Most moving horizon estimation designs require data from the considered systems of interest. However, practical processes might suffer from data sparsity issues, especially in a new or early operating environment. Motivated by the idea of transfer learning, this manuscript proposes a moving horizon estimation design using data from a similar but different system (i.e., source system) instead of the considered system (i.e., target system). Based on the data from the source system, we propose a novel moving horizon state estimation method for the target system and provide convergence and stability analyses. The state estimation error is upper bounded by a time-dependent sequence that is related to three types of similarities/differences between target and source systems, including initial conditions, disturbance levels, and model parameters. The effectiveness of the proposed approach is demonstrated through a numerical example.
|
|
ThA07 Invited Session, Simpor Junior 4813 |
Add to My Program |
Influence, Mechanism, and Information Design in Games |
|
|
Chair: Eksin, Ceyhun | Texas A&M University |
Co-Chair: Brown, Philip N. | University of Colorado, Colorado Springs |
Organizer: Eksin, Ceyhun | Texas A&M University |
Organizer: Marden, Jason R. | University of California, Santa Barbara |
Organizer: Brown, Philip N. | University of Colorado Colorado Springs |
|
10:00-10:20, Paper ThA07.1 | Add to My Program |
Robust Social Welfare Maximization Via Information Design in Linear-Quadratic-Gaussian Games |
|
Sezer, Furkan | Texas A&M University |
Eksin, Ceyhun | Texas A&M University |
Keywords: Game theory, Agents-based systems, Optimization
Abstract: Information design in an incomplete information game includes a designer with the goal of influencing players' actions through signals generated from a designed probability distribution so that its objective function is optimized. We consider a setting in which the designer has partial knowledge on agents' payoffs. We address the uncertainty about players' preferences by formulating a robust information design problem against the worst case payoffs. If the players have quadratic payoffs that depend on the players' actions and an unknown payoff-relevant state, and signals on the state that follow a Gaussian distribution conditional on the state realization, then the information design problem under quadratic design objectives is a semidefinite program (SDP). Specifically, we consider ellipsoid perturbations over payoff coefficients in linear-quadratic-Gaussian (LQG) games. We show that this leads to a SDP formulation. Numerical studies are carried out to identify the relation between the perturbation levels and the optimal information structures.
|
|
10:20-10:40, Paper ThA07.2 | Add to My Program |
Rationality and Behavior Feedback in a Model of Vehicle-To-Vehicle Communication (I) |
|
Gould, Brendan | University of Colorado Colorado Springs |
Brown, Philip N. | University of Colorado Colorado Springs |
Keywords: Game theory, Transportation networks, Agents-based systems
Abstract: Vehicle-to-Vehicle (V2V) communication is intended to improve road safety through distributed information sharing; however, it is difficult to predict and optimize how human agents will respond to this information. In a Bayesian game, agents probabilistically adopt various types from a fixed, exogenous distribution. Agents in such models ostensibly perform Bayesian inference, which may not be a reasonable cognitive demand for most humans. To complicate matters, real-world information provided to agents is often implicitly dependent on agent behavior, meaning that the distribution of agent types is a function of the behavior of agents (i.e., the type distribution is endogenous). In this paper, we study an existing model of V2V communication, but relax it along two dimensions: first, we pose a behavior model which does not require human agents to perform Bayesian inference; second, an equilibrium model which avoids the challenging endogenous recursion. Surprisingly, we show that the simplified non-Bayesian behavior model yields the exact same equilibrium behavior as the original Bayesian model, which may lend credibility to Bayesian models. However, we also show that the endogenous type model is necessary to obtain certain informational paradoxes; these paradoxes do not appear in the simpler exogenous model. This suggests that standard Bayesian game models with fixed type distributions are not sufficient to express certain important phenomena.
|
|
10:40-11:00, Paper ThA07.3 | Add to My Program |
Collaborative Coalitions in Multi-Agent Systems: Quantifying the Strong Price of Anarchy for Resource Allocation Games (I) |
|
Ferguson, Bryce L. | University of California, Santa Barbara |
Paccagnan, Dario | Imperial College London |
Pradelski, Bary S. R. | Centre National De La Recherche Scientifique, France |
Marden, Jason R. | University of California, Santa Barbara |
Keywords: Game theory, Agents-based systems, Cooperative control
Abstract: The emergence of new communication technologies allows us to expand our understanding of distributed control and consider collaborative decision-making paradigms. With collaborative algorithms, certain local decision-making entities (or agents) are enabled to communicate with one another and collaborate on their actions to attain better system behavior. By limiting the amount of communication, these algorithms exist somewhere between centralized and fully distributed approaches. To understand the possible benefits of this inter-agent collaboration, we model a multi-agent system as a common interest game in which groups of agents can collaborate on their actions to jointly increase the system welfare. We specifically consider k-strong Nash equilibria as the emergent behavior of these systems and address how well these states approximate the system optimal, formalized by the k-strong price of anarchy ratio. Our main contributions are in generating tight bounds on the k-strong price of anarchy in finite resource allocation games as the solution to a tractable linear program. By varying k –the maximum size of a collaborative coalition– we observe exactly how much performance is gained from inter-agent collaboration. To investigate further opportunities for improvement, we generate upper bounds on the maximum attainable k-strong price of anarchy when the agents’ utility function can be designed.
|
|
11:00-11:20, Paper ThA07.4 | Add to My Program |
Coordination in Markov Games with Asymmetric Information (I) |
|
Wei, Xupeng | University of Michigan |
Anastasopoulos, Achilleas | University of Michigan |
Keywords: Game theory, Markov processes, Stochastic systems
Abstract: We study coordination in Markov games with asymmetric information. We consider a model where the state consists of different components, each representing the private type of each player. Players' actions depend on their private types and the public observation of past actions. The state components evolve as independent Markov processes conditioned on actions. We propose a solution concept called perfect correlated equilibrium (PCE), realized by a correlation device that observes only the public information of past actions. At time t, the device generates a prescription profile from a commonly known joint distribution, and sends each player a prescription privately before they act. Players are expected to take actions according to the prescriptions at equilibrium by evaluating the suggested prescription at the private types. We introduce "structured" PCE (sPCE), in which the correlation device generates prescriptions based on the common action history through a common belief on the state. We motivate sPCE by showing that any payoff profile induced by a general device can be induced by a structured one. We show that when the correlation device is using structured strategies, players' rationality constraints can be characterized through appropriate Markov decision processes (MDPs). Based on this characterization, we develop a backward dynamic approach, with which one can verify if a structured device is feasible, or even design a structured PCE in a backward recursive manner. Finally, we consider a specific example demonstrating how coordination can improve social welfare.
|
|
11:20-11:40, Paper ThA07.5 | Add to My Program |
Capacity Allocation and Pricing of High Occupancy Toll Lane Systems with Heterogeneous Travelers (I) |
|
Pulyassary, Haripriya | Cornell University |
Yang, Ruifan | Cornell University |
Zhang, Zhanhao | Cornell University |
Wu, Manxi | Cornell University |
Keywords: Game theory, Transportation networks, Intelligent systems
Abstract: In this article, we study the optimal design of High Occupancy Toll (HOT) lanes. In our setup, the traffic authority determines road capacity allocation between HOT lanes and ordinary lanes, as well as the toll price charged for travelers who use the HOT lanes but do not meet the high-occupancy eligibility criteria. We build a game-theoretic model to analyze the decisions made by travelers with heterogeneous values of time and carpool disutilities, who must choose between paying or forming carpools to take the HOT lanes, or taking the ordinary lanes. Travelers' payoffs depend on the congestion cost of the lane that they take, the payment and the carpool disutilities. We provide a complete characterization of travelers' equilibrium strategies and resulting travel times for any capacity allocation and toll price. We also calibrate our model on the California Interstate highway 880 and compute the optimal capacity allocation and toll design.
|
|
11:40-12:00, Paper ThA07.6 | Add to My Program |
Information Asymmetry and Contract Design with Applications to Agriculture (I) |
|
Bonatti, Alessandro | MIT |
Dahleh, Munther A. | Massachusetts Inst. of Tech |
Horel, Thibaut | MIT |
Roozbehani, Mardavij | Massachusetts Institute of Technology |
Keywords: Agents-based systems, Game theory, Emerging control applications
Abstract: We consider situations in which an intermediary facilitates the interactions of one or several players with a downstream market in a game of incomplete information. Our key assumption is the presence of information asymmetries: while the intermediary has in general better (less noisy) information about the observable parameters of the game, the players have better information about their own private parameters and preferences. The intermediary seeks to influence the actions of the agents by offering side information and/or certain guarantees regarding the outcomes. For instance, farmers aim to sell their produce in a downstream market. The intermediary has access to more accurate signals regarding the downstream market (e.g. demand, prices, etc.), while the farmers are aware of their own private cost structure. The central problem is then to understand how to design contracts for exchanging information and mediating the interaction between the players and the downstream market in such a way that generates value to the players and revenue to the intermediary. Prior work on information design with elicitation has shown that in the presence of competition between the players, the intermediary can generate value by coordinating the players' actions in such a way that reduces the negative externalities they exert on each other. In this work, we focus on the question of risk-aversion. Our first result is negative and shows that the intermediary cannot generate revenue when interacting with a single risk-neutral or risk-seeking player. We then explore how this result changes under various relaxations of the model which include altering the risk preference of the player or changing the timing of the game.
|
|
ThA08 Regular Session, Simpor Junior 4812 |
Add to My Program |
Optimal Control IV |
|
|
Chair: Richter, Rebecca | Bunderwehr University Munich |
Co-Chair: Oguri, Kenshiro | Purdue University |
|
10:00-10:20, Paper ThA08.1 | Add to My Program |
Optimization-Based Trajectory Generation and Receding Horizon Control for Systems with Convex Dynamics |
|
Lishkova, Yana | University of Oxford |
Cannon, Mark | University of Oxford |
Keywords: Optimal control, Nonlinear systems, Modeling
Abstract: In this paper we propose an optimization-based control scheme, which can be used for trajectory generation or receding horizon control for system with nonlinear, but convex dynamics, and both explicit and implicit discrete time models. The scheme uses both the nonlinear model and its linearization to construct a tube containing all possible future system trajectories, and uses this tube to predict performance and ensure constraint satisfaction. The controls sequence and tube cross-sections are optimized online in a sequence of convex programs without the need of pre-computed error bounds. We prove feasibility, stability and non-conservativeness of the approach, with the series of convex programs converging to a point which is a local optimum for the original nonlinear optimal control problem. We further present how a structure-preserving model can be implemented within the approach and used to reduce the number of constraints and guarantee a structure-preserving discrete trajectory solution.
|
|
10:20-10:40, Paper ThA08.2 | Add to My Program |
Closed-Loop Neighboring Extremal Optimal Control Using HJ Equation |
|
Rai, Ayush | Purdue University |
Mou, Shaoshuai | Purdue University |
Anderson, Brian D.O. | Australian National University |
Keywords: Optimal control, Nonlinear systems
Abstract: This study introduces a method to obtain a neighboring extremal optimal control (NEOC) solution for a broad class of nonlinear systems with nonquadratic performance indices by investigating the variation to a known closed-loop optimal control law caused by small, known variations in the system parameters or in the performance index. The NEOC solution can formally be obtained by solving a linear partial differential equation similar to those arising in an iterative solution procedure for a nonlinear Hamilton-Jacobi equation. Motivated by numerical procedures for solving such an equation, we also propose a numerical algorithm based on the Galerkin algorithm that uses basis functions to solve the underlying Hamilton-Jacobi equation. This approach allows the determination of the minimum performance index as a function of both the system state and parameters and extends to allow the determination of the adjustment to an optimal control law given a small adjustment of parameters in the system or the performance index, effectively by computing the derivative of the law with respect to those parameters. The validity of the claims and theory is supported by numerical simulations.
|
|
10:40-11:00, Paper ThA08.3 | Add to My Program |
Towards Continuous-Time MPC: A Novel Trajectory Optimization Algorithm |
|
Das, Souvik | Indian Institute of Technology, Bombay |
Ganguly, Siddhartha | Indian Institute of Technology, Bombay |
Muthyala, Anjali | Indian Institute of Technology Bombay |
Chatterjee, Debasish | Indian Institute of Technology, Bombay |
Keywords: Optimal control, Numerical algorithms, Constrained control
Abstract: This article introduces a numerical algorithm that serves as a preliminary step toward solving continuous-time model predictive control (MPC) problems directly without explicit time-discretization. The chief ingredients of the underlying optimal control problem (OCP) are a linear time-invariant system, quadratic instantaneous and terminal cost functions, and convex path constraints. The thrust of the method involves finitely parameterizing the admissible space of control trajectories and solving the OCP satisfying the given constraints at every time instant in a tractable manner without explicit time-discretization. The ensuing OCP turns out to be a convex semi-infinite program (SIP), and some recently developed results are employed to obtain an optimal solution to this convex SIP. A numerical illustration on a benchmark model is included to show the efficacy of the algorithm.
|
|
11:00-11:20, Paper ThA08.4 | Add to My Program |
Dynamic and Nonlinear Programming for Trajectory Planning |
|
Britzelmeier, Andreas | Bundeswehr University |
De Marchi, Alberto | Universität Der Bundeswehr München |
Richter, Rebecca | Bunderwehr University Munich |
Keywords: Optimal control, Numerical algorithms, Robotics
Abstract: Direct optimal control techniques, relying on numerical methods for constrained optimization, are typically used in trajectory planning tasks in high-dimensional spaces. However, general-purpose solvers often fail to find a feasible solution when facing cluttered environments. Sampling- or graph-based methods, instead, can explore complex configuration spaces but struggle with dynamic constraints. Here, we propose to combine dynamic programming (DP) and derivative-based methods to reliably solve trajectory planning problems. Specifically, we exploit DP to generate a sequence of waypoints in a low-dimensional space, which are then encoded as pointwise path constraints for a high-dimensional trajectory, whose constraint violations are then represented as a penalty within the Bellman equation to recompute the waypoints. This iterative approach, alternating path and trajectory optimization, avoids both the curse of dimensionality for DP and problematic nonconvexities (such as obstacles) for motion planning. We demonstrate our strategy using numerical experiments on a six-degree-of-freedom robotic manipulator moving in a confined space.
|
|
11:20-11:40, Paper ThA08.5 | Add to My Program |
Higher-Order Retraction Maps and Construction of Numerical Methods for Optimal Control of Mechanical Systems |
|
Anahory Simőes, Alexandre | IE University |
Barbero-Linan, Maria | Technical University of Madrid |
Colombo, Leonardo Jesus | Spanish National Research Council |
Martin de Diego, David | High Council for Scientific Research |
Keywords: Optimal control, Numerical algorithms, Variational methods
Abstract: Retractions maps are used to define a discretization of the tangent bundle of the configuration manifold as two copies of the configuration manifold where the dynamics take place. Such discretization maps can be conveniently lifted to a higher-order tangent bundle to construct geometric integrators for the higher-order Euler-Lagrange equations. Given a cost function, an optimal control problem for fully actuated mechanical systems can be understood as a higher-order variational problem. In this paper we introduce the notion of a higher-order discretization map associated with a retraction map to construct geometric integrators for the optimal control of mechanical systems. In particular, we study applications to path planning for obstacle avoidance of a planar rigid body.
|
|
11:40-12:00, Paper ThA08.6 | Add to My Program |
Successive Convexification with Feasibility Guarantee Via Augmented Lagrangian for Non-Convex Optimal Control Problems |
|
Oguri, Kenshiro | Purdue University |
Keywords: Optimal control, Optimization, Aerospace
Abstract: This paper proposes an algorithm that solves non-convex optimal control problems with a theoretical guarantee for global convergence to a feasible local solution of the original problem. The proposed algorithm extends the recently proposed successive convexification (SCvx) algorithm to address its key limitation: lack of feasibility guarantee to the original non-convex problem. The main idea of the proposed algorithm is to incorporate the SCvx iteration into an algorithmic framework based on the augmented Lagrangian method to enable the feasibility guarantee while retaining favorable properties of SCvx. Unlike the original SCvx, our approach iterates on both of the optimization variables and the Lagrange multipliers, which facilitates the feasibility guarantee as well as efficient convergence, in a spirit similar to the alternating direction method of multipliers (ADMM). Convergence analysis shows the proposed algorithm's strong global convergence to a feasible local optimum of the original problem and its convergence rate. These theoretical results are demonstrated via numerical examples with comparison against the original SCvx algorithm.
|
|
ThA09 Regular Session, Simpor Junior 4811 |
Add to My Program |
Optimization Algorithms IV |
|
|
Chair: Kalaimani, Rachel Kalpana | Indian Institute of Technology Madras |
Co-Chair: Sato, Hiroyuki | Kyoto University |
|
10:00-10:20, Paper ThA09.1 | Add to My Program |
PANTR: A Proximal Algorithm with Trust-Region Updates for Nonconvex Constrained Optimization |
|
Bodard, Alexander | KU Leuven |
Pas, Pieter | KU Leuven |
Patrinos, Panagiotis | KU Leuven |
Keywords: Optimization algorithms, Optimal control
Abstract: This work presents PANTR, an efficient solver for nonconvex constrained optimization problems, that is well-suited as an inner solver for an augmented Lagrangian method.The proposed scheme combines forward-backward iterations with solutions to trust-region subproblems: the former ensures global convergence, whereas the latter enables fast update directions. We discuss how the algorithm is able to exploit exact Hessian information of the smooth objective term through a linear Newton approximation, while benefiting from the structure of box-constraints or L1-regularization. An open-source C++ implementation of PANTR is made available as part of the NLP solver library ALPAQA. Finally, the effectiveness of the proposed method is demonstrated in nonlinear model predictive control applications.
|
|
10:20-10:40, Paper ThA09.2 | Add to My Program |
Gradient Descent with Low-Rank Objective Functions |
|
Cosson, Romain | INRIA |
Jadbabaie, Ali | Massachusetts Institute of Technology |
Makur, Anuran | Purdue University |
Reisizadeh, Amirhossein | Massachusetts Institute of Technology |
Shah, Devavrat | MIT |
Keywords: Optimization algorithms, Optimization, Machine learning
Abstract: Several recent empirical studies demonstrate that important machine learning tasks, e.g., training deep neural networks, exhibit low-rank structure, where the loss function varies significantly in only a few directions of the input space. In this paper, we leverage such low-rank structure to reduce the high computational cost of canonical gradient-based methods such as gradient descent (GD). Our proposed Low-Rank Gradient Descent (LRGD) algorithm finds an epsilon-minimizer of a p-dimensional function by first identifying r < p significant directions, and then estimating the true p-dimensional gradient at every iteration by computing directional derivatives only along those r directions. We establish that the "directional oracle complexity" of LRGD for strongly convex objective functions is O(r log(1/epsilon) + rp). Therefore, when r << p, LRGD provides significant improvement over the known complexity of O(p log(1/epsilon)) of GD in the strongly convex setting. Furthermore, using real and synthetic data, we empirically find that LRGD provides significant gains over GD when the data has low-rank structure, and in the absence of such structure, LRGD does not degrade performance compared to GD.
|
|
10:40-11:00, Paper ThA09.3 | Add to My Program |
Adaptive Low-Rank Gradient Descent |
|
Jadbabaie, Ali | Massachusetts Institute of Technology |
Makur, Anuran | Purdue University |
Reisizadeh, Amirhossein | Massachusetts Institute of Technology |
Keywords: Optimization algorithms, Optimization, Machine learning
Abstract: Low-rank structures have been observed in several recent empirical studies in many machine and deep learning problems, where the loss function demonstrates significant variation only in a lower dimensional subspace. While traditional gradient-based optimization algorithms are computationally costly for high-dimensional parameter spaces, such low-rank structures provide an opportunity to mitigate this cost. In this paper, we aim to leverage low-rank structures to alleviate the computational cost of first-order methods and study Adaptive Low-Rank Gradient Descent (AdaLRGD). The main idea of this method is to begin the optimization procedure in a very small subspace and gradually and adaptively augment it by including more directions. We show that for smooth and strongly convex objectives and any target accuracy epsilon, AdaLRGD's complexity is O(r ln(r/epsilon)) for some rank r no more than dimension d. This significantly improves upon gradient descent's complexity of O(d ln(1/epsilon)) when r << d. We also propose a practical implementation of AdaLRGD and demonstrate its ability to leverage existing low-rank structures in data.
|
|
11:00-11:20, Paper ThA09.4 | Add to My Program |
Communication-Efficient Distributed Optimization with Adaptability to System Heterogeneity |
|
Yu, Ziyi | University of Science and Technology of China |
Freris, Nikolaos M. | University of Science and Technology of China |
Keywords: Optimization algorithms, Optimization
Abstract: We consider the setting of agents cooperatively minimizing the sum of local objectives plus a regularizer on a graph. This paper proposes a primal-dual method in consideration of three distinctive attributes of real-life multi-agent systems, namely: (i) expensive communication, (ii) lack of synchronization, and (iii) system heterogeneity. In specific, we propose a distributed asynchronous algorithm with minimal communication cost, in which users commit variable amounts of local work on their respective sub-problems. We illustrate this both theoretically and experimentally in the machine learning setting, where the agents hold private data and use a stochastic Newton method as the local solver. Under standard assumptions on Lipschitz continuous gradients and strong convexity, our analysis establishes linear convergence in expectation and characterizes the dependency of the rate on the number of local iterations. We proceed a step further to propose a simple means for tuning agents’ hyperparameters locally, so as to adjust to heterogeneity and accelerate the overall convergence. Last, we validate our proposed method on a benchmark machine learning dataset to illustrate the merits in terms of computation, communication, and run-time saving as well as adaptability to heterogeneity.
|
|
11:20-11:40, Paper ThA09.5 | Add to My Program |
Robust Analysis of Almost Sure Convergence of Zeroth-Order Mirror Descent Algorithm |
|
Paul, Anik Kumar | IIT Madras |
Mahindrakar, Arun D. | Indian Institute of Technology Madras |
Kalaimani, Rachel Kalpana | Indian Institute of Technology Madras |
Keywords: Optimization algorithms, Optimization
Abstract: This paper presents an almost sure convergence of the zeroth-order mirror descent algorithm. The algorithm admits non-smooth convex functions and a biased oracle which only provides noisy function value at any desired point. We approximate the subgradient of the objective function using Nesterov's Gaussian Approximation (NGA) with certain alternations suggested by some practical applications. We prove an almost sure convergence of the iterates' function value to the neighbourhood of optimal function value, which can not be made arbitrarily small, a manifestation of a biased oracle. This letter ends with a concentration inequality, which is a finite time analysis that predicts the likelihood that the function value of the iterates is in the neighbourhood of the optimal value at any finite iteration.
|
|
11:40-12:00, Paper ThA09.6 | Add to My Program |
Conjugate Gradient Methods for Optimization Problems on Symplectic Stiefel Manifold |
|
Yamada, Mitsutaka | Kyoto University |
Sato, Hiroyuki | Kyoto University |
Keywords: Optimization algorithms, Optimization
Abstract: The symplectic Stiefel manifold is a Riemannian manifold that is a generalization of the symplectic group. In this study, we propose novel conjugate gradient methods on the symplectic Stiefel manifold and compare them with the steepest descent method proposed in existing studies through numerical experiments. Although the theoretical basis of the Riemannian conjugate gradient methods has already been established, special treatment is required to address specific manifolds since these methods utilize some mappings, such as a retraction and vector transport, on the manifold. Numerical experiments demonstrate that the proposed method outperforms existing methods and is efficient.
|
|
ThA10 Regular Session, Roselle Junior 4713 |
Add to My Program |
Machine Learning IV |
|
|
Chair: Bhatnagar, Shalabh | Indian Institute of Science |
Co-Chair: Coutinho, Daniel | Universidade Federal De Santa Catarina |
|
10:00-10:20, Paper ThA10.1 | Add to My Program |
An ADMM Solver for the MKL-L_{0/1}-SVM |
|
Shi, Yijie | School of Intelligent Systems Engineering, Sun Yat-Sen Universit |
Zhu, Bin | Sun Yat-Sen University |
Keywords: Machine learning, Optimization, Estimation
Abstract: We formulate the Multiple Kernel Learning (abbreviated as MKL) problem for the support vector machine with the infamous (0, 1)-loss function. Some first-order optimality conditions are given and then exploited to develop a fast ADMM solver for the nonconvex and nonsmooth optimization problem. A simple numerical experiment on synthetic planar data shows that our MKL-L_{0/1}-SVM framework could be promising.
|
|
10:20-10:40, Paper ThA10.2 | Add to My Program |
Combining Robust Control and Machine Learning for Uncertain Nonlinear Systems Subject to Persistent Disturbance |
|
Banderchuk, Ana Cláudia | Universidade Federal De Santa Catarina |
Coutinho, Daniel | Universidade Federal De Santa Catarina |
Camponogara, Eduardo | Federal University of Santa Catarina |
Keywords: Machine learning, Robust control, Uncertain systems
Abstract: This paper proposes a control strategy consisting of a robust controller and an Echo State Network (ESN) based control law for stabilizing a class of uncertain nonlinear discrete-time systems subject to persistent disturbances. Firstly, the robust controller is designed to ensure that the closed-loop system is Input-to-State Stable (ISS) with a guaranteed stability region regardless of the ESN control action and exogenous disturbances. Then, the ESN based controller is trained in order to mitigate the effects of disturbances on the system output. A numerical example demonstrates the potentials of the proposed control design method.
|
|
10:40-11:00, Paper ThA10.3 | Add to My Program |
A Policy Gradient Approach for Finite Horizon Constrained Markov Decision Processes |
|
Guin, Soumyajit | Indian Institute of Science, Bengaluru |
Bhatnagar, Shalabh | Indian Institute of Science |
Keywords: Machine learning, Stochastic optimal control, Stochastic systems
Abstract: The infinite horizon setting is widely adopted for problems of reinforcement learning (RL). These invariably result in stationary policies that are optimal. In many situations, finite horizon control problems are of interest and for such problems, the optimal policies are time-varying in general. Another setting that has become popular in recent times is of Constrained Reinforcement Learning, where the agent maximizes its rewards while it also aims to satisfy some given constraint criteria. However, this setting has only been studied in the context of infinite horizon MDPs where stationary policies are optimal. We present an algorithm for constrained RL in the Finite Horizon Setting where the horizon terminates after a fixed (finite) time. We use function approximation in our algorithm which is essential when the state and action spaces are large or continuous and use the policy gradient method to find the optimal policy. The optimal policy that we obtain depends on the stage and so is non-stationary in general. To the best of our knowledge, our paper presents the first policy gradient algorithm for the finite horizon setting with constraints. We show the convergence of our algorithm to an optimal policy. We also compare and analyze the performance of our algorithm through experiments and show that our algorithm performs better than other well known algorithms.
|
|
11:00-11:20, Paper ThA10.4 | Add to My Program |
A Multi-Fidelity Bayesian Approach to Safe Controller Design |
|
Lau, Ethan | Michigan State University |
Srivastava, Vaibhav | Michigan State University |
Bopardikar, Shaunak D. | Michigan State University |
Keywords: Machine learning, Uncertain systems, Stochastic systems
Abstract: Safely controlling unknown dynamical systems is one of the biggest challenges in the field of control. Oftentimes, an approximate model of a system's dynamics exists which provides beneficial information for control design. However, differences between the approximate and true systems present challenges as well as safety concerns. We propose an algorithm called SAFESLOPE to safely evaluate points from a Gaussian process model of a function when its Lipschitz constant is unknown. We establish theoretical guarantees for the performance of SAFESLOPE and quantify how multi-fidelity modeling improves the algorithm's performance. Finally, we present a case where SAFESLOPE achieves lower cumulative regret than a naive sampling method by applying it to find the control gains of a linear time-invariant system.
|
|
11:20-11:40, Paper ThA10.5 | Add to My Program |
Safe Neural Control for Non-Affine Control Systems with Differentiable Control Barrier Functions |
|
Xiao, Wei | Massachusetts Institute of Technology |
Allen, Ross | MITLL |
Rus, Daniela | MIT |
Keywords: Lyapunov methods, Machine learning, Constrained control
Abstract: This paper addresses the problem of safety-critical control for non-affine control systems. It has been shown that optimizing quadratic costs subject to state and control constraints can be sub-optimally reduced to a sequence of quadratic programs (QPs) by using Control Barrier Functions (CBFs). Our recently proposed High Order CBFs (HOCBFs) can accommodate constraints of arbitrary relative degree. The main challenges in this approach are that it requires affine control dynamics and the solution of the CBF-based QP is sub-optimal since it is solved point-wise. To address these challenges, we incorporate higher-order CBFs into neural ordinary differential equation-based learning models as differentiable CBFs to guarantee safety for non-affine control systems. The differentiable CBFs are trainable in terms of their parameters, and thus, they can address the conservativeness of CBFs such that the system state will not stay unnecessarily far away from safe set boundaries. Moreover, the imitation learning model is capable of learning complex and optimal control policies that are usually intractable online. We illustrate the effectiveness of the proposed framework on LiDAR-based autonomous driving and compare it with existing methods.
|
|
11:40-12:00, Paper ThA10.6 | Add to My Program |
Supervised Learning of Lyapunov Functions Using Laplace Averages of Approximate Koopman Eigenfunctions |
|
Deka, Shankar | KTH Royal Institute of Technology, Sweden |
Dimarogonas, Dimos V. | KTH Royal Institute of Technology |
Keywords: Lyapunov methods, Machine learning, Stability of nonlinear systems
Abstract: Modern data-driven techniques have rapidly progressed beyond modelling and systems identification, with a growing interest in learning high-level dynamical properties of a system, such as safe-set invariance, reachability, input-to-state stability etc. In this paper, we propose a novel supervised Deep Learning technique for constructing Lyapunov certificates, by leveraging Koopman Operator theory-based numerical tools (Extended Dynamic Mode Decomposition and Generalized Laplace Analysis) to robustly and efficiently generate explicit ground truth data for training. This is in stark contrast to existing Deep Learning methods where the loss functions plainly penalize Lyapunov condition violation in the absence of labelled data for direct regression. Furthermore, our approach leads to a linear parameterization of Lyapunov candidate functions in terms of stable eigenfunctions of the Koopman operator, making them more interpretable compared to standard DNN-based architecture. We demonstrate and validate our approach numerically using 2-dimensional and 10-dimensional examples.
|
|
ThA11 Regular Session, Roselle Junior 4712 |
Add to My Program |
Agent-Based Systems IV |
|
|
Chair: Wang, Lin | Shanghai Jiao Tong University |
Co-Chair: Hespe, Christian | Hamburg University of Technology |
|
10:00-10:20, Paper ThA11.1 | Add to My Program |
Co-Evolution of Dual Opinions under Asynchronous Updating |
|
Zhang, Qi | Shanghai Jiao Tong University |
Wang, Lin | Shanghai Jiao Tong University |
Wang, Xiaofan | Department of Automation, Shanghai Jiaotong University |
Chen, Guanrong | City University of Hong Kong |
Keywords: Agents-based systems, Network analysis and control, Decentralized control
Abstract: Inspired by the dual attitudes theory that implicit opinions are individuals' inner evaluations affected by experience while explicit opinions are external expressions of these evaluations, we propose an asynchronous co-evolution model of dual opinions, where individuals update explicit opinion at each time step but change their implicit opinion based on their own clock. Furthermore, we introduce the after-effect of observed opinion information in this model, which enables individuals to update implicit opinions not only based on the opinion information observed at the current time but also on the information received from the past period of time. We analyze the dynamics of dual opinions in two discussion scenarios: a group of individuals with similar and opposite initial opinions. In the former scenario, rigorous analysis suggests that dual opinions are polarized to extreme opinions, mathematically verifying the empirical finding that group discussion intensifies individuals' preferences, resulting in group polarization. In the latter scenario, our investigation shows that individuals with low bias show acceptance (inward conformity) while those with high bias exhibit compliance (outward conformity). We further analyze the influence of parameters on the co-evolution of dual opinions.
|
|
10:20-10:40, Paper ThA11.2 | Add to My Program |
On an Extension of the Friedkin-Johnsen Model: The Effects of a Homophily-Based Influence Matrix |
|
Disarň, Giorgia | University of Padova |
Valcher, Maria Elena | Universita' Di Padova |
Keywords: Agents-based systems, Network analysis and control, Modeling
Abstract: In this paper we propose an extended version of the Friedkin-Johnsen (FJ) model that accounts for the effects of homophily mechanisms on the agents’ mutual appraisals. The proposed model consists of two difference equations. The first one describes the opinions’ evolution, namely how agents modify their opinions taking into account both their personal beliefs and the influences of other agents, as in the standard FJ model. Meanwhile, the second equation models how the influence matrix involved in the opinion formation process updates according to a homophily mechanism, by allowing both positive and negative appraisals. We derive necessary and sufficient conditions for the proposed time-varying version of the classical FJ model to asymptotically converge to a constant solution. In the case of a single discussion topic, asymptotic convergence is always ensured and the limit behavior of the system is derived in closed form.
|
|
10:40-11:00, Paper ThA11.3 | Add to My Program |
A Robustness Analysis to Structured Channel Tampering Over Secure-By-Design Consensus Networks |
|
Fabris, Marco | University of Padua |
Zelazo, Daniel | Technion - Israel Institute of Technology |
Keywords: Agents-based systems, Network analysis and control
Abstract: This work addresses multi-agent consensus networks where adverse attackers affect the convergence performances of the protocol by manipulating the edge weights. We generalize [1] and provide guarantees on the agents’ agreement in the presence of attacks on multiple links in the network. A stability analysis is conducted to show the robustness to channel tampering in the scenario where part of the codeword, corresponding to the value of the edge weights, is corrupted. Exploiting the built-in objective coding, we show how to compensate the conservatism that may emerge because of multiple threats in exchange for higher encryption capabilities. Numerical examples related to semi-autonomous networks are provided.
|
|
11:00-11:20, Paper ThA11.4 | Add to My Program |
A Scalable Approach for Analysing Multi-Agent Systems with Heterogeneous Stochastic Packet Loss |
|
Hespe, Christian | Hamburg University of Technology |
Werner, Herbert | Hamburg University of Technology |
Keywords: Agents-based systems, Networked control systems, Robust control
Abstract: An important aspect in jointly analysing networked control systems and their communication is to model the networking in a sufficiently rich but at the same time mathematically tractable way. As such, this paper improves on a recently proposed scalable approach for analysing multi-agent systems with stochastic packet loss by allowing for heterogeneous transmission probabilities and temporal correlation in the communication model. The key idea is to consider the transmission probabilities as uncertain, which facilitates the use of tools from robust control. Due to being formulated in terms of linear matrix inequalities that grow linearly with the number of agents, the result is applicable to very large multi-agent systems, which is demonstrated by numerical simulations with up to 10000 agents.
|
|
11:20-11:40, Paper ThA11.5 | Add to My Program |
Scalable Robust Multi-Agent Reinforcement Learning for Model Uncertainty |
|
Jwa, Younkyung | Gwangju Institute of Science and Technology |
Gwak, Minseon | POSTECH |
Kwak, Jiin | Ulsan National Institute of Science and Technology |
Ahn, Chang Wook | Gwangju Institute of Science and Technology |
Park, PooGyeon | POSTECH (Pohang Univ. of Sci. & Tech.) |
Keywords: Agents-based systems, Machine learning, Cooperative control
Abstract: A robust multi-agent reinforcement learning (MARL) algorithm using a nature actor has been shown to be effective in finding a robust Nash equilibrium (NE) of a Markov game with model uncertainty. However, since a game-size scaling increases the search space and challenges reaching the NE, the robust property of the algorithm is reduced in environments with many agents. This paper proposes an evolutionary diversity-maintaining population curriculum (EDPC) framework with a robust attention-based multi-agent deep deterministic policy gradient (RA-MADDPG) algorithm, which enables an efficient robust NE search by a structured search space expansion. In the EDPC framework, the MARL divides into several stages, and when moving on to the next stage, a population consisting of larger games is made with two parent games from the previous stage. We introduce reward-proportionate parent selection and reward-guided mutation methods to continue reinforcing superior agents and maintain the diversity of the population. Furthermore, the RA-MADDPG is used to solve the robust Markov game at each stage with nature actors with attention-based architectures. The scalability and robustness of the proposed method are evaluated for different numbers of agents and levels of model uncertainty.
|
|
11:40-12:00, Paper ThA11.6 | Add to My Program |
Lexicographic Min-Max Fairness in Task Assignments |
|
Ding, Geoffrey | Massachusetts Institute of Technology |
Balakrishnan, Hamsa | Massachusetts Institute of Technology |
Keywords: Agents-based systems, Optimization, Cooperative control
Abstract: Assignment problems and their variants are ubiquitous across resource allocation applications. While they traditionally focus on minimizing costs or maximizing utility, fairness is also an important consideration, especially for task assignment in multi-agent systems. We propose algorithms for assigning tasks to agents that consider lexicographic min-max fairness, a stronger notion of fairness than min-max fairness, which minimizes the maximum cost to any single agent. We apply our proposed approaches to both one-to-one and one-to-many assignment problems. Due to the computational challenges of one-to-many task assignments, we develop tractable approaches to achieve approximate fairness. Finally, we use the proposed methods to evaluate the trade-offs between efficiency and fairness through numerical experiments
|
|
ThA12 Regular Session, Roselle Junior 4711 |
Add to My Program |
Cooperative Control IV |
|
|
Chair: Hayashi, Naoki | Osaka University |
Co-Chair: Zhang, Hongwei | Harbin Institute of Technology, Shenzhen |
|
10:00-10:20, Paper ThA12.1 | Add to My Program |
Constrained Coverage of Unknown Environment Using Safe Reinforcement Learning |
|
Zhang, Yunlin | University of Electronic Science and Technology of China |
You, JunJie | University of Electronic Science and Technology of China |
Shi, Lei | Henan University |
Shao, Jinliang | University of Electronic Science and Technology of China, Chengd |
Zheng, Wei Xing | Western Sydney University |
Keywords: Cooperative control, Learning, Agents-based systems
Abstract: Achieving a connected, collision-free and time-efficient coverage in unknown environments is challenging for multi-agent systems. Particularly, agents with second-order dynamics are supposed to efficiently search and reach the optimal deployment positions over targets whose distribution is unknown, while preserving the distributed connectivity and avoiding collision. In this paper, a safe reinforcement learning based shield method is proposed for unknown environment exploration while correcting actions of agents for safety guarantee and avoiding invalid samples into policy updating. The shield is achieved distributively by a control barrier function and its validity is proved in theory. Moreover, policies of the optimal coverage are centrally learned via reward engineering and executed distributively. Numerical results show that the proposed approach not only achieves zero safety violations during training, but also speeds up the convergence of learning.
|
|
10:20-10:40, Paper ThA12.2 | Add to My Program |
Minimally Disruptive Cooperative Lane-Change Maneuvers |
|
Chalaki, Behdad | Honda Research Institute |
Tadiparthi, Vaishnav | Honda Research Institute |
Nourkhiz Mahjoub, Hossein | Honda Research Institute USA Inc |
D'sa, Jovin | Honda Research Institute USA Inc |
Moradi Pari, Ehsan | Honda Research Institute USA, Inc |
Chavez Armijos, Andres | Boston University |
Li, Anni | Boston University |
Cassandras, Christos G. | Boston University |
Keywords: Cooperative control, Optimization, Autonomous vehicles
Abstract: A lane-change maneuver on a congested highway could be severely disruptive or even infeasible without the cooperation of neighboring cars. However, cooperation with other vehicles does not guarantee that the performed maneuver will not have a negative impact on traffic flow unless it is explicitly considered in the cooperative controller design. In this letter, we present a socially compliant framework for cooperative lane-change maneuvers for an arbitrary number of CAVs on highways that aims to interrupt traffic flow as minimally as possible. Moreover, we explicitly impose feasibility constraints in the optimization formulation by using reachability set theory, leading to a unified design that removes the need for an iterative procedure used in prior work. We quantitatively evaluate the effectiveness of our framework and compare it against previously offered approaches in terms of maneuver time and incurred throughput disruption.
|
|
10:40-11:00, Paper ThA12.3 | Add to My Program |
Cooperative Learning for Adversarial Multi-Armed Bandit on Open Multi-Agent Systems |
|
Nakamura, Tomoki | Osaka University |
Hayashi, Naoki | Osaka University |
Inuiguchi, Masahiro | Osaka University |
Keywords: Cooperative control, Optimization, Networked control systems
Abstract: This paper considers a cooperative decision-making method for an adversarial bandit problem on open multi-agent systems. In an open multi-agent system, the network configuration changes dynamically as agents freely enter and leave the network. We propose a distributed Exp3 policy in which a group of agents exchanges the estimation of the expected reward of each arm with active neighboring agents. Then, each agent updates the probability distribution of choosing arms by combining the estimated rewards of neighboring agents. We derive a sufficient condition for a sublinear bound of a pseudo regret. The numerical example shows that active agents can cooperatively find the optimal arm by the proposed Exp3 policy algorithm.
|
|
11:00-11:20, Paper ThA12.4 | Add to My Program |
Energy Efficient Optimization-Based Coordination of Electric Automated Vehicles in Confined Areas |
|
Kojchev, Stefan | Chalmers University of Technology |
Hult, Robert | Chalmers University of Technology |
Fredriksson, Jonas | Chalmers University of Technology |
Keywords: Cooperative control, Optimization algorithms, Autonomous vehicles
Abstract: In this paper, we present an optimization-based control strategy for coordinating multiple electric automated vehicles (AVs) in confined sites. The approach focuses on obtaining and keeping energy-efficient driving profiles for the AVs while avoiding collisions in cross-intersections, narrow roads, and merge crossings. Specifically, the approach is composed of two optimization-based components. The first component obtains the energy-efficient profiles for each individual AV by solving a Nonlinear Program (NLP) for the vehicle's complete mission route. The conflict resolution, which is performed by the second component, is accomplished by solving a time-scheduling Mixed Integer Linear Programming (MILP) problem that exploits the application characteristics. We demonstrate the performance of the algorithm through a non-trivial comparative simulation example with an alternative optimization-based heuristic.
|
|
11:20-11:40, Paper ThA12.5 | Add to My Program |
Distributed Event-Triggered Dual Decomposition Method for Cooperative One-Way Car-Sharing Control |
|
Ogawa, Gakuto | Osaka University |
Hayashi, Naoki | Osaka University |
Sakurama, Kazunori | Kyoto University |
Inuiguchi, Masahiro | Osaka University |
Keywords: Cooperative control, Optimization algorithms, Networked control systems
Abstract: In this paper, we present cooperative rebalancing control of a one-way car-sharing service, where several service providers operate vehicles independently while sharing the common rental stations. The objective of service providers is to reduce the number of deadhead vehicles considering limited parking slots at stations. To this end, we propose a rebalancing control method by a distributed dual decomposition algorithm. Each provider transmits the estimation of the dual optimizers to the neighboring providers in an event-triggered manner. A numerical example shows that all service providers can find an optimal rebalancing solution while effectively reducing the number of communications.
|
|
11:40-12:00, Paper ThA12.6 | Add to My Program |
Power Sharing and Voltage Deviation Restriction for Multi-Bus DC Microgrids |
|
Bai, Handong | Yellow River Conservancy Technical Institute |
Liu, Zhancheng | Southwest Jiaotong University |
Zhang, Hongwei | Harbin Institute of Technology, Shenzhen |
Keywords: Cooperative control, Smart grid, Distributed control
Abstract: Power sharing and voltage regulation are fundamental but conflict control objectives of DC microgrids. This paper presents a distributed control strategy to achieve adjustment of the control objectives from accurate power sharing to accurate voltage regulation. At the same time, the bus voltage of critical node is regulated to reach the rated value of the DC microgrid. Based on this control strategy, steady state characteristics of the closed-loop system are analyzed. For a given DC microgrid, the proposed control method is verified experimentally.
|
|
ThA13 Regular Session, Roselle Junior 4613 |
Add to My Program |
Networked Control Systems I |
|
|
Chair: Lunze, Jan | Ruhr-Universität Bochum |
Co-Chair: Stursberg, Olaf | University of Kassel |
|
10:00-10:20, Paper ThA13.1 | Add to My Program |
Phase Locking of Linear Oscillators with Individual Parameters |
|
Lunze, Jan | Ruhr-Universität Bochum |
Keywords: Networked control systems, Agents-based systems, Cooperative control
Abstract: A recent paper has shown that linear oscillators can be synchronised only if their parameters are exactly the same. The main reason for this sensitivity lies in the fact that the oscillator network loses energy whenever the oscillators do not follow precisely the same output trajectory. This paper deals with the question how to extend oscillators to make them synchronisable in a practical sense. The oscillators are equipped with an energy source that replaces the energy lost during the synchronisation. It is shown that a power supply rate exists such that oscillators with different eigenfrequencies can phase lock, which means that they follow the same sinusoidal trajectory with some phase gap. For two coupled oscillators explicit relations for the frequency and the phase shift of the synchronous behaviour are derived.
|
|
10:20-10:40, Paper ThA13.2 | Add to My Program |
Uncorrelated Packet Loss Model for Networked Control Systems with H∞ Design Constraint |
|
Villamil, Andres | TU Dresden |
González, Arturo | Vodafone Tech Innovation Center Dresden |
Fettweis, Gerhard | Technische Universität Dresden |
Keywords: Networked control systems, Communication networks, Optimization
Abstract: Networked Controlled Systems (NCS) are control systems that rely on the performance of the communications to ensure a desired Quality-of-Control (QoC). However, the wireless link is imperfect; the packet has an intrinsic latency, and packets can be lost due to its stochastic nature. Although the newest generation of wireless networks (5G and beyond) can provide Ultra-Reliable Low Latency Communications (URLLC) to attempt to remove the problems caused by the wireless network. This methodology is expensive regarding communications resources, and for NCS, the constant stream of data could be spared since some updates might contain similar information. Although there are multiple methodologies to design a NCS, not many methods attempt to develop the communications system based on reducing the consumption of communications resources. Therefore, this work finds the maximum transmission interval and delay to optimize the maximum peak Age-of-Information (AoI) while getting a model of the Maximum Allowable Packet Loss Probability (MAPLP) for the case that the H∞ norm of the control system must be maintained lower than a specified threshold. Finally, the model is validated in the case of platooning using Cooperative Adaptive Cruise Control(CACC), showing a high accuracy compared to the results of the solution of the optimization problem.
|
|
10:40-11:00, Paper ThA13.3 | Add to My Program |
Communication Demand Minimization for Perturbed Networked Control Systems with Coupled Constraints |
|
Bahraini, Masoud | Chalmers University of Technology |
Zanon, Mario | IMT Institute for Advanced Studies Lucca |
Colombo, Alessandro | Politecnico Di Milano |
Falcone, Paolo | Chalmers University of Technology |
Keywords: Networked control systems, Constrained control, Optimization algorithms
Abstract: Communication scheduling is needed when control loops of several safety-critical systems are closed through a shared communication medium. To enable schedulability, control for each system is designed primarily to minimize its communication demand. In this paper, we study communication demand minimization for a class of perturbed multi-agent networked control systems with a shared communication medium and subject to input and coupled state constraints. First, a framework to design communication schedule and control is recalled such that state and input constraints are satisfied under no coupling assumption. Then, a heuristic method is proposed to decouple state constraints such that the overall communication demand of the systems is minimized. Effectiveness of the proposed results are illustrated through a numerical example.
|
|
11:00-11:20, Paper ThA13.4 | Add to My Program |
Resilience of Time-Varying Communication Graphs for Consensus of Changing Sets of Computing Agents |
|
Schmidtke, Vincent | Universität Kassel |
Liu, Zonglin | University of Kassel |
Stursberg, Olaf | University of Kassel |
Keywords: Networked control systems, Control of networks, Resilient Control Systems
Abstract: System performance of distributed control systems and networked computing systems is strongly dependent of the underlying communication topology. This paper considers the rarely studied problem of how the topology can maintain resilience by reconfiguration in case that agents leave or join the network during online operation. Existing optimization-based approaches which reconfigure the entire network can typically not be used in this case, since the computational burden for online application is too high. Thus, this paper proposes a novel combined offline-online scheme which optimizes the topology for high convergence rate (of e.g. consensus problems) while providing guarantees for the robustness against agent failures. In the offline part, an optimization of the entire topology is carried out using novel constraints to prepare resilience of the online procedure. For the latter, the proposed scheme guarantees that robustness is maintained for joining agents and if a specified number of agents leave the network. In simulation, the proposed scheme is compared to existing approaches and the advantages of the online-offline procedure are demonstrated.
|
|
11:20-11:40, Paper ThA13.5 | Add to My Program |
Large Population Games on Constrained Unreliable Networks |
|
Aggarwal, Shubham | University of Illinois, Urbana Champaign |
Zaman, Muhammad Aneeq uz | UIUC |
Bastopcu, Melih | University of Illinois Urbana Champaign |
Basar, Tamer | Univ of Illinois, Urbana-Champaign |
Keywords: Networked control systems, Control over communications, Constrained control
Abstract: This paper studies an N–agent cost-coupled game where the agents are connected via an unreliable capacity constrained network. Each agent receives state information over that network which loses packets with probability p. A Base station (BS) actively schedules agent communications over the network by minimizing a weighted Age of Information (WAoI) based cost function under a capacity limit C < N on the number of transmission attempts at each instant. Under a standard information structure, we show that the problem can be decoupled into a scheduling problem for the BS and a game problem for the N agents. Since the scheduling problem is an NP hard combinatorics problem, we propose an approximately optimal solution which approaches the optimal solution as N tends to infinity. In the process, we also provide some insights on the case without channel erasure. Next, to solve the large population game problem, we use the mean-field game framework to compute an approximate decentralized Nash equilibrium. Finally, we validate the theoretical results using a numerical example.
|
|
11:40-12:00, Paper ThA13.6 | Add to My Program |
Localized Privacy Preservation by Innovation Perturbation in a Cooperative LQG Control System |
|
Sheng, Wenliang | East China University of Science and Technology |
Zhao, Zhiyun | East China University of Science and Technology |
Yang, Wen | East China University of Science and Technology |
Yang, Chao | East China University of Science and Technology |
Keywords: Networked control systems, Control Systems Privacy
Abstract: We consider a cooperative Linear Quadratic Gaussian (LQG) control system, in which an individual user owns a local plant whose control inputs are provided by a server. In the cooperation, the user takes the plant states as private information and desires to maximize the privacy preservation while ensuring that the server still provides a certain level of control performance. Moreover, the user requires a privacy scheme that is used locally and is unknown to the server, so that it can create a deviation in the server’s knowledge of the states from the true value. To achieve this, we propose two privacy schemes localized at the user side, which inject perturbations in the innovation data sent to the server. For both schemes, firstly, we analyze the privacy preservation quality provided by the scheme and the performance loss in the LQG control caused by it. Secondly, based on the trade-off between them, we propose an optimization problem. Thirdly, we propose a recovery procedure by which the control performance is recovered to the optimal one, i.e., the privacy preservation is achieved without any performance loss in control. Finally, simulations are provided, and we give discussions on the two schemes based on the simulation results.
|
|
ThA14 Regular Session, Roselle Junior 4612 |
Add to My Program |
Identification III |
|
|
Chair: Smith, Roy S. | ETH Zurich |
Co-Chair: Oomen, Tom | Eindhoven University of Technology |
|
10:00-10:20, Paper ThA14.1 | Add to My Program |
Beyond Nyquist in Frequency Response Function Identification: Applied to Slow-Sampled Systems |
|
van Haren, Max | Eindhoven University of Technology |
Mirkin, Leonid | Technion - IIT |
Blanken, Lennart | Eindhoven University of Technology |
Oomen, Tom | Eindhoven University of Technology |
Keywords: Identification, Sampled-data control
Abstract: Fast-sampled models are essential for control design, e.g., to address intersample behavior. The aim of this paper is to develop a non-parametric identification technique for fast-sampled models of systems that have relevant dynamics and actuation above the Nyquist frequency of the sensor, such as vision-in-the-loop systems. The developed method assumes smoothness of the frequency response function, which allows to disentangle aliased components through local models over multiple frequency bands. The method identifies fast-sampled models of slowly-sampled systems accurately in a single identification experiment. Finally, an experimental example demonstrates the effectiveness of the technique.
|
|
10:20-10:40, Paper ThA14.2 | Add to My Program |
Error Bounds for Kernel-Based Linear System Identification with Unknown Hyperparameters |
|
Yin, Mingzhou | ETH Zurich |
Smith, Roy S. | ETH Zurich |
Keywords: Identification, Statistical learning, Machine learning
Abstract: Applying regularization in reproducing kernel Hilbert spaces has been successful in linear system identification using stable kernel designs. From a Gaussian process perspective, it automatically provides probabilistic error bounds for the identified models from the posterior covariance, which are useful in robust and stochastic control. However, the error bounds require knowledge of the true hyperparameters in the kernel design. They can be inaccurate with estimated hyperparameters for lightly damped systems or in the presence of high noise. In this work, we provide reliable quantification of the estimation error when the hyperparameters are unknown. The bounds are obtained by first constructing a high-probability set for the true hyperparameters from the marginal likelihood function. Then the worst-case posterior covariance is found within the set. The proposed bound is proven to contain the true model with a high probability and its validity is demonstrated in numerical simulation.
|
|
10:40-11:00, Paper ThA14.3 | Add to My Program |
On Concentration Bounds for Bayesian Identification of Linear Non-Gaussian Systems |
|
Kim, Yeoneung | SeoulTech |
Kim, Gihun | Seoul National University |
Yang, Insoon | Seoul National University |
Keywords: Identification, Stochastic systems
Abstract: We adopt a Bayesian perspective to identify the unknown parameters of linear stochastic systems with possibly non-Gaussian disturbance distributions. The key idea of our algorithm is to alternately execute L randomly selected linear state-feedback controllers and keep track of a maximum a posteriori estimator. The proposed algorithm asymptotically achieves the concentration of posterior distributions around the true system parameters. We also derive probabilistic bounds for the concentration based on the classical results regarding the asymptotic properties of posterior distributions. An empirical demonstration is provided as well.
|
|
11:00-11:20, Paper ThA14.4 | Add to My Program |
SINDy-CRN: Sparse Identification of Chemical Reaction Networks from Data |
|
Bhatt, Nirav | Indian Institute of Technology Madras |
Jayawardhana, Bayu | University of Groningen |
Sanchez-Escalonilla, Santiago | University of Groningen |
Keywords: Identification, Systems biology, Nonlinear systems identification
Abstract: This work considers an important problem of identifying the dynamics of chemical reaction networks from time-series data. We propose an approach to identify complex chemical reaction networks (CRN) from concentration data using the concept of sparse model identification. Particularly, we demonstrate challenges associated with the application of the sparse identification of nonlinear dynamics (SINDy) and its variants to data obtained from CRNs. We develop a SINDy-CRN algorithm based on the properties of CRNs for identifying governing equations of a CRN. The proposed algorithm is illustrated using a numerical simulation example.
|
|
11:20-11:40, Paper ThA14.5 | Add to My Program |
Fast Algorithms for Identification of Time-Varying Systems with Both Smooth and Discontinuous Parameter Changes |
|
Niedzwiecki, Maciej | Gdansk University of Technology |
Gancza, Artur | Gdansk University of Technology |
Keywords: Identification, Time-varying systems, Stochastic systems
Abstract: The problem of noncausal identification of a time-varying linear system subject to both smooth and occasional jump-type changes is considered and solved using the preestimation technique combined with the basis function approach to modeling the variability of system parameters. The proposed estimation algorithms yield very good parameter tracking results and are computationally attractive.
|
|
11:40-12:00, Paper ThA14.6 | Add to My Program |
Identifying Single-Input Linear System Dynamics from Reachable Sets |
|
Shafa, Taha | University of Illinois Urbana-Champaign |
Dong, Roy | University of Illinois at Urbana-Champaign |
Ornik, Melkior | University of Illinois Urbana-Champaign |
Keywords: Identification, Uncertain systems, Modeling
Abstract: This paper is concerned with identifying linear system dynamics without the knowledge of individual system trajectories, but from the knowledge of the system's reachable sets observed at different times. Motivated by a scenario where the reachable sets are known from partially transparent manufacturer specifications or observations of the collective behavior of adversarial agents, we aim to utilize such sets to determine the unknown system's dynamics. This paper has two contributions. Firstly, we show that the sequence of the system's reachable sets can be used to uniquely determine the system's dynamics for asymmetric input sets under some generic assumptions, regardless of the system's dimensions. We also prove the same property holds up to a sign change for two-dimensional systems where the input set is symmetric around zero. Secondly, we present an algorithm to determine these dynamics. We apply and verify the developed theory and algorithms on an unknown band-pass filter circuit solely provided the unknown system's reachable sets over a finite observation period.
|
|
ThA15 Regular Session, Roselle Junior 4611 |
Add to My Program |
Robust Adaptive Control |
|
|
Chair: Ge, Shuzhi Sam | National University of Singapore |
Co-Chair: Miller, Daniel E. | University of Waterloo |
|
10:00-10:20, Paper ThA15.1 | Add to My Program |
Robust Adaptive Step-Tracking with Exponential Stability and Convolution Bounds Using Supervisory Control |
|
Lalumiere, Craig | University of Waterloo |
Miller, Daniel E. | University of Waterloo |
Keywords: Robust adaptive control, Adaptive control, Switched systems
Abstract: Supervisory Control has been shown to be a very effective approach to adaptive control which ensures step-tracking, exponential stability, and a degree of robustness to unmodelled dynamics. Here we apply the technique in the discrete-time setting and prove a new linear-like convolution bound on the effect of the noise/disturbance. This property is then leveraged to prove robustness to slow time-variations.
|
|
10:20-10:40, Paper ThA15.2 | Add to My Program |
Adaptive Robust Control Contraction Metrics: Transient Bounds in Adaptive Control with Unmatched Uncertainties |
|
Gessow, Samuel | University of California, Los Angeles |
Lopez, Brett | University of California - Los Angeles |
Keywords: Robust adaptive control, Indirect adaptive control, Adaptive control
Abstract: This work presents a new sufficient condition for synthesizing nonlinear controllers that yield bounded closed-loop tracking error transients despite the presence of unmatched uncertainties that are concurrently being learned online. The approach utilizes contraction theory and addresses fundamental limitations of existing approaches by allowing the contraction metric to depend on the unknown model parameters. This allows the controller to incorporate new model estimates generated online without sacrificing its strong convergence and bounded transients guarantees. The approach is specifically designed for trajectory tracking so the approach is more broadly applicable to adaptive model predictive control as well. Simulation results on a nonlinear system with unmatched uncertainties demonstrates the approach.
|
|
10:40-11:00, Paper ThA15.3 | Add to My Program |
Adaptive Output Regulation and the Use It or Lose It Principle |
|
Mejia Uzeda, Erick | University of Toronto |
Broucke, Mireille E. | Univ. of Toronto |
Keywords: Robust adaptive control, Output regulation, Adaptive control
Abstract: It is well-known in adaptive control that when regressors are not persistently exciting (PE), then parameter adaptation is not robust. A number of adhoc modifications of parameter adaptation laws were developed to overcome this problem. In this paper we examine the PE subspace, a geometric characterization of a regressor’s excitation, which allows a more intrinsic modification of parameter adaptation laws. Our modular method, the mu-modification, is premised on the Use it or Lose it Principle of neuroplasticity, stating that parameters not excited by a regressor may be forgotten. This paper develops these ideas in the context of adaptive output regulation, with attention to the geometric properties of the PE subspace under linear filtering, such as when using augmented errors.
|
|
11:00-11:20, Paper ThA15.4 | Add to My Program |
Unmatched Uncertainty Mitigation through Neural Network Supported Model Predictive Control |
|
Valverde Gasparino, Mateus | University of Illinois at Urbana Champaign |
Mishra, Prabhat Kumar | Massachusetts Institute of Technology |
Chowdhary, Girish | University of Illinois at Urbana Champaign |
Keywords: Robust adaptive control, Predictive control for linear systems, Neural networks
Abstract: This paper presents a deep learning based model predictive control (MPC) algorithm for systems with unmatched and bounded state-action dependent uncertainties of unknown structure. We utilize a deep neural network (DNN) as an oracle in the underlying optimization problem of learning based MPC (LBMPC) to estimate unmatched uncertainties. Generally, DNNs as oracle are considered difficult to employ with LBMPC due to the technical difficulties associated with the estimation of their coefficients in real time. We employ a dual-timescale adaptation mechanism, where the weights of the last layer of the neural network are updated in real time while the inner layers are trained on a slower timescale using the training data collected online and selectively stored in a buffer. Our results are validated through a numerical experiment on the compression system model of a jet engine. These results indicate that the proposed approach is implementable in real time and carries the theoretical guarantees of LBMPC.
|
|
11:20-11:40, Paper ThA15.5 | Add to My Program |
Adaptive Prescribed-Time Tracking Control for Fixed-Wing UAV with the Input Saturation and State Constraints (I) |
|
Zheng, Jiayi | National University of Defense Technology |
Zhao, Shulong | National University of Defense Technology |
Wang, Qipeng | National University of Defense Technology |
Wang, Xiangke | National University of Defense Technology |
Zhou, Han | National University of Defense Technology |
Keywords: Adaptive control, Neural networks, Flight control
Abstract: In this paper, we propose an adaptive prescribed-time control algorithm for the fixed-wing unmanned aerial vehicle (UAV). How to follow the desired trajectory within a predetermined time is a problem worth investigating in fixed-wing UAV tracking missions. To this end, a novel method based on time-varying state feedback and segmented neural network (SNN) is proposed, using practice prescribed-time input-to-state stable to guarantee the convergence of all signals in the prescribed time. Considering the input saturation and state constraints, we give the basis for selecting the prescribed time with different initial conditions, rather than an arbitrary one. Finally, the simulation shows that the proposed method can realize prescribed-time tracking control with input saturation, despite large initial states, and the magnitude of the control changes moderately.
|
|
11:40-12:00, Paper ThA15.6 | Add to My Program |
Synchronized Optimization with Prescribed Performance for High-Order Strict-Feedback System (I) |
|
Zhang, Yuxiang | National University of Singapore |
Liang, Xiaoling | National University of Singapore |
Li, Dongyu | BEIHANG UNIVERSITY |
Ge, Shuzhi Sam | National University of Singapore |
Lee, Tong Heng | National University of Singapore |
Keywords: Adaptive control, Automotive control, Learning
Abstract: This paper investigates synchronized optimization with prescribed performance for the strict-feedback system with time-synchronized convergence property, which is the highly essential performance desired in various real-world high-precision control applications. The prescribed performance is considered to keep the state-variables within a predefined region during the control period to meet the required system performance. To consider optimization performance while also concurrently attaining the time-synchronized properties simultaneously of each backstepping subsystem, optimized backstepping is utilized to establish the learning framework; wherein the norm-normalized sign function is appropriately incorporated in each backstepping subsystem, which generates the decomposition of the optimal system control and gradient term of the cost function with appropriate time-synchronized control items and unknown independently learning parts to be approximated with neural networks. With this decomposition design, the learning objective is transformed to adaptively explore the optimal control parameter in the admissible policy region. By additionally employing the adaptive dynamic programming technique, actor-critic method, and gradient-constrained method, the solution of the Hamilton-Jacobi-Bellman equation is iteratively approximated while the learnable parameter stays within the predefined region. The work here has the outcome of time-synchronized convergence which surpasses the usual typical developments in this class of problems considered. The proposed method is verified with the vehicle platoon problem to show its effectiveness in that the system preserves special properties of time-synchronized stability and control while optimizing the overall system control.
|
|
ThA16 Regular Session, Peony Junior 4512 |
Add to My Program |
Power Systems I |
|
|
Chair: Bosso, Alessandro | University of Bologna |
Co-Chair: Shi, Xiasheng | AnHui University |
|
10:00-10:20, Paper ThA16.1 | Add to My Program |
Nonlinear Stability Analysis of Distributed Self-Interleaving for Driving Signals in Multicellular Converters |
|
Bosso, Alessandro | University of Bologna |
Mannes Hillesheim, Miguel | NXP Semiconductors |
Cousineau, Marc | LAPLACE, Université De Toulouse, CNRS, INPT, UPS, 31071, Toulous |
Zaccarian, Luca | LAAS-CNRS |
Keywords: Power electronics, Stability of hybrid systems, Distributed control
Abstract: We analyze a self-interleaving circuitry for driving signals in multicellular converters based on an interconnection graph with a ring topology that induces desirable fault-tolerant features. Using nonlinear hybrid dynamical tools, we show that the dynamics of this electronic solution can be formulated as a system with a sampled-data feedback law emulating a first-order Kuramoto-like model. For this Kuramoto model, under general conditions on the coupling functions, we provide a Lyapunov-based proof of local asymptotic stability of the splay-state (interleaved) configuration. We then illustrate the relation with the emulation-based sampled-data scenario via simulation results.
|
|
10:20-10:40, Paper ThA16.2 | Add to My Program |
Observer-Based Switched Control of the Three Level Neutral Point Clamped Rectifier |
|
Doré, Manon | LAAS-CNRS |
Ariba, Yassine | INSA |
Garcia, Germain | LAAS-CNRS |
Keywords: Power electronics, Switched systems, Observers for nonlinear systems
Abstract: In this paper, an observer-based switched control law is proposed for the three level neutral point clamped (NPC) converter operating as a rectifier. Modeling the converter as a switched affine system, the proposed control is based on the well known argmin control law to track a varying state reference trajectory. A full-order observer is introduced to compute the control law with only the measure of the input and the output voltages. The control aims at tracking a state reference defined from a power analysis and three objectives are addressed: to stabilize the output at a given DC voltage, to ensure a unit power factor by having the input current and voltage on phase and to have balanced capacitor voltages on the output. Based on a unified modeling methodology, the control and the observer are easily derived from LMI conditions. An outer loop is added to regulate the output when constant perturbations are considered. The results are illustrated by simulations on MATLAB/Simulink.
|
|
10:40-11:00, Paper ThA16.3 | Add to My Program |
Model Predictive Control of Wind Turbines with Piecewise-Affine Power Coefficient Approximation |
|
Sterle, Arnold | Technische Universität Berlin |
Grapentin, Aaron | Technical University Berlin |
Hans, Christian Andreas | TU Berlin |
Raisch, Joerg | Technical University Berlin |
Keywords: Power generation, Predictive control for nonlinear systems, Optimal control
Abstract: In this paper, an offset-free bilinear model predictive control approach for wind turbines is presented. State-of-the-art controllers employ different control loops for pitch angle and generator torque which switch depending on wind conditions. In contrast, the presented controller is based on one unified control law that works for all wind conditions. The inherent nonlinearity of wind turbines is addressed through a piecewise-affine approximation, which is modelled in a mixed-integer fashion. The presented controller is compared to a state-of-the-art baseline controller in a numerical case study using OpenFAST. Simulation results show that the presented controller ensures accurate reference power tracking. Additionally, damage equivalent loads are reduced for higher wind speeds.
|
|
11:00-11:20, Paper ThA16.4 | Add to My Program |
Initialization-Free Distributed Constrained Optimization Algorithms with a Pre-Specified Time |
|
Shi, Xiasheng | AnHui University |
Mu, Chaoxu | Tianjin University |
Sun, Changyin | School of Automation, Southeast University |
Ren, Lu | Anhui University |
Su, Yanxu | Anhui University |
Keywords: Power generation
Abstract: The distributed constrained optimization problem over an undirected communication topology is investigated in this study. It focuses on addressing a global coupled equality constraint that applies to all agents. To tackle this problem, a distributed approach with arbitrary initialization is developed by virtue of the aperiodic sampling control idea and the consensus-based multi-agent system(MAS) technology. This approach is developed to address constrained optimization problems within a pre-specified time. In addition, this predefined time is freely defined by users and irrelevant to the initial states, control coefficients, and network structure of systems. The Lyapunov stability theory completes the convergence proof of the developed method. Then, the developed method is extended to handle distributed nonlinear constrained optimization problems. Finally, The availability of two developed methods is demonstrated through two simulation examples.
|
|
11:20-11:40, Paper ThA16.5 | Add to My Program |
Investigation of Sub-Synchronous Oscillation in HVDC-Connected PMSG-Based Offshore Wind Farm: Comprehensive Modeling and Analysis |
|
Zhang, Zhihao | Xi'an Jiaotong University |
Kou, Peng | Xi’an Jiaotong University |
Mei, Mingyang | Xi'an Jiaotong Univercity |
Tian, Runze | Xi'an Jiaotong University |
Zhang, Yuanhang | Xi'an Jiaotong University |
Liang, Deliang | Xi'an Jiaotong University |
Keywords: Power systems, Electrical machine control, Power generation
Abstract: This paper investigates the sub-synchronous oscillation occurring in high voltage direct current (HVDC)-connected permanent magnet synchronous generator (PMSG)-based offshore wind farm. To do so, firstly, a comprehensive system model is developed, which incorporates the dynamics of the PMSG-based wind energy conversion system (WECS), the ac collection grid, and the HVDC transmission system. Subsequently, small signal model of the comprehensive system is derived. Based on the small signal model, the critical system mode is obtained using modal analysis. Special attention is paid to the influence of sub-synchronous mode on the onshore grid. Moreover, the influence of controller parameters on the sub-synchronous mode is investigated, which facilitates the design of converter controllers in HVDC-integrated PMSG-based offshore wind farm. Additionally, the modal analysis results are verified through time-domain simulations.
|
|
11:40-12:00, Paper ThA16.6 | Add to My Program |
Mean Field Game for Strategic Bidding of Energy Consumers in Congested Distribution Networks |
|
Silani, Amirreza | Delft University of Technology |
Tindemans, Simon H. | TU Delft |
Keywords: Power systems, Game theory, Power generation
Abstract: The proliferation of batteries, photovoltaic cells and Electric Vehicles (EVs) in electric power networks can result in network congestion. A redispatch market that allows the Distribution System Operators (DSOs) to relieve congested networks by asking the energy consumers to adjust their scheduled consumption is an alternative to upgrading network capacity. However, energy consumers can strategically increase their bids on the day-ahead market in anticipation of payouts from the redispatch market. This behaviour, which is called increase-decrease gaming, can aggravate congestion and allow the energy consumers to extract windfall profits from the DSO. In this paper, we model the increase-decrease game for large populations of energy consumers in power networks using a mean field game approach. The agents (energy consumers) maximize their individual welfare on the day-ahead market with anticipation of the redispatch market, coupled via the electricity price. We show that there exists a Nash equilibrium for this game and use an algorithm that converges to the Nash equilibrium for the infinite population case.
|
|
ThA17 Invited Session, Peony Junior 4511 |
Add to My Program |
Inverse Problems in Control, Estimation and Reinforcement Learning |
|
|
Chair: Li, Yibei | Nanyang Technological University |
Co-Chair: Wahlberg, Bo | KTH Royal Institute of Technology |
Organizer: Li, Yibei | Nanyang Technological University |
Organizer: Wahlberg, Bo | KTH Royal Institute of Technology |
|
10:00-10:20, Paper ThA17.1 | Add to My Program |
Inverse Optimal Adaptive Prescribed Performance Control with Application to Compliant Actuator-Driven Robot Manipulators (I) |
|
Lu, Kaixin | National University of Singapore |
Han, Shuaishuai | National University of Singapore |
Jia, Xinyu | National University of Singapore |
Yu, Haoyong | National University of Singapore |
Keywords: Adaptive control, Optimal control, Nonlinear systems
Abstract: In this work, we formulate and solve the problem of inverse optimal adaptive prescribed performance control and consider its application to compliant actuator-driven robot manipulators. A definition and sufficient conditions for this problem are introduced and derived based on adaptive control Lyapunov function method. An auxiliary system is constructed and incorporated with prescribed performance bounds so as to design a new class of inverse optimal adaptive controllers for the control system. By exploring the links between inverse optimality and stability, it is proved that the proposed controller ensures both inverse optimality and prescribed transient performance of the control system. Above developments are illustrated via an application to robot manipulators driven by compliant actuators. The inverse optimal adaptive control problem for robot manipulators with guaranteed transient performance has not been addressed in the literature.
|
|
10:20-10:40, Paper ThA17.2 | Add to My Program |
Finite-Sample Bounds for Adaptive Inverse Reinforcement Learning Using Passive Langevin Dynamics (I) |
|
Snow, Luke | Cornell University |
Krishnamurthy, Vikram | Cornell University |
Keywords: Markov processes, Learning, Estimation
Abstract: Stochastic gradient Langevin dynamics (SGLD) are a useful methodology for sampling from probability distributions. This paper provides a finite sample analysis of a passive stochastic gradient Langevin dynamics algorithm (PSGLD) designed to achieve inverse reinforcement learning. By "passive", we mean that the noisy gradients available to the PSGLD algorithm (inverse learning process) are evaluated at randomly chosen points by an external stochastic gradient algorithm (forward learner). The PSGLD algorithm thus acts as a randomized sampler which recovers the cost function being optimized by this external process. Previous work has analyzed the asymptotic performance of this passive algorithm using stochastic approximation techniques; in this work we analyze the non-asymptotic performance. Specifically, we provide finite-time bounds on the 2-Wasserstein distance between the passive algorithm and its stationary measure, from which the reconstructed cost function is obtained.
|
|
10:40-11:00, Paper ThA17.3 | Add to My Program |
Inverse Kalman Filtering for Systems with Correlated Noises (I) |
|
Li, Yibei | Nanyang Technological University |
Hu, Xiaoming | Royal Institute of Technology |
Wahlberg, Bo | KTH Royal Institute of Technology |
Xie, Lihua | Nanyang Tech. Univ |
Keywords: Kalman filtering, Identification, Optimal control
Abstract: This paper focuses on two inverse problems of the Kalman filter in which the process and measurement noises are correlated. The unknown covariance matrix in a stochastic system is reconstructed from observations of its posterior beliefs. For the standard inverse Kalman filtering problem, a novel duality-based formulation is proposed, where a well-defined inverse optimal control (IOC) problem is solved instead. Identifiability of the underlying model is proved, and a least squares estimator is designed that is statistically consistent. The time-invariant case using the steady-state Kalman gain is further studied. Since this inverse problem is ill-posed, a canonical class of covariance matrices is constructed, which can be uniquely identified from the dataset with asymptotic convergence. Finally, the performances of the proposed methods are illustrated by numerical examples.
|
|
11:00-11:20, Paper ThA17.4 | Add to My Program |
A Data-Driven Approach for Inverse Optimal Control (I) |
|
Liang, Zihao | Purdue University |
Hao, Wenjian | Purdue University |
Mou, Shaoshuai | Purdue University |
Keywords: Optimal control, Autonomous systems
Abstract: This paper proposes a data-driven, iterative approach for inverse optimal control (IOC), which aims to learn the objective function of a nonlinear optimal control system given its states and inputs. The approach solves the IOC problem in a challenging situation when the system dynamics is unknown. The key idea of the proposed approach comes from the deep Koopman representation of the unknown system, which employs a deep neural network to represent observables for the Koopman operator. By assuming the objective function to be learned is parameterized as a linear combination of features with unknown weights, the proposed approach for IOC is able to achieve a Koopman representation of the unknown dynamics and the unknown weights in objective function together. Simulation is provided to verify the proposed approach.
|
|
11:20-11:40, Paper ThA17.5 | Add to My Program |
Diagnosing and Repairing Feature Representations under Distribution Shifts (I) |
|
Lourenço, Inęs | KTH Royal Institute of Technology |
Bobu, Andreea | University of California Berkeley |
Rojas, Cristian R. | KTH Royal Institute of Technology |
Wahlberg, Bo | KTH Royal Institute of Technology |
Keywords: Robotics, Learning, Human-in-the-loop control
Abstract: Robots have been increasingly better at doing tasks for humans by learning from their feedback, but still often suffer from model misalignment due to missing or incorrectly learned features. When the features the robot needs to learn to perform its task are missing or do not generalize well to new settings, the robot will not be able to learn the task the human wants and, even worse, may learn a completely different and undesired behavior. Prior work shows how the robot can detect when its representation is missing some feature and can, thus, ask the human to be taught about the new feature; however, these works do not differentiate between features that are completely missing and those that exist but o not generalize to new environments. In the latter case, the robot would detect misalignment and simply learn a new feature, leading to an arbitrarily growing feature representation that can, in turn, lead to spurious correlations and incorrect learning down the line. In this work, we propose separating the two sources of misalignment: we propose a framework for determining whether a feature the robot needs is incorrectly learned and does not generalize to new environment setups vs. is entirely missing from the robot’s representation. Once we diagnose the source of error, we show how the human can initiate the realignment process for the model: if the feature is missing, we follow prior work for learning new features; however, if the feature exists but does not generalize, we use data augmentation to expand its training and, thus, complete the repair process. We demonstrate the proposed approach in experiments with a simulated 7DoF robot manipulator and physical human corrections.
|
|
11:40-12:00, Paper ThA17.6 | Add to My Program |
Reinforcement Learning-Based Operational Decision-Making in the Process Industry Using Multi-View Data (I) |
|
Liu, Chenliang | Central South University |
Wang, Yalin | Central South University |
Yang, Chunhua | Central South University |
Gui, Weihua | Central South University |
Keywords: Chemical process control, Control applications, Neural networks
Abstract: Owing to the frequent fluctuations encountered in raw material characteristics and operational conditions in the process industry, traditional data-driven approaches prove inadequate in adapting the adjustment of operational variables. Furthermore, the potential of multi-view data, including images, audio, and sensor data, remains underexploited in industrial processes. This study proposes an operational decision-making method based on feedstock-guided multi-view actor-critic (FMAC-ODM) using multi-view data to address these issues. This method utilizes the idea of reinforcement learning (RL) for enhanced decision-making. First, the problem of optimizing operational variables is reformulated into a continuous RL problem to acquire an improved decision-making policy aligned with the current operational conditions. Subsequently, the inclusion of feedstock properties in the state space is implemented to provide essential guidance for the decision-making process. Finally, in pursuit of a comprehensive understanding and bolstering the precision of the decision-making strategy, multi-view data sourced from the industrial site is harnessed as a surrogate for human observation. The effectiveness of the proposed decision-making method is substantiated through its practical application in the industrial flotation process.
|
|
ThA18 Regular Session, Peony Junior 4412 |
Add to My Program |
Nonlinear Systems IV |
|
|
Chair: Bollas, George | University of Connecticut |
Co-Chair: Como, Giacomo | Politecnico Di Torino |
|
10:00-10:20, Paper ThA18.1 | Add to My Program |
On the Existence and Uniqueness of Steady State Solutions of a Class of Dynamic Hydraulic Networks Via Actuator Placement |
|
Jeeninga, Mark | Lund University |
Machado Martínez, Juan Eduardo | University of Groningen |
Cucuzzella, Michele | University of Pavia |
Como, Giacomo | Politecnico Di Torino |
Scherpen, Jacquelien M.A. | University of Groningen |
Keywords: Fluid flow systems, Network analysis and control, Nonlinear systems
Abstract: In this paper, using tools from graph theory we provide verifiable necessary and sufficient conditions for the existence of a unique hydraulic equilibrium in district heating systems of meshed topology and containing multiple heat sources. Even though numerous publications have addressed the design of efficient algorithms for numerically finding hydraulic equilibria in the general context of water distribution networks, this is not the case for the analysis of existence and uniqueness. Moreover, most of the existing work dealing with these aspects exploit the equivalence between the nonlinear algebraic equations describing the hydraulic equilibria and the KKT conditions of a suitably defined nonlinear convex optimization problem. Differently, this paper proposes necessary and sufficient graph-theoretic conditions on the actuator placement for the existence and uniqueness of a hydraulic equilibrium, independent of the actuators' control objective. An example based on a representative district heating network is considered to illustrate the key aspects of our contribution, and an explicit formulation of the steady state solution is given for the case in which pressure drops through pipes are linear with respect to the flow rate.
|
|
10:20-10:40, Paper ThA18.2 | Add to My Program |
The Stabilization Condition for Interval Type-2 Fuzzy Systems Via Relaxed Membership-Parameter Matrix Inequalities |
|
Kim, Kyung Soo | POSTECH (Pohang Univ. of Sci. & Tech.) |
Park, PooGyeon | POSTECH (Pohang Univ. of Sci. & Tech.) |
Keywords: Fuzzy systems, Stability of nonlinear systems, LMIs
Abstract: This paper aims to investigate the relaxed stability condition for interval type-2 Takagi--Sugeno fuzzy systems via membership-parameter matrix inequalities. The membership framework of interval type-2 fuzzy sets is structured in convex polytopes with a straightforward method. The stabilization synthesis with a non-parallel distributed compensator controller incorporating lower and upper membership functions is achieved in the sense of a matrix. Moreover, this paper introduces the relaxation strategy for the orthogonal complement, effectively reducing the number of decision variables related to the linear matrix inequalities. In conclusion, examples are presented to demonstrate the effectiveness and applicability of the proposed methods.
|
|
10:40-11:00, Paper ThA18.3 | Add to My Program |
Bounded Extremum Seeking for Single-Variable Static Map with Large Measurement Delay Via Time-Delay Approach to Averaging |
|
Yang, Xuefei | Harbin Institute of Technology |
Fridman, Emilia | Tel-Aviv Univ |
Zhao, Bowen | Harbin Institute of Technology |
Keywords: Extremum seeking, Delay systems, Constrained control
Abstract: In this paper, we present a time-delay approach to gradient-based bounded extremum seeking (ES) with large measurement constant delay, for an unknown single-input static quadratic map. We assume that the extremum point and the Hessian H belong to known intervals, whereas the sign of H is known. We apply a time-delay approach to the bounded ES system and arrive at the neutral type system with a nominal linear delayed system. We present the latter system as a retarded one and employ variation of constants formula for practical stability analysis. Explicit conditions in terms of simple scalar inequalities depending on tuning parameters and delay are established to guarantee the practical stability of the bounded ES control systems. Given any delay and neighborhood of the extremum point and through the solution of the constructed inequalities, we find lower bounds on the dither period that ensures the practical stability.
|
|
11:00-11:20, Paper ThA18.4 | Add to My Program |
Discovery of Partial Differential Equation Models Using Symbolic Regression Via Genetic Programming |
|
Cohen, Benjamin | University of Connecticut |
Beykal, Burcu | University of Connecticut |
Bollas, George | University of Connecticut |
Keywords: Grey-box modeling, Machine learning, Nonlinear systems identification
Abstract: A framework for dynamic system model identification from scarce and noisy data is proposed. This framework uses symbolic regression via genetic programming with a gradient-based parameter estimation step to identify a differential equation model and its parameters from available system data. The effectiveness of the method is demonstrated by identifying four synthetic systems: an ideal plug flow reactor (PFR) with an irreversible chemical reaction, an ideal continuously stirred tank reactor (CSTR) with an irreversible chemical reaction, a system described by Burgers’ Equation, and an ideal PFR with a reversible chemical reaction. The results show that this framework can identify PDE models of systems from broadly spaced and noisy data. When the data was not sufficiently rich, the framework discovered a surrogate model that described the observations in equal or fewer terms than the true system model. Additionally, the method can select relevant physics terms to describe a system from a list of candidate arguments, providing valuable models for use in controls applications.
|
|
11:20-11:40, Paper ThA18.5 | Add to My Program |
Memory Saving State-Sharing Multi-Observer for a Class of Multi-Observer Based Algorithms |
|
Chong, Michelle | Eindhoven University of Technology |
Wakaiki, Masashi | Kobe Univeristy |
Hespanha, Joao P. | Univ. of California, Santa Barbara |
Keywords: Observers for nonlinear systems
Abstract: A multi-observer is a bank of observers which is used for state estimation in various applications. However, it has an implementation bottleneck when a large number of observers are required for the desired estimation performance. To overcome this problem, we propose the design method of a state-sharing multi-observer for a class of nonlinear systems. The state-sharing multi-observer is a single observer that integrates a bank of observers, and its state size is independent of the number of observers. We analyze the error of the state obtained from the state-sharing multi-observer, and then show its applicability to multi-observer based algorithms such as supervisory observers and in secure state estimation.
|
|
11:40-12:00, Paper ThA18.6 | Add to My Program |
Robust Control of Cascaded H-Bridge Multilevel Inverters for Grid-Tied PV Systems Subject to Faulty Conditions |
|
Katir, Hanane | ESE Laboratory, ENSEM of Casablanca, Hassan II University of Cas |
Abouloifa, Abdelmajid | ESE Lab, ENSEM of Casablanca, Hassan II University of Casablanc |
Elhoussin, Elbouchikhi | ISEN Yn Crea West |
Fekih, Afef | University of Louisiana at Lafayette |
Noussi, Karim | ESE Laboratory, ENSEM of Casablanca, University Hassan II of Cas |
El Aroudi, Abdelali | UNiversitat Rovira I Virgili |
Keywords: Energy systems, Fault tolerant systems, Robust control
Abstract: This paper deals with the design and implementation of a robust control approach for grid-tied PV systems. The main controller’s objectives are to inject the maximum available power to the grid, whilst guaranteeing power quality even under faulty conditions. To this end, a multi-loop regulator is designed for the Cascaded H-Bridge Multilevel Inverters (CHBMIs) by combining a sliding mode control strategy for maximum power point tracking and a Lyapunov approach for Power Factor Correction (PFC). Validation of the proposed approach using Matlab/ SimPowerSimscape environment confirmed the ability of the proposed approach to successfully accomplish its objectives in terms of references tracking and regulation under both matching and mismatching irradiation levels as well as faulty modes stemming from the photovoltaic (PV) panels and/or the dc-dc converters. Additionally, the proposed approach was shown to outperform conventional approaches and enable the continuous operation of the PV system under various failure modes affecting multiple PV panels and/or their associated dc-dc boost converters.
|
|
ThA19 Regular Session, Peony Junior 4411 |
Add to My Program |
Linear Parameter-Varying Systems |
|
|
Chair: Farhood, Mazen | Virginia Tech |
Co-Chair: Kon, Johan | Eindhoven University of Technology |
|
10:00-10:20, Paper ThA19.1 | Add to My Program |
Direct Data-Driven State-Feedback Control of General Nonlinear Systems |
|
Verhoek, Chris | Eindhoven University of Technology |
Koelewijn, Patrick | Eindhoven University of Technology |
Haesaert, Sofie | Eindhoven University of Technology |
Tóth, Roland | Eindhoven University of Technology |
Keywords: Linear parameter-varying systems, Adaptive control
Abstract: Through the use of the Fundamental Lemma for linear systems, a direct data-driven state-feedback control synthesis method is presented for a rather general class of nonlinear (NL) systems. The core idea is to develop a data-driven representation of the so-called velocity-form, i.e., the time-difference dynamics, of the NL system, which is shown to admit a direct linear parameter-varying (LPV) representation. By applying the LPV extension of the Fundamental Lemma in this velocity domain, a state-feedback controller is directly synthesized to provide asymptotic stability and dissipativity of the velocity-form. By using realization theory, the synthesized controller is realized as a NL state-feedback law for the original unknown NL system with guarantees of universal shifted stability and dissipativity, i.e., stability and dissipativity w.r.t. any (forced) equilibrium point, of the closed-loop behavior. This is achieved by the use of a single sequence of data from the system and a predefined basis function set to span the scheduling map. The applicability of the results is demonstrated on a simulation example of an unbalanced disc.
|
|
10:20-10:40, Paper ThA19.2 | Add to My Program |
Minimal Realizations of Input-Output Behaviors by LPV State-Space Representations with Affine Dependency |
|
Petreczky, Mihaly | UMR CNRS 9189, Ecole Centrale De Lille |
Tóth, Roland | Eindhoven University of Technology |
Mercčre, Guillaume | University of Poitiers |
Keywords: Linear parameter-varying systems, Algebraic/geometric methods, Identification
Abstract: The paper makes the first steps towards a behavioral theory of LPV state-space representations with an affine dependency on scheduling, by characterizing minimality of such state-space representations. It is shown that minimality is equivalent to observability, and that minimal realizations of the same behavior are isomorphic. Moreover, this isomorphism does not depend on the scheduling variable. Furthermore, controllability of behaviors is equivalent to span-reachability of their minimal state-space representations. Finally, we establish a formal relationship between minimality of LPV state-space representations with an affine dependence on scheduling and minimality of LPV state-space representations with a dynamic and meromorphic dependence on scheduling.
|
|
10:40-11:00, Paper ThA19.3 | Add to My Program |
Parameter-Varying Koopman Operator for Nonlinear System Modeling and Control |
|
Lee, Changyu | KAIST |
Park, Kiyong | KAIST |
Kim, Jinwhan | KAIST |
Keywords: Linear parameter-varying systems, Predictive control for nonlinear systems, Modeling
Abstract: This paper proposes a novel approach for modeling and controlling nonlinear systems with varying parameters. The approach introduces the use of a parameter-varying Koopman operator (PVKO) in a lifted space, which provides an efficient way to understand system behavior and design control algorithms that account for underlying dynamics and changing parameters. The PVKO builds on a conventional Koopman model by incorporating local time-invariant linear systems through interpolation within the lifted space. This paper outlines a procedure for identifying the PVKO and designing a model predictive control using the identified PVKO model. Simulation results demonstrate that the proposed approach improves model accuracy and enables predictions based on future parameter information. The feasibility and stability of the proposed control approach are analyzed, and their effectiveness is demonstrated through simulation.
|
|
11:00-11:20, Paper ThA19.4 | Add to My Program |
Robust Control of Discrete-Time Systems with Coefficient Matrices Given by Polytopic Martingales |
|
Kitahiro, Tomoya | Kyoto Univ |
Hosoe, Yohei | Kyoto University |
Hagiwara, Tomomichi | Kyoto Univ |
Keywords: Linear parameter-varying systems, Stochastic systems, LMIs
Abstract: This paper is concerned with robust control of discrete-time linear stochastic systems with coefficient matrices given by polytopic martingales. To the best of our knowledge, this class of stochastic systems have not been dealt with as a target of control due to the absence of required theory. For such systems, we discuss the following two types of approaches for robust stabilization: One is the proposed stochastic control approach using the martingale property of the coefficient matrices, and the other is a deterministic control approach without using the information. Through theoretical and numerical comparisons of the two approaches, we demonstrate the effectiveness of the proposed stochastic control approach in the sense of conservativeness.
|
|
11:20-11:40, Paper ThA19.5 | Add to My Program |
Control of Polytopic LPV Systems with Uncertain Initial Conditions |
|
Farhood, Mazen | Virginia Tech |
Keywords: Linear parameter-varying systems, Uncertain systems, Robust control
Abstract: This paper focuses on the control design and analysis for nonstationary linear parameter-varying systems with affine parameter dependence and uncertain initial conditions. The uncertain initial state and the disturbance input are allowed to reside in two separate norm balls. Convex analysis and synthesis conditions are derived, and a reachability analysis result for systems with pointwise-bounded inputs is developed, enabling the construction of ellipsoids in which the state or some output of interest lies at specified time instants. The usefulness of the proposed approach is demonstrated through an illustrative example involving a two-mass rotational system.
|
|
11:40-12:00, Paper ThA19.6 | Add to My Program |
Direct Learning for Parameter-Varying Feedforward Control: A Neural-Network Approach |
|
Kon, Johan | Eindhoven University of Technology |
van de Wijdeven, Jeroen | ASML Netherlands B.V |
Bruijnen, Dennis | Philips Engineering Solutions |
Tóth, Roland | Eindhoven University of Technology |
Heertjes, Marcel | Eindhoven University of Technology |
Oomen, Tom | Eindhoven University of Technology |
Keywords: Mechatronics, Linear parameter-varying systems, Neural networks
Abstract: The performance of a feedforward controller is primarily determined by the extent to which it can capture the relevant dynamics of a system. The aim of this paper is to develop an input-output linear parameter-varying (LPV) feedforward parameterization and a corresponding data-driven estimation method in which the dependency of the coefficients on the scheduling signal are learned by a neural network. The use of a neural network enables the parameterization to compensate a wide class of constant relative degree LPV systems. Efficient optimization of the neural-network-based controller is achieved through a Levenberg-Marquardt approach with analytic gradients and a pseudolinear approach generalizing Sanathanan-Koerner to the LPV case. The performance of the developed feedforward learning method is validated in a simulation study of an LPV system showing excellent performance.
|
|
ThA20 Invited Session, Orchid Junior 4312 |
Add to My Program |
Stochastic and Distributed Models in Systems and Synthetic Biology |
|
|
Chair: Waldherr, Steffen | University of Vienna |
Co-Chair: Singh, Abhyudai | University of Delaware |
Organizer: Waldherr, Steffen | University of Vienna |
Organizer: Singh, Abhyudai | University of Delaware |
|
10:00-10:20, Paper ThA20.1 | Add to My Program |
Robust Microphase Separation through Chemical Reaction Networks |
|
Blanchini, Franco | Univ. Degli Studi Di Udine |
Franco, Elisa | University of California a Los Angeles |
Giordano, Giulia | University of Trento |
Osmanovic, Dino | UCLA |
Keywords: Biological systems, Systems biology, Uncertain systems
Abstract: The interaction of phase-separating systems with chemical reactions is of great interest in various contexts, from biology to material science. In biology, phase separation is thought to be the driving force behind the formation of biomolecular condensates, i.e. organelles without a membrane that are associated with cellular metabolism, stress response, and development. RNA, proteins, and small molecules participating in the formation of condensates are also involved in a variety of biochemical reactions: how do the chemical reaction dynamics influence the process of phase separation? Here we are interested in finding chemical reactions that can arrest the growth of condensates, generating stable spatial patterns of finite size (microphase separation), in contrast with the otherwise spontaneous (unstable) growth of condensates. We consider a classical continuum model for phase separation coupled to a chemical reaction network (CRN), and we seek conditions for the emergence of stable oscillations of the solution in space. Given reaction dynamics with uncertain rate constants, but known structure, we derive easily computable conditions to assess whether microphase separation is impossible, possible for some parameter values, or robustly guaranteed for all parameter values within given bounds. Our results establish a framework to evaluate which classes of CRNs favor the emergence of condensates with finite size, a question that is broadly relevant to understanding and engineering life.
|
|
10:20-10:40, Paper ThA20.2 | Add to My Program |
Modeling Cell Size Distribution with Heterogeneous Flux Balance Analysis |
|
Busschaert, Michiel | KU Leuven |
Vermeire, Florence H. | KU Leuven |
Waldherr, Steffen | University of Vienna |
Keywords: Systems biology, Biological systems, Cellular dynamics
Abstract: For over two decades, Flux Balance Analysis (FBA) has been successfully used for predicting growth rates and intracellular reaction rates in microbiological metabolism. An aspect that is often omitted from this analysis, is segregation or heterogeneity between different cells. In this work, we propose an extended FBA method to model cell size distributions in balanced growth conditions. Hereto, a mathematical description of the concept of balanced growth in terms of cell mass distribution is presented. The cell mass distribution, quantified by the Number Density Function (NDF), is affected by cell growth and cell division. An optimization program is formulated in which the NDF, average cell culture growth rate and reaction rates per cell mass are treated as optimization variables. As qualitative proof of concept, the methodology is illustrated on a core carbon model of Escherichia coli under aerobic growth conditions. This illustrates feasibility and applications of this method, while indicating some shortcomings intrinsic to the simplified biomass structuring and the time invariant approach.
|
|
10:40-11:00, Paper ThA20.3 | Add to My Program |
Multicellular PD Control in Microbial Consortia |
|
Martinelli, Vittoria | Universitŕ Degli Studi Di Napoli Federico II |
Salzano, Davide | Scuola Superiore Meridionale |
Fiore, Davide | University of Naples Federico II |
di Bernardo, Mario | University of Naples Federico II |
Keywords: Biomolecular systems, Genetic regulatory systems, PID control
Abstract: We propose a multicellular implementation of a classical PD feedback controller to regulate gene expression in a microbial consortium. The implementation involves distributing the proportional and derivative control actions between two different cellular populations that can communicate with each other and regulate the output of a third target cellular population. We derive analytical conditions on biological parameters and control gains to adjust the system's static and dynamical properties. We then evaluate the strategy's performance and robustness through extensive in silico experiments in BSim, a realistic simulator of bacterial populations.
|
|
11:00-11:20, Paper ThA20.4 | Add to My Program |
Comparing Negative Feedback Mechanisms in Gene Expression: From Single Cells to Cell Populations (I) |
|
Zhang, Zhanhao | University of Delaware |
Nieto, Cesar | University of Delaware |
Singh, Abhyudai | University of Delaware |
Keywords: Systems biology, Hybrid systems, Nonlinear systems
Abstract: Negative feedback regulation is a well-known motif for suppressing deleterious fluctuations in gene product levels. We systematically compare two scenarios where negative feedback is either implemented in the protein production rate (regulated synthesis) or in the protein degradation rate (regulated degradation). Our results show that while in low-noise regimes both schemes are identical, they begin to show remarkable differences in high-noise regimes. Analytically solving for the probability distributions of the protein levels reveals that regulated synthesis is a better strategy to suppress random fluctuations while also minimizing protein levels dipping below a threshold. In contrast, regulated degradation is preferred if the goal is to minimize protein levels going beyond a threshold. Finally, we compare and contrast these distributions not only in a single cell over time but also in an expanding cell population where these effects can be buffered or exacerbated due to the coupling between expression and cell growth.
|
|
11:20-11:40, Paper ThA20.5 | Add to My Program |
Out-Of-Equilibrium Fluctuations Drive Correlations between Enzyme and Metabolic Product Levels (I) |
|
Borri, Alessandro | CNR-IASI |
Palumbo, Pasquale | University of Milano-Bicocca |
Singh, Abhyudai | University of Delaware |
Keywords: Systems biology, Biomolecular systems, Markov processes
Abstract: Enzyme-driven catalysis of a substrate into a product forms the fundamental backbone of cellular metabolic pathways. In the deterministic formulation of such a reaction scheme, the equilibrium level of the metabolic product is independent of the steady-state enzyme, so that any perturbation in enzyme levels causes a transient change in metabolic product levels that perfectly adapts to the original enzyme-independent steady state. In this work, we consider a stochastic formulation of the problem, where enzyme levels constantly fluctuate due to the inherently noisy gene expression process as well as to the extrinsic noise in substrate availability. Our results show that such out-of-equilibrium fluctuations can result in positive (or negative) enzyme-product and substrate-product correlations, whose behavior qualitatively and quantitatively changes in different scenarios characterized by perturbations of nominal parameters and variable noise levels.
|
|
11:40-12:00, Paper ThA20.6 | Add to My Program |
Error Bound for Hill-Function Approximations in a Class of Stochastic Transcriptional Network Models |
|
Hirsch, Dylan | Massachusetts Institute of Technology |
Grunberg, Theodore W. | Massachusetts Institute of Technology |
Del Vecchio, Domitilla | Massachusetts Institute of Technology |
Keywords: Biomolecular systems, Systems biology, Genetic regulatory systems
Abstract: Hill functions are often used in stochastic models of gene regulation to approximate the dependence of gene activity on the concentration of the transcription factor (TF) that regulates the gene. However, it is generally unknown how much error one may incur from this approximation. We investigate this question in the context of transcriptional networks (TNs). Under the assumption of rapid binding and unbinding of TFs with their gene targets, we bound the approximation error (in terms of the total variation distance) between a mass-action stochastic model and a corresponding model with Hill function propensities. To do so, we use a combination of singular perturbation theory and moment analysis for stochastic chemical reaction networks. We assume throughout that TFs regulate genes in a one-to-one fashion, each regulated gene produces a single TF, TFs do not multimerize, and each gene only has a single TF binding site. These results are pertinent for the modeling of TNs and may also carry relevance for more general biological processes.
|
|
ThA21 Regular Session, Orchid Junior 4311 |
Add to My Program |
Predictive Control for Nonlinear Systems I |
|
|
Chair: Saccani, Danilo | École Polytechnique Fédérale De Lausanne (EPFL) |
Co-Chair: Mohammadpour Velni, Javad | Clemson University |
|
10:00-10:20, Paper ThA21.1 | Add to My Program |
Model Predictive Control for Multi-Agent Systems under Limited Communication and Time-Varying Network Topology |
|
Saccani, Danilo | École Polytechnique Fédérale De Lausanne (EPFL) |
Fagiano, Lorenzo | Politecnico Di Milano |
Zeilinger, Melanie N. | ETH Zurich |
Carron, Andrea | ETH |
Keywords: Predictive control for nonlinear systems, Agents-based systems, Autonomous systems
Abstract: In control system networks, reconfiguration of the controller when agents are leaving or joining the network is still an open challenge, in particular when operation constraints that depend on each agent’s behavior must be met. Drawing our motivation from mobile robot swarms, in this paper, we address this problem by optimizing individual agent performance while guaranteeing persistent constraint satisfaction in presence of bounded communication range and time-varying network topology. The approach we propose is a model predictive control (MPC) formulation, building on multi-trajectory MPC (mt-MPC) concepts. To enable plug and play operations when the system is in closed-loop without the need of a request, the proposed MPC scheme predicts two different state trajectories in the same finite horizon optimal control problem. One trajectory drives the system to the desired target, assuming that the network topology will not change in the prediction horizon, while the second one ensures constraint satisfaction assuming a worst-case scenario in terms of new agents joining the network in the planning horizon. Recursive feasibility and stability of the closed-loop system during plug and play operations are shown. The approach effectiveness is illustrated with a numerical simulation.
|
|
10:20-10:40, Paper ThA21.2 | Add to My Program |
Nonlinear Data-Driven Predictive Control Using Deep Subspace Prediction Networks |
|
Lazar, Mircea | Eindhoven University of Technology |
Popescu, Mihai-Serban | Eindhoven University of Technology |
Schoukens, Maarten | Eindhoven University of Technology |
Keywords: Predictive control for nonlinear systems, Data driven control, Nonlinear systems identification
Abstract: Indirect data-driven predictive control (DPC) algorithms for nonlinear systems typically employ multi-step predictors, which are identified from input-output data using neural networks. In this paper we put forward a unifying multi-step prediction network architecture, i.e., the deep subspace prediction network (DSPN). We then prove that the DSPN architecture specialized to multi-layer-perceptron neural networks recovers the linear predictor corresponding to subspace predictive control for a sufficient number of hidden layer neurons. Hence, we establish a well-posed generalization of subspace predictive control for nonlinear systems. Moreover, we develop a regularized DSPN architecture that embeds a linear subspace predictor to improve extrapolation properties for non-training data. Simulation results on a benchmark inverted pendulum show that nonlinear DPC based on DSPN achieves high control performance for both noiseless and noisy data.
|
|
10:40-11:00, Paper ThA21.3 | Add to My Program |
Discrete-Time Control Barrier Functions for Guaranteed Recursive Feasibility in Nonlinear MPC: An Application to Lane Merging |
|
Katriniok, Alexander | Eindhoven University of Technology |
Shakhesi, Erfan | Eindhoven University of Technology |
Heemels, W.P.M.H. | Eindhoven University of Technology |
Keywords: Predictive control for nonlinear systems, Automotive systems, Optimal control
Abstract: In this paper, we present conditions under which the terminal ingredients, defined by discrete-time control barrier function (DTCBF) certificates, guarantee recursive feasibility in nonlinear MPC. Further, we introduce the notion of quasi-DTCBF (qDTCBF) certificates. Compared to DTCBFs, qDTCBF conditions can be satisfied with tighter control input bounds, which is highly advantageous if only limited actuation is possible. Both certificates encourage an earlier reaction of the control system and result in a lower cumulative MPC cost. The methodology is applied to a lane merging problem in automated driving, in which DTCBF and qDTCBF certificates subject to input constraints form the terminal ingredients to guarantee recursive feasibility of the nonlinear MPC scheme. A simulation study demonstrates the efficacy of the concept.
|
|
11:00-11:20, Paper ThA21.4 | Add to My Program |
Mixed-Integer MPC Strategies for Fueling and Density Control in Fusion Tokamaks |
|
Orrico, Christopher Anthony | Eindhoven University of Technology |
van Berkel, Matthijs | Dutch Institute for Fundamental Energy Research |
Bosman, Thomas | DIFFER |
Heemels, W.P.M.H. | Eindhoven University of Technology |
Krishnamoorthy, Dinesh | TU Eindhoven |
Keywords: Predictive control for nonlinear systems, Hybrid systems, Constrained control
Abstract: Model predictive control (MPC) is promising for fueling and core density feedback control in nuclear fusion tokamaks, where the primary actuators, frozen hydrogen fuel pellets fired into the plasma, are discrete. Previous density feedback control approaches have only approximated pellet injection as a continuous input due to the complexity that it introduces. In this letter, we model plasma density and pellet injection as a hybrid system and propose two MPC strategies for density control: mixed-integer (MI) MPC using a conventional mixed-integer programming (MIP) solver and MPC utilizing our novel modification of the penalty term homotopy (PTH) algorithm. By relaxing the integer requirements, the PTH algorithm transforms the MIP problem into a series of continuous optimization problems, reducing computation complexity. By adding a logarithmic barrier term to the PTH algorithm, we prevent the concave penalty term and active path constraints from causing the optimization problem to yield non-integer solutions. Both strategies perform well with regards to reference tracking without violating path constraints and satisfy the computation time limit for real-time control of the pellet injection system. However, the computation time of the PTH-based MPC strategy consistently outpaces the conventional MI-MPC strategy and is especially beneficial for MPC formulations with longer prediction horizons.
|
|
11:20-11:40, Paper ThA21.5 | Add to My Program |
Relaxed Feasibility and Stability Criteria for Flexible-Step MPC |
|
Fuernsinn, Annika | Queen's University |
Ebenbauer, Christian | RWTH Aachen University |
Gharesifard, Bahman | University of California, Los Angeles |
Keywords: Predictive control for nonlinear systems, Lyapunov methods, Stability of nonlinear systems
Abstract: We provide extensions to the new flexible-step model predictive control (MPC) scheme, which is based on the idea of generalized discrete-time control Lyapunov functions. These facilitate the implementation of a flexible number of control inputs in each iteration of the MPC scheme. We present relaxed recursive feasibility and stability results and provide a converse Lyapunov result. These results combined simplify the design of the flexible-step MPC scheme. We demonstrate the capabilities of the flexible-step MPC algorithm for a nonholonomic system, where the standard one-step implementation may suffer from lack of asymptotic convergence.
|
|
11:40-12:00, Paper ThA21.6 | Add to My Program |
A Hybrid Neural Network Approach for Adaptive Scenario-Based Model Predictive Control in the LPV Framework |
|
Bao, Yajie | The University of Georgia |
Mohammadpour Velni, Javad | Clemson University |
Keywords: Predictive control for nonlinear systems, Machine learning, Linear parameter-varying systems
Abstract: This paper presents a hybrid neural network (NN) approach for adaptive scenario-based model predictive control (SMPC) design of nonlinear systems in the linear parameter-varying (LPV) framework. In particular, a deterministic artificial neural network (ANN)-based LPV model is learned from data as the nominal model. Then, a Bayesian NN (BNN) is used to describe the mismatch between the plant and the LPV-ANN model. Adaptive scenarios are generated online based on the BNN model to reduce the conservativeness of scenario generation. Moreover, a probabilistic safety certificate is incorporated into the scenario generation by ensuring that the trajectories of scenarios contain the trajectory of the system and that all the scenarios satisfy the constraints with a high probability. Furthermore, conditions for the recursive feasibility of the SMPC are given. Experiments on the closed-loop simulations of a two-tank system demonstrate that the proposed approach can better model the behaviors of nonlinear systems than sole ANN/BNN models can, and the SMPC based on the hybrid NN (HyNN) model can improve the control performance compared to the SMPC with a fixed scenario tree.
|
|
ThA22 Regular Session, Orchid Junior 4212 |
Add to My Program |
Stochastic Systems I |
|
|
Chair: Nesic, Dragan | University of Melbourne |
Co-Chair: Chertkov, Michael | University of Arizona |
|
10:00-10:20, Paper ThA22.1 | Add to My Program |
Stability Bounds for Learning-Based Adaptive Control of Discrete-Time Multi-Dimensional Stochastic Linear Systems with Input Constraints |
|
Siriya, Seth | University of Melbourne |
Zhu, Jingge | University of Melbourne |
Nesic, Dragan | University of Melbourne |
Pu, Ye | The University of Melbourne |
Keywords: Stochastic systems, Adaptive control, Constrained control
Abstract: We consider the problem of adaptive stabilization for discrete-time, multi-dimensional linear systems with bounded control input constraints and unbounded stochastic disturbances, where the parameters of the system are unknown. To address this challenge, we propose a certainty-equivalent control scheme combining online parameter estimation with saturated linear control. We establish the existence of a high probability stability bound on the closed-loop system, under additional assumptions on the system and noise processes. Numerical examples are presented to illustrate our results.
|
|
10:20-10:40, Paper ThA22.2 | Add to My Program |
EVENT-DRIVEN L1-GAIN ASYNCHRONOUS FILTER of POSITIVE MARKOV JUMP SYSTEMS |
|
Zhang, Junfeng | Hainan University |
Yang, Yahao | Hainan University |
Huang, Mengxing | Hainan University |
Deng, Xuanjin | South China University of Technology |
Keywords: Stochastic systems, Fault detection, Hybrid systems
Abstract: This paper proposes event-driven asynchronous filters for positive Markov jump systems by employing hidden Markov model. Based on the output of sensor measurement, a weighted event-driven threshold is established in the form of 1-norm. The stability of the corresponding augmented system can be guaranteed by transforming the error signal into interval uncertain form. Under the established triggering condition, an event-driven positive l1-gain asynchronous filter is constructed for positive Markov jump systems. Then, the asynchronous filter design for positive Markov jump systems with partial information of hidden Markov model is further addressed. All presented conditions are described in the linear programming form. Finally, one example is given to illustrate the effectiveness of the proposed design.
|
|
10:40-11:00, Paper ThA22.3 | Add to My Program |
Spectral Decomposition in Kalman Filter Algorithm for Homogeneous Atomic Clock Ensembles |
|
Yan, Yuyue | Tokyo Institute of Technology |
Kawaguchi, Takahiro | Gunma University |
Yano, Yuichiro | National Institute of Information and Communications Technology |
Hanado, Yuko | National Institute of Information and Communications Technology |
Ishizaki, Takayuki | Tokyo Institute of Technology |
Keywords: Stochastic systems, Filtering, Computational methods
Abstract: Existing studies have pointed out numerical instability in the Kalman filter of atomic clocks, but the reasons for such instability have not been clarified mathematically. In this paper, we mathematically clarify the reason for the numerical instability by a new approach of spectral decomposition of the error covariance matrix in the Kalman filter. In particular, we reveal the fact that the error covariance matrix for homogeneous undetectable atomic clock ensembles can be decomposed into a diverging part and a converging part. Furthermore, the Kalman gain is solely influenced by the converging part, but not the diverging part, meaning that the Kalman gain converges to a steady-state value if ideal computation is possible without computation error. We present an alternative method to the conventional Kalman filter to avoid numerical instability and reduce computation cost where the covariance of Kalman filter can be computed rigorously only using three n-dimensional Riccati iterations instead of an nN-dimensional Riccati iterations for an n-order clock model with N clocks. A numerical example is provided to illustrate the efficacy of our approach.
|
|
11:00-11:20, Paper ThA22.4 | Add to My Program |
The First Achievement of a Given Level by a Random Process |
|
Semakov, Sergei | Moscow Institute of Physics and Technology and Moscow Automobile |
Semakov, Aleksei | Moscow Institute of Physics and Technology |
Semakov, Ivan | Moscow Aviation Institute, Tinkoff Bank |
Keywords: Stochastic systems, Flight control
Abstract: We estimate the probability that the first achievement of a given level by the component y_1(x) of n-dimensional continuous process y(x)={y_1(x),..., y_n(x)} occurs at some moment x* from a given interval (x',x") and, at this moment x*, the condition (y_2(x*),...,y_n(x*)∈D holds, where D is a given domain of (n−1)-dimensional Euclidean space R^{n−1}. The need to calculate the above-mentioned probability arises in the problems of aircraft control during landing.
|
|
11:20-11:40, Paper ThA22.5 | Add to My Program |
Universality and Control of Fat Tails |
|
Chertkov, Michael | University of Arizona |
Keywords: Stochastic systems, Fluid flow systems, Control of networks
Abstract: Motivated by applications in hydrodynamics and networks of thermostatically-control loads in buildings we study control of linear dynamical systems driven by additive and also multiplicative noise of a general position. Utilizing mathematical theory of stochastic multiplicative processes we present a universal way to estimate fat, algebraic tails of the state vector probability distributions. This prompts us to introduce and analyze mean-q-power stability criterion, generalizing the mean-square stability criterion, and then juxtapose it to other tools in control.
|
|
11:40-12:00, Paper ThA22.6 | Add to My Program |
Fraud Detection and Deterrence in Electronic Voting Machines: A Game-Theoretic Approach |
|
Vora, Anuj | IIT Bombay |
Kulkarni, Ankur A. | Indian Institute of Technology Bombay |
Keywords: Stochastic systems, Game theory, Attack Detection
Abstract: We study a setting where a detector wishes to detect and deter adversarial manipulation in an electronic voting machine. An adversary tries to win the election by tampering the votes while obfuscating its manipulation. We pose this problem as a game between the detector and the adversary and characterize the equilibrium payoffs for the players and the asymptotic nature of these payoffs. We find that if the detector is too cautious, then in equilibrium the adversary wins with a probability higher than its prior probability of winning. We derive an expression for the deterrence threshold, i.e., the minimum level of false-alarm that the detector should endure so that the adversary is not any better off by the manipulation. With this, asymptotically, the detector can ensure that the probability of missed-detection becomes zero by appropriately adjusting the rate of decay of probability of false-alarm. But if this rate of decay is too `fast', then the adversary can get an arbitrarily high probability of winning in spite of having a vanishing prior probability of winning. We then extend the results to a setting where the detector has incomplete information about the adversary.
|
|
ThA23 Invited Session, Orchid Junior 4211 |
Add to My Program |
Encrypted Control and Optimization |
|
|
Chair: Schulze Darup, Moritz | TU Dortmund University |
Co-Chair: Kim, Junsoo | SEOULTECH |
Organizer: Schulze Darup, Moritz | TU Dortmund University |
Organizer: Alexandru, Andreea B. | Duality Technologies |
Organizer: Kim, Junsoo | Seoul National University of Science and Technology |
|
10:00-10:20, Paper ThA23.1 | Add to My Program |
Optimal Controller and Security Parameter for Encrypted Control Systems under Least Squares Identification |
|
Teranishi, Kaoru | The University of Electro-Communications |
Kogiso, Kiminao | The University of Electro-Communications |
Keywords: Networked control systems, Information theory and control, Control over communications
Abstract: Encrypted control is a framework for the secure outsourcing of controller computation using homomorphic encryption that allows to perform arithmetic operations on encrypted data without decryption. In a previous study, the security level of encrypted control systems was quantified based on the difficulty and computation time of system identification. This study investigates an optimal design of encrypted control systems when facing an attack attempting to estimate a system parameter by the least squares method from the perspective of the security level. This study proposes an optimal H2 controller that maximizes the difficulty of estimation and an equation to determine the minimum security parameter that guarantee the security of an encrypted control system as a solution to the design problem. The proposed controller and security parameter are beneficial for reducing the computation costs of an encrypted control system, while achieving the desired security level. Furthermore, the proposed design method enables the systematic design of encrypted control systems.
|
|
10:20-10:40, Paper ThA23.2 | Add to My Program |
Homomorphically Encrypted Gradient Descent Algorithms for Quadratic Programming (I) |
|
Bertolace, André | University of Oxford |
Gatsis, Konstantinos | University of Oxford |
Margellos, Kostas | University of Oxford |
Keywords: Optimization algorithms, Numerical algorithms, Computer/Network Security
Abstract: In this paper, we evaluate the different fully homomorphic encryption schemes, propose an implementation, and numerically analyze the applicability of gradient descent algorithms to solve quadratic programming in a homomorphic encryption setup. The limit on the multiplication depth of homomorphic encryption circuits is a major challenge for iterative procedures such as gradient descent algorithms. Our analysis not only quantifies these limitations on prototype examples, thus serving as a benchmark for future investigations, but also highlights additional trade-offs like the ones pertaining the choice of gradient descent or accelerated gradient descent methods, opening the road for the use of homomorphic encryption techniques in iterative procedures widely used in optimization based control. In addition, we argue that, among the available homomorphic encryption schemes, the one adopted in this work, namely CKKS, is the only suitable scheme for implementing gradient descent algorithms. The choice of the appropriate step size is crucial to the convergence of the procedure. The paper shows firsthand the feasibility of homomorphically encrypted gradient descent algorithms.
|
|
10:40-11:00, Paper ThA23.3 | Add to My Program |
Oblivious Markov Decision Processes: Planning and Policy Execution (I) |
|
Alsayegh, Murtadha | Florida International University |
Fuentes, Jose | Florida International University |
Bobadilla, Leonardo | Florida International University |
Shell, Dylan | Texas A&M University |
Keywords: Control Systems Privacy, Markov processes, Robotics
Abstract: We examine a novel setting in which two parties have partial knowledge of the elements that make up a Markov Decision Process (MDP) and must cooperate to compute and execute an optimal policy for the problem constructed from those elements. This situation arises when one party wants to give a robot some task, but does not wish to divulge those details to a second party---while the second party possesses sensitive data about the robot's dynamics (information needed for planning). Both parties want the robot to perform the task successfully, but neither is willing to disclose any more information than is absolutely necessary. We utilize techniques from secure multi-party computation, combining primitives and algorithms to construct protocols that can compute an optimal policy while ensuring that the policy remains opaque by being split across both parties. To execute a split policy, we also give a protocol that enables the robot to determine what actions to trigger, while the second party guards against attempts to probe for information inconsistent with the policy's prescribed execution. In order to improve scalability, we find that basis functions and constraint sampling methods are useful in forming effective approximate MDPs. We report simulation results examining performance and precision, and assess the scaling properties of our Python implementation. We also describe a hardware proof-of-feasibility implementation using inexpensive physical robots, which, being a small-scale instance, can be solved directly.
|
|
11:00-11:20, Paper ThA23.4 | Add to My Program |
Hybrid Design of Multiplicative Watermarking for Defense against Malicious Parameter Identification (I) |
|
Zhang, Jiaxuan | Delft University of Technology |
Gallo, Alexander J. | TU Delft |
Ferrari, Riccardo M.G. | Delft University of Technology |
Keywords: Attack Detection, Cyber-Physical Security, Resilient Control Systems
Abstract: Multiplicative watermarking (MWM) is an active diagnosis technique for the detection of highly sophisticated at- tacks, but is vulnerable to malicious agents that use eavesdropped data to identify and then remove or replicate the watermark. In this work, we propose a scheme to protect the parameters of MWM, by proposing a design strategy based on piecewise affine (PWA) hybrid dynamical systems, called hybrid multiplicative watermarking (HMWM). Due to the design decision to make certain states of the HMWM systems unobservable, we show that parameter reconstruction by an eavesdropper is infeasible, from both a computational and a system-theoretic perspective, while not altering the system’s closed-loop performance.
|
|
11:20-11:40, Paper ThA23.5 | Add to My Program |
Feedback Path Delay Attacks and Detection (I) |
|
Wigren, Torbjorn | Uppsala University |
Teixeira, André M. H. | Uppsala University |
Keywords: Attack Detection, Identification, Cyber-Physical Security
Abstract: The paper discusses delay injection attacks on regulator loops and suggests joint recursive prediction error identification of delay and dynamics for supervision and attack detection. The control system is assumed to be operated either in open- or closed-loop mode. It is shown why delay insertion in the feedback path before the user switches to closed-loop operation is advantageous to disguise the attack. Delay attack monitoring is preferably performed continuously, allowing for early attack detection before closed-loop control is initiated. The detection performance is evaluated numerically for a linearized automotive cruise control feedback loop.
|
|
11:40-12:00, Paper ThA23.6 | Add to My Program |
On the Security of Randomly Transformed Quadratic Programs for Privacy-Preserving Cloud-Based Control (I) |
|
Binfet, Philipp | TU Dortmund University |
Schlüter, Nils | TU Dortmund University |
Schulze Darup, Moritz | TU Dortmund University |
Keywords: Control Systems Privacy, Cyber-Physical Security, Optimization
Abstract: Control related data, such as system states and inputs or controller specifications, is often sensitive. Meanwhile, the increasing connectivity of cloud-based or networked control results in vast amounts of such data, which poses a privacy threat, especially when evaluation on external platforms is considered. In this context, a cipher based on a random affine transformation gained attention, which is supposed to enable privacy-preserving evaluations of quadratic programs (QPs) with little computational overhead compared to other methods. This paper deals with the security of such randomly transformed QPs in the context of model predictive control (MPC). In particular, we show how to construct attacks against this cipher and thereby underpin concerns regarding its security in a practical setting. To this end, we exploit invariants under the transformations and common specifications of MPC-related QPs. Our numerical examples then illustrate that these two ingredients suffice to extract information from ciphertexts.
|
|
ThA24 Invited Session, Orchid Main 4201AB |
Add to My Program |
Event-Triggered and Self-Triggered Control II |
|
|
Chair: Tallapragada, Pavankumar | Indian Institute of Science |
Co-Chair: Xie, Yijing | University of Texas at Arlington |
Organizer: Heemels, W.P.M.H. | Eindhoven University of Technology |
Organizer: Hirche, Sandra | Technische Universität München |
Organizer: Nowzari, Cameron | George Mason University |
|
10:00-10:20, Paper ThA24.1 | Add to My Program |
Performance Implications of Different P-Norms in Level-Triggered Sampling (I) |
|
Meister, David | University of Stuttgart |
Allgöwer, Frank | University of Stuttgart |
Keywords: Networked control systems, Sampled-data control, Control over communications
Abstract: This work studies the performance of an event-based control approach, namely level-triggered sampling, in a standard multidimensional single-integrator setup. We falsify a conjecture from the literature that the deployed p-norm in the triggering condition supposedly has no impact on the performance of the sampling scheme in that setting. In particular, we show for the considered setup that the usage of the maximum norm instead of the Euclidean norm induces a performance deterioration of level-triggered sampling for sufficiently large system dimensions, when compared to periodic control at the same sampling rate. Moreover, we investigate the performance for other p-norms in simulation and observe that it degrades with increasing p. In addition, our findings reveal the previously unknown role of the triggering rule in the cause of a recently discovered phenomenon: Previous work has shown for a single-integrator consensus setup that the commonly observed performance advantage of event-based control over periodic control can be lost in distributed settings with a cooperative control goal. In our work, we obtain similar results for a non-cooperative setting only by adjusting the norm in the level-triggered sampling scheme. We therefore demonstrate that the performance degradation found in the distributed setting originates from the triggering rule and not from the considered cooperative control goal.
|
|
10:20-10:40, Paper ThA24.2 | Add to My Program |
Event-Triggered Distributed Optimization Algorithm Over Directed Networks: A Nonsingular Estimator Approach (I) |
|
Xian, Chengxin | Northwestern Polytechnical University |
Tao, Qianle | Northwestern Polytechnical University |
Liu, Yongfang | Northwestern Polytechnical University |
Wang, Huimin | Northeastern University |
Zhao, Yu | Northwestern Polytechnical University |
Keywords: Distributed control, Control of networks, Optimization algorithms
Abstract: This paper investigates the event-triggered distributed optimization problems (ETDOPs) over strongly connected directed networks. By assigning an additional scalar state variable to each agent and utilizing diminishing time-varying gain/step-size, a class of modified event-triggered distributed optimization algorithms (ETDOAs) is proposed, which can address the ETDOPs well and can avoid the inverse operation of some estimators in the existing literature. Compared with the existing DOAs, this paper gives a new idea to solve the DOPs under weighted-unbalanced digraphs and continuous communication of agent networks is avoided. Finally, numerical simulations are given to illustrate the effectiveness of the proposed ETDOAs.
|
|
10:40-11:00, Paper ThA24.3 | Add to My Program |
Distributed Dynamic Event-Triggered Communication Mechanisms for Dynamic Average Consensus (I) |
|
Qian, Yangyang | University of Virginia |
Xie, Yijing | University of Texas at Arlington |
Lin, Zongli | University of Virginia |
Wan, Yan | University of Texas at Arlington |
Shamash, Yacov | SUNY |
Keywords: Agents-based systems, Cooperative control, Distributed control
Abstract: This paper studies the dynamic average consensus problem of multi-agent systems under event-triggered communication. In this problem, each agent has access to a time-varying reference signal and aims to track the average of all reference signals. Distributed algorithms with event-triggered communication have been developed to achieve dynamic average consensus. Nevertheless, these existing event-triggered communication mechanisms cannot guarantee the existence of a designable positive minimum inter-event time (MIET), which is important in their practical implementation. Motivated by this observation, we propose a distributed dynamic event-triggered communication mechanism (ETCM) for each agent. It is shown that the proposed ETCM guarantees the existence of a positive MIET that is locally adjustable by tuning design parameters. It is also shown that the dynamic average consensus is achieved with any pre-specified level of accuracy. As an illustrative example, the theoretical results are applied to a networked battery energy storage system for state-of-charge balancing and desired total power tracking.
|
|
11:00-11:20, Paper ThA24.4 | Add to My Program |
Value of Information in Remote Estimation Subject to Delay and Packet Dropouts (I) |
|
Wang, Siyi | Technical University of Munich |
Hirche, Sandra | Technische Universität München |
Keywords: Networked control systems, Network analysis and control, Estimation
Abstract: Emerging cyber-physical systems impel the development of advanced network scheduling schemes to utilize communication and computation resources efficiently. This paper investigates the event-based schedule for remote state estimation in networked control systems (NCSs) subject to delay and packet dropouts. The scheduler decides whether or not to send out a local estimate according to the Value of Information (VoI) metric, which measures the relative importance of an information update. In addition, we model the triggering intervals as a Markov chain and analyze the tradeoff between the estimation performance and communication cost under the proposed VoI-based scheduling for the first-order system.
|
|
11:20-11:40, Paper ThA24.5 | Add to My Program |
Event-Triggered Parameterized Control for Stabilization of Linear Systems (I) |
|
Rajan, Anusree | Indian Institute of Science, Bangalore |
Tallapragada, Pavankumar | Indian Institute of Science |
Keywords: Control over communications, Networked control systems, Sampled-data control
Abstract: This paper proposes a new control method called event-triggered parameterized control (ETPC). We showcase this method by focusing on the specific problem of stabilization of linear systems. In this control method, between two consecutive events, each control input to the plant is a linear combination of a set of linearly independent scalar functions. At each event, the coefficients of the parameterized control input are chosen to minimize the error in approximating a model based control signal and then they are communicated to the actuator. We design two event-triggering rules that guarantee global asymptotic stability of the origin of the closed loop system under some conditions on the model uncertainty. We also show the existence of a uniform positive lower bound on the inter-event times. We illustrate our results through numerical examples. We compare the proposed control method with event-triggered zero-order-hold control and show a significant improvement in terms of the average inter-event times.
|
|
11:40-12:00, Paper ThA24.6 | Add to My Program |
Consistent Event-Triggered Consensus on Complete Graphs (I) |
|
Antunes, Duarte | Eindhoven University of Technology, the Netherlands |
Meister, David | University of Stuttgart |
Namerikawa, Toru | Keio University |
Allgöwer, Frank | University of Stuttgart |
Heemels, W.P.M.H. | Eindhoven University of Technology |
Keywords: Agents-based systems, Stochastic optimal control, Networked control systems
Abstract: This paper starts by considering an optimal control formulation of the consensus problem on complete graphs with a cost capturing disagreement and agents modeled by integrators. An optimal control policy for this problem is shown to be the well-known consensus algorithm by which each agent resets its state to the average of its and other agents' state values received at every time step. The framework is extended to the case where agents can only exchange information periodically, with a period larger than one. Then an event-triggered control strategy is proposed that results in a better cost than that of the optimal periodic one with the same average transmission rate, that is, it is consistent. According to this strategy, each agent distributedly transmits its state if the error between its current state and a common consensus estimate based on previously transmitted agents' data exceeds a threshold.
|
|
ThA25 Invited Session, Lotus Junior 4DE |
Add to My Program |
Informational Perspectives in Control |
|
|
Chair: Ranade, Gireeja | Microsoft Research |
Co-Chair: Tanaka, Takashi | University of Texas at Austin |
Organizer: Ranade, Gireeja | University of California, Berkeley |
Organizer: Tanaka, Takashi | University of Texas at Austin |
|
10:00-10:20, Paper ThA25.1 | Add to My Program |
Online Variable-Length Source Coding for Minimum Bitrate LQG Control (I) |
|
Cuvelier, Travis | University of Texas at Austin |
Tanaka, Takashi | University of Texas at Austin |
Heath Jr., Robert W. | North Carolina State University |
Keywords: Networked control systems, Information theory and control, Control over communications
Abstract: We propose an adaptive coding approach to achieve linear-quadratic-Gaussian (LQG) control with near-minimum bitrate prefix-free feedback. Our approach combines a recent analysis of a quantizer design for minimum rate LQG control with work on universal lossless source coding for sources on countable alphabets. It was recently demonstrated that the aforementioned quantizer's outputs are an asymptotically stationary, ergodic process. To enable LQG control with provably near-minimum bitrate, the quantizer outputs must be encoded into binary codewords efficiently. This is possible given knowledge of the quantizations' probability distributions, or of their limiting distribution. Obtaining such knowledge is challenging; the distributions do not readily admit closed form descriptions. This motivates the application of universal source coding. Our main theoretical contribution in this work is a proof that (after an invertible transformation), the quantizer outputs are random variables that fall within an exponential or power-law envelope class (depending on the plant dimension). Using ideas from universal coding on envelope classes, we develop a practical, zero-delay, fixed precision source code for the quantizer outputs. We evaluate the performance of this approach numerically, and demonstrate competitive results with respect to fundamental tradeoffs between bitrate and LQG control performance.
|
|
10:20-10:40, Paper ThA25.2 | Add to My Program |
Controllability with a Finite Data-Rate of Switched Linear Systems (I) |
|
Scabin Vicinansa, Guilherme | The University of Melbourne |
Liberzon, Daniel | Univ of Illinois, Urbana-Champaign |
Keywords: Quantized systems, Switched systems, Information theory and control
Abstract: In this work, we argue that the usual notion of controllability is unfit for systems that operate with finite data-rate constraints. We deal with this issue by defining a new concept of controllability with finite data-rate. Then, we specialize our discussion to the case of switched linear systems. We state a necessary condition and a sufficient condition for our new controllability notion to hold. Next, we take advantage of the switched linear system's structure to present a simple sufficient condition for controllability with finite data-rate that only involves the controllable subspace of the individual modes and some mild assumptions about the switching signal that guarantee that our sufficient condition holds. We also present another sufficient condition for systems that activate some controllable mode often enough. In particular, we illustrate the power of this result by deriving relations between the sampling time and the Average Dwell-Time (ADT) of the switching signal that guarantee that the switched system is controllable with finite data-rate. Finally, we discuss the gap between the necessary and the sufficient conditions and show that the sufficient condition is not necessary.
|
|
10:40-11:00, Paper ThA25.3 | Add to My Program |
Experiment Design with Gaussian Process Regression with Applications to Chance-Constrained Control (I) |
|
Anderson, Sean | University of California Santa Barbara |
Byl, Katie | University of California at Santa Barbara |
Hespanha, Joao P. | Univ. of California, Santa Barbara |
Keywords: Learning, Data driven control, Identification for control
Abstract: Learning for control in repeated tasks allows for well-designed experiments to gather the most useful data. We consider the setting in which we use a data-driven controller that does not have access to the true system dynamics. Rather, the controller uses inferred dynamics based on the available information. In order to acquire data that is beneficial for this controller, we present an experimental design approach that leverages the current data to improve expected control performance. We focus on the setting in which inference on the unknown dynamics is performed using Gaussian processes. Gaussian processes not only provide uncertainty quantification but also allow us to leverage structures inherit to Gaussian random variables. Through this structure, we design experiments via gradient descent on the expected control performance with respect to the experiment input. In particular, we focus on a chance-constrained minimum expected time control problem. Numerical demonstrations of our approach indicate our experimental design outperforms relevant benchmarks.
|
|
11:00-11:20, Paper ThA25.4 | Add to My Program |
Reinforcement Learning for Zero-Delay Coding Over a Noisy Channel with Feedback (I) |
|
Cregg, Liam | Queen's University |
Alajaji, Fady | Queen's University |
Yuksel, Serdar | Queen's University |
Keywords: Information theory and control, Machine learning, Stochastic optimal control
Abstract: In Shannon’s classical information-theoretic lossy coding problem, one is allowed to encode long sequences of source symbols at once in order to achieve a lower distortion, which is optimal in the limit of unbounded block lengths. Such a block-coding approach is undesirable in many delay-sensitive applications, such as networked control, sensor networks and live-streaming, among others. Accordingly, we are interested in a variant of Shannon’s lossy coding problem, where one wishes to send an information source causally at a fixed rate with no delay over a channel with feedback, while minimizing the average distortion at the receiver. Thus, the classical block-coding approach is not viable. This problem has previously been studied using stochastic control techniques, leading to existence, structural, and general approximation results. However, these techniques do not provide closed-form solutions for either optimal performance or code designs, and they lead to algorithmic implementations that are computationally difficult. To address this problem, we propose a reinforcement learning approach by building on recent results on quantized Q-learning. We will consider the case of a finite-alphabet Markov source over a discrete memoryless channel. After developing some supporting technical results on regularity and stability properties of the associated Markov process, we rigorously justify convergence of a quantized Q-learning algorithm to a near-optimal policy for this problem. Finally, we illustrate our theoretical findings via simulations.
|
|
11:20-11:40, Paper ThA25.5 | Add to My Program |
Information Design in Bayesian Routing Games (I) |
|
Cianfanelli, Leonardo | Politecnico Di Torino |
Ambrogio, Alexia | Politecnico Di Torino |
Como, Giacomo | Politecnico Di Torino |
Keywords: Transportation networks, Game theory
Abstract: We study optimal information provision in transportation networks when users are strategic and the network state is uncertain. An omniscient planner observes the network state and discloses information to the users with the goal of minimizing the expected travel time at the user equilibrium. Public signal policies, including full-information disclosure, are known to be inefficient in achieving optimality. For this reason, we focus on private signals and restrict without loss of generality the analysis to signals that coincide with path recommendations that satisfy obedience constraints, namely users have no incentive in deviating from the received recommendation according to their posterior belief. We first formulate the general problem and analyze its properties for arbitrary network topologies and delay functions. Then, we consider the case of two parallel links with affine delay functions, and provide sufficient conditions under which optimality can be achieved by information design. Interestingly, we observe that the system benefits from uncertainty, namely it is easier for the planner to achieve optimality when the variance of the uncertain parameters is large. We then provide an example where optimality can be achieved even if the sufficient conditions for optimality are not met.
|
|
11:40-12:00, Paper ThA25.6 | Add to My Program |
Control of Systems with Multiplicative Observation Noise (I) |
|
Won, Moses | University of California, Berkeley |
Ranade, Gireeja | University of California, Berkeley |
Keywords: Information theory and control, Uncertain systems, Stability of linear systems
Abstract: We consider the control of a linear system observed over multiplicative-noise. Specifically, the controller must stabilize the system using a control action based on observations of the system state that have been multiplied by i.i.d. random variables. While there is a long history of work on this fundamental problem, much of it has focused on understanding the performance of linear controllers, and the optimal control strategy for such a system remains unknown. In this paper, we consider the case of uniform multiplicative observation noise, and provide a non-linear control strategy based on the maximum a-posteriori (MAP) estimator of the state. We explicitly compute the convergence rates of different moments of the system under this control strategy, and find that the MAP-based strategy outperforms the best memoryless linear strategy when the ``signal-to-noise'' ratio (SNR) of the multiplicative noise, i.e. the ratio of the mean to the standard deviation, is low. In the high SNR regime we see that the MAP strategy is also a linear memoryless strategy, however, it is suboptimal and is outperformed by the optimal linear controller.
|
|
ThA26 Regular Session, Orchid Main 4301AB |
Add to My Program |
Modeling |
|
|
Chair: Mironchenko, Andrii | University of Passau |
Co-Chair: Eising, Jaap | ETH Zurich |
|
10:00-10:20, Paper ThA26.1 | Add to My Program |
Live Systems of Varying Dimension: Modeling and Stability |
|
Mironchenko, Andrii | University of Passau |
Keywords: Modeling, Hybrid systems, Nonlinear systems
Abstract: A major limitation of the classical control theory is the assumption that the state space remains stationary in time. This prevents analyzing and even formalizing the stability and control problems for open multi-agent systems whose agents may enter or leave the network, industrial processes where the sensors or actuators may be exchanged frequently, smart grids, etc. In this work, we propose a framework of live systems that covers a rather general class of systems with a time-varying state space. We argue that input-to-state stability is a proper stability notion for this class of systems, and many of the classic tools and results, such as Lyapunov methods and superposition theorems, can be extended to this setting.
|
|
10:20-10:40, Paper ThA26.2 | Add to My Program |
Design of Limit-Cycle Oscillators with Prescribed Trajectories and Phase-Response Properties Via Phase Reduction and Floquet Theory |
|
Namura, Norihisa | Tokyo Institute of Technology |
Nakao, Hiroya | Tokyo Institute of Technology |
Keywords: Modeling, Model/Controller reduction
Abstract: We propose a method for designing stable limit-cycle oscillators with prescribed periodic trajectories and phase-response properties in general dimensions based on the phase reduction theory. The vector field of the oscillator is approximated by polynomials and their coefficients are optimized to satisfy required conditions. Linear stability of the periodic trajectory is ensured by imposing conditions on the eigenvalues of the monodromy matrix based on Floquet theory. We verify the validity of the proposed method by designing several types of oscillators with given properties. As an application, we design two oscillators with the same periodic trajectory but with different phase-response properties and show their distinct synchronization dynamics under the same periodic input.
|
|
10:40-11:00, Paper ThA26.3 | Add to My Program |
Interconnection Schemes in Modeling and Control |
|
Borja, Pablo | University of Plymouth |
Ferguson, Joel | University of Newcastle |
van der Schaft, Arjan | Univ. of Groningen |
Keywords: Modeling, Nonlinear output feedback
Abstract: Interconnection schemes are ubiquitous in physical systems. For instance, in multi-domain systems consisting of interconnected subsystems from different physical domains. Furthermore, the interconnection of two or more systems has also been exploited to analyze and control dynamical systems, especially passive ones. To this end, the most common interconnection structure is the negative feedback interconnection. However, this approach is unsuitable to directly couple the states of the subsystems in the overall system's energy as customarily occurs in physical systems. This letter provides two interconnection approaches that overcome this issue. Notably, it is shown that these interconnection structures are suitable for decomposing passive systems into the interconnection of simpler passive subsystems. Moreover, these interconnections schemes allow the interpretation of some existing nonlinear control approaches as the interconnection of a passive plant with a passive controller. Additionally, the interpretation of the proposed interconnection structures is provided via bond graphs.
|
|
11:00-11:20, Paper ThA26.4 | Add to My Program |
Attitude Dynamics Modelling: Fractional Consensus Approach |
|
Baranowski, Jerzy | AGH University of Science and Technology |
Bauer, Waldemar | AGH University of Science and Technology |
Dukała, Karolina | SWPS University of Social Sciences and Humanities |
Mozyrska, Dorota | Bialystok University of Technology |
Wyrwas, Malgorzata | Bialystok University of Technology |
Keywords: Modeling, Nonlinear systems, Stability of linear systems
Abstract: In this paper we propose a consensus model using fractional calculus, which is an emerging topic in multi-agent modeling. Fractional models have infinite memory and can be understood as a relatively simple extension of traditional calculus. We propose a model structure motivating it by psychological research. For such model we also provide a stability analysis allowing results on possibilities of consensus arising in the modelled group of agents. To achieve this, we use fractional difference equations, which illustrate our considerations for agent groups of increasing complexity.
|
|
11:20-11:40, Paper ThA26.5 | Add to My Program |
Synchronisation in Electrical Circuits with Memristors and Grounded Capacitors |
|
Huijzer, Anne-Men | University of Groningen |
van der Schaft, Arjan | Univ. of Groningen |
Besselink, Bart | University of Groningen |
Keywords: Modeling, Stability of nonlinear systems, Network analysis and control
Abstract: Motivated by neuromorphic computing applications, this paper considers electrical circuits comprising memristors and grounded capacitors, connected to external sources. By using the flux-charge domain modelling approach, we will derive an initial value problem describing the dynamic behaviour of this circuit. Given an initial value and a fixed input, we will show that the fluxes in this circuit converge to an equilibrium. Furthermore, we show that when the fluxes reach this equilibrium, we achieve voltage synchronisation, i.e. no more currents are flowing through the circuit. These results are emphasised in an illustration.
|
|
11:40-12:00, Paper ThA26.6 | Add to My Program |
A Controlled Mean Field Model for Chiplet Population Dynamics |
|
Nodozi, Iman | University of California, Santa Cruz |
Halder, Abhishek | Iowa State University |
Matei, Ion | Palo Alto Research Center |
Keywords: Modeling, Stochastic systems, Uncertain systems
Abstract: In micro-assembly applications, ensemble of chiplets immersed in a dielectric fluid are steered using dielectrophoretic forces induced by an array of electrode population. Generalizing the finite population deterministic models proposed in prior works for individual chiplet position dynamics, we derive a controlled mean field model for a continuum of chiplet population in the form of a nonlocal, nonlinear partial differential equation. The proposed model accounts for the stochastic forces as well as two different types of nonlocal interactions, viz. chiplet-to-chiplet and chiplet-to-electrode interactions. Both of these interactions are nonlinear functions of the electrode voltage input. We prove that the deduced mean field evolution can be expressed as the Wasserstein gradient flow of a Lyapunov-like energy functional. With respect to this functional, the resulting dynamics is a gradient descent on the manifold of joint population density functions with finite second moments that are supported on the position coordinates.
|
|
ThB01 Tutorial Session, Orchid Main 4202-4306 |
Add to My Program |
Control and Optimization for Autonomous Energy Systems |
|
|
Chair: Bernstein, Andrey | National Renewable Energy Lab (NREL) |
Co-Chair: Zhao, Changhong | The Chinese University of Hong Kong |
Organizer: Bernstein, Andrey | National Renewable Energy Lab (NREL) |
Organizer: Cavraro, Guido | National Renewable Energy Laboratory |
|
13:30-13:50, Paper ThB01.1 | Add to My Program |
Tutorial on Congestion Control in Multi-Area Transmission Grids Via Online Feedback Equilibrium Seeking (I) |
|
Belgioioso, Giuseppe | ETH Zürich |
Bolognani, Saverio | ETH Zurich |
Pejrani, Giulia | ETHz |
Dörfler, Florian | Swiss Federal Institute of Technology (ETH) Zurich |
Keywords: Power systems, Game theory, Optimization
Abstract: Online feedback optimization (OFO) is an emerging control methodology for real-time optimal steady-state control of complex dynamical systems. This tutorial focuses on the application of OFO for the autonomous operation of large-scale transmission grids, with a specific goal of minimizing renewable generation curtailment and losses while satisfying voltage and current limits. When this control methodology is applied to multi-area transmission grids, where each area independently manages its congestion while being dynamically interconnected with the rest of the grid, a non-cooperative game arises. In this context, OFO must be interpreted as an online feedback equilibrium seeking (FES) scheme. Our analysis incorporates technical tools from game theory and monotone operator theory to evaluate the stability and robustness of multi-area grid operation. Through numerical simulations, we illustrate the key challenge of this non-cooperative setting: on the one hand, independent multi-area decisions are suboptimal compared to a centralized control scheme; on the other hand, some areas are heavily penalized by the centralized decision, which may discourage participation in the coordination mechanism.
|
|
13:50-14:10, Paper ThB01.2 | Add to My Program |
Time-Varying Feedback Optimization for Quadratic Programs with Heterogeneous Gradient Step Sizes (I) |
|
Bernstein, Andrey | National Renewable Energy Lab (NREL) |
Comden, Joshua | National Renewable Energy Laboratory |
Chen, Yue | National Renewable Energy Laboratory |
Wang, Jing | National Renewable Energy Laboratory |
Keywords: Optimization algorithms, Adaptive control, Power systems
Abstract: Online feedback-based optimization has become a promising framework for real-time optimization and control of complex engineering systems. This paper surveys the recent advances in the field as well as provides novel convergence results for primal-dual online algorithms with heterogeneous step sizes for different elements of the gradient. The analysis is performed for quadratic programs and the approach is illustrated on applications for adaptive step-size and model-free online algorithms, in the context of optimal control of modern power systems.
|
|
14:10-14:30, Paper ThB01.3 | Add to My Program |
Balancing the Power Grid with Cheap Assets (I) |
|
Meyn, Sean P. | Univ. of Florida |
Lu, Fan | University of Florida |
Mathias, Joel | Arizona State University |
Keywords: Power systems, Smart grid, Optimal control
Abstract: We have all heard that there is growing need to secure resources to obtain supply-demand balance in a power grid facing increasing volatility from renewable sources of energy. There are mandates for utility scale battery systems in regions all over the world, and there is a growing science of "demand dispatch" to obtain virtual energy storage from flexible electric loads such as water heaters, air conditioning, and pumps for irrigation. The question addressed in this tutorial is how to manage a large number of assets for balancing the grid. The focus is on variants of the economic dispatch problem, which may be regarded as the "feed-forward" component in an overall control architecture. 1) The resource allocation problem is identical to a finite horizon optimal control problem with degenerate cost---so called "cheap control". This implies a form of state space collapse, whose form is identified: the marginal cost for each load class evolves in a two-dimensional subspace, spanned by a scalar co-state process and its derivative. 2) The implication to distributed control is remarkable. Once the co-state process is synthesized, this common signal may be broadcast to each asset for optimal control. However, the optimal solution is extremely fragile, in a sense made clear through results from numerical studies. 3) Several remedies are proposed to address fragility. One is described through "robust training" in a particular Q-learning architecture (one approach to reinforcement learning). In numerical studies it is found that specialized training leads to more robust control solutions.
|
|
14:30-14:50, Paper ThB01.4 | Add to My Program |
Tutorial on Dynamics and Control of Grid-Connected Power Electronics and Renewable Generation (I) |
|
Gross, Dominic | University of Wisconsin-Madison |
Keywords: Power systems, Power electronics, Power generation
Abstract: Electrical power systems are transitioning from fuel-based generation to renewable generation and transmission interfaced by power electronics. This transition challenges standard power system modeling, analysis, and control paradigms across timescales from milliseconds to seasons. This tutorial focuses on frequency stability on timescales of milliseconds to seconds. We first review basic results for grid-following (GFL) and grid-forming (GFM) control of voltage source converters (VSCs), typical renewable generation, and high voltage direct current (HVdc) transmission. In this context, it becomes apparent that GFL and GFM control functions are needed to operate emerging power systems. However, combining GFL resources, GFM resources, and legacy generation on the same system results in highly complex dynamics that are a significant obstacle to stability analysis. The remainder of the tutorial provides an overview of recent developments in universal GFM controls that bridge the gap between GFL and GFM control and provide a pathway to a coherent control and analysis framework accounting for power generation, power conversion, and power transmission.
|
|
14:50-15:10, Paper ThB01.5 | Add to My Program |
Modeling Unbalanced Power Flow with ∆-Connected Devices (I) |
|
Low, Steven | California Institute of Technology |
Keywords: Power systems, Smart grid
Abstract: In this tutorial we present a simple approach to modeling unbalanced three-phase power flows. We allow general non-ideal models of voltage sources and ZIP loads. The basic idea is to explicitly separate a device/transformer model into an internal model, that depends on the characteristics of the single-phase devices or transformers, and a conversion rule, that depends on their configuration. This allows us to exploit common structures across different device/transformer variants and derive their external models that are general and unified. We illustrate the model by formulating a three-phase optimal power flow problem as a quadratically constrained quadratic program.
|
|
15:10-15:30, Paper ThB01.6 | Add to My Program |
Convergence of Backward/Forward Sweep for Power Flow Solution in Radial Networks (I) |
|
Fang, Bohang | The Chinese University of Hong Kong |
Zhao, Changhong | The Chinese University of Hong Kong |
Low, Steven | California Institute of Technology |
Keywords: Power systems, Nonlinear systems
Abstract: Solving power flow is perhaps the most fundamental calculation related to the steady state behavior of alternating-current (AC) power systems. The normally radial (tree) topology of a distribution network induces a spatially recursive structure in power flow equations, which enables a class of efficient solution methods called backward/forward sweep (BFS). In this paper, we revisit BFS from a new perspective, focusing on its convergence. Specifically, we describe a general formulation of BFS, interpret it as a special Gauss-Seidel algorithm, and then illustrate it in a single-phase power flow model. We prove a sufficient condition under which the BFS is a contraction mapping on a closed set of safe voltages and thus converges geometrically to a unique power flow solution. We verify the convergence condition, as well as the accuracy and computational efficiency of BFS, through numerical experiments in IEEE test systems.
|
|
ThB02 Invited Session, Melati Main 4001AB-4104 |
Add to My Program |
Learning-Based Control IV: Data-Driven Controller Design |
|
|
Chair: Zeilinger, Melanie N. | ETH Zurich |
Co-Chair: Schoellig, Angela P | University of Toronto |
Organizer: Trimpe, Sebastian | RWTH Aachen University |
Organizer: Müller, Matthias A. | Leibniz University Hannover |
Organizer: Schoellig, Angela P | University of Toronto |
Organizer: Zeilinger, Melanie N. | ETH Zurich |
|
13:30-13:50, Paper ThB02.1 | Add to My Program |
Data-Driven Model-Reference Control with Closed-Loop Stability: The Output-Feedback Case |
|
de Jong, Thomas O. | Eindhoven University of Technology |
Breschi, Valentina | Eindhoven University of Technology |
Schoukens, Maarten | Eindhoven University of Technology |
Formentin, Simone | Politecnico Di Milano |
Keywords: Identification for control, Output regulation
Abstract: We generalize a recently introduced data-driven approach for model-reference control design with closed-loop stability guarantees to the case of single-input single-output systems with inaccessible state. By considering a dynamic controller with fixed structure and leveraging a data-based description of the closed-loop dynamics, we propose a two-stage strategy for the optimization of the controller's parameters to match the desired closed-loop behavior. By means of a benchmark simulation example, we show the potential of the proposed approach and the impact of a simple strategy to handle noisy measurements.
|
|
13:50-14:10, Paper ThB02.2 | Add to My Program |
Model-Free Data-Driven Predictive Control Using Reinforcement Learning (I) |
|
Sawant, Shambhuraj | NTNU Trondheim |
Reinhardt, Dirk Peter | Norwegian University of Science and Technology |
Bahari Kordabad, Arash | Norwegian University of Science and Technology |
Gros, Sebastien | NTNU |
Keywords: Data driven control, Optimal control, Learning
Abstract: This paper proposes a novel approach for Predictive Control utilizing Reinforcement Learning (RL) and Data-Driven techniques to derive optimal control policies for real systems. Using pure input-output multi-step predictors based on Subspace Identification and RL techniques, the resulting predictive control scheme can approximate the optimal control policy of a system with high accuracy, even if the predictor cannot accurately capture the true system dynamics. One of the key contributions of the proposed approach is the extension of the framework connecting Model Predictive Control (MPC) and RL to one that does not require explicit state-space models, nor to define a notion of state at all. The paper demonstrates the efficacy of the proposed approach through an illustrative example, highlighting the ability of our approach to provide an optimal control policy for a real system without requiring any prior knowledge about its internal dynamics.
|
|
14:10-14:30, Paper ThB02.3 | Add to My Program |
The Fundamental Limitations of Learning Linear-Quadratic Regulators (I) |
|
Lee, Bruce | University of Pennsylvania |
Ziemann, Ingvar | University of Pennsylvania |
Tsiamis, Anastasios | ETH Zurich |
Sandberg, Henrik | KTH Royal Institute of Technology |
Matni, Nikolai | University of Pennsylvania |
Keywords: Identification for control, Statistical learning, Linear systems
Abstract: We present a local minimax lower bound on the excess cost of designing a linear-quadratic controller from offline data. The bound is valid for any offline exploration policy that consists of a stabilizing controller and an energy bounded exploratory input. The derivation leverages a relaxation of the minimax estimation problem to Bayesian estimation, and an application of van Trees inequality. We show that the bound aligns with system-theoretic intuition. In particular, we demonstrate that the lower bound increases when the optimal control objective value increases. We also show that the lower bound increases when the system is poorly excitable, as characterized by the spectrum of the controllability gramian of the system mapping the noise to the state and the H-infinity norm of the system mapping the input to the state. We further show that for some classes of systems, the lower bound may be exponential in the state dimension, demonstrating exponential sample complexity for learning the linear-quadratic regulator.
|
|
14:30-14:50, Paper ThB02.4 | Add to My Program |
On the Design of Persistently Exciting Inputs for Data-Driven Control of Linear and Nonlinear Systems |
|
Alsalti, Mohammad | Leibniz University Hannover |
Lopez, Victor G. | Leibniz University Hannover |
Müller, Matthias A. | Leibniz University Hannover |
Keywords: Identification for control
Abstract: In the context of data-driven control, persistence of excitation (PE) of an input sequence is defined in terms of a rank condition on the Hankel matrix of the input data. For nonlinear systems, recent results employed rank conditions involving collected input and state/output data, for which no guidelines are available on how to satisfy them a priori. In this paper, we first show that a set of discrete impulses is guaranteed to be persistently exciting for any controllable LTI system. Based on this result, for certain classes of nonlinear systems, we guarantee persistence of excitation of sequences of basis functions a priori, by design of the physical input only.
|
|
14:50-15:10, Paper ThB02.5 | Add to My Program |
Online Control for Adaptive Tapering of Medications (I) |
|
Gradu, Paula | UC Berkeley |
Recht, Benjamin | University of California, Berkeley |
Keywords: Healthcare and medical systems, Optimal control, Statistical learning
Abstract: We investigate adaptive protocols for the elimination or reduction of the use of medications or addictive substances. We formalize this problem as online optimization, minimizing the cumulative dose subject to constraints on individual well-being. We adapt a model of addiction from the psychology literature and show how it can be described by a class of linear time-invariant systems. For such systems, the optimal policy amounts to taking the smallest dose that maintains well-being. We derive a simple protocol based on integral control that requires no system identification, only needing approximate knowledge of the instantaneous dose response. This protocol is robust to model misspecification and is able to maintain an individual's well-being during the tapering process. Numerical experiments demonstrate that the adaptive protocol outperforms non-adaptive methods in terms of both maintenance of well-being and rate of dose reduction.
|
|
15:10-15:30, Paper ThB02.6 | Add to My Program |
Differentially Flat Learning-Based Model Predictive Control Using a Stability, State, and Input Constraining Safety Filter |
|
Hall, Adam W. | University of Toronto |
Greeff, Melissa | University of Toronto |
Schoellig, Angela P | University of Toronto |
Keywords: Predictive control for nonlinear systems, Machine learning, Robotics
Abstract: Learning-based optimal control algorithms control unknown systems using past trajectory data and a learned model of the system dynamics. These controllers use either a linear approximation of the learned dynamics, trading performance for faster computation, or nonlinear optimization methods, which typically perform better but can limit real-time applicability. In this work, we present a novel nonlinear controller that exploits differential flatness to achieve similar performance to state-of-the-art learning-based controllers but with significantly less computational effort. Differential flatness is a property of dynamical systems whereby nonlinear systems can be exactly linearized through a nonlinear input mapping. Here, the nonlinear transformation is learned as a Gaussian process and is used in a safety filter that guarantees, with high probability, stability as well as input and flat state constraint satisfaction. This safety filter is then used to refine inputs from a Flat Model Predictive Controller to perform constrained nonlinear learning-based optimal control through two successive convex optimizations. We compare our method to state-of-the-art learning-based control strategies and achieve similar performance, but with significantly better computational efficiency, while also respecting flat state and input constraints, and guaranteeing stability.
|
|
ThB03 Invited Session, Melati Junior 4010A-4111 |
Add to My Program |
Gaussian Process Based Optimization and Control |
|
|
Chair: Beckers, Thomas | Vanderbilt University |
Co-Chair: Bethge, Johanna | Otto-Von-Guericke University Magdeburg |
Organizer: Beckers, Thomas | Vanderbilt University |
Organizer: Bethge, Johanna | Otto-Von-Guericke University Magdeburg |
Organizer: Hirche, Sandra | Technische Universität München |
Organizer: Findeisen, Rolf | TU Darmstadt |
|
13:30-13:50, Paper ThB03.1 | Add to My Program |
Early Intention Prediction of Lane-Changing Based on Dual Gaussian-Mixed Hidden Markov Models (I) |
|
Li, Zheng | Tianjin University |
Wang, Yijing | Tianjin University |
Zuo, Zhiqiang | Tianjin University |
Liu, Zhengxuan | Tianjin University |
Chen, Yining | Tianjin University |
Li, Hongchao | Hebei University of Technology |
Keywords: Learning, Pattern recognition and classification
Abstract: Adjacent lane-changing is one of the most dangerous maneuvers which may lead to rear-end crash, uncomfortable braking and sharp steering. If the autonomous driving system can predict the potential latent lane-changing intentions of surrounding vehicles in advance, the driver will have more time to make reasonable response. In this paper, we focus on how to give accurate and reliable prediction for latent lanechangings, especially before the vehicles merge into the target lanes. A prediction model based on dual Gaussian-mixed hidden Markov models is developed to exploit the advantages of different features more effectively. Since there is no comprehensive criteria to evaluate the accuracy and predictability performance simultaneously, we propose two new metrics for quantitative analysis as supplement to the classical indicators. Comparative validation on Next Generation Simulation (NGSIM) database shows that our model has a high recognition accuracy of 93.05% for lane-changing intention with earlier prediction over the existing homologous methods.
|
|
13:50-14:10, Paper ThB03.2 | Add to My Program |
Robust Stability of Gaussian Process Based Moving Horizon Estimation (I) |
|
Wolff, Tobias M. | Leibniz University Hannover |
Lopez, Victor G. | Leibniz University Hannover |
Müller, Matthias A. | Leibniz University Hannover |
Keywords: Observers for nonlinear systems, Learning, Uncertain systems
Abstract: In this paper, we introduce a Gaussian process based moving horizon estimation (MHE) framework. The scheme is based on offline collected data and offline hyperparameter optimization. In particular, compared to standard MHE schemes, we replace the mathematical model of the system by the posterior mean of the Gaussian process. To account for the uncertainty of the learned model, we exploit the posterior variance of the learned Gaussian process in the weighting matrices of the cost function of the proposed MHE scheme. We prove practical robust exponential stability of the resulting estimator using a recently proposed Lyapunov-based proof technique. Finally, the performance of the Gaussian process based MHE scheme is illustrated via a nonlinear system.
|
|
14:10-14:30, Paper ThB03.3 | Add to My Program |
Learning a Gaussian Process Approximation of a Model Predictive Controller with Guarantees (I) |
|
Rose, Alexander | Technical University of Darmstadt |
Pfefferkorn, Maik | Otto-Von-Guericke-Universität Magdeburg |
Nguyen, Hoang Hai | TU Darmstadt |
Findeisen, Rolf | TU Darmstadt |
Keywords: Predictive control for nonlinear systems, Learning, Optimization
Abstract: Model predictive control effectively handles complex dynamical systems with constraints, but its high computational demand often makes real-time application infeasible. We propose using Gaussian process regression to learn an approximation of the controller offline for online use. Our approach incorporates a robust predictive control scheme and provides bounds on approximation errors to ensure recursive feasibility and input-to-state stability. Exploiting a sampling-based scenario approach, we develop an efficient sampling strategy and guarantee that, with high probability, the approximation error remains within acceptable bounds. Our method demonstrates enhanced efficiency and reduced computational demand in an example application.
|
|
14:30-14:50, Paper ThB03.4 | Add to My Program |
Data-Driven Reachability Analysis for Gaussian Process State Space Models (I) |
|
Griffioen, Paul | University of California, Berkeley |
Arcak, Murat | University of California, Berkeley |
Keywords: Data driven control, Uncertain systems, Statistical learning
Abstract: Gaussian process state space models are becoming common tools for the analysis and design of nonlinear systems with uncertain dynamics. When designing control policies for these systems, safety is an important property to consider. In this paper, we provide safety guarantees by computing finite-horizon forward reachable sets for Gaussian process state space models. We use data-driven reachability analysis to provide exact probability measures for state trajectories of arbitrary length, even when no data samples are available. We investigate two numerical examples to demonstrate the power of this approach, such as providing highly non-convex reachable sets and detecting holes in the reachable set.
|
|
14:50-15:10, Paper ThB03.5 | Add to My Program |
Safe Explorative Bayesian Optimization - towards Personalized Treatments in Plasma Medicine (I) |
|
Chan, Kimberly J | University of California Berkeley |
Paulson, Joel | The Ohio State University |
Mesbah, Ali | University of California, Berkeley |
Keywords: Optimization, Process Control
Abstract: This paper considers the problem of Bayesian optimization (BO) for systems with safety-critical constraints. Recent work has shown that a theoretically consistent way to account for constraints in BO is to relax the constraint functions such that the feasible region has a high probability of containing the global solution. However, by construction, these approaches are unable to ensure safe/feasible operation at every query, which is unacceptable in safety-critical applications. Alternatively, safe BO methods force the query points to remain in the interior of a partially-revealed safety region, which may result in unacceptable (and unquantified) performance losses. This paper presents a new safe BO method that avoids these performance losses by systematically incorporating potential performance gains from enlargement of the safety region. The proposed method avoids getting stuck at suboptimal points based on a potentially small initial safety region due to limited initial exploration of the safety boundary. The performance of the proposed method is demonstrated for safe control of a cold atmospheric plasma jet towards personalized plasma medicine.
|
|
15:10-15:30, Paper ThB03.6 | Add to My Program |
Primal-Dual Contextual Bayesian Optimization for Control System Online Optimization with Time-Average Constraints (I) |
|
Xu, Wenjie | EPFL |
Jiang, Yuning | EPFL |
Svetozarevic, Bratislav | University of Zurich |
Jones, Colin N. | EPFL |
Keywords: Machine learning, Optimization
Abstract: This paper studies the problem of online performance optimization of constrained closed-loop control systems, where both the objective and the constraints are unknown black-box functions affected by exogenous time-varying contextual disturbances. A primal-dual contextual Bayesian optimization algorithm is proposed that achieves sublinear cumulative regret with respect to the dynamic optimal solution under certain regularity conditions. Furthermore, the algorithm achieves zero time-average constraint violation, ensuring that the average value of the constraint function satisfies the desired constraint. The method is applied to both sampled instances from Gaussian processes and a continuous stirred tank reactor parameter tuning problem; simulation results show that the method simultaneously provides close-to-optimal performance and maintains constraint feasibility on average. This contrasts current state-of-the-art methods, which either suffer from large cumulative regret or severe constraint violations for the case studies presented.
|
|
ThB04 Invited Session, Simpor Junior 4913 |
Add to My Program |
Learning and Control for Accessible, Safe, and Equitable Transportation |
|
|
Chair: Malikopoulos, Andreas A. | University of Delaware |
Co-Chair: Salazar, Mauro | Eindhoven University of Technology |
Organizer: Malikopoulos, Andreas A. | Cornell University |
Organizer: Cassandras, Christos G. | Boston University |
Organizer: Wu, Cathy | UC Berkeley |
|
13:30-13:50, Paper ThB04.1 | Add to My Program |
A Time-Invariant Network Flow Model for Two-Person Ride-Pooling Mobility-On-Demand (I) |
|
Paparella, Fabio | Eindhoven University of Technology |
Pedroso, Leonardo | Eindhoven University of Technology |
Hofman, Theo | Technische Universiteit Eindhoven |
Salazar, Mauro | Eindhoven University of Technology |
Keywords: Transportation networks, Traffic control, Optimization
Abstract: This paper presents a time-invariant network flow model capturing two-person ride-pooling that can be integrated within design and planning frameworks for Mobility-on-Demand systems. In these type of models, the arrival process of travel requests is described by a Poisson process, meaning that there is only statistical insight into request times, including the probability that two requests may be pooled together. Taking advantage of this feature, we devise a method to capture ride-pooling from a stochastic mesoscopic perspective. This way, we are able to transform the original set of requests into an equivalent set including pooled ones which can be integrated within standard network flow problems, which in turn can be efficiently solved with off-the-shelf LP solvers for a given ride-pooling request assignment. Thereby, to compute such an assignment, we devise a polynomial-time algorithm that is optimal w.r.t. an approximated version of the problem. Finally, we perform a case study of Sioux Falls, USA, where we quantify the effects that waiting time and experienced delay have on the vehicle-hours traveled. Our results suggest that the higher the demands per unit time, the lower the waiting time and delay experienced by users. In addition, for a sufficiently large number of demands per unit time, with a maximum waiting time and experienced delay of 5 minutes, more than 90% of the requests can be pooled.
|
|
13:50-14:10, Paper ThB04.2 | Add to My Program |
Credit-Based Congestion Pricing: Equilibrium Properties and Optimal Scheme Design (I) |
|
Jalota, Devansh | Stanford University |
Lazarus, Jessica | University of California, Berkeley |
Bayen, Alexandre | University of California, Berkeley |
Pavone, Marco | Stanford University |
Keywords: Transportation networks, Game theory, Optimization
Abstract: Credit-based congestion pricing (CBCP) has emerged as a mechanism to alleviate the social inequity concerns of road congestion pricing - a promising strategy for traffic congestion mitigation - by providing low-income users with travel credits to offset some of their toll payments. While CBCP offers immense potential for addressing inequity issues that hamper the practical viability of congestion pricing, the deployment of CBCP in practice is nascent, and the potential efficacy and optimal design of CBCP schemes have yet to be formalized. In this work, we study the design of CBCP schemes to achieve particular societal objectives and investigate their influence on traffic patterns when routing heterogeneous users with different values of time (VoTs) on a multi-lane highway with an express lane (EL). To this end, we introduce a new non-atomic congestion game model of a mixed-economy, wherein eligible users receive travel credits while the remaining ineligible users pay out-of-pocket to use the EL. In this setting, we investigate the effect of CBCP schemes on traffic patterns by characterizing the properties (i.e., existence, comparative statics) of the corresponding Nash equilibria and, in the setting when eligible users have time-invariant VoTs, develop a convex program to compute these equilibria. We further present a bi-level optimization framework to design optimal CBCP schemes to achieve a central planner’s societal objectives. Finally, we conduct numerical experiments based on a case study of the San Mateo 101 Express Lanes Project, one of the first CBCP pilots. Our results demonstrate the potential of CBCP to enable low-income users to avail of the travel time savings provided by congestion pricing on ELs while having comparatively low impacts on the travel costs of other road users.
|
|
14:10-14:30, Paper ThB04.3 | Add to My Program |
Decentralized Control of Intercity Electric Automated Buses Via Time-Varying Objective Prioritization (I) |
|
Pasquale, Cecilia | University of Genova |
Sacone, Simona | University of Genova |
Siri, Silvia | University of Genova |
Ferrara, Antonella | University of Pavia |
Keywords: Autonomous vehicles, Emerging control applications, Optimal control
Abstract: This paper considers electric automated buses traveling in inter-urban roads and following a given route including stops, that must be reached according to a given timetable. Some of these stops are provided with a charging infrastructure allowing to charge the bus batteries. The paper proposes a decentralized control scheme for determining the optimal speed profiles, the dwell and charging times of the buses, by taking into account the traffic conditions along the road through a suitable traffic flow prediction model. Two objectives are considered contemporarily: the minimization of the deviations from the timetable and the minimization of the energy lack at the end of the bus route. To attain both these conflicting objectives, a lexicographic approach is adopted to design the controller which considers that, depending on the system state, the priority of the two objectives can change. Accordingly, the proposed control scheme changes the objective prioritization in real time and switches between two different lexicographic-based optimal control solutions. Some tests are discussed in the paper to show the effectiveness of the proposed control scheme.
|
|
14:30-14:50, Paper ThB04.4 | Add to My Program |
Mean-Field Learning for Day-To-Day Departure Time Choice with Mode Switching (I) |
|
Wang, Ben | University of Michigan |
Luo, Qi | Clemson University |
Yin, Yafeng | University of Michigan |
Keywords: Transportation networks, Mean field games, Iterative learning control
Abstract: Understanding travelers' day-to-day departure time choice (DDTC) is vital for managing traffic congestion, especially in multi-modal transportation systems. While providing real-time traffic information and alternative trip plans brings convenience to travelers, their collective travel patterns may conversely lead to unstable traffic equilibrium states. We investigate a DDTC problem with mode switching in this paper. A group of heterogeneous agents can adaptively choose their modes and departure times to minimize total travel costs in a dynamic game. Using a customized hierarchical soft actor-critic (HSAC) algorithm with a continuum approximation of other agents, the traffic dynamics will converge to an approximate Markovian Perfect Equilibrium (MPE). Our findings also shed light on changes in long-term travel behavior due to the widespread deployment of emerging mobility and travel information technology. This approach serves as a foundation for promoting intelligent travel plans through adaptive traffic control policies.
|
|
14:50-15:10, Paper ThB04.5 | Add to My Program |
Urgency-Aware Routing in Single Origin-Destination Itineraries through Artificial Currencies (I) |
|
Pedroso, Leonardo | Eindhoven University of Technology |
Heemels, W.P.M.H. | Eindhoven University of Technology |
Salazar, Mauro | Eindhoven University of Technology |
Keywords: Game theory, Traffic control
Abstract: Within mobility systems, the presence of self-interested users can lead to aggregate routing patterns that are far from the societal optimum which could be achieved by centrally controlling the users' choices. In this paper, we design a fair incentive mechanism to steer the selfish behavior of the users to align with the societally optimal aggregate routing. The proposed mechanism is based on an artificial currency that cannot be traded or bought, but only spent or received when traveling. Specifically, we consider a parallel-arc network with a single origin and destination node within a repeated game setting whereby each user chooses from one of the available arcs to reach their destination on a daily basis. In this framework, taking faster routes comes at a cost, whereas taking slower routes is incentivized by a reward. The users are thus playing against their future selves when choosing their present actions. To capture this complex behavior, we assume the users to be rational and to minimize an urgency-weighted combination of their immediate and future discomfort. To design the optimal pricing, we first derive a closed-form expression for the best individual response strategy. Second, we formulate the pricing design problem for each arc to achieve the societally optimal aggregate flows, and reformulate it so that it can be solved with gradient-free optimization methods. Our numerical simulations show that it is possible to achieve a near-optimal routing whilst significantly reducing the users' perceived discomfort when compared to a centralized optimal but urgency-unaware policy.
|
|
15:10-15:30, Paper ThB04.6 | Add to My Program |
Coordination for Connected Automated Vehicles at Merging Roadways in Mixed Traffic Environment (I) |
|
Le, Viet-Anh | University of Delaware |
Wang, Hao | University of Michigan |
Orosz, Gabor | University of Michigan |
Malikopoulos, Andreas A. | Cornell University |
Keywords: Traffic control, Autonomous vehicles, Optimal control
Abstract: In this paper, we present an optimal control framework to address motion coordination of connected automated vehicles (CAVs) in the presence of human-driven vehicles (HDVs) in merging scenarios. Our framework combines an unconstrained trajectory solution of a low-level energy-optimal control problem with an upper-level optimization problem that yields the minimum travel time for CAVs. We predict the future trajectories of the HDVs using Newell's car-following model. To handle potential deviations of HDVs' actual behavior from the predicted one, we design a safety filter for CAVs based on control barrier functions. The effectiveness of the proposed control framework is demonstrated via simulations with heterogeneous human driving behaviors.
|
|
ThB05 Invited Session, Simpor Junior 4912 |
Add to My Program |
Recent Advances in Distributed Optimization and Its Applications |
|
|
Chair: Xu, Jinming | Zhejiang University |
Co-Chair: Pu, Shi | The Chinese University of Hong Kong, Shenzhen |
Organizer: Xu, Jinming | Zhejiang University |
Organizer: Pu, Shi | Shenzhen Research Institute of Big Data, the Chinese University of Hong Kong, Shenzhen |
Organizer: Sun, Ying | The Pennsylvania State University |
Organizer: Wai, Hoi-To | The Chinese University of Hong Kong |
|
13:30-13:50, Paper ThB05.1 | Add to My Program |
A Linearly Convergent Robust Compressed Push-Pull Method for Decentralized Optimization (I) |
|
Liao, Yiwei | The Chinese University of Hong Kong, Shenzhen |
Li, Zhuorui | Shenzhen Research Institute of Big Data |
Pu, Shi | Shenzhen Research Institute of Big Data, the Chinese University |
Keywords: Communication networks, Optimization algorithms, Quantized systems
Abstract: In the modern paradigm of multi-agent networks, communication has become one of the main bottlenecks for decentralized optimization, where a large number of agents are involved in minimizing the average of the local cost functions. In this paper, we propose a robust compressed push-pull algorithm (RCPP) that combines gradient tracking with communication compression. In particular, RCPP is compatible with a much more general class of compression operators that allow both relative and absolute compression errors. We show that RCPP achieves linear convergence rate for smooth objective functions satisfying the Polyak-Łojasiewicz condition over general directed networks. Numerical examples verify the theoretical findings and demonstrate the efficiency, flexibility, and robustness of the proposed algorithm.
|
|
13:50-14:10, Paper ThB05.2 | Add to My Program |
Differentially-Private Distributed Optimization with Guaranteed Optimality (I) |
|
Wang, Yongqiang | Clemson University |
Nedich, Angelia | Arizona State University |
Keywords: Optimization algorithms, Control Systems Privacy, Cooperative control
Abstract: Privacy protection is gaining increased attention in distributed optimization and learning. As differential privacy is becoming a de facto standard for privacy preservation, recently results have emerged integrating differential privacy with distributed optimization. However, to ensure differential privacy (with a finite cumulative privacy budget), all existing approaches have to sacrifice provable convergence to the optimal solution. In this paper, we propose a differentially-private distributed optimization algorithm that can ensure, for the first time, both epsilon-differential privacy and optimality, even on the infinite time horizon. Numerical simulation results confirm the effectiveness of the proposed approach.
|
|
14:10-14:30, Paper ThB05.3 | Add to My Program |
A Distributed Stochastic First-Order Method for Strongly Concave-Convex Saddle Point Problems (I) |
|
Qureshi, Muhammad I. | Tufts University |
Khan, Usman A. | Tufts University |
Keywords: Optimization algorithms, Distributed control, Machine learning
Abstract: In this paper, we propose a distributed stochastic first-order method for saddle point problems over strongly connected graphs. Existing methods generally suffer from a steady-state error that arises due to the heterogeneous nature of data distribution (captured by the local versus global cost gaps) and the variance of the stochastic gradients. We propose~SGDA, a distributed stochastic gradient descent ascent method that uses network-level textit{gradient tracking} to eliminate the steady-state error component due to the local versus global cost gap. We show that~SGDA converges linearly to an error ball around the unique saddle point for sufficiently small constant step-sizes when the global cost is strongly concave-convex (a necessary condition for the existence of a unique saddle point). Moreover, we show that the size of this error ball depends on the variance of the stochastic gradients. We provide numerical experiments to illustrate the convergence properties of~SGDA for different applications and highlight the significance of gradient tracking. We also show the performance of~SGDA for training modern applications like distributed generative adversarial networks (GANs).
|
|
14:30-14:50, Paper ThB05.4 | Add to My Program |
On First-Order Meta-Reinforcement Learning with Moreau Envelopes (I) |
|
Toghani, Mohammad Taha | Rice University |
Perez-Salazar, Sebastian | Rice University |
Uribe, Cesar A. | Rice University |
Keywords: Machine learning, Optimization, Markov processes
Abstract: Meta-Reinforcement Learning (MRL) is a promising framework for training agents that can quickly adapt to new environments and tasks. In this work, we study the MRL problem under the policy gradient formulation, where we propose a novel algorithm that uses Moreau envelope surrogate regularizers to jointly learn a meta policy that is adjustable to the environment of each individual task. Our algorithm, called Moreau Envelope Meta-Reinforcement Learning (MEMRL), learns a meta-policy that can adapt to a distribution of tasks by efficiently updating the policy parameters using a combination of gradient-based optimization and Moreau Envelope regularization. Moreau Envelopes provide a smooth approximation of the policy optimization problem, which enables us to apply standard optimization techniques and converge to an appropriate stationary point. We provide a detailed analysis of the MEMRL algorithm, where we show a sublinear convergence rate to a first-order stationary point for non-convex policy gradient optimization. We finally show the effectiveness of MEMRL on a multi-task 2D-navigation problem.
|
|
14:50-15:10, Paper ThB05.5 | Add to My Program |
Distributed Nash Equilibrium Seeking in N-Cluster Games with Fully Uncoordinated Constant Step-Sizes (I) |
|
Pang, Yipeng | Nanyang Technological University |
Hu, Guoqiang | Nanyang Technological University, Singapore |
Keywords: Game theory, Optimization algorithms, Distributed control
Abstract: This paper studies a class of non-cooperative games, known as N-cluster game, which subsumes both cooperative and non-cooperative nature among multiple agents in the two problems. Moreover, we consider a partial-decision information game setup, i.e., the agents have no direct access to the decisions of other agents in all clusters, and hence need to communicate with each other. We propose a distributed NE seeking algorithm by a synthesis of consensus and gradient tracking. Unlike other existing discrete-time methods for N-cluster games where a common step-size is either publicly known by all agents or only known by agents from the same cluster, the proposed algorithm can work with fully uncoordinated constant step-sizes, which allows the agents (both within and across the clusters) to choose their own preferred step-sizes. We prove that all agents' decisions converge linearly to their corresponding NE so long as the largest step-size and the heterogeneity of the step-sizes are small. We verify the derived results through a numerical example in a Cournot competition game.
|
|
15:10-15:30, Paper ThB05.6 | Add to My Program |
A Loopless Distributed Algorithm for Personalized Bilevel Optimization (I) |
|
Niu, Youcheng | Zhejiang University |
Sun, Ying | The Pennsylvania State University |
Huang, Yan | Zhejiang University |
Xu, Jinming | Zhejiang University |
Keywords: Optimization algorithms, Cooperative control, Agents-based systems
Abstract: This paper studies a class of personalized distributed bilevel optimization problems over networks, where nodes aim at jointly optimizing the sum of outer-level objectives that depend on the solution of inner-level optimization problems. The existing algorithms for distributed bilevel optimization problems usually require extra computation loops for estimating hypergradients. To facilitate computational efficiency, we develop a loopless distributed algorithm that employs certain steps to approximate the optimal solution of inner-level optimization problems, and track Hessian-inverse-vector products in a recursive manner. We prove that for stochastic nonconvex-strongly-convex problems, the proposed algorithm achieves the state of the art O(epsilon ^{-2}) communication cost, while improving the computational cost by O(log(frac{1}{epsilon})). Numerical experiments validate our theoretical findings.
|
|
ThB06 Regular Session, Simpor Junior 4911 |
Add to My Program |
Estimation V |
|
|
Chair: Solo, Victor | University of New South Wales |
Co-Chair: Dokoupil, Jakub | CEITEC, Brno University of Technology |
|
13:30-13:50, Paper ThB06.1 | Add to My Program |
Stator Flux Linkage Estimation of Synchronous Machines Based on Integration Error Estimation for Improved Transient Performance |
|
Jang, Seunghoon | GIST |
Choi, Kyunghwan | GIST |
Keywords: Electrical machine control, Estimation, Identification for control
Abstract: The stator flux linkages of synchronous machines (SMs) are generally estimated by integrating their differential equations in the stationary frame. The technical challenge is removing the integration error arising from inaccurate integrator inputs and initial values. The conventional method uses a frequency domain approach to remove the integration error as a DC component by designing a high-pass filter. However, the frequency domain approach also affects irrelevant frequency components other than the DC component; thus, the magnitude or phase of the estimates could be distorted. Therefore, this study presents a novel stator flux linkage estimator for SMs, where the integration error is estimated in the time domain and subtracted from the integration result. This time domain approach does not affect other components than the integration error, guaranteeing accurate estimation. The key idea to estimating the integration error is using a linear state observer based on a circular motion of the stator flux linkages in the stationary frame. Simulation results obtained using a 35-kW SM drive demonstrate that the proposed estimator has significantly improved transient performance compared to existing methods.
|
|
13:50-14:10, Paper ThB06.2 | Add to My Program |
State Space Subspace Noise Modeling with Guaranteed Stability |
|
Solo, Victor | University of New South Wales |
Rong, Xinhui | University of New South Wales |
Keywords: Subspace methods, Identification, Estimation
Abstract: A fundamental problem for state space system identification is guaranteeing stability of the fitted model. Here we consider state space subspace methods for noise models. The few existing algorithms that guarantee stability have various limitations. Most do not scale well to large state space dimension; have statistical biases and some have arbitrary tuning parameters that can cause bias. Here we present a new simple, computationally cheap method that guarantees stability and needs no tuning parameters. We illustrate its strong performance in comparative simulations.
|
|
14:10-14:30, Paper ThB06.3 | Add to My Program |
Recursive Variational Inference for Total Least-Squares |
|
Friml, Dominik | Brno University of Technology |
Vaclavek, Pavel | Brno University of Technology |
Keywords: Variational methods, Estimation, Identification
Abstract: This article analyzes methods for deriving credible intervals to facilitate errors-in-variables identification by expanding on Bayesian total least squares. The credible intervals are approximated employing Laplace and variational approximations of the intractable posterior density function. Three recursive identification algorithms providing an approximation of the credible intervals for inference with the Bingham and the Gaussian priors are proposed. The introduced algorithms are evaluated on numerical experiments, and a practical example of application on battery cell total capacity estimation compared to the state-of-the-art algorithms is presented.
|
|
14:30-14:50, Paper ThB06.4 | Add to My Program |
Recursive Identification of the ARARX Model Based on the Variational Bayes Method |
|
Dokoupil, Jakub | CEITEC, Brno University of Technology |
Vaclavek, Pavel | Brno University of Technology |
Keywords: Variational methods, Identification, Estimation
Abstract: Bayesian parameter estimation of autoregressive (AR) with exogenous input (X) systems in the presence of colored model noise is addressed. The stochastic system under consideration is driven by colored noise that arises from passing an initially white noise through an AR filter. Owing to the additional AR filter, the ARARX schema provides more flexibility than the ARX one. The gained flexibility is countered by the fact that the ARARX system is no longer linear-in-parameters unless the white noise components or the AR noise filter are available. This paper analyzes the problem of estimating the unknown coefficients of the ARARX system and the model noise precision under conditions where the AR noise filter is both available and unavailable. While the former condition reduces the estimation problem to standard linear least squares, the latter one gives rise to an analytically intractable estimation problem. The intractability is resolved by the distributional approximation technique based on the variational Bayes (VB) method.
|
|
14:50-15:10, Paper ThB06.5 | Add to My Program |
A Plane-Based LiDAR Odometry Method for Man-Made Scene |
|
Yan, Zihao | Harbin Institute of Technology, Shenzhen |
Li, Peng | Harbin Institute of Technology, Shenzhen |
Wang, Rui | Harbin Institute of Technology, Shenzhen |
Chen, Boli | University College London |
Keywords: Autonomous robots, Estimation, Learning
Abstract: In this paper, a plane-based LiDAR odometry method is proposed. SLAM is an essential part of the autonomous robotic design that provides the estimated pose of a robot. Instead of using the point cloud map as in most existing works, the proposed method constructs a map consisting of a series of planes for estimating the pose in an efficient and accurate way. The plane map method reduces the number of objects processed in the map compared to point cloud map methods. Every time a LiDAR scan is received, the scan is voxelized and the planes included are extracted. The planes are matched with their counterparts in the plane map. Subsequently, the pose is optimized iteratively to get an accurate pose estimate. With the optimized pose, the plane map is updated. The effectiveness of the proposed method is verified by both public datasets and real-world experiments. The results show that the plane map based method can achieve accurate SLAM with a processing rate of more than 20 Hz in both indoor and outdoor scenarios in comparison with some recent LiDAR SLAM algorithms.
|
|
15:10-15:30, Paper ThB06.6 | Add to My Program |
Responsible and Effective Federated Learning in Financial Services: A Comprehensive Survey |
|
Shi, Yueyue | South China University of Technology |
Song, Hengjie | South China University of Technology |
Xu, Jun | Standard Chartered Bank |
Keywords: Finance, Machine learning, Control Systems Privacy
Abstract: The financial sector is increasingly leveraging Artificial Intelligence (AI) to deliver intelligent, automated, and personalized services. However, it encounters significant data privacy challenges due to the dispersion of financial data across various entities. Federated Learning (FL) offers a potential solution by facilitating AI model training at the source of data, albeit with certain challenges. Irresponsible utilization of FL can compromise stakeholder interests, and the prevalent heterogeneity in data spaces in numerous financial FL scenarios can impede FL's performance. These complications necessitate the development of a Responsible and Effective Federated Learning (RE-FL) system in finance. In this paper, we explore the interdisciplinary field of RE-FL in finance and guide readers to understand this area thoroughly. We present a taxonomy of RE-FL approaches that address the concerns of stakeholders in FL-based financial services and identify six major dimensions: accountability, controllability, fairness, privacy, security, and effectiveness. We also propose potential directions for future research. To our understanding, this is the first literature review conducted on RE-FL in the financial sector.
|
|
ThB07 Regular Session, Simpor Junior 4813 |
Add to My Program |
Game Theory IV |
|
|
Chair: Zhu, Quanyan | New York University |
Co-Chair: Reddy, Puduru Viswanadha | Indian Institute of Technology Madras |
|
13:30-13:50, Paper ThB07.1 | Add to My Program |
Optimal Intervention in Non-Binary Super-Modular Games |
|
Messina, Sebastiano | Polytechnic of Turin |
Como, Giacomo | Politecnico Di Torino |
Durand, Stephane | Politecnico Di Torino |
Fagnani, Fabio | Politecnico Di Torino |
Keywords: Game theory, Network analysis and control
Abstract: We study intervention design problems for general finite non-binary super-modular games. The considered interventions consist in constraining or incentivizing the players to play actions above designed lower bounds, with a cost for the system planner that is a separable increasing function of such bounds. We study the intervention of minimum cost for which a best response learning algorithm leads the system to its greatest Nash equilibrium. We show that, if the utility functions are unimodal, then the optimal intervention problem can be reformulated in terms of improvement paths, leading to a low complexity distributed iterative algorithm for its solution.
|
|
13:50-14:10, Paper ThB07.2 | Add to My Program |
A Semi-Decentralized Tikhonov-Based Algorithm for Optimal Generalized Nash Equilibrium Selection |
|
Benenati, Emilio | Technische Universiteit Delft |
Ananduta, Wicak | Flemish Institute for Technological Research (VITO) |
Grammatico, Sergio | Delft University of Technology |
Keywords: Game theory, Optimization algorithms, Variational methods
Abstract: To optimally select a generalized Nash equilibrium, in this paper, we consider a semi-decentralized algorithm based on a double-layer Tikhonov regularization algorithm. Technically, we extend the Tikhonov method for equilibrium selection to generalized games. Next, we couple such an algorithm with the preconditioned forward-backward splitting, which guarantees linear convergence to a solution of the inner layer problem and allows for a semi-decentralized implementation. We then establish a conceptual connection and draw a comparison between the considered algorithm and the hybrid steepest descent method, the other known distributed approach for solving the equilibrium selection problem.
|
|
14:10-14:30, Paper ThB07.3 | Add to My Program |
Equilibration of Coordinating Imitation and Best-Response Dynamics |
|
Hasheminejad, Nazanin | Brock University |
Ramazi, Pouria | Brock University |
Keywords: Game theory, Network analysis and control
Abstract: Decision-making individuals are often considered to be either emph{imitators} who copy the action of their most successful neighbors or emph{best-responders} who maximize their benefit based on the frequency of their neighbors. In the context of emph{coordination games}, where individuals earn more if they take the same action as those of their neighbors, by means of potential functions, it was shown that populations of all imitators and populations of all best-responders equilibrate in finite time when they become active to update their decisions sequentially. However, for mixed populations of the two, the equilibration was shown only for specific finite activation sequences. It is therefore, unknown, whether a potential function also exists for mixed populations or if there actually exists a counter example where an activation sequence prevents the population from reaching an equilibrium. We show that the number of consecutive individuals who have taken the same action in a emph{path} network serves as a potential function, leading to equilibration, and that this result can be extended to emph{sparse trees}. The existence of a potential function for other types of networks remains an open problem.
|
|
14:30-14:50, Paper ThB07.4 | Add to My Program |
Linear-Quadratic Mean-Field-Type Difference Games with Coupled Affine Inequality Constraints |
|
Mohapatra, Partha Sarathi | Indian Institute of Technology, Madras |
Reddy, Puduru Viswanadha | Indian Institute of Technology Madras |
Keywords: Game theory, Optimal control, Mean field games
Abstract: In this paper, we study a class of linear-quadratic mean-field-type difference games with coupled affine inequality constraints. We show that the mean-filed-type equilibrium can be characterized by the existence of a multiplier process which satisfies some implicit complementarity conditions. Further, we show that the equilibrium strategies can be computed by reformulating these conditions as a single large-scale linear complementarity problem. We illustrate our results with an energy storage problem arising in the management of microgrids.
|
|
14:50-15:10, Paper ThB07.5 | Add to My Program |
Learning Rationality in Potential Games |
|
Clarke, Stefan | Princeton University |
Dragotto, Gabriele | Princeton University |
Fernández Fisac, Jaime | Princeton University |
Stellato, Bartolomeo | Princeton University |
Keywords: Game theory, Optimization, Optimization algorithms
Abstract: We propose a stochastic first-order algorithm to learn the rationality parameters of simultaneous and non- cooperative potential games, i.e., the parameters of the agents’ optimization problems. Our technique combines (i.) an active- set step that enforces that the agents play at a Nash equilibrium and (i.) an implicit-differentiation step to update the estimates of the rationality parameters. We detail the convergence prop- erties of our algorithm and perform numerical experiments on Cournot and congestion games, showing that our algorithm effectively finds high-quality solutions (in terms of out-of- sample loss) and scales to large datasets.
|
|
15:10-15:30, Paper ThB07.6 | Add to My Program |
On the Price of Transparency: A Comparison between Overt Persuasion and Covert Signaling |
|
Li, Tao | New York University |
Zhu, Quanyan | New York University |
Keywords: Game theory, Optimization
Abstract: Transparency of information disclosure has always been considered an instrumental component of effective governance, accountability, and ethical behavior in any organization or system. However, a natural question follows: emph{what is the cost or benefit of being transparent}, as one may suspect that transparency imposes additional constraints on the information structure, decreasing the maneuverability of the information provider. This work proposes and quantitatively investigates the emph{price of transparency} (PoT) in strategic information disclosure by comparing the perfect Bayesian equilibrium payoffs under two representative information structures: overt persuasion and covert signaling models. PoT is defined as the ratio between the payoff outcomes in covert and overt interactions. As the main contribution, this work develops a two-stage-bilinear (TSB) programming approach to solve for non-degenerate perfect Bayesian equilibria of dynamic incomplete information games with finite states and actions. Using TSB, we show that it is always in the information provider's interest to choose the transparent information structure, as 0leq textrm{PoT}leq 1. The upper bound is attainable for any strictly Bayesian-posterior competitive games, of which zero-sum games are a particular case. For continuous games, the PoT, still upper-bounded by 1, can be arbitrarily close to 0, indicating the tightness of the lower bound. This tight lower bound suggests that the lack of transparency can result in significant loss for the provider.
|
|
ThB08 Regular Session, Simpor Junior 4812 |
Add to My Program |
Optimal Control V |
|
|
Chair: Nikitina, Viktoriya | University of the Bundeswehr Munich |
Co-Chair: Kerrigan, Eric C. | Imperial College London |
|
13:30-13:50, Paper ThB08.1 | Add to My Program |
Accelerating Soft-Constrained MPC for Linear Systems through Online Constraint Removal |
|
Nouwens, S.A.N. | Eindhoven University of Technology |
Paulides, Maarten | Erasmus MC Cancer Institute |
Heemels, W.P.M.H. | Eindhoven University of Technology |
Keywords: Optimal control, Predictive control for linear systems, Constrained control
Abstract: Optimization-based controllers, such as Model Predictive Control (MPC), have attracted significant research interest due to their intuitive concept, constraint handling capabilities, and natural application to multi-input multi-output systems. However, the computational complexity of solving a receding horizon problem at each time step remains a challenge for the deployment of MPC. This is particularly the case for systems constrained by many inequalities. In this paper, we present an extension to the recently introduced concept of constraint-adaptive MPC (ca-MPC), where at each time step a subset of the constraints is removed from the optimization problem, thereby accelerating the optimization procedure, while resulting in identical closed-loop behavior. The present paper extends this framework to soft-constrained MPC by detecting and removing constraints based on sub-optimal predicted input sequences. These input sequences, in turn, provide an ellipsoidal bound on the true minimizer, which can be used to remove constrains from the optimization problem, as we will show. Generating sub-optimal input sequences for soft-constrained MPC is easy due to the receding horizon principle and the inclusion of slack variables. We will translate these new ideas explicitly into an offset-free output tracking problem. We then demonstrate its effectiveness on a two-dimensional thermal output tracking problem. Here, we will show a three order of magnitude improvement in computational time and a large reduction in constraints required for the optimization problem.
|
|
13:50-14:10, Paper ThB08.2 | Add to My Program |
Complexity-Bounded Relaxed Dynamic Programming |
|
Beumer, Ruben | Eindhoven University of Technology (TU/e) |
Molengraft, René van de | Eindhoven University of Technology |
Antunes, Duarte | Eindhoven University of Technology, the Netherlands |
Keywords: Optimal control, Optimization algorithms, LMIs
Abstract: The idea behind relaxed dynamic programming for optimal control problems is to settle with a suboptimal but simpler control policy that guarantees a cost within a fixed constant factor from the optimal cost. Such a policy results from parameterized approximate value functions and the complexity of these functions determines the complexity of the policy. Typically, the more stringent the constant factor from the optimal cost is, the larger the complexity. However, relaxed dynamic programming does not give any guarantees on the complexity, which might still be unpractical. To tackle this issue, we propose to rather find the best factor away from optimality for a given complexity bound. We consider a large class of problems where the value functions can be represented as the minimum of quadratic functions. For this class, we propose a modified relaxed dynamic programming algorithm that ensures bounded complexity while still providing tight cost guarantees. A crucial step in the algorithm is the search for the best cost factor for a given policy with desired complexity, shown to be an optimization problem subject to Linear Matrix Inequalities (LMIs). We provide a new subclass of problems within this class and illustrate the effectiveness of our policy in a numerical instance of this subclass.
|
|
14:10-14:30, Paper ThB08.3 | Add to My Program |
Numerical Comparison of Collocation vs Quadrature Penalty Methods |
|
Neuenhofen, Martin P. | Imperial College London |
Kerrigan, Eric C. | Imperial College London |
Nie, Yuanbo | University of Sheffield |
Keywords: Optimal control, Optimization algorithms, Predictive control for nonlinear systems
Abstract: Direct transcription with collocation-type methods (CTM) is a popular approach for solving dynamic optimization problems. It is known that these types of methods can fail to converge for problems that feature singular-arc solutions, high-index differential-algebraic equations and overdetermined constraints. Recently, we proposed the use of quadrature penalty methods (QPM) as an alternative numerical approach to collocation-type methods. In contrast to the concept of collocation, which requires constraint-residuals to equal zero at individual points (e.g. at collocation points), the main idea of QPM is to simply oversample this number of points and use their respective quadrature weights in a quadratic penalty term, coining the name of quadrature penalty. In this paper, we provide numerical case studies and a broad numerical comparison on a wide range of problems, highlighting the benefits of QPM over CTM not only in difficult problems, but also in solving problems competitively to CTM. These results show that QPM can be considered an attractive first go-to method when solving general dynamic optimization problems.
|
|
14:30-14:50, Paper ThB08.4 | Add to My Program |
Discrete-Time Finite-Horizon Optimization of Singularly Perturbed Nonlinear Control Systems with State-Action Constraints |
|
Pi, Jianzong | The Ohio State University |
Gupta, Abhishek | The Ohio State University |
Keywords: Optimal control, Optimization algorithms, Constrained control
Abstract: In this paper, an algorithm is introduced for the computation of an approximate optimal control policy for discrete-time finite-horizon nonlinear singularly perturbed systems. This is achieved through timescale separation and by utilizing ideas from parametric optimization and dynamic programming. We demonstrate that our proposed method produces a control policy that is both theoretically robust and nearly optimal.
|
|
14:50-15:10, Paper ThB08.5 | Add to My Program |
Multi-Agent Dynamic Scheduling with a Posteriori Path Tracking and Collision Avoidance Using Model Predictive Control |
|
Bertoncini, Jeremy | Universität Der Bundeswehr |
Nikitina, Viktoriya | University of the Bundeswehr Munich |
Gerdts, Matthias | University of Munchen |
Keywords: Optimal control, Predictive control for linear systems, Constrained control
Abstract: This research work investigates a coordinated multi-agent path planning and tracking method. The solution of a pre-processed dynamic scheduling problem performs target assignment and provides optimal starting times and paths for each agent. Afterwards, a linear model predictive controller ensures robust and fast path tracking while preventing agents from collisions. This task is formulated as a discretized quadratic programming (QP) problem and is solved using an in-house developed semi-smooth Newton method. Numerical experiments have demonstrated the efficiency of the approach.
|
|
15:10-15:30, Paper ThB08.6 | Add to My Program |
Model Predictive Control for the Scheduling of Seedings in an Adaptive Vertical Farm |
|
Bagnerini, Patrizia | University of Genoa |
Gaggero, Mauro | National Research Council of Italy |
Ghio, Marco | Space V Srl |
Keywords: Optimal control, Predictive control for linear systems, Optimization
Abstract: A model predictive control approach is presented for the scheduling of sowings in an adaptive vertical farm, i.e., an innovative vertical greenhouse in which the spacing between shelves is automatically adapted to crop growth. First, a dynamic model describing the evolution of occupancy and shelf height is developed. The model is affected by disturbances to account for possible deviations of crop growth from the nominal pattern. Then, an optimal control problem over a given timeframe is defined to determine the best time instants to perform seedings in the various shelves with the goal of maximizing production yield. The repeated solution of the optimal control problem over a shorter, moving window over time, according to the receding horizon paradigm, allows to devise robust control strategies with respect to disturbances, even in the absence of predictions about their future realizations. Preliminary simulation results are reported for different control horizons and type of disturbances to showcase the effectiveness of the proposed approach in maximizing production yield while exploiting almost all the available vertical space.
|
|
ThB09 Regular Session, Simpor Junior 4811 |
Add to My Program |
Optimization Algorithms V |
|
|
Chair: Wai, Hoi-To | The Chinese University of Hong Kong |
Co-Chair: Parisio, Alessandra | The University of Manchester |
|
13:30-13:50, Paper ThB09.1 | Add to My Program |
On the Performance of Gradient Tracking with Local Updates (I) |
|
Nguyen, Edward Duc Hien | Rice University |
Alghunaim, Sulaiman A. | Kuwait University |
Yuan, Kun | Peking University |
Uribe, Cesar A. | Rice University |
Keywords: Optimization algorithms, Large-scale systems, Optimization
Abstract: We study the decentralized optimization problem where a network of n agents seeks to minimize the average of a set of heterogeneous non-convex cost functions distributedly. State-of-the-art decentralized algorithms like Exact Diffusion and Gradient Tracking (GT) involve communicating every iteration. However, communication is expensive, resource intensive, and slow. This work analyzes a locally updated GT method (LU-GT), where agents perform local recursions before interacting with their neighbors. While local updates have been shown to reduce communication overhead in practice, their theoretical influence has not been fully characterized. We show LU-GT has the same communication complexity as the Federated Learning setting but allows decentralized (symmetric) network topologies. In addition, we prove that the number of local updates does not degrade the quality of the solution achieved by LU-GT. Numerical results reveal that local updates may lead to lower communication costs in specific regimes (e.g., well-connected graphs).
|
|
13:50-14:10, Paper ThB09.2 | Add to My Program |
Linear Speedup of Incremental Aggregated Gradient Methods on Streaming Data (I) |
|
Wang, Xiaolu | The Chinese University of Hong Kong |
Jin, Cheng | Tsinghua University |
Wai, Hoi-To | The Chinese University of Hong Kong |
Gu, Yuantao | Tsinghua University |
Keywords: Optimization algorithms
Abstract: This paper considers an incremental aggregated gradient (IAG) method for large-scale distributed optimization. The IAG method is well suited for the parameter server architecture as the latter can easily aggregate potentially staled gradients contributed by workers. Although the convergence of IAG in the case of deterministic gradient is well known, there are only a few results for the case of its stochastic variant based on streaming data. Considering strongly convex optimization, this paper shows that the streaming IAG method achieves linear speedup when the workers are updating frequently enough, even if the data sample distribution across workers are heterogeneous. We show that the expected squared distance to optimal solution decays at O((1+T)/(nt)), where n is the number of workers, t is the iteration number, and T/n is the update frequency of workers. Our analysis involves careful treatments of the conditional expectations with staled gradients and a recursive system with both delayed and noise terms, which are new to the analysis of IAG-type algorithms. Numerical results are presented to verify our findings.
|
|
14:10-14:30, Paper ThB09.3 | Add to My Program |
Distributionally Robust Optimization for Nonconvex QCQPs with Stochastic Constraints |
|
Brock, Eli | University of California Berkeley |
Zhang, Haixiang | University of California, Berkeley |
Mulvaney-Kemp, Julie | University of California, Berkeley |
Lavaei, Javad | UC Berkeley |
Sojoudi, Somayeh | UC Berkeley |
Keywords: Optimization algorithms, Power systems, Optimization
Abstract: The quadratically constrained quadratic program (QCQP) with stochastic constraints appears in a wide range of real-world problems, including but not limited to the control of power systems. The randomness in the constraints prohibits the application of classic stochastic optimization algorithms. In this work, we utilize the techniques from the distributionally robust optimization (DRO) and propose a novel optimization formulation to solve the QCQP problems under strong duality. The proposed formulation does not contain stochastic constraints. The solutions to the optimization formulation attain the optimal objective value among all solutions that satisfy the stochastic constraints with high probability under the data-generating distribution, even when only a few samples from the distribution are available. We design corresponding algorithms to solve the optimization problems under the new formulation. Numerical experiments are conducted to verify the theory and illustrate the empirical performance of the proposed algorithm. This work provides the first results on the application of DRO techniques to non-convex optimization problems with stochastic constraints and the approach can be extended to a broad class of optimization problems.
|
|
14:30-14:50, Paper ThB09.4 | Add to My Program |
Fractional Budget Allocation for Influence Maximization |
|
Umrawal, Abhishek Kumar | Purdue University; University of Illinois Urbana-Champaign |
Aggarwal, Vaneet | Purdue University |
Quinn, Christopher J. | Iowa State University |
Keywords: Optimization algorithms, Stochastic systems, Control of networks
Abstract: We consider a generalization of the widely studied discrete influence maximization problem. We consider that instead of marketers using a budget to send free products to a few influencers, they can provide discounts to partly incentivize a larger set of influencers with the same budget. We show that this problem is an instance of maximizing the multilinear extension of a monotone submodular set function subject to an L_1 constraint. We propose and analyze an efficient (1-1/e)-approximation algorithm. We run experiments on a real-world social network to show the performance of our method in contrast to methods proposed for other generalizations of influence maximization.
|
|
14:50-15:10, Paper ThB09.5 | Add to My Program |
Distributed Feedforward Optimization for Control of Multi-Energy Network with Temporal Variations (I) |
|
Xu, Yiqiao | University of Manchester |
Zhang, Zhengfa | Aalborg University |
Ding, Zhengtao | The University of Manchester |
Jiang, Shuoying | University of Manchester |
Parisio, Alessandra | The University of Manchester |
Keywords: Optimization algorithms, Distributed control, Energy systems
Abstract: Multi-Energy Network (MEN) is a promising approach to improve the overall efficiency of energy utilization. Yet, balancing its electrical and thermal power in real-time is challenging due to variable demands. In this paper, we formulate a distributed Time Varying Optimization Problem (TVOP) and solve it in continuous-time to track the unknown time-varying optimal trajectories. First, we apply the principles of output regulation theory to reverse engineer the feedforward laws in the presence of projection. These laws are responsible for proactively canceling the effects of temporal demand variations. Then, a projection-based distributed optimization algorithm, alongside a distributed auxiliary protocol based on weighted-sum consensus, result in a novel scheme we term distributed feedforward optimization. One of the key features of our scheme is its data-driven nature, where temporal variations are captured from Ultra-Short-Term Forecasting (USTF) profiles using an exosystem. Under mild assumptions, the proposed scheme provides a guarantee for asymptotic convergence. Simulation results demonstrate the effectiveness of our scheme under an non-ideal case.
|
|
15:10-15:30, Paper ThB09.6 | Add to My Program |
Variance Reduction for Faster Decentralized General Convex Optimization (I) |
|
Xin, Ran | ByteDance |
Das, Subhro | IBM Research |
Kar, Soummya | Carnegie Mellon University |
Khan, Usman A. | Tufts University |
Keywords: Optimization algorithms, Machine learning, Cooperative control
Abstract: This paper studies decentralized stochastic empirical risk minimization over a network of nodes, where each node has access to a finite collection of risk functions. While this formulation has been well-studied when each local function is strongly convex or nonconvex, it is still not clear if acceleration (in the stochastic settings) can be achieved for general convex functions. In this paper, we show that GT-SAGA, an algorithm that combines gradient tracking and incremental variance reduction, converges to a global minimizer at a provably faster rate than the existing decentralized methods for this general convex formulation. In particular, GT-SAGA achieves a topology-independent iteration and gradient complexity when the local sample size is sufficiently large. Our proof techniques hinge on a simple linear coupling of convex descent inequality and variance bounds developed for nonconvex optimization, which can be of independent interest. To the best of our knowledge, these are the first such results in decentralized general convex empirical risk minimization.
|
|
ThB10 Regular Session, Roselle Junior 4713 |
Add to My Program |
Machine Learning V |
|
|
Chair: Gomes, Diogo | King Abdullah University of Science and Technology |
Co-Chair: Bai, Ting | KTH Royal Institute of Technology |
|
13:30-13:50, Paper ThB10.1 | Add to My Program |
Machine Learning Architectures for Price Formation Models with Common Noise |
|
Gutierrez, Julian | King Abdullah University of Science and Technology |
Gomes, Diogo | King Abdullah University of Science and Technology |
Lauriere, Mathieu | NYU Shanghai |
Keywords: Mean field games, Machine learning, Stochastic optimal control
Abstract: We propose a machine-learning method to solve a mean-field game price formation model with common noise. This involves determining the price of a commodity traded among rational agents subject to a market clearing condition imposed by random supply, which presents additional challenges compared to the deterministic counterpart. Our approach uses a dual recurrent neural network encoding noise dependence and a particle approximation of the mean-field model with a single loss function optimized by adversarial training. We provide a posteriori estimates for convergence and illustrate our method through numerical experiments.
|
|
13:50-14:10, Paper ThB10.2 | Add to My Program |
Reinforcement Learning Based Demand Charge Minimization Using Energy Storage |
|
Weber, Lucas | Inria |
Busic, Ana | Inria |
Zhu, Jiamin | IFPEN |
Keywords: Machine learning, Smart grid, Optimization algorithms
Abstract: Utilities have introduced demand charges to encourage customers to reduce their demand peaks, since a high peak may cause very high costs for both the utility and the consumer. We herein study the bill minimization problem for customers equipped with an energy storage device and a self-owned renewable energy production. A model-free reinforcement learning algorithm is carefully designed to reduce both the energy charge and the demand charge of the consumer. The proposed algorithm does not need forecasting models for the energy demand and the renewable energy production. The resulting controller can be used online, and progressively improved with newly gathered data. The algorithm is validated on real data from an office building of IFPEN Solaize site. Numerical results show that our algorithm can reduce electricity bills with both daily and monthly demand charges.
|
|
14:10-14:30, Paper ThB10.3 | Add to My Program |
Reinforcement Learning for Image-Based Visual Servo Control |
|
Dani, Ashwin | University of Connecticut |
Bhasin, Shubhendu | Indian Institute of Technology Delhi |
Keywords: Learning, Vision-based control, Adaptive control
Abstract: In this paper, a continuous-time reinforcement learning (RL)-based controller is developed for image-based visual servoing (IBVS). The IBVS control dynamics is of the form where the drift term is absent and there is an uncertainty in the Jacobian matrix that is multiplied with the input. This poses a challenge for developing a continuous-time RL controller. The paper presents an actor-critic or synchronous policy iteration (PI)-based RL controller along with a parameter update law for the unknown parameter in the image Jacobian and proves closed-loop stability with the proposed controller. An infinite-horizon value function minimization objective is achieved by regulating the current image features to the desired with near-optimal control efforts. The proposed controller is tested using a simulation use case and the results validate the proposed theory.
|
|
14:30-14:50, Paper ThB10.4 | Add to My Program |
Observability-Based Energy Efficient Path Planning with Background Flow Via Deep Reinforcement Learning |
|
Mei, Jiazhong | University of Washington |
Kutz, J. Nathan | University of Washington |
Brunton, Steven L. | University of Washington |
Keywords: Optimal control, Machine learning, Nonlinear systems
Abstract: In many sensor estimation and monitoring tasks, the mobile sensor travels through the state-space under the influence of a complex background flow environment. System observability is commonly used to assess the performance of the sensor-based estimation, although for a mobile sensor there are other important metrics. We consider the path planning problem under the environmental background flow and focus on a cyclic trajectory that (i) maximizes the log determinant of the observability matrix, (ii) minimizes total energy consumption, and (iii) returns close to the initial location at the end of the period. We formulate a reinforcement learning (RL) scheme and define a reward function that justifies multiple objectives. We investigate the performance of a policy-based proximal policy optimization (PPO) algorithm and address the issue of partially observed states with an additional recurrent module. We present our results on two complex unsteady fluid dynamical systems.
|
|
14:50-15:10, Paper ThB10.5 | Add to My Program |
Safe Reinforcement Learning with Probabilistic Guarantees Satisfying Temporal Logic Specifications in Continuous Action Spaces |
|
Krasowski, Hanna | Technical University of Munich |
Akella, Prithvi | California Institute of Technology |
Ames, Aaron D. | California Institute of Technology |
Althoff, Matthias | Technische Universität München |
Keywords: Autonomous robots, Machine learning, Formal Verification/Synthesis
Abstract: Vanilla Reinforcement Learning (RL) can efficiently solve complex tasks but does not provide any guarantees on system behavior. To bridge this gap, we propose a three-step safe RL procedure for continuous action spaces that provides probabilistic guarantees with respect to temporal logic specifications. First, our approach probabilistically verifies a candidate controller with respect to a temporal logic specification while randomizing the control inputs to the system within a bounded set. Second, we improve the performance of this probabilistically verified controller by adding an RL agent that optimizes the verified controller for performance in the same bounded set around the control input. Third, we verify probabilistic safety guarantees with respect to temporal logic specifications for the learned agent. Our approach is efficiently implementable for continuous action and state spaces. The separation of safety verification and performance improvement into two distinct steps realizes both explicit probabilistic safety guarantees and a straightforward RL setup that focuses on performance. We evaluate our approach on an evasion task where a robot has to reach a goal while evading a dynamic obstacle with a specific maneuver. Our results show that our safe RL approach leads to efficient learning while maintaining its probabilistic safety specification.
|
|
15:10-15:30, Paper ThB10.6 | Add to My Program |
Federated Learning in Wireless Networks Via Over-The-Air Computations |
|
Oksuz, Halil Yigit | TU Berlin |
Molinari, Fabio | TU Berlin |
Sprekeler, Henning | TU Berlin |
Raisch, Joerg | Technical University Berlin |
Keywords: Optimization, Communication networks, Machine learning
Abstract: In a multi-agent system, agents can cooperatively learn a model from data by exchanging their estimated model parameters, without the need to exchange the locally available data used by the agents. This strategy, often called federated learning, is mainly employed for two reasons: (i) improving resource-efficiency by avoiding to share potentially large data sets and (ii) guaranteeing privacy of local agents' data. Efficiency can be further increased by adopting a beyond-5G communication strategy that goes under the name of Over-the-Air Computation. This strategy exploits the interference property of the wireless channel. Standard communication schemes prevent interference by enabling transmissions of signals from different agents at distinct time or frequency slots, which is not required with Over-the-Air Computation, thus saving resources. In this case, the received signal is a weighted sum of transmitted signals, with unknown weights (fading channel coefficients). State of the art papers in the field aim at reconstructing those unknown coefficients. In contrast, the approach presented here does not require reconstructing channel coefficients by complex encoding-decoding schemes. This improves both efficiency and privacy.
|
|
ThB11 Regular Session, Roselle Junior 4712 |
Add to My Program |
Autonomous Systems I |
|
|
Chair: Langbort, Cedric | University of Illinois at Urbana Champaign |
Co-Chair: Zhu, Shanying | Shanghai Jiao Tong University |
|
13:30-13:50, Paper ThB11.1 | Add to My Program |
Pointwise-In-Time Explanation for Linear Temporal Logic Rules |
|
Brindise, Noel | University of Illinois at Urbana-Champaign |
Langbort, Cedric | University of Illinois at Urbana Champaign |
Keywords: Autonomous systems, Automata, Human-in-the-loop control
Abstract: The new field of Explainable Planning (XAIP) has produced a variety of approaches to explain and describe the behavior of autonomous agents to human observers. Many summarize agent behavior in terms of the constraints, or ''rules,'' which the agent adheres to during its trajectories. In this work, we narrow the focus from summary to specific moments in individual trajectories, offering a ''pointwise-in-time'' view. Our novel framework, which we define on Linear Temporal Logic (LTL) rules, assigns an intuitive status to any rule in order to describe the trajectory progress at individual time steps; here, a rule is classified as active, satisfied, inactive, or violated. Given a trajectory, a user may query for status of specific LTL rules at individual trajectory time steps. In this paper, we present this novel framework, named Rule Status Assessment (RSA), and provide examples of its implementation. We find that pointwise-in-time status assessment is useful as a post-hoc diagnostic, enabling a user to systematically track the agent's behavior with respect to a set of rules.
|
|
13:50-14:10, Paper ThB11.2 | Add to My Program |
Safe Control Design through Risk-Tunable Control Barrier Functions |
|
Sharma, Vipul Kumar | Purdue University |
Sivaranjani, S | Purdue University |
Keywords: Autonomous systems, Constrained control, Robust control
Abstract: We consider the problem of designing controllers to guarantee safety for a class of nonlinear systems under uncertainties in the system dynamics and/or the environment. We define a class of uncertain control barrier functions (CBFs), and formulate the safe control design problem as a chance-constrained optimization problem with uncertain CBF constraints. We leverage the scenario approach for chance-constrained optimization to develop a risk-tunable control design that provably guarantees the satisfaction of uncertain CBF safety constraints up to a user-defined probabilistic risk bound, and provides a trade-off between the sample complexity and risk tolerance. We demonstrate the performance of this approach through simulations on a quadcopter navigation problem with obstacle avoidance constraints.
|
|
14:10-14:30, Paper ThB11.3 | Add to My Program |
Cooperative Receding Horizon 3D Coverage Control with a Team of Networked Aerial Agents |
|
Papaioannou, Savvas | KIOS CoE |
Kolios, Panayiotis | University of Cyprus |
Theocharides, Theocharis | University of Cyprus |
Panayiotou, Christos | University of Cyprus |
Polycarpou, Marios M. | University of Cyprus |
Keywords: Autonomous systems, Cooperative control, Optimization
Abstract: This work proposes a receding horizon coverage control approach which allows multiple autonomous aerial agents to work cooperatively in order cover the total surface area of a 3D object of interest. The cooperative coverage problem which is posed in this work as an optimal control problem, jointly optimizes the agents' kinematic and camera control inputs, while considering coupling constraints amongst the team of agents which aim at minimizing the duplication of work. To generate look-ahead coverage trajectories over a finite planning horizon, the proposed approach integrates visibility constraints into the proposed coverage controller in order to determine the visible part of the object with respect to the agents' future states. In particular, we show how non-linear and non-convex visibility determination constraints can be transformed into logical constraints which can easily be embedded into a mixed integer optimization program.
|
|
14:30-14:50, Paper ThB11.4 | Add to My Program |
Distributed Optimal Formation Control of Second-Order Multiagent Systems with Obstacle Avoidance |
|
Huang, Fengping | Shanghai Jiao Tong University |
Duan, Mengmeng | Shanghai Jiao Tong University |
Su, Haifan | Shanghai Jiao Tong University |
Zhu, Shanying | Shanghai Jiao Tong University |
Keywords: Autonomous systems, Cooperative control, Optimization algorithms
Abstract: This paper formulates a class of generic optimal formation control problems for second-order multiagent systems, where agents are steered to achieve the optimal formation determined by a convex optimization problem with generic formation constraints and admissible range constraints. These constraints determine the geometric pattern and limit the range of the optimal formation, respectively. A generic optimal algorithm based on the primal-dual dynamics is proposed for various formation requirements. Based on Lyapunov stability and optimization theories, the states of the second-order multiagent system are shown to converge to the optimal solutions. Moreover, an obstacle avoidance mechanism based on the control barrier function is introduced to make our algorithm more practical. Finally, numerical simulations illustrate the effectiveness of the proposed algorithm.
|
|
14:50-15:10, Paper ThB11.5 | Add to My Program |
On a Probabilistic Approach for Inverse Data-Driven Optimal Control |
|
Garrabé, Émiland | University of Salerno |
Jesawada, Hozefa Zuzer | University of Sannio |
Del Vecchio, Carmen | Universitŕ Del Sannio |
Russo, Giovanni | University of Salerno |
Keywords: Autonomous systems, Data driven control, Optimal control
Abstract: We consider the problem of estimating the possibly non-convex cost of an agent by observing its interactions with a nonlinear, non-stationary and stochastic environment. For this inverse problem, we give a result that allows to estimate the cost by solving a convex optimization problem. To obtain this result we also tackle a forward problem. This leads to the formulation of a finite-horizon optimal control problem for which we show convexity and find the optimal solution. Our approach leverages certain probabilistic descriptions that can be obtained both from data and/or from first-principles. The effectiveness of our results, which are turned in an algorithm, is illustrated via simulations on the problem of estimating the cost of an agent that is stabilizing the unstable equilibrium of a pendulum.
|
|
15:10-15:30, Paper ThB11.6 | Add to My Program |
Distributed Algorithms for Edge-Agreements: More Than Consensus |
|
Rai, Ayush | Purdue University |
Mou, Shaoshuai | Purdue University |
Keywords: Autonomous systems, Distributed control
Abstract: In this paper, we propose distributed algorithms for multi-agent systems to achieve edge-agreements. Different from consensus, where all agents’ states converge to be the same value, the edge agreement is characterized by linear constraints defined for edges, i.e. one linear constraint involving two neighboring agents’ states for each edge. Such agreement allows more general coordination among agents, with consensus on a special case. Given the underlying graph of the multiagent system is undirected (not necessarily to be connected), we propose two discrete-time distributed algorithms that enable all agents’ states to converge to constants satisfying edge agreements. Besides theoretical proofs, effectiveness of the proposed algorithms is also shown by simulations on a four-agent multi-agent system.
|
|
ThB12 Regular Session, Roselle Junior 4711 |
Add to My Program |
Cooperative Control V |
|
|
Chair: Liu, Shuai | Shandong University |
Co-Chair: Li, Dongyu | National University of Singapore |
|
13:30-13:50, Paper ThB12.1 | Add to My Program |
Consensus Control Based on Privacy-Preserving Two-Party Relationship Test Protocol |
|
Wang, Hanzhou | Beihang University |
Li, Dongyu | BEIHANG UNIVERSITY |
Guan, Zhenyu | Beihang University |
Liu, Yizhong | Beihang University |
Liu, Jianwei | Beihang University |
Keywords: Agents-based systems, Constrained control, Cooperative control
Abstract: Preservation of privacy is a challenging and significant constraint in multi-agent systems. This paper aims to introduce a framework that enables the states of a multi-agent system to reach a consensus while preserving the confidentiality of each agent's initial states from others. First, a protocol for a privacy-preserving two-party relationship test is proposed. Subsequently, the protocol is employed to devise the average consensus controller for the first-order system, and the rendezvous controller for the second-order system. In contrast to prior research that relies on stochastic coupling weights, our approach circumvents the random chattering problem of the control input, resulting in improved convergence performance. Finally, numerical verification is conducted to demonstrate the effectiveness of the proposed controllers in both first- and second-order systems.
|
|
13:50-14:10, Paper ThB12.2 | Add to My Program |
Bearing-Based Formation Control Simultaneously Involving Several Heterogeneous Multi-Agent Systems with Nonlinear Uncertainties |
|
Wang, Yujie | Shandong University |
Liu, Shuai | Shandong University |
Keywords: Agents-based systems, Cooperative control, Adaptive control
Abstract: For a large-scale multi-agent system consisting of agents that have different types of dynamics, employing bearing rigidity theory to handle formation problems is unrealistic since the bearing-based rigid graph is extremely complicated and heterogeneous agents are hard to analyze as a whole. Therefore, we inventively propose to separate the large-scale system into smaller subsystems, and each subsystem is generated by agents which share the same dynamics. In such sense, formation control turns to focus on several systems with milder conditions rather than a system with complex analysis. The control objectives are to drive all systems to acquire the desired formation shapes, and make all systems simultaneously maneuver along with the desired velocities and maintain the formation shapes. To reduce communication cost, the leader-follower strategy is applied. To make formation control suitable for general environments, nonlinear uncertainties are considered, and the desired maneuvering velocities are time-varying. Adaptive nonsmooth distributed controllers are appropriately designed for all agents in bearing-based formation control.
|
|
14:10-14:30, Paper ThB12.3 | Add to My Program |
Distributed Prescribed-Time and Adaptive Synchronization of Complex Dynamical Networks under Directed Topologies |
|
Feng, Zhi | Beihang University |
Dong, Xiwang | Beihang University |
Lu, Jinhu | Beihang University |
Keywords: Agents-based systems, Cooperative control, Adaptive control
Abstract: This paper addresses an adaptively distributed prescribed-time synchronization problem of complex dynamical networks (CDNs) via distributed pinning control strategies using neighboring information over a directed graph. The novel distributed prescribed-time synchronization pinning control algorithms with static and dynamic coupling laws are developed to achieve global synchronization in a specified time, where each node can adjust its strategy on its procurable synchronization error. Based on the time transformation method and Lyapunov analysis theory, it is proved that global synchronization can be guaranteed in a pre-defined time and moreover, this synchronization can be preserved after the time, and further the control inputs are kept uniformly bounded. Lastly, the numerical simulation results are further presented to illustrate the effectiveness of the developed synchronization control methods.
|
|
14:30-14:50, Paper ThB12.4 | Add to My Program |
Bipartite Containment Control of Nonuniform Delayed Fractional-Order Multi-Agent Systems Over Signed Networks |
|
Li, Weihao | School of Aeronautics and Astronautics |
Qin, Kaiyu | University of Electronic Science and Technology of China |
Shao, Jinliang | University of Electronic Science and Technology of China, Chengd |
Shi, Lei | Henan University |
Shi, Mengji | University of Electronic Science and Technology of China |
Zheng, Wei Xing | Western Sydney University |
Keywords: Agents-based systems, Cooperative control, Network analysis and control
Abstract: In this study, the bipartite containment control problem of fractional-order multi-agent systems with nonuniform time delays is addressed. An in-depth analysis of the system stability and bipartite containment control performance from a delay margin perspective is provided. Theoretically, the corresponding delay margin (maximum allowable time delay) over undirected and directed signed networks is obtained in the presence of nonuniform time delays, respectively. In addition, numerical relationships between the delay margin and the control coefficients, fractional order, and topology parameters are established, thus enabling easy and direct calculation of the maximum allowable time delay and facilitating distributed controller design and controller parameter tuning. Finally, some simulation examples are given to verify the effectiveness of the proposed bipartite containment controller and the obtained delay margin.
|
|
14:50-15:10, Paper ThB12.5 | Add to My Program |
Impact of Relational Networks in Multi-Agent Learning: A Value-Based Factorization View |
|
Findik, Yasin | University of Massachusetts Lowell |
Robinette, Paul | UMass Lowell |
Jerath, Kshitij | University of Massachusetts Lowell |
Ahmadzadeh, S. Reza | University of Massachusetts Lowell |
Keywords: Agents-based systems, Cooperative control
Abstract: Effective coordination and cooperation among agents are crucial for accomplishing individual or shared objectives in multi-agent systems. In many real-world multi-agent systems, agents possess varying abilities and constraints, making it necessary to prioritize agents based on their specific properties to ensure successful coordination and cooperation within the team. However, most existing cooperative multi-agent algorithms do not take into account these individual differences, and lack an effective mechanism to guide coordination strategies. We propose a novel multi-agent learning approach that incorporates relationship awareness into value-based factorization methods. Given a relational network, our approach utilizes inter-agents relationships to discover new team behaviors by prioritizing certain agents over other, accounting for differences between them in cooperative tasks. We evaluated the effectiveness of our proposed approach by conducting fifteen experiments in two different environments. The results demonstrate that our proposed algorithm can influence and shape team behavior, guide cooperation strategies, and expedite agent learning. Therefore, our approach shows promise for use in multi-agent systems, especially when agents have diverse properties.
|
|
15:10-15:30, Paper ThB12.6 | Add to My Program |
Designing Cluster Consensus on Higher-Order Interaction Networks |
|
Wei, Haoyu | Shanghai Jiao Tong University |
Pan, Lulu | University of Washington |
Shao, Haibin | Shanghai Jiao Tong University |
Li, Dewei | Shanghai Jiao Tong University |
Yu, Wenbin | Shanghai Jiao Tong University |
Xue, Shibei | Shanghai Jiao Tong University |
Keywords: Agents-based systems, Cooperative control
Abstract: This paper examines the cluster consensus design problem on higher-order interaction networks. Specifically, the higher-order interaction mechanism is captured by matrix-weighted networks that allow the interdependency across the dimensions of the agents’ states, and the matrix-valued weight matrices Aij ∈ Rd×d associated with specific edges are further assumed to share the same nullspace for design purposes. Under mild assumptions on network connectivity, we first examine the case that the nullspace of positive semi-definite edges is spanned by a nonzero vector ξ ∈ Rd and show that the predictable cluster consensus can be achieved, which is eventually located in the 1−dimensional linear space determined by span{ξ} and the average of agents’ initial states. Moreover, the transient state of agents in each cluster can also be explicitly characterized. Namely, the derivative of the average state of agents in each cluster is perpendicular to span{ξ}. To generalize the above results, we proceed to examine the case that the nullspace of positive semi-definite edges is spanned by more than one linearly independent d−dimensional vector, in which case, analogous results can be obtained, and the explicit geometric interpretation is also provided.
|
|
ThB13 Regular Session, Roselle Junior 4613 |
Add to My Program |
Networked Control Systems II |
|
|
Chair: Kishida, Masako | National Institute of Informatics |
Co-Chair: Batista, Pedro | Instituto Superior Técnico / University of Lisbon |
|
13:30-13:50, Paper ThB13.1 | Add to My Program |
Consensus on Lie Groups for the Riemannian Center of Mass |
|
Kraisler, Spencer | University of Washington |
Talebi, Shahriar | University of Washington |
Mesbahi, Mehran | University of Washington |
Keywords: Networked control systems, Cooperative control, Optimization algorithms
Abstract: In this paper, we develop a consensus algorithm for distributed computation of the Riemannian Center of Mass (RCM) on Lie Groups. The algorithm is built upon a distributed optimization reformulation that allows developing an intrinsic, distributed (without relying on a consensus subroutine), and a computationally efficient protocol for the RCM computation. The novel idea for developing this fast distributed algorithm is to utilize a Riemannian version of distributed gradient flow combined with a gradient tracking technique. We first guarantee that, under certain conditions, the limit point of our algorithm is the RCM point of interest. We then provide a proof of global convergence in the Euclidean setting, that can be viewed as a "geometric" dynamic consensus that converges to the average from arbitrary initial points. Finally, we proceed to showcase the superior convergence properties of the proposed approach as compared with other classes of consensus optimization-based algorithms for the RCM computation.
|
|
13:50-14:10, Paper ThB13.2 | Add to My Program |
Greedy Synthesis of Event and Self-Triggered Controls with Control Lyapunov-Barrier Function |
|
Kishida, Masako | National Institute of Informatics |
Keywords: Networked control systems, Cyber-Physical Security, Stability of nonlinear systems
Abstract: This paper addresses the co-design problem of control inputs and execution decisions for event- and self-triggered controls subject to constraints given by the control Lyapunov function and control barrier function. The proposed approach computes the control input in a way that allows for longer inter-execution intervals, which distinguishes it from many existing event- and self-triggered controllers or control Lyapunov-barrier function controllers. The proposed approach guarantees lower bounds on the minimum inter-execution times. The effectiveness of the proposed approach is demonstrated and compared with existing approaches using a numerical example.
|
|
14:10-14:30, Paper ThB13.3 | Add to My Program |
First and Second-Order Consensus with Constant Uniform Delays |
|
Trindade, Pedro | Institute for Systems and Robotics, Instituto Superior Técnico, |
Cunha, Rita | Instituto Superior Técnico, Universidade De Lisboa |
Batista, Pedro | Instituto Superior Técnico / University of Lisbon |
Keywords: Networked control systems, Decentralized control, Stability of linear systems
Abstract: This paper analyzes first- and second-order consensus protocols subject to a constant uniform delay, when these are applied to single and double integrator agents interacting over a directed network. First, the consensus protocols are analyzed using frequency domain tools and necessary and sufficient bounds on the delay such that the agents achieve consensus are derived. Then, assuming that the delay is known, bounds on the coupling gains such that the agents achieve consensus for a given delay are sought. For first-order consensus, it turns out that it suffices to invert the bound obtained for the delay, but for second-order consensus, that is no longer possible. Instead, the Padé approximation of the delay is used to derive sufficient bounds on the coupling gains for second-order consensus.
|
|
14:30-14:50, Paper ThB13.4 | Add to My Program |
A Distributed Protocol for Finite-Time Supremum or Infimum Dynamic Consensus: The Directed Graph Case |
|
Furchě, Antonio | Roma Tre University |
Lippi, Martina | Roma Tre University |
Marino, Alessandro | Universitŕ degli Studi di Cassino e del Lazio Meridionale |
Gasparri, Andrea | Roma Tre University |
|
14:50-15:10, Paper ThB13.5 | Add to My Program |
Natural Policy Gradient Preserves Spatial Decay Properties for Control of Networked Dynamical Systems |
|
Xu, Eric | Carnegie Mellon University |
Qu, Guannan | Carnegie Mellon University |
Keywords: Networked control systems, Distributed control, Learning
Abstract: We consider the distributed control of networked linear time-invariant systems. Previous work has established the spatial decay property of the centralized controller, which allows truncating the centralized controller to obtain a κhop distributed controller with small performance loss. This paper makes a step further by showing a policy optimization approach, Natural Policy Gradient (NPG), preserves the spatial decay property of controllers. This enables “truncating” Natural Policy Gradient to directly learn a κ-hop distributed controller.
|
|
15:10-15:30, Paper ThB13.6 | Add to My Program |
Connectivity-Preserving Formation Tracking for Multiple Double Integrators by a Self-Tuning Adaptive Distributed Observer |
|
Pan, Zini | The Chinese University of Hong Kong |
Chen, Ben M. | Chinese University of Hong Kong |
Keywords: Networked control systems, Distributed control
Abstract: In this letter, we study the distributed formation tracking problem for multiple double-integrator systems with connectivity preservation over a state-dependent communication network. In particular, we employ an adaptive distributed observer for the leader system that can estimate both the state and the system matrix of the leader. As a result, unlike the existing results, we do not require all vehicles to know the system matrix of the leader. Furthermore, the adaptive distributed observer incorporates a self-tuning dynamic observer gain, which eliminates the need of computing the observer gain in advance. The effectiveness of our approach is illustrated by an example.
|
|
ThB14 Regular Session, Roselle Junior 4612 |
Add to My Program |
Identification IV |
|
|
Chair: Sznaier, Mario | Northeastern University |
Co-Chair: Chang, Chin-Yao | National Renewable Energy Laboratory |
|
13:30-13:50, Paper ThB14.1 | Add to My Program |
A Privacy Preserving Distributed Model Identification Algorithm for Power Distribution Systems |
|
Chang, Chin-Yao | National Renewable Energy Laboratory |
Keywords: Identification, Distributed control, Data driven control
Abstract: Distributed control/optimization is a promising approach for network systems due to its advantages over centralized schemes, such as robustness, cost-effectiveness, and improved privacy. However, distributed methods can have drawbacks, such as slower convergence rates due to limited knowledge of the overall network model. Additionally, ensuring privacy in the communication of sensitive information can pose implementation challenges. To address this issue, we propose a distributed model identification algorithm that enables each agent to identify the sub-model that characterizes the relationship between its local control and the overall system outputs. The proposed algorithm maintains the privacy of local agents by only communicating through dummy variables. We demonstrate the efficacy of our algorithm in the context of power distribution systems by applying it to the voltage regulation of a modified IEEE distribution system. The proposed algorithm is well-suited to the needs of power distribution controls and offers an effective solution to the challenges of distributed model identification in network systems.
|
|
13:50-14:10, Paper ThB14.2 | Add to My Program |
A Dual System-Level Parameterization for Identification from Closed-Loop Data |
|
Srivastava, Amber | Indian Institute of Technology Delhi |
Yin, Mingzhou | ETH Zurich |
Iannelli, Andrea | University of Stuttgart |
Smith, Roy S. | ETH Zurich |
Keywords: Closed-loop identification, Identification, Estimation
Abstract: This work presents a dual system-level parameterization (D-SLP) method for closed-loop identification of linear time-invariant systems. The recent system-level synthesis framework parameterizes all stabilizing controllers via linear constraints on closed-loop response functions, known as system-level parameters. It was demonstrated that several structural, locality, and communication constraints on the controller can be posed as convex constraints on these system-level parameters. In the current work, the identification problem is treated as a dual of the system-level synthesis problem. The plant model is identified from the dual system-level parameters associated to the plant. In comparison to existing closed-loop identification approaches (such as the dual-Youla parameterization), the D-SLP framework neither requires the knowledge of a nominal plant that is stabilized by the known controller, nor depends upon the choice of factorization of the nominal plant and the stabilizing controller. Numerical simulations demonstrate the efficacy of the proposed D-SLP method in terms of identification errors, compared to existing closed-loop identification techniques.
|
|
14:10-14:30, Paper ThB14.3 | Add to My Program |
Efficient MIMO Iterative Feedback Tuning Via Randomization |
|
Aarnoudse, Leontine | TU Eindhoven |
Oomen, Tom | Eindhoven University of Technology |
Keywords: Identification for control, Closed-loop identification, Identification
Abstract: Iterative feedback tuning (IFT) enables the tuning of feedback controllers based on measured data without the need for a parametric model. The aim of this paper is to develop an efficient method for MIMO IFT that reduces the required number of experiments. Using a randomization technique, an unbiased gradient estimate is obtained from a single dedicated experiment, regardless of the size of the MIMO system. This gradient estimate is employed in a stochastic gradient descent algorithm. Simulation examples illustrate that the approach reduces the number of experiments required to converge.
|
|
14:30-14:50, Paper ThB14.4 | Add to My Program |
Certified Control Oriented Learning: A Robust Predictor Based Approach |
|
Singh, Rajiv | The MathWorks |
Sznaier, Mario | Northeastern University |
Keywords: Identification for control, Identification, Robust control
Abstract: We present an efficient and scalable solution to the problem of learning the behavior of dynamical systems for the purpose of robust control design. The approach is centered around the derivation of stable predictors of potentially unstable systems and using them to identify plant models that can be ranked by their complexity (order) vs. empirical nu-gap value.
|
|
14:50-15:10, Paper ThB14.5 | Add to My Program |
An Efficient Method for the Joint Estimation of System Parameters and Noise Covariances for Linear Time-Variant Systems |
|
Simpson, Léo | Tool-Temp AG |
Diehl, Moritz | University of Freiburg |
Asprion, Jonas | Tool-Temp AG |
Ghezzi, Andrea | University of Freiburg |
Keywords: Identification for control, Linear systems, Optimization
Abstract: We present an optimization-based method for the joint estimation of system parameters and noise covariances of linear time-variant systems. Given measured data, this method maximizes the likelihood of the parameters. We solve the optimization problem of interest via a novel structure-exploiting solver. We present the advantages of the proposed approach over commonly used methods in the framework of Moving Horizon Estimation. Finally, we show the performance of the method through numerical simulations on a realistic example of a thermal system. In this example, the method can successfully estimate the model parameters in a short computational time.
|
|
15:10-15:30, Paper ThB14.6 | Add to My Program |
Data-Driven Feedforward Control Design for Nonlinear Systems: A Control-Oriented System Identification Approach |
|
Bolderman, Max | Eindhoven University of Technology |
Lazar, Mircea | Eindhoven University of Technology |
Butler, Hans | ASML |
Keywords: Identification for control, Neural networks, Iterative learning control
Abstract: Feedforward controllers typically rely on accurately identified inverse models of the system dynamics to achieve high reference tracking performance. However, the impact of the (inverse) model identification error on the resulting tracking error is only analyzed a posteriori in experiments. Therefore, in this work, we develop an approach to feedforward control design that aims at minimizing the tracking error a priori. To achieve this, we present a model of the system in a lifted space of trajectories, based on which we derive an upperbound on the reference tracking performance. Minimization of this bound yields a feedforward control-oriented system identification cost function, and a finite-horizon optimization to compute the feedforward control signal. The nonlinear feedforward control design method is validated using physics-guided neural networks on a nonlinear, nonminimum phase mechatronic example, where it outperforms linear ILC.
|
|
ThB15 Regular Session, Roselle Junior 4611 |
Add to My Program |
Robust Control I |
|
|
Chair: Tan, Ying | The University of Melbourne |
Co-Chair: Turner, Matthew C. | University of Southampton |
|
13:30-13:50, Paper ThB15.1 | Add to My Program |
Data-Driven Robust Backward Reachable Sets for Set-Theoretic Model Predictive Control |
|
Attar, Mehran | Concordia University |
Lucia, Walter | Concordia University |
Keywords: Robust control, Constrained control, Predictive control for linear systems
Abstract: In this paper, we propose a novel approach for computing robust backward reachable sets from noisy data for unknown constrained linear systems subject to bounded disturbances. In particular, we develop an algorithm for obtaining zonotopic inner approximations that can be used for control purposes. It is shown that such sets, if built on an extended space including states and inputs, can be used to embed the system's one-step evolution in the computed extended regions. Such a result is then exploited to build a set-theoretic model predictive controller that, offline, builds a recursive family of robust data-driven reachable sets and, online, computes recursively admissible control actions without explicitly resorting to either a model of the system or the available data. The validity of the proposed data-driven solution is verified by means of a numerical simulation and its performance is contrasted with the model-based counterpart.
|
|
13:50-14:10, Paper ThB15.2 | Add to My Program |
Robust Admittance Control with Complementary Passivity |
|
Xu, Jiapeng | University of Windsor |
Chen, Xiang | University of Windsor |
Tan, Ying | The University of Melbourne |
Zou, Wulin | Hong Kong University of Science and Technology |
Keywords: Robust control, Control system architecture, Control applications
Abstract: This paper studies a robust admittance control problem with a passivity requirement for stable and unstable linear time-invariant systems, motivated by control issues originated from physical human-robot interaction. A complementary admittance control structure is proposed and analyzed, revealing that the nominal performance (admittance tracking and passivity) is decoupled from robustness. Simulations on the admittance control for human arm strength augmentation with a passivity requirement validate the proposed controller design.
|
|
14:10-14:30, Paper ThB15.3 | Add to My Program |
On the Benefit of Nonlinear Control for Robust Logarithmic Growth: Coin Flipping Games As a Demonstration Case |
|
Proskurnikov, Anton V. | Politecnico Di Torino |
Barmish, B. Ross | Boston University |
Keywords: Robust control, Finance, Markov processes
Abstract: The takeoff point for this paper is the voluminous body of literature addressing recursive betting games with expected logarithmic growth of wealth being the performance criterion. Whereas almost all existing papers involve use of linear feedback, the use of nonlinear control is conspicuously absent. This is epitomized by the large subset of this literature dealing with Kelly Betting. With this as the high-level motivation, we study the potential for use of nonlinear control in this framework. To this end, we consider a "demonstration case" which is one of the simplest scenarios encountered in this line of research: repeated flips of a biased coin with probability of heads p and even-money payoff on each flip. First, we formulate a new robust nonlinear control problem which we believe is both simple to understand and apropos for dealing with concerns about distributional robustness; i.e., instead of assuming that p is perfectly known as in the case of the classical Kelly formulation, we begin with a bounding set P for this probability. Then, we provide a theorem, our main result, which gives a closed-form description of the optimal robust nonlinear controller and a corollary which establishes that it robustly outperforms linear controllers such as those found in the literature. A second contribution of this paper bears upon the computability of our solution. For an n-flip game, whereas an admissible controller has 2^n-1 parameters, at the optimum only O(n^2) of them turn out to be distinct. Finally, we provide some illustrations comparing robust performance with what is possible when working with the so-called perfect-information Kelly optimum.
|
|
14:30-14:50, Paper ThB15.4 | Add to My Program |
Mixed Gain/Phase Robustness Criterion for Structured Perturbations with an Application to Power System Stability |
|
Woolcock, Luke | University of Melbourne |
Schmid, Robert | The University of Melbourne |
Keywords: Robust control, Linear systems, Power systems
Abstract: A novel conception of phase for linear time-invariant multivariable systems was recently introduced. It enables robustness of such systems to be determined in terms of a phase-bounded set of perturbations via a so-called small phase theorem, in analogy to the well-known small gain theorem. However, it requires the system's frequency response to satisfy the relatively strong condition known as "sectoriality," which not all practical systems have. This paper aims to show that if the perturbation is assumed to have a block diagonal structure, a matrix-valued multiplier function can be calculated that can enable phase-based robustness margins to be defined in some cases when the original system is not sectorial. A real-world power systems example is presented to show how the small phase criterion using a multiplier can significantly reduce the conservatism of the small gain theorem, providing computationally straightforward methods to inform further nonlinear stability analysis of power systems.
|
|
14:50-15:10, Paper ThB15.5 | Add to My Program |
Strengthened Circle and Popov Criteria for the Stability Analysis of Feedback Systems with ReLU Neural Networks |
|
Richardson, Carl Robert | University of Southampton |
Turner, Matthew C. | University of Southampton |
Gunn, Steve | University of Southampton |
Keywords: Robust control, Neural networks, Stability of nonlinear systems
Abstract: This paper considers the stability analysis of a Lurie system with a static repeated ReLU (rectified linear unit) nonlinearity. Properties of the ReLU function are leveraged to derive new tailored quadratic constraints (QCs) which are satisfied by the repeated ReLU. These QCs are used to strengthen the Circle and Popov Criteria for this specialised Lurie system. It is shown that the criteria can be cast as a set of linear matrix inequalities (LMIs) with less restrictive conditions on the matrix variables. Many systems involving a neural network (NN) with ReLU activations are important instances of this specialised Lurie system; for example, a continuous time recurrent neural network (RNN) or the interconnection of a linear system with a feedforward NN. Numerical examples show the strengthened criteria strike an appealing balance between reduced conservatism and complexity, compared to existing criteria.
|
|
15:10-15:30, Paper ThB15.6 | Add to My Program |
Verification of Low-Dimensional Neural Network Control |
|
Gronqvist, Johan | Lund University |
Rantzer, Anders | Lund University |
Keywords: Robust control, Nonlinear systems, Neural networks
Abstract: We verify safety of a nonlinear continuous-time system controlled by a neural network controller. The system is decomposed into low-dimensional subsystems connected in a feedback loop. Our application is a rocket landing, and open-loop properties of the two-dimensional altitude subsystem are verified using worst-case simulations. Closed-loop safety properties (crash-avoidance) of the full system are obtained from composition of contracts for open-loop safety properties of subsystems in a fashion analogous to the small-gain theorem.
|
|
ThB16 Regular Session, Peony Junior 4512 |
Add to My Program |
Power Systems II |
|
|
Chair: Jiang, Yuning | EPFL |
Co-Chair: Glista, Elizabeth | University of California, Berkeley |
|
13:30-13:50, Paper ThB16.1 | Add to My Program |
Hypergraph-Based Fast Distributed AC Power Flow Optimization |
|
Dai, Xinliang | Karlsruhe Institute of Technology |
Lian, Yingzhao | EPFL |
Jiang, Yuning | EPFL |
Jones, Colin N. | EPFL |
Hagenmeyer, Veit | Karlsruhe Institute of Technology (KIT) |
Keywords: Power systems, Optimization algorithms, Nonlinear systems
Abstract: This paper presents a novel distributed approach for solving AC power flow (PF) problems. The optimization problem is reformulated into a distributed form using a communication structure corresponds to a hypergraph, by which complex relationships between subgrids can be expressed as hyperedges. Then, a hypergraph-based distributed sequential quadratic programming (HDSQP) approach is proposed to handle the reformulated problems, and the hypergraph based distributed quadratic optimization algorithm (HDQ) is used as the inner algorithm to solve the corresponding QP subproblems, which are respectively condensed using Schur complements with respect to coupling variables defined by hyperedges. Furthermore, we rigorously establish the convergence guarantee of the proposed algorithm with a locally quadratic rate and the one-step convergence of the inner algorithm when using the Levenberg-Marquardt regularization. Our analysis also demonstrates that the computational complexity of the proposed algorithm is much lower than the state-of-art distributed algorithm. We implement the proposed algorithm in an open-source toolbox, rapidPF, and conduct numerical tests that validate the proof and demonstrate the great potential of the proposed distributed algorithm in terms of communication effort and computational speed.
|
|
13:50-14:10, Paper ThB16.2 | Add to My Program |
A Control Leonov Function Guaranteeing Global ISS of Two Coupled Synchronverters |
|
Mercado Uribe, José Angel | Brandenburgische Technische Universität Cottbus-Senftenberg |
Mendoza-Avila, Jesus | INRIA Lille-Nord Europe |
Efimov, Denis | Inria |
Schiffer, Johannes | Brandenburg University of Technology |
Keywords: Power systems, Nonlinear systems, Control applications
Abstract: Abstract—The synchronverter control algorithm is a highly promising option for operating AC power inverters in future low-inertia power systems. Yet, as with conventional synchronous generators, the standard synchronverter algorithm only provides limited robustness guarantees. Therefore, we propose an additional control that confers the closed-loop system with global robustness in the Input-to-State Stability (ISS) sense. In this paper, such a control law is derived for the case of two identical synchronverters interconnected over dynamic power lines by using the Control Leonov Function (CLeF) framework. The control is illustrated by simulations.
|
|
14:10-14:30, Paper ThB16.3 | Add to My Program |
An Optimization-Based Method for Transient Stability Assessment |
|
Gao, Jianli | Imperial College London |
Chaudhuri, Balarko | Imperial College London |
Astolfi, Alessandro | Imperial College & Univ. of Rome |
Keywords: Power systems, Lyapunov methods, Electrical machine control
Abstract: The paper proposes an optimization-based method for the transient stability assessment of lossy multi-machine power systems. To achieve this objective, a global control Lyapunov function candidate including an auxiliary state is introduced. On this basis, a new excitation control law is proposed. This control law is well-defined provided that an ‘index’ matrix remains non-singular along the closed-loop trajectories. Such a matrix plays a key role in the formulation of an optimization problem, which allows calculating the so-called critical value associated to the introduced Lyapunov function. This permits a direct assessment of transient stability property of the considered post-fault power system. To illustrate the effectiveness of such an optimization-based method, a case study on the model of a three-machine system is presented.
|
|
14:30-14:50, Paper ThB16.4 | Add to My Program |
Differentially Private Algorithms for Synthetic Power System Datasets |
|
Dvorkin, Vladimir | Massachusetts Institute of Technology |
Botterud, Audun | MIT |
Keywords: Power systems, Optimization, Randomized algorithms
Abstract: While power systems research relies on the availability of real-world network datasets, data owners (e.g., system operators) are hesitant to share data due to privacy risks. To control these risks, we develop privacy-preserving algorithms for synthetic generation of optimization and machine learning datasets. Taking a real-world dataset as input, the algorithms output its noisy, synthetic version, which preserves the accuracy of the real data on a specific downstream model or even a large population of those. We control the privacy loss using Laplace and Exponential mechanisms of differential privacy and preserve data accuracy using a post-processing convex (or mixed-integer) optimization. We apply the algorithms to generate synthetic network parameters and wind power data.
|
|
14:50-15:10, Paper ThB16.5 | Add to My Program |
Optimization-Based Bound Tightening Using a Strengthened QC-Relaxation of the Optimal Power Flow Problem |
|
Sundar, Kaarthik | Los Alamos National Laboratory |
Nagarajan, Harsha | Los Alamos National Laboratory |
Misra, Sidhant | Los Alamos National Laboratory |
Lu, Mowen | Walmart Global Tech |
Coffrin, Carleton | Los Alamos National Laboratory |
Bent, Russell | Los Alamos National Laboratory |
Keywords: Power systems, Optimization, Optimization algorithms
Abstract: This paper develops a novel strengthened convex quadratic convex (QC) relaxation of the AC Optimal Power Flow (AC-OPF) problem and presents an optimization-based bound-tightening (OBBT) algorithm to compute tight, feasible bounds on the voltage magnitude variables for each bus and the phase angle difference variables for each branch in the network. Theoretical properties of the strengthened QC relaxation, that show its dominance over the other variants of the QC relaxation studied in the literature, are also derived. The effectiveness of the strengthened QC relaxation is corroborated via extensive numerical results on benchmark AC-OPF test networks. In particular, the results demonstrate that the proposed relaxation consistently provides the tightest variable bounds and optimality gaps with negligible impacts on runtime performance.
|
|
15:10-15:30, Paper ThB16.6 | Add to My Program |
Leveraging the Physics of AC Power Flow in Support Vector Regression to Identify Power System Topology |
|
Glista, Elizabeth | University of California, Berkeley |
Sojoudi, Somayeh | UC Berkeley |
Keywords: Power systems, Smart grid, Machine learning
Abstract: Understanding an electric power system's topology, including both its nodal connectivity and physical parameters, is critically important to the reliable operation and control of the power grid. In cases where this power system topology may be unavailable, due to data collection deficiencies, real-time line switching, or intentional cyberattacks, it is important to be able to estimate the real power system topology with high accuracy. In this paper, we propose a new data-driven constrained support vector regression (SVR) method that aims to map voltage data collected from phasor measurement units (PMUs) to data collected by Supervisory Data Acquisition and Control (SCADA) systems. We show that the dual of the constrained SVR model can be formulated as a quadratic program (QP) and solved efficiently with off-the-shelf solvers. Testing our method on standard IEEE test cases, we demonstrate that our proposed method significantly outperforms existing state-of-the-art SVR methods in learning the true network topology, even in the presence of measurement noise, outliers, and missing data.
|
|
ThB17 Regular Session, Peony Junior 4511 |
Add to My Program |
Iterative Learning Control I |
|
|
Chair: Rogers, Eric | University of Southampton |
Co-Chair: Poot, Maurice | Eindhoven University of Technology |
|
13:30-13:50, Paper ThB17.1 | Add to My Program |
Trackability-Based Distributed Learning Control for Multi-Agent Systems under Switching Topologies |
|
Wu, Yuxin | Beihang University (BUAA) |
Meng, Deyuan | Beihang University (BUAA) |
Wang, Jing | North China University of Technology |
Keywords: Iterative learning control, Agents-based systems, Cooperative control
Abstract: This paper aims to address the distributed learning control problem for irregular multi-agent systems subject to switching topologies. The cooperative trackability property for the desired reference is discussed, which ensures the existence of the desired inputs for realizing the cooperative perfect tracking objective. Then, a trackability-based distributed learning control algorithm is presented with the integration of the complete experience information from the previous iteration. It is shown that for the cooperatively trackable desired reference, all agents learn to achieve the cooperative perfect tracking objective in the presence of the developed distributed learning control algorithm despite their irregular dynamics, provided that their associated directed graphs jointly have a spanning tree. The simulation is implemented to illustrate the validity of the trackability-based distributed learning control algorithm.
|
|
13:50-14:10, Paper ThB17.2 | Add to My Program |
Online Stochastic Allocation of Reusable Resources |
|
Zhang, Xilin | National University of Singapore |
Cheung, Wang Chi | National University of Singapore |
Keywords: Iterative learning control, Data driven control, Stochastic optimal control
Abstract: We study a multi-objective model on the allocation of reusable resources under model uncertainty. Heterogeneous customers arrive sequentially according to a latent stochastic process, request for certain amounts of resources, and occupy them for random durations of time and return them. The decision maker's goal is to simultaneously maximize multiple types of rewards generated by the customers, while satisfying the resource capacity constraints in each time step. We develop models and algorithms for deciding on the allocation actions. We show that when the usage duration is relatively small compared with the length of the planning horizon, our policy achieves 1-O(epsilon) fraction of the optimal expected rewards, where epsilon decays to zero at a near optimal rate as the resource capacities grow. We further conduct numerical experiments to justify the performance of our algorithm.
|
|
14:10-14:30, Paper ThB17.3 | Add to My Program |
Data-Driven Iterative Learning Control for Continuous-Time Systems |
|
Chu, Bing | University of Southampton |
Rapisarda, Paolo | Univ. of Southampton |
Keywords: Iterative learning control, Data driven control
Abstract: We develop a data-driven iterative learning control design framework for continuous-time systems that does not require explicit or implicit identification of a system model. Using Chebyshev polynomial orthogonal bases, we show that all system trajectories can be characterised from sufficiently rich input/output data. Using this crucial result we develop a data-driven version of the model-based norm-optimal iterative learning control algorithm, and provide a computationally efficient implementation thereof. We rigorously analyse the convergence properties of the resulting design and also present a numerical example to illustrate its effectiveness.
|
|
14:30-14:50, Paper ThB17.4 | Add to My Program |
Boundary Iterative Learning Control for Repetitive Spatio-Temporal Processes |
|
Patan, Maciej | University of Zielona Gora |
Klimkowicz, Kamil | University of Zielona Gora |
Patan, Krzysztof | University of Zielona Gora |
Rogers, Eric | University of Southampton |
Keywords: Iterative learning control, Distributed parameter systems, Data driven control
Abstract: Iterative learning control for lumped processes is well established. Therefore, there is strong interest in developing designs that would produce similar flexibility for classes of distributed parameter systems. This paper develops a design for application to examples described by partial differential equations of convection-diffusion type in the multidimensional spatial domain, which have many applications, such as heat transfer problems. The system response is measured, and then the control is applied via specific boundary conditions using a sensor/actuator network, i.e., boundary control, as opposed to designs that require sensing and actuating application over the domain the dynamics are defined over. The convergence properties of the design are established together with rules for tuning its parameters for performance enhancement. Finally, the new design is applied to a laser heating problem in wafer staging, which requires boundary control.
|
|
14:50-15:10, Paper ThB17.5 | Add to My Program |
Risk-Constrained Control of Mean-Field Linear Quadratic Systems |
|
Roudneshin, Masoud | Concordia University |
Sanami, Saba | Concordia University |
Aghdam, Amir G. | Concordia University |
Keywords: Iterative learning control, Linear systems, Control of networks
Abstract: The risk-neutral LQR controller is optimal for stochastic linear dynamical systems. However, the classical optimal controller performs inefficiently in the presence of low probability yet statistically significant (risky) events. The present research focuses on infinite-horizon risk-constrained linear quadratic regulators in a mean-field setting. We address the risk constraint by bounding the cumulative one-stage variance of the state penalty of all players. It is shown that the optimal controller is affine in the state of each player with an additive term that controls the risk constraint. In addition, we propose a solution independent of the number of players. Finally, simulations are presented to verify the theoretical findings.
|
|
15:10-15:30, Paper ThB17.6 | Add to My Program |
Rational Basis Functions in Iterative Learning Control for Multivariable Systems |
|
Poot, Maurice | Eindhoven University of Technology |
Portegies, Jim | Eindhoven University of Technology |
Kostic, Dragan | ASM Pacific Technology |
Oomen, Tom | Eindhoven University of Technology |
Keywords: Iterative learning control, Mechatronics, Data driven control
Abstract: Feedforward control with task flexibility for MIMO systems is essential to meet ever-increasing demands on throughput and accuracy. The aim of this paper is to develop a framework for data-driven tuning of rational feedforward controllers in iterative learning control (ILC) for noncommutative MIMO systems. A convex optimization problem in ILC is achieved by rewriting the nonlinear terms in the control scheme as a function of the previous feedforward parameters. A simulation study on an multivariable industrial printer shows that the developed framework converges and achieves significant better performance than direct application of the RBF algorithm using SK-iterations for SISO systems.
|
|
ThB18 Regular Session, Peony Junior 4412 |
Add to My Program |
Stability of Nonlinear Systems I |
|
|
Chair: Heath, William Paul | University of Manchester |
Co-Chair: Zaccarian, Luca | LAAS-CNRS and University of Trento |
|
13:30-13:50, Paper ThB18.1 | Add to My Program |
A Compositional Approach to Certifying Almost Global Asymptotic Stability of Cascade Systems |
|
Welde, Jake | University of Pennsylvania |
Kvalheim, Matthew | University of Maryland, Baltimore County |
Kumar, Vijay | University of Pennsylvania |
Keywords: Stability of nonlinear systems, Algebraic/geometric methods
Abstract: In this work, we give sufficient conditions for the almost global asymptotic stability of a cascade in which the subsystems are only almost globally asymptotically stable. The result is extended to upper triangular systems of arbitrary size. In particular, if the unforced subsystems are almost globally asymptotically stable and their only chain recurrent points are hyperbolic equilibria, then the boundedness of forward trajectories is sufficient for the almost global asymptotic stability of the full upper triangular system. We show that unboundedness of such cascades is prohibited by growth rate conditions on the interconnection term and a Lyapunov function for the unforced outer subsystem, and the required structure for the chain recurrent set is enjoyed by classes of systems common in geometric control e.g. dissipative mechanical systems. Our results stand in contrast to prior works that require either time scale separation, strong disturbance robustness properties, or global asymptotic stability in the subsystems.
|
|
13:50-14:10, Paper ThB18.2 | Add to My Program |
Phase Limitations of Multipliers at Harmonics Via Frequency Intervals |
|
Heath, William Paul | University of Manchester |
Carrasco, Joaquin | University of Manchester |
Keywords: Stability of nonlinear systems, Constrained control
Abstract: The absolute stability of a Lurye system with a monotone nonlinearity is guaranteed by the existence of a suitable O'Shea-Zames-Falb (OZF) multiplier. A numerically tractable phase condition has recently been proposed under which there can be no suitable OZF multiplier for the transfer function of a given continuous-time plant. The condition has been derived via the so-called duality approach. Here we show that the condition may also be derived from an established frequency interval approach providing an important link between the two hitherto distinct approaches. We show that it leads to significantly improved results compared with the frequency interval approach on a benchmark example.
|
|
14:10-14:30, Paper ThB18.3 | Add to My Program |
Some Relations between Different Stability Notions for Discrete-Time Systems with Inputs |
|
Dashkovskiy, Sergey | University of Wuerzburg |
Schroll, Andreas | University of Würzburg |
|
|
14:30-14:50, Paper ThB18.4 | Add to My Program |
Local Static Anti-Windup Design with Sign-Indefinite Quadratic Forms |
|
Pantano-Calderón, Santiago | LAAS-CNRS |
Tarbouriech, Sophie | LAAS-CNRS |
Zaccarian, Luca | LAAS-CNRS |
Keywords: Stability of nonlinear systems, LMIs, Lyapunov methods
Abstract: This note proposes static anti-windup gains design for closed-loop linear systems with saturating inputs providing maximized non-ellipsoidal estimates of the basin of attraction. The proposed design uses sign-indefinite quadratic forms leading to locally positive definite nonquadratic Lyapunov functions. An iterative algorithm that solves the bilinear matrix conditions inherent to this problem is proposed, based on a convex-concave decomposition. A numerical application is presented to illustrate the effectiveness of the proposed method.
|
|
14:50-15:10, Paper ThB18.5 | Add to My Program |
Control of Bilinear Systems Using Gain-Scheduling: Stability and Performance Guarantees |
|
Strässer, Robin | University of Stuttgart |
Berberich, Julian | University of Stuttgart |
Allgöwer, Frank | University of Stuttgart |
Keywords: Stability of nonlinear systems, LMIs
Abstract: In this paper, we present a state-feedback controller design method for bilinear systems. To this end, we write the bilinear system as a linear fractional representation by interpreting the state in the bilinearity as a structured uncertainty. Based on that, we derive convex conditions in terms of linear matrix inequalities for the controller design, which are efficiently solvable by semidefinite programming. Further, we prove asymptotic stability and quadratic performance of the resulting closed-loop system locally in a predefined region. The proposed design uses gain-scheduling techniques and results in a state feedback with rational dependence on the state, which can substantially reduce conservatism and improve performance in comparison to a simpler, linear state feedback. Moreover, the design method is easily adaptable to various scenarios due to its modular formulation in the robust control framework. Finally, we apply the developed approaches to numerical examples and illustrate the benefits of the approach.
|
|
15:10-15:30, Paper ThB18.6 | Add to My Program |
Estimation of Regions of Attraction with Formal Certificates in a Purely Data-Driven Setting |
|
Mauroy, Alexandre | University of Namur |
Sootla, Aivar | University of Oxford |
Keywords: Stability of nonlinear systems, Lyapunov methods, Computational methods
Abstract: We provide a Koopman operator based method to estimate the region of attraction of equilibria in a purely data-driven setting. The proposed method yields formal stability certificates, while not requiring prior knowledge of the system dynamics or online addition of data points along the way. It consists in three steps. First, a candidate Lyapunov is constructed through an approximated linear lifted dynamics. Next, the validity domain of the Lyapunov function is assessed from the data set. This validation step is performed with the sole knowledge of a (possibly loose) second-order bound on the flow, and without the usual a priori knowledge of a Lipschitz constant. Finally, an inner approximation of the region of attraction is obtained on an adaptive grid via a branch-and-bound algorithm.
|
|
ThB19 Regular Session, Peony Junior 4411 |
Add to My Program |
Stability of Linear Systems |
|
|
Chair: Lindquist, Anders | Shanghai Jiao Tong University |
Co-Chair: Park, PooGyeon | Pohang Univ. of Sci. & Tech |
|
13:30-13:50, Paper ThB19.1 | Add to My Program |
A Novel Free-Matrix-Based Summation Inequality for Stability Analysis of Discrete-Time Delayed System |
|
Park, Yongbeom | POSTECH |
Park, PooGyeon | POSTECH (Pohang Univ. of Sci. & Tech.) |
Keywords: Stability of linear systems, Delay systems, Lyapunov methods
Abstract: This paper introduces an improved stability criterion for discrete-time systems with time-varying delay. A novel summation inequality based on the free-matrix is suggested which considers the augmented vector of the state and its forward difference. Additionally, the proposed summation inequality is employed to derive an improved stability criterion for the discrete-time system with time-varying delay. A new Lyapunov-Krasovskii functional is established for applying the summation lemma to reduce the conservatism of the stability analysis. To verify the effectiveness of the proposed approach, the maximum admissible upper bounds of the proposed method is presented in comparison to existing methods with two numerical examples.
|
|
13:50-14:10, Paper ThB19.2 | Add to My Program |
Linear Stability of Plane Poiseuille Flow in the Sense of Lyapunov |
|
Peet, Yulia | Arizona State University |
Edwards, Collin | Arizona State University |
Keywords: Stability of linear systems, Fluid flow systems, LMIs
Abstract: In this paper, we present a linear stability analysis formulation for a plane Poiseuille flow developed in a continuous time domain. Contrary to a conventional approach based on an eigenvalue analysis, which can only proof stability with respect to certain solutions that are assumed to be time harmonics modulated by an exponentially growing or decaying amplitude, the presented methodology does not make any assumptions on a solution form. By analyzing all time-varying solutions and not only the ones restricted to a specific functional form, the developed stability test provides a stronger condition with regard to the system stability. Stability analysis is performed by first casting the corresponding linearized partial differential equation into a partial integral equation (PIE) form, and subsequently employing a linear partial inequality (LPI) stability test, which searches for a corresponding Lyapunov function parameterized through polynomial expansions to prove or disprove stability. Stability results of the continuous-time formulation for the plane Poiseuille flow are compared with a traditional eigenvalue-based analysis, demonstrating that the developed methodology represents a stricter condition on stability.
|
|
14:10-14:30, Paper ThB19.3 | Add to My Program |
Coercive Quadratic ISS Lyapunov Functions for Analytic Systems |
|
Mironchenko, Andrii | University of Passau |
Schwenninger, Felix | University of Twente |
Keywords: Stability of linear systems, Lyapunov methods, Distributed parameter systems
Abstract: We investigate the relationship between input-to-state stability (ISS) of linear infinite-dimensional systems and existence of coercive ISS Lyapunov functions. We show that input-to-state stability of a linear system does not imply existence of a coercive quadratic ISS Lyapunov function, even if the underlying semigroup is analytic, and the input operator is bounded. However, if in addition the semigroup is similar to a contraction semigroup on a Hilbert space, then a quadratic ISS Lyapunov function always exists. Next we consider analytic and similar to contraction semigroups in Hilbert spaces with unbounded input operator B. If B is slightly stronger than 2-admissible, we construct explicitly a coercive L^2-ISS Lyapunov function. If the generator of a semigroup is additionally self-adjoint, this Lyapunov function is precisely a square norm in the state space.
|
|
14:30-14:50, Paper ThB19.4 | Add to My Program |
A Generalized Stopping Criterion for Real-Time MPC with Guaranteed Stability |
|
Fedorová, Kristína | Slovak University of Technology in Bratislava |
Jiang, Yuning | EPFL |
Oravec, Juraj | Slovak University of Technology in Bratislava |
Jones, Colin N. | EPFL |
Kvasnica, Michal | Slovak University of Technology in Bratislava |
Keywords: Stability of linear systems, Predictive control for linear systems, Optimization algorithms
Abstract: Most of the real-time implementations of the stabilizing optimal control actions suffer from the necessity to provide high computational effort. This paper presents a cutting-edge approach for real-time evaluation of linear-quadratic model predictive control (MPC) that employs a novel generalized stopping criterion achieving asymptotic stability in the presence of input constraints. The proposed method evaluates a fixed number of iterations independent of the initial condition, eliminating the necessity for computationally expensive methods. We demonstrate the effectiveness of the introduced technique by its implementation of two widely-used first-order optimization methods: the projected gradient descent method (PGDM) and the alternating directions method of multipliers (ADMM). The numerical simulation confirmed a significantly reduced number of iterations resulting in suboptimality rates of less than 2, while the reductions exceeded 80. These results nominate the proposed criterion as an efficient real-time implementation method of MPC controllers.
|
|
14:50-15:10, Paper ThB19.5 | Add to My Program |
A Parameterized Solution to the Simultaneous Stabilization Problem |
|
Cui, Yufang | Shanghai Jiao Tong University |
Lindquist, Anders | Shanghai Jiao Tong University |
Keywords: Stability of linear systems, Modeling, Computational methods
Abstract: In a series of fundamental papers BK Ghosh reduced the simultaneous stabilization problem to a Nevanlinna-Pick interpolation problem. In this paper we generalize some of these results allowing for derivative constraints. Moreover, we apply a method based on a Riccati-type matrix equation, called the Covariance Extension Equation, which provides a parameterization of all solutions in terms of a monic Schur polynomial. The procedure is illustrated by examples.
|
|
15:10-15:30, Paper ThB19.6 | Add to My Program |
On Solving Infinite-Dimensional Toeplitz Block LMIs |
|
Vernerey, Flora | Université De Lorraine, CNRS |
Riedinger, Pierre | Université De Lorraine - CNRS |
Daafouz, Jamal | Université De Lorraine, CRAN, CNRS |
Keywords: LMIs, Stability of linear systems, Time-varying systems
Abstract: This paper focuses on the resolution of infinite-dimensional Toeplitz Block LMIs, which are frequently encountered in the context of stability analysis and control design problems formulated in the harmonic framework. We propose {a consistent truncation method that makes this infinite dimensional problem tractable} and demonstrate that a solution to the truncated problem can always be found at any order, provided that the original infinite-dimensional Toeplitz Block LMI problem is feasible. Using this approach, we illustrate how the infinite dimensional solution of a Toeplitz Block LMI based convex optimization problem can be recovered up to {an arbitrarily} small error, by solving a finite dimensional truncated problem. The obtained results are applied to stability analysis and harmonic LQR for linear time periodic (LTP) systems.
|
|
ThB20 Regular Session, Orchid Junior 4312 |
Add to My Program |
Robotics |
|
|
Chair: Campolo, Domenico | Nanyang Technological University, Singapore |
Co-Chair: Schoellig, Angela P | University of Toronto |
|
13:30-13:50, Paper ThB20.1 | Add to My Program |
Multi-Step Model Predictive Safety Filters: Reducing Chattering by Increasing the Prediction Horizon |
|
Pizarro Bejarano, Federico | University of Toronto |
Brunke, Lukas | University of Toronto |
Schoellig, Angela P | University of Toronto |
Keywords: Robotics
Abstract: Learning-based controllers have demonstrated superior performance compared to classical controllers in various tasks. However, providing safety guarantees is not trivial. Safety, the satisfaction of state and input constraints, can be guaranteed by augmenting the learned control policy with a safety filter. Model predictive safety filters (MPSFs) are a common safety filtering approach based on model predictive control (MPC). MPSFs seek to guarantee safety while minimizing the difference between the proposed and applied inputs in the immediate next time step. This limited foresight can lead to jerky motions and undesired oscillations close to constraint boundaries, known as chattering. In this paper, we reduce chattering by considering input corrections over a longer horizon. Under the assumption of bounded model uncertainties, we prove recursive feasibility using techniques from robust MPC. We verified the proposed approach in both extensive simulation and quadrotor experiments. In experiments with a Crazyflie 2.0 drone, we show that, in addition to preserving the desired safety guarantees, the proposed MPSF reduces chattering by more than a factor of 4 compared to previous MPSF formulations.
|
|
13:50-14:10, Paper ThB20.2 | Add to My Program |
An SDP Optimization Formulation for the Inverse Kinematics Problem |
|
Wu, Liangting | Boston University |
Tron, Roberto | Boston University |
Keywords: Robotics
Abstract: Inverse kinematics (IK) is an important problem in robot control and motion planning; however, the nonlinearity of the map from joint angles to robot configurations makes the problem nonconvex. In this paper, we propose an inverse kinematics solver that works in the space of rotation matrices of the link reference frames rather than joint angles. To overcome the nonlinearity of the manifold of rotation matrices SO(3), we propose a semidefinite programming (SDP) relaxation of the kinematic constraints followed by a fixed-trace rank minimization via maximization of a convex function. Along the way, we show that the feasible set of an IK problem is exactly the intersection of a convex set and fixed-trace rank-1 matrices. Thanks to the use of matrices with fixed trace, our algorithm to obtain rank-1 solutions has guaranteed local convergence. Unlike some traditional solvers, our method does not require an initial guess, and can be applied to robots with closed kinematic chains without ad-hoc modifications such as splitting the kinematic chain. Compared to other work that performs SDP relaxation for IK problems, our formulation is simpler, and uses variables with smaller sizes. We validate our approach via simulations on a closed kinematic chain constituted by two robotic arms holding a box, comparing against a standard IK method.
|
|
14:10-14:30, Paper ThB20.3 | Add to My Program |
Acceleration-Free Recursive Composite Learning Control of High-DoF Robot Manipulators |
|
Zhu, Yuejiang | Sun Yat-Sen University |
Shi, Tian | Sun Yat-Sen University |
Li, Weibing | Sun Yat-Sen University |
Pan, Yongping | Sun Yat-Sen University |
Keywords: Robotics, Adaptive control, Closed-loop identification
Abstract: Composite learning robot control (CLRC) is an adaptive control approach that achieves exponential parameter convergence without using a stringent condition termed persistent excitation (PE). For robots with low degrees of freedom (DoFs), a filtered regressor of the robot dynamics needed in CLRC can be calculated analytically without joint accelerations, but this is difficult for high-DoF robots. Under the linear parameterization by the recursive Newton-Euler algorithm, this paper proposes an acceleration-free recursive CLRC (RCLRC) method for high-DoF robots to achieve exponential parameter convergence under a weakened condition termed interval excitation (IE). The proposed method has a low computational cost and avoids undesirable acceleration estimation that seriously affects performance. Simulations and experiments on a 7-DoF robot manipulator have verified the superiority of the proposed RCLRC, where it outperforms its analytical version in both estimation and tracking.
|
|
14:30-14:50, Paper ThB20.4 | Add to My Program |
The Inherent Representation of Tactile Manipulation Using Unified Force-Impedance Control |
|
Karacan, Kübra | TUM, MIRMI |
Kirschner, Robin | TU Munich - MIRMI |
Sadeghian, Hamid | Technical University of Munich |
Wu, Fan | Technical University of Munich |
Haddadin, Sami | Technische Universität München |
Keywords: Robotics, Autonomous robots
Abstract: Different robotic manipulation tasks require different execution and planning strategies. Nevertheless, the versatility of tasks in assembly and disassembly demands flexible control strategies. Fundamental to achieving such adaptive control methods is understanding and generalizing the interactions between tools, the object that is manipulated, and the environment required to perform a manipulation. In this paper, we address the problem of generating adaptive manipulation by introducing the force-velocity task phase plot that represents the inherent nature of tactile manipulation skills. This representation enables us to identify the basic phases of the interaction in the force-velocity domain. Using unified force-impedance control, we establish a tactile manipulation plan to robustly conduct versatile manipulation tasks even in case of disturbances or imprecise task information. The proposed control scheme features a dynamic process for impedance shaping based on the external force applied to the robot and the skill motion error for collision response, as well as a force-shaping function that enables both a smooth transition from free motion to contact and force regulation. We implement and compare the control strategy to previously proposed strategies using peg-in-hole reference experiments that include force disturbance and positioning inaccuracies and show the respective task phase plots. As a result, we observe high controller robustness and conclude that using the task phase plot as the inherent representation of tactile manipulation via unified force-impedance control enables successful adaptive controller design and creates a quantifiable basis for robotic skill solution comparison.
|
|
14:50-15:10, Paper ThB20.5 | Add to My Program |
Quasi-Static Mechanical Manipulation As an Optimal Process |
|
Campolo, Domenico | Nanyang Technological University, Singapore |
Cardin, Franco | Univ. Di Padova |
Keywords: Robotics, Differential-algebraic systems, Optimization algorithms
Abstract: This work focuses on quasi-static manipulation of elastically-interconnected rigid bodies by an agent, e.g. a robot assembling mechanical parts. The whole system is seen as an underactuated control problem, where only certain degrees of freedom (e.g. robot end-effector) are directly controllable. Mechanical contact is regularized via nonlinear yet smooth elastic interaction giving rise to a smooth total potential energy. The squared-Hessian of such a potential is used as optimality measure and quasi-static mechanical manipulation is rephrased as optimal path planning on the manifold of mechanical equilibria. A simple example of an elastically-driven inverted pendulum is presented as a toy model. Numerical implementation as a minimum path on graphs is also described.
|
|
15:10-15:30, Paper ThB20.6 | Add to My Program |
MrDMD-Based Sensor Placement in Distributed Flow Estimation for the Design of the Artificial Lateral Line of an Underwater Robot |
|
Wang, Jun | Peking University |
Shen, Tongsheng | National Innovation Institute of Defense Technology |
Zhao, Dexin | National Innovation Institute of Defense Technology |
Zhang, Feitian | Peking University |
Keywords: Robotics, Estimation, Optimization
Abstract: An artificial lateral line (ALL) is a sensing system that imitates the distributed perception organs of fish and plays a major role in enhancing the flow estimation capability of underwater robots. Whereas various ALLs have been designed and developed, it is still an open question how to better place ALL sensors on underwater robots, especially for those with complex shapes and working in dynamic flow and robot operating conditions. Aiming to answer this question, this paper presents a novel data-driven sensor placement method for ALLs of underwater robots. This method adopts distributed pressure sensors to measure the flow field along the profile or the outermost boundary of an underwater robot, and quantifies the dynamic information embedded within these measurements using multi-resolution dynamic mode decomposition (mrDMD). The sensors are then positioned by optimizing the dynamic flow information to enhance the perception. Compared with existing sensor placement methods, such as observability maximization and exhaustive experimental search, the proposed method focuses on the modes of dynamics variability at various spatio-temporal scales, thus leading to improved sensing ability especially in complex and dynamic flows. In addition, comprehensively considering the sensor placement under different flow and robot operating conditions, the proposed method is expected to provide an optimal solution for the overall sensing performance of the ALL system. To demonstrate the effectiveness of the proposed method, a case study of background flow speed estimation of oscillating underwater robots of different shapes in a uniform flow is presented.
|
|
ThB21 Regular Session, Orchid Junior 4311 |
Add to My Program |
Predictive Control for Nonlinear Systems II |
|
|
Chair: Zeilinger, Melanie N. | ETH Zurich |
Co-Chair: Krishnamoorthy, Dinesh | TU Eindhoven |
|
13:30-13:50, Paper ThB21.1 | Add to My Program |
Imitation Learning from Nonlinear MPC Via the Exact Q-Loss and Its Gauss-Newton Approximation |
|
Ghezzi, Andrea | University of Freiburg |
Hoffmann, Jasper | University of Freiburg |
Frey, Jonathan | University of Freiburg |
Boedecker, Joschka | University of Freiburg |
Diehl, Moritz | University of Freiburg |
Keywords: Predictive control for nonlinear systems, Machine learning, Optimal control
Abstract: This work presents a novel loss function for learning nonlinear Model Predictive Control policies via Imitation Learning. Standard approaches to Imitation Learning neglect information about the expert and generally adopt a loss function based on the distance between expert and learned controls. In this work, we present a loss based on the Q-function directly embedding the performance objectives and constraint satisfaction of the associated Optimal Control Problem (OCP). However, training a Neural Network with the Q-loss requires solving the associated OCP for each new sample. To alleviate the computational burden, we derive a second Q-loss based on the Gauss-Newton approximation of the OCP resulting in a faster training time. We validate our losses against Behavioral Cloning, the standard approach to Imitation Learning, on the control of a nonlinear system with constraints. The final results show that the Q-function-based losses significantly reduce the amount of constraint violations while achieving comparable or better closed-loop costs.
|
|
13:50-14:10, Paper ThB21.2 | Add to My Program |
An Improved Data Augmentation Scheme for Model Predictive Control Policy Approximation |
|
Krishnamoorthy, Dinesh | TU Eindhoven |
Keywords: Predictive control for nonlinear systems, Machine learning, Optimization
Abstract: This paper considers the problem of data generation for MPC policy approximation. Learning an approximate MPC policy from expert demonstrations requires a large data set consisting of optimal state-action pairs, sampled across the feasible state space. Yet, the key challenge of efficiently generating the training samples has not been studied widely. Recently, a sensitivity-based data augmentation framework for MPC policy approximation was proposed, where the parametric sensitivities are exploited to cheaply generate several additional samples from a single offline MPC computation. The error due to augmenting the training data set with inexact samples was shown to increase with the size of the neighborhood around each sample used for data augmentation. Building upon this work, this letter paper presents an improved data augmentation scheme based on predictor-corrector steps that enforces a user-defined level of accuracy, and shows that the error bound of the augmented samples are independent of the size of the neighborhood used for data augmentation.
|
|
14:10-14:30, Paper ThB21.3 | Add to My Program |
Data-Enabled Neighboring Extremal Optimal Control: A Computationally Efficient DeePC |
|
Vahidimoghaddam, Amin | Miichigan State University |
Zhang, Kaixiang | Michigan State University |
Li, Zhaojian | Michigan State University |
Wang, Yan | Self Employed |
Keywords: Predictive control for nonlinear systems, Optimal control, Constrained control
Abstract: Model-based optimal control strategies typically rely on accurate parametric representations of the underlying systems, which can be challenging to obtain, especially for nonlinear and complex systems. Therefore, data-driven optimal controllers have become increasingly attractive to both academics and industry practitioners. As a data-driven optimal control approach that can explicitly handle constraints, data-enabled predictive control (DeePC) makes a transition from model-based optimal control strategies (e.g. model predictive control (MPC)) to a data-driven one such that it seeks an optimal control policy from raw input/output (I/O) data without requiring system identification prior to control deployment, achieving remarkable successes in various applications. However, this approach involves high computational cost due to the dimension of the decision variable, which is generally significantly higher than its MPC counterpart. Several approaches have been proposed to reduce the computational cost of the DeePC for linear time-invariant (LTI) systems. However, finding a computationally efficient method to implement the DeePC for the nonlinear systems is still an open challenge. In this paper, we propose a data-enabled neighboring extremal (DeeNE) to approximate the DeePC policy and reduce its computational cost for the constrained nonlinear systems. The DeeNE adapts a pre-computed nominal DeePC solution to the perturbations of the initial I/O trajectory and the reference trajectory from the nominal ones. We also develop a scheme to handle nominal non-optimal solutions so that we can use the DeeNE solution as the nominal solution during the control process. Promising simulation results on the cart inverted pendulum problem demonstrate the efficacy of the DeeNE framework.
|
|
14:30-14:50, Paper ThB21.4 | Add to My Program |
Robust Optimal Control for Nonlinear Systems with Parametric Uncertainties Via System Level Synthesis |
|
Leeman, Antoine | ETH Zurich |
Sieber, Jerome | ETH Zurich |
Bennani, Samir | European Space Agency |
Zeilinger, Melanie N. | ETH Zurich |
Keywords: Predictive control for nonlinear systems, Optimal control, Uncertain systems
Abstract: This paper addresses the problem of optimally controlling nonlinear systems with norm-bounded disturbances and parametric uncertainties while robustly satisfying constraints. The proposed approach jointly optimizes a nominal nonlinear trajectory and an error feedback, requiring minimal offline design effort and offering low conservatism. This is achieved by decomposing the affine-in-the-parameter uncertain nonlinear system into a nominal nonlinear system and an uncertain linear time-varying system. Using this decomposition, we can apply established tools from system level synthesis to convexly over-bound all uncertainties in the nonlinear optimization problem. Moreover, it enables tight joint optimization of the linearization error bounds, parametric uncertainties bounds, nonlinear trajectory, and error feedback. With this novel controller parameterization, we can formulate a convex constraint to ensure robust performance guarantees for the nonlinear system. The presented method is relevant for numerous applications related to trajectory optimization, e.g., in robotics and aerospace engineering. We demonstrate the performance of the approach and its low conservatism through the simulation example of a post-capture satellite stabilization.
|
|
14:50-15:10, Paper ThB21.5 | Add to My Program |
Tracking MPC Tuning in Continuous Time: A First-Order Approximation of Economic MPC |
|
Facchino, Matteo | IMT School for Advanced Studies Lucca |
Bemporad, Alberto | IMT School for Advanced Studies Lucca |
Zanon, Mario | IMT Institute for Advanced Studies Lucca |
Keywords: Predictive control for nonlinear systems, Predictive control for linear systems, Optimal control
Abstract: Economic MPC (EMPC) optimizes closed-loop performance by directly minimizing a given objective function, as opposed to Tracking MPC (TMPC) which instead penalizes deviations from a precalculated optimal reference. The main difference between the two approaches can be observed during transients, as the former always acts optimally, while the latter is only optimal when the reference is accurately tracked. Unfortunately, stability for EMPC is in general difficult to prove, as opposed to TMPC which builds on a rich theory. Additionally, many efficient algorithms are available for TMPC, while solving the EMPC problem can be much harder. A family of discrete-time TMPC schemes that provide approximate economic optimality has been developed in order to partially overcome these issues. In this paper, we aim at extending such a family of TMPC schemes by deriving them also in continuous time. Similarly to the discrete-time version, also our TMPC scheme provides a first-order approximation of the EMPC control law. We demonstrate the theory with a numerical example that confirms the first-order approximation and show that our continuous-time formulation can be made equivalent to the discrete-time one.
|
|
15:10-15:30, Paper ThB21.6 | Add to My Program |
Robust Nonlinear Reduced-Order Model Predictive Control |
|
Alora, John Irvin | Stanford University |
Pabon, Luis A. | Stanford University |
Köhler, Johannes | ETH Zurich |
Cenedese, Mattia | ETH Zurich |
Schmerling, Edward | Stanford University |
Zeilinger, Melanie N. | ETH Zurich |
Haller, George | ETH Zurich |
Pavone, Marco | Stanford University |
Keywords: Predictive control for nonlinear systems, Reduced order modeling, Robotics
Abstract: Real-world systems are often characterized by high-dimensional nonlinear dynamics, making them challenging to control in real time. While reduced-order models (ROMs) are frequently employed in model-based control schemes, dimensionality reduction introduces model uncertainty which can potentially compromise the stability and safety of the original high-dimensional system. In this work, we propose a novel reduced-order model predictive control (ROMPC) scheme to solve constrained optimal control problems for nonlinear, high-dimensional systems. To address the challenges of using ROMs in predictive control schemes, we derive an error bounding system that dynamically accounts for model reduction error. Using these bounds, we design a robust MPC scheme that ensures robust constraint satisfaction, recursive feasibility, and asymptotic stability. We demonstrate the effectiveness of our proposed method in simulations on a high-dimensional soft robot with nearly 10,000 states.
|
|
ThB22 Regular Session, Orchid Junior 4212 |
Add to My Program |
Stochastic Systems II |
|
|
Chair: Li, Tao | East China Normal University |
Co-Chair: Nishimura, Yuki | Kagoshima University |
|
13:30-13:50, Paper ThB22.1 | Add to My Program |
A Large-Scale Stochastic Gradient Descent Algorithm Over a Graphon |
|
Chen, Yan | East China Normal University |
Li, Tao | East China Normal University / New York University Shanghai |
Keywords: Stochastic systems, Large-scale systems, Mean field games
Abstract: We study the large-scale stochastic gradient descent algorithm over a graphon with a continuum of nodes, which is regarded as the limit of the distributed networked optimization as the number of nodes goes to infinity. Each node has a private local cost function. The global cost function, which all nodes cooperatively minimize, is the integral of the local cost functions on the node set. We propose a stochastic gradient descent algorithm evolving as a graphon particle system, where each node heterogeneously interacts with others through a coupled mean field term. It is proved that if the graphon is connected, then by properly choosing the algorithm gains, all nodes’states achieve consensus uniformly in mean square. Furthermore, if the local cost functions are strongly convex, then all nodes’states converge uniformly to the minimizer of the global cost function in mean square.
|
|
13:50-14:10, Paper ThB22.2 | Add to My Program |
Prescribed-Time Nonlinear Control with Multiplicative Noise |
|
Li, Wuquan | Ludong University |
Krstic, Miroslav | University of California, San Diego |
Keywords: Stochastic systems, Lyapunov methods
Abstract: We study the prescribed-time design problem for strict-feedback nonlinear systems with multiplicative measurement noise. With the assumption that the noise is small and linearly vanishing, we propose a new postulated feedback to solve the prescribed-time mean-square stabilization problem. In contrast to the existing stochastic prescribed-time designs, the merit of our design is that it can effectively deal with multiplicative measurement noise. The existence of measurement noise makes the design rather challenging since the resulting process noise intensity, in closed loop, depends on the feedback gains and even goes to infinity. Finally, a simulation example is given to illustrate the design.
|
|
14:10-14:30, Paper ThB22.3 | Add to My Program |
Safety-Probability Analysis and Control for Stochastic Systems Based on Lyapunov Candidate Functions |
|
Nishimura, Yuki | Kagoshima University |
Hoshino, Kenta | Kyoto University |
Keywords: Stochastic systems, Lyapunov methods
Abstract: In recent control theory, safety analysis and safety-critical control based on a (control) barrier function have been actively pursued. The barrier function is closely related to a Lyapunov function, which is an important property that guarantees asymptotic stability of the system, i.e., the settling to the target state, which is a fundamental control performance. Therefore, control strategies that simultaneously guarantee safety and stability are important in the recent control scene. In this paper, we propose a method for quantitative evaluation of safety probability for stochastic systems based on barrier functions generated from Lyapunov functions, and then develop control design methods to increase the safety probability. In particular, safety analysis and safety-critical control of linear stochastic systems having additive noises are performed based on linear algebra. We also discuss design methods for safety and safety-critical control for input-affine stochastic systems. The effectiveness of the proposed method is demonstrated based on a simple example.
|
|
14:30-14:50, Paper ThB22.4 | Add to My Program |
The Impact of Recommendation Systems on Opinion Dynamics: Microscopic versus Macroscopic Effects |
|
Lanzetti, Nicolas | ETH Zürich |
Dörfler, Florian | Swiss Federal Institute of Technology (ETH) Zurich |
Pagan, Nicolň | University of Zürich |
Keywords: Stochastic systems, Network analysis and control, Agents-based systems
Abstract: Recommendation systems are widely used in web services, such as social networks and e-commerce platforms, to serve personalized content to the users and, thus, enhance their experience. While personalization assists users in navigating through the available options, there have been growing concerns regarding its repercussions on the users and their opinions. Examples of negative impacts include the emergence of filter bubbles and the amplification of users' confirmation bias, which can cause opinion polarization and radicalization. In this paper, we study the impact of recommendation systems on users, both from a microscopic (i.e., at the level of individual users) and a macroscopic (i.e., at the level of a homogenous population) perspective. Specifically, we build on recent work on the interactions between opinion dynamics and recommendation systems to propose a model for this closed loop, which we then study both analytically and numerically. Among others, our analysis reveals that shifts in the opinions of individual users do not always align with shifts in the opinion distribution of the population. In particular, even in settings where the opinion distribution appears unaltered (e.g., measured via surveys across the population), the opinion of individual users might be significantly distorted by the recommendation system.
|
|
14:50-15:10, Paper ThB22.5 | Add to My Program |
Adaptive Sampling for Online Learning Spectral Properties of Networks |
|
Abdullah, Mohammed | Telecom-Sud Paris |
Hayel, Yezekael | University of Avignon |
Reiffers, Alexandre | IMT Atlantique |
Chonavel, Thierry | IMT Atlantique |
Keywords: Stochastic systems, Network analysis and control, Learning
Abstract: Recently, the area of decision and control has been interested in studying the connectivity of large-scale networks. As networks under study are large, to have a complete knowledge of the network is impossible, whereas little but representative information is available with an efficient exploration scheme. Machine learning approaches were presented and used to tackle this difficulty to hold it up. In this regard, we present and prove the convergence of an efficient algorithm that converges to the Fielder vector when the topology is initially unknown and the only accessible information is gathered by a random walk process throughout the entire network. The Rayleigh quotient optimization problem and the notion of stochastic approximation are the foundations of our technique. We consider multiple sampling strategies that are categorized under random walks, as well as adapting another sampling approach that are considered random walk, the Gibbs sampling, and it showed better results. Finally, we demonstrate its performance on different network topologies.
|
|
15:10-15:30, Paper ThB22.6 | Add to My Program |
Peak Value-At-Risk Estimation for Stochastic Differential Equations Using Occupation Measures |
|
Miller, Jared | ETH Zurich |
Tacchi, Matteo | Univ. Grenoble Alpes, CNRS, Grenoble INP, GIPSA-Lab |
Sznaier, Mario | Northeastern University |
Jasour, Ashkan | NASA JPL |
Keywords: Stochastic systems, Nonlinear systems, LMIs
Abstract: This paper proposes an algorithm to upper-bound maximal quantile statistics of a state function over the course of a Stochastic Differential Equation (SDE) system execution. This chance-peak problem is posed as a nonconvex program aiming to maximize the Value-at-Risk (VaR) of a state function along SDE state distributions. The VaR problem is upper-bounded by an infinite-dimensional Second-Order Cone Program in occupation measures through the use of one-sided Cantelli or Vysochanskii-Petunin inequalities. These upper bounds on the true quantile statistics may be approximated from above by a sequence of Semidefinite Programs in increasing size using the moment-Sum-of-Squares hierarchy when all data is polynomial. Effectiveness of this approach is demonstrated on example stochastic polynomial dynamical systems.
|
|
ThB23 Regular Session, Orchid Junior 4211 |
Add to My Program |
Fault Detection |
|
|
Chair: Previdi, Fabio | Universitŕ Degli Studi Di Bergamo |
Co-Chair: Qiu, Gen | University of Electronic Science and Technology of China |
|
13:30-13:50, Paper ThB23.1 | Add to My Program |
Quickest Change Point Detection with Measurements Over a Lossy Link |
|
Chaythanya KV, Krishna | Indian Institute of Science |
Chattopadhyay, Arpan | Indian Institute of Technology, Delhi |
Kumar, Anurag | Indian Institute of Science |
Sundaresan, Rajesh | Indian Institute of Science |
Keywords: Fault detection, Estimation, Sensor networks
Abstract: Motivated by Industry 4.0 applications, we consider quickest change point detection (QCD) when process measurements are transmitted by a sensor over a lossy wireless link to a decision maker (DM). The sensor node samples measurements using a Bernoulli sampling process, and places the measurement samples in a transmit queue of the transmitter. The transmitter uses a retransmit-until-success transmission strategy to deliver packets to the DM over the lossy link, which is modeled as an independent Bernoulli process and has different loss probabilities before and after the change. We pose the QCD problem in the non-Bayesian setting under Lorden's framework, and derive a CUSUM algorithm. By defining a suitable Markov process, involving the DM measurements and the queue length process, we show that the problem reduces to QCD of a Markov process. Characterizing the information measure per measurement sample at the DM, our analysis proves the asymptotic optimality of our algorithm when the false alarm rate tends to zero. We discuss extensions of the analysis to periodic sampling and no-retransmission cases. Through numerical analysis, we demonstrate trade-offs that can be used to optimize system design parameters such as the sampling rate of the measurement process in the non-asymptotic regime.
|
|
13:50-14:10, Paper ThB23.2 | Add to My Program |
Model Uncertainty-Aware Residual Generators for SISO LTI Systems Based on Kernel Identification and Randomized Approaches |
|
Mazzoleni, Mirko | University of Bergamo |
Valceschini, Nicholas | University of Bergamo |
Previdi, Fabio | Universitŕ Degli Studi Di Bergamo |
Keywords: Fault detection, Identification
Abstract: Robustness of residual signals to model uncertainties and noise in the measurements is of paramount importance in model-based fault diagnosis. Model uncertainty has been mainly represented in a structured way by considering known bounds on the model parameters, thus relying on prior knowledge about the plant structure and values of its physical parameters. When the plant is completely unknown, system identification techniques must be used for model-based diagnosis. In this work, we present a data-driven approach to represent the uncertainty in the identified model. This uncertainty is described in the frequency domain using kernel based identification and robust control tools. The estimated model uncertainty region overlaps with the true uncertainty region with a probability specified by the user. The user choices are thus reduced to the selection of only some interpretable hyperparameters. Then, a residual generator robust to the estimated model uncertainty and measurements noise is designed by a standard H∞ approach. Simulation results on SISO LTI systems show the effectiveness of the approach in producing a residual signal viable for the detection of additive faults.
|
|
14:10-14:30, Paper ThB23.3 | Add to My Program |
Fault Detection Via Occupation Kernel Principal Component Analysis |
|
Morrison, Zachary | Oklahoma State University |
Russo, Benjamin | Oak Ridge National Laboratory |
Lian, Yingzhao | EPFL |
Kamalapurkar, Rushikesh | Oklahoma State University |
Keywords: Fault detection, Statistical learning, Machine learning
Abstract: The reliable operation of automatic systems is heavily dependent on the ability to detect faults in the underlying dynamical system. While traditional model-based methods have been widely used for fault detection, data-driven approaches have garnered increasing attention due to their ease of deployment and minimal need for expert knowledge. In this paper, we present a novel principal component analysis (PCA) method that uses occupation kernels. Occupation kernels result in feature maps that are tailored to the measured data, have inherent noise-robustness due to the use of integration, and can utilize irregularly sampled system trajectories of variable lengths for PCA. The occupation kernel PCA method is used to develop a reconstruction error approach to fault detection and its efficacy is validated using numerical simulations.
|
|
14:30-14:50, Paper ThB23.4 | Add to My Program |
Statistical Times Series Based Damage Detection in the Fiber Rope Mooring Lines of the Semi-Submersible OO-STAR Wind Floater |
|
Sakaris, Christos | Norwegian Research Center NORCE AS |
Schlanbusch, Rune | Norwegian Research Centre |
Nygaard, Tor | Institute for Energy Technology |
Sakellariou, John | University of Patras |
Tutkun, Murat | Institute for Energy Technology |
Keywords: Fault detection, Modeling, Machine learning
Abstract: The Floating Offshore Wind Turbines (FOWTs) based on semi-submersible floaters constitute a popular choice in most markets due to their installation being flexible and in need of low infrastructural requirements. A simple and robust three-legged semi-submersible floater for FOWTs, the OO-STAR wind floater, has been introduced and it can be anchored to the seabed with steel chain mooring lines or hybrid mooring lines - a combination of chains and synthetic fiber ropes. The fiber rope mooring lines present a number of advantages thus leading to a lighter and less costly mooring system. These lines are important for the FOWT's integrity as their loss can lead to the change of the floater's position, a damaged power cable, a possible collision with other infrastructure and high maintenance costs. This why an early detection of damages in the mooring system is crucial. In this study, damage detection in the main part of fiber rope mooring lines of semi-submersible based FOWTs is investigated for the first time. In particular, the OO-STAR floater based FOWT is considered. Two Statistical Time Series based detection methods, the Multiple Model-AutoRegressive (MM-AR) method and the Functional Model Based Method (FMBM) are used and compared. The MM-AR is based on multiple AR models whereas the FMBM on a single Functional Model, for the description of the healthy FOWT's dynamics under varying environmental conditions. The results based on seven healthy and eight damage cases under varying wind speed and wave height show that the two methods are able to achieve damage detection in fiber rope mooring lines without any false alarm or missed damage despite of damages having small effects on the FOWT's dynamics and the fiber ropes presenting a non-linear behaviour.
|
|
14:50-15:10, Paper ThB23.5 | Add to My Program |
Incipient Fault Detection with Feature Ensemble Based on One-Class Machine Learning Methods |
|
Wang, Min | University of Electronic Science and Technology of China |
Cheng, Feiyang | University of Electronic Science and Technology of China |
Chen, Kai | University of Electronic Science and Technology of China |
Mi, Jinhua | University of Electronic Science and Technology of China |
Xu, Zhiwei | Shandong University |
Qiu, Gen | University of Electronic Science and Technology of China |
Keywords: Fault detection, Machine learning, Neural networks
Abstract: Considering production quality and process safety, incipient fault detection has drawn more and more attention. With the rapid development of machine learning, numerous researches for fault detection based on machine learning have been published. However, almost all machine learning methods used for fault detection need abnormal data to construct models. Unfortunately, it is difficult to obtain sufficient fault samples in practical industrial processes. In addition, the existing fault detection methods are based on single feature extraction strategy. Process monitoring methods with different working principles often extract and utilize different process information. Reasonable integration of features extracted by multiple methods can usually effectively improve the performance of incipient fault detection. Therefore, this paper proposes an one-class machine learning feature ensemble model (OCMLFEM) for incipient fault detection. In OCMLFEM, various one-class machine learning models are constructed as basic detectors. In order to effectively mine the features obtained by basic detectors, a feature ensemble strategy with the technologies of sliding window singular value and principal component analysis is adopted. Then, Tennessee Eastman process is utilized to verify the validity of the proposed detection model, which proves that OCMLFEM has significant superiority.
|
|
15:10-15:30, Paper ThB23.6 | Add to My Program |
Anomaly Search Over Many Sequences with Switching Costs |
|
Ubl, Matthew | University of Florida |
Robinson, Benjamin | AFRL |
Hale, Matthew | University of Florida |
Keywords: Fault detection, Statistical learning
Abstract: This paper considers the quickest search problem to identify anomalies among large numbers of data streams. These streams can model, for example, disjoint regions monitored by a mobile robot. A particular challenge is a version of the problem in which the experimenter must suffer a cost each time the data stream being sampled changes, such as the time the robot must spend moving between regions. In this paper, we propose an algorithm which accounts for switching costs by varying a confidence threshold that governs when the algorithm switches to a new data stream. Our main contributions are easily computable approximations for both the optimal value of this threshold and the optimal value of the parameter that determines when a stream is flagged as an anomaly, using the Brownian motion approximations. Further, we empirically show (i) a uniform improvement for switching costs of interest and (ii) roughly equivalent performance for small switching costs when comparing to the closest available algorithm.
|
|
ThB24 Regular Session, Orchid Main 4201AB |
Add to My Program |
Switched Systems I |
|
|
Chair: Incremona, Gian Paolo | Politecnico Di Milano |
Co-Chair: Zhang, Wentao | Nanyang Technological University |
|
13:30-13:50, Paper ThB24.1 | Add to My Program |
Design of a Distributed Switching Model Predictive Control for Quadrotor UAVs Aggregation |
|
Yuca Huanca, Chrystian Pool Edmundo | Politecnico Di Milano |
Incremona, Gian Paolo | Politecnico Di Milano |
Colaneri, Patrizio | Politecnico Di Milano |
Keywords: Switched systems, Distributed control, Optimization algorithms
Abstract: This letter proposes a novel distributed model predictive control (MPC) strategy to address the swarm aggregation of a team of quadrotor unmanned aerial vehicles (UAVs). First, a switched formulation of the quadrotor model is derived by mapping the UAVs dynamics into a set of finite motion modes. Then, relying on a suitably selected control Lyapunov function (CLF), the inter-agent collisions and the aggregation task are taken into account to design a switching MPC (SMPC) strategy. A clustering method is also introduced to define the communication network among the agents, which is essential to sequentially solve the optimal control problem. Finally, the efficacy of the proposal, also in comparison with other methodologies, is satisfactorily shown in simulation.
|
|
13:50-14:10, Paper ThB24.2 | Add to My Program |
Identification of Piecewise Affine Systems with Online Deterministic Annealing |
|
Mavridis, Christos | University of Maryland, College Park |
Baras, John S. | University of Maryland |
Keywords: Switched systems, Identification, Intelligent systems
Abstract: We propose a new online identification scheme for discrete-time piece-wise affine models based on a system of adaptive algorithms. A stochastic approximation algorithm based on online deterministic annealing runs at a slow timescale, estimating the partition of the space that defines the modes of the system. At the same time, a recursive identification algorithm, running at a higher timescale, updates the parameters of local identification models based on the estimate of the modes. Convergence results under mild assumptions are given based on the theory of two timescale stochastic approximation. In contrast to standard identification algorithms for piece-wise affine systems, the proposed approach is appropriate for online system identification using sequential data acquisition, and is computationally more efficient compared to standard algebraic, mixed-integer programming, and clustering-based methods. The progressive nature of the algorithm provides real-time control over the performance-complexity trade-off, desired in practical applications. Experimental results validate the efficacy of the proposed methodology.
|
|
14:10-14:30, Paper ThB24.3 | Add to My Program |
Sufficient Stability Conditions for a Class of Switched Systems with Multiple Steady States |
|
Piccini, Jacopo | Reykjavik University |
August, Elias | Reykjavik University |
Hafstein, Sigurdur | University of Iceland |
Andersen, Stefania | University of Iceland |
Keywords: Switched systems, Lyapunov methods
Abstract: In this paper, we present a novel approach to determine the stability of switched linear and nonlinear systems using Sum of Squares optimisation. Particularly, we use Sum of Squares optimisation to search for a Lyapunov function that defines an absorbing set that confines solution trajectories. For linear systems, we show that this also implies global asymptotic stability. Using this approach, we can study stability for a broader range of switched systems, particularly, we can search for a global attractor for switched nonlinear systems, whose dynamics are given by polynomial vector fields and which have multiple equilibria or limit cycles.
|
|
14:30-14:50, Paper ThB24.4 | Add to My Program |
False Data Injection Attack for Switched Systems |
|
Zhao, Rui | Tianjin University |
Zuo, Zhiqiang | Tianjin University |
Wang, Yijing | Tianjin University |
Zhang, Wentao | Nanyang Technological University |
Keywords: Switched systems, Networked control systems, Estimation
Abstract: This paper studies the secure state estimation problem for switched systems. The single/joint false data injection attacks are designed with the aim at altering the sensor signal and/or switching signal. Firstly, it is shown that the attack will steer system state to infinity but could be detectable by kappa^2 detector when only the switching signal is attacked. In addition, the attack acting on sensor signal is designed, which can be recognized by the summation (SUM) detector but fails by kappa^2 detector. Then a joint attack strategy is devised and a sufficient condition is given to guarantee that the joint attack is strictly stealthy. The joint attack performs well since it can launch a strictly stealthy attack compared with the sensor signal attack. Finally, a numerical example is given to verify the theoretical results.
|
|
14:50-15:10, Paper ThB24.5 | Add to My Program |
Approximate Model Predictive Control of Switched Affine Systems Using Multitask Learning with Safety and Stability Guarantees |
|
Ghawash, Faiq | Norwegian University of Science and Technology (NTNU) |
Hovd, Morten | Norwegian Univ of Sci & Tech |
Schofield, Brad | CERN |
Keywords: Switched systems, Neural networks, Optimal control
Abstract: We study the problem of designing an approximate model predictive control (MPC) for discrete time switched affine systems. The MPC design for the switched affine system requires an online solution of a mixed integer program. However, the combinatorial nature of the mixed integer problems might require a large computational time limiting its applicability in real time scenarios. To this end, we propose a framework based on the multitask learning paradigm to approximate the solution of mixed integer MPC for switched affine systems. We also provide a computational method to overapproximate the reachable sets of the closed-loop system that helps to analyze the safety and stability of the system under the influence of the learned controller. Once trained offline, the resulting controller results in a solver free approach especially suited for implementation on resource constrained embedded hardware. We demonstrate the efficacy of the approach on a real world example of an induced draft cooling tower.
|
|
15:10-15:30, Paper ThB24.6 | Add to My Program |
On Some Geometric Behavior of Value Iteration on the Orthant: Switching System Perspective |
|
Lee, Donghwan | KAIST |
Keywords: Switched systems, Stability of linear systems, Optimal control
Abstract: In this paper, the primary goal is to offer additional insights into the value iteration through the lens of switching system models in the control community. These models establish a connection between value iteration and switching system theory and reveal additional geometric behaviors of value iteration in solving discounted Markov decision problems. Specifically, the main contributions of this paper are twofold: 1) We provide a switching system model of value iteration and, based on it, offer a different proof for the contraction property of the value iteration. 2) Furthermore, from the additional insights, new geometric behaviors of value iteration are proven when the initial iterate lies in a special region. We anticipate that the proposed perspectives might have the potential to be a useful tool, applicable in various settings. Therefore, further development of these methods could be a valuable avenue for future research.
|
|
ThB25 Regular Session, Lotus Junior 4DE |
Add to My Program |
Information Theory and Control |
|
|
Chair: Papadopoulos, Alessandro Vittorio | Mälardalen University |
Co-Chair: Petersen, Ian R. | Australian National University |
|
13:30-13:50, Paper ThB25.1 | Add to My Program |
Multi-Criteria Optimization of Application Offloading in the Edge-To-Cloud Continuum |
|
Miloradovic, Branko | Malardalen University |
Papadopoulos, Alessandro Vittorio | Mälardalen University |
Keywords: Information technology systems, Optimization
Abstract: Applications are becoming increasingly data-intensive, requiring a significant amount of computational resources for meeting their demand. Cloud-based services are not sufficient to meet such demand, leading to a shift of the computation towards the devices closer to the edge of the network, leading to the emergence of an Edge-to-Cloud compute Continuum (E2C). Application can offload part of their computation towards the E2C. The allocation of applications to a set of available computing nodes is a challenging problem, as the allocation needs to take into account several factors, including the application requirements and demands as well as the optimization of the resource utilization in the E2C infrastructure and the minimization the CO2 footprint of the executed applications. Control and optimization techniques provide a vast array of tools for the optimized management of the Edge-to-Cloud continuum. In this paper, we provide a mathematical formulation for the application offloading with specific requirements in the cloud computing domain. The problem is modeled both as integer linear programming and constraint programming model, and implemented in commercially available software. Finally, we provide the results of performed comparison between the two models.
|
|
13:50-14:10, Paper ThB25.2 | Add to My Program |
Sensor-Based Planning and Control for Robotic Systems: Introducing Clarity and Perceivability |
|
Agrawal, Devansh Ramgopal | University of Michigan |
Panagou, Dimitra | University of Michigan, Ann Arbor |
Keywords: Information theory and control, Lyapunov methods, Constrained control
Abstract: In this paper, we first introduce an information measure, termed clarity, motivated by information entropy, and show that it has intuitive properties relevant to dynamic coverage control and informative path planning. Clarity defines on a scale of [0, 1] the quality of the information that we have about a variable of interest in an environment. Clarity lower bounds the expected estimation error of any estimator, and is used as the information metric in the notion of perceivability, which is defined later on and is the primary contribution of the paper. Perceivability captures whether a given robotic (or more generally, sensing and control) system has sufficient sensing and actuation capabilities to gather desired information about an environment. We show that perceivability relates to the reachability of an augmented system, which encompasses the robot dynamics and the clarity about the environment, and we derive the corresponding Hamilton-Jacobi-Bellman equations. Thus, we provide an algorithm to measure an environment's perceivability, and obtain optimal controllers that maximize information gain. In simulations, we demonstrate how clarity is a useful concept for planning trajectories, how perceivability can be determined using reachability analysis, and how a Control Barrier Function controller can be used to design controllers to maintain a desired level of information.
|
|
14:10-14:30, Paper ThB25.3 | Add to My Program |
Steering a Linear System at the Minimum Information Rate: Quantifying Redundancy Via Covariance Assignment Theory |
|
Wendel, Eric | Boston University, Draper |
Baillieul, John | Boston Univ |
Hollmann, Joseph | The Charles Stark Draper Laboratory, Inc |
Keywords: Information theory and control, Sampled-data control, Linear systems
Abstract: We compute fundamental performance limitations in the data rate constrained control of continuous-time linear stochastic control systems using information theoretic tools and principles. Specifically, we find the minimum achievable mean square error for steering a linear system to sequences of uncertain steering objectives under a rate constraint as the solution of a convex optimization problem. We propose the redundancy of a control system as a measure of the relative inefficiency of information transmission through a linear control system vs. an ideal communications channel.
|
|
14:30-14:50, Paper ThB25.4 | Add to My Program |
On Iterative Parameter Identification of FIR Systems with Batched Possibly Incorrect Binary-Valued Observations |
|
Guo, Jian | Academy of Mathematics and Systems Science, Chinese Academy of S |
Xue, Wenchao | Academy of Mathematics and Systems Science, Chinese Academy of S |
Wang, Ting | Chinese Academy of Sciences |
Zhang, Ji-Feng | Chinese Academy of Sciences |
Zhang, Yanjun | Beijing Institute of Technology |
Keywords: Quantized systems, Identification, Linear systems
Abstract: This paper considers the problem of parameter identification for a binary output finite impulse response (FIR) system with measurement error, where the measurement error makes the binary measurement values take opposite values with a certain probability. First, the maximum likelihood estimation (MLE) of the parameters is given, and an iterative algorithm with projection based on the Expectation-Maximization algorithm is presented to calculate the MLE. Furthermore, the necessary and sufficient condition for the likelihood function to have a unique maximum point is obtained. It is proved that the iterative estimation error converges to zero at an exponential rate under persistently excitation input conditions. Finally, some numerical simulation results based on a typical system show the effectiveness of the proposed algorithm.
|
|
14:50-15:10, Paper ThB25.5 | Add to My Program |
Reachable Set-Based Dynamic Quantization for the Remote State Estimation of Linear Systems |
|
Li, Yaodong | Eindhoven University of Technology |
Chong, Michelle | Eindhoven University of Technology |
Keywords: Quantized systems, Observers for Linear systems
Abstract: We employ reachability analysis in designing dynamic quantization schemes for the remote state estimation of linear systems over a finite date rate communication channel. The quantization region is dynamically updated at each transmission instant, with an approximated reachable set of the linear system. We propose a set-based method using zonotopes and compare it to a norm-based method in dynamically updating the quantization region. For both methods, we guarantee that the quantization error is bounded and consequently, the remote state reconstruction error is also bounded. To the best of our knowledge, the set-based method using zonotopes has no precedent in the literature and admits a larger class of linear systems and communication channels, where the set-based method allows for a longer inter-transmission time and lower bit rate. Finally, we corroborate our theoretical guarantees with a numerical example.
|
|
15:10-15:30, Paper ThB25.6 | Add to My Program |
A Coherent LQG Approach to Quantum Equalization |
|
Thien, Rebbecca Tze Yean | Australian National University |
Vuglar, Shanon Leigh | Princeton University |
Petersen, Ian R. | Australian National University |
Keywords: Quantum information and control, Control applications
Abstract: We propose a method to design a suboptimal, coherent quantum LQG controller to solve a quantum equalization problem. Our method involves reformulating the problem as a control problem and then designing a classical LQG controller and implementing it as a quantum system. Illustrative examples are included which demonstrate the algorithm for both active and passive systems, i.e., systems where the dynamics are described in terms of both position and momentum operators and systems with dynamics in terms of annihilation operators only.
|
|
ThB26 Regular Session, Orchid Main 4301AB |
Add to My Program |
Model Reduction |
|
|
Chair: Moreschini, Alessio | Imperial College London |
Co-Chair: Kamalapurkar, Rushikesh | Oklahoma State University |
|
13:30-13:50, Paper ThB26.1 | Add to My Program |
Closed-Loop Model Reduction by Moment Matching for Linear Systems |
|
Bhattacharjee, Debraj | Imperial College London |
Astolfi, Alessandro | Imperial College & Univ. of Rome |
Keywords: Reduced order modeling, Model/Controller reduction
Abstract: We study the model reduction by moment matching problem for linear systems in a closed-loop configuration. First we show that the moments of a linear system can be expressed in a form that is independent of the structure of the signal generator. Then we define a class of reduced-order models that can replicate the steady-state response of the original system from input-output data. Finally, we demonstrate the applicability of the results using two simple numerical examples.
|
|
13:50-14:10, Paper ThB26.2 | Add to My Program |
Learning Latent Representations in High-Dimensional State Spaces Using Polynomial Manifold Constructions |
|
Geelen, Rudy | University of Texas at Austin |
Balzano, Laura | University of Michigan |
Willcox, Karen Elizabeth | Massachusetts Institute of Technology |
Keywords: Reduced order modeling, Large-scale systems, Optimization
Abstract: We present a novel framework for learning cost-efficient latent representations in problems with high-dimensional state spaces through nonlinear dimension reduction. By enriching linear state approximations with low-order polynomial terms we account for key nonlinear interactions existing in the data thereby reducing the problem's intrinsic dimensionality. Two methods are introduced for learning the representation of such low-dimensional, polynomial manifolds for embedding the data. The manifold parametrization coefficients can be obtained by regression via either a proper orthogonal decomposition or an alternating minimization based approach. Our numerical results focus on the one-dimensional Korteweg-de Vries equation where accounting for nonlinear correlations in the data was found to lower the representation error by up to two orders of magnitude compared to linear dimension reduction techniques.
|
|
14:10-14:30, Paper ThB26.3 | Add to My Program |
Model Reduction by Matching Zero-Order Moments for 2-D Discrete Systems |
|
Mao, Junyu | Imperial College London |
Scarciotti, Giordano | Imperial College London |
Keywords: Reduced order modeling, Distributed parameter systems, Model/Controller reduction
Abstract: In this paper, the problem of model reduction for two-dimensional (2-D) systems in the Fornasini-Marchesini local state-space form is addressed by matching zero-order moments. Two characterizations of zero-order moments are proposed: the first based on the notion of interpolation of complex points and the second based on the concept of steady state. A parameterized family of reduced-order models that achieves moment matching while preserving the 2-D structure of the original system is presented. The developed theory is illustrated by means of a 2-D low-pass filter reduction problem.
|
|
14:30-14:50, Paper ThB26.4 | Add to My Program |
Convergent Dynamic Mode Decomposition |
|
Rosenfeld, Joel A. | University of South Florida |
Kamalapurkar, Rushikesh | Oklahoma State University |
Keywords: Nonlinear systems identification, Reduced order modeling, Data driven control
Abstract: This manuscript addresses convergence of dynamic mode decomposition (DMD) algorithms and the existence of associated Koopman modes. Convergence relies on reformulation of dynamic mode decomposition in terms of newly defined compact operators defined with pairs of Hilbert spaces selected separately as the domain and range of the operator. With the Hilbert spaces selected so that the domain is embedded in the range, an eigenfunction approach to DMD is developed by leveraging a finite rank representation. The finite rank representation is proven to converge, in norm, to the original operator with increasing rank. The manuscript concludes with the description of a DMD algorithm that converges when a dense collection of occupation kernels, arising from the data, are leveraged in the analysis.
|
|
14:50-15:10, Paper ThB26.5 | Add to My Program |
Moment Matching for Nonlinear Systems of Second-Order Equations |
|
Simard, Joel David | Imperial College London |
Moreschini, Alessio | Imperial College London |
Astolfi, Alessandro | Imperial College & Univ. of Rome |
Keywords: Reduced order modeling, Nonlinear systems, Differential-algebraic systems
Abstract: In this paper we consider the problem of constructing nonlinear systems of second-order equations that achieve moment matching. In particular, necessary and sufficient conditions are given for which a system of second-order equations achieves moment matching, and a family of systems of second-order equations achieving moment matching is directly constructed by extracting it, via particular choices of the free mappings, from a parameterization of all systems achieving moment matching. The results are specialized for the scenario in which the signal generator is a linear system. Finally, the results of the paper are demonstrated by constructing reduced order models of a two link robotic manipulator in the second-order equation form.
|
|
15:10-15:30, Paper ThB26.6 | Add to My Program |
Globally Optimal SISO H2-Norm Model Reduction Using Walsh's Theorem |
|
Lagauw, Sibren | KU Leuven |
Agudelo, Oscar Mauricio | Katholieke Universiteit Leuven |
De Moor, Bart L.R. | Katholieke Universiteit Leuven |
Keywords: Model/Controller reduction, Linear systems, Optimization
Abstract: We present a novel methodology for single-input single-output (SISO) H2-norm model reduction that guarantees global optimality of the obtained solution(s). By exploiting Walsh's theorem, which is an elegant formulation of the first-order necessary conditions for optimality, we reformulate the model reduction problem as a multiparameter eigenvalue problem (MEVP), the real-valued eigentuples of which characterize the globally optimal solution(s) of the model reduction problem. While aiming for global optimality comes at the cost of a combinatorial growth of the problem complexity for increasing model orders, the novel methodology allows us to tackle larger problems compared to the few other globally optimal approaches in the literature. In particular, the degree of the obtained MEVP is independent of the order of the original higher order and obtained reduced-order model, a property that is favorable from a computational point of view. We perform three numerical experiments to illustrate the effectiveness of the methodology.
|
|
ThC01 Invited Session, Orchid Main 4202-4306 |
Add to My Program |
Learning, Optimization, and Game Theory II |
|
|
Chair: Sayin, Muhammed Omer | Bilkent University |
Co-Chair: Sundaram, Shreyas | Purdue University |
Organizer: Doan, Thinh T. | Virginia Tech |
Organizer: Sayin, Muhammed Omer | Bilkent University |
Organizer: Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Organizer: Zhang, Kaiqing | University of Maryland |
|
16:00-16:20, Paper ThC01.1 | Add to My Program |
Analysis of Contagion Dynamics with Active Cyber Defenders (I) |
|
Paarporn, Keith | University of Colorado, Colorado Springs |
Brown, Philip N. | University of Colorado Colorado Springs |
Xu, Shouhuai | UTSA |
Keywords: Computer/Network Security, Cyber-Physical Security, Stability of nonlinear systems
Abstract: In this paper, we analyze the infection spreading dynamics of malware in a population of cyber nodes (i.e., computers or devices). Unlike most prior studies where nodes are reactive to infections, in our setting some nodes are active defenders meaning that they are able to clean up malware infections of their neighboring nodes, much like how spreading malware exploits the network connectivity properties in order to propagate. We formulate these dynamics as an Active Susceptible-Infected-Susceptible (A-SIS) compartmental model of contagion. We completely characterize the system's asymptotic behavior by establishing conditions for the global asymptotic stability of the infection-free equilibrium and for an endemic equilibrium state. We show that the presence of active defenders counter-acts infectious spreading, effectively increasing the epidemic threshold on parameters for which an endemic state prevails. Leveraging this characterization, we investigate a general class of problems for finding optimal investments in active cyber defense capabilities given limited resources. We show that this class of problems has unique solutions under mild assumptions. We then analyze an Active Susceptible-Infected-Recovered (A-SIR) compartmental model, where the peak infection level of any trajectory is explicitly derived.
|
|
16:20-16:40, Paper ThC01.2 | Add to My Program |
Online Learning for Equilibrium Pricing in Markets under Incomplete Information (I) |
|
Jalota, Devansh | Stanford University |
Sun, Haoyuan | Massachusetts Institute of Technology |
Azizan, Navid | MIT |
Keywords: Learning, Game theory, Smart grid
Abstract: The study of market equilibria is central to economic theory, particularly in efficiently allocating scarce resources. However, the computation of equilibrium prices at which the supply of goods matches their demand typically relies on having access to complete information on private attributes of agents, e.g., suppliers' cost functions, which are often unavailable in practice. Motivated by this practical consideration, we consider the problem of setting equilibrium prices in the incomplete information setting wherein a market operator seeks to satisfy the customer demand for a commodity by purchasing the required amount from competing suppliers with privately known cost functions unknown to the market operator. In this incomplete information setting, we consider the online learning problem of learning equilibrium prices over time while jointly optimizing three performance metrics -- unmet demand, cost regret, and payment regret -- pertinent in the context of equilibrium pricing over a horizon of T periods. In the general setting when suppliers' cost functions are time-varying, we show that no online algorithm can achieve sublinear regret on all three metrics. Thus, we consider the setting when suppliers' cost functions are fixed and develop algorithms that achieve a regret of (i) O(log log T) when the customer demand is constant over time and (ii) O(sqrt{T} log log T) when the demand is variable over time.
|
|
16:40-17:00, Paper ThC01.3 | Add to My Program |
Robust Online Covariance and Sparse Precision Estimation under Arbitrary Data Corruption (I) |
|
Yao, Tong | Purdue University |
Sundaram, Shreyas | Purdue University |
Keywords: Statistical learning, Machine learning
Abstract: Gaussian graphical models are widely used to represent correlations among entities, but they remain vulnerable to data corruption. In this work, we introduce a modified trimmed-inner-product algorithm to robustly estimate the covariance in an online scenario even in the presence of arbitrary and adversarial data attacks. At each time step, data points, drawn nominally independently and identically from a multivariate Gaussian distribution, arrive. However, a certain fraction of these points may have been arbitrarily corrupted. We propose an online algorithm to estimate the sparse inverse covariance (i.e., precision) matrix despite this corruption. We provide the error-bound and the convergence properties of the estimates to the true precision matrix under our algorithms.
|
|
17:00-17:20, Paper ThC01.4 | Add to My Program |
Asynchronous Decentralized Q-Learning in Stochastic Games (I) |
|
Yongacoglu, Bora | Queen's University |
Arslan, Gurdal | University of Hawaii at Manoa |
Yuksel, Serdar | Queen's University |
Keywords: Game theory, Learning, Decentralized control
Abstract: Non-stationarity is a fundamental challenge in multi-agent reinforcement learning (MARL), where agents update their behaviour as they learn. In multi-agent settings, individual agents may have an incomplete view of the actions of others, which can complicate the learning process. Many theoretical advances in MARL avoid the challenge of non-stationarity by coordinating the policy updates of agents in various ways, including synchronizing times at which agents are allowed to revise their policies. In this paper, we study an asynchronous variant of the decentralized Q-learning algorithm, a recent MARL algorithm for stochastic games. We provide sufficient conditions under which the asynchronous algorithm drives play to equilibrium with high probability. In this generalization, players need not agree on the schedule of policy update times, and may change their policies at their own separately selected times. This work extends the applicability of the decentralized Q-learning algorithm to settings in which parameters are selected in an independent manner, and tames non-stationarity without imposing the coordination assumptions of prior work.
|
|
17:20-17:40, Paper ThC01.5 | Add to My Program |
PrimeTime: A Finite-Time Consensus Protocol for Open Networks (I) |
|
Abrahamson, Henry Waichi | Northwestern University |
Wei, Ermin | Northwestern Univeristy |
Keywords: Agents-based systems, Autonomous vehicles, Sensor networks
Abstract: In distributed problems where consensus between agents is required but average consensus is not desired, it can be necessary for each agent to know not only the data of each other agent in the network, but also the origin of each piece of data before consensus can be reached. However, transmitting large tables of data with IDs can cause the size of an agent's message to increase dramatically, while truncating down to fewer pieces of data to keep the message size small can lead to problems with the speed of achieving consensus. Also, many existing consensus protocols are not robust against agents leaving and entering the network. We introduce PrimeTime, a novel communication protocol that exploits the properties of prime numbers to quickly and efficiently share small integer data across an open network. For sufficiently small networks or small integer data, we show that messages formed by PrimeTime require fewer bits than messages formed by simply tabularizing the data and IDs to be transmitted.
|
|
17:40-18:00, Paper ThC01.6 | Add to My Program |
Distributed Learning Dynamics for Coalitional Games (I) |
|
Hamed, Aya | University of Illinois Urbana-Champaign |
Shamma, Jeff S. | University of Illinois at Urbana-Champaign |
Keywords: Game theory, Learning, Agents-based systems
Abstract: In the framework of transferable utility coalitional games, a scoring (characteristic) function determines the value of any subset/coalition of agents. Agents decide on both which coalitions to form and the allocations of the values of the formed coalitions among their members. An important concept in coalitional games is that of a core solution, which is a partitioning of agents into coalitions and an associated allocation to each agent under which no group of agents can get a higher allocation by forming an alternative coalition. We present distributed learning dynamics for coalitional games that converge to a core solution whenever one exists. In these dynamics, an agent maintains a state consisting of (i) an aspiration level for its allocation and (ii) the coalition, if any, to which it belongs. In each stage, a randomly activated agent proposes to form a new coalition and changes its aspiration based on the success or failure of its proposal. The coalition membership structure is changed, accordingly, whenever the proposal succeeds. Required communications are that: (i) agents in the proposed new coalition need to reveal their current aspirations to the proposing agent, and (ii) agents are informed if they are joining the proposed coalition or if their existing coalition is broken. The proposing agent computes the feasibility of forming the coalition. We show that the dynamics hit an absorbing state whenever a core solution is reached. We further illustrate the distributed learning dynamics on a multi-agent task allocation setting.
|
|
ThC02 Invited Session, Melati Main 4001AB-4104 |
Add to My Program |
Online Learning for Optimization and Control |
|
|
Chair: Balta, Efe C. | ETH Zurich |
Co-Chair: Didier, Alexandre | ETH Zurich |
Organizer: Balta, Efe C. | ETH Zurich |
Organizer: Didier, Alexandre | ETH Zurich |
Organizer: Karapetyan, Aren | ETH Zürich |
Organizer: Iannelli, Andrea | University of Stuttgart |
Organizer: Martin, Andrea | École Polytechnique Fédérale De Lausanne |
Organizer: Tsiamis, Anastasios | ETH Zurich |
|
16:00-16:20, Paper ThC02.1 | Add to My Program |
Online Distributed Learning with Quantized Finite-Time Coordination (I) |
|
Bastianello, Nicola | KTH Royal Institute of Technology |
Rikos, Apostolos I. | KTH Royal Institute of Technology |
Johansson, Karl H. | KTH Royal Institute of Technology |
Keywords: Optimization algorithms, Agents-based systems, Learning
Abstract: In this paper we consider online distributed learning problems. Online distributed learning refers to the process of training learning models on distributed data sources. In our setting a set of agents need to cooperatively train a learning model from streaming data. Differently from federated learning, the proposed approach does not rely on a central server but only on peer-to-peer communications among the agents. This approach is often used in scenarios where data cannot be moved to a centralized location due to privacy, security, or cost reasons. In order to overcome the absence of a central server, we propose a distributed algorithm that relies on a quantized, finite-time coordination protocol to aggregate the locally trained models. Furthermore, our algorithm allows for the use of stochastic gradients during local training. Stochastic gradients are computed using a randomly sampled subset of the local training data, which makes the proposed algorithm more efficient and scalable than traditional gradient descent. In our paper, we analyze the performance of the proposed algorithm in terms of the mean distance from the online solution. Finally, we present numerical results for a logistic regression task.
|
|
16:20-16:40, Paper ThC02.2 | Add to My Program |
Safe Non-Stochastic Control of Linear Dynamical Systems (I) |
|
Zhou, Hongyu | University of Michigan |
Tzoumas, Vasileios | University of Michigan, Ann Arbor |
Keywords: Optimization, Time-varying systems
Abstract: We study the problem of safe control of linear dynamical systems corrupted with non-stochastic noise, and provide the first algorithm that guarantees (i) zero constraint violation of convex time-varying constraints, and (ii) bounded dynamic regret, i.e., bounded suboptimality against an optimal clairvoyant controller that knows the future noise a priori. The constraints bound the values of the state and of the control input such as to ensure collision avoidance and bounded control effort. We are motivated by the future of autonomy where robots will safely perform complex tasks despite real-world unpredictable disturbances such as wind and wake disturbances. To develop the algorithm, we capture our problem as a sequential game between a linear feedback controller and an adversary, assuming a known upper bound on the noise's magnitude. Particularly, at each step t=1,ldots, T, first the controller chooses a linear feedback control gain K_t in mathcal{K}_t, where mathcal{K}_t is constructed such that it guarantees that the safety constraints will be satisfied; then, the adversary reveals the current noise w_t and the controller suffers a loss f_t(K_t) ---eg f_t represents the system's tracking error at t upon the realization of the noise. The controller aims to minimize its cumulative loss, despite knowing w_t only after K_t has been chosen, and despite mathcal{K}_{t-1} being possibly disjoint from mathcal{K}_t. We validate our algorithm in simulated scenarios of safe control of linear dynamical systems in the presence of bounded noise.
|
|
16:40-17:00, Paper ThC02.3 | Add to My Program |
Change Point Detection Approach for Online Control of Unknown Time Varying Dynamical Systems (I) |
|
Muthirayan, Deepan | University of California at Irvine |
Du, Ruijie | University of California Irvine |
Shen, Yanning | UCI |
Khargonekar, Pramod | Univ. of California, Irvine |
Keywords: Learning, Optimization, Adaptive control
Abstract: We propose a novel change point detection approach for online learning control with full information feedback (state, disturbance, and cost feedback) for unknown time-varying dynamical systems. We show that our algorithm can achieve a sub-linear regret with respect to the class of Disturbance Action Control (DAC) policies, which are a widely studied class of policies for online control of dynamical systems, for any sub-linear number of changes and very general class of systems: (i) matched disturbance system with general convex cost functions, (ii) general system with linear cost functions. Specifically, a (dynamic) regret of Sigma^{1/5}_TT^{4/5} can be achieved for these class of systems, where Sigma_T is the number of changes of the underlying system and T is the duration of the control episode. That is, the change point detection approach achieves a sub-linear regret for any sub-linear number of changes, which other previous algorithms such as in Minasyan et al. (2021) cannot. Numerically, we demonstrate that the change point detection approach is superior to Minasyan et al. (2021) and to standard online learning approaches for time-invariant dynamical systems. Our work presents the first regret guarantee for unknown time-varying dynamical systems in terms of a stronger notion of variability like the number of changes in the underlying system. The extension of the present work to state and output feedback controllers is a subject of future work.
|
|
17:00-17:20, Paper ThC02.4 | Add to My Program |
On-Policy Data-Driven Linear Quadratic Regulator Via Combined Policy Iteration and Recursive Least Squares (I) |
|
Sforni, Lorenzo | Alma Mater Studiorum - Universitŕ Di Bologna |
Carnevale, Guido | University of Bologna |
Notarnicola, Ivano | University of Bologna |
Notarstefano, Giuseppe | University of Bologna |
Keywords: Data driven control, Optimal control, Optimization algorithms
Abstract: In this paper, we address infinite-horizon Linear Quadratic Regulator (LQR) problems for unknown discrete-time systems. As an additional challenge, we address an on-policy setup in which system matrices are identified while controlling the real system with a progressively optimized policy. Specifically, we consider a time-varying control policy that, while applied to the real unknown system, is iteratively refined (based on the most updated estimate of the system matrices) towards the optimal LQR solution. The overall learning procedure combines a recursive least squares method with a direct policy search based on the gradient method. By resorting to Lyapunov-based analysis tools in combination with averaging theory for nonlinear systems, exponential stability for the closed-loop scheme can be proven. Finally, a numerical example showing the effectiveness of the considered strategy corroborates the theoretical findings.
|
|
17:20-17:40, Paper ThC02.5 | Add to My Program |
On the Finite-Time Behavior of Suboptimal Linear Model Predictive Control (I) |
|
Karapetyan, Aren | ETH Zürich |
Balta, Efe C. | ETH Zurich |
Iannelli, Andrea | University of Stuttgart |
Lygeros, John | ETH Zurich |
Keywords: Predictive control for linear systems, Optimal control, Optimization algorithms
Abstract: Inexact methods for model predictive control (MPC), such as real-time iterative schemes or time-distributed optimization, alleviate the computational burden of exact MPC by providing suboptimal solutions. While the asymptotic stability of such algorithms is well studied, their finite-time performance has not received much attention. In this work, we quantify the performance of suboptimal linear model predictive control in terms of the additional cost incurred due to performing only a finite number of optimization iterations. Leveraging this novel analysis framework, we propose a novel suboptimal MPC algorithm with a diminishing horizon length and guaranteed closed-loop stability and finite-time performance. This analysis allows the designer to plan a limited computational power budget distribution to achieve a desired performance level. We provide numerical examples to illustrate the algorithm's transient behavior and computational complexity.
|
|
17:40-18:00, Paper ThC02.6 | Add to My Program |
Safe and Stable Adaptive Control for a Class of Dynamic Systems (I) |
|
Autenrieb, Johannes | German Aerospace Center (DLR) |
Annaswamy, Anuradha M. | Massachusetts Inst. of Tech |
Keywords: Adaptive control, Uncertain systems, Flight control
Abstract: Adaptive control has focused on online control of dynamic systems in the presence of parametric uncertainties, with solutions guaranteeing stability and control performance. Safety, a related property to stability, is becoming increasingly important as the footprint of autonomous systems grows in society. One of the popular ways for ensuring safety is through the notion of a control barrier function (CBF). In this paper, we combine adaptation and CBFs to develop a real-time controller that guarantees stability and remains safe in the presence of parametric uncertainties. The class of dynamic systems that we focus on is linear time-invariant systems whose states are accessible and where the inputs are subject to a magnitude limit. Conditions of stability, state convergence to a desired value, and parameter learning are all elucidated. One of the elements of the proposed adaptive controller that ensures stability and safety is the use of a CBF-based safety filter that suitably generates safe reference commands, employs error-based relaxation (EBR) of Nagumo’s theorem, and leads to guarantees of set invariance. To demonstrate the effectiveness of our approach, we present two numerical examples, an obstacle avoidance case and a missile flight control case.
|
|