| |
Last updated on September 22, 2023. This conference program is tentative and subject to change
Technical Program for Wednesday December 13, 2023
|
WeA01 Tutorial Session, Orchid Main 4202-4306 |
Add to My Program |
Nonstandard Linear-Quadratic Decision Making |
|
|
Chair: Basar, Tamer | Univ of Illinois, Urbana-Champaign |
Co-Chair: Zhang, Huanshui | Shandong University of Science and Technology |
Organizer: Basar, Tamer | Univ of Illinois, Urbana-Champaign |
Organizer: Zhang, Huanshui | Shandong University of Science and Technology |
|
10:00-10:20, Paper WeA01.1 | Add to My Program |
Variations Around the Standard LQG Model (I) |
|
Basar, Tamer | Univ of Illinois, Urbana-Champaign |
|
10:20-11:00, Paper WeA01.2 | Add to My Program |
Advances in Linear-Quadratic Stochastic Differential Games (I) |
|
Moon, Jun | Hanyang University |
|
11:00-11:20, Paper WeA01.3 | Add to My Program |
Progress on Nonstandard LQ Control and Applications in Networked Control Systems (I) |
|
Zhang, Huanshui | Shandong University of Science and Technology |
|
11:20-12:00, Paper WeA01.4 | Add to My Program |
Stochastic Linear-Quadratic Optimal Control Problems Some Recent Results (I) |
|
Yong, Jiongmin | University of Central Florida |
|
WeA02 Invited Session, Melati Main 4001AB-4102 |
Add to My Program |
Learning-Based Control I: Policy Learning and Optimization |
|
|
Chair: Müller, Matthias A. | Leibniz University Hannover |
Co-Chair: Meyn, Sean P. | Univ. of Florida |
Organizer: Trimpe, Sebastian | RWTH Aachen University |
Organizer: Müller, Matthias A. | Leibniz University Hannover |
Organizer: Schoellig, Angela P | University of Toronto |
Organizer: Zeilinger, Melanie N. | ETH Zurich |
|
10:00-10:20, Paper WeA02.1 | Add to My Program |
Differentiable Sparse Optimal Control |
|
Shima, Ryotaro | Toyota Central R&D Labs |
Moriyasu, Ryuta | Toyota Central R&D Labs |
Kawaguchi, Sho | Toyota Industries Corporation |
Kashima, Kenji | Kyoto University |
Keywords: Optimal control, Machine learning, Computer-aided control design
Abstract: This paper develops a framework for differentiating sparse optimal control inputs with respect to cost parameters. The difficulty lies in the non-smoothness induced by a sparsity-enhancing regularizer. To avoid this, we identify the optimal inputs as a unique zero point of a function using the proximal technique. This enables us to characterize the differentiability and employ the implicit function theorem. We also demonstrate the effectiveness of our approach using a numerical example of inverse optimal control.
|
|
10:20-10:40, Paper WeA02.2 | Add to My Program |
An Efficient Off-Policy Reinforcement Learning Algorithm for the Continuous-Time LQR Problem (I) |
|
Lopez, Victor G. | Leibniz University Hannover |
Müller, Matthias A. | Leibniz University Hannover |
Keywords: Iterative learning control, Data driven control, Optimal control
Abstract: In this paper, an off-policy reinforcement learning algorithm is designed to solve the continuous-time linear quadratic regulator (LQR) problem using only input-state data measured from the system. Different from other algorithms in the literature, we propose the use of a specific persistently exciting input as the exploration signal during the data collection step. We then show that, using this persistently excited data, the solution of the matrix equation in our algorithm is guaranteed to exist and to be unique at every iteration. Convergence of the algorithm to the optimal control input is also proven. Moreover, we formulate the policy evaluation step as the solution of a Sylvester-transpose equation, which increases the efficiency of its solution. A method to determine an initial stabilizing policy using only measured data is proposed. Finally, the advantages of the proposed method are tested via simulation.
|
|
10:40-11:00, Paper WeA02.3 | Add to My Program |
No-Regret Bayesian Optimization with Gradients Using Local Optimality-Based Constraints: Application to Closed-Loop Policy Search (I) |
|
Makrygiorgos, Georgios | University of California, Berkeley |
Paulson, Joel | The Ohio State University |
Mesbah, Ali | University of California, Berkeley |
Keywords: Optimization
Abstract: Bayesian optimization (BO) has emerged as a data-efficient method for global optimization of expensive black-box functions, which commonly arise in learning-based control applications. Recent work has shown that BO can be augmented with gradient measurements to further improve its convergence behavior. These approaches mostly rely on standard acquisition functions and indirectly incorporate gradient information into a probabilistic surrogate model of the performance function to improve its local predictions. This paper presents a new strategy to simultaneously exploit performance (zeroth-order) and gradient (first-order) data within a single constrained acquisition optimization. This is done by enforcing a set of black-box constraints that mimic the necessary optimality conditions for the original global optimization problem. We establish how the incorporation of these constraints restricts the allowable search space of BO, leading to less exploration than zeroth-order BO. The performance of the proposed method is demonstrated for closed-loop policy search via reinforcement learning on a benchmark LQR problem.
|
|
11:00-11:20, Paper WeA02.4 | Add to My Program |
Scenario Optimization with Constraint Relaxation in a Non-Convex Setup: A Flexible and General Framework for Data-Driven Design (I) |
|
Garatti, Simone | Politecnico Di Milano |
Campi, M. C. | Universita' Di Brescia |
Keywords: Uncertain systems, Randomized algorithms, Statistical learning
Abstract: The scenario approach, originally developed as a computational tool for robust problems, has through the years developed into a solid, general, framework for data-driven decision making and design. One main driving force that has fostered this process has certainly been the increasing generality of the considered schemes. In this paper, we move a further step forward in this process. By leveraging some recent results in the wake of the so-called wait-and-judge paradigm, we fully develop a scheme for scenario optimization with constraint relaxation in a non-convex setup, so greatly expanding previous achievements valid under a convexity assumption. We show that a purely data-driven, and yet tight and informative, quantification of the solution robustness is possible regardless of the mechanism through which uncertainty is generated. The generality of this new non-convex setup provides an extremely versatile scheme for data-driven design that can be applied to a variety of problems ranging from mixed-integer optimization to design in abstract spaces.
|
|
11:20-11:40, Paper WeA02.5 | Add to My Program |
On-Policy Data-Driven Linear Quadratic Regulator Via Model Reference Adaptive Reinforcement Learning (I) |
|
Borghesi, Marco | University of Bologna |
Bosso, Alessandro | University of Bologna |
Notarstefano, Giuseppe | University of Bologna |
Keywords: Data driven control, Optimal control, Adaptive control
Abstract: In this paper, we address a data-driven linear quadratic optimal control problem in which the regulator design is performed on-policy by resorting to approaches from reinforcement learning and model reference adaptive control. In particular, a continuous-time identifier of the value function is used to generate online a reference model for the adaptive stabilizer. By introducing a suitably selected dithering signal, the resulting policy is shown to achieve asymptotic convergence to the optimal gain while the controlled plant reaches asymptotically the behavior of the optimal closed-loop system.
|
|
11:40-12:00, Paper WeA02.6 | Add to My Program |
Learning Optimal Policies in Mean Field Models with Kullback-Leibler Regularization (I) |
|
Busic, Ana | Inria |
Meyn, Sean P. | Univ. of Florida |
Cammardella, Neil | University of Florida |
Keywords: Machine learning, Distributed control, Stochastic optimal control
Abstract: The theory and application of mean field games has grown significantly since its origins less than two decades ago. This paper considers a special class in which the game is cooperative, and the cost includes a control penalty defined by Kullback-Leibler divergence, as commonly used in reinforcement learning and other fields. Its use as a control cost or regularizer is often preferred because this leads to an attractive solution. This paper considers a particular control paradigm called Kullback-Leibler Quadratic (KLQ) optimal control, and arrives at the following conclusions: 1. in application to distributed control of electric loads, a new modeling technique is introduced to obtain a simple Markov model for each load (the `agent' in mean field theory). 2. It is argued that the optimality equations may be solved using Monte-Carlo techniques---a specialized version of stochastic gradient descent (SGD). 3. The use of averaging minimizes the asymptotic covariance in the SGD algorithm; the form of the optimal covariance is identified for the first time.
|
|
WeA03 Invited Session, Melati Main 4003-4104 |
Add to My Program |
Safe Planning and Control with Uncertainty Quantification I |
|
|
Chair: Gao, Yulong | University of Oxford |
Co-Chair: Gatsis, Konstantinos | University of Oxford |
Organizer: Gao, Yulong | University of Oxford |
Organizer: Lindemann, Lars | University of Southern California |
Organizer: Fan, Chuchu | Massachusetts Institute of Technology |
Organizer: Abate, Alessandro | University of Oxford |
Organizer: Pappas, George J. | University of Pennsylvania |
|
10:00-10:20, Paper WeA03.1 | Add to My Program |
Koopman-Inspired Implicit Backward Reachable Sets for Unknown Nonlinear Systems |
|
Balim, Haldun | ETH Zurich |
Aspeel, Antoine | University of Michigan |
Liu, Zexiang | University of Michigan |
Ozay, Necmiye | Univ. of Michigan |
Keywords: Constrained control, Hybrid systems
Abstract: Koopman liftings have been successfully used to learn high dimensional linear approximations for autonomous systems for prediction purposes, or for control systems for leveraging linear control techniques to control nonlinear dynamics. In this paper, we show how learned Koopman approximations can be used for state-feedback correct-by-construction control. To this end, we introduce the Koopman over-approximation, a (possibly hybrid) lifted representation that has a simulation-like relation with the underlying dynamics. Then, we prove how successive application of controlled predecessor operation in the lifted space leads to an implicit backward reachable set for the actual dynamics. Finally, we demonstrate the approach on two nonlinear control examples with unknown dynamics.
|
|
10:20-10:40, Paper WeA03.2 | Add to My Program |
Exact Characterization of the Convex Hulls of Reachable Sets (I) |
|
Lew, Thomas | Stanford |
Bonalli, Riccardo | Laboratoire Des Signaux Et Systèmes |
Pavone, Marco | Stanford University |
Keywords: Uncertain systems, Optimal control, Predictive control for nonlinear systems
Abstract: We study the convex hulls of reachable sets of nonlinear systems with bounded disturbances. Reachable sets play a critical role in control, but remain notoriously challenging to compute, and existing over-approximation tools tend to be conservative or computationally expensive. In this work, we exactly characterize the convex hulls of reachable sets as the convex hulls of solutions of an ordinary differential equation from all possible initial values of the disturbances. This finite-dimensional characterization unlocks a fast sampling-based method to accurately over-approximate reachable sets. We give applications to neural feedback loop analysis and robust model predictive control.
|
|
10:40-11:00, Paper WeA03.3 | Add to My Program |
Logical Zonotopes: A Set Representation for the Formal Verification of Boolean Functions (I) |
|
Alanwar, Amr | Technical University of Munich |
Jiang, Frank J. | Royal Institute of Technology |
Amin, Samy | Constructor University |
Johansson, Karl H. | KTH Royal Institute of Technology |
Keywords: Formal Verification/Synthesis, Discrete event systems, Computational methods
Abstract: A logical zonotope, which is a new set representation for binary vectors, is introduced in this paper. A logical zonotope is constructed by XOR-ing a binary vector with a combination of other binary vectors called generators. Such a zonotope can represent up to 2^n binary vectors using only n generators. It is shown that logical operations over sets of binary vectors can be performed on the zonotopes' generators and, thus, significantly reduce the computational complexity of various logical operations (e.g., XOR, NAND, AND, OR, and semi-tensor products). Similar to traditional zonotopes' role in the formal verification of dynamical systems over real vector spaces, logical zonotopes can efficiently analyze discrete dynamical systems defined over binary vector spaces. We illustrate the approach and its ability to reduce the computational complexity in two use cases: (1) encryption key discovery of a linear feedback shift register and (2) safety verification of a road traffic intersection protocol.
|
|
11:00-11:20, Paper WeA03.4 | Add to My Program |
Scalable Forward Reachability Analysis of Multi-Agent Systems with Neural Network Controllers (I) |
|
Gates, Oliver | University of Oxford |
Newton, Matthew | University of Oxford |
Gatsis, Konstantinos | University of Oxford |
Keywords: Machine learning, Neural networks, Robust control
Abstract: Neural networks (NNs) have been shown to learn complex control laws successfully, often with performance advantages or decreased computational cost compared to alternative methods. Neural network controllers (NNCs) are, however, highly sensitive to disturbances and uncertainty, meaning that it can be challenging to make satisfactory robustness guarantees for systems with these controllers. This problem is exacerbated when considering multi-agent NN-controlled systems, as existing reachability methods often scale poorly for large systems. This paper addresses the problem of finding overapproximations of forward reachable sets for discrete-time uncertain multi-agent systems with distributed NNC architectures. We first reformulate the dynamics, making the system more amenable to reachablility analysis. Next, we take advantage of the distributed architecture to split the overall reachability problem into smaller problems, significantly reducing computation time. We use quadratic constraints, along with a convex representation of uncertainty in each agent's model, to form semidefinite programs, the solutions of which give overapproximations of forward reachable sets for each agent. Finally, the methodology is tested on two realistic examples: a platoon of vehicles and a power network system.
|
|
11:20-11:40, Paper WeA03.5 | Add to My Program |
Distributionally Robust Optimization Using Cost-Aware Ambiguity Sets |
|
Schuurmans, Mathijs | KU Leuven |
Patrinos, Panagiotis | KU Leuven |
Keywords: Optimization, Statistical learning
Abstract: We present a novel framework for distributionally robust optimization (DRO), called cost-aware DRO (CADRO). The key idea of CADRO is to exploit the cost structure in the design of the ambiguity set to reduce conservatism. Particularly, the set specifically constrains the worst-case distribution along the direction in which the expected cost of an approximate solution increases most rapidly. We prove that CADRO provides both a high-confidence upper bound and a consistent estimator of the out-of-sample expected cost, and show empirically that it produces solutions that are substantially less conservative than existing DRO methods, while providing the same guarantees.
|
|
11:40-12:00, Paper WeA03.6 | Add to My Program |
Bounding Optimality Gaps for Non-Convex Optimization Problems: Applications to Nonlinear Safety-Critical Systems (I) |
|
Akella, Prithvi | California Institute of Technology |
Ames, Aaron D. | California Institute of Technology |
Keywords: Randomized algorithms, Optimization algorithms, Nonlinear systems
Abstract: Efficient methods to provide sub-optimal solutions to non-convex optimization problems with knowledge of their sub-optimality have been long sought after. To that end, by leveraging recent work in risk-aware verification, we provide two algorithms to (1) probabilistically bound the optimality gaps of solutions reported by novel percentile optimization techniques, and (2) probabilistically bound the maximum optimality gap reported by percentile approaches for repetitive applications, e.g. Model Predictive Control (MPC). Notably, our results work for a large class of optimization problems. We showcase the efficacy and repeatability of our results on a few, benchmark non-convex optimization problems and the utility of our results for controls in a Nonlinear MPC setting.
|
|
WeA04 Invited Session, Simpor Junior 4913 |
Add to My Program |
Control of Connected and Autonomous Vehicles in Mixed Traffic |
|
|
Chair: Cicic, Mladen | CNRS, GIPSA-Lab |
Co-Chair: Delle Monache, Maria Laura | University of California, Berkeley |
Organizer: Cicic, Mladen | CNRS, GIPSA-Lab |
Organizer: Delle Monache, Maria Laura | University of California, Berkeley |
Organizer: Miao, Fei | University of Connecticut |
Organizer: Pasquale, Cecilia | University of Genova |
|
10:00-10:20, Paper WeA04.1 | Add to My Program |
Cyber-Attack Detection Framework for Connected Vehicles in V2X Networks Based on an Iterative UFIR Filter (I) |
|
Jiang, Kai | Nanyang Technological University |
Ju, Zhiyang | Beihang University |
Huang, Lingying | Nanyang Technological University |
Su, Rong | Nanyang Technological University |
Keywords: Attack Detection, Estimation, Transportation networks
Abstract: Connected vehicles have great advantages in driving safety and energy efficiency under the support of vehicle-to-everything (V2X) networks, while they are also vulnerable to malicious cyber-attacks. To enhance the cyber security of connected vehicles, a cyber-attack detection framework is proposed based on multi-source information fusion specifically for the vehicle localization system. In this framework, an iterative unbiased finite impulse response (UFIR) filter is utilized to estimate the vehicle position with low computational load, based on the vehicle dynamics model and information from the inertial measurement system (IMU), GPS, and V2X networks. In addition, a discriminator module is developed to analyze the residuals between estimations and position information from different sources for cyber-attack detection. Finally, multiple simulation cases are implemented to validate the effectiveness of the proposed framework.
|
|
10:20-10:40, Paper WeA04.2 | Add to My Program |
Connected and Automated Vehicles in Mixed-Traffic: Learning Human Driver Behavior for Effective On-Ramp Merging (I) |
|
Senthil Kumar, Nishanth Venkatesh | Cornell University |
Le, Viet-Anh | University of Delaware |
Dave, Aditya | Cornell University |
Malikopoulos, Andreas A. | Cornell University |
Keywords: Traffic control, Optimal control, Machine learning
Abstract: Highway merging scenarios featuring mixed traffic conditions pose significant modeling and control challenges for connected and automated vehicles (CAVs) interacting with incoming on-ramp human-driven vehicles (HDVs). In this paper, we present an approach to learn an approximate information state (AIS) model of CAV-HDV interactions. Thus, the CAV learns the behavior of an incoming HDV using the AIS model and uses it to generate a control strategy for merging. First, we validate the efficacy of this framework on real-world data by using it to predict the behavior of an HDV in situations with other HDVs extracted from the Next-Generation Simulation repository. Then, we generate simulation data for HDV-CAV interactions in a highway merging scenario using a standard inverse reinforcement learning approach. Without assuming a prior knowledge of the generating model, we show that our AIS model learns to predict the future trajectory of the HDV using only observations. Subsequently, we generate safe merging control policies for a CAV when merging with HDVs that demonstrate a spectrum of driving behaviors, from aggressive to conservative. We establish the effectiveness of the proposed approach by performing numerical simulations.
|
|
10:40-11:00, Paper WeA04.3 | Add to My Program |
Exploring CAV-Based Traffic Control for Improving Traffic Conditions in the Face of Bottlenecks (I) |
|
Vishnoi, Suyash | The University of Texas at Austin |
Ji, Junyi | Vanderbilt University |
Bahavarnia, MirSaleh | Vanderbilt University |
Zhang, Yuhang | Vanderbilt University |
Taha, Ahmad | Vanderbilt University |
Claudel, Christian G. | UT Austin |
Work, Daniel B. | Vanderbilt University |
Keywords: Traffic control, Control applications, Autonomous vehicles
Abstract: This work investigates traffic control via controlled connected and automated vehicles (CAVs) using novel controllers derived from the linear-quadratic regulator (LQR) theory. CAV-platoons are modeled as moving bottlenecks impacting the surrounding traffic with their speeds as control inputs. An iterative controller algorithm based on the LQR theory is proposed along with a variant that allows for penalizing abrupt changes in platoon speeds. The controllers use the Lighthill-Whitham-Richards (LWR) model implemented using an extended cell transmission model (CTM) which considers the capacity drop phenomenon for a realistic representation of traffic in congestion. The effectiveness of the proposed traffic control algorithms is tested using a traffic control example and compared with existing proportional-integral (PI)- and model predictive control (MPC)- based controllers from the literature. A case study using the TransModeler traffic microsimulation software is conducted to test the usability of the proposed controller in a realistic setting. It is observed that the proposed controller works well in both settings to mitigate the impact of the jam caused by a fixed bottleneck. The computation time required by the controller is also small making it suitable for real-time control.
|
|
11:00-11:20, Paper WeA04.4 | Add to My Program |
Optimal Control of Autonomous Vehicles for Flow Smoothing in Mixed Autonomy Traffic (I) |
|
Alanqary, Arwa | University of California, Berkeley |
Gong, Xiaoqian | Arizona State University |
Keimer, Alexander | UC Berkeley |
Seibold, Benjamin | Temple University |
Piccoli, Benedetto | Rutgers University - Camden |
Bayen, Alexandre | University of California, Berkeley |
Keywords: Traffic control, Optimal control
Abstract: This article studies the optimal control of autonomous vehicles over a given time horizon to smooth traffic. We model the dynamics of a mixed-autonomy platoon as a system of non-linear ODEs, where the acceleration of human-driven vehicles is governed by a car-following model, and the acceleration of autonomous vehicles is to be controlled. We formulate the car-following task as an optimal control problem and propose a computational method to solve it. Our approach uses an adjoint formulation to compute gradients of the optimization problem explicitly, resulting in more accurate and efficient numerical computations. The gradients are then used to solve the problem using gradient-based optimization solvers. We consider an instance of the problem with the objective of improving the fuel efficiency of the vehicles in the platoon. The effectiveness of the proposed approach is demonstrated through numerical experiments. We apply the proposed approach to different scenarios of lead vehicle trajectories and platoon sizes. The results suggest that introducing an AV can produce significant energy savings for the platoon. It also reveals that the solution is agnostic to the platoon size thus the fuel saving is mainly due to optimizing the trajectory of the AV.
|
|
11:20-11:40, Paper WeA04.5 | Add to My Program |
Robust Decentralised Proof-Of-Position Algorithms for Smart City Applications (I) |
|
Manzano Kharman, Aida | Imperial College London |
Ferraro, Pietro | University College Dublin |
Quinn, Anthony | Imperial College London |
Shorten, Robert | Imperial College London |
Keywords: Agents-based systems, Automotive systems, Smart cities/houses
Abstract: Motivated by the ever growing use of location-based services, we present a decentralised class of algorithms called Tree-Proof-of-Position (T-PoP). Most of the current proofs of location are centralised, thus forgoing verifiablity and privacy for the users. Decentralised solutions also exist, but they suffer from drawbacks that make them unsuitable for realistic, adversarial use cases. T-PoP algorithms rely on the web of interconnected devices in a smart city to establish how likely it is that an agent is in their claimed position. T-PoP operates under adversarial assumptions, where some agents are incentivised to be dishonest. We present a theoretical model for T-PoP and its security properties, and we validate this model through a large number of Monte-Carlo simulations. We specifically focus on two instances of T-PoP and analyse their security and reliability properties under a range of adversarial conditions. Use-cases and applications are discussed towards the end of the paper.
|
|
11:40-12:00, Paper WeA04.6 | Add to My Program |
A Dynamic Population Game Model of Non-Monetary Bottleneck Congestion Management under Elastic Demand Using Karma (I) |
|
Elokda, Ezzat | ETH Zurich |
Cenedese, Carlo | ETH Zurich |
Zhang, Kenan | EPFL |
Censi, Andrea | ETH Zurich |
Bolognani, Saverio | ETH Zurich |
Frazzoli, Emilio | ETH Zürich |
Keywords: Traffic control, Game theory, Mean field games
Abstract: The morning commute bottleneck congestion problem has classically been modelled as a static game in which commuters act strategically based on their immediate Value of Time (VOT). This has restricted existing congestion mitigation techniques to rely on essentially monetary incentives to affect the static costs of the commuters. In contrast, a dynamic model enables characterizing the strategic trade-off between immediate and future resource access rights and inspires the design of new classes of fair, non-monetary congestion mitigation schemes. In this paper, we show how the recently proposed Dynamic Population Game (DPG) framework can be leveraged to study a non-monetary economy for bottleneck congestion management based on karma, a non-tradable mobility credit. Our DPG model allows to consider an elastic demand of commuters that only travel if congestion is reduced, and we show that a Stationary Nash Equilibrium (SNE) is guaranteed to exist despite of the dynamic participation of these commuters. Through numerical case studies we illustrate how our tools can assist policy makers in taking informed decisions about complex policy outcomes. In particular, we show how the dynamic karma scheme is robust to a potentially detrimental rebound effect that would manifest in a static monetary scheme.
|
|
WeA05 Invited Session, Simpor Junior 4912 |
Add to My Program |
Decentralized Optimization and Learning |
|
|
Chair: Yuan, Kun | Peking University |
Co-Chair: You, Keyou | Tsinghua University |
Organizer: Yuan, Kun | Peking University |
Organizer: You, Keyou | Tsinghua University |
|
10:00-10:20, Paper WeA05.1 | Add to My Program |
Smoothing Gradient Tracking for Decentralized Optimization Over the Stiefel Manifold with Non-Smooth Regularizers (I) |
|
Wang, Lei | Academy of Mathematics and Systems Science, Chinese Academy of S |
Liu, Xin | Academy of Mathematics and Systems Science, Chinese Academy of S |
Keywords: Optimization, Decentralized control, Machine learning
Abstract: Recently, decentralized optimization over the Stiefel manifold has attracted tremendous attentions due to its wide range of applications in various fields. Existing methods rely on the gradients to update variables, which are not applicable to the objective functions with non-smooth regularizers, such as sparse PCA. In this paper, to the best of our knowledge, we propose the first decentralized algorithm for non-smooth optimization over Stiefel manifolds. Our algorithm approximates the non-smooth part of objective function by its Moreau envelope, and then existing algorithms for smooth optimization can be deployed. We establish the convergence guarantee with the iteration complexity of O (epsilon^{-4}). Numerical experiments conducted under the decentralized setting demonstrate the effectiveness and efficiency of our algorithm.
|
|
10:20-10:40, Paper WeA05.2 | Add to My Program |
Gradient Tracking with Multiple Local SGD for Decentralized Non-Convex Learning (I) |
|
Ge, Songyang | The Chinese University of Hong Kong, Shenzhen |
Chang, Tsung-Hui | The Chinese University of Hong Kong, Shenzhen |
Keywords: Optimization algorithms, Machine learning, Distributed control
Abstract: The stochastic Gradient Tracking (GT) method for distributed optimization, is known to be robust against the inter-client variance caused by data heterogeneity. However, the stochastic GT method can be communication-intensive, requiring a large number of communication rounds of message exchange for convergence. To address this challenge, this paper proposes a new communication-efficient stochastic GT algorithm called the Local Stochastic GT (LSGT) algorithm, which adopts the local stochastic gradient descent (local SGD) technique in the GT method. With LSGT, each agent can perform multiple SGD updates locally within each communication round. Although it is not known previously whether the stochastic GT method can benefit from the local SGD, we establish the conditions under which our proposed LSGT algorithm enjoys the linear speedup brought by local SGD. Compared with the existing work, our analysis requires less restrictive conditions on the mixing matrix and algorithm stepsize. Moreover, it reveals that the local SGD does not only reserve the resilience of the stochastic GT method against the data heterogeneity but also speeds up reducing the tracking error reduction in the optimization process. The experimental results demonstrate that the proposed LSGT exhibits improved convergence speed and robust performance in various heterogeneous environments.
|
|
10:40-11:00, Paper WeA05.3 | Add to My Program |
Achieving Linear Speedup with Network-Independent Learning Rates in Decentralized Stochastic Optimization (I) |
|
Yuan, Hao | Peking University |
Alghunaim, Sulaiman A. | Kuwait University |
Yuan, Kun | Peking University |
Keywords: Optimization, Machine learning, Decentralized control
Abstract: Decentralized stochastic optimization has become a crucial tool for addressing large-scale machine learning and control problems. In decentralized algorithms, all computing nodes are connected through a network topology, and each node communicates only with its direct neighbors. Decentralized algorithms can significantly reduce communication overhead by eliminating the need for global communication. However, existing research on the linear speedup analysis of decentralized stochastic algorithms is limited to the condition of network-dependent learning rates, which rarely holds in practice since the network connectivity is typically unknown to each node. As a result, it remains an open question whether a linear speedup bound can be achieved using network-independent learning rates. This paper provides an affirmative answer. By utilizing a new analysis framework, we prove D-SGD and Exact-Diffusion, two representative decentralized stochastic algorithms, can achieve linear speedup with network-independent learning rates. Simulations are provided to validate our theories.
|
|
11:00-11:20, Paper WeA05.4 | Add to My Program |
Fully Stochastic Distributed Convex Optimization on Time-Varying Graph with Compression (I) |
|
Yau, Chung-Yiu | The Chinese University of Hong Kong |
Wai, Hoi-To | The Chinese University of Hong Kong |
Keywords: Optimization algorithms, Communication networks, Machine learning
Abstract: This paper develops a fully stochastic proximal primal-dual (FSPPD) algorithm for distributed convex optimization. At each iteration, the distributed algorithm has agents communicating on a randomly drawn graph and applies random sparsification on the transmitted messages, while the agents only have access to a stochastic gradient oracle. To our best knowledge, this is the first compression-enabled distributed stochastic gradient algorithm on random graphs utilizing the primal-dual framework. With diminishing step size, we show that the FSPPD algorithm converges almost surely to an optimal solution of the strongly convex optimization problem. Numerical experiments are provided to verify our results.
|
|
11:20-11:40, Paper WeA05.5 | Add to My Program |
Asynchronous Byzantine-Robust Stochastic Aggregation with Variance Reduction for Distributed Learning (I) |
|
Zhu, Zehan | Zhejiang University |
Huang, Yan | Zhejiang University |
Zhao, Chengcheng | Zhejiang University |
Xu, Jinming | Zhejiang University |
Keywords: Optimization algorithms, Cyber-Physical Security, Machine learning
Abstract: We consider Byzantine-robust distributed learning with asynchronous participation of clients at a certain probability, where Byzantine clients can send malicious messages to the server. Instead of relying on traditional robust aggregation rules, such as Krum and Median, that can only tolerate a limited proportion of Byzantine clients, we propose an asynchronous Byzantine-robust stochastic aggregation method that employs regularization-based techniques to mitigate Byzantine attacks, and adopts variance-reduced techniques to eliminate the effect of stochastic noise of gradient sampling. Leveraging a properly designed Lyapunov function, we show that the proposed algorithm converges linearly to an error ball that is independent of stochastic gradient variance. Extensive experiments are conducted to show its efficacy in dealing with Byzantine attacks compared to the existing counterparts.
|
|
11:40-12:00, Paper WeA05.6 | Add to My Program |
Linear Model Predictive Control under Continuous Path Constraints Via Parallelized Primal-Dual Hybrid Gradient Algorithm (I) |
|
Li, Zishuo | Tsinghua University |
Yang, Bo | Tsinghua University |
Li, Jiayun | Tsinghua University |
Yan, Jiaqi | Tokyo Institute of Technology |
Mo, Yilin | Tsinghua University |
Keywords: Predictive control for linear systems, Optimization algorithms, Constrained control
Abstract: In this paper, we consider a Model Predictive Control (MPC) problem of a continuous-time linear time-invariant system subject to continuous-time path constraints on the states and the inputs. By leveraging the concept of differential flatness, we can replace the differential equations governing the system with linear mapping between the states, inputs, and flat outputs (including their derivatives). The flat outputs are then parameterized by piecewise polynomials, and the model predictive control problem can be equivalently transformed into a Semi-Definite Programming (SDP) problem via Sum-of-Squares (SOS), ensuring constraint satisfaction at every continuous-time interval. We further note that the SDP problem contains a large number of small-size semi-definite matrices as optimization variables. To address this, we develop a Primal-Dual Hybrid Gradient (PDHG) algorithm that can be efficiently parallelized to speed up the optimization procedure. Simulation results on a quadruple-tank process demonstrate that our formulation can guarantee strict constraint satisfaction, while the standard MPC controller based on the discretized system may violate the constraint inside a sampling period. Moreover, the computational speed superiority of our proposed algorithm is collaborated by numerical simulation.
|
|
WeA06 Regular Session, Simpor Junior 4911 |
Add to My Program |
Estimation I |
|
|
Chair: Dunik, Jindrich | University of West Bohemia |
Co-Chair: Anil Meera, Ajith | Radboud University Nijmegen |
|
10:00-10:20, Paper WeA06.1 | Add to My Program |
Adaptive Noise Covariance Estimation under Colored Noise Using Dynamic Expectation Maximization |
|
Anil Meera, Ajith | Radboud University Nijmegen |
Lanillos, Pablo | Spanish National Research Council |
Keywords: Estimation, Biologically-inspired methods, Observers for Linear systems
Abstract: The accurate estimation of the noise covariance matrix (NCM) in a dynamic system is critical for state estimation and control, as it has a major influence in their optimality. Although a large number of NCM estimation methods have been developed, most of them assume the noises to be white. However, in many real-world applications, the noises are colored (e.g., they exhibit temporal autocorrelations), resulting in suboptimal solutions. Here, we introduce a novel brain-inspired algorithm that accurately and adaptively estimates the NCM for dynamic systems subjected to colored noise. Particularly, we extend the Dynamic Expectation Maximization algorithm to perform both online noise covariance and state estimation by optimizing the free energy objective. We mathematically prove that our NCM estimator converges to the global optimum of this free energy objective. Using randomized numerical simulations, we show that our estimator outperforms nine baseline methods with minimal noise covariance estimation error under colored noise conditions. Notably, we show that our method outperforms the best baseline (Variational Bayes) in joint noise and state estimation for high colored noise. We foresee that the accuracy and the adaptive nature of our estimator make it suitable for online estimation in real-world applications.
|
|
10:20-10:40, Paper WeA06.2 | Add to My Program |
Sharp Performance Bounds for PASTA |
|
Marchi, Matteo | University of California, Los Angeles |
Bunton, Jonathan | University of California, Los Angeles |
Gas, Yskandar | University of California, Los Angeles |
Gharesifard, Bahman | University of California, Los Angeles |
Tabuada, Paulo | University of California at Los Angeles |
Keywords: Estimation, Control applications, Robust control
Abstract: LiDAR is a standard sensor choice for self-localization and SLAM of indoor autonomous robots. While there are many methods to estimate a robot's location using LiDAR measurements, most rely on algorithms that solve a generic LiDAR scan matching problem. When safety is a concern, these algorithms must provide a bound on the localization error to enable safety enforcing controllers, such as those based on Control Barrier Functions. Unfortunately, most existing scan matching algorithms offer no formal guarantees, and are tailored to structured, high-resolution 3D point clouds. In this paper, we present an improved theoretical analysis for a low-cost alternative to these methods named PASTA (Provably Accurate Simple Transformation Alignment), originally introduced in [8]. We provide a formal worst-case error guarantee on the localization error and show, experimentally, that it is tight. This characterization of the localization error simplifies the use of high-dimensional perception data for safety-critical control algorithms.
|
|
10:40-11:00, Paper WeA06.3 | Add to My Program |
Finite-Gain L1 Interval Impulsive Observer Design under Denial-Of-Service Attacks |
|
Tagne Mogue, Ruth Line | Univ. Orleans |
Courtial, Estelle | Laboratory PRISME, University of Orleans |
Becis-Aubry, Yasmina | Univ. of Orléans |
Rabehi, Djahid | University of Orléans |
Meslem, Nacim | GIPSA-LAB, CNRS |
Ramdani, Nacim | University of Orléans |
Keywords: Estimation, Hybrid systems
Abstract: The design of a robust observer under denial-of-service attacks is addressed for linear time invariant systems in the bounded-error framework. The cyber-attacks occur between the output of the sensors localized on the physical plant and the cyber part embedding the observer. The data required by the observer are thus available at sporadic measurement time instants. In this setting, an interval impulsive observer is synthesized. The stability analysis of the dynamics of the state estimation error is done leveraging finite-gain L1 stability theory for hybrid systems. The observer L1 gain is computed by combining interval analysis and the resolution of algebraic inequalities that greatly reduces the synthesis complexity when compared to the state-of-the-art approaches that usually rely on solving many bilinear matrix inequalities. A numerical example illustrates the approach and the performance of the designed robust observer.
|
|
11:00-11:20, Paper WeA06.4 | Add to My Program |
Modeling and State Estimation for Lithium Sulfur Batteries As a Piecewise Affine System |
|
Goujard, Guillaume | UC Berkeley |
Dangwal, Chitra | University of California Berkeley |
Gill, Preet | University of California Berkeley |
Kato, Dylan | University of California, Berkeley |
Moura, Scott | University of California, Berkeley |
Keywords: Estimation, Hybrid systems, Machine learning
Abstract: Lithium-sulfur (Li-S) is a promising battery chem- istry for applications demanding high energy densities, such as electrified aircraft and heavy-duty trucks, among others. A critical challenge in modeling the Li-S chemistry lies in the use of differential algebraic (DAE) equations for representing the electrochemical dynamics. Due to their constrained and stiff nature, these equations are not conducive to real-time state estimation. In this study, we propose a novel approach to constrained state estimation for Li-S batteries by integrating a piecewise affine (PWA) model into a moving horizon estimation (MHE) framework. We begin by deriving the PWA model using a linear tree algorithm based on data obtained from simulations of a calibrated DAE model. We further leverage the unique structural advantages of the proposed PWA model to formulate a real-time state estimation algorithm grounded in a mixed-integer quadratic program. Overall, our initial findings, based on a single constant current trajectory, demonstrate that our approach offers an accurate and computationally efficient method for modeling and state estimation of Li-S batteries. The coupled PWA-MHE framework effectively captures the dynamics of the DAE system, even in the presence of high observational noise (20mV).
|
|
11:20-11:40, Paper WeA06.5 | Add to My Program |
Regularization for Distributionally Robust State Estimation and Prediction |
|
Brouillon, Jean-Sébastien | EPFL |
Dörfler, Florian | Swiss Federal Institute of Technology (ETH) Zurich |
Ferrari-Trecate, Giancarlo | Ecole Polytechnique Fédérale De Lausanne |
Keywords: Estimation, Filtering, Time-varying systems
Abstract: The increasing availability of sensing techniques provides a great opportunity for engineers to design state estimation methods, which are optimal for the system under observation and the observed noise patterns. However, these patterns often do not fulfill the assumptions of existing methods. We provide a direct method using samples of the noise to create a moving horizon observer for linear time-varying and nonlinear systems, which is optimal under the empirical noise distribution. Moreover, we show how to enhance the observer with distributional robustness properties in order to handle unmodeled components in the noise profile, as well as different noise realizations. We prove that, even though the design of distributionally robust estimators is a complex minmax problem over an infinite-dimensional space, it can be transformed into a regularized linear program using a system level synthesis approach. Numerical experiments with the Van der Pol oscillator show the benefits of not only using empirical samples of the noise to design the state estimator, but also of adding distributional robustness. We show that our method can significantly outperform state-of-the-art approaches under challenging noise distributions, including multi-modal and deterministic components.
|
|
WeA07 Regular Session, Simpor Junior 4813 |
Add to My Program |
Game Theory I |
|
|
Chair: Hayakawa, Tomohisa | Tokyo Institute of Technology |
Co-Chair: Riess, Hans | Duke University |
|
10:00-10:20, Paper WeA07.1 | Add to My Program |
Stability and Stabilization of Nash Equilibrium for Noncooperative Systems with Vector-Valued Payoff Functions |
|
Guo, Zehui | Tokyo Institute of Technology |
Hayakawa, Tomohisa | Tokyo Institute of Technology |
Yan, Yuyue | Tokyo Institute of Technology |
Keywords: Game theory, Agents-based systems, Linear systems
Abstract: A zero-sum tax/subsidy approach and a necessary condition for stabilizing unstable Nash equilibria in pseudo-gradient-based noncooperative dynamical systems with vector-valued payoff functions are proposed. Specifically, we first present a necessary and sufficient condition for the Nash equilibrium of the noncooperative game with vector-valued payoff functions to be bounded. Then we give a sufficient condition for such Nash equilibrium to be stable. After that, we develop a framework where a system manager constructs a zero-sum tax/subsidy incentive structure by collecting taxes from one agent and giving the same amount of subsidy to the other agent to make the incentivized Nash equilibrium stable and bounded, which can make the trajectories converge to the interior of original Nash equilibrium set. Finally, we present a numerical example to illustrate the utility of the zero-sum tax/subsidy approach.
|
|
10:20-10:40, Paper WeA07.2 | Add to My Program |
A Nash Equilibrium Solution for Periodic Double Auctions |
|
Manvi, Bharat | Tata Consultancy Services |
Subramanian, Easwar | Senior Scientist, TCS Research, Tata Consultancy Services, India |
Keywords: Game theory, Agents-based systems, Markov processes
Abstract: We consider a periodic double auction (PDA) setting where buyers of the auction have multiple (but finite)opportunities to procure multiple but fixed units of a commodity. The goal of each buyer participating in such auctions is to reduce their cost of procurement by planning their purchase across multiple rounds of the PDA. Formulating such optimal bidding strategies in a multi-agent periodic double auction setting is a challenging problem as such strategies involve planning across current and future auctions. In this work, we consider one such setup wherein the composite supply curve is known to all buyers. Specifically, for the complete information setting, we model the PDA as a Markov game and derive Markov perfect Nash equilibrium (MPNE) solution to devise an optimal bidding strategy for the case when each buyer is allowed to make one bid per round of the PDA. Thereafter, the efficacy of the Nash policies obtained is demonstrated with numerical experiments.
|
|
10:40-11:00, Paper WeA07.3 | Add to My Program |
The Cost of Informing Decision-Makers in Multi-Agent Maximum Coverage Problems with Random Resource Values |
|
Ferguson, Bryce L. | University of California, Santa Barbara |
Paccagnan, Dario | Imperial College London |
Marden, Jason R. | University of California, Santa Barbara |
Keywords: Game theory, Agents-based systems, Uncertain systems
Abstract: The emergent behavior of a distributed system is conditioned by the information available to the local decision makers. Therefore, one may expect that providing decision makers with more information will improve system performance; in this work, we find that this is not necessarily the case. In multi-agent maximum coverage problems, we find that even when agents objectives are aligned with the global welfare, informing agents about the realization of the resources random values can reduce equilibrium performance by a factor of 1/2. This affirms an important aspect of designing distributed systems: information need be shared carefully. We further this understanding by providing lower and upper bounds on the ratio of system welfare when information is (fully or partially) revealed and when it is not, termed the value-of-informing. We then identify a trade-off that emerges when optimizing the performance of the best-case and worst-case equilibrium.
|
|
11:00-11:20, Paper WeA07.4 | Add to My Program |
Max-Plus Synchronization in Decentralized Trading Systems |
|
Riess, Hans | Duke University |
Munger, Michael | Duke University |
Zavlanos, Michael M. | Duke University |
Keywords: Game theory, Algebraic/geometric methods, Discrete event systems
Abstract: We introduce a decentralized mechanism for pricing and exchanging alternatives constrained by transaction costs. We characterize the time-invariant solutions of a heat equation involving a (weighted) Tarski Laplacian operator, defined for max-plus matrix-weighted graphs, as approximate equilibria of the trading system. We study algebraic properties of the solution sets as well as convergence behavior of the dynamical system. We apply these tools to the "economic problem"' of allocating scarce resources among competing uses. Our theory suggests differences in competitive equilibrium, bargaining, or cost-benefit analysis, depending on the context, are largely due to differences in the way that transaction costs are incorporated into the decision-making process. We present numerical simulations of the synchronization algorithm (RRAggU), demonstrating our theoretical findings.
|
|
11:20-11:40, Paper WeA07.5 | Add to My Program |
Almost-Bayesian Quadratic Persuasion with a Scalar Prior |
|
Massicot, Olivier | UIUC |
Langbort, Cedric | Univ of Illinois, Urbana-Champaign |
Keywords: Game theory, Communication networks, Uncertain systems
Abstract: In this article, we consider a problem of strategic communication between a sender (Alice) and a receiver (Bob) akin to the now-traditional model of Bayesian Persuasion intro- duced by Kamenica & Gentzkow, with the crucial difference that Bob is not assumed Bayesian. In lieu of the Bayesian assumption, Alice assumes that Bob behaves almost like a Bayesian agent, in some sense, without resorting to any specific model. Under this assumption, we study Alices strategy when both utilities are quadratic and the prior is scalar. We show that, contrary to the Bayesian case, Alices optimal response may be more subtle than revealing all or nothing. More precisely, Alice reveals the state of the world when it lies outside a specific interval, and nothing otherwise. This interval increases (and the amount of information shared decreases) as Bob further departs from Bayesianity, much to his detriment.
|
|
11:40-12:00, Paper WeA07.6 | Add to My Program |
Collaboration As a Mechanism for More Robust Strategic Classification |
|
Jin, Kun | University of Michigan, Ann Arbor |
Huang, Ziyuan | University of Michigan |
Liu, Mingyan | University of Michigan |
Keywords: Game theory, Control of networks, Learning
Abstract: A conventional strategic classification problem takes on a Stackelberg form: a decision maker commits to a decision rule (e.g., in the form of a binary classifier) and agents best respond to the published decision rule by deciding on an effort level so as to maximize their chance of getting a favorable decision less the cost of the effort. This problem becomes significantly more complex when we allow agents access to two types of effort: honest (improvement actions) and dishonest (or cheating/gaming). While the former improves an agent's underlying unobservable states (e.g., certain types of qualification), the latter merely improves an agent's outward observable feature, serving as input to the classifier. Under the natural assumption that honest effort is more costly than cheating, prior work has shown that the decision maker has limited ability to mitigate cheating by simply adjusting the decision rule. In this paper, we consider a collaboration mechanism, which the decision maker establishes at a cost and offers to the agents together with the decision rule. In this case, an agent best responds by choosing not only its effort but also whether to participate in the mechanism and if so, with which other agents it wishes to form a connection or collaboration relation. While agents outside the mechanism remain independent of each other, those inside the mechanism are connected to a group of collaborators and enjoy positive externality in the form of a boost in their observable features and consequently enhanced probability of a favorable decision outcome. We show how the collaboration mechanism can induce agents to participate and take improvement actions over gaming and how it can benefit both sides. We also discuss the social values of the system, including social welfare, social qualification status, and the mechanism surplus.
|
|
WeA08 Regular Session, Simpor Junior 4812 |
Add to My Program |
Optimal Control I |
|
|
Chair: Cassandras, Christos G. | Boston University |
Co-Chair: Tanaka, Takashi | University of Texas at Austin |
|
10:00-10:20, Paper WeA08.1 | Add to My Program |
Safe Q-Learning for Continuous-Time Linear Systems |
|
Bandyopadhyay, Soutrik | Indian Institute of Technology Delhi |
Bhasin, Shubhendu | Indian Institute of Technology Delhi |
Keywords: Optimal control, Adaptive control, Constrained control
Abstract: Q-learning is a promising method for solving optimal control problems for uncertain systems without the explicit need for system identification. However, approaches for continuous-time Q-learning have limited provable safety guarantees, which restrict their applicability to real-time safety-critical systems. This paper proposes a safe Q-learning algorithm for partially unknown linear time-invariant systems to solve the linear quadratic regulator problem with user-defined state constraints. We frame the safe Q-learning problem as a constrained optimal control problem using reciprocal control barrier functions and show that such an extension provides a safety-assured control policy. To the best of our knowledge, Q-learning for continuous-time systems with state constraints has not yet been reported in the literature.
|
|
10:20-10:40, Paper WeA08.2 | Add to My Program |
A Bilevel Optimization Scheme for Persistent Monitoring |
|
Hall, Jonas | Boston University |
Beaver, Logan E. | Boston University |
Cassandras, Christos G. | Boston University |
Andersson, Sean B. | Boston University |
Keywords: Optimal control, Autonomous systems, Optimization
Abstract: In this paper we study an infinite-horizon persistent monitoring problem in a two-dimensional mission space containing a finite number of statically placed targets, at each of which we assume a constant rate of uncertainty accumulation. Equipped with a sensor of finite range, the agent is capable of reducing the uncertainty of nearby targets. We derive a steady-state minimum time periodic trajectory over which each of the target uncertainties is driven down to zero during each visit. A hierarchical decomposition leads to purely local optimal control problems, coupled via boundary conditions. We optimize both the local trajectory segments as well as the boundary conditions in an on-line bilevel optimization scheme.
|
|
10:40-11:00, Paper WeA08.3 | Add to My Program |
Navigation with Shadow Prices to Optimize Multi Commodity Flow Rates |
|
Boero, Ignacio | Univerisidad De La Republica |
Spasojevic, Igor | MIT |
del Castillo, Mariana | Facultad De Ingeniería, Universidad De La República |
Pappas, George J. | University of Pennsylvania |
Kumar, Vijay | University of Pennsylvania |
Ribeiro, Alejandro | University of Pennsylvania |
Keywords: Optimal control, Communication networks, Autonomous systems
Abstract: We propose a method for providing communication network infrastructure in autonomous multi-agent teams. In particular, we consider a set of communication agents that are placed alongside regular agents from the system. In order to find the optimal positions to place such agents, we define a flexible performance function that adapts to network requirements for different systems. We provide an algorithm based on shadow prices of a related convex optimization problem in order to arrive at a local maximum. We run the algorithm for three different performance functions associated to three practical scenarios, in which we show both the performance of the algorithm and the flexibility for different network requirements.
|
|
11:00-11:20, Paper WeA08.4 | Add to My Program |
Separable Approximations of Optimal Value Functions under a Decaying Sensitivity Assumption |
|
Sperl, Mario | University of Bayreuth |
Saluzzi, Luca | Imperial College London |
Gruene, Lars | University of Bayreuth |
Kalise, Dante | Imperial College London |
Keywords: Optimal control, Computational methods
Abstract: An efficient approach for the construction of separable approximations of optimal value functions from interconnected optimal control problems is presented. The approach is based on assuming decaying sensitivities between subsystems, enabling a curse-of-dimensionality free approximation, for instance by deep neural networks.
|
|
11:20-11:40, Paper WeA08.5 | Add to My Program |
Economic Model Predictive Control of Water Distribution Systems with Solar Energy and Batteries |
|
Zheng, Xiangyi | The University of Melbourne |
Wang, Ye | The University of Melbourne |
Weyer, Erik | Univ. of Melbourne |
Manzie, Chris | The University of Melbourne |
Keywords: Optimal control, Control applications, Smart grid
Abstract: Pumping in water distribution systems (WDSs) consumes a significant amount of power from the grid and may incur large electricity cost. WDSs with solar panels and batteries can greatly reduce the electricity cost by displacing the use of grid-based electricity with solar or stored energy, while also utilising water storage elements to allow selective pumping. A novel economic model predictive control (EMPC) scheme is proposed in this paper to facilitate optimal operation of pumps, batteries and solar panels in the WDSs. The proposed EMPC controller seeks to minimize the energy cost for water pumping while keeping the water levels in tanks and the battery state of charge within restricted limits. The EMPC is applied to an EPANET model of the Richmond Pruned Network, a Doyle Fuller Newman model of a lithium-ion battery, and simulated output from solar panels to to determine the efficacy of the proposed scheme.
|
|
11:40-12:00, Paper WeA08.6 | Add to My Program |
Simulator-Driven Deceptive Control Via Path Integral Approach |
|
Patil, Apurva | The University of Texas at Austin |
Karabag, Mustafa O. | The University of Texas at Austin |
Tanaka, Takashi | University of Texas at Austin |
Topcu, Ufuk | The University of Texas at Austin |
Keywords: Optimal control, Cyber-Physical Security, Autonomous systems
Abstract: We consider a setting where a supervisor delegates an agent to perform a certain control task, while the agent is incentivized to deviate from the given policy to achieve its own goal. In this work, we synthesize the optimal deceptive policies for an agent who attempts to hide its deviations from the supervisor's policy. We study the deception problem in the continuous-state discrete-time stochastic dynamics setting and, using motivations from hypothesis testing theory, formulate a Kullback-Leibler control problem for the synthesis of deceptive policies. This problem can be solved using backward dynamic programming in principle, which suffers from the curse of dimensionality. However, under the assumption of deterministic state dynamics, we show that the optimal deceptive actions can be generated using path integral control. This allows the agent to numerically compute the deceptive actions via Monte Carlo simulations. Since Monte Carlo simulations can be efficiently parallelized, our approach allows the agent to generate deceptive control actions online. We show that the proposed simulation-driven control approach asymptotically converges to the optimal control distribution.
|
|
WeA09 Regular Session, Simpor Junior 4811 |
Add to My Program |
Optimization Algorithms I |
|
|
Chair: Notarstefano, Giuseppe | University of Bologna |
Co-Chair: Xu, Jinming | Zhejiang University |
|
10:00-10:20, Paper WeA09.1 | Add to My Program |
Counter-Examples in First-Order Optimization: A Constructive Approach |
|
Goujaud, Baptiste | Ecole Polytechnique |
Dieuleveut, Aymeric | Ecole Polytechnique |
Taylor, Adrien | Inria/Ecole Normale Supérieure |
Keywords: Optimization algorithms, Computer-aided control design
Abstract: While many approaches were developed for obtaining worst-case complexity bounds for first-order optimization methods in the last years, there remain theoretical gaps in cases where no such bound can be found. In such cases, it is often unclear whether no such bound exists (e.g., because the algorithm might fail to systematically converge) or simply if the current techniques do not allow finding them. In this work, we propose an approach to automate the search for cyclic trajectories generated by first-order methods. This provides a constructive approach to show that no appropriate complexity bound exists, thereby complementing the approaches providing sufficient conditions for convergence. Using this tool, we provide ranges of parameters for which some of the famous heavy-ball, Nesterov accelerated gradient, inexact gradient descent, and three-operator splitting algorithms fail to systematically converge, and show that it nicely complements existing tools searching for Lyapunov functions.
|
|
10:20-10:40, Paper WeA09.2 | Add to My Program |
On the Computation-Communication Trade-Off with a Flexible Gradient Tracking Approach |
|
Huang, Yan | Zhejiang University |
Xu, Jinming | Zhejiang University |
Keywords: Optimization algorithms, Cooperative control, Agents-based systems
Abstract: We propose a flexible gradient tracking approach with adjustable computation and communication steps for solving distributed stochastic optimization problems over networks. The proposed method allows each node to perform multiple local gradient updates and multiple inter-node communications in each round, aiming to strike a balance between computation and communication costs according to the properties of objective functions and network topology in non-i.i.d. settings. Leveraging a properly designed Lyapunov function, we derive both the computation and communication complexities for achieving arbitrary accuracy on smooth and strongly convex objective functions. Our analysis demonstrates sharp dependence of the convergence performance on graph topology and properties of objective functions, highlighting the trade-off between computation and communication. Numerical experiments are conducted to validate our theoretical findings.
|
|
10:40-11:00, Paper WeA09.3 | Add to My Program |
Distributed Consensus Optimization Via ADMM-Tracking Gradient |
|
Carnevale, Guido | University of Bologna |
Bastianello, Nicola | KTH Royal Institute of Technology |
Carli, Ruggero | University of Padova |
Notarstefano, Giuseppe | University of Bologna |
Keywords: Optimization algorithms, Distributed control, Networked control systems
Abstract: In this paper, we propose a novel distributed algorithm for consensus optimization over networks. The key idea is to achieve dynamic consensus on the agents' average and on the global descent direction by iteratively solving an online auxiliary optimization problem through the Alternating Direction Method of Multipliers (ADMM). Such a mechanism is suitably interlaced with a local proportional action steering each agent estimate to the solution of the original consensus optimization problem. The analysis uses tools from system theory to prove the linear convergence of the scheme with strongly convex costs. Finally, some numerical simulations confirm our findings and show the robustness of the proposed scheme.
|
|
11:00-11:20, Paper WeA09.4 | Add to My Program |
Distributed Optimization of Clique-Wise Coupled Problems |
|
Watanabe, Yuto | Kyoto University |
Sakurama, Kazunori | Kyoto University |
Keywords: Optimization algorithms, Distributed control, Agents-based systems
Abstract: This study addresses a distributed optimization with a novel class of coupling of variables, called clique-wise coupling. A clique is a node set of a complete subgraph of an undirected graph. This setup is an extension of pairwise coupled optimization problems (e.g., consensus optimization) and allows us to handle coupling of variables consisting of more than two agents systematically. To solve this problem, we propose a clique-based linearized ADMM algorithm, which is proved to be distributed. Additionally, we consider objective functions given as a sum of nonsmooth and smooth convex functions and present a more flexible algorithm based on the FLiP-ADMM algorithm. Moreover, we provide convergence theorems of these algorithms. Notably, all the algorithmic parameters and the derived condition in the theorems depend only on local information, which means that each agent can choose the parameters in a distributed manner. Finally, we apply the proposed methods to a consensus optimization problem and demonstrate their effectiveness via numerical experiments.
|
|
11:20-11:40, Paper WeA09.5 | Add to My Program |
On the Convergence of Decentralized Federated Learning under Imperfect Information Sharing |
|
Chellapandi, Vishnu Pandi | Purdue University |
Upadhyay, Antesh | Purdue University, West Lafayette |
Hashemi, Abolfazl | Purdue University |
Zak, Stanislaw H. | Purdue Univ |
Keywords: Optimization algorithms, Distributed control, Stochastic systems
Abstract: Most of the current literature focused on centralized learning is centered around the celebrated average-consensus paradigm and less attention is devoted to scenarios where the communication between the agents may be imperfect. This paper presents three different algorithms of Decentralized Federated Learning (DFL) in the presence of imperfect information sharing modeled as noisy communication channels. The first algorithm, Federated Noisy Decentralized Learning (FedNDL1) comes from the literature, where the noise is added to the algorithm parameters to simulate the scenario of the presence of noisy communication channels. This algorithm shares parameters to form a consensus with the clients based on a communication graph topology through a noisy communication channel. The proposed second algorithm (FedNDL2) is similar to the first algorithm but with added noise to the parameters and it performs the gossip averaging before the gradient optimization. The proposed third algorithm (FedNDL3), on the other hand, shares the gradients through noisy communication channels instead of the parameters. Theoretical and experimental results show that under imperfect information sharing, the third scheme that mixes gradients is more robust in the presence of a noisy channel compared with the algorithms from the literature that mix the parameters.
|
|
11:40-12:00, Paper WeA09.6 | Add to My Program |
Event-Triggered Distributed Nonconvex Optimization with Progress-Based Threshold |
|
Liu, Changxin | KTH Royal Institute of Technology |
Shi, Eric | Harvard University |
Keywords: Optimization algorithms, Distributed control
Abstract: This work studies the distributed nonconvex optimization problem in bandwidth-limited communication environments. We develop a communication-efficient algorithm based on the gradient-tracking based distributed optimization method, where each computation node is equipped with a new event-triggered communication scheduler. Such scheduler approves the broadcasting only when the innovation of exchanged variables exceeds the change of decision variables in two consecutive updates. Compared to the conventional scheduler with time-dependent vanishing thresholds, the proposed one adapts better to the optimization dynamics and thus leads to more significant communication reduction. Finally, we prove the convergence of the algorithm and illustrate its performance via numerical examples.
|
|
WeA10 Regular Session, Roselle Junior 4713 |
Add to My Program |
Machine Learning I |
|
|
Chair: Mahony, Robert | Australian National University, |
Co-Chair: Pappas, George J. | University of Pennsylvania |
|
10:00-10:20, Paper WeA10.1 | Add to My Program |
Reprojection Methods for Koopman-Based Modelling and Prediction |
|
van Goor, Pieter | Australian National University |
Mahony, Robert | Australian National University, |
Schaller, Manuel | Technische Universität Ilmenau |
Worthmann, Karl | Technische Universität Ilmenau |
Keywords: Machine learning, Algebraic/geometric methods, Predictive control for nonlinear systems
Abstract: Extended Dynamic Mode Decomposition (eDMD) is a powerful tool to generate data-driven surrogate models for the prediction and control of nonlinear dynamical systems in the Koopman framework. In eDMD a compression of the lifted system dynamics on the space spanned by finitely many observables is computed, in which the original space is embedded as a low-dimensional manifold. While this manifold is invariant for the infinite-dimensional Koopman operator, this invariance is typically not preserved for its eDMD-based approximation. Hence, an additional (re-)projection step is often tacitly incorporated to improve the prediction capability. We propose a novel framework for consistent reprojectors respecting the underlying manifold structure. Further, we present a new geometric reprojector based on maximum-likelihood arguments, which significantly enhances the approximation accuracy and preserves known finite-data error bounds.
|
|
10:20-10:40, Paper WeA10.2 | Add to My Program |
Robust Meta-Learning of Vehicle Yaw Rate Dynamics Via Conditional Neural Processes |
|
Ullrich, Lars | Chair of Automatic Control, Friedrich-Alexander-Universität Erla |
Völz, Andreas | Friedrich-Alexander-University Erlangen-Nürnberg |
Graichen, Knut | University Erlangen-Nürnberg (FAU) |
Keywords: Machine learning, Autonomous vehicles, Data driven control
Abstract: Trajectory planners of autonomous vehicles usually rely on physical models to predict the vehicle behavior. However, despite their suitability, physical models have some shortcomings. On the one hand, simple models suffer from larger model errors and more restrictive assumptions. On the other hand, complex models are computationally more demanding and depend on environmental and operational parameters. In each case, the drawbacks can be associated to a certain degree to the physical modeling of the yaw rate dynamics. Therefore, this paper investigates the yaw rate prediction based on conditional neural processes (CNP), a data-driven meta-learning approach, to simultaneously achieve low errors, adequate complexity and robustness to varying parameters. Thus, physical models can be enhanced in a targeted manner to provide accurate and computationally efficient predictions to enable safe planning in autonomous vehicles. High fidelity simulations for a variety of driving scenarios and different types of cars show that CNP makes it possible to employ and transfer knowledge about the yaw rate based on current driving dynamics in a human-like manner, yielding robustness against changing environmental and operational conditions.
|
|
10:40-11:00, Paper WeA10.3 | Add to My Program |
Combinatorial Optimization Approach to Client Scheduling for Federated Learning |
|
Omori, Tomohito | Kyoto University |
Kashima, Kenji | Kyoto University |
Keywords: Machine learning, Control over communications, Optimization algorithms
Abstract: For machine learning in situations where data is scattered and cannot be aggregated, federated learning, in which aggregators and agents send and receive model parameters, is one of the most promising methods. The scheduling problem of deciding which agents to communicate with has been studied in various ways, but it is not easy to solve due to its combinatorial optimization nature. In this letter, we attempt to solve this scheduling problem using combinatorial optimization theory. Specifically, we propose an efficient exact solution method based on dynamic programming and a greedy method whose superiority is confirmed by numerical examples. We also discuss the applicability of the proposed methods to a more realistic federated learning setting.
|
|
11:00-11:20, Paper WeA10.4 | Add to My Program |
Robust Safe Reinforcement Learning under Adversarial Disturbances |
|
Li, Zeyang | Tsinghua University |
Hu, Chuxiong | Tsinghua University |
Li, Shengbo Eben | Tsinghua University |
Cheng, Jia | Tsinghua University |
Wang, Yunan | Tsinghua University |
Keywords: Machine learning, Data driven control, Constrained control
Abstract: Safety is a primary concern when applying reinforcement learning to real-world control tasks, especially in the presence of external disturbances. However, existing safe reinforcement learning algorithms rarely account for external disturbances, limiting their applicability and robustness in practice. To address this challenge, this paper proposes a robust safe reinforcement learning framework that tackles worst-case disturbances. First, this paper presents a policy iteration scheme to solve for the robust invariant set, i.e., a subset of the safe set, where persistent safety is only possible for states within. The key idea is to establish a two-player zero-sum game by leveraging the safety value function in Hamilton-Jacobi reachability analysis, in which the protagonist (i.e., control inputs) aims to maintain safety and the adversary (i.e., external disturbances) tries to break down safety. This paper proves that the proposed policy iteration algorithm converges monotonically to the maximal robust invariant set. Second, this paper integrates the proposed policy iteration scheme into a constrained reinforcement learning algorithm that simultaneously synthesizes the robust invariant set and uses it for constrained policy optimization. This algorithm tackles both optimality and safety, i.e., learning a policy that attains high rewards while maintaining safety under worst-case disturbances. Experiments on classic control tasks show that the proposed method achieves zero constraint violation with learned worst-case adversarial disturbances, while other baseline algorithms violate the safety constraints substantially. Our proposed method also attains comparable performance as the baselines even in the absence of the adversary.
|
|
11:20-11:40, Paper WeA10.5 | Add to My Program |
Uncertainty Quantification for Learning-Based MPC Using Weighted Conformal Prediction |
|
Chee, Kong Yao | University of Pennsylvania |
Hsieh, M. Ani | University of Pennsylvania |
Pappas, George J. | University of Pennsylvania |
Keywords: Machine learning, Identification for control, Predictive control for nonlinear systems
Abstract: Nonlinear model predictive control (MPC) is an established control framework that not only provides a systematic way to handle state and input constraints, but also offers the flexibility to incorporate data-driven models. With the proliferation of machine learning techniques, there is an uptrend in the development of learning-based MPC, with neural networks (NN) being an important cornerstone. Although it has been shown that NNs are expressive enough to model the dynamics of complex systems and produce accurate state predictions, these predictions often do not include uncertainty estimates or have practical finite sample guarantees. In contrast to existing work that either requires the data samples to be exchangeable or relies on properties of the underlying data distribution, we propose an approach that utilizes weighted conformal prediction to alleviate these assumptions and to synthesize provably valid, finite-sample uncertainty estimates for data-driven dynamics models, in a distribution-free manner. These uncertainty estimates are generated online and incorporated into a novel uncertainty-aware learning-based MPC framework. Through a case study with a cartpole system controlled by a state-of-the-art learning-based MPC framework, we demonstrate that our approach not only provides well-calibrated uncertainty estimates, but also enhances the closed-loop performance of the system.
|
|
11:40-12:00, Paper WeA10.6 | Add to My Program |
Federated TD Learning Over Finite-Rate Erasure Channels: Linear Speedup under Markovian Sampling |
|
Dal Fabbro, Nicolò | Università Degli Studi Di Padova |
Mitra, Aritra | University of Pennsylvania |
Pappas, George J. | University of Pennsylvania |
Keywords: Machine learning, Large-scale systems, Communication networks
Abstract: Federated learning (FL) has recently gained much attention due to its effectiveness in speeding up supervised learning tasks under communication and privacy constraints. However, whether similar speedups can be established for reinforcement learning remains much less understood theoretically. Towards this direction, we study a federated policy evaluation problem where agents communicate via a central aggregator to expedite the evaluation of a common policy. To capture typical communication constraints in FL, we consider finite capacity up-link channels that can drop packets based on a Bernoulli erasure model. Given this setting, we propose and analyze QFedTD - a quantized federated temporal difference learning algorithm with linear function approximation. Our main technical contribution is to provide a finite-sample analysis of QFedTD that (i) highlights the effect of quantization and erasures on the convergence rate; and (ii) establishes a linear speedup w.r.t. the number of agents under Markovian sampling. Notably, while different quantization mechanisms and packet drop models have been extensively studied in the FL, distributed optimization, and networked control systems literature, our work is the first to provide a non-asymptotic analysis of their effects in multi-agent and federated reinforcement learning.
|
|
WeA11 Regular Session, Roselle Junior 4712 |
Add to My Program |
Agent-Based Systems I |
|
|
Chair: Liu, Ji | Stony Brook University |
Co-Chair: Mou, Shaoshuai | Purdue University |
|
10:00-10:20, Paper WeA11.1 | Add to My Program |
A Distributed Algorithm for Solving Linear Equations in Clustered Multi-Agent Systems |
|
Rai, Ayush | Purdue University |
Mou, Shaoshuai | Purdue University |
Anderson, Brian D.O. | Australian National University |
Keywords: Agents-based systems, Autonomous systems, Distributed control
Abstract: A new approach to solving the linear algebraic equation Ax = b is presented by introducing a leaderless clustered multi-agent system. Each agent is associated with a certain submatrix of A and a vector obtained from b by a certain decomposition process. A distributed algorithm has each agent processing information solely with its own information about A and b, but sharing a time-varying estimate of part of the solution of Ax = b with its neighbors, as defined by a graphical structure overlaying the agents. This graphical structure divides the agents into clusters, with agents in one cluster being associated with one block column of A, and different rows of that block column; with each intra-cluster graph being connected. The update law uses consensus within individual clusters, utilizing their estimated states together with a process of inter-cluster conservation aimed at maintaining certain necessary constraints. Unlike previous literature that uses clustered multi-agent systems or double-layered networks, this framework does not require the presence of an aggregator or central node in the networks clusters. The algorithm demonstrates exponential convergence, evidenced by both theoretical proof and numerical simulations.
|
|
10:20-10:40, Paper WeA11.2 | Add to My Program |
Riemannian Polarization of Multi-Agent Gradient Flows |
|
Mi, La | University of Luxembourg |
Goncalves, Jorge | University of Luxembourg |
Markdahl, Johan | University of Luxembourg |
Keywords: Agents-based systems, Algebraic/geometric methods, Network analysis and control
Abstract: Stable polarization of multi-agent systems has been shown to exist over Rn and highly symmetric nonlinear spaces, especially the n-sphere S^n. Toward a more generalized setting without assuming linearity or symmetry, our previous work established the same type of emergent behavior over general hypersurfaces, subsuming the n-sphere case. In this paper, we discuss our ongoing work of extending our previous hypersurface results to study the stability of polarized equilibria of multi-agent gradient flows evolving on general Riemannian manifolds. The aim is to provide sufficient conditions in terms of the manifold geometry. Special attention is paid to two nonlinear manifolds of interest, the Stiefel manifold and the Grassmannian. While the polarization of the former share similar traits to that of the n-sphere, the latter is shown to have distinct polarization behaviors.
|
|
10:40-11:00, Paper WeA11.3 | Add to My Program |
Constructive Barrier Feedback for Collision Avoidance in Leader-Follower Formation Control |
|
Tang, Zhiqi | Instituto Superior Técnico, Universidade De Lisboa |
Cunha, Rita | Instituto Superior Técnico, Universidade De Lisboa |
Hamel, Tarek | I3S-CNRS-UCA |
Silvestre, Carlos | University of Macau |
Keywords: Agents-based systems, Decentralized control, Nonlinear systems
Abstract: This paper proposes a novel constructive barrier feedback for reactive collision avoidance between two agents. It incorporates this feature in a formation tracking control strategy for a group of 2nd-order dynamic robots defined in three-dimensional space. Using only relative measurements between neighboring agents, we propose an elegant decentralized controller as the sum of a nominal tracking controller and the constructive barrier feedback for leader-follower formations under a directed single-spanning tree graph topology. The key ingredient is the use of divergent flow as a dissipative term, which slows down the relative velocity in the direction of the neighboring robots without compromising the nominal controllers performance. Compared to traditional barrier function-based optimization controllers, the proposed constructive barrier feedback avoids feasibility issues and results in more computationally efficient control algorithms with systematic equilibrium analysis.
|
|
11:00-11:20, Paper WeA11.4 | Add to My Program |
A Two-Layer Opinion Dynamics Model Coupling Static and Bounded-Confidence Interactions |
|
Kravitzch, Emmanuel | Avignon Université, Computer Sciences (LIA UAPV) |
Varma, Vineeth S. | CRAN, Université De Lorraine |
Berthet, Antoine O. | L2S Centrale-Supélec |
Hayel, Yezekael | University of Avignon |
Keywords: Agents-based systems, Discrete event systems, Network analysis and control
Abstract: In this paper, we present a new bounded-confidence model of opinion dynamics where agents with a substantially different opinion may still be able to influence each other along a static graph. Our intention is to account for hard and fast ties present due to physical or social proximity. This additional feature allows weak but persistent interaction between disagreeing agents. Albeit simple, the model remains difficult to analyse due to its separation of time scales. We introduce an appropriate notion of stability, give some properties of the system and formulate a conjecture supported by numerical simulations.
|
|
11:20-11:40, Paper WeA11.5 | Add to My Program |
A Resilient Distributed Algorithm for Solving Linear Equations |
|
Zhu, Jingxuan | Stony Brook University |
Velasquez, Alvaro | Air Force Research Laboratory, AFRL/RISC, Rome, NY |
Liu, Ji | Stony Brook University |
Keywords: Agents-based systems, Distributed control, Cooperative control
Abstract: This paper presents a resilient distributed algorithm for solving a system of linear algebraic equations over a multi-agent network in the presence of Byzantine agents capable of arbitrarily introducing untrustworthy information in communication. It is shown that the algorithm causes all non-Byzantine agents' states to converge to the same least squares solution exponentially fast, provided appropriate levels of graph redundancy and objective redundancy are established. An explicit convergence rate is also provided.
|
|
11:40-12:00, Paper WeA11.6 | Add to My Program |
A Plug & Play Command Generator for Swarm Formation on Multiple Arbitrary Shaped Orbits: A Diffeomorphism-Based Approach |
|
Bono, Antonio | University of Calabria |
D'Alfonso, Luigi | Università Della Calabria |
Fedele, Giuseppe | University of Calabria |
Gazi, Veysel | Yildiz Technical University |
Keywords: Agents-based systems, Distributed control, Robotics
Abstract: In this paper, we develop a strategy for arranging a team of agents in orbits of arbitrary shape that pass through a common point that may move. In particular, the shapes are defined by star-shaped sets. The solution is based on its transformation into the traditional circular formation problem and subsequent reconversion. Given the large number of solutions to such a problem, the developed algorithm is well suited as a plug & play command generator to extend these solutions to the promising field of aerial robotic swarms. Two examples of such an application show its ease of implementation.
|
|
WeA12 Regular Session, Roselle Junior 4711 |
Add to My Program |
Cooperative Control I |
|
|
Chair: Shao, Jinliang | University of Electronic Science and Technology of China, Chengdu |
Co-Chair: He, Jianping | Shanghai Jiao Tong University |
|
10:00-10:20, Paper WeA12.1 | Add to My Program |
Simultaneous Synchronization and Topology Identification of Complex Dynamical Networks |
|
Restrepo, Esteban | CNRS, INRIA Rennes Bretagne Atlantique |
Wang, Nana | Royal Institute of Technology (KTH) |
Dimarogonas, Dimos V. | KTH Royal Institute of Technology |
Keywords: Cooperative control, Adaptive control, Identification
Abstract: We propose a new method for simultaneous synchronization and topology identification of a complex dynamical network that relies on the edge-agreement framework and on adaptive-control approaches by design of an auxiliary network. Our method guarantees the identification of the unknown topology and it guarantees that once the topology is identified the complex network achieves synchronization. Under our identification algorithm we are able to provide stability results for the estimation errors in the form of uniform semiglobal practical asymptotic stability. Finally, we demonstrate the effectiveness of our approach with an illustrating example.
|
|
10:20-10:40, Paper WeA12.2 | Add to My Program |
Topology-Preserving Second-Order Consensus: A Strategic Compensation Approach |
|
Wang, Zitong | Shanghai Jiao Tong University |
Li, Yushan | Shanghai Jiao Tong University |
Duan, Xiaoming | Shanghai Jiao Tong University |
He, Jianping | Shanghai Jiao Tong University |
Keywords: Cooperative control, Agents-based systems, Cyber-Physical Security
Abstract: The interaction topology plays a significant role in the collaboration of multi-agent systems. How to preserve the topology against inference attacks has become an imperative task for security concerns. In this paper, we propose a distributed topology-preserving algorithm for second-order multiagent systems by adding noisy inputs. The major novelty is that we develop a strategic compensation approach to overcome the noise accumulation issue in the second-order dynamic process while ensuring the exact second-order consensus. Specifically, we design two distributed compensation strategies that make the topology more invulnerable against inference attacks. Furthermore, we derive the relationship between the inference error and the number of observations by taking the ordinary least squares estimator as a benchmark. Extensive simulations are conducted to verify the topology-preserving performance of the proposed algorithm.
|
|
10:40-11:00, Paper WeA12.3 | Add to My Program |
An Adaptive Distributed Observer for a Class of Discrete-Time Uncertain Linear Systems Over Acyclic Digraphs |
|
Liu, Tao | Southern University of Science and Technology |
Huang, Jie | The Chinese University of Hong Kong |
Keywords: Cooperative control, Adaptive systems, Uncertain systems
Abstract: This paper proposes an adaptive distributed observer for a class of discrete-time uncertain linear leader systems. The leader system is assumed to be neutrally stable with unknown parameters in the system matrix. Such a leader system can produce multi-tone sinusoidal signals with unknown frequencies, magnitudes, and phases. Under the assumption that the digraph of the communication network is a spanning tree with the leader system as the root, the proposed adaptive distributed observer is shown to be capable of estimating over the communication network not only the leader's state, but also the unknown parameters of the leader's system matrix.
|
|
11:00-11:20, Paper WeA12.4 | Add to My Program |
Mutualistic Interactions in Heterogeneous Multi-Agent Systems |
|
Nguyen, Alexander A. | University of California, Irvine |
Jabbari, Faryar | Univ. of California at Irvine |
Egerstedt, Magnus | University of California, Irvine |
Keywords: Cooperative control, Agents-based systems, Autonomous robots
Abstract: This paper presents a collaboration strategy that enables heterogeneous agents, i.e., different capabilities and dynamics, to accomplish tasks by working together. The collaboration between multiple agents is inspired by the ecological concept known as a mutualism, an interaction between two or more species that benefits everyone involved. A collaborative act is made possible through the composition of barrier functions, which allows the heterogeneous agents to work together safely. Moreover, a measure of collaborative potential is established to assess the merit of agents interacting with each other. Furthermore, the collaboration framework is provided for a general multi-agent setting. Finally, the collaboration framework's efficacy is demonstrated in two case studies that necessitate collaboration between the heterogeneous agents to complete their respective tasks.
|
|
11:20-11:40, Paper WeA12.5 | Add to My Program |
Bipartite Flocking Control for Multi-Agent Systems with Switching Topologies and Time Delays under Coopetition Interactions |
|
Ma, Zhuangzhuang | University of Electronic Science and Technology of China |
Li, Bowen | University of Electronic Science and Technology of China |
Shao, Jinliang | University of Electronic Science and Technology of China, Chengd |
Cheng, Yuhua | University of Electronic Science and Technology of China |
Zheng, Wei Xing | Western Sydney University |
Keywords: Cooperative control, Agents-based systems, Communication networks
Abstract: This paper investigates the bipartite flocking behavior of multi-agent systems with coopetition interactions, where communications between agents are described by signed digraphs. The scenario with switching topologies due to the movement of agents, and time delays caused by the limited data transmission capability, is considered comprehensively. Nonlinear weight functions are designed to describe the relationship between the communication distance of agents and the coopetition degree in real biological networks. A distributed update rule based on the neighbors' information and the designed weight functions is proposed. By the aid of the graph theory and sub-stochastic matrix properties, the effectiveness of the proposed update rule is proved theoretically, and the algebraic conditions for achieving the bipartite flocking behavior are obtained. Finally, the theoretical results are verified by numerical simulations.
|
|
11:40-12:00, Paper WeA12.6 | Add to My Program |
Finite-Time Topology Identification for Complex Dynamical Networks |
|
Wang, Nana | Royal Institute of Technology (KTH) |
Dimarogonas, Dimos V. | KTH Royal Institute of Technology |
Keywords: Cooperative control, Adaptive control, Identification for control
Abstract: This paper presents a finite-time topology identification method for complex dynamical networks. This method prevents the difficulty of verifying linear independence conditions and ensures the success of accurate topology identification. The topology identification scheme first renders the error dynamics between the networks and reference signals zero in finite time, and afterward, the topology is estimated by building an auxiliary network. The identification of topology is achieved once a relaxed excitation condition holds. The excitation condition is guaranteed by the proposed tracking control scheme. A finite-time topology identification and synchronization scheme for complex systems is further proposed where synchronization is realized by removing the exciting signals after the identification of the topology. At last, the simulation results verify the feasibility of the proposed method.
|
|
WeA13 Invited Session, Roselle Junior 4613 |
Add to My Program |
Identification in Networked Systems |
|
|
Chair: Hendrickx, Julien M. | UCLouvain |
Co-Chair: Van den Hof, Paul M.J. | Eindhoven University of Technology |
Organizer: Hendrickx, Julien M. | UCLouvain |
Organizer: Van den Hof, Paul M.J. | Eindhoven University of Technology |
Organizer: Vizuete, Renato | UCLouvain |
|
10:00-10:20, Paper WeA13.1 | Add to My Program |
Local Identification in Dynamic Networks Using a Multi-Step Least Squares Method (I) |
|
Fonken, Stefanie J.M. | Eindhoven University of Technology |
Ramaswamy, Karthik R. | Eindhoven University of Technology |
Van den Hof, Paul M.J. | Eindhoven University of Technology |
Keywords: Closed-loop identification, Large-scale systems, Networked control systems
Abstract: For identification of a single module in a linear dynamic network with correlated disturbances different methods are available in a prediction error setting. While indirect methods fully rely on the presence of a sufficient number of external excitation signals for achieving data-informativity, the local direct method with a MIMO predictor model can exploit also non-measured disturbance signals for data-informativity. However, a simple two-node example shows that this local direct method can also be conservative in terms of the number of external excitation signals that is required. Inspired by a recently introduced multi-step method for full network identification, we present a multi-step least squares method for single module identification. In a first indirect step a model is estimated that is used to reconstruct the innovation on a set of output signals, which in a second step is used to directly estimate the module dynamics with a MISO predictor model. The resulting path based conditions for data-informativity show that the multi-step method requires a smaller number of excitation signals for data-informativity than the local direct method.
|
|
10:20-10:40, Paper WeA13.2 | Add to My Program |
Connecting Graphical Notions of Separation and Statistical Notions of Independence for Topology Reconstruction (I) |
|
Materassi, Donatello | University of Minnesota |
Keywords: Network analysis and control, Stochastic systems, Identification
Abstract: Over the last decade, there has been a significant increase in interest for techniques that can infer the connectivity structure of a network of dynamic systems. This article examines a flexible class of network systems and reviews various methods for reconstructing their underlying graph. However, these techniques typically only guarantee consistent reconstruction if additional assumptions on the model are made, such as the network topology being a tree, the dynamics being strictly causal, or the absence of directed loops in the network. The central theme of the article is to reinterpret these methodologies under a unified framework where a graphical notion of separation between nodes of the underlying graph corresponds to a probabilistic notion of separation among associated stochastic processes. This duality property enables the creation of algorithms for reconstructing the network topology. The article also endeavors to connect the considered class of networks with other semantically similar network models.
|
|
10:40-11:00, Paper WeA13.3 | Add to My Program |
Nonlinear Network Identifiability: The Static Case (I) |
|
Vizuete, Renato | UCLouvain |
Hendrickx, Julien M. | UCLouvain |
Keywords: Network analysis and control, Identification
Abstract: We analyze the problem of network identifiability with nonlinear functions associated with the edges. We consider a static model for the output of each node and by assuming a perfect identification of the function associated with the measurement of a node, we provide conditions for the identifiability of the edges in a specific class of functions. First, we analyze the identifiability conditions in the class of all nonlinear functions and show that even for a path graph, it is necessary to measure all the nodes except by the source. Then, we consider analytic functions satisfying f(0)=0 and we provide conditions for the identifiability of paths and trees. Finally, by restricting the problem to a smaller class of functions where none of the functions is linear, we derive conditions for the identifiability of directed acyclic graphs. Some examples are presented to illustrate the results.
|
|
11:00-11:20, Paper WeA13.4 | Add to My Program |
Boolean Internal Structure Reconstruction from Collapsed Small-Scale Networks (I) |
|
Teng, Basi | University of Cambridge |
Zhao, Yuxuan | Huazhong University of Science and Technology |
Yuan, Ye | Huazhong University of Science and Technology |
Goncalves, Jorge | University of Luxembourg |
Keywords: Boolean control networks and logic networks, Identification, Large-scale systems
Abstract: Dynamic network reconstruction aims to infer network structure from input-output data. Dynamical structure functions (DSFs) have been introduced to represent structural information between observable nodes of linear time-invariant systems. However, reconstructing large-scale DSFs can be difficult since most existing methods do not scale. Instead of inferring large DSFs directly, an alternative approach is to reconstruct many small-scale DSFs that are easier to infer. Given a sparsity constraint on the network, this paper proposes a necessary and sufficient condition for perfect reconstruction of the Boolean network using collapsed small-scale networks. For sparse networks, such as gene regulatory networks, this method can significantly reduce time and computational costs of Boolean network inference for most links in the network, especially when using parallel computing.
|
|
11:20-11:40, Paper WeA13.5 | Add to My Program |
Optimal PMU Placement for Voltage Estimation in Partially Known Power Networks (I) |
|
Mishra, Aditya | University of California, San Diego |
de Callafon, Raymond A. | Univ. of California, San Diego |
Keywords: Estimation, Power systems, Optimization
Abstract: Observability of all bus voltages in a power network enables overall monitoring and fault detection of power flow. Information on this voltage state is often a combination of voltage, current measurements obtained by Phasor Measurement Units (PMUs) and state estimation techniques that use admittance information of the connections between nodes within the power network. Voltage state estimation is a challenge for a power network in which limited PMUs need to be combined with partially know network admittance information. The challenge lies in choosing locations of PMUs such that full voltage state reconstruction is possible, despite the lack of full knowledge on network admittance information. This paper proposes a methodology of placing PMUs across a network with incomplete network admittance information that guarantees complete observability of the voltage states. The methodology separates network nodes in distinct nodal sets based on voltage, current and admittance information. Permutations of the sets are uses to establish the minimum number of PMUs required for full voltage state observability for a power network with partially known admittance information. Subsequently, an additional optimal placement can be used to further minimize the variance of the estimated voltage states. The proposed PMU placement approach is tested on a modified IEEE-14 bus with incomplete network admittance information.
|
|
11:40-12:00, Paper WeA13.6 | Add to My Program |
Latent Dynamic Networked System Identification with High-Dimensional Networked Data (I) |
|
Yu, Jiaxin | CityU of Hong Kong |
Mo, Yanfang | City University of Hong Kong |
Qin, S. Joe | Lingnan University |
Keywords: Identification, Networked control systems, Statistical learning
Abstract: Networked dynamic systems are ubiquitous in various domains such as industrial processes, social networks, and biological systems. These systems produce high-dimensional data that reflect the complex interactions among the network nodes with rich sensor measurements. In this paper, we propose a novel algorithm for latent dynamic networked system identification that leverages the network structure and performs dimension reduction for each node via dynamic latent variables (DLV). The algorithm assumes that the DLVs of each node have an auto-regressive model with exogenous input and interactions from other nodes. The DLVs of each node are extracted to capture the most predictable latent variables in the high dimensional data, while the residual factors are not predictable. The advantage of the proposed framework is demonstrated on an industrial process network for system identification and dynamic data analytics.
|
|
WeA14 Invited Session, Roselle Junior 4612 |
Add to My Program |
Modeling, Analysis, and Control of Complex Systems |
|
|
Chair: Ye, Mengbin | Curtin University |
Co-Chair: Cao, Ming | University of Groningen |
Organizer: Ye, Mengbin | Centre for Optimisation and Decision Science, Curtin University |
Organizer: Zino, Lorenzo | Politecnico Di Torino |
Organizer: Cao, Ming | University of Groningen |
Organizer: Leonard, Naomi Ehrich | Princeton University |
|
10:00-10:20, Paper WeA14.1 | Add to My Program |
Species Coexistence and Extinction Resulting from Higher-Order Lokta-Volterra Two-Faction Competition (I) |
|
Cui, Shaoxuan | University of Groningen |
Zhao, Qi | Qingdao University of Science and Technology |
Jardón-Kojakhmetov, Hildeberto | University of Groningen |
Cao, Ming | University of Groningen |
Keywords: Biological systems, Stability of nonlinear systems, Agents-based systems
Abstract: It is known that the effect of species' density on species' growth is non-additive in real ecological systems. This challenges the conventional Lokta-Volterra model, where the interactions are always pairwise and their effects are additive. To address this challenge, we introduce HOIs (Higher-Order Interactions) and are able to capture, for example, the indirect effect of one species on a second one correlating to a third species. Towards this end, we propose a purely cooperative higher-order Lokta-Volterra model and a higher-order Lokta-Volterra two-faction competition model. By utilizing the theory of monotone systems, we provide stability conditions for both models. The stability analysis further shows that small HOIs usually promote the coexistence of all species, while the extinction of some species is usually caused by a huge difference among the higher-order competitive terms. Finally, illustrative numerical examples are provided to highlight our contributions.
|
|
10:20-10:40, Paper WeA14.2 | Add to My Program |
Population Games with Replicator Dynamics under Event-Triggered Payoff Provider and a Demand Response Application |
|
Martinez-Piazuelo, Juan | Universitat Politècnica De Catalunya |
Ananduta, Wicak | Flemish Institute for Technological Research (VITO) |
Ocampo-Martinez, Carlos | Universitat Politècnica De Catalunya (UPC) |
Grammatico, Sergio | Delft University of Technology |
Quijano, Nicanor | Universidad De Los Andes |
Keywords: Game theory, Hybrid systems, Optimization
Abstract: We consider a large population of decision makers that choose their evolutionary strategies based on simple pairwise imitation rules. We describe such a dynamic process by the replicator dynamics. Differently from the available literature, where the payoffs signals are assumed to be updated continuously, we consider a more realistic scenario where they are updated occasionally. Our main technical contribution is to devise two event-triggered communication schemes with asymptotic convergence guarantees to a Nash equilibrium. Finally, we show how our proposed approach is applicable as an efficient distributed demand response mechanism.
|
|
10:40-11:00, Paper WeA14.3 | Add to My Program |
Structural Properties of Optimal Risk-Aware Controllers for Spatially Invariant Systems |
|
Arbelaiz, Juncal | Princeton University |
Bamieh, Bassam | Univ. of California at Santa Barbara |
Leonard, Naomi Ehrich | Princeton University |
Keywords: Control of networks, Distributed control, Optimal control
Abstract: We analyze the optimal Linear Exponential Quadratic Gaussian (LEQG) control synthesis of a spatially distributed system with a shift invariance in its spatial coordinate, perturbed by additive white Gaussian noise. We refer to such a system as spatially invariant. The LEQG framework accounts for the risk attitude of the controller in its synthesis by appropriate selection of the value of a free parameter, providing the possibility to continuously tune the degree of risk awareness of the controller. We prove important structural properties of the optimal LEQG control problem for spatially invariant systems, namely that: (i) the optimal LEQG control gain is spatially invariant itself; (ii) the LEQG control synthesis problem is equivalent to a family of decoupled LEQG optimization problems of smaller dimension; and (iii) under some further assumptions, the optimal LEQG control gain is spatially localized. Through a case study, we illustrate how the risk attitude of the controller tunes the degree of spatial localization of the optimal control gain. We argue that the proven structural properties can be leveraged to reduce the computational complexity of obtaining the optimal LEQG control gain in large-scale systems and to design distributed risk-aware controller implementations.
|
|
11:00-11:20, Paper WeA14.4 | Add to My Program |
On Adaptive-Gain Control of Replicator Dynamics in Population Games (I) |
|
Zino, Lorenzo | Politecnico Di Torino |
Ye, Mengbin | Centre for Optimisation and Decision Science, Curtin University |
Rizzo, Alessandro | Politecnico Di Torino |
Calafiore, Giuseppe C. | Politecnico Di Torino |
Keywords: Game theory, Nonlinear systems, Adaptive control
Abstract: Controlling evolutionary game-theoretic dynamics is a problem of paramount importance for the systems and control community, with several applications spanning from social science to engineering. Here, we study a population of individuals who play a generic 2-action matrix game, and whose actions evolve according to a replicator equation - a nonlinear ordinary differential equation that captures salient features of the collective behavior of the population. Our objective is to steer such a population to a specified equilibrium that represents a desired collective behavior - e.g., to promote cooperation in the prisoner's dilemma. To this aim, we devise an adaptive-gain controller, which regulates the system dynamics by adaptively changing the entries of the payoff matrix of the game. The adaptive-gain controller is tailored according to distinctive features of the game, and conditions to guarantee global convergence to the desired equilibrium are established.
|
|
11:20-11:40, Paper WeA14.5 | Add to My Program |
Propagation of Stubborn Opinions on Signed Graphs (I) |
|
Razaq, Muhammad Ahsan | Linkoping University |
Altafini, Claudio | Linkoping University |
Keywords: Network analysis and control, Agents-based systems, Communication networks
Abstract: This paper addresses the problem of propagation of opinions in a Signed Friedkin-Johnsen (SFJ) model, i.e., an opinion dynamics model in which the agents are stubborn and the interaction graph is signed. We provide sufficient conditions for the stability of the SFJ model and for convergence to consensus of a concatenation of such SFJ models.
|
|
11:40-12:00, Paper WeA14.6 | Add to My Program |
Nash-Equilibrium Seeking Algorithm for Power Allocation Games on Networks of International Relations (I) |
|
Zhang, Chuanzhe | Peking University |
Li, Yuke | Peking University |
Mei, Wenjun | Peking University |
Keywords: Game theory, Network analysis and control, Optimization algorithms
Abstract: In the field of international security, understanding the strategic interactions between countries within a networked context is crucial. Our previous research has introduced a games-on-signed graphs framework to analyze these interactions. While the framework is intended to be basic and general, there is much left to be explored, particularly in capturing the complexity of strategic scenarios in international relations. Our paper aims to fill this gap in two key ways. First, we modify the existing preference axioms to allow for a more nuanced understanding of how countries pursue self-survival, defense of allies, and offense toward adversaries. Second, we introduce a novel algorithm that proves the existence of a pure strategy Nash equilibrium for these revised games. To validate our model, we employ historical data from the year 1940 as the game input and predict countries survivability. Our contributions thus extend the real-world applicability of the original framework, offering a more comprehensive view of strategic interactions in a networked security environment.
|
|
WeA15 Regular Session, Roselle Junior 4611 |
Add to My Program |
Adaptive Control I |
|
|
Chair: Shin, Hyo-Sang | Cranfield University |
Co-Chair: Baldi, Simone | Southeast University |
|
10:00-10:20, Paper WeA15.1 | Add to My Program |
Adaptive Anti-Swing Control for Clasping Operations in Quadrotors with Cable-Suspended Payload |
|
Dantu, Swati | IIIT HYDERABAD |
Yadav, Rishabh Dev | IIIT HYDERABAD |
Rachakonda, Ananth | IIIT Hyderabad |
Roy, Spandan | IIIT HYDERABAD |
Baldi, Simone | Southeast University |
Keywords: Adaptive control, Aerospace
Abstract: Crucial phases in aerial transportation and delivery of suspended payloads are the clasping and unclasping of the payload to the cable. During these phases, along with the uncertainties in the quadrotor and in the environment, the inevitable payload swings induced by the human interaction or by other external interaction will create additional state-dependent uncertainties; such uncertainties pose a significant challenge in terms of control. If they continue unabated, these uncertainties can cause safety hazard for the quadrotor, the payload and, most importantly, for the human operating the clasping/unclasping tasks. As the state-of-the-art adaptive controllers cannot tackle such uncertainties or considers them as bounded terms, this paper presents an adaptive anti-swing controller where all uncertainties are taken in a state-dependent form. This choice is made to better capture uncertain clasping and unclasping operations of the suspended payload. The closed-loop stability is studied analytically and the real-time experiments confirm significant performance improvements for the proposed scheme over the state of the art.
|
|
10:20-10:40, Paper WeA15.2 | Add to My Program |
Collision Avoidance in Longitudinal Platooning: Graceful Degradation and Adaptive Designs |
|
Liu, Di | Technical University of Munich |
Baldi, Simone | Southeast University |
Hirche, Sandra | Technische Universität München |
Keywords: Adaptive control, Autonomous vehicles, Cooperative control
Abstract: An externally positive system has the property of giving a nonnegative output for any nonnegative input. By making the inter-vehicle spacing a nonnegative output, this system property is significant for collision avoidance in platooning. Yet, existing platooning results based on external positivity just apply to adaptive cruise control (ACC): as ACC uses on-board sensing only, these results do not apply when onboard sensing is integrated with inter-vehicle communication, as in cooperative adaptive cruise control (CACC). This work provides an integrated external positivity design for CACC. When unreliable communication requires transitions between CACC and ACC, the design still guarantees graceful degradation in terms of collision avoidance and disturbance rejection. Such graceful transitions can be attained also in the presence of vehicle parameter uncertainty, via a suitable adaptive control design.
|
|
10:40-11:00, Paper WeA15.3 | Add to My Program |
Least-Squares Composite Learning Backstepping Control with High-Order Tuners |
|
Pan, Yongping | Sun Yat-Sen University |
Shi, Tian | Sun Yat-Sen University |
Wen, Changyun | Nanyang Tech. Univ |
Keywords: Adaptive control, Closed-loop identification, Nonlinear systems
Abstract: Transient performance improvement in adaptive backstepping control is beneficial for the stability and robustness of control systems. In addition, parameter convergence in classical adaptive control is dependent on a stringent condition named persistent excitation (PE). This paper proposes a least squares-based composite learning backstepping control (LS-CLBC) strategy with high-order tuners for strict-feedback uncertain nonlinear systems such that exponential stability with parameter convergence is achieved under interval excitation (IE) or even partial IE that is strictly weaker than PE. In the LS-CLBC, the storage and forgetting of online historical data are determined by the exciting strength of a novel excitation matrix consisting of only active regressor channels, such that excitation information of regressor channels is exploited more effectively and efficiently to achieve parameter estimation. The learning rate is adjusted online based on LS and integrated into a high-order tuner to obtain the high-order time derivatives of parameter estimates. The closed-loop system is proven exponentially stable. Simulation results have demonstrated the superiority of the proposed approach.
|
|
11:00-11:20, Paper WeA15.4 | Add to My Program |
A Model Reference Adaptive Controller Based on Operator-Valued Kernel Functions |
|
Oesterheld, Derek | Virginia Tech |
Stilwell, Daniel J. | Virginia Tech |
Kurdila, Andrew J. | Virginia Tech |
Guo, Jia | Georgia Institute of Technology |
Keywords: Adaptive control, Distributed parameter systems, Autonomous vehicles
Abstract: This paper extends recent results on model reference adaptive control using reproducing kernel Hilbert space (RKHS) learning techniques for some general cases of multi-input systems. We leverage recent results on error bounds for nonlinear observers in a vector-valued RKHS to design adaptive model reference adaptive controllers (MRAC) that are induced by operator-valued kernels. This paper formulates a model reference adaptive control strategy based on a dead zone robust modification, and derives for this case conditions for the ultimate boundedness of the tracking error. The RKHS setting allows the control designer to influence the ultimate bound by selection and placement of operator-valued kernels. As in the scalar-valued setting, closed-form expressions are obtained for the ultimate upper bound. But in this case the upper bound depends on a generalization of the power function for an operator-valued kernel space. Finally, we provide a detailed illustration our results in practice for the case of attitude control of a streamlined tailed-controlled underwater vehicle.
|
|
11:20-11:40, Paper WeA15.5 | Add to My Program |
Composite Model Reference Adaptive Control under Finite Excitation with Unstructured Uncertainties |
|
Cho, Namhoon | Cranfield University |
Shin, Hyo-Sang | Cranfield University |
Kim, Youdan | Seoul National University |
Tsourdos, Antonios | Cranfield University |
Keywords: Adaptive control, Estimation, Aerospace
Abstract: This paper presents an online parameter update algorithm in the context of composite model reference adaptive control based on intermittent signal holding to improve convergence properties of the parameters representing the unstructured uncertainties in the absence of persistent excitation. The present study extends the algorithm which was previously developed by considering only the structured uncertainties for which the basis functions are known a priori. The proposed extension utilises the Gaussian radial basis function neural network as the model for the uncertainty assuming appropriate placement of the local basis functions in the state space. A notable distinction from the case with full knowledge of the features constituting the linearly-parameterised uncertainty model is that the extended algorithm introduces a robustifying modification in the earlier phase of operation to deal with the inevitable learning residual.
|
|
11:40-12:00, Paper WeA15.6 | Add to My Program |
Rate-Matching the Regret Lower-Bound in the Linear Quadratic Regulator with Unknown Dynamics |
|
Wang, Feicheng | Harvard University |
Janson, Lucas | Harvard University |
Keywords: Adaptive control, Identification for control, Linear systems
Abstract: The theory of adaptive learning-based control currently suffers from a mismatch between its empirical performance and the theoretical characterization of its performance, with consequences for, e.g., the understanding of sample efficiency, safety, and robustness. The linear quadratic regulator with unknown dynamics is a fundamental adaptive control setting with significant structure in its dynamics and cost function, yet even in this setting the ratio between the best-known regret or estimation error upper bounds and their corresponding best-known lower bounds is unbounded due to polylogarithmic factors in T. This gap has not been closed in any of the many papers theoretically studying the linear quadratic regulator with unknown dynamics, and indeed similar gaps have plagued other areas of theoretical online learning such as reinforcement learning. The contribution of this paper is to close that gap by establishing a novel regret upper-bound of O_p(sqrt{T}) , and simultaneously establishes an estimation error bound on the dynamics of O_p(T^{-1/4}). The two keys to our improved proof technique are (1) a more precise upper- and lower-bound on the system Gram matrix by establishing exact rates of eigenvalues from different sub-spaces and (2) a self-bounding argument for the expected estimation error of the optimal controller. Our technique may shed light on removing polylogarithmic factors in other adaptive learning problems.
|
|
WeA16 Invited Session, Peony Junior 4512 |
Add to My Program |
Computational Techniques for Automation in Energy Systems |
|
|
Chair: Guo, Yi | ETH Zürich |
Co-Chair: Jiang, Yuning | EPFL |
Organizer: Guo, Yi | ETH Zürich |
Organizer: Jiang, Yuning | EPFL |
Organizer: Mallada, Enrique | Johns Hopkins University |
Organizer: Jones, Colin N. | EPFL |
Organizer: Hug, Gabriela | ETH Zurich |
Organizer: Lygeros, John | ETH Zurich |
|
10:00-10:20, Paper WeA16.1 | Add to My Program |
MIMO Grid Impedance Identification of Three-Phase Power Systems: Parametric vs. Nonparametric Approaches (I) |
|
Häberle, Verena | ETH Zurich |
Huang, Linbin | ETH Zurich |
He, Xiuqiang | ETH Zurich |
Prieto-Araujo, Eduardo | CITCEA-UPC |
Smith, Roy S. | ETH Zurich |
Dörfler, Florian | Swiss Federal Institute of Technology (ETH) Zurich |
Keywords: Power systems, Power electronics, Identification
Abstract: A fast and accurate grid impedance measurement of three-phase power systems is crucial for online assessment of power system stability and adaptive control of grid-connected converters. Existing grid impedance measurement approaches typically rely on pointwise sinusoidal injections or sequential wideband perturbations to identify a nonparametric grid impedance curve via fast Fourier computations in the frequency domain. This is not only time-consuming, but also inaccurate during time-varying grid conditions, while on top of that, the identified nonparametric model cannot be immediately used for stability analysis or control design. To tackle these problems, we propose to use parametric system identification techniques (e.g., prediction error or subspace methods) to obtain a parametric impedance model directly from time-domain current and voltage data. Our approach relies on injecting wideband excitation signals in the converter's controller and allows to accurately identify the grid impedance in closed loop within one injection and measurement cycle. Even though the underlying parametric system identification techniques are well-studied in general, their utilization in a grid impedance identification setup poses specific challenges, is vastly underexplored, and has not gained adequate attention in urgent and timely power systems applications. To this end, we demonstrate in numerical experiments how the proposed parametric approach can accomplish a significant improvement compared to prevalent nonparametric methods.
|
|
10:20-10:40, Paper WeA16.2 | Add to My Program |
Tractable Identification of Electric Distribution Networks (I) |
|
Stanojev, Ognjen | ETH Zürich |
Werner, Lucien | California Institute of Technology |
Low, Steven | California Institute of Technology |
Hug, Gabriela | ETH Zurich |
Keywords: Power systems, Smart grid, Network analysis and control
Abstract: The identification of distribution network topology and parameters is a critical problem that lays the foundation for improving network efficiency, enhancing reliability, and increasing its capacity to host distributed energy resources. Network identification problems often involve estimating a large number of parameters based on highly correlated measurements, resulting in an ill-conditioned and computationally demanding estimation process. We address these challenges by proposing two admittance matrix estimation methods. In the first method, we use the eigendecomposition of the admittance matrix to generalize the notion of stationarity to electrical signals and demonstrate how the stationarity property can be used to facilitate a maximum a posteriori estimation procedure. We relax the stationarity assumption in the second proposed method by employing Linear Minimum Mean Square Error (LMMSE) estimation. Since LMMSE estimation is often ill-conditioned, we introduce an approximate well-conditioned solution. Our quantitative results demonstrate the improvement in computational efficiency compared to the state-of-the-art methods while preserving the estimation accuracy.
|
|
10:40-11:00, Paper WeA16.3 | Add to My Program |
Safe Zeroth-Order Optimization Using Linear Programs (I) |
|
Guo, Baiwei | EPF Lausanne |
Wang, Yang | Delft University of Technology |
Jiang, Yuning | EPFL |
Kamgarpour, Maryam | EPFL |
Ferrari-Trecate, Giancarlo | Ecole Polytechnique Fédérale De Lausanne |
Keywords: Optimization, Power systems, Data driven control
Abstract: To solve unmodeled optimization problems with hard constraints, this paper proposes a novel zeroth-order approach called Safe Zeroth-order Optimization using Linear Programs (SZO-LP). The SZO-LP method solves a linear program in each iteration to find a descent direction, followed by a step length determination. We prove that, under mild conditions, the iterates of SZO-LP have an accumulation point that is also the primal of a KKT pair. We then apply SZO-LP to solve an Optimal Power Flow (OPF) problem on the IEEE 30-bus system. The results demonstrate that SZO-LP requires less computation time and samples compared to state-of-the-art approaches.
|
|
11:00-11:20, Paper WeA16.4 | Add to My Program |
Battery Optimization for Power Systems: Feasibility and Optimality (I) |
|
Elsaadany, Mazen | University of Vermont |
Almassalkhi, Mads | University of Vermont |
Keywords: Optimization, Modeling, Power systems
Abstract: The deployment of battery energy storage systems (BESS) is necessary to integrate terawatts of renewable generation while supporting grid resilience and reliability efforts. Optimizing battery dispatch requires predictive battery models that accurately characterize the battery state of charge (SOC) to ensure that the battery operates within the energy and power limits and avoids unexpected saturation effects. Furthermore, most BESS are unable to simultaneously charge and discharge, which begets an additional, non-convex complementary constraint. This paper presents and compares recently developed predictive battery models that side-step the non-convexity while providing supporting analysis on modeling error and optimal parameter selection. Specifically, insights for four different predictive BESS formulations are presented, including non-linear, mixed-integer, linear convex relaxation, and linear robust formulations. Additionally, two two-stage approaches are also considered. Analysis is conducted on optimal parameter selection for two of the methods, as well, as providing a new and improved SOC error bound on the relaxed formulation and the role of sustainability constraints on the robust formulation. Through the lens of relevant BESS use-cases, the paper discusses optimality and feasibility guarantees between the different models and provides extensive simulation-based analysis.
|
|
11:20-11:40, Paper WeA16.5 | Add to My Program |
Leveraging Predictions in Power System Frequency Control: An Adaptive Approach (I) |
|
Cui, Wenqi | University of Washington |
Shi, Guanya | Carnegie Mellon University |
Shi, Yuanyuan | University of California San Diego |
Zhang, Baosen | University of Washington |
Keywords: Power systems, Learning
Abstract: Ensuring the frequency stability of electric grids with increasing renewable resources is a key problem in power system operations. In recent years, a number of advanced controllers have been designed to optimize frequency control. These controllers, however, almost always assume that the net load in the system remains constant over a sufficiently long time. Given the intermittent and uncertain nature of renewable resources, it is becoming important to explicitly consider net load that is time-varying. This paper proposes an adaptive approach to frequency control in power systems with significant time-varying net load. We leverage the advances in short-term load forecasting, where the net load in the system can be accurately predicted using weather and other features. We integrate these predictions into the design of adaptive controllers, which can be seamlessly combined with most existing controllers including conventional droop control and emerging neural network-based controllers. We prove that the overall control architecture achieves frequency restoration decentralizedly. Case studies verify that the proposed method improves both transient and frequency-restoration performances compared to existing approaches.
|
|
11:40-12:00, Paper WeA16.6 | Add to My Program |
Fast Constraint Screening for Multi-Interval Unit Commitment (I) |
|
He, Xuan | Hong Kong University of Science and Technology (Guangzhou) |
Tian, Jiayu | Sun Yat-Sen University |
Zhang, Yufan | University of California, San Diego |
Wen, Honglin | Shanghai Jiaotong University |
Chen, Yize | Hong Kong University of Science and Technology |
Keywords: Power systems, Power generation, Machine learning
Abstract: Power systems Unit Commitment (UC) problem determines the generator commitment schedule and dispatch decisions for power networks based on forecasted electricity demand. However, with the increasing penetration of renewables and stochastic demand behaviors, it becomes challenging to solve the large-scale, multi-interval UC problem in an efficient manner. The main objective of this paper is to propose a fast and reliable scheme to eliminate a set of redundant or inactive physical constraints in the high-dimensional, multi-interval, mixed-integer UC problem, while the reduced problem is equivalent to the original full problem in terms of commitment decisions. Our key insights lie on pre-screening the constraints based on the load distribution and considering the physical feasibility regions of multi-interval UC problem. For the multi-step UC formulation, we overcome screening conservativeness by utilizing the multi-step ramping relationships, and can reliably screen out more constraints compared to current practice. Extensive simulations on both specific load samples and load regions validate the proposed technique can screen out more than 80% constraints while preserving the feasibility of multi-interval UC problem.
|
|
WeA17 Regular Session, Peony Junior 4511 |
Add to My Program |
Data-Driven Control I |
|
|
Chair: Padoan, Alberto | ETH Zürich |
Co-Chair: Dörfler, Florian | Swiss Federal Institute of Technology (ETH) Zurich |
|
10:00-10:20, Paper WeA17.1 | Add to My Program |
Informativity for Identification for 2D State-Representable Autonomous Systems, with Applications to Data-Driven Simulation |
|
Rapisarda, Paolo | Univ. of Southampton |
Pal, Debasattam | Indian Institute of Technology Bombay |
Keywords: Data driven control, Autonomous systems, Behavioural systems
Abstract: We define persistency of excitation and informativity for system identification for the class of 2D state- representable autonomous systems. We characterize informativity for system identification in terms of properties of a matrix constructed from the restrictions of a system trajectory on successive consecutive lines. We state a procedure to compute arbitrary trajectories from a "sufficiently rich" one.
|
|
10:20-10:40, Paper WeA17.2 | Add to My Program |
Direct Data-Driven Computation of Polytopic Robust Control Invariant Sets and State-Feedback Controllers |
|
Mejari, Manas | University of Applied Sciences and Arts of Southern Switzerland |
Gupta, Ankit | Zenseact AB |
Keywords: Data driven control, Constrained control, Robust control
Abstract: This paper presents a direct data-driven approach for computing robust control invariant (RCI) sets and their associated state-feedback control laws for linear time-invariant systems affected by bounded disturbances. The proposed method utilizes a single state-input trajectory generated from the system, to compute a polytopic RCI set with a desired complexity and an invariance-inducing feedback controller, without the need to identify a model of the system. The problem is formulated in terms of a set of sufficient linear matrix inequality conditions that are then combined in a semi-definite program to maximize the volume of the RCI set while respecting the state and input constraints. We demonstrate through a numerical case study that the proposed data-driven approach can generate RCI sets that are of comparable size to those obtained by a model-based method in which exact knowledge of the system matrices is assumed.
|
|
10:40-11:00, Paper WeA17.3 | Add to My Program |
Data-Driven Representations of Conical, Convex, and Affine Behaviors |
|
Padoan, Alberto | ETH Zürich |
Dörfler, Florian | Swiss Federal Institute of Technology (ETH) Zurich |
Lygeros, John | ETH Zurich |
Keywords: Data driven control, Behavioural systems, Identification
Abstract: The paper studies conical, convex, and affine models in the framework of behavioral systems theory. We investigate basic properties of such behaviors and address the problem of constructing models from measured data. We prove that closed, shift-invariant, conical, convex, and affine models have the intersection property, thereby enabling the definition of most powerful unfalsified models based on infinite-horizon measurements. We then provide necessary and sufficient conditions for representing conical, convex, and affine finite-horizon behaviors using raw data matrices, expressing persistence of excitation requirements in terms of non-negative rank conditions. The applicability of our results is demonstrated by a numerical example arising in population ecology.
|
|
11:00-11:20, Paper WeA17.4 | Add to My Program |
On the Sample Complexity of the Linear Quadratic Gaussian Regulator |
|
Al Makdah, Abed AlRahman | University of California Riverside |
Pasqualetti, Fabio | University of California, Riverside |
Keywords: Data driven control, Learning, Optimal control
Abstract: In this paper we provide direct data-driven expressions for the Linear Quadratic Regulator (LQR), the Kalman filter, and the Linear Quadratic Gaussian (LQG) controller using a finite dataset of noisy input, state, and output trajectories. We show that our data-driven expressions are consistent, since they converge as the number of experimental trajectories increases, we characterize their convergence rate, and quantify their error as a function of the system and data properties. These results complement the body of literature on data-driven control and finite-sample analysis, and provide new ways to solve canonical control and estimation problems that do not assume, nor require the estimation of, a model of the system and noise and do not rely on solving implicit equations.
|
|
11:20-11:40, Paper WeA17.5 | Add to My Program |
Combining Q-Learning and Deterministic Policy Gradient for Learning-Based MPC |
|
Seel, Katrine | NTNU |
Gros, Sebastien | NTNU |
Gravdahl, Jan Tommy | Norwegian Univ. of Science & Tech |
Keywords: Data driven control, Learning, Optimization
Abstract: This paper considers adjusting a fully parametrized model predictive control (MPC) scheme to approximate the optimal policy for a system as accurately as possible. By adopting MPC as a function approximator in reinforcement learning (RL), the MPC parameters can be adjusted using Q-learning or policy gradient methods. However, each method has its own specific shortcomings when used alone. Indeed, Q-learning does not exploit information about the policy gradient and therefore may fail to capture the optimal policy, while policy gradient methods miss any cost function corrections not affecting the policy directly. The former is a general problem, whereas the latter is an issue when dealing with economic problems specifically. Moreover, it is notoriously difficult to perform second-order steps in the context of policy gradient methods while it is straightforward in the context of Q-learning. This calls for an organic combination of these learning algorithms, in order to fully exploit the MPC parameterization as well as speed up convergence in learning.
|
|
11:40-12:00, Paper WeA17.6 | Add to My Program |
Data-Driven Eigenstructure Assignment for Sparse Feedback Design |
|
Celi, Federico | University of California, Riverside |
Baggio, Giacomo | University of Padova, Italy |
Pasqualetti, Fabio | University of California, Riverside |
Keywords: Data driven control, Linear systems, Stability of linear systems
Abstract: This paper presents a novel approach for solving the pole placement and eigenstructure assignment problems through data-driven methods. By using open-loop data alone, the paper shows that it is possible to characterize the allowable eigenvector subspaces, as well as the set of feedback gains that solve the pole placement problem. Additionally, the paper proposes a closed-form expression for the feedback gain that solves the eigenstructure assignment problem. Finally, the paper discusses a series of optimization problems aimed at finding sparse feedback gains for the pole placement problem.
|
|
WeA18 Regular Session, Peony Junior 4412 |
Add to My Program |
Nonlinear Systems I |
|
|
Chair: Isidori, Alberto | Universita Di Roma |
Co-Chair: Kaldmäe, Arvo | Tallinn University of Technology |
|
10:00-10:20, Paper WeA18.1 | Add to My Program |
Synthesizing Stable Reduced-Order Visuomotor Policies for Nonlinear Systems Via Sums-Of-Squares Optimization |
|
Chou, Glen | MIT |
Tedrake, Russ | MIT |
Keywords: Nonlinear output feedback, Robotics, Vision-based control
Abstract: We present a method for synthesizing dynamic, reduced-order output-feedback polynomial control policies for control-affine nonlinear systems which guarantees runtime stability to a goal state, when using visual observations and a learned perception module in the feedback control loop. We leverage Lyapunov analysis to formulate the problem of synthesizing such policies. This problem is nonconvex in the policy parameters and the Lyapunov function that is used to prove the stability of the policy. To solve this problem approximately, we propose two approaches: the first solves a sequence of sum-of-squares optimization problems to iteratively improve a policy which is provably-stable by construction, while the second directly performs gradient-based optimization on the parameters of the polynomial policy, and its closed-loop stability is verified a posteriori. We extend our approach to provide stability guarantees in the presence of observation noise, which realistically arises due to errors in the learned perception module. We evaluate our approach on several underactuated nonlinear systems, including pendula and quadrotors, showing that our guarantees translate to empirical stability when controlling these systems from images, while baseline approaches can fail to reliably stabilize the system.
|
|
10:20-10:40, Paper WeA18.2 | Add to My Program |
Feasibility Detection for Nested Codesign of Hypersonic Vehicles |
|
van der Heide, Chris | University of Melbourne |
Cudmore, Peter | University of Melbourne |
Jahn, Ingo | The University of Queensland |
Bone, Viv | The University of Melbourne |
Dower, Peter M. | University of Melbourne |
Manzie, Chris | The University of Melbourne |
Keywords: Nonlinear systems, Aerospace
Abstract: Controllability and feasaiblity measures are used to determine whether a given system can achieve its specified objective. However, for nonlinear systems with space constraints, the controllable and feasible sets may be highly sensitive to minor perturbations in the system's constraints, initial states and parameters. This becomes particularly important in codesign of hypersonic vehicles, where functions governing the dynamics must be estimated from expensive computational fluid dynamics simulations, and poor initialization can lead to significant wasted resources. By relaxation of the constraints and introduction of a surrogate cost, we provide a method for detection and quantification of which constraints are violating feasibility. To demonstrate the method in a concrete example, we apply the technique to simulation of hypersonic vehicle trajectories.
|
|
10:40-11:00, Paper WeA18.3 | Add to My Program |
Further Results on the Structure of Normal Forms of Input-Affine Nonlinear MIMO Systems |
|
Isidori, Alberto | Universita Di Roma |
Keywords: Nonlinear systems, Algebraic/geometric methods
Abstract: In a recent paper it has been shown that the existence, for a MIMO nonlinear system, of normal forms with a special structure that proves to be useful in the design of feedback laws is implied by an assumption introduced a long time ago by Hirschorn in his work on systems invertibility. In this paper, we provide an alternative viewpoint and prove that a necessary and sufficient condition for the existence of such kind of normal forms can be identified in a special feature of the so-called maximal controlled invariant distribution algorithm.
|
|
11:00-11:20, Paper WeA18.4 | Add to My Program |
Relations between Modules Associated to Input-Output Nonlinear Equations with Delays and Their Realizations |
|
Bartosiewicz, Zbigniew | Bialystok University of Technology |
Kaldmäe, Arvo | Tallinn University of Technology |
Kotta, Ülle | Tallinn University of Technology |
Wyrwas, Malgorzata | Bialystok University of Technology |
Keywords: Nonlinear systems, Algebraic/geometric methods
Abstract: The relations between a control system with delays given by a nonlinear input-output equation and its realization are addressed. The algebraic formalism based on rings of polynomials over the rings associated with the considered systems and modules of differential one-forms is used to show the relations between submodules corresponding to the input-output equation and its realization.
|
|
11:20-11:40, Paper WeA18.5 | Add to My Program |
Controlled Density Transport Using Perron Frobenius Generators |
|
Buzhardt, Jake | Clemson University |
Tallapragada, Phanindra | Clemson University |
Keywords: Nonlinear systems, Fluid flow systems, Data driven control
Abstract: We consider the problem of the transport of a density of states from an initial state distribution to a desired final state distribution through a dynamical system with actuation. In particular, we consider the case where the control signal is a function of time, but not space; that is, the same actuation is applied at every point in the state space. This is motivated by several problems in fluid mechanics, such as mixing and manipulation of a collection of particles by a global control input such as a uniform magnetic field, as well as by more general control problems where a density function describes an uncertainty distribution or a distribution of agents in a multi-agent system. We formulate this problem using the generators of the Perron-Frobenius operator associated with the drift and control vector fields of the system. By considering finite-dimensional approximations of these operators, the density transport problem can be expressed as a control problem for a bilinear system in a high-dimensional, lifted state. With this system, we frame the density control problem as a problem of driving moments of the density function to the moments of a desired density function, where the moments of the density can be expressed as an output which is linear in the lifted state. This output tracking problem for the lifted bilinear system is then solved using differential dynamic programming, an iterative trajectory optimization scheme.
|
|
11:40-12:00, Paper WeA18.6 | Add to My Program |
Convergence Rates for Approximations of Deterministic Koopman Operators Via Inverse Problems |
|
Powell, Nathan | EPFL |
Bouland, Ali | Virginia Tech |
Burns, John A | Virginia Tech |
Kurdila, Andrew J. | Virginia Tech |
Keywords: Nonlinear systems, Estimation, Statistical learning
Abstract: This paper derives rates of convergence of approximations of the deterministic Koopman operator using a framework based on estimating solutions of inverse problems. By restricting the domain of the Koopman operator, simple sufficient conditions are derived that ensure that the resulting Koopman operator is compact when acting on a suitable reproducing kernel Hilbert space (RKHS). Approximations of the Koopman operator, or its inverse, are derived in terms of Galerkin approximations of solutions to an associated inverse problem which depends on noisy data. The resulting bounds on accuracy of approximations to the Koopman operator then take a classical form: we obtain explicit representations of the contributions of the approximation error and the generalization error in terms of the reduced dimension and noise level. As the reduced dimension of the approximations increases, the approximation error decreases, while the generalization error increases. The generalization error increases with an increase in the noise level. In the case of a discrete evolution over a smooth, compact, connected, Riemannian manifold, we show that these two contributions to the error can be bounded in terms of the fill distance of centers of approximation and samples in the manifold
|
|
WeA19 Regular Session, Peony Junior 4411 |
Add to My Program |
Linear Systems I |
|
|
Chair: Iannelli, Andrea | University of Stuttgart |
Co-Chair: Bianchin, Gianluca | University of Louvain |
|
10:00-10:20, Paper WeA19.1 | Add to My Program |
Condition for Sensitivity Unidentifiability of Linear Systems with Affinely Parameter-Dependent Coefficient Matrices |
|
Yamakawa, Masafumi | Nagoya University |
Asai, Toru | Nagoya University |
Ariizumi, Ryo | Nagoya University |
Azuma, Shun-ichi | Kyoto University |
Keywords: Linear systems, Estimation, Identification
Abstract: In this paper, we analyze "sensitivity identifiability" of initial states and parameters in affinely parametrized linear systems. If the true parameter is sensitivity unidentifiable (non-SI), optimization-based estimation algorithms may face computational problems. Thus, it is important to detect whether the parameter is non-SI a priori. To this aim, we address a problem to find the condition that the parameter is non-SI for any initial state and input. Then, we obtain a sufficient condition for the problem. The condition is given by algebraic equations and is expected to be the foundation of structural conditions. We show systems that satisfy the condition and are observable and controllable.
|
|
10:20-10:40, Paper WeA19.2 | Add to My Program |
A Data-Driven Approach to System Invertibility and Input Reconstruction |
|
Mishra, Vikas Kumar | Technische Universitat Kaiserlautern |
Iannelli, Andrea | University of Stuttgart |
Bajcinca, Naim | University of Kaiserslautern |
Keywords: Linear systems, Estimation
Abstract: We consider the problems of system invertibility and input reconstruction for linear time-invariant (LTI) systems using only measured data. The two problems are connected in the sense that input reconstruction is possible provided that the system is left invertible. To verify the latter property without model knowledge, we leverage behavioral systems theory and develop two data-driven algorithms: one based on input/state/output data and the other based only on input/output data. We then consider the problem of input reconstruction for both noise-free and noisy data settings. In the case of noisy data, a statistical approach is leveraged to formulate the problem as a maximum likelihood estimation (MLE) problem. The proposed approaches are finally illustrated with numerical examples that show: exact input reconstruction in the noise-free setting; and the better performance of the MLE-based approach compared to the standard least-norm solution.
|
|
10:40-11:00, Paper WeA19.3 | Add to My Program |
Non-Overshooting Tracking Controllers Based on Combinatorial Polynomials |
|
Taghavian, Hamed | KTH Royal Institute of Technology |
Johansson, Mikael | KTH - Royal Institute of Technology |
Keywords: Linear systems
Abstract: This paper presents a technique for designing two-parameter compensators that stabilize a plant and provide offset-free tracking of set-points without overshooting or undershooting. We first represent the impulse response of linear systems using combinatorial polynomials, based on which a new set of conditions is derived for the system to be externally positive. This result is then used in control synthesis to achieve monotonic tracking. In contrast to the methods available in the literature, the proposed technique always gives a solution whenever the problem is feasible, can yield as small a settling time as desired, and provides the freedom to choose the closed-loop poles arbitrarily inside the unit circle, all obtained by low-degree controllers.
|
|
11:00-11:20, Paper WeA19.4 | Add to My Program |
Data-Driven Exact Pole Placement for Linear Systems |
|
Bianchin, Gianluca | University of Louvain |
Keywords: Linear systems, Identification for control, Stability of linear systems
Abstract: The exact pole placement problem concerns computing a static feedback law for a linear dynamical system that will assign its poles at a set of pre-specified locations. This is a classic problem in feedback control and numerous methodologies have been proposed in the literature for cases where a model of the system to control is available. In this paper, we study the problem of computing feedback laws for pole placement (and, more generally, eigenstructure assignment) directly from experimental data. Interestingly, we show that the closed-loop poles can be placed exactly at arbitrary locations without relying on any model description but by using only finite-length trajectories generated by the open-loop system. In turn, these findings imply that classical control goals, such as feedback stabilization or meeting transient transient performance specifications, can be achieved directly from data without first identifying a system model. Numerical experiments demonstrate the benefits of the data-driven pole-placement approach compared to its model-based counterpart.
|
|
11:20-11:40, Paper WeA19.5 | Add to My Program |
Phase of Multivariable Systems: A Revisit Via mathcal{H}_2^T-Dissipativity |
|
Wang, Dan | KTH Royal Institute of Technology |
Chen, Wei | Peking University |
Johansson, Karl H. | KTH Royal Institute of Technology |
Keywords: Linear systems
Abstract: A new notion of phase of multi-input multi-output (MIMO) systems was recently defined and studied, leading to new understandings in various fronts including a formulation of small phase theorem, a performance criterion named mathcal{H}_{infty} phase sector, and a sectored real lemma, etc. In this paper, we define a new notion of mathcal{H}_2^T-dissipativity and show the connection between the phase of a multivariable linear time-invariant (LTI) system and the mathcal{H}_2^T-dissipativity. The mathcal{H}_2^T-dissipativity, roughly speaking, is dissipativity restricted to the time-domain mathcal{H}_2 space which consists of mathcal{L}_2 signals with only positive frequency components. In addition, by exploiting the newly defined mathcal{H}_2^T-dissipativity, we also study the phase of a feedback system and provide a physical interpretation of the sectored real lemma.
|
|
11:40-12:00, Paper WeA19.6 | Add to My Program |
Markov Chain Monte Carlo for Gaussian: A Linear Control Perspective |
|
Yuan, Bo | Georgia Institute of Technology |
Fan, Jiaojiao | Georgia Institute of Technology |
Wang, Yuqing | Georgia Institute of Technology |
Tao, Molei | Georgia Institute of Technology |
Chen, Yongxin | Georgia Institute of Technology |
Keywords: Linear systems, Lyapunov methods, Filtering
Abstract: Drawing samples from a given target probability distribution is a fundamental task in many science and engineering applications. A commonly used method for sampling is the Markov chain Monte Carlo (MCMC) which simulates a Markov chain whose stationary distribution coincides with the target one. In this work, we study the convergence and complexity of MCMC algorithms from a dynamic system point of view. We focus on the special cases with Gaussian target distributions and provide a Lyapunov perspective to them using tools from linear control theory. In particular, we systematically analyze two popular MCMC algorithms: Langevin Monte Carlo (LMC) and kinetic Langevin Monte Carlo (KLMC). By applying Lyapunov theory we derive tight complexity bounds to these algorithms. Our analysis also highlights subtle differences between sampling and optimization that could inform the more challenging task to sample from general distributions. Overall, our findings offer valuable insights for improving MCMC algorithms.
|
|
WeA20 Regular Session, Orchid Junior 4312 |
Add to My Program |
Biological Systems I |
|
|
Chair: Duenas, Victor H | Syracuse University |
Co-Chair: Giordano, Giulia | University of Trento |
|
10:00-10:20, Paper WeA20.1 | Add to My Program |
Robust Online Estimation of Biophysical Neural Circuits |
|
Schmetterling, Raphael | University of Cambridge |
B. Burghi, Thiago | University of Cambridge |
Sepulchre, Rodolphe | University of Cambridge |
Keywords: Biological systems, Robust adaptive control, Estimation
Abstract: The control of neuronal networks, whether biological or neuromorphic, relies on tools for estimating parameters in the presence of model uncertainty. In this work, we explore the robustness of adaptive observers for neuronal estimation. Inspired by biology, we show that decentralization and redundancy help recover the performance of a centralized recursive mean square algorithm in the presence of uncertainty and mismatch on the internal dynamics of the model.
|
|
10:20-10:40, Paper WeA20.2 | Add to My Program |
Dynamic Brain Networks with Prescribed Functional Connectivity |
|
Casti, Umberto | University of Padova |
Baggio, Giacomo | University of Padova, Italy |
Benozzo, Danilo | University of Padova |
Zampieri, Sandro | Univ. Di Padova |
Bertoldo, Alessandra | University of Padova |
Chiuso, Alessandro | Univ. Di Padova |
Keywords: Biological systems, Linear systems, Stochastic systems
Abstract: In this paper, we consider stable stochastic linear systems modeling whole-brain resting-state dynamics. We parametrize the state matrix of the system (effective connectivity) in terms of its steady-state covariance matrix (functional connectivity) and a skew-symmetric matrix S. We examine how the matrix S influences some relevant dynamic properties of the system. Specifically, we show that a large S enhances the degree of stability and excitability of the system, and makes the latter more responsive to high-frequency inputs.
|
|
10:40-11:00, Paper WeA20.3 | Add to My Program |
Modelling Pathogen Response of the Human Immune System in a Reduced State Space |
|
Tajvar, Pouria | KTH, Royal Institute of Technology |
Forlin, Rikard | Karolinska Institutet |
Brodin, Petter | Karolinska Institutet |
Dimarogonas, Dimos V. | KTH Royal Institute of Technology |
Keywords: Cellular dynamics, Systems biology, Identification
Abstract: The immune system response to pathogens is organized by a network of cells communicating through expression of a variety of proteins and signaling molecules. A high number of genes are involved in encoding these communicating agents, but the relatively low number of data points is a major challenge in modelling the gene expression response. In this work we propose a feature-selection approach based on gene expression distributions at the single-cell level that improves dynamics identification at the population level. We investigate common approaches to differential expression analysis and show that Earth Mover's Distance (EMD) is a relatively robust measure for gene selection as reflected by the coefficient of variation as well as accuracy of a naive Bayes classifier based on the selected genes. We ultimately propose the bootstrap standard deviation metric as an estimate of state uncertainty and show that statistically significant signals in pathogen response can be recovered in the reduced state space constructed with the selected genes.
|
|
11:00-11:20, Paper WeA20.4 | Add to My Program |
Data-Based Extended Moving Horizon Estimation for MISO Anesthesia Dynamics |
|
Moussa, Kaouther | INSA Hauts-De-France and LAMIH |
Aubouin--Pairault, Bob | Univ. Grenoble Alpes, CNRS, Grenoble INP, GIPSA Lab, 38000 Gren |
Alamir, Mazen | CNRS / University of Grenoble |
Dang, Thao | VERIMAG |
Keywords: Biomedical, Estimation, Optimization
Abstract: This paper presents an extended moving horizon observer to estimate both the states and the pharmacodynamic(PD) parameters of an anesthesia model, based on real data. The inputs of this model are the injection rates of Propofol and Remifentanil. The states represent the concentration of the anesthetic agents in different compartments of the human body (muscles, fat, blood) and in the effect site. The considered output is the Bispectral index (BIS) which is derived from the electroencephalogram (EEG). The observer is designed such that the parameters are estimated during the anesthesia induction phase, and then almost frozen for the rest of the surgery. The estimator is validated on real data that were extracted from the VitalDB database (Lee et al., 2022).
|
|
11:20-11:40, Paper WeA20.5 | Add to My Program |
Passivity-Based Hybrid Systems Approach to Repetitive Learning Control for FES-Cycling with Control Input Saturation |
|
Sweatland, Hannah | University of Florida |
Griffis, Emily | University of Florida |
Duenas, Victor H | Syracuse University |
Dixon, Warren E. | University of Florida |
Keywords: Biomedical, Stability of hybrid systems, Stability of nonlinear systems
Abstract: Functional electrical stimulation (FES)-cycling is an effective method of rehabilitation for people with neuromuscular disorders. Muscle stimulation and electric motor inputs are designed to complement the riders volitional pedaling, but open challenges remain in the analysis of the stability and robustness of the human-machine system under the influence of switching between muscle and motor inputs. Discontinuous switching between muscle stimulation inputs and motor input motivates the use of a hybrid systems analysis, reducing gain conditions compared to a switched systems analysis and yielding robustness to disturbances. In this paper, repetitive learning control (RLC)-based feedforward terms for each muscle group and electric motor are designed to improve cadence tracking and reduce high-gain feedback terms that can cause chattering effects. Muscle stimulation limits are systematically considered for the safety and comfort of the rider, and a cadence controller is designed integrating RLC and robust control terms to account for input saturation. A passivity-based analysis ensures the hybrid system is flow output strictly passive from the riders volitional effort to the tracking error output. Moreover, the position and cadence tracking errors are shown to asymptotically converge based on a Lyapunov-like stability analysis.
|
|
11:40-12:00, Paper WeA20.6 | Add to My Program |
A Novel Viral Infection Model to Guide Optimal Mpox Treatment |
|
de Jong, Maarten N. | Delft University of Technology |
Calà Campana, Francesca | University of Trento |
Li, Pengfei | Erasmus MC - University Medical Center, Rotterdam |
Pan, Qiuwei | Erasmus MC - University Medical Center, Rotterdam |
Giordano, Giulia | University of Trento |
Keywords: Healthcare and medical systems, Biological systems, Systems biology
Abstract: In 2022, worldwide mpox outbreaks have called attention to mpox virus infection and treatment opportunities using the drugs cidofovir and tecovirimat, which target different stages of in-host viral proliferation, respectively production and shedding. We propose a new model of in-host viral infection dynamics that distinguishes between the two stages, so as to explore the distinct effects of the two drugs, and we analyse the model properties and behaviour. Reducing the model order via timescale separation is shown to lead to the classical target-cell limited model, with a lumped viral proliferation rate depending on both production and shedding. We explicitly introduce the effect of the two drugs and we exemplify how to formulate and solve an optimal control problem that leverages the model dynamics to schedule optimal combined treatments.
|
|
WeA21 Regular Session, Orchid Junior 4311 |
Add to My Program |
Constrained Control I |
|
|
Chair: Panagou, Dimitra | University of Michigan, Ann Arbor |
Co-Chair: You, Keyou | Tsinghua University |
|
10:00-10:20, Paper WeA21.1 | Add to My Program |
Enhancements on a Saturated Control for Stabilizing a Quadcopter: Adaptive and Robustness Analysis in the Flat Output Space |
|
Do, Huu-Thinh | Grenoble Institute of Technology (Grenoble INP) |
Blanchini, Franco | Univ. Degli Studi Di Udine |
Prodan, Ionela | Grenoble Institute of Technology (Grenoble INP) - Esisar |
Keywords: Constrained control, Adaptive control, Feedback linearization
Abstract: This paper extends our previous study on an explicit saturated control for a quadcopter, which ensures both constraint satisfaction and stability thanks to the linear representation of the system in the flat output space. The novelty here resides in the adaptivity of the controllers gain to enhance the systems performance without exciting its parasitic dynamics and avoid lavishing the input actuation with excessively high gain parameters. Moreover, we provide a thorough robustness analysis of the proposed controller when additive disturbances are affecting the system behavior. Finally, simulation and experimental tests validate the performances of the proposed controller.
|
|
10:20-10:40, Paper WeA21.2 | Add to My Program |
Adaptive Safe Backstepping for Collaborative Levitation Control of Maglev Trains with Unknown Mass |
|
Zhang, Tianbo | Tsinghua University |
Li, Xingchen | Tsinghua University |
Zhang, TiYao | Beijing Jiaotong University |
You, Keyou | Tsinghua University |
Keywords: Constrained control, Adaptive control
Abstract: Maglev trains are levitated by magnetic forces to maintain a desired air gap between guideways and magnets. Although every two magnets are mechanically coupled via a levitation bogie, the existing controllers are usually designed only for each individual magnet, which may result in unstable air gaps or levitation failures. Differently, this paper proposes an adaptive safe backstepping scheme to collaboratively control two magnets of the same bogie. In particular, safety constraints are introduced to adaptive backstepping control via quadratic programs, which ensures the air gap within a permissible range and significantly reduces overshoots. Finally, simulation results validate the effectiveness of the proposed collaborative controller.
|
|
10:40-11:00, Paper WeA21.3 | Add to My Program |
Adaptation for Validation of Consolidated Control Barrier Functions |
|
Black, Mitchell | Toyota Motor North America |
Panagou, Dimitra | University of Michigan, Ann Arbor |
Keywords: Constrained control, Autonomous robots, Nonlinear systems
Abstract: We develop a novel adaptation-based technique for safe control design in the presence of multiple state constraints. Specifically, we introduce an approach for synthesizing any number of candidate control barrier functions (CBFs), each encoding a different state constraint, into one consolidated CBF (C-CBF) candidate. We then propose a parameter adaptation law for the weights of the C-CBF's constituent functions such that its controllable dynamics are non-vanishing. We prove that the adaptation law certifies the consolidated CBF candidate as valid for a class of nonlinear, control-affine, multi-agent systems, which permits its use in a quadratic program based control law. We highlight the success of our approach in simulation on a multi-robot goal-reaching problem in a warehouse environment, and further demonstrate its efficacy via a laboratory study with an AION ground rover operating amongst other vehicles behaving both aggressively and conservatively.
|
|
11:00-11:20, Paper WeA21.4 | Add to My Program |
Anti-Windup Coordination Strategy Around a Fair Equilibrium in Resource Sharing Networks |
|
Agner, Felix | Lund University |
Kergus, Pauline | CNRS |
Rantzer, Anders | Lund University |
Tarbouriech, Sophie | LAAS-CNRS |
Zaccarian, Luca | LAAS-CNRS |
Keywords: Constrained control, Energy systems, Distributed control
Abstract: We coordinate interconnected agents where the control input of each agent is limited by the control input of others. In that sense, the systems have to share a limited resource over a network. Such problems can arise in different areas and it is here motivated by a district heating example. When the shared resource is insufficient for the combined need of all systems, the resource will have to be shared in an optimal fashion. In this scenario, we want the systems to automatically converge to an optimal equilibrium. The contribution of this paper is the proposal of a control architecture where each separate system is controlled by a local PI controller. The controllers are then coordinated through a global rank-one anti- windup signal. It is shown that the equilibrium of the proposed closed-loop system minimizes the infinity-norm of stationary state deviations. A proof of linear-domain passivity is given, and a numerical example highlights the benefits of the proposed method with respect to the state-of-the-art.
|
|
11:20-11:40, Paper WeA21.5 | Add to My Program |
Safety-Critical Control for Systems with Impulsive Actuators and Dwell Time Constraints |
|
Breeden, Joseph | University of Michigan, Ann Arbor |
Panagou, Dimitra | University of Michigan, Ann Arbor |
Keywords: Constrained control, Hybrid systems, Aerospace
Abstract: This paper presents extensions of control barrier function (CBF) and control Lyapunov function (CLF) theory to systems wherein all actuators cause impulsive changes to the state trajectory, and can only be used again after a minimum dwell time has elapsed. These rules define a hybrid system, wherein the controller must at each control cycle choose whether to remain on the current state flow or to jump to a new trajectory. We first derive a sufficient condition to render a specified set forward invariant using extensions of CBF theory. We then derive related conditions to ensure asymptotic stability in such systems, and apply both conditions online in an optimization-based control law with aperiodic impulses. We simulate both results on a spacecraft docking problem with multiple obstacles.
|
|
11:40-12:00, Paper WeA21.6 | Add to My Program |
On the Relationship between Control Barrier Functions and Projected Dynamical Systems |
|
Delimpaltadakis, Giannis | Eindhoven University of Technology |
Heemels, W.P.M.H. | Eindhoven University of Technology |
Keywords: Constrained control, Hybrid systems, Stability of nonlinear systems
Abstract: In this paper, we study the relationship between systems controlled via Control Barrier Function (CBF) approaches and a class of discontinuous dynamical systems, called Projected Dynamical Systems (PDSs). In particular, under appropriate assumptions, we show that the vector field of CBF-controlled systems is a Krasovskii-like perturbation of the set-valued map of a differential inclusion, that abstracts PDSs. This result provides a novel perspective to analyze and design CBF-based controllers. Specifically, we show how, in certain cases, it can be employed for designing CBF-based controllers that, while imposing safety, preserve asymptotic stability and do not introduce undesired equilibria or limit cycles. Finally, we briefly discuss about how it enables continuous implementations of certain projection-based controllers, that are gaining increasing popularity.
|
|
WeA22 Regular Session, Orchid Junior 4212 |
Add to My Program |
Stochastic Optimal Control I |
|
|
Chair: Bhatnagar, Shalabh | Indian Institute of Science |
Co-Chair: Chakravorty, Suman | Texas A&M University |
|
10:00-10:20, Paper WeA22.1 | Add to My Program |
Convex Q Learning in a Stochastic Environment |
|
Lu, Fan | University of Florida |
Meyn, Sean P. | Univ. of Florida |
Keywords: Stochastic optimal control, Adaptive control, Machine learning
Abstract: The paper introduces the first formulation of convex Q-learning for Markov decision processes with function approximation. The algorithms and theory rest on a relaxation of a dual of Manne's celebrated linear programming characterization of optimal control. The main contributions firstly concern properties of the relaxation, described as a deterministic convex program: we identify conditions for a bounded solution, a significant connection between the solution to the new convex program, and the solution to standard Q-learning with linear function approximation. The second set of contributions concern algorithm design and analysis: (i) A direct model-free method for approximating the convex program for Q-learning shares properties with its ideal. In particular, a bounded solution is ensured subject to a simple property of the basis functions; (ii) The proposed algorithms are convergent and new techniques are introduced to obtain the rate of convergence in a mean-square sense; (iii) The approach can be generalized to a range of performance criteria, and it is found that variance can be reduced by considering ``relative'' dynamic programming equations; (iv) The theory is illustrated with an application to a classical inventory control problem.
|
|
10:20-10:40, Paper WeA22.2 | Add to My Program |
Signalling of Information Via Coding in a Series Network of Unstable Stochastic Dynamical Control Systems |
|
Charalambous, Charalambos D. | University of Cyprus |
Kourtellaris, Christos | University of Cyprus |
Tzortzis, Ioannis | University of Cyprus |
Keywords: Stochastic optimal control, Control over communications, Networked control systems
Abstract: We address the asymptotic problem of signalling information from one controller to another controller, in a series network, consisting of control system 1 (CS-1) and control system 2 (CS-2), as shown in the Figure, first analyzed in [1] for finite-horizon. Controller 2 of CS-2 has access to feedback information from its output, while controller 1 of CS-1 does not have access to feedback information from its output. Under suitable detectability and stabilizability conditions of matrix algebraic Riccati equations (AREs), it is shown that, if the rate of generating information by CS-1 is below the asymptotic control-coding (CC) capacity of CS-2, then we can synthesize, i) a controller-encoder for CS-2 that simultaneously controls the CS-2 and encodes the state of the CS-1, and operates at the CC capacity of CS-2, ii) a decoder for CS-2 that is optimal with respect to a mean-square error (MSE) criterion, and iii) a controller for CS-1, which acts on the decoder output, and it is optimal with respect to the pay-off of CS-1. Compared to [1], this paper includes bounds to MSE and Error probability of communicating digital messages.
|
|
10:40-11:00, Paper WeA22.3 | Add to My Program |
Learning to Control under Uncertainty with Data-Based Iterative Linear Quadratic Regulator |
|
Wang, Ran | Texas A&M University |
Goyal, Raman | Texas A&M University |
Chakravorty, Suman | Texas A&M University |
Keywords: Stochastic optimal control, Iterative learning control, Nonlinear output feedback
Abstract: This paper studies the learning-to-control problem under process and sensing uncertainties for dynamical systems. In our previous work, we developed a data-based generalization of the iterative linear quadratic regulator (iLQR) to design closed-loop feedback control for high-dimensional dynamical systems with partial state observation. This method required perfect simulation rollouts which are not realistic in real applications. In this work, we briefly introduce this method and explore its efficacy under process and sensing uncertainties. We prove that in the fully observed case where the system dynamics are corrupted with noise but the measurements are perfect, it still converges to the global minimum. However, in the partially observed case where both process and measurement noise exist in the system, this method converges to a biased ``optimum". Thus multiple rollouts need to be averaged to retrieve the true optimum. The analysis is verified in two nonlinear robotic examples simulated in the above cases.
|
|
11:00-11:20, Paper WeA22.4 | Add to My Program |
Stochastic Nonlinear Control Via Finite-Dimensional Spectral Dynamic Embedding |
|
Ren, Tongzheng | University of Texas, Austin |
Ren, Zhaolin | Harvard University |
Li, Na | Harvard University |
Dai, Bo | Google Brain & Georgia Tech |
Keywords: Stochastic optimal control, Iterative learning control, Optimal control
Abstract: Optimal control is notoriously difficult for stochastic nonlinear systems. Ren et.al. 2022 introduced Spectral Dynamics Embedding for developing reinforcement learning methods for controlling an unknown system. It uses an infinite-dimensional feature to linearly represent the state-value function and exploits finite-dimensional truncation approximation for practical implementation. However, the finite-dimensional approximation properties in control have not been investigated even when the model is known. In this paper, we provide a tractable stochastic nonlinear control algorithm that exploits the nonlinear dynamics upon the finite-dimensional feature approximation, Spectral Dynamics Embedding Control (SDEC), with an in-depth theoretical analysis to characterize the approximation error induced by the finite-dimension truncation and statistical error induced by finite-sample approximation in both policy evaluation and policy optimization. We also empirically test the algorithm and compare the performance with Koopman-based methods and iLQR methods on the pendulum swingup problem.
|
|
11:20-11:40, Paper WeA22.5 | Add to My Program |
Large-Population Optimal Control with Mixed Agents: The Multi-Scale Analysis and Decentralized Control |
|
Huang, Minyi | Carleton University |
Nguyen, Son | University of Puerto Rico, Rio Piedras |
Keywords: Stochastic optimal control, Large-scale systems, Decentralized control
Abstract: We consider a large-population optimal control problem involving a major agent and a large number of minor agents. By starting with a centralized optimal control problem, we employ a re-scaling method to derive decentralized control laws. This re-scaling method is further used to obtain a tight upper bound of O(1/N) for the performance loss resulting from decentralized control. This improves upon known results of O(1/sqrt{N}) in the literature for similar models.
|
|
11:40-12:00, Paper WeA22.6 | Add to My Program |
Actor-Critic or Critic-Actor? a Tale of Two Time Scales |
|
Bhatnagar, Shalabh | Indian Institute of Science |
Borkar, Vivek S. | Indian Institute of Technology |
Guin, Soumyajit | Indian Institute of Science, Bengaluru |
Keywords: Stochastic optimal control, Machine learning, Iterative learning control
Abstract: We revisit the standard formulation of tabular actor-critic algorithm as a two time-scale stochastic approximation with value function computed on a faster time-scale and policy computed on a slower time-scale. This emulates policy iteration. We begin by observing that reversal of the time scales will in fact emulate value iteration and is a legitimate algorithm. We provide a proof of convergence and compare the two empirically with and without function approximation (with both linear and nonlinear function approximators) and observe that our proposed critic-actor algorithm performs on par with actor-critic in terms of both accuracy and computational effort.
|
|
WeA23 Regular Session, Orchid Junior 4211 |
Add to My Program |
Cyber-Physical Security I |
|
|
Chair: Chen, Tongwen | University of Alberta |
Co-Chair: Ishii, Hideaki | Tokyo Institute of Technology |
|
10:00-10:20, Paper WeA23.1 | Add to My Program |
Model Extraction Attacks against Reinforcement Learning Based Controllers |
|
Sajid, Momina | University of California, Irvine |
Shen, Yanning | UCI |
Shoukry, Yasser | University of California, Irvine |
Keywords: Cyber-Physical Security, Attack Detection, Machine learning
Abstract: We introduce the problem of model-extraction attacks in cyber-physical systems in which an attacker attempts to estimate (or extract) the feedback controller of the system. Extracting (or estimating) the controller provides an unmatched edge to attackers since it allows them to predict the future control actions of the system and plan their attack accordingly. Hence, it is important to understand the ability of the attackers to perform such an attack. In this paper, we focus on the setting when a Deep Neural Network (DNN) controller is trained using Reinforcement Learning (RL) algorithms and is used to control a stochastic system. We play the role of the attacker that aims to estimate such an unknown DNN controller, and we propose a two-phase algorithm. In the first phase, also called the offline phase, the attacker uses side-channel information about the RL-reward function and the system dynamics to identify a set of candidate estimates of the unknown DNN. In the second phase, also called the online phase, the attacker observes the behavior of the unknown DNN and uses these observations to shortlist the set of final policy estimates. We provide theoretical analysis of the error between the unknown DNN and the estimated one. We also provide numerical results showing the effectiveness of the proposed algorithm.
|
|
10:20-10:40, Paper WeA23.2 | Add to My Program |
Physical Backdoor Trigger Activation of Autonomous Vehicle Using Reachability Analysis |
|
Li, Wenqing | New York University Abu Dhabi |
Wang, Yue | New York University |
Shafique, Muhammad | New York University Abu Dhabi |
Jabari, Saif | New York University Abu Dhabi |
Keywords: Cyber-Physical Security, Autonomous vehicles, Traffic control
Abstract: Recent studies reveal that Autonomous Vehicles (AVs) can be manipulated by hidden backdoors, causing them to perform harmful actions when activated by physical triggers. However, it is still unclear how these triggers can be activated while adhering to traffic principles. Understanding this vulnerability in a dynamic traffic environment is crucial. This work addresses this gap by presenting physical trigger activation as a reachability problem of controlled dynamic system. Our technique identifies security-critical areas in traffic systems where trigger conditions for accidents can be reached, and provides intended trajectories for how those conditions can be reached. Testing on typical traffic scenarios showed the system can be successfully driven to trigger conditions with near 100% activation rate. Our method benefits from identifying AV vulnerability and enabling effective safety strategies.
|
|
10:40-11:00, Paper WeA23.3 | Add to My Program |
Stealthy Linear Deception Attacks against Kalman Filtering with Partially Secured Measurements |
|
Zhou, Jing | University of Alberta |
Chen, Tongwen | University of Alberta |
Keywords: Cyber-Physical Security, Computer/Network Security, Estimation
Abstract: In this paper, we investigate an optimal strategy for malicious agents to compromise remote state estimators where only a portion of the transmitted packets are secured. First, the analysis of the performance evolution and stealthiness properties of innovation-based linear attacks that can compromise unsafe transmission channels is provided. An optimal attack policy is then derived by numerically solving an optimization problem step by step. Different from the scenario where all links are vulnerable to cyber-attacks, the existence of secured channels poses a tighter stealthiness constraint and thus can significantly reduce the worst-case attack impact. Additionally, it is shown that the well-studied flipping-sign-attack in existing work cannot remain stealthy. Finally, a numerical example and comparative studies are included to verify the effectiveness of the proposed method.
|
|
11:00-11:20, Paper WeA23.4 | Add to My Program |
Consensus Resiliency of Stochastic Observation Via Ring Lattices of Sensors Facing Byzantine Attacks |
|
Peng, Haotian | Shanghai Jiao Tong University |
Li, Yuke | Peking University |
Jin, Li | Shanghai Jiao Tong University |
Keywords: Cyber-Physical Security, Game theory, Fault tolerant systems
Abstract: We consider the observation of a random, binary environment state via a set of sensing nodes connected through a ring lattice. Each node obtains a correct observation with a positive probability and broadcasts its observation to its neighbors. A system operator selects a consensus threshold for the number of consistent observations, and a consensus is reached when any node has accumulated sufficient consistent observations. A Byzantine attacker can manipulate a certain number of nodes to broadcast misleading information, and thus prohibit a correct consensus. We formulate this problem as a zero-sum game and analyze the equilibria. We show that the attacker has a dominant strategy for selecting the nodes to manipulate and the information to broadcast/block. We show that, unless the attacker's budget is abundant, the system operator can select an optimal consensus threshold to balance between the chance of a correct consensus and the risk of a wrong consensus. We also use the equilibrium structure to characterize the network resiliency, i.e., the minimal number of Byzantine nodes that would eliminate the chance of a correct consensus, given the size and connectivity of the ring lattice. The results are relevant for hardware surveillance, infrastructure inspection, disaster response, etc.
|
|
11:20-11:40, Paper WeA23.5 | Add to My Program |
Optimal Linear Attack in Cyber-Physical Systems with Periodical Detection |
|
Qi, Jia | Shanghai Jiao Tong University |
Fang, Chongrong | Shanghai Jiao Tong University |
He, Jianping | Shanghai Jiao Tong University |
Keywords: Cyber-Physical Security, Networked control systems, Kalman filtering
Abstract: Security issues are of significant importance for cyber-physical systems (CPS), where the attack design is a major concern. Most related studies on attack design implicitly consider that the control period and detection period are the same. However, the two periods could be different in practical systems with remote detection such as SCADA systems, which could lead to new vulnerabilities for attackers. In this paper, we consider the design of innovation-based linear attack strategies for CPS when the control period and detection period are inconsistent. Specifically, we propose an attack framework that consists of attack strategies for detection and non-detection instants under the period discrepancy. On this basis, we design the optimal stealthy innovation-based linear attack strategies for state estimation and LQG control to maximize the estimation error or control cost, respectively. Simulations are given to demonstrate the effectiveness of the proposed attack strategies.
|
|
11:40-12:00, Paper WeA23.6 | Add to My Program |
Effects of Quantization on Zero-Dynamics Attacks to Closed-Loop Sampled-Data Control Systems |
|
Kang, Xile | Tokyo Institute of Technology |
Ishii, Hideaki | Tokyo Institute of Technology |
Keywords: Cyber-Physical Security, Networked control systems, Sampled-data control
Abstract: This paper focuses on cyber-security issues of networked control systems in closed-loop forms from the perspective of quantized sampled-data systems. As sampling can introduce non-minimum phase zeros in discretized systems, we consider zero dynamics attacks, which is a class of false data injection attacks. Quantization of control inputs disables such attacks to be made exactly, resulting in certain errors in the system output. Specifically, we characterize a trade-off relation between attack performance and stealthiness, and then show that the attacker can reduce the output error with a modified approach by considering the quantization error of the attack signal. We provide a numerical example to demonstrate the effectiveness of the proposed approaches.
|
|
WeA24 Invited Session, Orchid Main 4201AB |
Add to My Program |
Event-Triggered and Self-Triggered Control I |
|
|
Chair: Morarescu, Irinel-Constantin | CRAN, CNRS, Université De Lorraine |
Co-Chair: Nesic, Dragan | University of Melbourne |
Organizer: Heemels, W.P.M.H. | Eindhoven University of Technology |
Organizer: Hirche, Sandra | Technische Universität München |
Organizer: Nowzari, Cameron | George Mason University |
|
10:00-10:20, Paper WeA24.1 | Add to My Program |
Event-Triggered Robust Stabilization by Using Fast-Varying Square Wave Dithers (I) |
|
Zhang, Jin | Shanghai University |
Zhang, Zhihao | Shanghai University |
Fridman, Emilia | Tel-Aviv Univ |
Keywords: Stability of linear systems, Robust control
Abstract: This paper is concerned with event-triggered robust static output-feedback stabilization of the second-order linear uncertain systems by a fast-varying square wave with high gain. Recently, a constructive time-delay approach for designing such a fast-varying output-feedback controller was suggested by using continuous measurements. In the present paper, we employ an event-trigger (ET) based on switching approach that determines the measurement transmission instants for this design. For the resulting switching system, we construct appropriate coordinate transformations that cancel the high gains and apply the time-delay approach to periodic averaging of the system in new coordinates. By employing appropriate Lyapunov functionals, we derive linear matrix inequalities (LMIs) for finding an efficient upper bound on the square wave frequency that guarantees the stability of the original systems. Numerical examples illustrate the efficiency of the method.
|
|
10:20-10:40, Paper WeA24.2 | Add to My Program |
Output-Based Event-Holding Control in Presence of Measurement Noise (I) |
|
Scheres, Koen | Eindhoven University of Technology |
Postoyan, Romain | CNRS, CRAN, Université De Lorraine |
Nesic, Dragan | University of Melbourne |
Heemels, W.P.M.H. | Eindhoven University of Technology |
Keywords: Hybrid systems, Networked control systems, Control over communications
Abstract: We present rules to stabilize the origin of a networked system, where data exchanges between the plant and the controller only occur when an output-dependent inequality has been satisfied for a given amount of time. This strategy, called Event-Holding Control (EHC), differs from time-regularized event-triggered control (ETC) techniques, which generate transmissions as soon as a triggering condition is verified and the time elapsed since the last transmission is larger than a given bound. Indeed, the clock involved in EHC is not running continuously after each transmission instant, but only when a criterion is verified. We propose an output-based design of these triggering mechanisms that are robust to additive measurement noise and ensure an input-to-state stability (ISS) property. This EHC scheme naturally has a positive lower bound on the transmission interval. Additionally, we show via an example that, in presence of measurement noise, Zeno-like behavior, where events are generated near the minimum inter-event time consistently, may occur when the system is close to the attractor. We introduce space-regularization to mitigate this issue, resulting in an input-to-state practical stability (ISpS) property rather than ISS.
|
|
10:40-11:00, Paper WeA24.3 | Add to My Program |
Forward Invariance-Based Hybrid Control Using Uncertified Controllers (I) |
|
Wintz, Paul K. | University of California, Santa Cruz |
Sanfelice, Ricardo G. | University of California at Santa Cruz |
Keywords: Supervisory control, Hybrid systems, Fault accomodation
Abstract: For a constrained nonlinear control system, an automated supervisor is proposed that determines switching between a barrier functioncertified controller and an uncertified controller. The switching strategy allows for properties of the uncertified controller to be exploited while preserving the forward invariance that is guaranteed by the barrier function for the certified controller. Tunable threshold functions determine regions of the state space where the supervisor switches between controllers. Conditions are given to prevent chattering by establishing a positive minimum time between switches. An example illustrates achieving forward invariance despite using an uncertified MPC controller with delayed computations.
|
|
11:00-11:20, Paper WeA24.4 | Add to My Program |
Intermittent Safety Filters for Event-Triggered Safety Maneuvers with Application to Satellite Orbit Transfers (I) |
|
Ong, Pio | California Institute of Technology |
Ames, Aaron D. | California Institute of Technology |
Keywords: Discrete event systems, Hybrid systems, Nonlinear systems
Abstract: In balancing safety with the nominal control objectives, e.g., stabilization, it is desirable to reduce the time period when safety filters are in effect. Inspired by traditional spacecraft maneuvers, and with the ultimate goal of reducing the duration when safety is of concern, this paper proposes an event-triggered control framework with switching state-based triggers. Our first trigger in the scheme monitors safety constraints encoded by barrier functions, and thereby ensures safety without the need to alter the nominal controller---and when the boundary of the safety constraint is approached, the controller drives the system to the region where control actions are not needed. The second trigger condition determines if the safety constraint has improved enough for the success of the first trigger. We begin by motivating this framework for impulsive control systems, e.g., a satellite orbiting an asteroid. We then expand the approach to more general nonlinear system through the use of safety filtered controllers. Simulation results demonstrating satellite orbital maneuvers illustrate the utility of the proposed event-triggered framework.
|
|
11:20-11:40, Paper WeA24.5 | Add to My Program |
Event-Triggered Sensor Scheduling for Remote State Estimation with Error-Detecting Code |
|
Zhong, Yuxing | The Hong Kong University of Science and Technology |
Tang, Jiawei | Hong Kong University of Science and Technology |
Yang, Nachuan | Hong Kong University of Science and Technology |
Shi, Dawei | Beijing Institute of Technology |
Shi, Ling | Hong Kong University of Science and Technology |
Keywords: Discrete event systems, Kalman filtering, Networked control systems
Abstract: This paper addresses the problem of remote state estimation subject to packet dropouts, focusing on the use of an event-triggered sensor scheduler to conserve communication resources. However, packet dropouts introduce significant challenges, as the remote estimator cannot distinguish between packet loss caused by poor channel conditions and packet loss due to the event trigger. To overcome this issue, we propose a novel formulation that incorporates error-detecting codes. We prove that the Gaussian property, commonly assumed in the literature, does not hold in this setup, and instead, the system state follows an extended Gaussian mixture model (GMM). We present an exact minimum mean-squared error (MMSE) estimator and an approximate estimator, which significantly reduces algorithm complexity without sacrificing performance. Our simulation results show that the approximate estimator achieves nearly the same performance as the exact estimator while requiring much less computation time. Moreover, the proposed event trigger outperforms existing schedulers in terms of estimation accuracy.
|
|
11:40-12:00, Paper WeA24.6 | Add to My Program |
Event-Triggered Transmission Policies for Nonlinear Control Systems Over Erasure Channels |
|
Satheeskumar Varma, Vineeth | CNRS |
Postoyan, Romain | CNRS, CRAN, Université De Lorraine |
Quevedo, Daniel E. | Queensland University of Technology |
Morarescu, Irinel-Constantin | CRAN, CNRS, Université De Lorraine |
Keywords: Networked control systems, Stochastic systems
Abstract: We investigate the scenario where a controller communicates with a nonlinear plant via a wireless erasure channel. We present an event-based control strategy to stabilize the plant while sporadically using the unreliable wireless network. In particular, control packets may be lost at any time with a certain probability. Consequently, stability is ensured in a stochastic sense. We then compare the proposed event-based policy with a baseline policy that transmits according to the age of information, i.e., the time elapsed since the last successful reception. For any given baseline policy, we show how to design an event-based policy that ensures the same guaranteed control performance while leading, on average, to a strictly smaller channel utilization. Numerical simulations suggest that the achieved channel utilization may in fact be significantly smaller.
|
|
WeA25 Invited Session, Lotus Junior 4DE |
Add to My Program |
Contraction Theory for Analysis, Synchronization and Regulation I |
|
|
Chair: Astolfi, Daniele | Cnrs - Lagepp |
Co-Chair: Bullo, Francesco | Univ of California at Santa Barbara |
Organizer: Astolfi, Daniele | Cnrs - Lagepp |
Organizer: Bullo, Francesco | Univ of California at Santa Barbara |
|
10:00-10:20, Paper WeA25.1 | Add to My Program |
Euclidean Contractivity of Neural Networks with Symmetric Weights |
|
Centorrino, Veronica | Scuola Superiore Meridionale |
Gokhale, Anand | University of California, Santa Barbara |
Davydov, Alexander | University of California, Santa Barbara |
Russo, Giovanni | University of Salerno |
Bullo, Francesco | Univ of California at Santa Barbara |
Keywords: Stability of nonlinear systems, Neural networks, Optimization
Abstract: This paper investigates stability conditions of continuous-time Hopfield and firing-rate neural networks by leveraging contraction theory. First, we present a number of useful general algebraic results on matrix polytopes and products of symmetric matrices. Then, we give sufficient conditions for strong and weak Euclidean contractivity, i.e., contractivity with respect to the ell_2 norm, of both models with symmetric weights and (possibly) non-smooth activation functions. Our contraction analysis leads to contraction rates which are log-optimal in almost all symmetric synaptic matrices. Finally, we use our results to propose a firing-rate neural network model to solve a quadratic optimization problem with box constraints.
|
|
10:20-10:40, Paper WeA25.2 | Add to My Program |
Parametrization of Linear Controllers for P-Dominance |
|
Kawano, Yu | Hiroshima University |
Sato, Yusuke | Hiroshima University |
Wada, Nobutaka | Hiroshima University |
Keywords: Stability of nonlinear systems, Variational methods, LMIs
Abstract: Recently, the concept of p-dominance has been proposed as a unified framework to study rich behaviors of nonlinear systems. In this paper, we consider finding a set of linear dynamic output feedback controllers rendering the closed-loop systems p-dominant. We first derive an existence condition. Based on this condition, we then provide a parametrization of controllers. For Lure's systems, the proposed method can be applied only by solving a finite family of linear matrix inequalities, which is illustrated by achieving multi-stabilization and stabilization of a limit cycle.
|
|
10:40-11:00, Paper WeA25.3 | Add to My Program |
Range-Only Bearing Estimator for Localization and Mapping |
|
Marcantoni, Matteo | University of Groningen |
Jayawardhana, Bayu | University of Groningen |
Bunte, Kerstin | University of Groningen |
Keywords: Observers for nonlinear systems, Estimation, Autonomous vehicles
Abstract: Navigation and exploration within unknown environments are typical examples in which simultaneous localization and mapping (SLAM) algorithms are applied. When mobile agents deploy only range sensors without bearing information, the agents must estimate the bearing using the online distance measurement for the localization and mapping purposes. In this paper, we propose a scalable dynamic bearing estimator to obtain the relative bearing of the static landmarks in the local coordinate frame of a moving agent in real-time. Using contraction theory, we provide convergence analysis of the proposed range-only bearing estimator and present upper and lower-bound for the estimator gain. Numerical simulations demonstrate the effectiveness of the proposed method.
|
|
11:00-11:20, Paper WeA25.4 | Add to My Program |
A Small-Gain Approach to Incremental Input-To-State Stability Analysis of Hybrid Integrator-Gain Systems |
|
van den Eijnden, Sebastiaan | Eindhoven University of Technology |
Heertjes, Marcel | Eindhoven University of Technology |
Nijmeijer, Hendrik | Eindhoven University of Technology |
Heemels, W.P.M.H. | Eindhoven University of Technology |
Keywords: Switched systems, Stability of hybrid systems, Hybrid systems
Abstract: Incremental input-to-state stability plays an important role in the analysis of nonlinear systems, as it opens up the possibility for accurate performance characterizations beyond classical approaches. In this paper, we are interested in deriving conditions for incremental stability of a specific class of discontinuous dynamical systems containing a so-called hybrid integrator. Recently, it was shown that hybrid integrators have the potential for overcoming fundamental performance limitations of linear time-invariant control, thereby making them interesting for use in, e.g., high-precision motion control applications. The main contribution of this paper is to show that these hybrid integrators have incremental input-to-state stability properties, and that, under an incremental small-gain condition, the feedback interconnection of a hybrid integrator and a linear time-invariant plant is incrementally input-to-state stable.
|
|
11:20-11:40, Paper WeA25.5 | Add to My Program |
Nonlinear Singular Switched Systems in Discrete-Time: Solution Theory and Incremental Stability under Restricted Switching Signals (I) |
|
Sutrisno, Sutrisno | University of Groningen and Diponegoro Unversity |
Yin, Hao | University of Groningen |
Trenn, Stephan | University of Groningen |
Jayawardhana, Bayu | University of Groningen |
Keywords: Switched systems, Differential-algebraic systems, Stability of nonlinear systems
Abstract: In this article the solvability analysis of discrete-time nonlinear singular switched systems with restricted switching signals is studied. We provide necessary and sufficient conditions for the solvability analysis under fixed switching signals and fixed mode sequences. The so-called surrogate systems (ordinary systems that have the equivalent behavior to the original switched systems) are introduced for solvable switched systems. Incremental stability, which ensures that all solution trajectories converge with each other, is then studied by utilizing these surrogate systems. Sufficient (and necessary) conditions are provided for this stability analysis using single and switched Lyapunov function approaches.
|
|
11:40-12:00, Paper WeA25.6 | Add to My Program |
Semicontraction and Synchronization of Kuramoto-Sakaguchi Oscillator Networks |
|
Delabays, Robin | University of Applied Sciences and Arts of Western Switzerland / |
Bullo, Francesco | Univ of California at Santa Barbara |
Keywords: Stability of nonlinear systems
Abstract: This paper studies the celebrated Kuramoto-Sakaguchi model of coupled oscillators adopting two recent concepts. First, we consider appropriately-defined subsets of the n-torus called winding cells. Second, we analyze the semicontractivity of the model, i.e., the property that the distance between trajectories decreases when measured according to a seminorm. This paper establishes the local semicontractivity of the Kuramoto-Sakaguchi model, which is equivalent to the local contractivity for the reduced model. The reduced model is defined modulo the rotational symmetry. The domains where the system is semicontracting are convex phase-cohesive subsets of winding cells. Our sufficient conditions and estimates of the semicontracting domains are less conservative and more explicit than in previous works. Based on semicontraction on phase-cohesive subsets, we establish the "at most uniqueness" of synchronous states within these domains, thereby characterizing the multistability of this model.
|
|
WeA26 Regular Session, Orchid Main 4301AB |
Add to My Program |
Delay Systems |
|
|
Chair: Sahoo, Soumya Ranjan | Indian Institute of Technology Kanpur |
Co-Chair: Bekiaris-Liberis, Nikolaos | Technical University of Crete |
|
10:00-10:20, Paper WeA26.1 | Add to My Program |
Delay Feedback Active Inference Based on Predicted States for Uncertain Systems with Input Delay |
|
Ji, Mingyue | Northwestern Polytechnical University |
Lyu, Yang | Northwestern Polytechnical University |
Pan, Kunpeng | Northwestern Polytechnical University |
Zhang, Xiaoxuan | Northwestern Polytechnical University |
Pan, Quan | Northwestern Polytechnical University |
Li, Yang | Northwestern Polytechnical University |
Zhao, Chunhui | Northwestern Polytechnical University |
Hu, Jinwen | Northwestern Polytechnical University |
Keywords: Delay systems, Agents-based systems, Adaptive control
Abstract: Active inference (AIF) as a comprehensive theory has been proven that it is promising in state estimation and adaptive control of uncertain systems. However, the input delay in the controller was ignored in the normal framework. When taking input delay into consideration in the uncertain system, the optimal estimation state in the normal AIF differs greatly from the real state of the system due to the accumulation of the delay effect. Therefore, delay feedback active inference(D- AIF) is proposed in this paper. Different from normal AIF, a predictive state based on the delay state becomes the expectation of the state of normal AIF. Meanwhile, the expectation of the controller becomes an epitaxial delayed feedback Proportional- Integral (PI) control. The variational free energy (VFE) is extended by adding a quadratic of control consumption, In order to minimize the variational free energy, the model uncertainty and measurement uncertainty are compensated by approximate Gaussian distributions. Surprisingly, the state estimation does not depend on the given target state in D-AIF. When AIF and D-AIF are applied for trajectory tracking of an unmanned aircraft with input delay, it can be seen from the results that delay feedback active inference control (D-AIFC) can accurately estimate the current state and has stronger robustness when dealing with sudden disturbance than active inference control (AIFC).
|
|
10:20-10:40, Paper WeA26.2 | Add to My Program |
Nonlinear Predictor-Feedback Cooperative Adaptive Cruise Control |
|
Bekiaris-Liberis, Nikolaos | Technical University of Crete |
Keywords: Delay systems, Autonomous vehicles, Stability of nonlinear systems
Abstract: We construct a nonlinear predictor-feedback Cooperative Adaptive Cruise Control (CACC) design for homogeneous vehicular platoons subject to actuators delays, which achieves: i) positivity of vehicles' speed and spacing states, ii) mathcal{L}_{infty} string stability of the platoon, iii) stability of each individual vehicular system, and iv) tracking of a constant reference speed (dictated by the leading vehicle) and spacing. The design relies on a nominal, nonlinear control law, which guarantees i)--iv) in the absence of actuator delay, and nonlinear predictor feedback. We consider a second-order, nonlinear vehicle model with input delay. The proofs of the theoretical guarantees i)--iv) rely on derivation of explicit estimates on solutions (both during open-loop and closed-loop operation), capitalizing on the ability of predictor feedback to guarantee complete delay compensation after the dead-time interval has elapsed, and derivation of explicit conditions on initial conditions and parameters of the nominal control law. We also present consistent simulation results.
|
|
10:40-11:00, Paper WeA26.3 | Add to My Program |
On the Intelligent Proportional Controller Applied to Linear Systems |
|
Belhadjoudja, Mohamed Camil | Gipsa Lab / Cnrs |
Maghenem, Mohamed Adlene | Gipsa Lab, CNRS, France |
Witrant, Emmanuel | Université Grenoble Alpes |
Keywords: Delay systems, Data driven control, Linear systems
Abstract: We analyze in this paper the effect of the well-known intelligent proportional controller on the stability of linear control systems. Inspired by the literature on neutral time-delay systems and advanced-type systems, we derive sufficient conditions on the order of the control system, under which, the used controller fails to achieve exponential stability. Furthermore, we obtain conditions, relating the systems and the control parameters, such that the closed-loop system is either unstable or not exponentially stable. After that, we provide cases where the used controller achieves exponential stability. The obtained results are illustrated on an experimental benchmark that consists of an electronic throttle valve.
|
|
11:00-11:20, Paper WeA26.4 | Add to My Program |
Synchronization for Linear Networked Systems Subject to Input and Communication Delays |
|
De, Souradip | Indian Institute of Technology Kanpur |
Sahoo, Soumya Ranjan | Indian Institute of Technology Kanpur |
Wahi, Pankaj | Indian Institute of Technology |
Keywords: Delay systems, Distributed control, LMIs
Abstract: This paper investigates the synchronization problem for generic linear multi-agent systems with known or unknown heterogeneous input and communication delays. We propose two protocols that consist of consensus-based internal controller states and decentralized controllers. This kind of distributed dynamic control methodology is able to circumvent the interactive nature of two delays by translating the synchronization problem of agents into the stability of a set of delay differential equations. We examine the synchronization problem for two distinct cases, namely, known delays and unknown delays. When the delays are known, the stability criteria are satisfied by the feasibility of an input-delay-dependent linear matrix inequality and a communication-delay-dependent coupling strength bound. The margin on the communication delay is dependent not only on the network topology but also on the system matrix, which does not have any eigenvalues with positive real parts. We also develop a distributed dynamic control protocol that can handle unknown input and communication delays, and the stability criteria are realized by using the feasibility of a linear matrix inequality and a positive coupling strength. Synchronization is guaranteed even if the unknown communication delays are arbitrarily large but bounded and the upper bound on the heterogeneous input delays is known. The proposed control methodology guarantees that inaccurate measurements of the actual states of a particular agent will not lead to an irretrievable failure of the mission.
|
|
11:20-11:40, Paper WeA26.5 | Add to My Program |
Some Remarks on LQ Mean-Field Games for Stochastic Input Delay Systems |
|
Mukaidani, Hiroaki | Hiroshima University |
Irie, Shunpei | Hiroshima University |
Xu, Hua | Univ. of Tsukuba |
Zhuang, Weihua | University of Waterloo |
Keywords: Delay systems, Mean field games, Stochastic optimal control
Abstract: In this study, we consider linear-quadratic (LQ) mean-field social control problems for a class of stochastic systems with ordinary control input and delay control input. We define a stabilization problem via a memoryless static output feedback (SOF) strategy and then solve the problem of minimizing the upper bound of the cost function using guaranteed cost control theory. It is found that the minimization of the upper bound of the cost function cannot be attained if only a delay control input exists. Futhermore, it is proved that it is impossible to implement a mean-field SOF strategy to solve the minimization problem, and the input matrix must have the same dimension as the state matrix. To solve this minimiztion problem, the necessary conditions for the sub-optimality are established via stochastic cross-coupled matrix equations (SCCMEs) using the Karush-Kuhn-Tucker condition and the state feedback strategy. Finally, the performance and usefulness of the proposed strategy are investigated using an order-reduced scheme based on the direct method.
|
|
11:40-12:00, Paper WeA26.6 | Add to My Program |
On Stability of Second-Order Nonlinear Time-Delay Systems without Damping |
|
Aleksandrov, Alexander | St. Petersburg State University |
Efimov, Denis | Inria |
Fridman, Emilia | Tel-Aviv Univ |
Keywords: Delay systems, Nonlinear systems, Stability of nonlinear systems
Abstract: For a second-order system with time delays and power nonlinearity of the degree higher than one, which does not contain a velocity-proportional damping term, the conditions of local asymptotic stability of the zero solution are proposed. The result is based on application of the Lyapunov-Razumikhin approach, and it is illustrated by simulations. Our local stability conditions for nonlinear systems are less restrictive than stability conditions of the corresponding linear models.
|
|
WeB01 Tutorial Session, Orchid Main 4202-4306 |
Add to My Program |
Statistical Learning Theory for Identification and Control |
|
|
Chair: Ziemann, Ingvar | University of Pennsylvania |
Co-Chair: Tsiamis, Anastasios | ETH Zurich |
Organizer: Jedra, Yassir | KTH |
Organizer: Matni, Nikolai | University of Pennsylvania |
Organizer: Pappas, George J. | University of Pennsylvania |
Organizer: Tsiamis, Anastasios | ETH Zurich |
Organizer: Ziemann, Ingvar | University of Pennsylvania |
|
13:30-13:50, Paper WeB01.1 | Add to My Program |
Introduction and Roadmap (I) |
|
Ziemann, Ingvar | University of Pennsylvania |
Tsiamis, Anastasios | ETH Zurich |
|
13:50-14:10, Paper WeB01.2 | Add to My Program |
Basic Concentration Inequalities (I) |
|
Jedra, Yassir | KTH |
|
14:10-14:30, Paper WeB01.3 | Add to My Program |
The Lower Tail of the Empirical Covariance (I) |
|
Ziemann, Ingvar | University of Pennsylvania |
|
14:30-14:50, Paper WeB01.4 | Add to My Program |
Self-Normalized Martingales (I) |
|
Lee, Bruce | University of Pennsylvania |
|
14:50-15:10, Paper WeB01.5 | Add to My Program |
Non-Asymptotic Linear System Identification (I) |
|
Tsiamis, Anastasios | ETH Zurich |
|
15:10-15:30, Paper WeB01.6 | Add to My Program |
Beyond Linear System Identification (I) |
|
Ziemann, Ingvar | University of Pennsylvania |
|
WeB02 Invited Session, Melati Main 4001AB-4102 |
Add to My Program |
Learning-Based Control II: Safety and Robustness |
|
|
Chair: Zeilinger, Melanie N. | ETH Zurich |
Co-Chair: Rantzer, Anders | Lund University |
Organizer: Trimpe, Sebastian | RWTH Aachen University |
Organizer: Müller, Matthias A. | Leibniz University Hannover |
Organizer: Schoellig, Angela P | University of Toronto |
Organizer: Zeilinger, Melanie N. | ETH Zurich |
|
13:30-13:50, Paper WeB02.1 | Add to My Program |
A Computationally Lightweight Safe Learning Algorithm (I) |
|
Baumann, Dominik | Aalto University |
Kowalczyk, Krzysztof | Wroclaw University of Science and Technology |
Tiels, Koen | Eindhoven University of Technology |
Wachel, Pawel | Wroclaw University of Science and Technology, Poland |
Keywords: Statistical learning, Machine learning, Learning
Abstract: Safety is an essential asset when learning control policies for physical systems, as violating safety constraints during training can lead to expensive hardware damage. In response to this need, the field of safe learning has emerged with algorithms that can provide probabilistic safety guarantees without knowledge of the underlying system dynamics. Those algorithms often rely on Gaussian process inference. Unfortunately, Gaussian process inference scales cubically with the number of data points, limiting applicability to high-dimensional and embedded systems. In this paper, we propose a safe learning algorithm that provides probabilistic safety guarantees but leverages the Nadaraya-Watson estimator instead of Gaussian processes. For the Nadaraya-Watson estimator, we can reach logarithmic scaling with the number of data points. We provide theoretical guarantees for the estimates, embed them into a safe learning algorithm, and show numerical experiments on a simulated seven-degrees-of-freedom robot manipulator.
|
|
13:50-14:10, Paper WeB02.2 | Add to My Program |
Learning Over Contracting and Lipschitz Closed-Loops for Partially-Observed Nonlinear Systems (I) |
|
Barbara, Nicholas H. | The University of Sydney |
Wang, Ruigang | The University of Sydney |
Manchester, Ian R. | University of Sydney |
Keywords: Learning, Nonlinear output feedback, Robust control
Abstract: This paper presents a policy parameterization for learning-based control on nonlinear, partially-observed dynamical systems. The parameterization is based on a nonlinear version of the Youla parameterization and the recently proposed Recurrent Equilibrium Network (REN) class of models. We prove that the resulting Youla-REN parameterization automatically satisfies stability (contraction) and user-tunable robustness (Lipschitz) conditions on the closed-loop system. This means it can be used for safe learning-based control with no additional constraints or projections required to enforce stability or robustness. We test the new policy class in simulation on two reinforcement learning tasks: 1) magnetic suspension, and 2) inverting a rotary-arm pendulum. We find that the Youla-REN performs similarly to existing learning-based and optimal control methods while also ensuring stability and exhibiting improved robustness to adversarial disturbances.
|
|
14:10-14:30, Paper WeB02.3 | Add to My Program |
An Online Learning Analysis of Minimax Adaptive Control (I) |
|
Renganathan, Venkatraman | Lund University |
Iannelli, Andrea | University of Stuttgart |
Rantzer, Anders | Lund University |
Keywords: Robust adaptive control, Adaptive control, Robust control
Abstract: We present an online learning analysis of minimax adaptive control for the case where the uncertainty includes a finite set of linear dynamical systems. Precisely, for each system inside the uncertainty set, we define the model-based regret by comparing the state and input trajectories from the minimax adaptive controller against that of an optimal controller in hindsight that knows the true dynamics. We then define the total regret as the worst case model-based regret with respect to all models in the considered uncertainty set. We study how the total regret accumulates over time and its effect on the adaptation mechanism employed by the controller. Moreover, we investigate the effect of the disturbance on the growth of the regret over time and draw connections between robustness of the controller and the associated regret rate.
|
|
14:30-14:50, Paper WeB02.4 | Add to My Program |
Risk-Sensitive Inhibitory Control for Safe Reinforcement Learning (I) |
|
Lederer, Armin | Technical University of Munich |
Noorani, Erfaun | University of Maryland College Park |
Baras, John S. | University of Maryland |
Hirche, Sandra | Technische Universität München |
Keywords: Machine learning, Learning, Uncertain systems
Abstract: Humans have the ability to deviate from their natural behavior when necessary, which is a cognitive process called response inhibition. Similar approaches have independently received increasing attention in recent years for ensuring the safety of control. Realized using control barrier functions or predictive safety filters, these approaches can effectively ensure the satisfaction of state constraints through an online adaptation of nominal control laws, e.g., obtained through reinforcement learning. While the focus of these realizations of inhibitory control has been on risk-neutral formulations, human studies have shown a tight link between response inhibition and risk attitude. Inspired by this insight, we propose a flexible, risk-sensitive method for inhibitory control. Our method is based on a risk-aware condition for value functions, which guarantees the satisfaction of state constraints. We propose a method for learning these value functions using common techniques from reinforcement learning and derive sufficient conditions for its success. By enforcing the derived safety conditions online using the learned value function, risk-sensitive inhibitory control is effectively achieved. The effectiveness of the developed control scheme is demonstrated in simulations.
|
|
14:50-15:10, Paper WeB02.5 | Add to My Program |
Learning Safety Filters for Unknown Discrete-Time Linear Systems |
|
Farokhi, Farhad | The University of Melbourne |
Leong, Alex S. | DST Group |
Zamani, Mohammad | University of Melbourne |
Shames, Iman | Australian National University |
Keywords: Networked control systems, Linear systems, Stochastic systems
Abstract: A learning-based safety filter is developed for discrete-time linear time-invariant systems with unknown models subject to Gaussian noises with unknown covariance. Safety is characterized using polytopic constraints on the states and control inputs. The empirically learned model and process noise covariance with their confidence bounds are used to construct a robust optimization problem for minimally modifying nominal control actions to ensure safety with high probability. The optimization problem relies on tightening the original safety constraints. The magnitude of the tightening is larger at the beginning since there is little information to construct reliable models, but shrinks with time as more data becomes available.
|
|
15:10-15:30, Paper WeB02.6 | Add to My Program |
Data-Driven Synthesis of Safety Controllers for Partially-Observable Systems with Unknown Models (I) |
|
Jahanshahi, Niloofar | Ludwig Maximilian University of Munich |
Zamani, Majid | University of Colorado Boulder |
Keywords: Optimization
Abstract: This paper is concerned with the formal synthesis of safety controllers for partially-observable discrete-time control systems with unknown mathematical models. Given a state estimator with unknown dynamics but a known upper bound on the estimation error, we propose a data-driven approach to compute controllers that render the partially-observable systems with unknown dynamics safe. Our proposed method is based on the construction of control barrier certificates, where we first formulate the barrier-based safety problem as a robust program (RP). The proposed RP is not tractable since the unknown model of the estimator appears in one of its constraints. To tackle this issue, we collect a set of data from the black-box system and its estimator and replace the original RP with a scenario program (SP). Due to the existence of a max-min constraint in the SP, we construct an analogous scenario program, denoted by SP^alpha, in which the max-min constraint is replaced with a single inequality constraint. The control barrier certificates together with their corresponding controllers can then be computed by solving SP^alpha via the collected data. By connecting the feasible solutions of SP^alpha and SP, the safety of the partially-observable system equipped with the synthesized controller can be guaranteed with 100% confidence. We show the effectiveness of our results by synthesizing a safety controller for a partially-observable Van der Pol oscillator with unknown dynamics.
|
|
WeB03 Invited Session, Melati Main 4003-4104 |
Add to My Program |
Learning Algorithms for Optimization and Applications |
|
|
Chair: Li, Xiuxian | Tongji University |
Co-Chair: Yi, Xinlei | Massachusetts Institute of Technology |
Organizer: Li, Xiuxian | Tongji University |
Organizer: Yi, Xinlei | Massachusetts Institute of Technology |
Organizer: Yang, Tao | Northeastern University |
Organizer: Xie, Lihua | Nanyang Tech. Univ |
|
13:30-13:50, Paper WeB03.1 | Add to My Program |
Variance-Reduced Shuffling Gradient Descent with Momentum for Finite-Sum Minimization |
|
Jiang, Xia | Beijing Institute of Technology |
Zeng, Xianlin | Beijing Institute Fo Technology |
Xie, Lihua | Nanyang Tech. Univ |
Sun, Jian | Beijing Institute of Technology |
Keywords: Optimization, Numerical algorithms
Abstract: Finite-sum minimization is a fundamental optimization problem in signal processing and machine learning. This paper proposes a variance-reduced shuffling gradient descent with Nesterovs momentum for smooth convex finite-sum optimization. We integrate an explicit variance reduction into the shuffling gradient descent to deal with the variance introduced by shuffling gradients. The proposed algorithm with a unified shuffling scheme converges at a rate of O(1/T), where T is the number of epochs. The convergence rate independent of gradient variance is better than most existing shuffling gradient algorithms for convex optimization. Finally, numerical simulations demonstrate the convergence performance of the proposed algorithm.
|
|
13:50-14:10, Paper WeB03.2 | Add to My Program |
Quantized Distributed Online Projection-Free Convex Optimization |
|
Zhang, Wentao | Nanjing University of Science and Technology |
Shi, Yang | University of Victoria |
Zhang, Baoyong | Nanjing University of Science and Technology |
Lu, Kaihong | Shandong University of Science and Technology |
Yuan, Deming | Nanjing University of Science and Technology |
Keywords: Network analysis and control, Optimization algorithms, Cooperative control
Abstract: This paper considers online distributed convex constrained optimization over a time-varying multi-agent network. Agents in this network cooperate to minimize the global objective function through information exchange with their neighbors and local computation. Since the capacity or bandwidth of communication channels often is limited, a random quantizer is introduced to reduce the transmission bits. Through incorporating this quantizer, we develop a quantized distributed online projection-free optimization algorithm, which can achieve the saving of communication resources and computational costs. The dynamic regret bound mathcal{O}( max{T^{1-gamma}, T^{gamma}(1+H_T) } +D_T) of the proposed algorithm is rigorously established, which depends on the total time T, function variation H_T, gradient variation D_T, and the parameter 0
|
|
14:10-14:30, Paper WeB03.3 | Add to My Program |
A Game Theoretic Approach for Safe and Distributed Control of Unmanned Aerial Vehicles (I) |
|
Andre do Nascimento, Allan | University of Oxford |
Papachristodoulou, Antonis | University of Oxford |
Margellos, Kostas | University of Oxford |
Keywords: Optimization algorithms, Game theory, Predictive control for linear systems
Abstract: This paper presents a distributed methodology to produce collision-free control laws for an Unmanned Aerial Vehicles (UAVs) fleet. We use a game theoretic framework, where UAVs accommodate for individual and fleet goals, while respecting safety requirements. The method combines Control Barrier Functions (CBFs) and a primal-dual algorithm for Nash equilibrium (NE) seeking in generalized games. Feedback is introduced by Model Predictive Control (MPC) and we analyze its stability properties. The combination of these tools allows for a distributed, collision-free pointwise equilibrium solution, despite the agents coupling, due to common target tracking and the collision avoidance constraints. Our algorithmic results are supported theoretically and its efficacy is demonstrated via extensive numerical simulations.
|
|
14:30-14:50, Paper WeB03.4 | Add to My Program |
Linear Last-Iterate Convergence for Continuous Games with Coupled Inequality Constraints (I) |
|
Meng, Min | Tongji University |
Li, Xiuxian | Tongji University |
Keywords: Game theory
Abstract: In this paper, the generalized Nash equilibrium (GNE) seeking problem for continuous games with coupled affine inequality constraints is investigated in a partial-decision information scenario, where each player can only access its neighbors' information through local communication although its cost function possibly depends on all other players' strategies. To this end, a novel decentralized primal-dual algorithm based on consensus and dual diffusion methods is devised for seeking the variational GNE of the studied games. This paper also provides theoretical analysis to show that the designed algorithm converges linearly for the last-iterate, which, to our best knowledge, is the first to propose a linearly convergent GNE seeking algorithm under coupled affine inequality constraints. Finally, a numerical example is presented to demonstrate the efficiency of the obtained theoretical results.
|
|
14:50-15:10, Paper WeB03.5 | Add to My Program |
Delay-Agnostic Asynchronous Distributed Optimization (I) |
|
Wu, Xuyang | KTH Royal Institute of Technology |
Liu, Changxin | KTH Royal Institute of Technology |
Magnusson, Sindri | Stockholm University |
Johansson, Mikael | KTH - Royal Institute of Technology |
Keywords: Optimization algorithms, Optimization
Abstract: Existing asynchronous distributed optimization algorithms often use diminishing step-sizes that cause slow practical convergence, or fixed step-sizes that depend on an assumed upper bound of delays. Not only is such a delay bound hard to obtain in advance, but it is also large and therefore results in unnecessarily slow convergence. This paper develops asynchronous versions of two distributed algorithms, DGD and DGD-ATC, for solving consensus optimization problems over undirected networks. In contrast to alternatives, our algorithms can converge to the fixed point set of their synchronous counterparts using step-sizes that are independent of the delays. We establish convergence guarantees under both partial and total asynchrony. The practical performance of our algorithms is demonstrated by numerical experiments.
|
|
15:10-15:30, Paper WeB03.6 | Add to My Program |
PIQP: A Proximal Interior-Point Quadratic Programming Solver (I) |
|
Schwan, Roland | EPFL |
Jiang, Yuning | EPFL |
Kuhn, Daniel | EPFL |
Jones, Colin N. | EPFL |
Keywords: Numerical algorithms, Computational methods, Optimization algorithms
Abstract: This paper presents PIQP, a high-performance toolkit for solving generic sparse quadratic programs (QP). Combining an infeasible Interior Point Method (IPM) with the Proximal Method of Multipliers (PMM), the algorithm can handle ill-conditioned convex QP problems without the need for linear independence of the constraints. The open-source implementation is written in C++ with interfaces to C, Python, Matlab, and R leveraging the Eigen3 library. The method uses a pivoting-free factorization routine and allocation-free updates of the problem data, making the solver suitable for embedded applications. The solver is evaluated on the Maros-Mészáros problem set and optimal control problems, demonstrating state-of-the-art performance for both small and large-scale problems, outperforming commercial and open-source solvers.
|
|
WeB04 Invited Session, Simpor Junior 4913 |
Add to My Program |
Connected Automated Traffic Systems |
|
|
Chair: Yu, Huan | The Hong Kong University of Science and Technology(Guangzhou) |
Co-Chair: Molnar, Tamas G. | California Institute of Technology |
Organizer: Yu, Huan | The Hong Kong University of Science and Technology(Guangzhou) |
|
13:30-13:50, Paper WeB04.1 | Add to My Program |
Safety-Critical Traffic Control for Mixed Autonomy Systems with Input Delay and Disturbances (I) |
|
Zhao, Chenguang | The Hong Kong University of Science and Technology |
Yu, Huan | The Hong Kong University of Science and Technology(Guangzhou) |
Keywords: Traffic control, Delay systems, Lyapunov methods
Abstract: While the connected automated vehicle (CAV) has been applied in vehicle-based traffic control that aims to stabilize a string of car-following human-driven vehicles (HV), the safety impact of a CAV controller on the overall traffic flow system is still an open question. In this paper, we propose a Robust Safety-critical Traffic Control (RSTC) design to impart safety for the mixed autonomy traffic system, in which the speed disturbance from a leading HV and input delay of the stabilizing CAV controller are considered. We employ Control Barrier Function (CBF) to impose safety constraints on a nominal CAV control input that achieves string stability. The key challenge lies in incorporating effect of the input delay and external disturbances into the CBF constraints. The predicted speed and spacing gap in the robust CBF design is obtained using a delay-compensating state predictor. The forward invariance of the safe set is proved given the derivative of speed disturbance, i.e., acceleration of the leading vehicle, is bounded. The safety improvement of RSTC over the nominal controller is then validated via numerical simulation.
|
|
13:50-14:10, Paper WeB04.2 | Add to My Program |
Observer-Based Output Feedback Stabilization for Stop-And-Go Waves of Vehicle Traffic Flow (I) |
|
Luan, Haoran | Beijing University of Technology |
Zhan, Jingyuan | Beijing University of Technology |
Li, Xiaoli | Beijing University of Technology |
Zhang, Liguo | Beijing University of Technology |
Keywords: Distributed parameter systems, Backstepping, Traffic control
Abstract: This paper designs an observerbased output feedback controller for traffic flow with stopandgo waves and disturbances in order to dissipate traffic congestion. The macroscopic traffic flow dynamics in the congestion regime is described by the linearized AwRascleZhang (ARZ) traffic flow model over a timevarying moving spatial domain, and according to the RankineHugoniot condition and the characteristic velocities of the ARZ model, a novel propagation model of the stopandgo waves is proposed. To stabilize the traffic state and the stopandgo waves, and to suppress traffic disturbances, an observerbased output feedback controller is designed by using the PDE backstepping method. The controller utilizes the estimated state of an observer, which is constructed based on boundary measurements only. The exponential stability of the closedloop system in the H1 norm is proved by the Lyapunov analysis. Finally, the effectiveness of the controller for stabilizing the traffic state with stop-and-go waves and disturbances is verified by numerical simulations.
|
|
14:10-14:30, Paper WeB04.3 | Add to My Program |
On the Safety of Connected Cruise Control: Analysis and Synthesis with Control Barrier Functions (I) |
|
Molnar, Tamas G. | California Institute of Technology |
Orosz, Gabor | University of Michigan |
Ames, Aaron D. | California Institute of Technology |
Keywords: Autonomous vehicles, Constrained control, Traffic control
Abstract: Connected automated vehicles have shown great potential to improve the efficiency of transportation systems in terms of passenger comfort, fuel economy, stability of driving behavior and mitigation of traffic congestions. Yet, to deploy these vehicles and leverage their benefits, the underlying algorithms must ensure their safe operation. In this paper, we address the safety of connected cruise control strategies for longitudinal car following using control barrier function (CBF) theory. In particular, we consider various safety measures such as minimum distance, time headway and time to conflict, and provide a formal analysis of these measures through the lens of CBFs. Additionally, motivated by how stability charts facilitate stable controller design, we derive safety charts for existing connected cruise controllers to identify safe choices of controller parameters. Finally, we combine the analysis of safety measures and the corresponding stability charts to synthesize safety-critical connected cruise controllers using CBFs. We verify our theoretical results by numerical simulations.
|
|
14:30-14:50, Paper WeB04.4 | Add to My Program |
Learning Optimal Robust Control of Connected Vehicles in Mixed Traffic Flow (I) |
|
Li, Jie | Tsinghua University |
Wang, Jiawei | Tsinghua University |
Li, Shengbo Eben | Tsinghua University |
Li, Keqiang | Tsinghua University, Beijing, China |
Keywords: Traffic control, Robust control, Autonomous vehicles
Abstract: Connected and automated vehicles (CAVs) technologies promise to attenuate undesired traffic disturbances. However, in mixed traffic where human-driven vehicles (HDVs) also exist, the nonlinear human-driving behavior has brought critical challenges for effective CAV control. This paper employs the policy iteration method to learn the optimal robust controller for nonlinear mixed traffic systems. Precisely, we consider the H_{infty} control framework and formulate it as a zero-sum game, the equivalent condition for whose solution is converted into a HamiltonJacobi inequality with a Hamiltonian constraint. Then, a policy iteration algorithm is designed to generate stabilizing controllers with desired attenuation performance. Based on the updated robust controller, the attenuation level is further optimized in sum of squares program by leveraging the gap of the Hamiltonian constraint. Simulation studies verify that the obtained controller enables the CAVs to dampen traffic perturbations and smooth traffic flow.
|
|
14:50-15:10, Paper WeB04.5 | Add to My Program |
How Does Driver Non-Compliance Destroy Traffic Routing Control? (I) |
|
Tang, Yu | New York University |
Jin, Li | Shanghai Jiao Tong University |
Ozbay, Kaan | New York University |
Keywords: Transportation networks, Traffic control
Abstract: Routing control is one of important traffic man- agement strategies against urban congestion. However, it could be compromised by heterogeneous driver non-compliance with routing instructions. In this article, we model the compliance in a stochastic manner and investigate its impacts on routing control. We consider traffic routing for two parallel links. Particularly, we consider two scenarios: one ignores congestion spillback while the other one considers it. We formulate the problem as a Markov chain, given random drivers adherence. Then, we propose the stability and instability conditions to reveal when the routing is able or unable to stabilize the traffic. We show that for links without congestion spillback there exists a necessary and sufficient stability criterion. For links admiting congestion propagation, we present one stability condition and one instability condition. These stability conditions allow us to quantify the impacts of driver non-compliance on the two-link network in terms of throughput. Finally, we illustrate the results with a set of numerical examples.
|
|
15:10-15:30, Paper WeB04.6 | Add to My Program |
Uncertainty-Aware Grounded Action Transformation towards Sim-To-Real Transfer for Traffic Signal Control (I) |
|
Da, Longchao | New Jersey Institute of Technology |
Mei, Hao | New Jersey Institue of Technology |
Sharma, Romir | West Windsor-Plainsboro High School South |
Wei, Hua | Arizona State University |
Keywords: Adaptive control, Traffic control, Intelligent systems
Abstract: Traffic signal control (TSC) is a complex and important task that affects the daily lives of millions of people. Reinforcement Learning (RL) has shown promising results in optimizing traffic signal control, but current RL-based TSC methods are mainly trained in simulation and suffer from the performance gap between simulation and the real world. In this paper, we propose a simulation-to-real-world (sim-to-real) transfer approach called UGAT, which transfers a learned policy trained from a simulated environment to a real-world environment by dynamically transforming actions in the simulation with uncertainty to mitigate the domain gap of transition dynamics. We evaluate our method on a simulated traffic environment and show that it significantly improves the performance of the transferred RL policy in the real world.
|
|
WeB05 Invited Session, Simpor Junior 4912 |
Add to My Program |
Distributed Optimization and Learning for Networked Systems |
|
|
Chair: Yang, Tao | Northeastern University |
Co-Chair: Uribe, Cesar A. | Rice University |
Organizer: Yang, Tao | Northeastern University |
Organizer: Uribe, Cesar A. | Rice University |
Organizer: Lu, Jie | ShanghaiTech University |
Organizer: Nedich, Angelia | Arizona State University |
Organizer: Hong, Yiguang | Chinese Academy of Sciences |
|
13:30-13:50, Paper WeB05.1 | Add to My Program |
A Robust Dynamic Average Consensus Algorithm That Ensures Both Differential Privacy and Accurate Convergence (I) |
|
Wang, Yongqiang | Clemson University |
Keywords: Control Systems Privacy, Cooperative control, Decentralized control
Abstract: We propose a new dynamic average consensus algorithm that is robust to information-sharing noise arising from differential-privacy design. Not only is dynamic average consensus widely used in cooperative control and distributed tracking, it is also a fundamental building block in numerous distributed computation algorithms such as multi-agent optimization and distributed Nash equilibrium seeking. We propose a new dynamic average consensus algorithm that is robust to persistent and independent information-sharing noise added for the purpose of differential-privacy protection. In fact, the algorithm can ensure both provable convergence to the exact average reference signal and rigorous ϵ-differential privacy (even when the number of iterations tends to infinity), which, to our knowledge, has not been achieved before in average consensus algorithms. Given that channel noise in communication can be viewed as a special case of differential-privacy noise, the algorithm can also be used to counteract communication imperfections. Numerical simulation results confirm the effectiveness of the proposed approach.
|
|
13:50-14:10, Paper WeB05.2 | Add to My Program |
Distributed Optimization on Directed Graphs Based on Inexact ADMM with Partial Participation (I) |
|
Yi, Dingran | University of Science and Technology of China |
Freris, Nikolaos M. | University of Science and Technology of China |
Keywords: Optimization, Optimization algorithms
Abstract: We consider the problem of minimizing the sum of cost functions pertaining to agents over a network whose topology is captured by a directed graph (i.e., asymmetric communication). We cast the problem into the ADMM setting, via a consensus constraint, for which both primal subproblems are solved inexactly. In specific, the computationally demanding local minimization step is replaced by a single gradient step, while the averaging step is approximated in a distributed fashion. Furthermore, partial participation is allowed in the implementation of the algorithm. Under standard assumptions on strong convexity and Lipschitz continuous gradients, we establish linear convergence and characterize the rate in terms of the connectivity of the graph and the conditioning of the problem. Our line of analysis provides a sharper convergence rate compared to Push-DIGing. Numerical experiments corroborate the merits of the proposed solution in terms of superior rate as well as computation and communication savings over baselines.
|
|
14:10-14:30, Paper WeB05.3 | Add to My Program |
Convergence Analysis of the Best Response Algorithm for Time-Varying Games (I) |
|
Wang, Zifan | KTH Royal Institute of Technology |
Shen, Yi | Duke University |
Zavlanos, Michael M. | Duke University |
Johansson, Karl H. | KTH Royal Institute of Technology |
Keywords: Optimization, Game theory
Abstract: This paper studies a class of strongly monotone games involving non-cooperative agents that optimize their own time-varying cost functions. We assume that the agents can observe other agents' historical actions and choose actions that best respond to other agents' previous actions; we call this a best response scheme. We start by analyzing the convergence rate of this best response scheme for standard time-invariant games. Specifically, we provide a sufficient condition on the strong monotonicity parameter of the time-invariant games under which the proposed best response algorithm achieves exponential convergence to the static Nash equilibrium. We further illustrate that this best response algorithm may oscillate when the proposed sufficient condition fails to hold, which indicates that this condition is tight. Next, we analyze this best response algorithm for time-varying games where the cost functions of each agent change over time. Under similar conditions as for time-invariant games, we show that the proposed best response algorithm stays asymptotically close to the evolving equilibrium. We do so by analyzing both the equilibrium tracking error and the dynamic regret. Numerical experiments on economic market problems are presented to validate our analysis.
|
|
14:30-14:50, Paper WeB05.4 | Add to My Program |
Distributed Nonsmooth Optimization with Different Local Constraints Via Exact Penalty (I) |
|
Liu, Shuyu | Tongji University |
Liang, Shu | Tongji University |
Hong, Yiguang | Chinese Academy of Sciences |
Keywords: Optimization algorithms, Optimization, Lyapunov methods
Abstract: In this paper, we study a distributed optimization problem, where decision variables of agents need to satisfy different local constraints and the consensus constraint to minimize the sum of local cost functions. We propose a novel method to remove all these constraints by employing the exact penalty so that the derived equivalent unconstrained problem can be directly solved by subgradient descent type differential inclusions. The algorithm achieves O(1/t) rate of convergence when the cost functions are convex. Moreover, it achieves exponential convergence when the cost functions are strongly convex. Our method needs no dual variable to deal with the constraints so that computation and communication resources are saved in comparison with primal-dual methods. In addition, the method overcomes a divergence problem arising from an existing exponentially convergent distributed algorithm based on the exact penalty when the local constraints are different.
|
|
14:50-15:10, Paper WeB05.5 | Add to My Program |
Resilient Federated Learning under Byzantine Attack in Distributed Nonconvex Optimization with 2-F Redundancy (I) |
|
Dutta, Amit | Virginia Polytechnic Institute and State University |
Doan, Thinh T. | Virginia Tech |
Reed, Jeffrey | Virginia Tech |
Keywords: Optimization, Distributed control, Fault tolerant systems
Abstract: We study the problem of Byzantine fault tolerance in a distributed optimization setting, where there is a group of N agents communicating with a trusted centralized coordinator. Among these agents, there is a subset of f agents that may not follow a prescribed algorithm and may share arbitrarily incorrect information with the coordinator. The goal is to find the optimizer of the aggregate cost functions of the honest agents. We will be interested in studying the local gradient descent method, also known as federated learning, to solve this problem. However, this method often returns an approximate value of the underlying optimal solution in the Byzantine setting. Recent work showed that by incorporating the so-called comparative elimination (CE) filter at the coordinator, one can provably mitigate the detrimental impact of Byzantine agents and precisely compute the true optimizer in the convex setting. The focus of the present work is to provide theoretical results to show the convergence of local gradient methods with the CE filter in a nonconvex setting. We will also provide a number of numerical simulations to support our theoretical results.
|
|
15:10-15:30, Paper WeB05.6 | Add to My Program |
Distributed Online Optimization with Coupled Inequality Constraints Over Unbalanced Directed Networks (I) |
|
Wang, Dandan | ShanghaiTech University |
Zhu, Daokuan | ShanghaiTech University |
Sou, Kin Cheong | National Sun Yat-Sen University |
Lu, Jie | ShanghaiTech University |
Keywords: Optimization algorithms, Optimization, Networked control systems
Abstract: This paper studies a distributed online convex optimization problem, where agents in an unbalanced network cooperatively minimize the sum of their time-varying local cost functions subject to a coupled inequality constraint. To solve this problem, we propose a distributed dual subgradient tracking algorithm, called DUST, which attempts to optimize a dual objective by means of tracking the primal constraint violations and integrating dual subgradient and push-sum techniques. Different from most existing works, we allow the underlying network to be unbalanced with a column stochastic mixing matrix. We show that DUST achieves sublinear dynamic regret and constraint violations, provided that the accumulated variation of the optimal sequence grows sublinearly. If the standard Slaters condition is additionally imposed, DUST acquires a smaller constraint violation bound than the alternative existing methods applicable to unbalanced networks. Simulations on a plug-in electric vehicle charging problem demonstrate the superior convergence of DUST.
|
|
WeB06 Regular Session, Simpor Junior 4911 |
Add to My Program |
Estimation II |
|
|
Chair: Dong, Daoyi | University of New South Wales |
Co-Chair: Chen, Hua | Southern University of Science and Technology |
|
13:30-13:50, Paper WeB06.1 | Add to My Program |
A Unified Approach to Optimally Solving Sensor Scheduling and Sensor Selection Problems in Kalman Filtering |
|
Dutta, Shamak | University of Waterloo |
Wilde, Nils | TU Delft |
Smith, Stephen L. | University of Waterloo |
Keywords: Estimation, Kalman filtering, Optimization
Abstract: We consider a general form of the sensor scheduling problem for state estimation of linear dynamical systems, which involves selecting sensors that minimize the trace of the Kalman filter error covariance (weighted by a positive semidefinite matrix) subject to polyhedral constraints. This general form captures several well-studied problems including sensor placement, sensor scheduling with budget constraints, and Linear Quadratic Gaussian (LQG) control and sensing co-design. We present a mixed integer optimization approach that is derived by exploiting the optimality of the Kalman filter. While existing work has focused on approximate methods to specific problem variants, our work provides a unified approach to computing optimal solutions to the general version of sensor scheduling. In simulation, we show this approach finds optimal solutions for systems with 30 to 50 states in seconds.
|
|
13:50-14:10, Paper WeB06.2 | Add to My Program |
A General Iterative Extended Kalman Filter Framework for State Estimation on Matrix Lie Groups |
|
Liu, Ben | Southern University of Science and Technology |
Chen, Hua | Southern University of Science and Technology |
Zhang, Wei | Southern University of Science and Technology |
Keywords: Estimation, Kalman filtering
Abstract: In this paper, we focus on state estimation prob- lem for nonlinear systems on joint matrix Lie group G and Euclidean space Rn. We propose a general iterative Kalman filter, aiming to integrate the prediction step into the iteration scheme, which is not considered in the conventional iterative extended Kalman filter framework. Such an extra iteration scheme in the prediction step helps improving the accuracy of probability density function propagation through nonlinearities, which can further lead to more accurate estimations of the system states. In addition, the proposed framework unifies the Kalman filter based estimation schemes on studied manifold by adopting the perspective from Gaussian Bayesian inference. The improvement of the proposed framework is illustrated by the ES-GIKF algorithm that is instantiated from the proposed framework in a numerical simulation.
|
|
14:10-14:30, Paper WeB06.3 | Add to My Program |
Data Informativity for Lyapunov Equations |
|
Banno, Ikumi | Nagoya University |
Azuma, Shun-ichi | Kyoto University |
Ariizumi, Ryo | Tokyo University of Agriculture and Technology |
Asai, Toru | Nagoya University |
Imura, Jun-ichi | Tokyo Institute of Technology |
Keywords: Estimation, Linear systems, Computational methods
Abstract: Recently, the novel framework for data-driven analysis and control, called data informativity, was proposed. This notion represents whether the given data contain sufficient information to solve a problem or not. However, data informativity for solving a Lyapunov equation has never been addressed before. This letter characterizes the data informativity for the Lyapunov equations in the form of AP + PA^T = -Q, where A and Q are square matrices and P is an unknown matrix. First, we clarify the relationship between the unique solution to the Lyapunov equation and the controllable subspace of a system. Second, based on this result, we provide a necessary and sufficient condition for the data informativity, which is characterized by the possibility of a certain matrix decomposition of Q, called the data-basis decomposition. Finally, we present a direct data-driven method for solving the Lyapunov equation based on our data informativity condition. This method has a potential to compute the solution even if the data do not contain sufficient information to identify the system.
|
|
14:30-14:50, Paper WeB06.4 | Add to My Program |
Estimation of Quantum Channels Using Neural Networks |
|
Ma, Hailan | University of New South Wales |
Sun, Zhenhong | UNSW |
Xiao, Shuixin | Australian National University |
Dong, Daoyi | Australian National University |
Petersen, Ian R. | Australian National University |
Keywords: Estimation, Machine learning
Abstract: Quantum process tomography is an essential task for characterizing the dynamics of quantum systems and achieving precise quantum control. In this work, we propose a machine learning-based quantum process tomography method to reconstruct the Choi matrices of quantum channels from the measurements of the output states. Numerical results demonstrate that the proposed method exhibits a significant potential to achieve accurate reconstruction of different quantum channels.
|
|
14:50-15:10, Paper WeB06.5 | Add to My Program |
Minimax Two-Stage Gradient Boosting for Parameter Estimation |
|
Lakshminarayanan, Braghadeesh | KTH Royal Institute of Technology |
Rojas, Cristian R. | KTH Royal Institute of Technology |
Keywords: Estimation, Machine learning, Identification
Abstract: Parameter estimation is an important sub-field in statistics and system identification. Various methods for parameter estimation have been proposed in the literature, among which the Two-Stage (TS) approach is particularly promising, due to its ease of implementation and reliable estimates. Among the different statistical frameworks used to derive TS estimators, the min-max framework is attractive due to its mild dependence on prior knowledge about the parameters to be estimated. However, the existing implementation of the minimax TS approach has currently limited applicability, due to its heavy computational load. In this paper, we overcome this difficulty by using a gradient boosting machine (GBM) in the second stage of TS approach. We call the resulting algorithm the Two-Stage Gradient Boosting Machine (TSGBM) estimator. Finally, we test our proposed TSGBM estimator on several numerical examples including models of dynamical systems.
|
|
15:10-15:30, Paper WeB06.6 | Add to My Program |
Log-Sparse Hawkes Network Identification |
|
Solo, Victor | University of New South Wales |
Rong, Xinhui | University of New South Wales |
Seneviratne, Akila | The University of New South Wales |
Keywords: Estimation, Model Validation, Identification
Abstract: Point process networks are emerging in wide range of applications, such as finance and social media. Such history dependent data are often modeled by the multivariate Hawkes processes. In this paper, we propose a log-sparsity penalized least squares (log-LS) estimation for the Hawkes intensity to capture the network dynamics, while eliminating the inactive nodes. We develop a new continuous-time log-LS formulation correcting an error in previous work by finding an underlying true global minimum. We use a cyclic descent + BIC algorithm for efficient optimization. We finally compare various penalties in simulations demonstrating advantages of log-sparsity.
|
|
WeB07 Regular Session, Simpor Junior 4813 |
Add to My Program |
Game Theory II |
|
|
Chair: Hu, Jianghai | Purdue University |
Co-Chair: Bistritz, Ilai | Tel Aviv University |
|
13:30-13:50, Paper WeB07.1 | Add to My Program |
A Bandit Learning Method for Continuous Games under Feedback Delays with Residual Pseudo-Gradient Estimate |
|
Huang, Yuanhanqing | Purdue University |
Hu, Jianghai | Purdue University |
Keywords: Game theory, Delay systems, Learning
Abstract: Learning in multi-player games can model a large variety of practical scenarios, where each player seeks to optimize its own local objective function, which at the same time relies on the actions taken by others. Motivated by the frequent absence of first-order information such as partial gradients in solving local optimization problems and the prevalence of asynchronicity and feedback delays in multi-agent systems, we introduce a bandit learning algorithm, which integrates mirror descent, residual pseudo-gradient estimates, and the priority-based feedback utilization strategy, to contend with these challenges. We establish that for pseudo-monotone plus games, the actual sequences of play generated by the proposed algorithm converge a.s. to critical points. Compared with the existing method, the proposed algorithm yields more consistent estimates with less variation and allows for more aggressive choices of parameters. Finally, we illustrate the validity of the proposed algorithm through a thermal load management problem of building complexes.
|
|
13:50-14:10, Paper WeB07.2 | Add to My Program |
Logit Learning by Valuation in Extensive-Form Games with Simultaneous Moves |
|
Castiglione, Jason | University of Hawaii |
Arslan, Gurdal | University of Hawaii at Manoa |
Keywords: Game theory, Iterative learning control, Learning
Abstract: We study the long-term behavior of the logit learning rule in multiplayer repeated extensive-form games. Our model involves the possibility of simultaneous moves by multiple players as well as chance moves by nature in every node of the game tree. The logit learning rule considered in this paper is based on average payoff valuations. In certain class of extensive-form games with simultaneous moves, we show that player strategies converge to a perturbed subgame perfect equilibrium of the stage game when every player uses the logit learning rule in the repeated game. In extensive-form games with perfect information, we also show that the long run average payoff of a player using the logit learning rule is guaranteed to be nearly as high as the player's maxmin payoff in the stage game.
|
|
14:10-14:30, Paper WeB07.3 | Add to My Program |
Admission Control for Games with a Dynamic Set of Players |
|
Bistritz, Ilai | Tel Aviv University |
Bambos, Nicholas | Stanford University |
Keywords: Game theory, Learning, Control of networks
Abstract: We consider open games where players arrive according to a Poisson process with rate lambda and stay in the game for an exponential random duration with rate mu. The game evolves in continuous time where each player n sets an exponential random clock and updates her action a_{n}inleft{ 0,ldots,Kright} when it expires. The players take independent best-response actions that, uninterrupted, can converge to a Nash Equilibrium (NE). This models open multiagent systems such as wireless networks, cloud computing, and online marketplaces. When lambda is small, the game spends most of the time in a (time-varying) equilibrium. This equilibrium exhibits predictable behavior and can have performance guarantees by design. However, when lambda is too small, the system is under-utilized since not many players are in the game on average. Choosing the maximal lambda that the game can support while still spending a target fraction 0
|
|
14:30-14:50, Paper WeB07.4 | Add to My Program |
The Stability of Matrix Multiplicative Weights Dynamics in Quantum Games |
|
Lotidis, Kyriakos | Stanford University |
Mertikopoulos, Panayotis | French National Center for Scientific Research (CNRS) |
Bambos, Nicholas | Stanford University |
Keywords: Game theory, Learning, Quantum information and control
Abstract: In this paper, we study the equilibrium convergence and stability properties of the widely used matrix multiplicative weights (MMW) dynamics for learning in general quantum games. A key difficulty in this endeavor is that the induced quantum state dynamics decompose naturally into (i) a classical, commutative component which governs the dynamics of the system's eigenvalues in a way analogous to the evolution of mixed strategies under the classical replicator dynamics; and (ii) a non-commutative component for the system's eigenvectors. This non-commutative component has no classical counterpart and, as a result, requires the introduction of novel notions of (asymptotic) stability to account for the nonlinear geometry of the game's quantum space. In this general context, we show that (i) only pure quantum equilibria can be stable and attracting under the MMW dynamics; and (ii) as a partial converse, pure quantum states that satisfy a certain ``variational stability'' condition are always attracting. This allows us to fully characterize the structure of quantum Nash equilibria that are stable and attracting under the MMW dynamics, a fact with significant implications for predicting the outcome of a multi-agent quantum learning process.
|
|
14:50-15:10, Paper WeB07.5 | Add to My Program |
Bandit Online Learning in Merely Coherent Games with Multi-Point Pseudo-Gradient Estimate |
|
Huang, Yuanhanqing | Purdue University |
Hu, Jianghai | Purdue University |
Keywords: Game theory, Learning, Variational methods
Abstract: Non-cooperative games serve as a powerful framework for capturing the interactions among self-interested players and have broad applicability in modeling a wide range of practical scenarios, ranging from power management to drug delivery. Although most existing solution algorithms assume the availability of first-order information or full knowledge of the objectives and others' action profiles, there are situations where the only accessible information at players' disposal is the realized objective function values. In this paper, we devise a bandit online learning algorithm for merely coherent games that integrates the optimistic mirror descent scheme and multi-point pseudo-gradient estimates. We further demonstrate that the generated actual sequence of play can converge a.s. to a critical point if the sequences of query radius and sample size are chosen properly, without resorting to extra Tikhonov regularization terms or additional norm conditions. Finally, we illustrate the validity of the proposed algorithm via a Rock-Paper-Scissors game and a least square estimation game.
|
|
15:10-15:30, Paper WeB07.6 | Add to My Program |
A Distributed Linear Quadratic Discrete-Time Game Approach to Formation Control with Collision Avoidance |
|
Aditya, Prima | Student |
Werner, Herbert | Hamburg University of Technology, Institute of Control Systems |
Keywords: Game theory, Linear systems, Distributed control
Abstract: Formation control problems can be expressed as linear quadratic discrete-time games (LQDTG) for which Nash equilibrium solutions are sought. However, solving such problems requires solving coupled Riccati equations, which cannot be done in a distributed manner. A recent study showed that a distributed implementation is possible for a consensus problem when fictitious agents are associated with edges in the network graph rather than nodes. This paper proposes an extension of this approach to formation control with collision avoidance, where collision is precluded by including appropriate penalty terms on the edges. To address the problem, a state-dependent Riccati equation needs to be solved since the collision avoidance term in the cost function leads to a state-dependent weight matrix. This solution provides relative control inputs associated with the edges of the network graph. These relative inputs then need to be mapped to the physical control inputs applied at the nodes; this can be done in a distributed manner by iterating over a gradient descent search between neighbors in each sampling interval. Unlike inter-sample iteration frequently used in distributed MPC, only a matrix-vector multiplication is needed for each iteration step here, instead of an optimization problem to be solved. This approach can be implemented in a receding horizon manner, this is demonstrated through a numerical example.
|
|
WeB08 Regular Session, Simpor Junior 4812 |
Add to My Program |
Optimal Control II |
|
|
Chair: Chong, Edwin K. P. | Colorado State University |
Co-Chair: Watson, Jeremy | University of Canterbury |
|
13:30-13:50, Paper WeB08.1 | Add to My Program |
Inverse Optimal Control and Passivity-Based Design for Converter-Based Microgrids |
|
Hallinan, Liam | University of Cambridge |
Watson, Jeremy | University of Canterbury |
Lestas, Ioannis | University of Cambridge |
Keywords: Optimal control, Decentralized control, Power systems
Abstract: Passivity-based approaches have been suggested as a solution to the problem of decentralised control design in many multi-agent network control problems due to the plug- and-play functionality they provide. However, it is not clear if these controllers are optimal at a network level due to their inherently local formulation, with designers often relying on heuristics to achieve desired global performance. On the other hand, solving for an optimal controller is not guaranteed to produce a passive system. In this paper, we address these dual problems by using inverse optimal control theory to formulate a set of sufficient local conditions, which when satisfied ensure that the resulting decentralised control policies are the solution to a network optimal control problem, while at the same time satisfying appropriate passivity properties. These conditions are then reformulated into a set of linear matrix inequalities (LMIs) which can be solved to obtain such controllers for linear systems. The proposed approach is demonstrated through a DC microgrid case study. The results substantiate the feasibility and efficacy of the presented method.
|
|
13:50-14:10, Paper WeB08.2 | Add to My Program |
Penalized Least-Squares Method for LQR Problem of Singular Systems |
|
Nosrati, Komeil | Tallinn University of Technology |
Belikov, Juri | Tallinn University of Technology |
Tepljakov, Aleksei | Tallinn University of Technology |
Petlenkov, Eduard | Tallinn University of Technology |
Keywords: Optimal control, Differential-algebraic systems, Linear systems
Abstract: The linear quadratic regulator (LQR) algorithms devised using the Riccati equation possess two key attributes---they are recursive and have easily met conditions of existence. Nevertheless, these features only apply for the transformed structure of the regulated dynamics in singular systems, otherwise their optimal performance will be compromised under violation of constraints in non-singular versions. This technical note presents the LQR problem for a time-varying discrete linear singular system in a direct manner avoiding any transformations. This approach eliminates the requirement of making regularity assumptions for the system. To achieve this, first, we formulate a quadratic cost function for LQR derivation based on a penalized weighted least-squares method. Then, by using Bellman's principle of optimality and performing variable substitutions, we connect the formulation to a constrained and recursive minimization problem. We then proceed with investigating the existence conditions and using dynamic programming in a backward strategy at the finite horizon to derive a recursive regulator algorithm for the original system in a matrix array framework, without degrading its optimal performance. The achieved algorithm has more general features compared to the classical LQR problem for standard systems. This study concludes with numerical evaluation of the algorithm and confirmation of the results.
|
|
14:10-14:30, Paper WeB08.3 | Add to My Program |
An Improved Greedy Curvature Bound in Finite-Horizon String Optimization with an Application to a Sensor Coverage Problem |
|
Van Over, Brandon | Colorado State University |
Li, Bowen | Colorado State University |
Chong, Edwin K. P. | Colorado State University |
Pezeshki, Ali | Colorado State University |
Keywords: Optimal control, Discrete event systems, Optimization
Abstract: We study the optimization problem of choosing strings of finite length to maximize string submodular functions on string matroids, which is a broader class of problems than maximizing set submodular functions on set matroids. We provide a lower bound for the performance of the greedy algorithm in our problem, and then prove that our bound is superior to the greedy curvature bound of Conforti and Cornuéjols. Our bound has lower computational complexity than most previously proposed curvature bounds. Finally, we demonstrate the strength of our result on a sensor coverage problem.
|
|
14:30-14:50, Paper WeB08.4 | Add to My Program |
Optimal Boundary State Feedback Control by Triangularization of the Counterflow Heat Exchanger Model |
|
Kadima Kazaku, Jacques | Université Catholique De Louvain, Université De Lubumbashi |
Dochain, Denis | Univ. Catholique De Louvain |
Keywords: Optimal control, Distributed parameter systems, Process Control
Abstract: In this paper we are interested in the LQ-optimal boundary control of counterflow heat exchanger. The dynamics of this system is described (under some assumptions) by hyperbolic partial differential equations (PDEs) and contains singularities which do not guarantee in some cases the uniqueness of solution of the operator Riccati equation. To address this issue, we first propose a state transformation that involves solving a Riccati differential equation, and that allows to put the system in a lower triangular form. Next, for the reachability analysis, the model has been rewritten as an abstract boundary control system with bounded control and observation operators. Finally, the design of an optimal control law with integral action is considered. The results are illustrated by means of numerical simulations for the set point tracking, and show the interest of the control approach proposed in this paper.
|
|
14:50-15:10, Paper WeB08.5 | Add to My Program |
Sparse Approximate Hamilton-Jacobi Solutions for Optimal Feedback Control with Terminal Constraints |
|
Jain, Amit | Penn State University |
Eapen, Roshan Thomas | Pennsylvania State University |
Singla, Puneet | The Pennsylvania State University |
Keywords: Optimal control
Abstract: A semi-analytic method is proposed to solve a class of optimal control problems while exploiting its underlying Hamiltonian structure. Optimal control problems with a fixed final state at a fixed terminal time are considered. The solution methodology proposed in this work solves the Hamilton-Jacobi equation over a predefined domain of states and co-states. The advantage over traditional methods is that an approximate generating function (analogous to the value function of HJB theory) is obtained as a function of time, which allows for the computation of co-states for any final time and final state specified. Numerical experiments are conducted to demonstrate the efficacy of developed method while considering benchmark problems including spin stabilization.
|
|
15:10-15:30, Paper WeB08.6 | Add to My Program |
A Model-Free Iteration Algorithm for Markov Jump Linear Systems Based on Gauss-Seidel Method |
|
Fan, Wenwu | University of Science and Technology of China |
Xiong, Junlin | University of Science and Technology of China |
Keywords: Optimal control, Linear parameter-varying systems, Machine learning
Abstract: This paper focuses on the linear quadratic regulator problem of discrete-time Markov jump linear systems without knowing the system matrices. A model-free fixed-point iteration algorithm is proposed to learn the optimal state feedback control law without the requirement of an initial admissible control policy. Analogous to the Gauss-Seidel method for linear equations, the model-free algorithm is constantly iterating with the latest information of each mode. It is proved that the algorithm converges monotonically to the optimal solution. In addition, our algorithm is faster than the classical model-based value iteration method. Finally, an example is used to illustrate our results.
|
|
WeB09 Regular Session, Simpor Junior 4811 |
Add to My Program |
Optimization Algorithms II |
|
|
Chair: Van Scoy, Bryan | Miami University |
Co-Chair: Breschi, Valentina | Eindhoven University of Technology |
|
13:30-13:50, Paper WeB09.1 | Add to My Program |
Accelerated Nonconvex ADMM with Self-Adaptive Penalty for Rank-Constrained Model Identification |
|
Liu, Qingyuan | Tsinghua University |
Huang, Zhengchao | Tsinghua University |
Ye, Hao | Tsinghua University |
Huang, Dexian | Tsinghua University |
Shang, Chao | Tsinghua University |
Keywords: Optimization algorithms, Identification, Numerical algorithms
Abstract: The alternating direction method of multipliers (ADMM) has been widely adopted in low-rank approximation and low-order model identification tasks; however, the performance of nonconvex ADMM is highly reliant on the choice of penalty parameter. To accelerate ADMM for solving rank-constrained identification problems, this paper proposes a new self-adaptive strategy for automatic penalty update. Guided by first-order analysis of the increment of the augmented Lagrangian, the self-adaptive penalty updating enables effective and balanced minimization of both primal and dual residuals and thus ensures a stable convergence. Moreover, improved efficiency can be obtained within the Anderson acceleration scheme. Numerical examples show that the proposed strategy significantly accelerates the convergence of nonconvex ADMM while alleviating the critical reliance on tedious tuning of penalty parameters.
|
|
13:50-14:10, Paper WeB09.2 | Add to My Program |
Online Learning with Adversaries: A Differential-Inclusion Analysis |
|
Ganesh, Swetha | Indian Institute of Science |
Reiffers, Alexandre | IMT Atlantique |
Thoppe, Gugan | Indian Institute of Science |
Keywords: Optimization algorithms, Learning, Randomized algorithms
Abstract: We introduce an observation-matrix-based framework for fully asynchronous online Federated Learning (FL) with adversaries. In this work, we demonstrate its effectiveness in estimating the mean of a random vector. Our main result is that the proposed algorithm almost surely converges to the desired mean mu. This makes ours the first asynchronous FL method to have an a.s. convergence guarantee in the presence of adversaries. We derive this convergence using a novel differential-inclusion-based two-timescale analysis. Two other highlights of our proof include (a) the use of a novel Lyapunov function to show that mu is the unique global attractor for our algorithm's limiting dynamics, and (b) the use of martingale and stopping-time theory to show that our algorithm's iterates are almost surely bounded.
|
|
14:10-14:30, Paper WeB09.3 | Add to My Program |
META-SMGO-∆: Similarity As a Prior in Black-Box Optimization |
|
Busetto, Riccardo | Politecnico Di Milano |
Breschi, Valentina | Eindhoven University of Technology |
Formentin, Simone | Politecnico Di Milano |
Keywords: Optimization algorithms, Learning
Abstract: When solving global optimization problems in practice, one often ends up repeatedly solving problems that are similar to each others. By introducing a rigorous definition of similarity to exploit priors obtained from past experience to efficiently solve new (similar) problems, in this work we incorporate the META-learning rationale into SMGO-∆, a global optimization approach recently proposed in the literature. Through a benchmark numerical example we show the practical benefits of our META-extension of the baseline algorithm, while providing theoretical bounds on its performance.
|
|
14:30-14:50, Paper WeB09.4 | Add to My Program |
Performance of Noisy Three-Step Accelerated First-Order Optimization Algorithms for Strongly Convex Quadratic Problems |
|
Samuelson, Samantha | University of Southern California |
Mohammadi, Hesameddin | University of Southern California |
Jovanovic, Mihailo R. | University of Southern California |
Keywords: Optimization algorithms, Linear systems, Stochastic optimal control
Abstract: We study the class of first-order algorithms in which the optimization variable is updated using information from three previous iterations. While two-step momentum algorithms akin to heavy-ball and Nesterov's accelerated methods achieve the optimal convergence rate, it is an open question if the three-step momentum method can offer advantages for problems in which exact gradients are not available. For strongly convex quadratic problems, we identify algorithmic parameters which achieve the optimal convergence rate and examine how additional momentum terms affects the trade-offs between acceleration and noise amplification. Our results suggest that for parameters that optimize the convergence rate, introducing additional momentum terms does not provide improvement in variance amplification relative to standard accelerated algorithms.
|
|
14:50-15:10, Paper WeB09.5 | Add to My Program |
Automated Lyapunov Analysis of Primal-Dual Optimization Algorithms: An Interpolation Approach |
|
Van Scoy, Bryan | Miami University |
Simpson-Porco, John W. | University of Toronto |
Lessard, Laurent | Northeastern University |
Keywords: Optimization algorithms, LMIs
Abstract: Primal-dual algorithms are frequently used for iteratively solving large-scale convex optimization problems. The analysis of such algorithms is usually done on a case-by-case basis, and the resulting guaranteed rates of convergence can be conservative. Here we consider a class of first-order algorithms for linearly constrained convex optimization problems, and provide a linear matrix inequality (LMI) analysis framework for certifying worst-case exponential convergence rates. Our approach builds on recent results for interpolation of convex functions and linear operators, and our LMI directly constructs a Lyapunov function certifying the guaranteed convergence rate. By comparing to rates established in the literature, we show that our approach can certify significantly faster convergence for this family of algorithms.
|
|
15:10-15:30, Paper WeB09.6 | Add to My Program |
Switch and Conquer: Efficient Algorithms by Switching Stochastic Gradient Oracles for Decentralized Saddle Point Problems |
|
Sharma, Chhavi | Indian Institute of Technology Bombay |
Narayanan, Vishnu | Indian Institute of Technology |
Palaniappan, Balamurugan | Indian Institute of Technology Bombay |
Keywords: Optimization algorithms, Machine learning, Large-scale systems
Abstract: We consider a class of non-smooth strongly convex-strongly concave saddle point problems in a decentralized setting without a central server. To solve a consensus formulation of problems in this class, we develop an inexact primal dual hybrid gradient (inexact PDHG) procedure that allows generic gradient computation oracles to update the primal and dual variables. We first investigate the performance of inexact PDHG with stochastic variance reduction gradient (SVRG) oracle. Our numerical study uncovers a significant phenomenon of initial conservative progress of iterates of IPDHG with SVRG oracle. To tackle this, we develop a simple and effective switching idea, where a generalized stochastic gradient (GSG) computation oracle is employed to hasten the iterates' progress to a saddle point solution during the initial phase of updates, followed by a switch to the SVRG oracle at an appropriate juncture. The proposed algorithm is named Decentralized Proximal Switching Stochastic Gradient method with Compression (C-DPSSG), and is proven to converge to an epsilon-accurate saddle point solution with linear rate. Apart from delivering highly accurate solutions, our study reveals that utilizing the best convergence phases of GSG and SVRG oracles makes C-DPSSG well suited for obtaining solutions of low/medium accuracy faster, useful for certain applications. Numerical experiments on two benchmark machine learning applications show C-DPSSG's competitive performance which validate our theoretical findings. The codes used in the experiments can be found href{https://github.com/chhavisharma123/C-DPSSG-CDC2023}{here}.
|
|
WeB10 Regular Session, Roselle Junior 4713 |
Add to My Program |
Machine Learning II |
|
|
Chair: Amidzadeh, Mohsen | Aalto University |
Co-Chair: Rodrigues, Luis | Concordia University |
|
13:30-13:50, Paper WeB10.1 | Add to My Program |
Stable and Safe Reinforcement Learning Via a Barrier-Lyapunov Actor-Critic Approach |
|
Zhao, Liqun | University of Oxford |
Gatsis, Konstantinos | University of Oxford |
Papachristodoulou, Antonis | University of Oxford |
Keywords: Machine learning, Learning, Autonomous robots
Abstract: Reinforcement learning (RL) has demonstrated impressive performance in various areas such as video games and robotics. However, ensuring safety and stability, which are two critical properties from a control perspective, remains a significant challenge when using RL to control real-world systems. In this paper, we first provide definitions of safety and stability for the RL system, and then combine the control barrier function (CBF) and control Lyapunov function (CLF) methods with the actor-critic method in RL to propose a Barrier-Lyapunov Actor-Critic (BLAC) framework which helps maintain the aforementioned safety and stability for the system. In this framework, CBF constraints for safety and CLF constraint for stability are constructed based on the data sampled from the replay buffer, and the augmented Lagrangian method is used to update the parameters of the RL-based controller. Furthermore, an additional backup controller is introduced in case the RL-based controller cannot provide valid control signals when safety and stability constraints cannot be satisfied simultaneously. Simulation results show that this framework yields a controller that can help the system approach the desired state and cause fewer violations of safety constraints compared to baseline algorithms.
|
|
13:50-14:10, Paper WeB10.2 | Add to My Program |
A Scale-Independent Multi-Objective Reinforcement Learning with Convergence Analysis |
|
Amidzadeh, Mohsen | Aalto University |
Keywords: Machine learning, Learning, Iterative learning control
Abstract: Many sequential decision-making problems need optimization of different objectives which possibly conflict with each other. The conventional way to deal with a multi-task problem is to establish a scalar objective function based on a linear combination of different objectives. However, for the case of having conflicting objectives with different scales, this method needs a trial-and-error approach to properly find proper weights for the combination. As such, in most cases, this approach cannot guarantee an optimal Pareto solution. In this paper, we develop a single-agent scale-independent multi-objective reinforcement learning on the basis of the Advantage Actor-Critic (A2C) algorithm. A convergence analysis is then done for the devised multi-objective algorithm providing a convergence-in-mean guarantee. We then perform some experiments over a multi-task problem to evaluate the performance of the proposed algorithm. Simulation results show the superiority of developed multi-objective A2C approach against the single-objective algorithm.
|
|
14:10-14:30, Paper WeB10.3 | Add to My Program |
KCRL: Krasovskii-Constrained Reinforcement Learning with Guaranteed Stability in Nonlinear Discrete-Time Systems |
|
Lale, Sahin | Caltech |
Shi, Yuanyuan | University of California San Diego |
Qu, Guannan | Carnegie Mellon University |
Azizzadenesheli, Kamyar | Purdue University |
Wierman, Adam | California Institute of Technology |
Anandkumar, Animashree | California Institute of Technology |
Keywords: Machine learning, Learning, Stability of nonlinear systems
Abstract: Learning a dynamical system requires stabilizing the unknown dynamics to avoid state blow-ups. However, the standard reinforcement learning (RL) methods lack formal stabilization guarantees, which limits their applicability for the control of real-world dynamical systems. We propose a novel policy optimization method that adopts Krasovskii's family of Lyapunov functions as a stability constraint. We show that solving this stability-constrained optimization problem using a primal-dual approach recovers a stabilizing policy for the underlying system even under modeling error. Combining this method with model learning, we propose a model-based RL framework with formal stability guarantees, Krasovskii-Constrained Reinforcement Learning (KCRL). We theoretically study KCRL with kernel-based feature representation in model learning and provide a sample complexity guarantee to learn a stabilizing controller for the underlying system. Further, we empirically demonstrate the effectiveness of KCRL in learning stabilizing policies in online voltage control of a distributed power system. We show that KCRL stabilizes the system under various real-world solar and electricity demand profiles, whereas standard RL methods often fail to stabilize.
|
|
14:30-14:50, Paper WeB10.4 | Add to My Program |
Distributionally Robust Behavioral Cloning for Robust Imitation Learning |
|
Panaganti, Kishan | Texas A&M University |
Xu, Zaiyan | Texas A&M University |
Kalathil, Dileep | Texas A&M University (TAMU) |
Ghavamzadeh, Mohammad | Adobe Systems Inc |
Keywords: Machine learning, Learning
Abstract: Robust reinforcement learning (RL) aims to learn a policy that can withstand uncertainties in model parameters, which often arise in practical RL applications due to modeling errors in simulators, variations in real-world system dynamics, and adversarial disturbances. This paper introduces the robust imitation learning (IL) problem in a Markov decision process (MDP) framework where an agent learns to mimic an expert demonstrator that can withstand uncertainties in model parameters without additional online environment interactions. The agent is only provided with a dataset of state-action pairs from the expert on a single (nominal) dynamics, without any information about the true rewards from the environment. Behavioral cloning (BC), a supervised learning method, is a powerful algorithm to address the vanilla IL problem. We propose an algorithm for the robust IL problem that utilizes distributionally robust optimization (DRO) with BC. We call the algorithm DR-BC and show its robust performance against parameter uncertainties both in theory and in practice. We also demonstrate the empirical performance of our approach to addressing model perturbations on several MuJoCo continuous control tasks.
|
|
14:50-15:10, Paper WeB10.5 | Add to My Program |
Learning Stable and Robust Linear Parameter-Varying State-Space Models |
|
Verhoek, Chris | Eindhoven University of Technology |
Wang, Ruigang | The University of Sydney |
Tóth, Roland | Eindhoven University of Technology |
Keywords: Machine learning, Linear parameter-varying systems, Stability of linear systems
Abstract: This paper presents two direct parameterizations of stable and robust linear parameter-varying state-space (LPV-SS) models. The model parametrizations guarantee a priori that for all parameter values during training, the allowed models are stable in the contraction sense or have their Lipschitz constant bounded by a user-defined value gamma. Furthermore, since the parametrizations are direct, the models can be trained using unconstrained optimization. The fact that the trained models are of the LPV-SS class makes them useful for, e.g., further convex analysis or controller design. The effectiveness of the approach is demonstrated on an LPV identification problem.
|
|
15:10-15:30, Paper WeB10.6 | Add to My Program |
System Identification and Control Using Quadratic Neural Networks |
|
Rodrigues, Luis | Concordia University |
Givigi, Sidney | Queen's University |
Keywords: Machine learning, LMIs, Nonlinear systems identification
Abstract: This paper proposes convex formulations of system identification and control for nonlinear systems using two layer quadratic neural networks. The results in the paper cast system identification, stability and control design as convex optimization problems, which can be solved efficiently with polynomial-time algorithms. The main advantage of using quadratic neural networks for system identification and control as opposed to other neural networks is the fact that they provide a smooth (quadratic) mapping between the input and the output of the network. This allows one to cast stability and control for quadratic neural network models as a Sum of Squares (SOS) optimization, which is a convex optimization program that can be efficiently solved. Additionally, these networks offer other advantages, such as the fact that the architecture is a by-product of the design and is not determined a-priori, and the training can be done by solving a convex optimization problem so that the global optimum of the weights is achieved. It also appears from the examples in this paper that quadratic networks work extremely well using only a small fraction of the training data.
|
|
WeB11 Regular Session, Roselle Junior 4712 |
Add to My Program |
Agent-Based Systems II |
|
|
Chair: Tayebi, Abdelhamid | Lakehead University |
Co-Chair: Hovakimyan, Naira | University of Illinois at Urbana-Champaign |
|
13:30-13:50, Paper WeB11.1 | Add to My Program |
Convergence of Opinion Dynamics under Social Pressure for General Networks |
|
Tang, Jennifer | MIT |
Adler, Aviv | MIT |
Ajorlou, Amir | Massachusetts Institute of Technology |
Jadbabaie, Ali | Massachusetts Institute of Technology |
Keywords: Agents-based systems, Distributed control, Stochastic systems
Abstract: Social pressure is a key factor affecting the evolution of opinions on networks in many types of settings, pushing people to conform to their neighbors opinions. To study this, the interacting P ́olya urn model was introduced by Jadbabaie et al. [1], in which each agent has two kinds of opinion: inherent beliefs, which are hidden from the other agents and fixed; and declared opinions, which are randomly sampled at each step from a distribution which depends on the agents inherent belief and her neighbors past declared opinions (the social pressure component), and which is then communicated to their neighbors. Each agent also has a bias parameter denoting her level of resistance to social pressure. At every step, each agent updates her declared opinion (simultaneously with all other agents) according to her neighbors aggregate past declared opinions, her inherent belief, and her bias parameter. We study the asymptotic behavior of this opinion dynamics model and show that agents declaration probabilities converge almost surely in the limit using Lyapunov theory and stochastic approximation techniques. We also derive a sufficient condition for the agents to approach consensus on their declared opinions.
|
|
13:50-14:10, Paper WeB11.2 | Add to My Program |
Continuum Swarm Tracking Control: A Geometric Perspective in Wasserstein Space |
|
Emerick, Max | University of California Santa Barbara |
Bamieh, Bassam | Univ. of California at Santa Barbara |
Keywords: Agents-based systems, Distributed parameter systems, Algebraic/geometric methods
Abstract: We consider a setting in which one swarm of agents is to service or track a second swarm, and formulate an optimal control problem which trades off between the competing objectives of servicing and motion costs. We consider the continuum limit where large-scale swarms are modeled in terms of their time-varying densities, and where the Wasserstein distance between two densities captures the servicing cost. We show how this non-linear infinite-dimensional optimal control problem is intimately related to the geometry of Wasserstein space, and provide new results in the case of absolutely continuous densities and constant-in-time references. Specifically, we show that optimal swarm trajectories follow Wasserstein geodesics, while the optimal control tradeoff determines the time-schedule of travel along these geodesics. We briefly describe how this solution provides a basis for a model-predictive control scheme for tracking time-varying and real-time reference trajectories as well.
|
|
14:10-14:30, Paper WeB11.3 | Add to My Program |
Continuification Control of Large-Scale Multiagent Systems under Limited Sensing and Structural Perturbations |
|
Maffettone, Gian Carlo | Scuola Superiore Meridionale |
Porfiri, Maurizio | New York University Tandon School of Engineering |
di Bernardo, Mario | University of Naples Federico II |
Keywords: Agents-based systems, Distributed parameter systems, Large-scale systems
Abstract: We investigate the stability and robustness properties of a continuification-based strategy for the control of large-scale multiagent systems. Within this framework, one transforms the microscopic, agent-level description of the system dynamics into a macroscopic continuum-level, for which a control action can be synthesized to steer the macroscopic dynamics towards a desired distribution. Such an action is ultimately discretized to obtain a set of deployable control inputs for the agents to achieve the goal. The mathematical proof of convergence toward the desired distribution typically relies on the assumptions that no disturbance is present and that each agent possesses global knowledge of all the others' positions. Here, we analytically and numerically address the possibility of relaxing these assumptions for the case of a one-dimensional system of agents moving in a ring. We offer compelling evidence in favor of the use of a continuification-based strategy when agents only possess a finite sensing capability and spatio-temporal perturbations affect the macroscopic dynamics of the ensemble. We also discuss some preliminary results about the benefits of adding an integral action in the macroscopic control solution.
|
|
14:30-14:50, Paper WeB11.4 | Add to My Program |
Tuning Convergence Rate Via Non-Bayesian Social Learning: A Trade-Off between Internal Belief and External Information |
|
Sui, Dongyan | Fudan University |
Guan, Chun | Fudan University |
Gan, Zhongxue | Fudan University |
Lin, Wei | Fudan University |
Leng, Siyang | Fudan University |
Keywords: Agents-based systems, Distributed parameter systems, Networked control systems
Abstract: Social learning strategies have been recently developed for multi-agents to learn progressively an underlying state of nature by information communications and evolutions. Existing works define algorithms mainly by swapping the Bayesian update and belief aggregation steps and/or discovering diverse underlying network structures. Inspired by the diversity of agents when they are exposed to new information, this work designs a non-Bayesian learning strategy, named as Parametric Social Learning, by introducing an agent stubbornness parameter to trade-off the significance between its internal belief and external information. This strategy thus allows for tuning the convergence rate by adjusting the introduced parameter, which is consistent highly with the sociological intuition. Theoretical analyses and numerical examples are provided to illustrate several sociological insights. Our work therefore has appealing potential in practical tasks such as dispersed information aggregation and distributed parameter estimation.
|
|
14:50-15:10, Paper WeB11.5 | Add to My Program |
Pure Pursuit Strategy Enhanced with Defense Margin under Noisy Measurements for Protective Missions |
|
Sung, Minjun | University of Illinois at Urbana-Champaign |
Hiltebrandt-McIntosh, Christophe | University of Illinois at Urbana-Champaign |
Kim, Hunmin | Mercer University |
Hovakimyan, Naira | University of Illinois at Urbana-Champaign |
Keywords: Agents-based systems, Emerging control applications, Uncertain systems
Abstract: This paper investigates the problem of protecting a safe zone against rogue drone intrusion when the defender has noisy observations. The conventional strategies were not sufficient to achieve high mission success rates, prompting the introduction of a concept called defense margin. The proposed strategy improves upon the Pure Pursuit (PP) strategy by incorporating the defense margin strategy, offering better performance compared to using either strategy alone. Simulation results demonstrate the effectiveness of the proposed strategy, resulting in higher mission success rates.
|
|
15:10-15:30, Paper WeB11.6 | Add to My Program |
Bearing-Based Distributed Pose Estimation for Multi-Agent Networks |
|
Boughellaba, Mouaad | Lakehead University |
Tayebi, Abdelhamid | Lakehead University |
Keywords: Agents-based systems, Estimation, Observers for nonlinear systems
Abstract: In this paper, we address the distributed pose estimation problem for multi-agent systems under a directed graph topology, where two agents have access to their respective poses, and the other agents have unknown static positions and time-varying orientations. The proposed estimation scheme consists of two cascaded distributed observers, an almost globally asymptotically stable (AGAS) attitude observer and an input-to-state stable (ISS) position observer, leading to an overall AGAS distributed localization scheme. Numerical simulation results are presented to illustrate the performance of our proposed distributed pose estimation scheme.
|
|
WeB12 Regular Session, Roselle Junior 4711 |
Add to My Program |
Cooperative Control II |
|
|
Chair: Pettersen, Kristin Y. | Norwegian University of Science and Technology (NTNU) |
Co-Chair: Jeeninga, Mark | Politecnico Di Torino |
|
13:30-13:50, Paper WeB12.1 | Add to My Program |
Formation Control for Moving Target Enclosing Via Relative Localization |
|
Liu, Xueming | Sun Yat-Sen University |
Liu, Kunda | Sun Yat-Sen University |
Hu, Tianjiang | Sun Yat-Sen University |
Zhang, Qingrui | Sun Yat-Sen University |
Keywords: Cooperative control, Autonomous systems, Control applications
Abstract: In this paper, we investigate the problem of controlling multiple unmanned aerial vehicles (UAVs) to enclose a moving target in a distributed fashion based on a relative distance and self-displacement measurements. A relative localization technique is developed based on the recursive least square estimation (RLSE) technique with a forgetting factor to estimates both the ``UAV-UAV'' and ``UAV-target'' relative positions. The formation enclosing motion is planned using a coupled oscillator model, which generates desired motion for UAVs to distribute evenly on a circle. The coupled-oscillator-based motion can also facilitate the exponential convergence of relative localization due to its persistent excitation nature. Based on the generation strategy of desired formation pattern and relative localization estimates, a cooperative formation tracking control scheme is proposed, which enables the formation geometric center to asymptotically converge to the moving target. The asymptotic convergence performance is analyzed theoretically for both the relative localization technique and the formation control algorithm. Numerical simulations are provided to show the efficiency of the proposed algorithm. Experiments with three quadrotors tracking one target are conducted to evaluate the proposed target enclosing method in real platforms.
|
|
13:50-14:10, Paper WeB12.2 | Add to My Program |
Bearing-Only Formation Control with Bounded Disturbances in Agents' Local Coordinate Frames |
|
Garanayak, Chinmay | Indian Institute of Technology Bombay |
Mukherjee, Dwaipayan | Indian Institute of Technology Bombay |
Keywords: Cooperative control, Autonomous systems, Decentralized control
Abstract: This paper studies formation control using bearing-only measurements for elevation angle rigid configurations in the presence of time-varying bounded disturbances. Elevation angle rigidity-based control laws ensure bearing-only formation control in agents' local frame of reference sans any orientation synchronization or orientation estimation algorithms. However, existing control laws do not account for bounded disturbances in the agents' dynamics. Motivated by this, we design bearing-only control laws for single integrators in agents' frame of reference and prove local finite-time convergence to the desired formation. Then control laws for double integrators are proposed, and local asymptotic stability is proved when agents' accelerations are affected by bounded disturbances. Simulations are provided to validate the claims.
|
|
14:10-14:30, Paper WeB12.3 | Add to My Program |
Formation Control of Underactuated AUVs Using the Hand Position Concept |
|
Lie, Erling S. | Norwegian University of Science and Technology - NTNU |
Matous, Josef | NTNU (Norwegian University of Science and Technology) |
Pettersen, Kristin Y. | Norwegian University of Science and Technology (NTNU) |
Keywords: Cooperative control, Autonomous systems, Stability of nonlinear systems
Abstract: This paper presents an extended null-space-based behavioral algorithm for the formation control of fleets of underactuated autonomous underwater vehicles. The null-space-based controller is developed to work directly with second-order integrator systems, handling their dynamics in task space. The method is applied to the formation-path-following problem of a fleet of underactuated autonomous underwater vehicles. The nonlinear six-degrees-of-freedom model of the vehicle is transformed into a second-order integrator system using the 3D hand position output linearizing controller. The behavioral controller implements a hierarchy of path-following, formation-keeping, and collision-avoidance tasks. The closed-loop system is proven uniformly globally asymptotically stable, and the proposed method is validated through numerical simulations.
|
|
14:30-14:50, Paper WeB12.4 | Add to My Program |
Decentralized Lateral and Longitudinal Control of Vehicle Platoons with Constant Headway Spacing |
|
Wijnbergen, Paul | KTH Royal Institute of Technology |
Jeeninga, Mark | Politecnico Di Torino |
de Haan, Redmer | Eindhoven University of Technology |
Lefeber, Erjen | Eindhoven University of Technology |
Keywords: Cooperative control, Autonomous vehicles, Nonlinear systems
Abstract: The formation of platoons, where groups of vehicles follow each other at close distances, has the potential to increase road capacity. In this paper, a decentralized control approach is presented that extends the well-known constant headway vehicle following approach to the two-dimensional case, i.e., lateral control is included in addition to the longitudinal control. The presented control scheme employs a direct vehicle following approach where each vehicle in the platoon is responsible for following the directly preceding vehicle according to a nonlinear spacing policy. The proposed constant headway spacing policy is motivated by an approximation of a delay-based spacing policy and results in a generalization of the constant headway spacing policy to the two-dimensional case. By input-output linearization, necessary and sufficient conditions for the tracking of the nonlinear spacing policy are obtained, which motivate the synthesis of the lateral and longitudinal controllers of each vehicle in the platoon. By deriving an internal state representation of the follower vehicle and showing input-to-state stability, the internal dynamics for each leader-follower subsystem are shown to be well-behaved in case the leader drives in steady state conditions (i.e., the leader vehicle's trajectory is unexcited). The results are illustrated by a simulation.
|
|
14:50-15:10, Paper WeB12.5 | Add to My Program |
Finite-Time Control Protocol for Uniform Allocation of Second-Order Agents |
|
Zimenko, Konstantin | ITMO University |
Efimov, Denis | Inria |
Polyakov, Andrey | Inria, Univ. Lille |
Ping, Xubin | Xidian University |
Keywords: Cooperative control, Decentralized control, Lyapunov methods
Abstract: The paper addresses the problem of uniform finite-time robust allocations of second-order agents on a straight line. A decentralized homogeneous control protocol is proposed that uses only agent's states and local interactions (distances between two closest neighbors). Sufficient conditions of finite-time (input-to-state) stability are proposed in the form of Linear Matrix Inequalities. The theoretical results are illustrated via numerical simulations.
|
|
15:10-15:30, Paper WeB12.6 | Add to My Program |
A Novel Point-Based Algorithm for Multi-Agent Control Using the Common Information Approach |
|
Tang, Dengwang | University of Southern California |
Nayyar, Ashutosh | University of Southern California |
Jain, Rahul | University of Southern California |
Keywords: Cooperative control, Decentralized control, Stochastic optimal control
Abstract: The Common Information (CI) approach provides a systematic way to transform a multi-agent stochastic control problem to a single-agent partially observed Markov decision problem (POMDP) called the coordinator's POMDP. However, such a POMDP can be hard to solve due to its extraordinarily large action space. We propose a new algorithm for multi-agent stochastic control problems, called coordinator's heuristic search value iteration (CHSVI), that combines the CI approach and point-based POMDP algorithms for large action spaces. We demonstrate the algorithm through optimally solving several benchmark problems.
|
|
WeB13 Regular Session, Roselle Junior 4613 |
Add to My Program |
Control of Networks |
|
|
Chair: Kawano, Yu | Hiroshima University |
Co-Chair: Mousavi, Shima Sadat | ETH Zurich |
|
13:30-13:50, Paper WeB13.1 | Add to My Program |
Bipartite Formation Over Undirected Signed Networks with Collision Avoidance |
|
Sekercioglu, Pelin | ONERA, Univ Paris-Saclay |
Sarras, Ioannis | ONERA |
Loria, Antonio | CNRS |
Panteley, Elena | CNRS |
Marzat, Julien | ONERA - the French Aerospace Lab |
Keywords: Control of networks, Constrained control, Lyapunov methods
Abstract: We address the problem of bipartite formation control, with collision avoidance, for double integrators with limited sensing ranges. We assume that the systems are interconnected over an undirected, signed, and structurally balanced network. Then, to ensure that the proximity constraints are satisfied, we design a barrier-Lyapunov-function-based control law that guarantees connectivity maintenance for cooperative agents, and inter-agent collision avoidance for all agents. Relying on the edge-based agreement, we establish asymptotic stability of the bipartite formation control for signed networks. Finally, we illustrate our theoretical results via numerical simulations.
|
|
13:50-14:10, Paper WeB13.2 | Add to My Program |
Distributed Spatial Filtering by a Two-Hop Consensus-Type Algorithm |
|
Izumi, Shinsaku | Kochi University of Technology |
Xin, Xin | Okayama Prefectural University |
Yamasaki, Taiga | Okayama Prefectural University |
Keywords: Control of networks, Distributed control, Sensor networks
Abstract: In this study, we discuss distributed spatial filtering (DSF) on networked systems to obtain signal values with a desired spatial frequency characteristic from those assigned to nodes by a distributed algorithm. We present a two-hop consensus-type algorithm for DSF based on an existing one-hop algorithm. We prove that the range of the filter characteristics the presented algorithm can achieve is broader than that for the existing algorithm by deriving a necessary and sufficient condition for achieving DSF. Simulation results show that our filtering algorithm and a new filter characteristic it provides are effective in distributed anomaly detection by sensor networks.
|
|
14:10-14:30, Paper WeB13.3 | Add to My Program |
Krasovskii and Shifted Passivity Approaches to Mixed Input/Output Consensus |
|
Kawano, Yu | Hiroshima University |
Cucuzzella, Michele | University of Pavia |
Scherpen, Jacquelien M.A. | University of Groningen |
Keywords: Control of networks, Distributed control, Lyapunov methods
Abstract: In this letter, we consider nonlinear network systems under unknown disturbance and address the problem of mixed input/output consensus, i.e., consensus among disjoint sets of input and output nodes. We develop two control schemes based on different notions of passivity: 1) Krasovskii passivity and 2) shifted passivity. Furthermore, we propose an input consensus controller which is applicable to either Krasovskii or shifted passive systems. Finally, we validate the proposed controllers in simulation by achieving current sharing in a heterogeneous DC microgrid and power sharing in an AC power system, which are Krasovskii and shifted passive, respectively.
|
|
14:30-14:50, Paper WeB13.4 | Add to My Program |
Minimal Control Placement of Turing's Model Using Symmetries |
|
Cao, Yuexin | KTH Royal Institute of Technology |
Li, Yibei | Nanyang Technological University |
Liu, Zhixin | Academy of Mathematics and Systems Science, ChineseAcademyof Scie |
Zheng, Lirong | Fudan University |
Hu, Xiaoming | Royal Institute of Technology |
Keywords: Control of networks, Linear systems, Control system architecture
Abstract: In this paper, the minimal control placement problem for the Turing's reaction-diffusion system is investigated. The two-dimensional RD system is discretized into square grids and the nodes in the outermost layer are considered as control candidates. Symmetric control sets are defined naturally using the property of symmetry of the network structure. The minimal control placement problem for the diffusion system is investigated first. The necessary condition is provided based on the idea of symmetric control sets. Then we prove that this condition is also sufficient to ensure controllability when the multiplicity of eigenvalues satisfies certain conditions. We show further that symmetric control sets can be extended and prove that the necessary condition can also be applied to the reaction-diffusion system, i.e., the Turing's model. The sufficient condition is proved to be effective for the reaction-diffusion system under similar circumstances. Our conclusion can also be useful for other multi-agent systems with the same topology. Directions of future study include nonlinear reaction terms and time-varying systems.
|
|
14:50-15:10, Paper WeB13.5 | Add to My Program |
Influencing Opinions in a Nonlinear Pinning Control Model |
|
Ancona, Camilla | Università Di Napoli Federico II |
De Lellis, Pietro | University of Naples Federico II |
Lo Iudice, Francesco | Università Di Napoli Federico II |
Keywords: Control of networks, Network analysis and control, Control applications
Abstract: This letter studies how opinions and subsequent actions of groups of individuals are shaped by opinion leaders, nowadays denoted influencers. We model an influencer as a pinner that exerts a control input on a small subset of individuals, and leverages the interaction network to affect the action of a large fraction of individuals. We provide sufficient conditions so that a given agent takes the same action as the pinner. Based on these conditions, we design a heuristic for the pinned node selection that maximizes the number of nodes taking the action elected by the pinner. The performance of the heuristic is then numerically tested against standard pinning strategies.
|
|
WeB14 Regular Session, Roselle Junior 4612 |
Add to My Program |
Identification I |
|
|
Chair: Lee, Jin Gyu | INRIA |
Co-Chair: Moreschini, Alessio | Imperial College London |
|
13:30-13:50, Paper WeB14.1 | Add to My Program |
Parametric Continuous-Time Blind System Identification |
|
Elton, Augustus | University of Newcastle |
González, Rodrigo A. | Eindhoven University of Technology |
Welsh, James S. | University of Newcastle |
Rojas, Cristian R. | KTH Royal Institute of Technology |
Fu, Minyue | Southern University of Science and Technology |
Keywords: Identification, Linear systems
Abstract: In this paper, the blind system identification problem for continuous-time systems is considered. A direct continuous-time estimator is proposed by utilising a state-variable-filter least squares approach. In the proposed method, coupled terms between the numerator polynomial of the system and input parameters appear in the parameter vector which are subsequently separated using a rank-1 approximation. An algorithm is then provided for the direct identification of a single-input single-output linear time-invariant continuous-time system which is shown to satisfy the property of correctness under some mild conditions. Monte Carlo simulations demonstrate the performance of the algorithm and verify that a model and input signal can be estimated to a proportion of their true values.
|
|
13:50-14:10, Paper WeB14.2 | Add to My Program |
An Efficient Implementation for Kernel-Based Regularized System Identification with Periodic Input Signals |
|
Shen, Zhuohua | The Chinese University of Hong Kong, Shenzhen |
Xu, Yu | The Chinese University of Hong Kong, Shenzhen |
Andersen, Martin S. | Technical University of Denmark |
Chen, Tianshi | The Chinese University of Hong Kong, Shenzhen, China |
Keywords: Identification, Linear systems, Computational methods
Abstract: Efficient implementation of algorithms for kernel-based regularized system identification is an important issue. The state of art result is based on semiseparable kernels and a class of commonly used test input signals in system identification and automatic control, and with such input signals, the output kernel is semiseparable and exploring this structure gives rise to very efficient implementation. In this paper, we consider instead the periodic input signals, which is another class of commonly used test input signals. Unfortunately, with periodic input signals, the output kernel is NOT semiseparable. Nevertheless, it can be shown that the output kernel matrix is hierarchically semiseparable. Moreover, it is possible to develop efficient implementation of algorithms by exploring the hierarchically semiseparable structure of the output kernel matrix and the periodic structure of the regression matrix. The efficiency of the proposed implementation of algorithms is demonstrated by Monte Carlo simulations.
|
|
14:10-14:30, Paper WeB14.3 | Add to My Program |
Realization from Moments: The Linear Case |
|
Lee, Jin Gyu | INRIA |
Astolfi, Alessandro | Imperial College & Univ. of Rome |
Keywords: Identification, Model/Controller reduction, Linear systems
Abstract: By exploiting the time-domain notion of moments we recover a time-domain counterpart of the fact that a certain number of steady-state responses is sufficient to realize a linear system. This may pave the way to a realization theory for nonlinear systems based on their steady-state responses.
|
|
14:30-14:50, Paper WeB14.4 | Add to My Program |
Kernel-Based Continuous-Time System Identification: A Parametric Approximation |
|
Scandella, Matteo | Imperial College London |
Moreschini, Alessio | Imperial College London |
Parisini, Thomas | Imperial College & Univ. of Trieste |
Keywords: Identification, Linear systems, Statistical learning
Abstract: In this paper, we discuss the non-parametric estimate problem using kernel-based LTI system identification techniques by constructing a Loewner-based interpolant of the estimated model. Through this framework, we have been able to retrieve a finite-dimensional approximation of the infinite-dimensional estimate obtained using the classical kernel-based methodologies. The employment of the Loewner framework constitutes an enhancement of recent results which propose to use a Padé approximant to obtain a rational transfer function from an irrational transfer function corresponding to the identified impulse response. The enhancement has been illustrated for the identification of the Rao-Garnier benchmark.
|
|
14:50-15:10, Paper WeB14.5 | Add to My Program |
Towards Scalable Kernel-Based Regularized System Identification |
|
Chen, Lujing | Technical University of Denmark |
Chen, Tianshi | The Chinese University of Hong Kong, Shenzhen, China |
Detha, Utkarsh | MOSEK ApS |
Andersen, Martin S. | Technical University of Denmark |
Keywords: Identification, Linear systems, Computational methods
Abstract: This paper proposes a methodology for scalable kernel-based regularized system identification based on indirect methods. It leverages stochastic trace estimation methods and an iterative solver such as LSQR for the efficient evaluation of hyperparameter selection criteria. It also uses a derivative-free optimization approach to hyperparameter estimation, which avoids the need for computing gradients or Hessians of the objective function. Moreover, the method is matrix-free, which means it only relies on a matrix-vector oracle and exploits fast routines for various structured matrix-vector products. Our preliminary numerical experiments indicate that the methodology scales significantly better than direct methods, especially when dealing with large datasets and slowly decaying impulse responses.
|
|
15:10-15:30, Paper WeB14.6 | Add to My Program |
On the Relation between Discrete and Continuous-Time Refined Instrumental Variable Methods |
|
González, Rodrigo A. | Eindhoven University of Technology |
Rojas, Cristian R. | KTH Royal Institute of Technology |
Pan, Siqi | University of Newcastle |
Welsh, James S. | University of Newcastle |
Keywords: Identification, Linear systems, Estimation
Abstract: The Refined Instrumental Variable method for discrete-time systems (RIV) and its variant for continuous-time systems (RIVC) are popular methods for the identification of linear systems in open-loop. The continuous-time equivalent of the transfer function estimate given by the RIV method is commonly used as an initialization point for the RIVC estimator. In this paper, we prove that these estimators share the same converging points for finite sample size when the continuous-time model has relative degree zero or one. This relation does not hold for higher relative degrees. Then, we propose a modification of the RIV method whose continuous-time equivalent is equal to the RIVC estimator for any non-negative relative degree. The implications of the theoretical results are illustrated via a simulation example.
|
|
WeB15 Regular Session, Roselle Junior 4611 |
Add to My Program |
Adaptive Control II |
|
|
Chair: Kamalapurkar, Rushikesh | Oklahoma State University |
Co-Chair: Ampountolas, Konstantinos | University of Thessaly |
|
13:30-13:50, Paper WeB15.1 | Add to My Program |
Deep Lyapunov-Based Physics-Informed Neural Networks (DeLb-PINN) for Adaptive Control Design |
|
Hart, Rebecca | University of Florida |
Patil, Omkar Sudhir | University of Florida |
Griffis, Emily | University of Florida |
Dixon, Warren E. | University of Florida |
Keywords: Adaptive control, Neural networks, Stability of nonlinear systems
Abstract: Physics-informed learning is an emerging machine learning technique driven by the desire to leverage known physical principles in machine learning algorithms. Recent developments have produced physics-informed neural networks (PINNs) which are neural networks designed to be constrained by known physical principles. However, developing real-time adaptive control methods with stability guarantees for PINNs remains an open problem. This paper develops the first result for a deep Lyapunov-based physics-informed neural network (DeLb-PINN) architecture to adaptively control uncertain Euler-Lagrange systems. Lyapunov-derived weight adaptation laws provide continuous, online learning using the DeLb-PINN architecture without the need for offline training. A nonsmooth desired compensation adaptation law (DCAL) Lyapunov-based analysis is provided to guarantee global asymptotic tracking error convergence.
|
|
13:50-14:10, Paper WeB15.2 | Add to My Program |
State and Parameter Estimation for Affine Nonlinear Systems |
|
Ogri, Tochukwu Elijah | Oklahoma State University |
Bell, Zachary I. | Air Force |
Kamalapurkar, Rushikesh | Oklahoma State University |
Keywords: Adaptive control, Nonlinear output feedback, Observers for nonlinear systems
Abstract: This paper proposes a new approach to online state and parameter estimation for affine nonlinear systems. Unlike conventional methods limited to specific classes of nonlinear systems and reliant on stringent excitation conditions, the proposed approach uses multiplier matrices and a data-driven concurrent learning method to develop an adaptive observer for affine nonlinear systems. Through rigorous Lyapunov-based analysis, the technique is proven to guarantee locally uniformly ultimately bounded state estimates and ultimately bounded parameter estimation errors. Additionally, under certain excitation conditions, the parameter estimation error is guaranteed to converge to a given neighborhood of the origin.
|
|
14:10-14:30, Paper WeB15.3 | Add to My Program |
Physics-Informed Neural Networks for Learning the Parameters of Commercial Adaptive Cruise Control Systems |
|
Apostolakis, Theocharis | University of Thessaly |
Ampountolas, Konstantinos | University of Thessaly |
Keywords: Adaptive control, Nonlinear systems identification, Neural networks
Abstract: This paper develops a physics-informed neural network (PINN) for learning the parameters of commercially implemented adaptive cruise control (ACC) systems. The constant time-headway policy (CTHP) is adopted to emulate the core functionality of stock ACC systems (proprietary control logic and its parameters) which is not publicly available. Multi-layer artificial neural networks is a class of universal approximators, and thus the developed PINN can serve as a surrogate approximator to capture the longitudinal dynamics of ACC-engaged vehicles and efficiently learn the unknown parameters of the CTHP. The ability of the PINN to infer the unknown ACC parameters is tested on both synthetic and empirical data of space-gap and relative velocity involved ACC-engaged vehicles in platoon formation. The results have demonstrated the superior predictive ability of the proposed PINN to learn the unknown design parameters of stock ACC systems of different vehicle makes. The set of ACC model parameters obtained from the PINN revealed that the stock ACC system of the considered vehicles in three experimental campaigns is neither L2 nor L∞ string stable.
|
|
14:30-14:50, Paper WeB15.4 | Add to My Program |
Convex Q-Learning in Continuous Time with Application to Dispatch of Distributed Energy Resources |
|
Lu, Fan | University of Florida |
Mathias, Joel | Arizona State University |
Meyn, Sean P. | Univ. of Florida |
Kalsi, Karan | Pacific Northwest National Lab |
Keywords: Adaptive control, Optimal control, Machine learning
Abstract: Convex Q-learning is a recent approach to reinforcement learning, motivated by the possibility of a firmer theory for convergence, and the possibility of making use of greater a~priori knowledge regarding policy or value function structure. This paper explores algorithm design in the continuous time domain, with a finite-horizon optimal control objective. The main contributions are (i) The new Q-ODE: a model-free characterization of the Hamilton-Jacobi-Bellman equation. (ii) A formulation of Convex Q-learning that avoids approximations appearing in prior work. The Bellman error used in the algorithm is defined by filtered measurements, which is necessary in the presence of measurement noise. (iii) Convex Q-learning with linear function approximation is a convex program. It is shown that the constraint region is bounded, subject to an exploration condition on the training input. (iv) The theory is illustrated in application to resource allocation for distributed energy resources, for which the theory is ideally suited.
|
|
14:50-15:10, Paper WeB15.5 | Add to My Program |
Adaptive Output Regulation of MIMO LTI Systems with Unmodeled Input Dynamics |
|
Borisov, Oleg | ITMO University |
Isidori, Alberto | Universita Di Roma |
Pyrkin, Anton | ITMO University, Hangzhou Dianzi University |
Keywords: Adaptive control, Output regulation, Uncertain systems
Abstract: This paper addresses the problem of adaptive output regulation of a minimum-phase MIMO LTI system in the presence of un-modeled (fast) input dynamics. The adoption of a post-processing tunable internal model makes it possible to implement standard methods for the analysis of two-time-scale systems. The proposed adaptation law guarantees, under suitable hypotheses, convergence to zero of the regulation error as well as of the parameter estimation error.
|
|
15:10-15:30, Paper WeB15.6 | Add to My Program |
Human Multi-Agent Interaction for Optical Manipulation of Micro-Objects |
|
Khan, Gulam Dastagir | Sultan Qaboos University |
Ta, Quang Minh | Nanyang Technological University |
Cheah, Chien Chern | Nanyang Tech. Univ |
Keywords: Adaptive control, Robotics, Agents-based systems
Abstract: Automated optical manipulation systems are faster and more precise than humans, but less adaptable to uncertainty. A collaborative approach between humans and automated agents can leverage both human reasoning skills and the precision of automated systems. This paper describes a human multi-agent interaction approach for dealing with unexpected events during optical manipulation. The proposed multimode control system improves the flexibility and dependability of existing systems, allowing for stable interaction between humans and machines. Experimental validation confirms the effectiveness of the proposed method.
|
|
WeB16 Regular Session, Peony Junior 4512 |
Add to My Program |
Energy Systems I |
|
|
Chair: Gießler, Armin | Karlsruhe Institute of Technology |
Co-Chair: Jané Soneira, Pol | Karlsruher Institute Für Technology |
|
13:30-13:50, Paper WeB16.1 | Add to My Program |
Passivity-Based Economic Ports for Optimal Operation of Networked DC Microgrids |
|
Jané Soneira, Pol | Karlsruher Institute Für Technology |
Malan, Albertus J. | Karlsruhe Institute of Technology |
Prodan, Ionela | Grenoble Institute of Technology (Grenoble INP) - Esisar |
Hohmann, Soeren | KIT |
Keywords: Energy systems, Optimal control, Distributed control
Abstract: In this paper, we introduce the novel concept of economic ports, allowing modular and distributed optimal operation of networked microgrids. Firstly, we design a novel price-based controller for optimal operation of a single microgrid and show asymptotic stability. Secondly, we define novel physical and economic interconnection ports for the microgrid and study the dissipativity properties of these ports. Lastly, we propose an interconnection scheme for microgrids via the economic ports. This interconnection scheme requires only an exchange of the local prices and allows a globally economic optimal operation of networked microgrids at steady state, while guaranteeing asymptotic stability of the networked microgrids via the passivity properties of economic ports. The methods are demonstrated through various academic examples.
|
|
13:50-14:10, Paper WeB16.2 | Add to My Program |
Economic Dispatch for DC Microgrids: An Optimal Power Sharing Approach with Batteries |
|
Gießler, Armin | Karlsruhe Institute of Technology |
Jané Soneira, Pol | Karlsruher Institute Für Technology |
Malan, Albertus J. | Karlsruhe Institute of Technology |
Hohmann, Soeren | KIT |
Keywords: Energy systems, Power generation, Optimal control
Abstract: In this paper, we propose a hierarchical control structure comprising three layers which is able to (i) achieve economic dispatch for islanded DC microgrids, (ii) compensate load and generation disturbances with batteries performing power sharing and (iii) stabilize non-passive constant power loads. The batteries are charged economically optimally by the third layer controller such the state of charges (SOCs) remain constant. The proportional power sharing of the batteries is achieved by employing a novel control law which solves the linearized steady-state power flow equations in real time. The microgrid is stabilized by using voltage controllers for batteries and active damping elements. A numeric method to verify closed-loop asymptotic stability is derived. The power sharing of the batteries and stability achieved with the proposed control is demonstrated in a simulation.
|
|
14:10-14:30, Paper WeB16.3 | Add to My Program |
On the Optimality of Procrastination Policy for EV Charging under Net Energy Metering |
|
Jeon, Minjae | Cornell University |
Tong, Lang | Cornell University |
Zhao, Qing | Cornell University |
Keywords: Energy systems, Stochastic optimal control, Smart grid
Abstract: We consider the problem of behind-the-meter EV charging by a prosumer, co-optimized with rooftop solar, electric battery, and flexible consumptions such as water heaters and HVAC. Under the time-of-use net energy metering tariff with the stochastic solar production and random EV charging demand, a finite-horizon surplus-maximization problem is formulated. We show that a procrastination threshold policy that delays EV charging to the last possible moment is optimal when EV charging is co-optimized with flexible demand, and the policy thresholds can be computed easily offline. When battery storage is part of the co-optimization, it is shown that the net consumption of the prosumer is a two-threshold piecewise linear function of the behind-the-meter renewable generation under the optimal policy, and the procrastination threshold policy remains optimal, although the thresholds cannot be computed easily. We propose a simple myopic solution and demonstrate in simulations that the performance gap between the myopic policy and an oracle upper bound appears to be 0.5-7.5%.
|
|
14:30-14:50, Paper WeB16.4 | Add to My Program |
Online Learning of Effective Turbine Wind Speed in Wind Farms |
|
Henry, Aoife | University of Colorado Boulder |
Sinner, Michael | National Renewable Energy Laboratory |
King, Jennifer | National Renewable Energy Laboratory |
Pao, Lucy Y. | University of Colorado Boulder |
Keywords: Energy systems, Statistical learning, Identification for control
Abstract: To develop better wind farm controllers that can meet more complex objectives, methods of modeling the wind turbine wakes at low computational expense are needed. Gaussian Process (GP) regression offers a computationally inexpensive framework for learning complex functions from noisy measurements with very few datapoints. In this work, an online learning approach is presented to learn the rotor-averaged wind velocity at downstream wind turbines with GPs, using the available datastream of wind field measurements and wind turbine control set-points. This framework can readily be integrated into model-based controls methods because the model a) is updated online at low computational expense, b) assumes a mathematically favorable Gaussian form, and c) explicitly quantifies the stochastic nature of the wake field so that the trade-off between exploration and exploitation, and the uncertainty in the prediction, can be utilized. We show that a GP-learned model can match true values with errors within 0.5 % on average, with as few as 5 training data points.
|
|
14:50-15:10, Paper WeB16.5 | Add to My Program |
Emission-Constrained Optimization of Gas Networks: Input-Convex Neural Network Approach |
|
Dvorkin, Vladimir | Massachusetts Institute of Technology |
Chevalier, Samuel | MIT |
Chatzivasileiadis, Spyros | Technical University of Denmark |
Keywords: Energy systems, Smart grid, Neural networks
Abstract: Planning optimization of gas networks under emission constraints prioritizes gas supply with the smallest emission footprint. As this problem includes complex gas flow physical laws, standard optimization solvers cannot guarantee convergence to a feasible solution, especially under strict emission constraints. To address this issue, we develop an input-convex neural network (ICNN) aided optimization routine which incorporates a set of trained ICNNs approximating the gas flow equations with high precision. Numerical tests on the Belgium gas network demonstrate that the ICNN-aided optimization dominates non-convex and relaxation-based solvers, with larger optimality gains pertaining to stricter emission targets.
|
|
15:10-15:30, Paper WeB16.6 | Add to My Program |
Pricing Uncertainty in Stochastic Multi-Stage Electricity Markets |
|
Werner, Lucien | California Institute of Technology |
Christianson, Nicolas | California Institute of Technology |
Zocca, Alessandro | VU Amsterdam |
Wierman, Adam | California Institute of Technology |
Low, Steven | California Institute of Technology |
Keywords: Power systems, Smart grid, Stochastic systems
Abstract: This work proposes a pricing mechanism for multi-stage electricity markets that does not explicitly depend on the choice of dispatch procedure or optimization method. Our approach is applicable to a wide range of methodologies for the economic dispatch of power systems under uncertainty, including multi-interval dispatch, multi-settlement markets, scenario-based dispatch, and chance-constrained dispatch policies. We prove that our pricing scheme provides both ex-ante and ex-post dispatch-following incentives by simultaneously supporting per-stage and ex-post competitive equilibria. In numerical experiments on a ramp-constrained test system, we demonstrate the benefits of scheduling under uncertainty and show how our price decomposes into components corresponding to energy, intertemporal coupling, and uncertainty.
|
|
WeB17 Regular Session, Peony Junior 4511 |
Add to My Program |
Data-Driven Control II |
|
|
Chair: Ferramosca, Antonio | Univeristy of Bergamo |
Co-Chair: Liu, Tao | The University of Hong Kong |
|
13:30-13:50, Paper WeB17.1 | Add to My Program |
Data-Driven Control of Positive Linear Systems Using Linear Programming |
|
Miller, Jared | ETH Zurich |
Dai, Tianyu | Northeastern University |
Sznaier, Mario | Northeastern University |
Shafai, Bahram | Northeastern Univ |
Keywords: Data driven control, Linear systems, Uncertain systems
Abstract: This paper presents a linear-programming based algorithm to perform data driven stabilizing control of linear positive systems. A set of state-input-transition observations is collected up to magnitude-bounded noise. A state feedback controller and dual linear copositive Lyapunov function are created such that the set of all data-consistent plants is contained within the set of all stabilized systems. This containment is certified through the use of the Extended Farkas Lemma and solved via Linear Programming. Sign patterns and sparsity structure for the controller may be imposed using linear constraints. The complexity of this algorithm scales in a polynomial manner with the number of states and inputs. Effectiveness is demonstrated on example systems.
|
|
13:50-14:10, Paper WeB17.2 | Add to My Program |
Data-Driven Finite-Time Control for Discrete-Time Linear Time-Invariant Systems |
|
Li, Jinjiang | The University of Hong Kong; HKU Shenzhen Institute of Research |
Liu, Tao | The University of Hong Kong |
Liu, Tengfei | Northeastern University |
Keywords: Data driven control, Linear systems
Abstract: This paper studies the data-driven finite-time control (FTC) problem of unknown discrete-time linear time-invariant (LTI) systems with unknown and bounded noise. The proposed FTC aims to guarantee that the state of such a system does not exceed a given bound over a finite time interval under bounded initial conditions. Data-dependent representations are built for the unknown system without and with noise from pre-collected input/state data, based on which a finite-time controller is designed. Sufficient conditions of finite-time stability/boundness of the closed-loop system without/with noise are derived. Compared with model-based FTC methods that strongly depend on some accurate system models, the proposed method is model-free and only relies on pre-collected data. Numerical simulations are performed to illustrate the effectiveness of the proposed scheme.
|
|
14:10-14:30, Paper WeB17.3 | Add to My Program |
Data-Driven Input-To-State Stabilization with Respect to Measurement Errors |
|
Chen, Hailong | University of Groningen |
Bisoffi, Andrea | Politecnico Di Milano |
De Persis, Claudio | University of Groningen |
Keywords: Data driven control, Lyapunov methods, Uncertain systems
Abstract: We consider noisy input/state data collected from an experiment on a polynomial input-affine nonlinear system. Motivated by event-triggered control, we provide data-based conditions for input-to-state stability with respect to measurement errors. Such conditions, which take into account all dynamics consistent with data, lead to the design of a feedback controller, an ISS Lyapunov function, and comparison functions ensuring ISS with respect to measurement errors. When solved alternately for two subsets of the decision variables, these conditions become a convex sum-of-squares program. Feasibility of the program is illustrated in a numerical example.
|
|
14:30-14:50, Paper WeB17.4 | Add to My Program |
Distributed Event-Triggered Consensus Control from Noisy Data Using Matrix Polytopes |
|
Li, Yifei | Beijing Institute of Technology |
Liu, Wenjie | Beijing Institute of Technology, Beijing, China |
Sun, Jian | Beijing Institute of Technology |
Wang, Gang | Beijing Institute of Technology |
Chen, Jie | Beijing Institute of Technology |
Keywords: Data driven control, Networked control systems, Network analysis and control
Abstract: This paper presents a novel data-driven polytopic approach to event-triggered consensus control of unknown leader-following multi-agent systems (MASs). A distributed data-driven event-triggered consensus control protocol is proposed that utilizes noisy input-state data to enable all followers to track the leader while reducing communication and computational burden. Unlike previous research that relies on quadratic matrix inequalities to characterize system uncertainties, this paper devises a data-based polytopic representation for MASs, which enables addressing the consensus control problem without using explicit system matrices. Based on this representation, a data-based criterion is established, utilizing matrix polytopes to ensure the asymptotic stability of the closed-loop MAS. Moreover, a co-design method is presented for the distributed controller gain and the triggering matrix, using only data and expressed in terms of linear matrix inequalities. Finally, numerical simulations are conducted to demonstrate the validity and effectiveness of the proposed data-driven approach.
|
|
14:50-15:10, Paper WeB17.5 | Add to My Program |
Data-Driven Control of Nonlinear Systems from Input-Output Data |
|
Dai, Xiaoyan | University of Groningen |
De Persis, Claudio | University of Groningen |
Monshizadeh, Nima | University of Groningen |
Tesi, Pietro | University of Florence |
Keywords: Data driven control, Nonlinear systems
Abstract: The design of controllers from data for nonlinear systems is a challenging problem. In a recent paper, De Persis, Rotulo and Tesi, Learning controllers from data via approximate nonlinearity cancellation, IEEE Transactions on Automatic Control, 2023, a method to learn controllers that make the closed-loop system stable and dominantly linear was proposed. The approach leads to a simple solution based on data-dependent semidefinite programs. The method uses input-state measurements as data, while in a realistic setup it is more likely that only input-output measurements are available. In this note we report how the design principle of the above mentioned paper can be adjusted to deal with input-output data and obtain dynamic output feedback controllers in a favourable setting.
|
|
15:10-15:30, Paper WeB17.6 | Add to My Program |
CHoKI-Based MPC for Blood Glucose Regulation in Artificial Pancreas with Probabilistic Constraints |
|
Sonzogni, Beatrice | University of Bergamo |
Manzano, Jose Maria | Universidad Loyola Andalucía |
Polver, Marco | Università Degli Studi Di Bergamo |
Previdi, Fabio | Università Degli Studi Di Bergamo |
Ferramosca, Antonio | Univeristy of Bergamo |
Keywords: Data driven control, Predictive control for nonlinear systems, Healthcare and medical systems
Abstract: This work presents a Model Predictive Control (MPC) algorithm for the Artificial Pancreas. In this work, we assume that an a-priori model is unknown and the Componentwise Hölder Kinky Inference (CHoKI) data-based learning method is used to make glucose predictions. A stochastic formulation of the MPC with chance constraints is considered to have a less conservative controller. The data collection and the testing of the proposed controller are performed by exploiting the virtual patients of the FDA-accepted UVA/Padova simulator. The simulation results are quite satisfying since the time in hypoglycemia is reduced.
|
|
WeB18 Regular Session, Peony Junior 4412 |
Add to My Program |
Nonlinear Systems II |
|
|
Chair: Dochain, Denis | Univ. Catholique De Louvain |
Co-Chair: Smeraldo, Simone | Politecnico Di Milano |
|
13:30-13:50, Paper WeB18.1 | Add to My Program |
Capturing Persistence of High-Dimensional Delayed Complex Balanced Chemical Reaction Systems Via Decomposition of Semilocking Sets |
|
Zhang, Xiaoyu | Zhejiang University |
Gao, Chuanhou | Zhejiang University |
Dochain, Denis | Univ. Catholique De Louvain |
Keywords: Nonlinear systems
Abstract: With the increasing complexity of time-delayed systems, the diversification of boundary types of chemical reaction systems poses a challenge for persistence analysis. This paper focuses on delayed complex balanced mass-action systems (DeCBMAS) and it derives that some boundaries of a DeCBMAS cannot contain an omega-limit point of some trajectory with positive initial conditions by using the method of semilocking set decomposition and the property of the facet, further expanding the range of persistence of DeCBMASs. These findings demonstrate the effectiveness of semilocking set decomposition to address the complex boundaries and offer insights into the persistence analysis.
|
|
13:50-14:10, Paper WeB18.2 | Add to My Program |
A Collaborative Multi-Agent Nonlinear System Identification Algorithm with Spectral Regularization |
|
Smeraldo, Simone | Politecnico Di Milano |
Bianchi, Federico | Ricerca Sul Settore Energetico, RSE SpA |
Busboom, Axel | Munich University of Applied Sciences |
Prandini, Maria | Politecnico Di Milano |
Keywords: Nonlinear systems, Nonlinear systems identification
Abstract: We address nonlinear system identification in a multi-agent setting, where each agent collects input-output data from a different instance of the same process, possibly in a different operating condition. In particular, we introduce a novel scheme for nonlinear auto-regressive with exogenous input (NARX) model identification where agents make a tentative estimate of their individual parameters, based on local information only, while a cloud-based application further manipulates these estimates so as to disclose the common model structure and adjust the values of the individual parameters around some reference parameter vector. The proposed scheme is inspired by the spectral regularization framework recently introduced in multi-task feature learning and is shown to be competitive against state-of-the art cloud-based algorithms addressing the same problem but under more restrictive assumptions.
|
|
14:10-14:30, Paper WeB18.3 | Add to My Program |
Coherence-Based Input Design for Nonlinear Systems |
|
Parsa, Javad | KTH Royal Inst. of Tech |
Rojas, Cristian R. | KTH Royal Institute of Technology |
Hjalmarsson, Hċkan | KTH Royal Inst. of Tech |
Keywords: Nonlinear systems identification, Estimation, Optimization
Abstract: Many off-the-shelf generic non-linear model structures have inherent sparse parametrizations. Volterra series and non-linear Auto-Regressive with eXogeneous inputs (NARX) models are examples of this. It is well known that sparse estimation requires low mutual coherence, which translates into input sequences with certain low correlation properties. This paper highlights that standard optimal input design methods do not account for this requirement which may lead to designs unsuitable for this type of model structure. To tackle this problem, the paper proposes incorporating a coherence constraint to standard input design problems. The coherence constraint is defined as the ratio between the diagonal and non-diagonal entries of the Fisher information matrix (FIM) and can be easily added to any input design problem for nonlinear systems, while the resulting problem remains convex. The paper provides a theoretical analysis of how the range of the optimal objective function of the original problem is affected by the coherence constraint. Additionally, the paper presents numerical evaluations of the proposed approachs performance on a Volterra series model in comparison to state-of-the-art algorithms.
|
|
14:30-14:50, Paper WeB18.4 | Add to My Program |
An Optimal Solution to Infinite Horizon Nonlinear Control Problems |
|
Gul Mohamed, Mohamed Naveed | Texas A&M University |
Goyal, Raman | Texas A&M University |
Chakravorty, Suman | Texas A&M University |
Keywords: Nonlinear systems, Optimal control, Lyapunov methods
Abstract: In this paper, we consider the infinite horizon optimal control problem for nonlinear systems. Under the conditions of controllability of the linearized system around the origin, and nonlinear controllability of the system to a terminal set containing the origin, we establish an approximate regularized solution approach consisting of a ``finite free final time" optimal transfer problem to the terminal set, and an infinite horizon linear regulation problem within the terminal set, that is shown to render the origin globally asymptotically stable. Further, we show that the approximations converge to the true optimal cost function as the size of the terminal set decreases to zero. The approach is empirically evaluated on the pendulum and cart-pole swing-up problems to show that the finite time transfer is far shorter than the effective horizon required to solve the infinite horizon problem without the proposed regularization.
|
|
14:50-15:10, Paper WeB18.5 | Add to My Program |
A Simple Bounded Controller for the FiniteTime Stabilization of the Heisenberg System |
|
Mera, Manuel | Esime Upt Ipn |
Ríos, Héctor | Tecnológico Nacional De México/I.T. La Laguna |
Keywords: Nonlinear systems, Nonholonomic systems, Variable-structure/sliding-mode control
Abstract: Abstract--- In this paper a novel controller design is proposed to stabilize the Heisenberg system in a finite time. Although the controller design is based on the unit vector control for the SlidingMode Control theory, no sliding manifold design is required. Instead, some inherent properties of the Heisenberg system, e.g., the skew symmetric/diagonal structure, are exploited to obtain a simple to tune and bounded controller that ensures the finitetime stabilization of the origin for any arbitrary initial condition outside the origin. Additionally, the resulting controller is globally bounded and, contrasting with other similar approaches for the Heisenberg system, this design allows the estimation of the settlingtime function.
|
|
15:10-15:30, Paper WeB18.6 | Add to My Program |
A Generalized-Moment Method for Control-Affine Ensemble Systems |
|
Kuan, Yuan-Hung | Washington University in St. Louis |
Ning, Nancy | Washington University in St.Louis |
Li, Jr-Shin | Washington University in St. Louis |
Keywords: Nonlinear systems, Large-scale systems, Emerging control applications
Abstract: Controlling dynamic ensemble systems is an essential yet challenging step to enable diverse applications in science and engineering. In this paper, we present a generalized moment method that gives rise to a moment representation of the control-affine ensemble system. The induced moment system is equipped with a banded structure that is beneficial to conducting systems-theoretic analysis and control design for ensemble systems. In addition, we introduce a Lie algebraic technique for exact bilinearization of nonlinear ensemble systems. This transformation provides a unified paradigm for studying highly intricate nonlinear ensemble systems through the associated bilinear moment systems. To demonstrate the applicability of the proposed method, we present numerical examples involving the control of nonlinear ensemble systems.
|
|
WeB19 Regular Session, Peony Junior 4411 |
Add to My Program |
Linear Systems II |
|
|
Chair: Besselink, Bart | University of Groningen |
Co-Chair: Grussler, Christian | Technion - Israel Institute of Technology |
|
13:30-13:50, Paper WeB19.1 | Add to My Program |
Reachable Set Estimation for Discrete-Time Periodic Piecewise Systems |
|
Liu, Yun | East China University of Science and Technolog |
Yang, Wen | East China University of Science and Technology |
Yang, Chao | East China University of Science and Technology |
Zhao, Zhiyun | East China University of Science and Technology |
Keywords: Linear systems, Hybrid systems, Optimization
Abstract: This paper investigates the reachable set estimation problem for discrete-time periodic piecewise systems subject to bounded-peak disturbances for the first time. Based on the periodic linear-interpolative formulation, the discrete time-scheduling Lyapunov functions with continuous or jumping modes at the switching instant are constructed to develop criteria of reachable set estimation that can ensure the asymptotic stability and reachability of the investigated system. Moreover, an index optimizing the bounding region of the desirable reachable set is given via resorting to the ellipsoid technique, and their results are compared. Finally, numerical examples are given to validate the effectiveness of the proposed results.
|
|
13:50-14:10, Paper WeB19.2 | Add to My Program |
Control of Linear Systems with Guarantee of Outputs in Given Sets at Any Time |
|
Nguyen, Ba Huy | ITMO University |
Furtat, Igor | Institute of Problems of Mechanical Engineering Russian Academy |
Vrazhevsky, Sergey | ITMO University |
Keywords: Linear systems, Nonlinear output feedback, LMIs
Abstract: A new method for designing the control law for linear plants with a guarantee of finding outputs in given sets under conditions of unknown bounded disturbances is proposed. The problem is solved in two stages. In the first stage, a coordinate transformation is used to reduce the original constrained problem to the problem of studying the input-to-state stability of a new extended system without constraints. In the second stage, the control law for the transformed system is designed, where the adjustable parameters are selected from the solution of linear matrix inequalities (LMI). The simulations, which are performed in MATLAB, show the methods efficiency and confirm the theoretical results.
|
|
14:10-14:30, Paper WeB19.3 | Add to My Program |
Relaxation Systems and Cyclic Monotonicity |
|
Chaffey, Thomas Lawrence | University of Cambridge |
van Waarde, Henk J. | University of Groningen |
Sepulchre, Rodolphe | University of Cambridge |
Keywords: Linear systems, Variational methods, Optimization
Abstract: It is shown that an LTI system is a relaxation system if and only if its Hankel operator is cyclic monotone. Cyclic monotonicity of the Hankel operator implies the existence of a storage function whose gradient is the Hankel operator. This storage is a function of past inputs alone, is independent of the state space realization, and admits a generalization to nonlinear circuit elements.
|
|
14:30-14:50, Paper WeB19.4 | Add to My Program |
Characterizing Compositionality of LQR from the Categorical Perspective |
|
She, Baike | University of Florida |
Hanks, Tyler | University of Florida |
Fairbanks, James | University of Florida |
Hale, Matthew | University of Florida |
Keywords: Linear systems, Optimal control
Abstract: Composing systems is a fundamental concept in modern control systems, yet it remains challenging to formally analyze how controllers designed for individual subsystems can differ from controllers designed for the composition of those subsystems. To address this challenge, we propose a novel approach to composing control systems based on resource sharing machines, a concept from applied category theory. We use resource sharing machines to investigate the differences between (i) the linear-quadratic regulator (LQR) designed directly for a composite system and (ii) the LQR that is attained through the composition of LQRs designed for each subsystem. We first establish novel formalisms to compose LQR control designs using resource sharing machines. Then we develop new sufficient conditions to guarantee that the LQR designed for a composite system is equal to the LQR attained through composition of LQRs for its subsystems. In addition, we reduce the developed condition to that of checking the controllability and observability of a certain linear, time-invariant system, which provides a simple, computationally efficient procedure for evaluating the equivalence of controllers for composed systems.
|
|
14:50-15:10, Paper WeB19.5 | Add to My Program |
On the Monotonicity of Frequency Response Gains |
|
Grussler, Christian | Technion - Israel Institute of Technology |
B. Burghi, Thiago | University of Cambridge |
Keywords: Compartmental and Positive systems, Linear systems, PID control
Abstract: Linear time-invariant single-input-single-output systems with nonnegative impulse responses, commonly called externally positive systems, carry well-known monotonicity properties such as: (i) the static gain equals the H_infty-norm (peak of the Bode magnitude diagram), (ii) monotone inputs are mapped to monotone outputs, (iii) the transfer function is totally monotone on the positive reals. In this paper, we complement these properties by proving monotonicity properties of the frequency response gain with the help of variation diminishing theory. While our results give new insights into proving monotonicity properties of the gains of positive systems, they are not limited to such systems, and extend to systems that preserve the periodic monotonicity of their inputs. In particular, our results also provide an interesting sufficient condition for positive dominance.
|
|
15:10-15:30, Paper WeB19.6 | Add to My Program |
Specification Verification and Controller Synthesis Using (gamma, Delta)-Similarity (I) |
|
Pirastehzad, Armin | University of Groningen |
van der Schaft, Arjan | Univ. of Groningen |
Besselink, Bart | University of Groningen |
Keywords: Linear systems, LMIs, Formal Verification/Synthesis
Abstract: We address the problems of specification verification and controller synthesis in the context of (gamma,delta)-similarity, a notion of approximate system comparison that measures to what extent the external behaviors of two potentially non-deterministic systems are similar in an L2 sense. Expressing specifications in terms of input-output trajectories of a dynamical system, we use (gamma,delta)-similarity to verify whether the external behavior of a system satisfies such specifications in an approximate sense. We characterize this problem as a linear matrix inequality feasibility problem. In case a control system fails to satisfy specifications with a desired accuracy, we synthesize a dynamic controller that enforces specification satisfaction. We characterize the synthesis problem in terms of a bilinear matrix inequality feasibility problem. Aware of the computational costs for solving such problem, we obtain a sufficient condition for the existence of the controller that can be expressed in terms of a linear matrix inequality. Based on this, we propose an algorithm to construct the controller.
|
|
WeB20 Regular Session, Orchid Junior 4312 |
Add to My Program |
Biological Systems II |
|
|
Chair: Borri, Alessandro | CNR-IASI |
Co-Chair: Niazi, Muhammad Umar B. | Massachusetts Institute of Technology |
|
13:30-13:50, Paper WeB20.1 | Add to My Program |
Undetectable Attacks on Boolean Networks (I) |
|
Zhu, Shiyong | Southeast University |
Lin, Lin | The University of Hong Kong |
Lam, James | The University of Hong Kong |
Ng, Michael | Hong Kong Baptist University |
Lu, Jianquan | Southeast University |
Azuma, Shun-ichi | Kyoto University |
Cao, Jinde | Southeast University |
Keywords: Biological systems, Network analysis and control, Attack Detection
Abstract: In this paper, driven by the ever-increasing cyber-security threats, we study the undetectable attack problems for Boolean networks (BNs), which model distributed systems with a limited capacity of storage and bandwidth of communication. Given a consistent monitor, undetectable attacks are formalized for BNs as those do not yield an output sequence out of the nominal output sequence set. By the graph-theoretic approach, undetectable attacks are characterized by a reachability problem of a directed cycle in the augmented transition graph. On the other hand, the algebraic approach also derives a necessary and sufficient criterion for undetectable attacks by testing the existence of the nonzero elements in the constructed matrix. While all these derived results are only computationally efficient for relatively small-size BNs. The detection of attack signals is indeed NP-hard. In other words, there is no polynomial-time algorithm to check the detectability of an attack signal or an attack node set unless NP=P.
|
|
13:50-14:10, Paper WeB20.2 | Add to My Program |
Optimal Safety-Critical Control of Epidemics |
|
Butler, Brooks A. | Purdue University |
Pare, Philip E. | Purdue University |
Keywords: Biological systems, Emerging control applications, Optimal control
Abstract: We present a generalized model for epidemic processes that partitions control into changes in linear and non-linear flow rates between compartments, respectively. We then define an optimal control problem that minimizes the weighted cost of rate control on the generalized model while maintaining conditions that guarantee system safety at any time using control barrier functions. Using this formulation, we prove that under homogeneous penalties the optimal controller will always favor increasing the linear flow out of an infectious process over reducing nonlinear flow in. Further, in the case of heterogeneous penalties, we provide necessary and sufficient conditions under which the optimal controller will set control of non-linear rates (i.e., the reduction of flow rate into the infection process) to zero. We then illustrate these results through the simulation of a bi-virus SEIQRS model.
|
|
14:10-14:30, Paper WeB20.3 | Add to My Program |
The Long-Term Effects of Physical Activity on Blood Glucose Regulation: A Model to Unravel Diabetes Progression |
|
De Paola, Pierluigi Francesco | Consiglio Nazionale Delle Ricerche (CNR) |
Paglialonga, Alessia | Consiglio Nazionale Delle Ricerche (CNR) |
Palumbo, Pasquale | University of Milano-Bicocca |
Keshavjee, Karim | University of Toronto |
Dabbene, Fabrizio | CNR-IEIIT |
Borri, Alessandro | CNR-IASI |
Keywords: Modeling, Biomedical, Metabolic systems
Abstract: Physical activity plays a key role in the prevention of type 2 diabetes. However, despite the numerous clinical evidences, there are still no mathematical models that satisfactorily describe the effects of physical activity on the progression of diabetes, preventing its onset or slowing down its course. Instead, there are models describing the influence of single training sessions of physical activity on blood glucose and insulin levels in the short term. In this article we propose a novel model for the long term effects of physical activity on diabetes progression, by exploiting and adapting an existing short-term model of physical activity. A pivotal role in the proposed model is played by interleukin-6 released during physical activity and known to be fundamental in maintaining pancreatic beta cells production and therefore satisfactory insulin secretion. The proposed simulation scenarios show how a modeling approach of physical activity that neglects the interleukin-6 action is not sufficient to capture the cumulative effects of physical exercise on disease progression. Indeed, preliminary results pave the way to natural extensions of the model to account for model-based control techniques for the long-term control of diabetes through personalized lifestyle interventions, properly accounting for the effects of physical activity on the long-term dynamics of blood glucose.
|
|
14:30-14:50, Paper WeB20.4 | Add to My Program |
A Probabilistic Finite-State Automata Framework for Monitoring Long-Term Activities of Daily Living |
|
Li, Mianjun | National University of Singapore |
Chen, Peter C. Y. | National University of Singapore |
Keywords: Healthcare and medical systems, Modeling, Automata
Abstract: Aging is an increasingly challenging healthcare issue with long-term sociopolitical implications, requiring more sophisticated management of healthcare services for the elderly. A key element in managing such services is the knowledge about a typical elderly persons Activities of Daily Living (ADLs). However, there is little study on the model-based objective and standardized ADLs assessment. This paper presents the concept of tracking the health status of the elderly by monitoring their individual patterns of ADLs: the specific pattern of the elderly model as probabilistic state-transition structures, and the transition probabilities in such a probabilistic model consider as a representation of the health status of a typical older person. The typical pattern of ADLs changes with the health status due to changes in the residential environment, progressive aging, or the onset of certain diseases. Such changes are reflected (over time) in the model, and therefore monitoring the transition probabilities to assess ADLs represents a possible way to track the health status of an older person and to alert relevant health service professionals if some change that warrants subsequent careful medical attention.
|
|
14:50-15:10, Paper WeB20.5 | Add to My Program |
Observer Design for Nonlinear Systems without Parameterizing Nonlinearities: Application to Networked SIR Model (I) |
|
Niazi, M. Umar B. | MIT |
Johansson, Karl H. | KTH Royal Institute of Technology |
Keywords: Observers for nonlinear systems, Nonlinear systems, Biological systems
Abstract: Traditional observer design methods rely on certain properties of the system's nonlinearity, such as Lipschitz continuity, one-sided Lipschitzness, a bounded Jacobian, or quadratic boundedness. These properties are described by parameterized inequalities. However, enforcing these inequalities globally can lead to very large parameters, resulting in overly conservative observer design criteria. These criteria become infeasible for highly nonlinear applications, such as networked epidemic processes. In this paper, we present an observer design approach for estimating the state of nonlinear systems, without requiring any parameterization of the system's nonlinearities. The proposed observer design depends only on systems' matrices and applies to systems with any nonlinearity. We establish different design criteria for ensuring both asymptotic and exponential convergence of the estimation error to zero. To demonstrate the efficacy of our approach, we employ it for estimating the state of a networked SIR epidemic model. We show that, even in the presence of measurement noise, the observer can accurately estimate the epidemic state of each node in the network. To the best of our knowledge, the proposed observer is the first that is capable of estimating the state of networked SIR models.
|
|
15:10-15:30, Paper WeB20.6 | Add to My Program |
Data-Driven Forward Stochastic Reachability Analysis for Human-In-The-Loop Systems |
|
Choi, Joonwon | Purdue University |
Byeon, Sooyung | Purdue University |
Hwang, Inseok | Purdue University |
Keywords: Human-in-the-loop control, Linear systems, Data driven control
Abstract: We propose a data-driven forward stochastic reachability analysis algorithm for Human-In-The-Loop (HITL) systems. We focus on a certain type of HITL system whose behavior is dominated by a human operator, for example, a multi-rotor controlled by a human operator. In such a system, the intervention of the human operator may generate a conservative reachable set due to the unpredictable control strategy of the human operator. The proposed algorithm computes a less conservative reachable set of the HITL system by accounting for the human operators behavior, i.e., we present the data-driven reachability analysis algorithm that considers the unknown controller information of the HITL system. The behavior of the human operator is trained as a Gaussian Mixture Model (GMM) from the state and input trajectories of the system. Then, the conditional probability distribution of the human operators behavior is obtained from the Gaussian Mixture Regression (GMR) for the closed-loop reachability analysis. The proposed algorithm is tested and demonstrated by the data collected from human subject experiments.
|
|
WeB21 Regular Session, Orchid Junior 4311 |
Add to My Program |
Constrained Control II |
|
|
Chair: Khorrami, Farshad | NYU Tandon School of Engineering |
Co-Chair: Tzes, Anthony | New York University Abu Dhabi |
|
13:30-13:50, Paper WeB21.1 | Add to My Program |
An Interval Predictor--Based Robust Control for a Class of Constrained Nonlinear Systems |
|
Gutiérrez, Ariana | Tecnológico Nacional De México/I.T. De La Laguna |
Ríos, Héctor | Tecnológico Nacional De México/I.T. La Laguna |
Mera, Manuel | Esime Upt Ipn |
Efimov, Denis | Inria |
Ushirobira, Rosane | Inria |
Keywords: Constrained control, Nonlinear systems, Predictive control for nonlinear systems
Abstract: This paper proposes the design of a robust sampledtime controller to stabilize continuoustime nonlinear systems, taking into account state and input constraints. The proposed controller comprises the design of a robust control law, which is based on an interval predictorbased statefeedback controller and a Model Predictive Control (MPC) approach, which deals with the state and input constraints. The interval predictorbased statefeedback controller is designed based on a Lyapunov function approach that provides a safe set, where the state constraints are not transgressed. Out this set, the MPC is activated guaranteeing the fulfillment of the state and input constraints. The proposed switched control strategy guarantees the practical Uniform Asymptotic Stability of the considered nonlinear systems. A constructive method, based on linear matrix inequalities (LMIs), is proposed to compute the controller gains and the state of the system is not required. Some simulation results illustrate the feasibility of the proposed scheme.
|
|
13:50-14:10, Paper WeB21.2 | Add to My Program |
On Enlarging the Domain of Attraction for Linear Systems Subject to Asymmetric Actuator Saturation |
|
Lai, Wenxin | Shanghai Jiao Tong University |
Li, Yuanlong | Shanghai Jiao Tong University |
Lin, Zongli | University of Virginia |
Keywords: Constrained control, Nonlinear systems
Abstract: In this paper, we revisit the problem of enlarging the domain of attraction for linear systems with asymmetric actuator saturation. We partition the state space into several regions according to the sign of each input and rewrite the linear system subject to asymmetric actuator saturation as an equivalent switched system, each subsystem of which is associated with one partition of the state space and is a linear system subject to symmetric actuator saturation. Based on this equivalent representation of the system, we present a Lyapunov function, which is composed of a set of quadratic functions associated with matrices that are not required to be positive definite. We establish sufficient conditions for regional stability and, based on them, formulate optimization problems to enlarge the estimate of the domain of attraction. Simulation results illustrate the effectiveness of the proposed approach.
|
|
14:10-14:30, Paper WeB21.3 | Add to My Program |
Safety-Critical Control under Multiple State and Input Constraints and Application to Fixed-Wing UAV |
|
Oh, Donggeon David | Seoul National University |
Lee, Dongjae | Seoul National University |
Kim, H. Jin | Seoul National University |
Keywords: Constrained control, Optimal control, Control applications
Abstract: This study presents a framework to guarantee safety for a class of second-order nonlinear systems under multiple state and input constraints. To facilitate real-world applications, a safety-critical controller must consider multiple constraints simultaneously, while being able to impose general forms of constraints designed for various tasks (e.g., obstacle avoidance). With this in mind, we first devise a zeroing control barrier function (ZCBF) using a newly proposed nominal evading maneuver. By designing the nominal evading maneuver to 1) be continuously differentiable, 2) satisfy input constraints, and 3) be capable of handling other state constraints, we deduce an ultimate invariant set, a subset of the safe set that can be rendered forward invariant with admissible control inputs. Thanks to the development of the ultimate invariant set, we then propose a safety-critical controller, which is a computationally tractable one-step model predictive controller (MPC) with guaranteed recursive feasibility. We validate the proposed framework in simulation, where a fixed-wing UAV tracks a circular trajectory while satisfying multiple safety constraints including collision avoidance, bounds on flight speed and flight path angle, and input constraints.
|
|
14:30-14:50, Paper WeB21.4 | Add to My Program |
PWA Control Functions: From the Projection of mpQP Solution and Back to the Convexification by Lifting |
|
Yang, Songlin | CentraleSupele, Paris Saclay University |
Olaru, Sorin | CentraleSupélec |
Rodriguez-Ayerbe, Pedro | CentraleSupelec |
Keywords: Constrained control, Optimal control, Model/Controller reduction
Abstract: This paper focuses on the geometric properties of the Piece-Wise Affine (PWA) feedback function as they appear from the optimal solution of the multi-parameter quadratic programming (mpQP) problem. Such optimization problems are popular formulations, for example, in the design of model- based predictive controllers (MPC) for discrete linear systems subject to input and state constraints. The paper considers such a PWA function as input data and provides a method for reconstructing a feasible convex set and a PWA curve within it, which retrieves the identical structure of the solution in the original parametric feasible set. The proposed method involves establishing and decomposing the topology structure of the polyhedral critical regions, which form the domain of the PWA function by means of a graph of interconnections. The regions are split into the boundary and interior collections using convex- concave lifting. The explicit solution is then merged based on the convex-concave liftings to reconstruct the feasible domain and the PWA curves.
|
|
14:50-15:10, Paper WeB21.5 | Add to My Program |
Constraint Inference in Control Tasks from Expert Demonstrations Via Inverse Optimization |
|
Papadimitriou, Dimitris | UC Berkeley |
Li, Jingqi | University of California, Berkeley |
Keywords: Constrained control, Optimization, Estimation
Abstract: Inferring unknown constraints is a challenging and crucial problem in many robotics applications. When only expert demonstrations are available, it becomes essential to infer the unknown domain constraints to deploy additional agents effectively. In this work, we propose an approach to infer affine constraints in control tasks after observing expert demonstrations. We formulate the constraint inference problem as an inverse optimization problem, and we propose an alternating optimization scheme that infers the unknown constraints by minimizing a KKT residual objective. We demonstrate the effectiveness of our method in a number of simulations, and show that our method can infer less conservative constraints than a recent baseline method while maintaining comparable safety guarantees.
|
|
15:10-15:30, Paper WeB21.6 | Add to My Program |
Using Circulation to Mitigate Spurious Equilibria in Control Barrier Function |
|
Goncalves, Vinicius Mariano | New York University Abu Dhabi, United Arab Emirates |
Krishnamurthy, Prashanth | NYU Tandon School of Engineering |
Tzes, Anthony | New York University Abu Dhabi |
Khorrami, Farshad | NYU Tandon School of Engineering |
Keywords: Constrained control, Optimization, Nonlinear systems
Abstract: Control Barrier Functions and Quadratic Programming are increasingly used for designing controllers that consider critical safety constraints. However, like Artificial Potential Fields, they can suffer from the stable spurious equilibrium point problem, which can result in the controller failing to reach the goal. To address this issue, we propose introducing circulation inequalities as a constraint. These inequalities force the system to explicitly circulate the obstacle region in configuration space, thus avoiding undesirable equilibria. We conduct a theoretical analysis of the proposed framework and demonstrate its efficacy through simulation studies. By mitigating spurious equilibria, our approach enhances the reliability of CBF-based controllers, making them more suitable for real-world applications.
|
|
WeB22 Regular Session, Orchid Junior 4212 |
Add to My Program |
Stochastic Optimal Control II |
|
|
Chair: Tsiotras, Panagiotis | Georgia Institute of Technology |
Co-Chair: Khonji, Majid | Khalifa University of Science and Technology |
|
13:30-13:50, Paper WeB22.1 | Add to My Program |
Optimal Stopping Problems in Low-Dimensional Feature Spaces: Lossless Conditions and Approximations |
|
van Zutphen, Menno Johannes Theodorus Cornelis | Eindhoven University of Technology |
Heemels, W.P.M.H. | Eindhoven University of Technology |
Antunes, Duarte | Eindhoven University of Technology, the Netherlands |
Keywords: Stochastic optimal control, Markov processes, Model/Controller reduction
Abstract: Optimal control problems can be solved by dynamic programming. However, this method suffers from the curse of dimensionality. To resolve this, simplified versions of the original problem are often constructed in lower-dimensional feature spaces, leading to approximate policies. Yet, the connections between the original and the approximate policy and costs are rarely formalized. This paper addresses this challenge for optimal stopping problems. We start by providing conditions for lossless feature representations. This means that from an optimal policy obtained in feature space, an optimal policy in the original space can be constructed. Then, we search for modified versions of the original problem that (i) admit a lossless feature representation of far lower dimension; and (ii) provide upper and lower bounds on the optimal cost of the original problem. We can then use policies obtained in feature space using these modified problems to provide approximate policies for the original problem that are guaranteed to perform better than or equal to this aforementioned cost upper bound. We apply our tools in a high-dimensional precision farming intervention problem, where our tools allow for a dramatic decrease in complexity with only a small increase in the cost.
|
|
13:50-14:10, Paper WeB22.2 | Add to My Program |
A Fully Polynomial Time Approximation Scheme for Constrained MDPs under Local Transitions |
|
Khonji, Majid | Khalifa University of Science and Technology |
Keywords: Stochastic optimal control, Markov processes, Uncertain systems
Abstract: The fixed-horizon constrained Markov Decision Process (C-MDP) is a well-known model for planning in stochastic environments under operating constraints. Chance-constrained MDP (CC-MDP) is a variant that allows bounding the probability of constraint violation, which is desired in many safety-critical applications. CC-MDP can also model a class of MDPs, called Stochastic Shortest Path (SSP), under dead-ends, where there is a trade-off between the probability-to-goal and cost-to-goal. This work studies the structure of (C)C-MDP, particularly an important variant that involves local transition. In this variant, the state reachability exhibits a certain degree of locality and independence from the remaining states. More precisely, the number of states, at a given time, that share some reachable future states is always constant. (C)C-MDP under local transition is NP-Hard even for a planning horizon of two. In this work, we propose a fully polynomial-time approximation scheme for (C)C-MDP that computes (near) optimal deterministic policies. Such an algorithm is among the best approximation algorithms attainable in theory and gives insights into the approximability of constrained MDP and its variants.
|
|
14:10-14:30, Paper WeB22.3 | Add to My Program |
Covariance Steering for Systems Subject to Unknown Parameters |
|
Knaup, Jacob | Georgia Institute of Technology |
Tsiotras, Panagiotis | Georgia Institute of Technology |
Keywords: Stochastic optimal control, Optimal control, Uncertain systems
Abstract: This work considers the optimal covariance steering problem for systems subject to both additive noise and uncertain parameters which may enter multiplicatively with the state and the control. The unknown parameters are modeled as constant random variables sampled from distributions with known moments. The optimal covariance steering problem is formulated to include dependence between the unknown parameters and future states, and is solved using sequential convex programming. The proposed approach is demonstrated numerically using a spacecraft control application.
|
|
14:30-14:50, Paper WeB22.4 | Add to My Program |
Computationally Efficient Covariance Steering for Systems Subject to Parametric Disturbances and Chance Constraints |
|
Knaup, Jacob | Georgia Institute of Technology |
Tsiotras, Panagiotis | Georgia Institute of Technology |
Keywords: Stochastic optimal control, Optimal control
Abstract: This work investigates the finite-horizon optimal covariance steering problem for discrete-time linear systems subject to both additive and multiplicative uncertainties as well as state and input chance constraints. In particular, a tractable convex approximation of the optimal covariance steering problem is developed by tightening the chance constraints and by introducing a suitable change of variables. The solution of the convex approximation is shown to be a valid (albeit potentially suboptimal) solution to the original chance-constrained covariance steering problem.
|
|
14:50-15:10, Paper WeB22.5 | Add to My Program |
Discrete-Time Optimal Covariance Steering Via Semidefinite Programming |
|
Rapakoulias, George | Georgia Institute of Technology |
Tsiotras, Panagiotis | Georgia Institute of Technology |
Keywords: Stochastic optimal control, Optimization, Linear systems
Abstract: This paper addresses the optimal covariance steering problem for stochastic discrete-time linear systems subject to probabilistic state and control constraints. A method is presented to efficiently attain the exact solution of the problem based on a lossless convex relaxation of the original non-linear program using semidefinite programming. Both the constrained and the unconstrained versions of the problem with either equality or inequality terminal covariance boundary conditions are addressed. We first prove that the proposed relaxation is lossless for all of the above cases. Numerical examples are then provided to illustrate the proposed method. Finally, a comparative study is performed on systems of various sizes and steering horizons to illustrate the advantages of the proposed method in terms of computational resources compared to the state of the art.
|
|
15:10-15:30, Paper WeB22.6 | Add to My Program |
Safe Stochastic Model-Based Policy Iteration with Chance Constraints |
|
Zhai, Lijing | Georgia Institute of Technology |
Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Hugues, Jerome | Carnegie Mellon University / Software Engineering Institute |
Keywords: Stochastic optimal control, Resilient Control Systems, Learning
Abstract: In this paper, we consider optimal control problems of stochastic discrete-time systems subject to additive disturbances. Safety of such systems is guaranteed in a probabilistic sense via chance constraints. We solve the corresponding chance constrained stochastic control problems by extending the unconstrained model-based Policy Iteration (PI), and thus chance constrained PI with safety guarantees is proposed. Additionally, the stability of generated control policies is analyzed in the mean square sense. Numerical simulations are provided to validate the proposed algorithm performance.
|
|
WeB23 Regular Session, Orchid Junior 4211 |
Add to My Program |
Cyber-Physical Security II |
|
|
Chair: Li, Yuzhe | Northeastern University |
Co-Chair: Zamani, Majid | University of Colorado Boulder |
|
13:30-13:50, Paper WeB23.1 | Add to My Program |
Model-Unknown Spoofing Attack Via False Data Injections (I) |
|
Yang, Nachuan | Hong Kong University of Science and Technology |
Zhong, Yuxing | The Hong Kong University of Science and Technology |
Li, Yuzhe | Northeastern University |
Shi, Ling | Hong Kong University of Science and Technology |
Keywords: Cyber-Physical Security, Attack Detection, Computer/Network Security
Abstract: This conference paper studies the spoofing attack via false data injections, where the adversarial attacker aims at misleading a cyber-physical system by distorting its sensor data. Such type of attacks has not been explored in the existing work on false data injection attacks. Besides, the existing research usually assumes an adversary with full knowledge of the target system. In this paper, we consider the case that the attacker does not know the systems parameters. More specifically, we construct an adaptive estimator for the adversary and prove its convergence to the plants state estimate under adaptive laws. We also show that the convergence of the adaptive estimator is independent of the adversarys strategy. Based on this separation principle, we propose two false data injection methods to implement online spoofing attacks by solving online linear equations and quadratic programming, respectively, and more can be developed using our proposed adaptive scheme in future research. Finally, we provide a benchmark numerical example of an L-1011 aircraft to illustrate the attack performance.
|
|
13:50-14:10, Paper WeB23.2 | Add to My Program |
Optimal Sequential False Data Injection Attack Scheme: Finite-Time Inverse Convergence (I) |
|
Luo, Xiaoyu | Shanghai Jiao Tong University |
Fang, Chongrong | Shanghai Jiao Tong University |
Zhao, Chengcheng | Zhejiang University |
Cheng, Peng | Zhejiang University |
He, Jianping | Shanghai Jiao Tong University |
Keywords: Cyber-Physical Security, Networked control systems, Linear systems
Abstract: In this paper, we explore the relationship between the injected attack signal and the attack selection strategy in networked control systems where the adversary desires to steer the system state to the expected malicious one. We construct a sequential attack framework, i.e., the injected false data varies with the sampling time in discrete-time systems, and then derive an optimal sequential FDI attack strategy. The optimal sequential FDI attack strategy reveals the strongly coupled relationship between the injected attack signal and the attack selection strategy. Furthermore, we prove the finite-time inverse convergence of the critical parameters in the injected optimal attack signal by discrete-time Lyapunov analysis, which enables the efficient off-line design of the attack strategy and saves computing sources. Extensive simulations are conducted to show the effectiveness of the injected optimal sequential attack and the relationship between the attack signal and the attack selection strategy.
|
|
14:10-14:30, Paper WeB23.3 | Add to My Program |
Resilient State Estimation for Nonlinear Discrete-Time Systems Via Input and State Interval Observer Synthesis (I) |
|
Khajenejad, Mohammad | University of California, San Diego |
Jin, Zeyuan | Arizona State University |
Dinh, Thach N. | CNAM Paris |
Yong, Sze Zheng | Northeastern University |
Keywords: Resilient Control Systems, Observers for nonlinear systems, Attack Detection
Abstract: This paper addresses the problem of resilient state estimation and attack reconstruction for boundederror nonlinear discrete-time systems with nonlinear observations/ constraints, where both sensors and actuators can be compromised by false data injection attack signals/unknown inputs. By leveraging mixed-monotone decomposition of nonlinear functions, as well as affine parallel outer-approximation of the observation functions, along with introducing auxiliary states to cancel out the effect of the attacks/unknown inputs, our proposed observer recursively computes interval estimates that by construction, contain the true states and unknown inputs of the system. Moreover, we provide several semi-definite programs to synthesize observer gains to ensure input-to-state stability of the proposed observer and optimality of the design in the sense of minimum mathcal{H}_{infty} gain.
|
|
14:30-14:50, Paper WeB23.4 | Add to My Program |
Towards Trustworthy AI: Sandboxing AI-Based Unverified Controllers for Safe and Secure Cyber-Physical Systems (I) |
|
Zhong, Bingzhuo | Technical University of Munich |
Liu, Siyuan | KTH Royal Institute of Technology |
Caccamo, Marco | Technical University of Munich |
Zamani, Majid | University of Colorado Boulder |
Keywords: Supervisory control, Formal Verification/Synthesis, Hybrid systems
Abstract: In the past decade, artificial-intelligence-based (AI-based) techniques have been widely applied to design controllers over cyber-physical systems (CPSs) for complex control missions (e.g., motion planning in robotics). Nevertheless, AI-based controllers, particularly those developed based on deep neural networks, are typically very complex and are challenging to be formally verified. To cope with this issue, we propose a secure-by-construction architecture, namely Safe-Sec-visor architecture, to sandbox AI-based unverified controllers. By applying this architecture, the overall safety and security of CPSs can be ensured simultaneously, while formal verification over the AI-based controllers is not required. Here, we consider invariance and opacity properties as the desired safety and security properties, respectively. Accordingly, by leveraging a notion of (augmented) control barrier functions, we design a supervisor to check the control inputs provided by the AI-based controller and decide whether to accept them. At the same time, a safety-security advisor runs in parallel and provides fallback control inputs whenever the AI-based controller is rejected for safety and security reasons. To show the effectiveness of our approaches, we apply them to a case study on a quadrotor controlled by an AI-based controller. Here, the initial state of the quadrotor contains secret information which should not be revealed while the safety of the quadrotor should be ensured.
|
|
14:50-15:10, Paper WeB23.5 | Add to My Program |
Zero-Sum Game Based Secure Tracking Control of UAV against FDI Attacks Using Fixed-Time Convergent Reinforcement Learning (I) |
|
Gong, Zhenyu | Northwestern Polytechnical University |
Yang, Feisheng | Northwestern Polytechnical University |
Wu, Dongrui | Huazhong University of Science and Technology |
Keywords: Cyber-Physical Security, Game theory, Learning
Abstract: In this paper, a fixed-time convergent reinforcement learning (RL) algorithm is developed to realize the secure tracking control of the unmanned aerial vehicle (UAV) via the zero-sum game for the first time. To mitigate FDI attack on actuators that may cause the UAV to deviate from the reference trajectory, a zero-sum differential game framework is built in which the secure controller tries to minimize the common performance function, yet the attacker plays a contrary role. Obtaining the optimal secure tracking controller depends on solving the Hamilton-Jacobi-Issacs (HJI) equation related to the zero-sum game. Therefore, a critic-only online RL algorithm is proposed that can converge in a fixed time interval, with the corresponding convergence proof provided. A simulation example is given to show the effectiveness of the raised method.
|
|
WeB24 Regular Session, Orchid Main 4201AB |
Add to My Program |
Hybrid Systems I |
|
|
Chair: Zaccarian, Luca | LAAS-CNRS and University of Trento |
Co-Chair: Poveda, Jorge I. | University of California, San Diego |
|
13:30-13:50, Paper WeB24.1 | Add to My Program |
A Hybrid Redesign for Robust Stabilization without Unit Input |
|
Ballaben, Riccardo | University of Trento |
Sutulovic, Uros | Università Degli Studi Di Trento |
Invernizzi, Davide | Politecnico Di Milano |
Zaccarian, Luca | LAAS-CNRS |
Keywords: Hybrid systems, Aerospace, Lyapunov methods
Abstract: We redesign a linear stabilizer while avoiding an arbitrary value of the input signal. With a hybrid architecture based on a switching logic with two modes we obtain robust global exponential stability while ensuring that the input never takes the unwanted value, while preserving the nominal closed-loop behaviour in a neighbourhood of the origin. In the case where the minimization problem is too computationally expensive, we provide a simpler, albeit more conservative, way to determine the scaling factor. We also present a nonlinear case study to show that the proposed hybrid redesign can be extended to deal with nonlinear systems. Numerical examples are used to illustrate the theoretical results.
|
|
13:50-14:10, Paper WeB24.2 | Add to My Program |
On Invariants for Open Hybrid Systems and Their Interconnections |
|
Sanfelice, Ricardo G. | University of California at Santa Cruz |
Teel, Andrew R. | Univ. of California at Santa Barbara |
Keywords: Hybrid systems, Network analysis and control, Networked control systems
Abstract: For a broad class of hybrid dynamical systems with inputs, termed open hybrid inclusions, a general interconnection model and solution concept are introduced. This model is employed to certify forward invariance of a set for the interconnection. The forward invariance notion allows for Zeno solutions and solutions that end prematurely namely, maximal solutions that are not complete. Sufficient conditions for forward invariance of a set that are compositional and involve a properly defined scalar-valued barrier function are proposed. An example illustrates the ideas.
|
|
14:10-14:30, Paper WeB24.3 | Add to My Program |
Kalman-Like Observer for Hybrid Systems with Linear Maps and Known Jump Times |
|
Tran, Gia Quoc Bao | Mines Paris, Université PSL |
Bernard, Pauline | Mines Paris - PSL |
Keywords: Hybrid systems, Observers for Linear systems, Kalman filtering
Abstract: We propose a hybrid Kalman-like observer for general hybrid systems with linear (time-varying) dynamics and output maps, where the solutions' jump times are exactly known. After defining a hybrid observability Gramian and the corresponding hybrid uniform complete observability, we show that the estimate provided by this observer converges asymptotically to the system solution if this observability holds together with some boundedness and invertibility conditions along the considered system solution. Then, under additional uniformity and strictness of the forgetting factors, we show exponential stability of the estimation error with an arbitrarily fast rate. The robust stability of this error against input disturbances and measurement noise is also studied. The results are illustrated on several benchmark examples, including switched systems, hybrid systems with discontinuous solutions, and continuous-time systems with multi-rate sporadic outputs.
|
|
14:30-14:50, Paper WeB24.4 | Add to My Program |
A Trajectory Based Optimization Approach for Hybrid Observer Design |
|
Oliva, Federico | University of Rome Tor Vergata |
Mattogno, Simone | University of Rome Tor Vergata |
Tenaglia, Alessandro | University of Rome Tor Vergata |
Masocco, Roberto | University of Rome "Tor Vergata" |
Martinelli, Francesco | Univ. Di Roma Tor Vergata |
Carnevale, Daniele | Universita' Di Roma |
Keywords: Hybrid systems, Optimization, Estimation
Abstract: This paper presents a study on developing a hybrid 3D position observer for a rover with acceleration and relative distance measurements. The observer design utilizes two different methodologies; a Trajectory Based Optimization Design (TBOD) and a Linear Matrix Inequality (LMI) method. We prove that, under the proposed solutions, the boundedness of the estimation error is guaranteed. The performance of the observer is evaluated and compared to a standard EKF using comprehensive Monte Carlo simulations.
|
|
14:50-15:10, Paper WeB24.5 | Add to My Program |
Humanoid Robot Co-Design: Coupling Hardware Design with Gait Generation Via Hybrid Zero Dynamics |
|
Ghansah, Adrian Boedtker | California Institute of Technology |
Kim, Jeeseop | California Institute of Technology |
Tucker, Maegan | California Institute of Technology |
Ames, Aaron D. | California Institute of Technology |
Keywords: Hybrid systems, Robotics, Optimization
Abstract: Selecting robot design parameters can be challenging since these parameters are often coupled with the performance of the controller and, therefore, the resulting capabilities of the robot. This leads to a time-consuming and often expensive process whereby one iterates between designing the robot and manually evaluating its capabilities. This is particularly challenging for bipedal robots, where it can be difficult to evaluate the behavior of the system due to the underlying nonlinear and hybrid dynamics. Thus, in an effort to streamline the design process of bipedal robots, and maximize their performance, this paper presents a systematic framework for the co-design of humanoid robots and their associated walking gaits. To this end, we leverage the framework of hybrid zero dynamic (HZD) gait generation, which gives a formal approach to the generation of dynamic walking gaits. The key novelty of this paper is to consider both virtual constraints associated with the actuators of the robot, coupled with emph{design} virtual constraints that encode the associated parameters of the robot to be designed. These virtual constraints are combined in an HZD optimization problem which simultaneously determines the design parameters while finding a stable walking gait that minimizes a given cost function. The proposed approach is demonstrated through the design of a novel humanoid robot, ADAM, wherein its thigh and shin are co-designed so as to yield energy efficient bipedal locomotion.
|
|
15:10-15:30, Paper WeB24.6 | Add to My Program |
Averaging in a Class of Stochastic Hybrid Dynamical Systems with Time-Varying Flow Maps |
|
Poveda, Jorge I. | University of California, San Diego |
Keywords: Hybrid systems, Stability of hybrid systems, Adaptive systems
Abstract: We present stability and recurrence results for a class of stochastic hybrid dynamical systems with oscillating flow maps. These results are developed by introducing averaging tools that parallel those already existing for ordinary differential equations and deterministic hybrid dynamical systems. Such tools can be used to examine the stability properties of the original dynamics based on the properties of a simpler dynamical system constructed from the average of the original oscillating vector field. In this work, we focus on a class of systems for which global stability and recurrence results are achievable under suitable smoothness assumptions on the dynamics. By studying the average stochastic hybrid dynamics using Lyapunov-Foster functions, we derive similar stability and recurrence results for the original stochastic hybrid system.
|
|
WeB25 Invited Session, Lotus Junior 4DE |
Add to My Program |
Contraction Theory for Analysis, Synchronization and Regulation II |
|
|
Chair: Astolfi, Daniele | Cnrs - Lagepp |
Co-Chair: Bullo, Francesco | Univ of California at Santa Barbara |
Organizer: Astolfi, Daniele | Cnrs - Lagepp |
Organizer: Bullo, Francesco | Univ of California at Santa Barbara |
|
13:30-13:50, Paper WeB25.1 | Add to My Program |
Stabilization for a Class of Bilinear Systems: A Unified Approach |
|
Nazari Monfared, Morteza | University of Pavia, |
Kawano, Yu | Hiroshima University |
Machado Martínez, Juan Eduardo | Brandenburg University of Technology |
Astolfi, Daniele | Cnrs - Lagepp |
Cucuzzella, Michele | University of Pavia |
Keywords: Stability of nonlinear systems, Lyapunov methods
Abstract: This letter studies nonlinear dynamic control design for a class of bilinear systems to asymptotically stabilize a given equilibrium point while fulfilling constraints on the control input and state. We design a controller based on integral actions on the system input and output. As special cases, the proposed controller contains a dynamic controller with an integral action of either input or output only and a static controller. Stability analysis of the closed-loop system is performed based on a Lyapunov function. Level sets of the Lyapunov function are utilized to estimate a set of initial states and inputs such that the corresponding state and input trajectories are within specified compact sets. Finally, the proposed control technique is applied to a heat exchanger under constraints on the temperature of each cell (state) and the mass flow rate (input), and simulations show the effectiveness of the proposed approach.
|
|
13:50-14:10, Paper WeB25.2 | Add to My Program |
Comparison between Different Continuous-Time Realizations Based on the Tau Method of a Nonlinear Repetitive Control Scheme (I) |
|
Bajodek, Mathieu | LAAS-CNRS |
Astolfi, Daniele | Cnrs - Lagepp |
Keywords: Output regulation, Lyapunov methods, Reduced order modeling
Abstract: This paper presents a new framework to study a continuous-time finite-dimensional approximation of a repetitive control scheme for minimum-phase nonlinear systems. Based on the tau method, three state matrices are proposed to approximate the transport equation, namely tau-Fourier, tau-Legendre and tau-Chebyshev models. Then, the forwarding approach allows us to set up three finite dimensional controllers whose structure is derived from these tau models. In all three cases, we prove that a smooth steady state is reached and that the output converges to zero as the order increases. We also show that, for a fixed order, the tau-Fourier controller leads to the smallest output. Lastly, numerical comparisons are discussed and emphasized our theoretical expectations.
|
|
14:10-14:30, Paper WeB25.3 | Add to My Program |
LMI Conditions for K-Contraction Analysis: A Step towards Design (I) |
|
Zoboli, Samuele | University of Lyon 1 |
Cecilia, Andreu | Universitat Politècnica De Catalunya |
Serres, Ulysse | Université Claude Bernard Lyon 1 |
Astolfi, Daniele | Cnrs - Lagepp |
Andrieu, Vincent | Université De Lyon |
Keywords: Stability of nonlinear systems, Stability of linear systems, Nonlinear systems
Abstract: Recently, k-contraction has been proposed as a generalization of contraction properties for nonlinear time- variant systems. Existing tools for k-contraction analysis exploit complex mathematical tools known as matrix compounds. This prevented the development of related design methodologies. In this paper, we link k-contraction properties to recent partial stability analysis tools. This leads to novel, design-oriented sufficient conditions for k-contraction analysis which do not involve matrix compounds. We also show that such sufficient conditions are necessary for the linear time-invariant framework. Finally, we compare our results to existing methods and highlight their advantages.
|
|
14:30-14:50, Paper WeB25.4 | Add to My Program |
Saturating Integral Control for Infinite-Dimensional Linear Systems (I) |
|
Lorenzetti, Pietro | Tel Aviv University |
Paunonen, Lassi | Tampere University |
Vanspranghe, Nicolas | Tampere University |
Weiss, George | Tel Aviv University |
Keywords: Distributed parameter systems, Constrained control, Output regulation
Abstract: We propose a saturating integrator based controller for infinite-dimensional well-posed linear systems. This prevents the controller state from leaving a desired closed interval. This set usually represents actuator constraints or safety requirements. We use Lyapunov theory to prove closed-loop stability and tracking of a constant reference for a suitable set of feasible references. The performance of the proposed controller is showcased through an application: the boundary control of a string equation with localized viscous damping.
|
|
14:50-15:10, Paper WeB25.5 | Add to My Program |
An Internal Model Principle for Open Systems (I) |
|
Bin, Michelangelo | University of Bologna |
Marconi, Lorenzo | Univ. Di Bologna |
Sepulchre, Rodolphe | University of Cambridge |
Keywords: Output regulation, Nonlinear systems, Behavioural systems
Abstract: The paper explores an extension of the classical internal model principle of Francis and Wonham to cases where the exogeneous signals (references or disturbances) are generated by an open, rather than closed (i.e., autonomous), exosystem. We study this extension both in the linear and nonlinear case, showing that the internal model principle is necessary for robust regulation of a contractive closed-loop system. While preliminary, our results motivate a generalization of nonlinear regulation theory to open exosystems.
|
|
15:10-15:30, Paper WeB25.6 | Add to My Program |
Further Results on Incremental Input-To-State Stability Based on Contraction-Metric Analysis (I) |
|
Giaccagli, Mattia | Tel Aviv University |
Astolfi, Daniele | Cnrs - Lagepp |
Andrieu, Vincent | Université De Lyon |
Keywords: Stability of nonlinear systems, Nonlinear systems
Abstract: In this paper, we study the notion of incremental stability for continuous-time nonlinear systems forced by some external input. Thanks to the notion of Killing vector field, we provide a set of sufficient conditions based on a metric analysis for an input-affine system to be incrementally input-to-state stable and show that this also implies that its lifted system is transversally ISS. We conclude by providing a design achieving incremental ISS properties for the closed-loop system by means of a state-feedback control law that possesses an infinite gain margin property.
|
|
WeB26 Regular Session, Orchid Main 4301AB |
Add to My Program |
Distributed Parameter Systems I |
|
|
Chair: Macchelli, Alessandro | University of Bologna - Italy |
Co-Chair: Le Gorrec, Yann | Cnrs, Ensmm, Femto-St / As2m |
|
13:30-13:50, Paper WeB26.1 | Add to My Program |
Adaptive Estimation of the Pennes Bio-Heat Equation - I: Observer Design |
|
Cristofaro, Andrea | Sapienza University of Rome |
Cappellini, Guglielmo | Sapienza University of Rome |
Staffetti, Ernesto | Universidad Rey Juan Carlos |
Trappolini, Giovanni | Sapienza University of Rome |
Vendittelli, Marilena | Sapienza University of Rome |
Keywords: Distributed parameter systems, Adaptive systems, Healthcare and medical systems
Abstract: In this paper, we propose a multiple-model adaptive estimation setup for a class of uncertain parabolic reaction-diffusion PDEs encompassing the Pennes bio-heat equation, which is a motivating case study from the perspective of biomedical applications such as hyperthermia. The efficacy of the approach in estimating the system solution and recovering the value of the reaction coefficient is validated through numerical simulations in MATLAB. The validation step has highlited some limitations of classical numerical simulation tools that we propose to handle through an implementation of the estimator relying on Deep Learning libraries. This alternative approach is reported in a companion paper (Part II of this work).
|
|
13:50-14:10, Paper WeB26.2 | Add to My Program |
Incrementally Passive Infinite Dimensional Systems with a Constrained State Variable |
|
Singh, Shantanu | Tel Aviv University |
Fueyo, Sebastien | Tel Aviv University |
Weiss, George | Tel Aviv University |
Keywords: Distributed parameter systems, Constrained control, Nonlinear systems
Abstract: In this paper, we show that the passivity property of a linear infinite dimensional system, with respect to a given supply rate, is preserved in the presence of a saturating integrator, which restricts a one dimensional component of the state to a compact interval. The resulting nonlinear system is incrementally passive with the same supply rate. We give an application of our main result to a boundary controlled string equation, where the displacement of the string at some interior point is restricted to a compact interval.
|
|
14:10-14:30, Paper WeB26.3 | Add to My Program |
The Power Function for Adaptive Control in Native Space Embedding |
|
Wang, Haoran | Virginia Tech |
Powell, Nathan | Virginia Tech |
L'Afflitto, Andrea | Virginia Tech |
Kurdila, Andrew J. | Virginia Tech |
Burns, John A | Virginia Tech |
Keywords: Distributed parameter systems, Data driven control, Adaptive control
Abstract: This paper studies how the power function in a reproducing kernel Hilbert space (RKHS) can be used system- atically to design error bounding methods of adaptive estima- tion and control via the native space embedding method. The approach is based on viewing the original system of ordinary differential equations (ODEs) as a type of distributed parameter system (DPS), and subsequently defining realizable controllers by approximating the DPS with scattered bases over a domain of interest. The approach provides rigorous bounds on ultimate performance guarantees for uncertainty classes defined in the native space. One result derives an upper bound on the ultimate performance of the adaptive controller in terms of the power function. Another version of this upper bound shows how the ultimate performance can bounded in terms of a fill distance of centers of approximation in subsets that contain the closed loop trajectory. In contrast to the general theory for error bounding adaptive controllers in Euclidean space, the general approach in this paper works for functional uncertainties in any RKHS.
|
|
14:30-14:50, Paper WeB26.4 | Add to My Program |
Frequency Domain Approach for the Stability Analysis of a Fast Hyperbolic PDE Coupled with a Slow ODE |
|
Arias, Gonzalo | Pontificia Universidad Católica De Chile |
Marx, Swann | LS2N |
Mazanti, Guilherme | Inria, Université Paris-Saclay, CentraleSupélec, CNRS |
Keywords: Distributed parameter systems, Delay systems, Stability of linear systems
Abstract: This paper deals with the exponential stability of systems made of a hyperbolic PDE coupled with an ODE with different time scales, the dynamics of the PDE being much faster than that of the ODE. Such a difference of time scales is modeled though a small parameter varepsilon multiplying the time derivative in the PDE, and our stability analysis relies on the singular perturbation method. More precisely, we define two subsystems: a reduced order system, representing the dynamics of the full system in the limit varepsilon = 0, and a boundary-layer system, which represents the dynamics of the PDE in the fast time scale. Our main result shows that, if both the reduced order and the boundary-layer systems are exponentially stable, then the full system is also exponentially stable for varepsilon small enough, and our strategy is based on a spectral analysis of the systems under consideration. Our main result improves a previous result in the literature, which was proved using a Lyapunov approach and required a stronger assumption on the boundary-layer system to obtain the same conclusion.
|
|
14:50-15:10, Paper WeB26.5 | Add to My Program |
Port-Hamiltonian Control Design for an IPMC Actuated Highly Flexible Endoscope |
|
Macchelli, Alessandro | University of Bologna - Italy |
Wu, Yongxin | FEMTO-ST/ENSMM |
Le Gorrec, Yann | Cnrs, Ensmm, Femto-St / As2m |
Keywords: Distributed parameter systems, Flexible structures, Smart structures
Abstract: This paper deals with modelling and control of an endoscope actuated by Ionic Polymer Metal Composites (IPMC) patches. The endoscope is modelled by a nonlinear partial differential equation (PDE) capable to describe large deformations. The dynamics of the flexible structure and of the IPMC patches are in port-Hamiltonian form, with the actuators interconnected to the mechanical device in power-conserving way. Thus, the complete model is a port-Hamiltonian system in which a PDE with fixed boundary conditions is coupled with a set of ordinary differential equations. The control inputs are the voltages applied to the patches, and the feedback law is designed within the Interconnection and Damping Assignment Passivity-based Control (IDA-PBC) framework. The asymptotic stability of the closed-loop system is proved, and the effectiveness of the design procedure is illustrated by a numerical example.
|
|
15:10-15:30, Paper WeB26.6 | Add to My Program |
Disturbance Attenuation in the Euler-Bernoulli Beam with Viscous and Kelvin-Voigt Damping Via Piezoelectric Actuators |
|
Selivanov, Anton | The University of Sheffield |
Fridman, Emilia | Tel-Aviv Univ |
Keywords: Distributed parameter systems, LMIs
Abstract: We design a state-feedback controller, applied via piezoelectric actuators, that suppresses the effect of a distributed disturbance in the Euler-Bernoulli beam with viscous and Kelvin-Voigt damping. The controller is designed to improve performance on a finite number of modes. Its effect on the remaining (infinitely many) modes is analysed by constructing an appropriate Lyapunov functional, whose properties are guaranteed by the feasibility of linear matrix inequalities (LMIs). The LMIs allow us to design suitable controller gain and estimate the induced L2 gain. A numerical example demonstrates how this modal decomposition approach leads to a controller that significantly reduces the L2 gain.
|
|
WeC01 Invited Session, Orchid Main 4202-4306 |
Add to My Program |
Learning, Optimization, and Game Theory I |
|
|
Chair: Doan, Thinh T. | Virginia Tech |
Co-Chair: Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Organizer: Doan, Thinh T. | Virginia Tech |
Organizer: Sayin, Muhammed Omer | Bilkent University |
Organizer: Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Organizer: Zhang, Kaiqing | University of Maryland |
|
16:00-16:20, Paper WeC01.1 | Add to My Program |
Revisiting LQR Control from the Perspective of Receding-Horizon Policy Gradient |
|
Zhang, Xiangyuan | University of Illinois at Urbana-Champaign |
Basar, Tamer | Univ of Illinois, Urbana-Champaign |
Keywords: Optimal control, Machine learning, Optimization
Abstract: We revisit in this paper the discrete-time linear quadratic regulator (LQR) problem from the perspective of receding-horizon policy gradient (RHPG), a newly developed model-free learning framework for control applications. We provide a fine-grained sample complexity analysis for RHPG to learn a control policy that is both stabilizing and epsilon-close to the optimal LQR solution, and our algorithm does not require knowing a stabilizing control policy for initialization. Combined with the recent application of RHPG in learning the Kalman filter, we demonstrate the general applicability of RHPG in linear control and estimation with streamlined analyses.
|
|
16:20-16:40, Paper WeC01.2 | Add to My Program |
Online Reinforcement Learning in Markov Decision Process Using Linear Programming (I) |
|
Leon, Vincent | University of Illinois at Urbana-Champaign |
Etesami, Rasoul | University of Illinois at Urbana-Champaign |
Keywords: Learning, Markov processes, Optimization algorithms
Abstract: We consider online reinforcement learning in episodic Markov decision process (MDP) with unknown transition function and stochastic rewards drawn from some fixed but unknown distribution. The learner aims to learn the optimal policy and minimize their regret over a finite time horizon through interacting with the environment. We devise a simple and efficient model-based algorithm that achieves O~(LX(TA)^(1/2)) regret with high probability, where L is the episode length, T is the number of episodes, and X and A are the cardinalities of the state space and the action space, respectively. The proposed algorithm, which is based on the concept of "optimism in the face of uncertainty", maintains confidence sets of transition and reward functions and uses occupancy measures to connect the online MDP with linear programming. It achieves a tighter regret bound compared to the existing works that use a similar confidence set framework and improves computational effort compared to those that use a different framework but with a slightly tighter regret bound.
|
|
16:40-17:00, Paper WeC01.3 | Add to My Program |
On the Convergence of Natural Policy Gradient and Mirror Descent-Like Policy Methods for Average-Reward MDPs (I) |
|
Murthy, Yashaswini | University of Illinois, Urbana-Champaign |
Srikant, R | Univ of Illinois, Urbana-Champaign |
Keywords: Learning, Stochastic optimal control, Stochastic systems
Abstract: It is now well known that Natural Policy Gradient (NPG) globally converges for discounted-reward MDPs in the tabular setting, with perfect value function estimates. However, the result cannot be directly used to obtain a corresponding convergence result for average-reward MDPs by letting the discount factor tend to one. In this paper, we prove that NPG also converges for average-reward MDPs in which each policy leads to an irreducible Markov chain. Since NPG can also be interpreted as a mirror descent based policy method, we then discuss extensions to non-tabular settings for mirror descent-based methods.
|
|
17:00-17:20, Paper WeC01.4 | Add to My Program |
Episodic Logit-Q Dynamics for Efficient Learning in Stochastic Teams (I) |
|
Unlu, Onur | Bilkent University |
Sayin, Muhammed Omer | Bilkent University |
Keywords: Game theory, Markov processes, Learning
Abstract: We present new learning dynamics combining (independent) log-linear learning and value iteration for stochastic games within the auxiliary stage game framework. The dynamics presented provably attain the efficient equilibrium (also known as optimal equilibrium) in identical-interest stochastic games, beyond the recent concentration of progress on provable convergence to some (possibly inefficient) equilibrium. The dynamics are also independent in the sense that agents take actions consistent with their local viewpoint to a reasonable extent rather than seeking equilibrium. These aspects can be of practical interest in the control applications of intelligent and autonomous systems. The key challenges are the convergence to an inefficient equilibrium and the non-stationarity of the environment from a single agent's viewpoint due to the adaptation of others. The log-linear update plays an important role in addressing the former. We address the latter through the play-in-episodes scheme in which the agents update their Q-function estimates only at the end of the episodes.
|
|
17:20-17:40, Paper WeC01.5 | Add to My Program |
Gradient Dynamics in Linear Quadratic Network Games with Time-Varying Connectivity and Population Fluctuation (I) |
|
Al Taha, Feras | Cornell University |
Rokade, Kiran | Cornell University |
Parise, Francesca | Cornell University |
Keywords: Game theory, Learning, Large-scale systems
Abstract: In this paper, we consider a learning problem among non-cooperative agents interacting in a time-varying system. Specifically, we focus on repeated linear quadratic network games, in which the network of interactions changes with time and agents may not be present at each iteration. To get tractability, we assume that at each iteration, the network of interactions is sampled from an underlying random network model and agents participate at random with a given probability. Under these assumptions, we consider a gradient-based learning algorithm and establish almost sure convergence of the agents' strategies to the Nash equilibrium of the game played over the expected network. Additionally, we prove, in the large population regime, that the learned strategy is an epsilon-Nash equilibrium for each stage game with high probability. We validate our results over an online market application.
|
|
17:40-18:00, Paper WeC01.6 | Add to My Program |
Policy Gradient Play Over Time-Varying Networks in Markov Potential Games (I) |
|
Aydin, Sarper | Texas A&M University |
Eksin, Ceyhun | Texas A&M University |
Keywords: Game theory, Learning, Optimization algorithms
Abstract: We design a multi-agent and networked policy gradient algorithm in Markov potential games. Each agent has its own rewards and utility as functions of joint actions and a shared state among agents. The state dynamics depend on the joint actions taken. Differentiable Markov potential games are defined based on the existence of a potential (value) function having partial gradients equal to the local gradients of agents' individual value functions. Agents implement continuous parameterized policies defined over the state and other agents' parameters to maximize their utilities against each other. Agents compute their stochastic policy gradients to update their parameters with respect to their local estimates of Q-functions and joint parameters. The updated parameters are shared with neighbors over a time-varying network. We prove the convergence of joint parameters to a first-order stationary point of the potential function in probability for any type of state and action spaces. Numerical results illustrate the potential advantages of using networked policies compared to independent policies.
|
|
WeC02 Invited Session, Melati Main 4001AB-4104 |
Add to My Program |
Data-Driven Distributionally Robust Optimization and Control |
|
|
Chair: Aolaritei, Ioan-Liviu | ETH Zurich |
Co-Chair: Dörfler, Florian | Swiss Federal Institute of Technology (ETH) Zurich |
Organizer: Aolaritei, Liviu | ETH Zurich |
Organizer: Dörfler, Florian | Swiss Federal Institute of Technology (ETH) Zurich |
Organizer: Cherukuri, Ashish | University of Groningen |
|
16:00-16:20, Paper WeC02.1 | Add to My Program |
Ordered Risk Minimization: Learning More from Less Data (I) |
|
Coppens, Peter | KU Leuven |
Patrinos, Panagiotis | KU Leuven |
Keywords: Optimization, Statistical learning, Machine learning
Abstract: We consider the worst-case expectation of a permutation invariant ambiguity set of discrete distributions as a proxy-cost for data-driven expected risk minimization. For this framework, we coin the term ordered risk minimization to highlight how results from order statistics inspired the proxy-cost. Specifically, we show how such costs serve as point-wise high-confidence upper bounds of the expected risk. The confidence level can be determined tightly for any sample size. Conversely we also illustrate how to calibrate the size of the ambiguity set such that the high-confidence upper bound has some user specified confidence. This calibration procedure notably supports phi-divergence based ambiguity sets. Numerical experiments then illustrate how the resulting scheme both generalizes better and is less sensitive to tuning parameters compared to the empirical risk minimization approach.
|
|
16:20-16:40, Paper WeC02.2 | Add to My Program |
Distributionally Robust Differential Dynamic Programming with Wasserstein Distance |
|
Hakobyan, Astghik | Seoul National University |
Yang, Insoon | Seoul National University |
Keywords: Stochastic optimal control, Optimal control
Abstract: Differential dynamic programming (DDP) is a popular technique for solving nonlinear optimal control problems with locally quadratic approximations. However, existing DDP methods are not designed for stochastic systems with unknown disturbance distributions. To address this limitation, we propose a novel DDP method that approximately solves the Wasserstein distributionally robust control (WDRC) problem, where the true disturbance distribution is unknown but a disturbance sample dataset is given. Our approach aims to develop a practical and computationally efficient DDP solution. To achieve this, we use the Kantrovich duality principle to decompose the value function in a novel way and derive closed-form expressions of the distributionally robust control and worst-case distribution policies to be used in each iteration of our DDP algorithm. This characterization makes our method tractable and scalable without the need for numerically solving any minimax optimization problems. The superior out-of-sample performance and scalability of our algorithm are demonstrated through kinematic car navigation and coupled oscillator problems.
|
|
16:40-17:00, Paper WeC02.3 | Add to My Program |
Distributionally Robust Optimal and Safe Control of Stochastic Systems Via Kernel Conditional Mean Embedding (I) |
|
Romao, Licio | University of Oxford |
Hota, Ashish Ranjan | Indian Institute of Technology (IIT), Kharagpur |
Abate, Alessandro | University of Oxford |
Keywords: Stochastic optimal control, Stochastic systems, Machine learning
Abstract: We present a distributionally robust framework for dynamic programming that uses kernel methods to design control policies satisfying both safety and optimality specifications. Specifically, we leverage kernel mean embedding to map the transition probabilities governing state evolution into an associated reproducing kernel Hilbert space. Our key idea lies in combining conditional mean embedding estimated from past data of system trajectories with the maximum mean discrepancy distance to construct an ambiguity set, and then design a robust control policy using techniques from distributionally robust optimization. The main theoretical contribution of this paper is to leverage functional analytical tools to prove that optimal policies for this infinite-dimensional min-max problem are Markovian. Additionally, we discuss approximation schemes based on discretization of inputs to make the approach computationally tractable. We validate the main theoretical findings of the paper in a benchmark control problem involving safe control of thermostatically controlled loads.
|
|
17:00-17:20, Paper WeC02.4 | Add to My Program |
Wasserstein Distributionally Robust Risk-Constrained Iterative MPC for Motion Planning: Computationally Efficient Approximations (I) |
|
Zolanvari, Alireza | University of Groningen |
Cherukuri, Ashish | University of Groningen |
Keywords: Predictive control for nonlinear systems, Iterative learning control, Robotics
Abstract: This paper considers a risk-constrained motion planning problem and aims to find the solution combining the concepts of iterative model predictive control (MPC) and data-driven distributionally robust (DR) risk-constrained optimization. In the iterative MPC, at each iteration, safe states visited and stored in the previous iterations are imposed as terminal constraints. Furthermore, samples collected during the iteration are used in the subsequent iterations to tune the ambiguity set of the DR constraints employed in the MPC. In this method, the MPC problem becomes computationally burdensome when the iteration number goes high. To overcome this challenge, the emphasis of this paper is to reduce the real-time computational effort using two approximations. First one involves clustering of data at the beginning of each iteration and modifying the ambiguity set for the MPC scheme so that safety guarantees still holds. The second approximation considers determining DR-safe regions at the start of iteration and constraining the state in the MPC scheme to such safe sets. We analyze the computational tractability of these approximations and present a simulation example that considers path planning in the presence of randomly moving obstacle.
|
|
17:20-17:40, Paper WeC02.5 | Add to My Program |
Data-Driven Distributionally Robust Coverage Control by Mobile Robots (I) |
|
Boskos, Dimitris | TU Delft |
Cortes, Jorge | University of California, San Diego |
Martinez, Sonia | University of California at San Diego |
Keywords: Autonomous robots, Uncertain systems, Optimization algorithms
Abstract: This paper provides a data-driven solution to the problem of coverage control by which a team of robots aims to optimally deploy in a spatial region where certain event of interest may occur. This event is random and described by a probability density function, which is unknown and can only be learned by collecting data. In this work, we hedge against this uncertainty by designing a distributionally robust algorithm that optimizes the locations of the robots against the worst-case probability density from an ambiguity set. This ambiguity set is constructed from data initially collected by the agents, and contains the true density function with prescribed confidence. However, the objective function that the robots seek to minimize is non-smooth. To address this issue, we employ the so-called gradient sampling algorithm, which approximates the Clarke generalized gradient by sampling the derivative of the objective function at nearby locations and stabilizes the choice of descent directions around points where the function may fail to be differentiable. This enables us to prove that the algorithm converges to a stationary point from any initial location of the robots, in analogy to the well-known Lloyd algorithm for differentiable costs when the spatial density is known.
|
|
17:40-18:00, Paper WeC02.6 | Add to My Program |
Wasserstein Tube MPC with Exact Uncertainty Propagation (I) |
|
Aolaritei, Liviu | ETH Zurich |
Fochesato, Marta | ETH Zurich |
Lygeros, John | ETH Zurich |
Dörfler, Florian | Swiss Federal Institute of Technology (ETH) Zurich |
Keywords: Stochastic systems, Robust control, Optimal control
Abstract: We study model predictive control (MPC) problems for stochastic LTI systems, where the noise distribution is unknown, compactly supported, and only observable through a limited number of i.i.d. noise realizations. Building upon recent results in the literature, which show that distributional uncertainty can be efficiently captured within a Wasserstein ambiguity set, and that such ambiguity sets propagate exactly through the system dynamics, we start by formulating a novel Wasserstein Tube MPC (WT-MPC) problem, with distributionally robust CVaR constraints. We then show that the WT-MPC problem: (1) is a direct generalization of the (deterministic) Robust Tube MPC (RT-MPC) to the stochastic setting; (2) through a scalar parameter, it interpolates between the data-driven formulation based on sample average approximation and the RT-MPC formulation, allowing us to optimally trade between safety and performance; (3) admits a tractable convex reformulation, which grows linearly in the number of available noise samples; and (4) is recursively feasible. Finally, we conclude the paper with a numerical comparison of WT-MPC and RT-MPC.
|
|
WeC03 Invited Session, Melati Junior 4010A-4111 |
Add to My Program |
Cyber-Physical Systems: Resilience |
|
|
Chair: Zhu, Quanyan | New York University |
Co-Chair: Sadabadi, Mahdieh S. | Queen Mary University of London |
Organizer: Sadabadi, Mahdieh S. | University of Manchester |
Organizer: Escudero, Cédric | INSA Lyon, Laboratoire Ampère |
Organizer: Selvi, Daniela | Università Di Pisa |
Organizer: Soudjani, Sadegh | Newcastle University |
Organizer: Murguia, Carlos | Eindhoven University of Technology |
Organizer: Chong, Michelle | Eindhoven University of Technology |
Organizer: Ferrari, Riccardo M.G. | Delft University of Technology |
Organizer: Sasahara, Hampei | Tokyo Institute of Technology |
Organizer: Zhu, Quanyan | New York University |
|
16:00-16:20, Paper WeC03.1 | Add to My Program |
Identification of Malicious Activity in Distributed Average Consensus Via Non-Concurrent Checking |
|
Hadjicostis, Christoforos N. | University of Cyprus |
Dominguez-Garcia, Alejandro D. | University of Illinois at Urbana-Champaign |
Keywords: Agents-based systems, Distributed control, Fault diagnosis
Abstract: We consider the problem of average consensus in a network system under a fixed, undirected communication topology, when there are malicious nodes present that may try to influence the average calculation. In the setting considered, the average consensus is performed by the nodes in a distributed fashion using a linear iterative algorithm. We assume malicious nodes can manipulate, in an arbitrary manner, the value of their state in the aforementioned algorithm; the problem is then to check whether or not each node is correctly performing the updates of its state. To address this problem, we propose a distributed algorithm whereby each node is in charge of checking the updates performed by its neighboring nodes based on information that it receives from them and also from the neighbors of its neighbors. The algorithm leverages ideas from non-concurrent error detection schemes and its main advantage is that information from two-hop neighbors is only needed infrequentlya relaxation that significantly reduces the communication overhead associated with the requirement to make such information available.
|
|
16:20-16:40, Paper WeC03.2 | Add to My Program |
Distributed Resilient Observer: Blended Dynamics Theory Meets L1-Minimization Approach |
|
Lee, Donggil | Seoul National University |
Kim, Junsoo | Seoul National University of Science and Technology |
Shim, Hyungbo | Seoul National University |
Keywords: Networked control systems, Observers for Linear systems, Distributed control
Abstract: This paper presents a distributed resilient observer for continuous-time linear time-invariant plants that remains functional even under sensor attacks. The proposed method aims to determine the estimation outcome that matches the majority of sensor measurements, which is formulated as an l1-minimization problem considering all the observable components of each sensor measurement. A distributed observer based on the blended dynamics theory is then proposed to solve the l1-minimization problem in a distributed manner. As a result, the distributed resilient estimation is enabled for a broader class of systems compared to previous works. The design procedure is constructive with parameters obtained from a specified condition that is equivalent to the well-known null-space property.
|
|
16:40-17:00, Paper WeC03.3 | Add to My Program |
Leader-Follower Formations Subject to False Data Injections: A Resilient Distributed Model Predictive Approach (I) |
|
Famularo, Domenico | Università Degli Studi Della Calabria |
Franze, Giuseppe | Universita' Della Calabria |
Tedesco, Francesco | Università Della Calabria |
Venturino, Antonello | Università Della Calabria |
Keywords: Resilient Control Systems, Cyber-Physical Security, Predictive control for nonlinear systems
Abstract: In this paper, resilience issues for platoons of autonomous agents are addressed when false data injections affect the information exchanged among the neighbors via a communication medium. A distributed model predictive control scheme is used for dealing with the overall regulation task. Conversely, the core of this study relies on the design of an efficient anomaly detector and viable attack countermeasures. In particular, it is formally proven that the proposed device is capable to uncover in finite time malicious actions by simple set-containment set-membership conditions arising from the concept of k-step ahead state predictions convex sets. Moreover, the attack countermeasures have a twofold nature: the first one is conceived by exploiting feasibility arguments of the model predictive philosophy; while the second resilient operation takes inspiration from rejuvenation ideas by leading to safe splitting and/or queuing the initial multi-agent formation.
|
|
17:00-17:20, Paper WeC03.4 | Add to My Program |
Resilient Quantized Consensus with Multi-Hop Communication (I) |
|
Yuan, Liwei | Hunan University |
Ishii, Hideaki | Tokyo Institute of Technology |
Keywords: Networked control systems, Cyber-Physical Security, Agents-based systems
Abstract: In this paper, we study the problem of resilient quantized consensus problem where some of the agents may behave maliciously. The network consists of agents taking quantized/integer-valued states with asynchronous updates and time delays in the communication between agents. We propose a quantized weighted mean subsequence reduced (QW-MSR) algorithm where agents are capable to communicate with multi-hop neighbors. We provide necessary and sufficient conditions for our algorithm to achieve resilient quantized consensus for synchronous/asynchronous updates under the malicious attacks. Compared to existing methods in the literature, our algorithm has tighter graph condition and, in particular, we establish that with multi-hop communication, the requirement for achieving resilient quantized consensus is less stringent. Numerical examples are given to verify the efficacy of the proposed algorithm.
|
|
17:20-17:40, Paper WeC03.5 | Add to My Program |
Temporal Logic Resilience for Cyber-Physical Systems (I) |
|
Saoud, Adnane | CentraleSupelec |
Jagtap, Pushpak | Indian Institute of Science |
Soudjani, Sadegh | Newcastle University |
Keywords: Resilient Control Systems, Formal Verification/Synthesis, Hybrid systems
Abstract: We consider the notion of resilience for cyber-physical systems, that is, the ability of the system to withstand adverse events while maintaining acceptable functionality. We use temporal logic to express the requirements on the acceptable functionality and define the resilience metric as the maximum disturbance under which the system satisfies the temporal requirements. We fix a parameterized template for the set of disturbances and form a robust optimization problem under the system dynamics and the temporal specifications to find the maximum value of the parameter. From the computational point of view, we show how this optimization can be solved for linear systems and provide under-approximations of the resilience metric for nonlinear systems using linear programs. The computations are demonstrated on the temperature regulation of buildings and adaptive cruise control.
|
|
17:40-18:00, Paper WeC03.6 | Add to My Program |
Resilient Integral Control for Regulating Systems with Convex Input Constraints (I) |
|
Konstantopoulos, George | University of Patras |
Bechlioulis, Charalampos P. | University of Patras |
Keywords: Nonlinear systems, Output regulation, Constrained control
Abstract: In this paper, a novel integral control that can maintain the control input vector trajectory of a generic ISS linear or nonlinear plant within a prescribed compact and convex set is proposed. During normal operating conditions, the proposed controller can regulate the plant to the desired setpoint, while in the case of abnormal conditions, e.g. sensor faults, unrealistic reference input command, the controller introduces an inherent resilience property by maintaining the entire control input vector of the plant within a desired convex set. The boundedness of the control input vector is analytically proven using invariant set theory and vector field analysis (Nagumos theorem). Opposed to conventional and more advanced integral controllers that either restrict each element of the control input vector independently or bound its Euclidean norm, in this paper, a detailed methodology for designing a resilient integral control to guarantee a generic compact and convex input constraint for a plant with unknown structure or dynamics is presented for the first time. A practical example of an underwater vehicle is investigated to validate the efficiency and resilience of the proposed controller under changes of the reference signal and under sensor faults.
|
|
WeC04 Invited Session, Simpor Junior 4913 |
Add to My Program |
Efficient Communication for Networked Control Systems & Games |
|
|
Chair: Aggarwal, Shubham | University of Illinois, Urbana Champaign |
Co-Chair: Basar, Tamer | Univ of Illinois, Urbana-Champaign |
Organizer: Aggarwal, Shubham | University of Illinois, Urbana Champaign |
Organizer: Zaman, Muhammad Aneeq uz | UIUC |
Organizer: Bastopcu, Melih | University of Illinois Urbana Champaign |
Organizer: Basar, Tamer | Univ of Illinois, Urbana-Champaign |
|
16:00-16:20, Paper WeC04.1 | Add to My Program |
Event-Based Admission Control Over Multi-Hop Networks with Self-Interference (I) |
|
Ayan, Onur | Huawei Technologies |
Kutsevol, Polina | Technical University of Munich |
Kellerer, Wolfgang | Technical University of Munich |
An, Xueli | Huawei Technologies |
Keywords: Control of networks, Networked control systems, Control over communications
Abstract: In this work, we investigate the application of event-triggering in a multi-hop networked control scenario with interference constraints. In particular, we consider a line network comprised of H nodes with neighboring nodes affecting the reliability of each other, hence, introducing packet loss and non-negligible end-to-end latency. Having the practical feasibility in mind, we focus on admission control mechanisms at the sensor without assuming a centralized scheduling entity that has the perfect and global knowledge of the entire network. We demonstrate that, if the limitations of the network are neglected, the event-triggering mechanism may lead to low end-to-end reliability causing a significant degradation of the control performance. As a solution, we propose two novel admission control policies that aim to find a minimum inter-event time (MIET) in order to prevent a network congestion followed by a control performance deterioration. While the first policy follows an analytical approach combining the core principles of event-triggering and congestion control, the second policy learns the MIET adaptively without the knowledge of the network model. We show through numerical evaluation that the proposed strategies improve the control performance by more than 20% if the event criterion is selected appropriately.
|
|
16:20-16:40, Paper WeC04.2 | Add to My Program |
A Group Formation Game for Local Anomaly Detection (I) |
|
Ye, Zixin | University of Melbourne |
Alpcan, Tansu | The University of Melbourne |
Leckie, Christopher Andrew | The University of Melbourne |
Keywords: Game theory, Machine learning, Computer/Network Security
Abstract: This paper studies strategic group formation for local anomaly detection with potential applications to Cognitive Radio Networks (CRN) and the Internet-of-Things (IoT). The problem comprises multiple local anomaly detection tasks which use machine learning (ML) models and partial data. We consider a two-layer network structure with anomaly detectors in the lower layer acting as local anomaly detectors and central nodes at the upper layer as data aggregators, which train the ML models used by local anomaly detectors. The problem is addressed using a strategic (non-cooperative) game formulation, where all central nodes and detectors are players. The players interactively learn one or multiple optimal machine learning models for their dynamically identified local anomaly detection problems. The game is next formulated as a successive optimization problem and solved using the player's best responses to compute a Nash equilibrium. Under mild conditions, we prove that this group formation game is also a potential game, and any acquired solution achieving the local optima corresponds to a Nash Equilibrium. Experimental results are consistent with theoretical ones and show fast convergence to the solution.
|
|
16:40-17:00, Paper WeC04.3 | Add to My Program |
Timely Tracking of a Remote Dynamic Source Via Multi-Hop Renewal Updates (I) |
|
Kaswan, Priyanka | University of Maryland |
Ulukus, Sennur | University of Maryland |
Keywords: Communication networks, Stochastic systems, Network analysis and control
Abstract: We study the version age of information in a multi-hop multi-cast cache-enabled network, where updates at the source are marked with incrementing version numbers, and the inter-update times on the links are not necessarily exponentially distributed. We focus on the set of non-arithmetic distributions, which includes continuous probability distributions as a subset, with finite first and second moments for inter-update times. We first characterize the instantaneous version age of information at each node for an arbitrary network. We then explicate the recursive equations for instantaneous version age of information in multi-hop networks and employ semi-martingale representation of renewal processes to derive closed form expressions for the expected version age of information at an end user. We show that the expected age in a multi-hop network exhibits an additive structure. Further, we show that the expected age at each user is proportional to the variance of inter-update times at all links between a user and the source. Thus, end user nodes should request packet updates at constant intervals.
|
|
17:00-17:20, Paper WeC04.4 | Add to My Program |
Communication-Efficient Local SGD for Over-Parametrized Models with Partial Participation (I) |
|
Qin, Tiancheng | University of Illinois at Urbana-Champaign |
Yevale, Jayesh | University of Illinois at Urbana-Champaign |
Etesami, Rasoul | University of Illinois at Urbana-Champaign |
Keywords: Optimization, Distributed parameter systems, Machine learning
Abstract: We analyze the convergence rate of Local stochastic gradient descent (SGD) for over-parameterized models, which is at the core of federated learning. In this model, we allow the server to randomly select a subset of agents and communicate with them at each communication round to optimize a global objective function. This captures the realistic scenarios where the communication link between the server and the agents may break down due to random link failures or adversarial attacks. Under such an elaborate setting, we establish convergence guarantees for smooth objective functions without the convexity assumption that is the first for the regime. We also consider an extension of our results under a different random participation setting over general network structures (rather than a star network) in which an agent participates in the local optimization steps of its neighbors by some edge-dependent probability. We characterize the convergence rate of the proposed algorithm in terms of the number of communication rounds which confirms the communication efficiency of our methods.
|
|
17:20-17:40, Paper WeC04.5 | Add to My Program |
Efficient Communication for Pursuit-Evasion Games with Asymmetric Information (I) |
|
Maity, Dipankar | University of North Carolina at Charlotte |
Keywords: Agents-based systems, Control over communications, Game theory
Abstract: We consider a class of pursuit-evasion differential games in which the evader has continuous access to the pursuers location but not vice-versa. There is an immobile sensor (e.g., a ground radar station) that can sense the evaders location and communicate that information intermittently to the pursuer. Transmitting the information from the sensor to the pursuer is costly and only a finite number of transmissions can happen throughout the entire game. The outcome of the game is determined by the control strategies of the players and the communication strategy between the sensor and the pursuer. We obtain the (Nash) equilibrium control strategies for both the players as well as the optimal communication strategy between the static sensor and the pursuer. We discuss a dilemma for the evader that emerges in this game. We also discuss the emergence of implicit communication where the absence of communication from the sensor can also convey some actionable information to the pursuer.
|
|
17:40-18:00, Paper WeC04.6 | Add to My Program |
A Privacy-Preserving Finite-Time Push-Sum Based Gradient Method for Distributed Optimization Over Digraphs |
|
Chen, Xiaomeng | Hong Kong University of Science and Technology |
Jiang, Wei | Aalto University, Finland |
Charalambous, Themistoklis | University of Cyprus |
Shi, Ling | Hong Kong Un | |