| |
Last updated on November 17, 2022. This conference program is tentative and subject to change
Technical Program for Thursday December 8, 2022
|
ThPL Plenary Session, Tulum Ballroom A-H |
Add to My Program |
Nonlinear Systems with Limited Data: Estimation, Control and
Synchronization |
|
|
Chair: Prieur, Christophe | CNRS |
Co-Chair: Serrani, Andrea | The Ohio State University |
|
08:30-09:30, Paper ThPL.1 | Add to My Program |
Nonlinear Systems with Limited Data: Estimation, Control and Synchronization |
|
Liberzon, Daniel | Univ of Illinois, Urbana-Champaign |
Keywords: Nonlinear systems, Estimation
Abstract: The general focus of this talk is on nonlinear control systems in which only limited amounts of data can be exchanged between different parts of the system. In such systems, a control designer is faced not just with the task of developing a control algorithm, but also with deciding whether a given objective is achievable with the available data, or with characterizing the minimal data needed to meet the objective. In this context, we consider two specific problem settings. We first address state estimation and model detection with finite data rate. Our unifying approach to these two tasks relies on a notion of entropy for dynamical systems. We then discuss observer design with guaranteed robustness to measurement errors. This problem is motivated in part by applications involving synchronization with limited data.
|
|
ThAT01 Regular Session, Tulum Ballroom A |
Add to My Program |
Control and Estimation of Network Systems |
|
|
Chair: Stella, Leonardo | University of Birmingham |
Co-Chair: Franceschelli, Mauro | University of Cagliari |
|
10:00-10:20, Paper ThAT01.1 | Add to My Program |
Opportunistic Wireless Control Over State-Dependent Fading Channels |
|
Wang, Shuling | Shanghai Jiao Tong University |
Li, Peizhe | Shanghai Jiaotong University |
Zhu, Shanying | Shanghai Jiao Tong University |
Chen, Cailian | Shanghai Jiao Tong University |
Keywords: Networked control systems, Boolean control networks and logic networks, Manufacturing systems and automation
Abstract: The heterogeneous system consisting of the wireless control system (WCS) and mobile agent system (MAS) is ubiquitous in Industrial Internet of Things (IIoT) systems. Within this system, the positions of mobile agents may lead to shadow fading on the wireless channel that the WCS is controlled over and can significantly compromise the WCS's performance. This paper focuses on the controller design for the MAS to ensure the performance of WCS in the presence of WCS and MAS coupling. Firstly, the constrained finite field network (FFN) with profile-dependent switching topology is adopted to proceed the operational control for the MAS. By virtue of the algebraic state space representation (ASSR) method, an equivalent form is obtained for the WCS and MAS coupling. A necessary and sufficient condition in terms of constrained set stabilization is then established to ensure the Lyapunov-like performance with expected decay rate. Finally, a graphical method together with the breath-first searching is provided to design state feedback controllers for the MAS. With this method, it is easy to check the constrained set stabilization of MAS and to ensure the performance requirements of WCS in the presence of WCS and MAS coupling. The study of an illustrative example shows the effectiveness of the proposed method.
|
|
10:20-10:40, Paper ThAT01.2 | Add to My Program |
Chaotic Synchronization of Neuronet-Type Stochastic Complex Networks Using Risk Sensitive Optimal Control |
|
Liu, Ziqian | State University of New York Maritime College |
Keywords: Large-scale systems, Lyapunov methods, Neural networks
Abstract: In this paper, we present a risk sensitive control for chaotic synchronization of a class of nonlinear complex dynamic systems. With an eye on a predefined risk sensitive parameter, the goal of this new design approach is to achieve the synchronization of neuronet-type stochastic complex networks toward a chaotic target node. The proposed method is rigorously formulated via stochastic Lyapunov technique, robust inverse optimality, and the Hamilton-Jacobi-Bellman equation, which ensures to accomplish the design objective that includes an achievable meaningful performance index. Furthermore, a numerical example is given to demonstrate the effectiveness of the introduced technique.
|
|
10:40-11:00, Paper ThAT01.3 | Add to My Program |
Control of Misbehaving Multiagent Networks through Driver Nodes: The Directed Graph Case |
|
Yildirim, Emre | University of South Florida |
Yucelen, Tansel | University of South Florida |
Keywords: Cooperative control, Control system architecture
Abstract: In this paper, we study multiagent networks subject to misbehaving nodes (i.e., nodes subject to exogenous disturbances) over fixed, connected, and directed graphs. In contrast to previous studies, which apply feedback control signals to every node in the network to suppress the adverse effect of misbehaving nodes, we propose to apply feedback control signals to a subset of nodes (i.e., driver nodes) in the network due to constraints. In particular, proportional-integral feedback controllers are proposed to be executed by the driver nodes, which guarantee that the overall multiagent network is stable in the sense of input-to-state stability (i.e., they make the resulting closed-loop system matrix Hurwitz). Following that we present a system-theoretical approach to find the steady-state value of each node in the network. In addition, a graph-theoretical approach is utilized to allow users to find steady-state values of critical nodes without requiring the knowledge of the Laplacian matrix of the overall multiagent network. Based on the results presented in this paper, one can gain the understanding how to select the driver nodes to negate the effect of misbehaving nodes on the neighborhood of the critical nodes. Finally, illustrative numerical examples are also presented to demonstrate the efficacy of the proposed approaches.
|
|
11:00-11:20, Paper ThAT01.4 | Add to My Program |
A Reinforcement Learning Approach to Sensing Design in Resource-Constrained Wireless Networked Control Systems |
|
Ballotta, Luca | University of Padova |
Peserico, Giovanni | University of Padova, Autec Srl |
Zanini, Francesco | Università Di Padova |
Keywords: Networked control systems, Machine learning, Kalman filtering
Abstract: In this paper, we consider a wireless network of smart sensors (agents) that monitor a dynamical process and send measurements to a base station that performs global monitoring and decision-making. Smart sensors are equipped with both sensing and computation, and can either send raw measurements or process them prior to transmission. Constrained agent resources raise a fundamental latency-accuracy trade-off. On the one hand, raw measurements are inaccurate but fast to produce. On the other hand, data processing on resource-constrained platforms generates accurate measurements at the cost of non-negligible computation latency. Further, if processed data are also compressed, latency caused by wireless communication might be higher for raw measurements. Hence, it is challenging to decide when and where sensors in the network should transmit raw measurements or leverage time-consuming local processing. To tackle this design problem, we propose a Reinforcement Learning approach to learn an efficient policy that dynamically decides when measurements are to be processed at each sensor. Effectiveness of our proposed approach is validated through a numerical simulation with case study on smart sensing motivated by the Internet of Drones.
|
|
11:20-11:40, Paper ThAT01.5 | Add to My Program |
Mission-Aware Value of Information Censoring for Distributed Filtering |
|
Calvo-Fullana, Miguel | Massachusetts Institute of Technology |
How, Jonathan, P. | MIT |
Keywords: Sensor fusion, Estimation, Sensor networks
Abstract: In this paper, we study the problem of distributed estimation with an emphasis on communication-efficiency. The proposed algorithm is based on a windowed maximum a posteriori (MAP) estimation problem, wherein each agent in the network locally computes a Kalman-like filter estimate that approximates the centralized MAP solution. Information sharing among agents is restricted to their neighbors only, with guarantees on overall estimate consistency provided via logarithmic opinion pooling. The problem is efficiently distributed using the alternating direction method of multipliers (ADMM), whose overall communication usage is further reduced by a value of information (VoI) censoring mechanism, wherein agents only transmit their primal-dual iterates when deemed valuable to do so. The proposed censoring mechanism is mission-aware, enabling a globally efficient use of communication resources while guaranteeing possibly different local estimation requirements. To illustrate the validity of the approach we perform simulations in a target tracking scenario.
|
|
11:40-12:00, Paper ThAT01.6 | Add to My Program |
Linear Stochastic Graphon Systems with Q-Space Noise |
|
Dunyak, Alexander | McGill University |
Caines, Peter E. | McGill University |
Keywords: Stochastic systems, Networked control systems, Subspace methods
Abstract: The modelling and control of systems on large complex networks is intractable in general. One approach is to use graphon theory which provides limit objects for infinite sequences of graphs permitting one to approximate arbitrarily large networks by infinite dimensional operators. Such a formulation was initiated in the work of Gao and Caines (2020, 2021) extending classical linear system control theory to the control of systems on large networks. This paper introduces infinite dimensional stochastic processes called Q-space noise into this framework. First, Brownian motions in Hilbert spaces are defined. Second, stochastic dynamical systems on large graphs using Q-space noise processes are shown to converge in the graph limit in expectation. Third, state-to-state and linear quadratic control of these systems is formulated and the limit approximations are established. Finally, the behavior of these approximations is illustrated numerically.
|
|
ThAT02 Regular Session, Tulum Ballroom B |
Add to My Program |
Distributed Control I |
|
|
Chair: Dimarogonas, Dimos V. | KTH Royal Institute of Technology |
Co-Chair: Dogan, Kadriye Merve | Embry-Riddle Aeronautical University |
|
10:00-10:20, Paper ThAT02.1 | Add to My Program |
Distributed Constrained Connectivity Keeping Supervision Scheme in the Presence of Static Obstacles |
|
Casavola, Alessandro | Universita' Della Calabria |
D'Angelo, Vincenzo | Università Della Calabria |
El Qemmah, Ayman | Università Della Calabria |
Tedesco, Francesco | Università Della Calabria |
Torchiaro, Franco Angelo | University of Calabria |
Keywords: Distributed control, Predictive control for linear systems, Autonomous systems
Abstract: Connectivity maintenance and safe navigation around obstacles are fundamental challenges for vehicle formations. This paper introduces a novel multi-level supervision distributed architecture aimed at supervising a set of dynamically decoupled agents operating in a 2D space and subject to obstacle avoidance and connectivity- keeping constraints. An existing supervision scheme based on Distributed Command Governor ideas is here extended in order to adequately tackle non-convex obstacle avoidance constraints while maintaining the communication connectivity of the formation, here modeled as a dynamic graph. To this end, the proposed method enforces the existence of a specific minimum spanning tree at each time instant while fulfilling obstacle avoidance constraints by using a separation hyperplane. Conditions that formally prove the feasibility of the proposed strategy are included along with simulations that show the effectiveness of the proposed method.
|
|
10:20-10:40, Paper ThAT02.2 | Add to My Program |
Distributed Suboptimal Model Predictive Control with Minimal Information Exchange |
|
Jané Soneira, Pol | Karlsruher Institute Für Technology |
Gießler, Armin | Karlsruhe Institute of Technology |
Pfeifer, Martin | Institute of Control Systems, Karlsruhe Institute of Technology |
Hohmann, Soeren | KIT |
Keywords: Distributed control, Predictive control for linear systems, Optimization algorithms
Abstract: In this paper, we propose an approach to distributed model predictive control based on distributed optimization and suboptimal model predictive control theory. In the proposed method, the individual subsystems are coupled through the inputs and do not need to exchange information about their system dynamics or objective function. Coordination is achieved by iteratively exchanging an estimation of the subsystem's optimal control sequence only with neighboring subsystems. The subsystems compute iteratively an admissible control sequence, which is asymptotically stabilizing the system. With mild assumptions, a minimum number of iterations for obtaining an admissible control sequence and hence guaranteeing stability of the closed-loop system can be computed independently of the initial overall system state. Furthermore, it is shown that the centrally optimal control sequence is achieved at convergence. The subsystem's practitioner can then choose to apply a suboptimal but stabilizing control sequence or wait for the optimal control sequence when convergence is attained. The approach is illustrated through an academic example.
|
|
10:40-11:00, Paper ThAT02.3 | Add to My Program |
Coordination of Uncertain Multiagent Systems with Non-Identical Actuation Capacities |
|
Kurttisi, Atahan | Embry-Riddle Aeronautical University |
Aly, Islam | Embry-Riddle Aeronautical University |
Dogan, Kadriye Merve | Embry-Riddle Aeronautical University |
Keywords: Distributed control, LMIs, Adaptive control
Abstract: This paper provides a distributed adaptive control architecture for uncertain multiagent systems with nonidentical actuation capacities and unknown control effectiveness to achieve cooperative behaviors for real-world applications. In detail, our approach includes a user-assigned Laplacian matrix for creating cooperative behaviors with multiple agents, a hedging-based reference model to provide correct adaptation that is not affected by the presence of heterogeneous actuator dynamics in the networked system, and a distributed adaptive control architecture to deal with the system anomalies. The stability of the overall multiagent system is showed by utilizing the Lyapunov stability theorem, and allowance actuator bandwidth limits are calculated by using linear matrix inequalities. Results of numerical examples are illustrated to indicate the performance of the proposed control algorithm on a multiagent system that is a fully connected circle graph.
|
|
11:00-11:20, Paper ThAT02.4 | Add to My Program |
Distributed Infinite-Horizon Optimal Control of Discrete-Time Linear Systems Over Networks |
|
d'Angelo, Massimiliano | Sapienza Università Di Roma |
Battilotti, Stefano | Univ. La Sapienza |
Cacace, Filippo | Università Campus Biomedico Di Roma |
Keywords: Distributed control, Control of networks, Stochastic optimal control
Abstract: In this paper we consider the distributed infinite-horizon Linear-Quadratic-Gaussian optimal control problem for discrete-time systems over networks. In particular, the feedback controller is composed of local control stations, which receives some measurement data from the plant process and regulates a portion of the input signal. We provide a solution when the nodes have information on the structural data of the whole network but takes local actions, and also when only local information on the network are available to the nodes. The proposed solution is arbitrarily close to the optimal centralized one (in terms of cost index) when the intermediate consensus steps are sufficiently large
|
|
11:20-11:40, Paper ThAT02.5 | Add to My Program |
A Consistency Constraint-Based Approach to Coupled State Constraints in Distributed Model Predictive Control |
|
Wiltz, Adrian | KTH Royal Institute of Technology |
Chen, Fei | KTH Royal Institute of Technology |
Dimarogonas, Dimos V. | KTH Royal Institute of Technology |
Keywords: Distributed control, Constrained control, Predictive control for nonlinear systems
Abstract: In this paper, we present a distributed model predictive control (DMPC) scheme for dynamically decoupled systems which are subject to state constraints, coupling state constraints and input constraints. In the proposed control scheme, neighbor-to-neighbor communication suffices and all subsystems solve their local optimization problem in parallel. The approach relies on consistency constraints which define a neighborhood around each subsystem’s reference trajectory where the state of the respective subsystem is guaranteed to stay in. Reference trajectories and consistency constraints are known to neighboring subsystems. Contrary to other relevant approaches, the reference trajectories are improved iteratively. Besides, the presented approach allows the formulation of convex optimization problems even in the presence of non-convex state constraints. The algorithm’s effectiveness is demonstrated with a simulation.
|
|
11:40-12:00, Paper ThAT02.6 | Add to My Program |
Differentially Private Distributed Mismatch Tracking Algorithm for Constraint-Coupled Resource Allocation Problems |
|
Wu, Wenwen | Shanghai Jiao Tong University |
Zhu, Shanying | Shanghai Jiao Tong University |
Liu, Shuai | Nanyang Technological University |
Guan, Xin-Ping | Shanghai Jiao Tong University |
Keywords: Distributed control, Optimization algorithms, Sensor networks
Abstract: This paper considers privacy-concerned distributed constraint-coupled resource allocation problems over an undirected network, where each agent holds a private cost function and obtains the solution via only local communication. With privacy concerns, we mask the exchanged information with independent Laplace noise against a potential attacker with potential access to all network communications. We propose a differentially private distributed mismatch tracking algorithm (diff-DMAC) to achieve cost-optimal distribution of resources while preserving privacy. Adopting constant stepsizes, the linear convergence property of diff-DMAC in mean square is established under the standard assumptions of Lipschitz gradients and strong convexity. Moreover, it is theoretically proven that the proposed algorithm is -differentially private. And we also show the trade-off between convergence accuracy and privacy level. Finally, a numerical example is provided for verification.
|
|
ThAT03 Regular Session, Tulum Ballroom C |
Add to My Program |
Autonomous Systems I |
|
|
Chair: Smith, Stephen L. | University of Waterloo |
Co-Chair: Kurtz, Vincent | University of Notre Dame |
|
10:00-10:20, Paper ThAT03.1 | Add to My Program |
Mixed-Integer Programming for Signal Temporal Logic with Fewer Binary Variables |
|
Kurtz, Vincent | University of Notre Dame |
Lin, Hai | University of Notre Dame |
Keywords: Autonomous systems, Robotics, Optimization
Abstract: Signal Temporal Logic (STL) provides a convenient way of encoding complex control objectives for robotic and cyber-physical systems. The state-of-the-art in trajectory synthesis for STL is based on Mixed-Integer Convex Programming (MICP). The MICP approach is sound and complete, but has limited scalability due to exponential complexity in the number of binary variables. In this letter, we propose a more efficient MICP encoding for STL. Our new encoding is based on the insight that disjunction can be encoded using a logarithmic number of binary variables and conjunction can be encoded without binary variables. We demonstrate in simulation examples that our proposed approach significantly outperforms the state-of-the-art for long and complex specifications.
|
|
10:20-10:40, Paper ThAT03.2 | Add to My Program |
Collective Behavior of Generic Linear Agents Interacting by Dynamic Topology with General Structures |
|
Chen, Lulu | University of Electronic Science and Technology of China |
Cheng, Yuhua | University of Electronic Science and Technology of China |
Shao, Jinliang | University of Electronic Science and Technology of China, Chengd |
Shi, Lei | University of Electronic Science and Technology |
Zheng, Wei Xing | Western Sydney University |
Keywords: Autonomous systems, Network analysis and control
Abstract: Most of the classic collective behavior occurring in multiagent systems (MASs) are built upon the network topology with sufficient connectivity. The MASs that impose no restrictions on the network topology deserves due attention to reveal more general collective behavior, which means unifying consensus and containment control. This paper examines the collective behavior in the network of homogeneous generic linear agents over the dynamic topology without structural constraints. By virtue of the product of nonnegative matrices and the properties of algebraic digraphs, a systematic coordination analysis with proper parameter selection is put forward. It is shown that local consensus is reached by the agents of the same closed strongly connected component, whereas the agents outside the closed strongly connected components are ultimately confined in the convex hull defined by the agents of the closed strongly connected components. The results of the theoretical analysis are supported by computer simulations.
|
|
10:40-11:00, Paper ThAT03.3 | Add to My Program |
Leaderless Affine Formation Maneuvers Over Directed Graphs |
|
Garanayak, Chinmay | Indian Institute of Technology Bombay |
Mukherjee, Dwaipayan | Indian Institute of Technology Bombay |
Keywords: Autonomous systems, Cooperative control, Decentralized control
Abstract: This paper studies leaderless affine formation maneuver control over a directed sensing topology, in the presence of constant input disturbances. Directed sensing topology is important when limitations in the sensing capabilities of an agent are considered. Unlike traditional leader-follower approaches, in case of leaderless formation maneuvers, there are no leaders. First, control laws for leaderless affine formation maneuvering for single integrator agents are presented, considering a directed sensing graph. Global Asymptotic Stability (GAS) is proved for formation tracking error in presence of constant disturbances. Subsequently, for higher-order systems, control laws are designed using an adaptive back-stepping based approach and GAS is proved for such control laws. Simulations are provided to validate the results.
|
|
11:00-11:20, Paper ThAT03.4 | Add to My Program |
Adding Neural Network Controllers to Behavior Trees without Destroying Performance Guarantees |
|
Sprague, Christopher Iliffe | KTH Royal Institute of Technology |
Ogren, Petter | KTH Royal Institute of Technology |
Keywords: Autonomous systems, Stability of hybrid systems, Switched systems
Abstract: In this paper, we show how Behavior Trees that have performance guarantees, in terms of safety and goal convergence, can be extended with components that were designed using machine learning, without destroying those performance guarantees. Machine learning approaches such as reinforcement learning or learning from demonstration can be very appealing to AI designers that want efficient and realistic behaviors in their agents. However, those algorithms seldom provide guarantees for solving the given task in all different situations while keeping the agent safe. Instead, such guarantees are often easier to find for manually designed model-based approaches. In this paper we exploit the modularity of behavior trees to extend a given design with an efficient, but possibly unreliable, machine learning component in a way that preserves the guarantees. The approach is illustrated with an inverted pendulum example.
|
|
11:20-11:40, Paper ThAT03.5 | Add to My Program |
Scheduling Operator Assistance for Shared Autonomy in Multi-Robot Teams |
|
Cai, Yifan | University of Waterloo |
Dahiya, Abhinav | University of Waterloo |
Wilde, Nils | TU Delft |
Smith, Stephen L. | University of Waterloo |
Keywords: Autonomous systems, Human-in-the-loop control, Optimization algorithms
Abstract: In this paper, we consider the problem of allo- cating human operator assistance in a system with multiple autonomous robots. Each robot is required to complete inde- pendent missions, each defined as a sequence of tasks. While executing a task, a robot can either operate autonomously or be teleoperated by the human operator to complete the task at a faster rate. We formulate our problem as a Mixed Integer Linear Program, which can be used to optimally solve small to moderate sized problem instances. We also develop an anytime algorithm that makes use of the problem structure to provide a fast and high-quality solution of the operator scheduling problem, even for larger problem instances. Our key insight is to identify blocking tasks in greedily-created schedules and iteratively remove those blocks to improve the quality of the solution. Through numerical simulations, we demonstrate the benefits of the proposed algorithm as an efficient and scalable approach that outperforms other greedy methods.
|
|
11:40-12:00, Paper ThAT03.6 | Add to My Program |
Verified Compositions of Neural Network Controllers for Temporal Logic Control Objectives |
|
Wang, Jun | Washington University in St. Louis |
Kalluraya, Samarth | Washington University in St. Louis |
Kantaros, Yiannis | Washington University in St. Louis |
Keywords: Autonomous systems, Neural networks, Automata
Abstract: This paper presents a new approach to design verified compositions of Neural Network (NN) controllers for autonomous systems with tasks captured by Linear Temporal Logic (LTL) formulas. Particularly, the LTL formula requires the system to reach and avoid certain regions in a temporal/logical order. We assume that the system is equipped with a finite set of trained NN controllers. Each controller has been trained so that it can drive the system towards a specific region of interest while avoiding others. Our goal is to check if there exists a temporal composition of the trained NN controllers - and if so, to compute it - that will yield composite system behaviors that satisfy a user-specified LTL task for any initial system state belonging to a given set. To address this problem, we propose a new approach that relies on a novel integration of automata theory and recently proposed reachability analysis tools for NN-controlled systems. We note that the proposed method can be applied to other controllers, not necessarily modeled by NNs, by appropriate selection of the reachability analysis tool. We focus on NN controllers due to their lack of robustness. The proposed method is demonstrated on navigation tasks for aerial vehicles.
|
|
ThAT04 Regular Session, Tulum Ballroom D |
Add to My Program |
Optimization and Learning |
|
|
Chair: Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Co-Chair: Doan, Thinh T. | Virginia Tech |
|
10:00-10:20, Paper ThAT04.1 | Add to My Program |
Data-Driven Pole Placement in LMI Regions with Robustness Guarantees |
|
Mukherjee, Sayak | Pacific Northwest National Laboratory |
Hossain, Ramij Raja | Iowa State University |
Keywords: Machine learning, Optimal control, Robust control
Abstract: This paper proposes a data-driven methodology to place the closed-loop poles in desired convex regions in the complex plane with sufficient robustness constraints. We considered the system state and input matrices unknown and only used the measurements of system trajectory. The closed-loop pole placement problem in the linear matrix inequality (LMI) regions is considered a classic robust control problem; however, that requires knowledge about the state and input matrices of the linear system. We bring in ideas from the behavioral system theory and persistency of excitation condition-based fundamental lemma to develop a data-driven counterpart that satisfies multiple closed-loop robustness specifications, such as mathcal{D}-stability and mixed H_2/H_{infty} performance specifications. We provide solutions based on the availability of the disturbance input, both in the controlled and fully uncertain environment, leading to data-driven semi-definite programs (SDPs) with sufficient guarantees. We validate the theoretical results with numerical simulations on a third-order dynamic system.
|
|
10:20-10:40, Paper ThAT04.2 | Add to My Program |
Closed-Form Estimates of the LQR Gain from Finite Data |
|
Celi, Federico | University of California, Riverside |
Baggio, Giacomo | University of Padova, Italy |
Pasqualetti, Fabio | University of California, Riverside |
Keywords: Learning, Optimal control, Linear systems
Abstract: When dealing with unknown systems, data can be used to directly learn controllers with desirable features, thus bypassing system identification. In this paper we present strategies to design optimal controls for unknown linear systems directly trough closed-form functions of the data. In particular, when data is sufficiently informative, (i) we find the control input that minimizes a finite-horizon quadratic function of the states and inputs and (ii) we show how these inputs enable the estimate of the static feedback controller that minimizes an infinite-horizon quadratic function, i.e., the Linear Quadratic Regulator. Our formulas are closed-form, making them computationally efficient and of straightforward interpretation.
|
|
10:40-11:00, Paper ThAT04.3 | Add to My Program |
Safe Finite-Time Reinforcement Learning for Pursuit-Evasion Games |
|
Kokolakis, Nick-Marios T. | Georgia Institute of Technology |
Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Keywords: Learning, Optimal control, Stability of nonlinear systems
Abstract: In this paper, we develop a safe pursuit-evasion game for enabling finite-time capture and optimal performance. The pursuit-evasion game is formulated as a zero-sum differential game wherein the pursuer seeks to minimize its relative distance to the target while the evader attempts to maximize it. A critic-only reinforcement learning-based algorithm is then proposed for learning online and in finite time the pursuit-evasion policies, thus enabling finite-time capture of the evader. Safety is ensured by means of barrier functions associated with the obstacles, which are integrated into the running cost. Simulation results illustrate the efficacy of the proposed approach.
|
|
11:00-11:20, Paper ThAT04.4 | Add to My Program |
Finite-Time Complexity of Online Primal-Dual Natural Actor-Critic Algorithm for Constrained Markov Decision Processes |
|
Zeng, Sihan | Georgia Institute of Technology |
Doan, Thinh T. | Virginia Tech |
Romberg, Justin | Georgia Tech |
Keywords: Machine learning, Optimization, Markov processes
Abstract: We consider a discounted cost constrained Markov decision process (CMDP) policy optimization problem, in which an agent seeks to maximize a discounted cumulative reward subject to a number of constraints on discounted cumulative utilities. To solve this constrained optimization program, we study an online actor-critic variant of a classic primal-dual method where the gradients of both the primal and dual variables are estimated using samples from a single trajectory generated by the underlying time-varying Markov processes. This online primal-dual natural actor-critic algorithm maintains and iteratively updates three variables: a dual variable (or Lagrangian multiplier), a primal variable (or actor), and a critic variable used to estimate the gradients of both primal and dual variables. These variables are updated simultaneously but on different time scales (using different step sizes) and they are all intertwined with each other. Our main contribution is to derive a finite-time analysis for the convergence of this algorithm to the global optimum of a CMDP problem. Specifically, we show that with a proper choice of step sizes the optimality gap and constraint violation converge to zero in expectation at a rate O(1/K^(1/6)), where K is the number of iterations. To our knowledge, this paper is the first to study the finite-time complexity of an online primal-dual actor-critic method for solving a CMDP problem. We also validate the effectiveness of this algorithm through numerical simulations.
|
|
11:20-11:40, Paper ThAT04.5 | Add to My Program |
Data-Driven Optimal Control of Affine Systems: A Linear Programming Perspective |
|
Martinelli, Andrea | ETH Zurich |
Gargiani, Matilde | ETH Zurich |
Draskovic, Marina | ETH Zurich |
Lygeros, John | ETH Zurich |
Keywords: Optimal control
Abstract: In this letter, we discuss the problem of optimal control for affine systems in the context of data-driven linear programming. We first introduce a unified framework for the fixed point characterization of the value function, Q-function and relaxed Bellman operators. Then, in a model-free setting, we show how to synthesize and estimate Bellman inequalities from a small but sufficiently rich dataset. To guarantee exploration richness, we complete the extension of Willems' fundamental lemma to affine systems.
|
|
11:40-12:00, Paper ThAT04.6 | Add to My Program |
On the Optimization Landscape of Dynamic Output Feedback: A Case Study for Linear Quadratic Regulator |
|
Duan, Jingliang | National University of Singapore |
Cao, Wenhan | Tsinghua University |
Zheng, Yang | University of California San Diego |
Zhao, Lin | National University of Singapore |
Keywords: Learning, Optimization, Linear systems
Abstract: The convergence of policy gradient algorithms in reinforcement learning hinges on the optimization landscape of the underlying optimal control problem. Theoretical insights into these algorithms can often be acquired from analyzing those of linear quadratic control. However, most of the existing literature only considers the optimization landscape for static full-state or output feedback policies (controllers). We investigate the more challenging case of dynamic output-feedback policies for linear quadratic regulation (abbreviated as dLQR), which is prevalent in practice but has a rather complicated optimization landscape. We first show how the dLQR cost varies with the coordinate transformation of the dynamic controller and then derive the optimal transformation for a given observable stabilizing controller. At the core of our results is the uniqueness of the stationary point of dLQR when it is observable, which is in a concise form of an observer-based controller with the optimal similarity transformation. These results shed light on designing efficient algorithms for general decision-making problems with partially observed information.
|
|
ThAT05 Invited Session, Tulum Ballroom E |
Add to My Program |
Learning-Based Control IV: Controller Design and Applications |
|
|
Chair: Muller, Matthias A. | Leibniz University Hannover |
Co-Chair: Zeilinger, Melanie N. | ETH Zurich |
Organizer: Müller, Matthias A. | Leibniz University Hannover |
Organizer: Schoellig, Angela P | University of Toronto |
Organizer: Trimpe, Sebastian | RWTH Aachen University |
Organizer: Zeilinger, Melanie N. | ETH Zurich |
|
10:00-10:20, Paper ThAT05.1 | Add to My Program |
On Controller Tuning with Time-Varying Bayesian Optimization |
|
Brunzema, Paul | RWTH Aachen University |
von Rohr, Alexander | RWTH Aachen University, Germany |
Trimpe, Sebastian | RWTH Aachen University |
Keywords: Machine learning, Time-varying systems, Adaptive control
Abstract: Changing conditions or environments can cause system dynamics to vary over time. To ensure optimal control performance, controllers should adapt to these changes. As systems become increasingly complex, first-principal modeling and data-based system identification reach their limits. This motivates approaches such as time-varying Bayesian optimization (TVBO) to tune controllers online by directly modeling the control objective. Many online controller-tuning problems can be characterized by two properties. First, they exhibit incremental and lasting changes of the objective due to changes to the system dynamics through, e.g., wear and tear. Second, the optimization problem is convex in the tuning parameters. Current TVBO methods do not explicitly account for these properties, resulting in poor tuning performance and many unstable controllers through over-exploration of the parameter space. We propose a novel TVBO forgetting strategy using Uncertainty-Injection (UI), thereby retaining relevant information about the control objective over time. The control objective is modeled as a spatio-temporal Gaussian process (GP) with UI through a Wiener process in the temporal domain. Further, we explicitly model convex functions through GP models with linear inequality constraints. In numerical experiments, we show that our model outperforms the state-of-the-art method in TVBO, exhibiting reduced regret and less unstable parameter configurations.
|
|
10:20-10:40, Paper ThAT05.2 | Add to My Program |
Neural Energy Casimir Control for Port-Hamiltonian Systems (I) |
|
Xu, Liang | École Polytechnique Fédérale De Lausanne |
Zakwan, Muhammad | EPFL |
Ferrari-Trecate, Giancarlo | Ecole Polytechnique Fédérale De Lausanne |
Keywords: Nonlinear systems, Neural networks, Learning
Abstract: The energy Casimir method is an effective controller design approach to stabilize port-Hamiltonian systems at a desired equilibrium. However, its application relies on the availability of suitable Casimir and Lyapunov functions, whose computation are generally intractable. In this paper, we propose a neural network-based framework to learn these functions. We show how to achieve equilibrium assignment by adding suitable regularization terms in the training cost. We also propose a parameterization of Casimir functions for reducing the training complexity. Moreover, the distance between the equilibrium of the learned Lyapunov function and the desired equilibrium is analyzed, which indicates that for small suboptimality gaps, the distance decreases linearly with respect to the training loss. Our methods are backed up by simulations on a pendulum system.
|
|
10:40-11:00, Paper ThAT05.3 | Add to My Program |
Structured-Policy Q-Learning: An LMI-Based Design Strategy for Distributed Reinforcement Learning (I) |
|
Sforni, Lorenzo | Alma Mater Studiorum - Università Di Bologna |
Camisa, Andrea | University of Bologna |
Notarstefano, Giuseppe | University of Bologna |
Keywords: Distributed control, Optimal control, Machine learning
Abstract: In this paper, we consider a Linear Quadratic optimal control problem with the assumptions that the system dynamics is unknown and that the designed feedback control has to comply with a desired sparsity pattern. An important application where this set-up arises is distributed control of network systems, where the aim is to find an optimal sparse controller matching the communication graph. To tackle the problem, we propose a Reinforcement Learning framework based on a Q-learning scheme preserving a desired policy structure. At each time step the performance of the current candidate feedback is first evaluated through the computation of its Q-function, and then a new sparse feedback matrix, improving on the previous one, is computed. We prove that the scheme produces at each iteration a stabilizing feedback control with the desired sparsity and with non-increasing cost, which in turns indicates that every limit point of the computed feedback matrices is sparse and stabilizing. The algorithm is numerically tested on a distributed control scenario with randomly generated graph and unstable dynamics.
|
|
11:00-11:20, Paper ThAT05.4 | Add to My Program |
Convex Analytic Theory for Convex Q-Learning (I) |
|
Lu, Fan | University of Florida |
Mehta, Prashant G. | Univ of Illinois, Urbana-Champaign |
Meyn, Sean P. | Univ. of Florida |
Neu, Gergely | INRIA Lille -- Nord Europe |
Keywords: Optimal control, Machine learning
Abstract: In recent years there has been a collective research effort to find new formulations of reinforcement learning that are simultaneously more efficient and more amenable to analysis. This paper concerns one approach that builds on the linear programming (LP) formulation of optimal control of Manne. A primal version is called logistic Q-learning, and a dual variant is convex Q-learning. This paper focuses on the latter, while building bridges with the former. The main contributions follow: (i) The dual of convex Q-learning is not precisely Manne's LP or a version of logistic Q-learning, but has similar structure that reveals the need for regularization to avoid over-fitting. (ii) A sufficient condition is obtained for a bounded solution to the Q-learning LP. (iii) Simulation studies reveal numerical challenges when addressing sampled-data systems based on a continuous time model. The challenge is addressed using state-dependent sampling. The theory is illustrated with applications to examples from OpenAI gym. It is shown that convex Q-learning is successful in cases where standard Q-learning diverges, such as the LQR problem.
|
|
11:20-11:40, Paper ThAT05.5 | Add to My Program |
Multi-Agent Deep Reinforcement Learning for Shock Wave Detection and Dissipation Using Vehicle-To-Vehicle Communication (I) |
|
Suriyarachchi, Nilesh | University of Maryland |
Noorani, Erfaun | University of Maryland College Park |
Tariq, Faizan M. | University of Maryland |
Baras, John S. | University of Maryland |
Keywords: Traffic control, Iterative learning control, Autonomous systems
Abstract: Traffic shock waves are a commonly occurring phenomena caused by the delays in reaction times of Human Driven Vehicles (HDVs) resulting in unnecessary congestion in highway networks. Application of a suitable moving bottleneck control using Connected Autonomous Vehicles (CAVs) can result in shock wave mitigation and smoothing of the traffic flow. This traffic control scheme is dependent on accurately predicting shock wave conditions while choosing the best control to apply for the observation available to the CAV. In this work, we propose the use of a multi-agent shared policy reinforcement learning algorithm which leverages communication between CAVs for improved observability of downstream traffic conditions. A key feature of this method is the ability to perform shock wave dissipation control without the need for global information and the applicability of this method to multi-lane mixed traffic highways of arbitrary structure. We use the shared-parameter Proximal Policy Optimization (PPO) reinforcement learning strategy for obtaining the controls for each CAV in the simulation. We also built a custom SUMO-Gym wrapper for the multi-lane highway simulation with custom designed observation space, action space and rewards for each agent. The shock wave dissipation efficiency is evaluated on a three lane circular highway loop using realistic traffic simulation software and low CAV penetration levels.
|
|
11:40-12:00, Paper ThAT05.6 | Add to My Program |
Physically Consistent Learning of Conservative Lagrangian Systems with Gaussian Processes (I) |
|
Evangelisti, Giulio | Technical University of Munich |
Hirche, Sandra | Technische Universität München |
Keywords: Identification for control, Statistical learning, Uncertain systems
Abstract: This paper proposes a physically consistent Gaussian Process (GP) enabling the data-driven modelling of uncertain Lagrangian systems. The function space is tailored according to the energy components of the Lagrangian and the differential equation structure, analytically guaranteeing properties such as energy conservation and quadratic form. The novel formulation of Cholesky decomposed matrix kernels allow the probabilistic preservation of positive definiteness. Only differential input-to-output measurements of the function map are required while Gaussian noise is permitted in torques, velocities, and accelerations. We demonstrate the effectiveness of the approach in numerical simulation.
|
|
ThAT06 Regular Session, Tulum Ballroom F |
Add to My Program |
Linear Estimation |
|
|
Chair: Yong, Sze Zheng | Northeastern University |
Co-Chair: Chong, Edwin K. P. | Colorado State University |
|
10:00-10:20, Paper ThAT06.1 | Add to My Program |
On Distributed Robust Optimal Filter Design with Bounded-Power Disturbances and White Gaussian Noises |
|
Sun, Hongda | Zhejiang University of Technology |
Li, Jiahao | ZheJiang University of Techology |
Feng, Yu | Zhejiang University of Technology |
Keywords: Estimation, Linear systems, Optimization
Abstract: This paper is concerned with the distributed robust optimal filtering problem for discrete-time systems subject to both bounded-power disturbances and white Gaussian noises. Relied on the system level synthesis, an upper bound of the estimation performance is characterized in terms of system responses of the error dynamics and parameters of the disturbances and noises. Based on such characterization, a numerically tractable algorithm is presented for distributed multi-objective filter design. Moreover, the relative estimation performance loss incurred by imperfect modeling uncertainty and the finite truncation is explicitly established. A numerical example is also included to show the effectiveness of the current results.
|
|
10:20-10:40, Paper ThAT06.2 | Add to My Program |
New Bounds for State Transition Matrices |
|
Mazenc, Frederic | Inria Saclay |
Malisoff, Michael | Louisiana State University |
Keywords: Estimation, Linear systems
Abstract: We address the problem of constructing matrix valued interval observers for estimating state transition matrices for time-varying systems. We provide less conservative estimators than those in recent literature. We cover continuous- and discrete-time linear systems, under Metzler or nonnegativity conditions on the coefficient matrices. We show how to satisfy our Metzler conditions after simple changes of coordinates. We illustrate our method using a feedback stabilized underwater marine robotic dynamics with unknown control gains.
|
|
10:40-11:00, Paper ThAT06.3 | Add to My Program |
Data-Driven Sensitivity Analysis of Controllability Measure for Network Systems |
|
Banno, Ikumi | Nagoya University |
Azuma, Shun-ichi | Kyoto University |
Ariizumi, Ryo | Nagoya University |
Asai, Toru | Nagoya University |
Imura, Jun-ichi | Tokyo Institute of Technology |
Keywords: Estimation, Network analysis and control, Linear systems
Abstract: Optimization of controllability, i.e., to optimize the degree of the controllability using the flexibility of a system such as choosing input nodes and designing network structure, is growing in importance to network systems. For example, if a network system has strong controllability for a certain control input, we know that the input is effective for controlling the system even under a limited energy condition. If a mathematical model of the system is available, the controllability can be optimized by a model-based method. However, we often face the difficulty to incorporate useful prior knowledge of the system. In such a case, a data-driven approach is promising. In this paper, as the initial step toward this direction, we address the problem of determining the sensitivity of a controllability measure with respect to an edge weight of a network system in a data-driven manner. By characterizing the sensitivity by two Lyapunov equations, we clarify that the sensitivity is derived from the so-called data-driven Lyapunov equations. Moreover, this result is extended to the case of high-order sensitivity.
|
|
11:00-11:20, Paper ThAT06.4 | Add to My Program |
Primal-Dual Estimator Learning Method with Feasibility and Near-Optimality Guarantees |
|
Cao, Wenhan | Tsinghua University |
Duan, Jingliang | National University of Singapore |
Li, Shengbo Eben | Tsinghua University |
Chen, Chen | Tsinghua University |
Liu, Chang | Cornell University |
Wang, Yu | Tsinghua University |
Keywords: Estimation, Statistical learning, Linear systems
Abstract: This paper proposes a primal-dual framework to learn a stable estimator for linear constrained estimation problems leveraging the moving horizon approach. To avoid the online computational burden in most existing methods, we learn a parameterized function offline to approximate the primal estimate. Meanwhile, a dual estimator is trained to check the suboptimality of the primal estimator during execution time. Both the primal and dual estimators are learned from data using supervised learning techniques, and the explicit sample size is provided, which enables us to guarantee the quality of each learned estimator in terms of feasibility and optimality. This in turn allows us to bound the probability of the learned estimator being infeasible or suboptimal. Furthermore, we analyze the stability of the resulting estimator with a bounded error in the minimization of the cost function. Since our algorithm does not require the solution of an optimization problem during runtime, state estimates can be generated online almost instantly. Simulation results are presented to show the accuracy and time efficiency of the proposed framework compared to online optimization of moving horizon estimation and Kalman filter. To the best of our knowledge, this is the first learning-based state estimator with feasibility and near-optimality guarantees for linear constrained systems.
|
|
11:20-11:40, Paper ThAT06.5 | Add to My Program |
Well-Conditioned Linear Minimum Mean Square Error Estimation |
|
Chong, Edwin K. P. | Colorado State University |
Keywords: Estimation, Computational methods, Filtering
Abstract: Linear minimum mean square error (LMMSE) estimation is often ill-conditioned, suggesting that unconstrained minimization of the mean square error is an inadequate approach to filter design. To address this, we first develop a unifying framework for studying constrained LMMSE estimation problems. Using this framework, we explore an important structural property of constrained LMMSE filters involving a certain prefilter. Optimality is invariant under invertible linear transformations of the prefilter. This parameterizes all optimal filters by equivalence classes of prefilters. We then clarify that merely constraining the rank of the filter does not suitably address the problem of ill-conditioning. Instead, we adopt a constraint that explicitly requires solutions to be well-conditioned in a certain specific sense. We introduce two well-conditioned filters and show that they converge to the unconstrained LMMSE filter as their truncation-power loss goes to zero, at the same rate as the low-rank Wiener filter. We also show extensions to the case of weighted trace and determinant of the error covariance as objective functions. Finally, our quantitative results with historical VIX data demonstrate that our two well-conditioned filters have stable performance while the standard LMMSE filter deteriorates with increasing condition number.
|
|
11:40-12:00, Paper ThAT06.6 | Add to My Program |
System-Level Recurrent State Estimators for Affine Systems Subject to Data Losses Modeled by Automata |
|
Hassaan, Syed, M. | Arizona State University |
Yong, Sze Zheng | Northeastern University |
Keywords: Observers for Linear systems, Estimation, Automata
Abstract: This paper proposes a robust output feedback state estimator for uncertain/bounded-error affine systems subject to data losses modeled by an automaton. Specifically, by introducing a novel property known as recurrent recovery, where the estimation errors are required to be recurrent to some minimum recovery levels at each node of the data loss automata, we design a robust estimator design that guarantees that the state estimation errors remain bounded in a recurrent manner despite worst-case realizations of process and sensor noise/uncertainties in addition to missing data. Our design can directly deal with infinite-horizon missing data specifications modeled by automata by recasting the problem into multiple finite-horizon problems of varying lengths, which results in an optimization-based approach with only a finite number of constraints. Moreover, our design is built upon system-level parameterization and for this purpose, we propose a novel affine output feedback strategy that also contributes to the literature of finite-horizon optimal control.
|
|
ThAT07 Invited Session, Tulum Ballroom G |
Add to My Program |
Multi-Agent Optimization and Games |
|
|
Chair: Franci, Barbara | Maastricht University |
Co-Chair: Herty, Michael | RWTH Aachen University |
Organizer: Grammatico, Sergio | Delft Univ. of Tech |
Organizer: Bianchi, Mattia | Delft University of Technology |
Organizer: Staudigl, Mathias | Maastricht University |
Organizer: Franci, Barbara | Maastricht University |
|
10:00-10:20, Paper ThAT07.1 | Add to My Program |
Mini-Batch Stochastic Three-Operator Splitting for Distributed Optimization |
|
Franci, Barbara | Maastricht University |
Staudigl, Mathias | Maastricht University |
Keywords: Stochastic systems, Optimization
Abstract: We consider a network of agents, each with its own private cost consisting of a sum of two possibly nonsmooth convex functions, one of which is composed with a linear operator. At every iteration each agent performs local calculations and can only communicate with its neighbors. The challenging aspect of our study is that the smooth part of the private cost function is given as an expected value and agents only have access to this part of the problem formulation via a heavy-tailed stochastic oracle. To tackle such sampling-based optimization problems, we propose a stochastic extension of the triangular pre-conditioned primal-dual algorithm. We demonstrate almost sure convergence of the scheme and validate the performance of the method via numerical experiments.
|
|
10:20-10:40, Paper ThAT07.2 | Add to My Program |
A Consensus-Based Algorithm for Multi-Objective Optimization and Its Mean-Field Description (I) |
|
Borghi, Giacomo | RWTH Aachen University |
Herty, Michael | RWTH Aachen University |
Pareschi, Lorenzo | University of Ferrara |
Keywords: Optimization algorithms, Agents-based systems, Stochastic systems
Abstract: We present a multi-agent algorithm for multi-objective optimization problems, which extends the class of consensus-based optimization methods and relies on a scalarization strategy. The optimization is achieved by a set of interacting agents exploring the search space and attempting to solve all scalar sub-problems simultaneously. We show that those dynamics are described by a mean-field model, which is suitable for a theoretical analysis of the algorithm convergence. Numerical results show the validity of the proposed method.
|
|
10:40-11:00, Paper ThAT07.3 | Add to My Program |
Proximal-Like Algorithms for Equilibrium Seeking in Mixed-Integer Nash Equilibrium Problems (I) |
|
Fabiani, Filippo | IMT School for Advanced Studies Lucca |
Franci, Barbara | Maastricht University |
Sagratella, Simone | Sapienza University of Rome |
Schmidt, Martin | Trier University |
Staudigl, Mathias | Maastricht University |
Keywords: Game theory, Optimization algorithms, Agents-based systems
Abstract: We consider potential games with mixed-integer variables, for which we propose two distributed, proximal-like equilibrium seeking algorithms. Specifically, we focus on two scenarios: i) the underlying game is generalized ordinal and the agents update through iterations by choosing an exact optimal strategy; ii) the game admits an exact potential and the agents adopt approximated optimal responses. By exploiting the properties of integer-compatible regularization functions used as penalty terms, we show that both algorithms converge to either an exact or an epsilon-approximate equilibrium. We corroborate our findings on a numerical instance of a Cournot oligopoly model.
|
|
11:00-11:20, Paper ThAT07.4 | Add to My Program |
A Privacy-Preserving Decentralized Algorithm for Distribution Locational Marginal Prices (I) |
|
Bilenne, Olivier | École Des Ponts ParisTech |
Franci, Barbara | Maastricht University |
Jacquot, Paulin | EDF |
Oudjane, Nadia | EDF |
Staudigl, Mathias | Maastricht University |
Wan, Cheng | EDF Lab |
Keywords: Optimization algorithms, Decentralized control, Smart grid
Abstract: A major challenge in today’s electricity system is the management of flexibilities offered by new usages, such as smart home appliances or electric vehicles. By incentivizing energy consumption profiles of individuals, demand response seeks to adjust the power demand to the supply, for increased grid stability and better integration of renewable energies. This optimization of flexibility is typically managed by Load Aggregators, independent entities which aggregate and optimize numerous flexibility providers. The consideration of the underlying distribution network constraints, which couple the different actors, leads to a complex multi-agent problem. To address it, we propose a new decentralized algorithm that solves a convex relaxation of the classical Alternative Current Optimal Power Flow (ACOPF) problem, and which relies on local information only. Each computational step is performed in a privacy-preserving manner, and system-wide coordination is achieved via node-specific distribution locational marginal prices (DLMPs). We demonstrate the efficiency of our approach on a 15-bus radial distribution network.
|
|
11:20-11:40, Paper ThAT07.5 | Add to My Program |
Local Stochastic Factored Gradient Descent for Distributed Quantum State Tomography |
|
Kim, Junhyung Lyle | Rice University |
Toghani, Mohammad Taha | Rice University |
Uribe, Cesar A. | Rice University |
Kyrillidis, Anastasios | Rice University |
Keywords: Optimization algorithms, Quantum information and control, Machine learning
Abstract: We propose a distributed Quantum State Tomography (QST) protocol, named Local Stochastic Factored Gradient Descent (Local SFGD), to learn the low-rank factor of a density matrix over a set of local machines. QST is the canonical procedure to characterize the state of a quantum system, which we formulate as a stochastic non-convex smooth optimization problem. Physically, the estimation of a low-rank density matrix helps characterizing the amount of noise introduced by quantum computation. Theoretically, we prove the local convergence of Local SFGD for a general class of restricted strongly convex/smooth loss functions. Local SFGD converges locally to a small neighborhood of the global optimum at a linear rate with a constant step size, while it locally converges exactly at a sub-linear rate with diminishing step sizes. With a proper initialization, local convergence results imply global convergence. We validate our theoretical findings with numerical simulations of QST on the Greenberger-Horne-Zeilinger (GHZ) state.
|
|
11:40-12:00, Paper ThAT07.6 | Add to My Program |
Optimal Communication and Control Strategies for a Multi-Agent System in the Presence of an Adversary |
|
Kartik, Dhruva | University of Southern California |
Sudhakara, Sagar | University of Southern California |
Jain, Rahul | University of Southern California |
Nayyar, Ashutosh | University of Southern California |
Keywords: Stochastic optimal control, Game theory, Control over communications
Abstract: We consider a multi-agent system in which a decentralized team of agents controls a stochastic system in the presence of an adversary. Instead of committing to a fixed information sharing protocol, the agents can strategically decide at each time whether to share their private information with each other or not. The agents incur a cost whenever they communicate with each other and the adversary may eavesdrop on their communication. Thus, the agents in the team must effectively coordinate with each other while being robust to the adversary's malicious actions. We model this interaction between the team and the adversary as a stochastic zero-sum game where the team aims to minimize a cost while the adversary aims to maximize it. Under some assumptions on the adversary's capabilities, we characterize a min-max control and communication strategy for the team. We supplement this characterization with several structural results that can make the computation of the min-max strategy more tractable.
|
|
ThAT08 Invited Session, Tulum Ballroom H |
Add to My Program |
Robust Distributed Optimization, Estimation, and Coordination in
Multi-Agent Systems |
|
|
Chair: Nedich, Angelia | Arizona State University |
Co-Chair: Srivastava, Priyank | Massachusetts Institute of Technology |
Organizer: Nedich, Angelia | Arizona State University |
Organizer: Gil, Stephanie | Harvard University |
Organizer: Yemini, Michal | Princeton University |
Organizer: Goldsmith, Andrea | Stanford University |
|
10:00-10:20, Paper ThAT08.1 | Add to My Program |
Learning Constant-Gain Stabilizing Controllers for Frequency Regulation under Variable Inertia |
|
Srivastava, Priyank | Massachusetts Institute of Technology |
Hidalgo-Gonzalez, Patricia | University of California, San Diego |
Cortes, Jorge | University of California, San Diego |
Keywords: Power systems, Switched systems, Networked control systems
Abstract: Declines in cost and concerns about the environmental impact of traditional generation have boosted the penetration of renewables and non-conventional distributed energy resources into the power grid. The intermittent availability of these resources causes the inertia of the power system to vary over time. As a result, there is a need to go beyond traditional controllers designed to regulate frequency under the assumption of invariant dynamics. This paper presents a learning-based framework for the design of stable controllers based on imitating datasets obtained from linear-quadratic regulator (LQR) formulations for different switching sequences of inertia modes. The proposed controller is linear with a constant feedback-gain, thereby interpretable, does not require the knowledge of the current operating mode, and is guaranteed to stabilize the switching power dynamics. We show that it is always possible to stabilize the switched system using a communication-free local controller whose implementation only requires each node to use its own state. We illustrate our results on a 12-bus 3-region network.
|
|
10:20-10:40, Paper ThAT08.2 | Add to My Program |
Distributed Optimization for Rank-Constrained Semidefinite Programs |
|
Chaoying, Pei | Purdue University |
You, Sixiong | Purdue University |
Sun, Chuangchuang | Massachusetts Institute of Technology |
Dai, Ran | Purdue University |
Keywords: Optimization, Distributed control, Optimization algorithms
Abstract: This paper develops a distributed optimization framework for solving the rank-constrained semidefinite programs (RCSPs). Since the rank constraint is non-convex and discontinuous, solving an optimization problem with rank constraints is NP-hard and notoriously time-consuming, especially for large-scale RCSPs. In the proposed approach, by decomposing an unknown matrix into a set of submatrices with much smaller sizes, the rank constraint on the original matrix is equivalently transformed into a set of constraints on the decomposed submatrices. The distributed framework allows parallel computation of subproblems while requiring coordination among them to satisfy the coupled constraints. As the scale of every subproblem solved independently is significantly reduced, the decomposition scheme and the distributed framework can be applied to large-scale RCSPs. Moreover, optimality conditions of the proposed distributed optimization algorithm for RCSPs at the converged point are analyzed. Finally, the efficiency and effectiveness of the proposed method are demonstrated via simulation examples for solving the image denoising problem.
|
|
10:40-11:00, Paper ThAT08.3 | Add to My Program |
PARS-Push: Personalized, Asynchronous and Robust Decentralized Optimization |
|
Toghani, Mohammad Taha | Rice University |
Lee, Soomin | Yahoo Research |
Uribe, Cesar A. | Rice University |
Keywords: Machine learning, Optimization algorithms, Sensor networks
Abstract: We study the multi-step Model-Agnostic Meta-Learning (MAML) framework where a group of n agents seeks to find a common point that enables ``few-shot'' learning (personalization) via local stochastic gradient steps on their local functions. We formulate the personalized optimization problem under the MAML framework and propose PARS-Push, a decentralized asynchronous algorithm robust to message failures, communication delays, and directed message sharing. We characterize the convergence rate of PARS-Push under arbitrary multi-step personalization for smooth strongly convex, and smooth non-convex functions. Moreover, we provide numerical experiments showing its performance under heterogeneous data setups.
|
|
11:00-11:20, Paper ThAT08.4 | Add to My Program |
Distributed Statistical Min-Max Learning in the Presence of Byzantine Agents (I) |
|
Adibi, Arman | University of Pennsylvania |
Mitra, Aritra | University of Pennsylvania |
Pappas, George J. | University of Pennsylvania |
Hassani, Hamed | University of Pennsylvania |
Keywords: Statistical learning, Large-scale systems, Optimization algorithms
Abstract: Recent years have witnessed a growing interest in the topic of min-max optimization, owing to its relevance in the context of generative adversarial networks (GANs), robust control and optimization, and reinforcement learning. Motivated by this line of work, we consider a multi-agent min-max learning problem, and focus on the emerging challenge of contending with worst-case Byzantine adversarial agents in such a setup. By drawing on recent results from robust statistics, we design a robust distributed variant of the extra-gradient algorithm - a popular algorithmic approach for min-max optimization. Our main contribution is to provide a crisp analysis of the proposed robust extra-gradient algorithm for smooth convex-concave and smooth strongly convex-strongly concave functions. Specifically, we establish statistical rates of convergence to approximate saddle points. Our rates are near-optimal, and reveal both the effect of adversarial corruption and the benefit of collaboration among the non-faulty agents. Notably, this is the first paper to provide formal theoretical guarantees for large-scale distributed min-max learning in the presence of adversarial agents.
|
|
11:20-11:40, Paper ThAT08.5 | Add to My Program |
Resilience to Malicious Activity in Distributed Optimization for Cyberphysical Systems (I) |
|
Yemini, Michal | Princeton University |
Nedich, Angelia | Arizona State University |
Gil, Stephanie | Harvard University |
Goldsmith, Andrea | Stanford University |
Keywords: Resilient Control Systems, Distributed control, Cyber-Physical Security
Abstract: Enhancing resilience in distributed networks in the face of malicious agents is an important problem for which many key theoretical results and applications require further development and characterization. This work develops a new algorithmic and analytical framework for achieving resilience to malicious agents in distributed optimization problems where a legitimate agent's dynamic is influenced by the values it receives from neighboring agents and its own self-serving target function. We show that by utilizing stochastic values of trust between agents it is possible to recover convergence to the system's global optimal point even in the presence of malicious agents. Additionally, we provide expected convergence rate guarantees in the form of an upper bound on the expected squared distance to the optimal value. Finally, we present numerical results that validate the analytical convergence guarantees we present in this paper even when the malicious agents are the majority of agents in the network.
|
|
11:40-12:00, Paper ThAT08.6 | Add to My Program |
Robust Online and Distributed Mean Estimation under Adversarial Data Corruption (I) |
|
Yao, Tong | Purdue University |
Sundaram, Shreyas | Purdue University |
Keywords: Estimation, Statistical learning, Resilient Control Systems
Abstract: We study robust mean estimation in an online and distributed scenario in the presence of adversarial data attacks. At each time step, each agent in a network receives a potentially corrupted data point, where the data points were originally independent and identically distributed samples of a random variable. We propose online and distributed algorithms for all agents to asymptotically estimate the mean. We provide the error-bound and the convergence properties of the estimates to the true mean under our algorithms. Based on the network topology, we further evaluate each agent's trade-off in convergence rate between incorporating data from neighbors and learning with only local observations.
|
|
ThAT09 Regular Session, Maya Ballroom I |
Add to My Program |
Linear Systems I |
|
|
Chair: Baggio, Giacomo | University of Padova, Italy |
Co-Chair: Zampieri, Sandro | Univ. Di Padova |
|
10:00-10:20, Paper ThAT09.1 | Add to My Program |
Sample-Based Observability of Linear Discrete-Time Systems |
|
Krauss, Isabelle | Leibniz University Hannover |
Lopez, Victor G. | Leibniz University Hannover |
Muller, Matthias A. | Leibniz University Hannover |
Keywords: Linear systems, Observers for Linear systems
Abstract: In this work, sample-based observability of linear discrete-time systems is studied. That is, we consider the case where the system output measurements are not available at every time instance. It is shown that some discrete-time systems exhibit particular behaviors that lead to pathological sampling. Depending on the characteristics of the system, different sampling schemes are developed that allow the system state to be reconstructed.
|
|
10:20-10:40, Paper ThAT09.2 | Add to My Program |
Dynamic Control Allocation for Sampled-Data Closed-Loop Systems |
|
Galeani, Sergio | Università Di Roma Tor Vergata |
Masocco, Roberto | University of Rome "Tor Vergata" |
Sassano, Mario | University of Rome, Tor Vergata |
Keywords: Linear systems, Sampled-data control, Optimization
Abstract: In this paper the problem of dynamic control allocation is studied in the context of a continuous-time plant regulated by means of a digital controller, comprising sampling and holding blocks. The proposed architecture consists of an annihilator unit designed on the basis of the sampled-data equivalent description of the plant and of a discrete-time steady-state generator/optimizer driven by the exogenous reference signal. By relying on the feed-forward nature of the design, it is shown that the allocator unit yields the optimal solution in finite time, differently from existing solutions purely in the context of continuous-time plants.
|
|
10:40-11:00, Paper ThAT09.3 | Add to My Program |
LQR Design for Discrete-Time Positive Systems: A First-Order Method |
|
Yang, Nachuan | Hong Kong University of Science and Technology |
Tang, Jiawei | Hong Kong University of Science and Technology |
Li, Yuzhe | Northeastern University |
Shi, Ling | Hong Kong University of Science and Technology |
Keywords: Linear systems, Compartmental and Positive systems
Abstract: This paper fills the literature gap by considering the LQR design for discrete-time positive linear systems, which remains an open problem due to its nonconvexity. We propose a first-order method to solve this problem where the nonconvexity inherited from positivity and stability constraints was tackled by utilizing the special property of positive linear systems. More specifically, two Lyapunov-based methods were proposed to compute the L-2 and L-infinity projections for single-input and multi-input systems, respectively. Based on the projection theorems, a novel projected gradient descent algorithm was developed for computation and a successively improved sequence of suboptimal solutions is obtained. Finally, two numerical examples are provided to verify the effectiveness of our proposed results. The proposed first-order method is likely to be extended to synthesize more general constrained systems.
|
|
11:00-11:20, Paper ThAT09.4 | Add to My Program |
Transient Performance of Linear Systems through Symmetric Polynomials |
|
Taghavian, Hamed | KTH Royal Institute of Technology |
Johansson, Mikael | KTH - Royal Institute of Technology |
Keywords: Linear systems, Optimization, Optimal control
Abstract: Several time-domain constraints for linear systems are characterized using a new representation of the impulse response based on symmetric polynomials. This includes the induced ∞-norm, the peak response and external positivity – performance indices that are complicated to impose in existing LMI frameworks. Each index is described by a convex constraint, explicit in terms of the system poles. Despite their simplicity, numerical examples suggest that these constraints can provide relatively tight performance bounds.
|
|
11:20-11:40, Paper ThAT09.5 | Add to My Program |
Reachable Volume of Large-Scale Linear Network Systems: The Single-Input Case |
|
Baggio, Giacomo | University of Padova, Italy |
Zampieri, Sandro | Univ. Di Padova |
Keywords: Linear systems, Large-scale systems, Network analysis and control
Abstract: The volume of the set of states of a linear system that can be reached using control inputs with bounded energy depends on the determinant of the controllability Gramian. As such, the latter represents a meaningful metric to quantify the degree of controllability of a linear system. In this paper, we conduct a detailed study of this controllability metric for the case of linear network systems controlled by a single node. Our analysis builds on a closed-form expression which uncovers the relation between the determinant of the controllability Gramian and the spectral properties of the underlying system. In particular, we explore the dependence of this metric on the network dimension and provide a characterization of its scaling behavior in terms of exact exponential decay rates. We illustrate and complement our findings with a set of numerical examples.
|
|
11:40-12:00, Paper ThAT09.6 | Add to My Program |
Equivalence between Different Stability Definitions for 2D Linear Discrete Roesser Models |
|
Bachelier, Olivier | Université De Poitiers |
Cluzeau, Thomas | Universite' De Limoges |
Rigaud, Alexandre | University of Poitiers |
Francisco, Silva | Universite De Limoges |
Yeganefar, Nima | LIAS |
Keywords: Linear systems, Stability of linear systems, Iterative learning control
Abstract: In this paper, we review different stability definitions that have been used in the literature concerning 2D Roesser models: structural stability, asymptotic stability, and exponential stability. We clarify the relations between all these definitions: when the considered Roesser model is linear, structural stability and exponential stability are equivalent and both of them imply asymptotic stability. However asymptotic stability does not imply exponential or structural stability. The proof is based on previous results obtained for Fornasini-Marchesini models and requires careful switching between both models.
|
|
ThAT10 Regular Session, Maya Ballroom II |
Add to My Program |
Discrete-Event Systems |
|
|
Chair: Zhang, Kuize | University of Surrey |
Co-Chair: Takai, Shigemasa | Osaka Univ |
|
10:00-10:20, Paper ThAT10.1 | Add to My Program |
Target Control of Boolean Networks with Permanent Edgetic Perturbations |
|
Zeyen, Olivier Georges Rémy | University of Luxembourg |
Pang, Jun | University of Luxembourg |
Keywords: Boolean control networks and logic networks, Genetic regulatory systems
Abstract: Boolean network is a popular and well-established modelling framework for gene regulatory networks. The steady- state behaviour of Boolean networks can be described as attractors, which are hypothesised to characterise cellular phenotypes. In this work, we study the target control problem of Boolean networks, which has important applications for cellular reprogramming. More specifically, we want to reduce the total number of attractors of a Boolean network to a single target attractor. Different from existing approaches to solving control problems of Boolean networks with node perturbations, we aim to develop an approach utilising edgetic perturbations. Namely, our objective is to modify the update functions of a Boolean network such that there remains only one attractor. The design of our approach is inspired by Thomas’ first rule, and we primarily focus on the removal of cycles in the interaction graph of a Boolean network. We further use results in the literature to only remove positive cycles which are responsible for the appearance of multiple attractors. We apply our solution to a number of real-life biological networks modelled as Boolean networks, and the experimental results demonstrate its efficacy and efficiency.
|
|
10:20-10:40, Paper ThAT10.2 | Add to My Program |
A General Intersection-Based Architecture for Decentralized Supervisory Control of Discrete Event Systems |
|
Hayano, Akihito | Osaka University |
Takai, Shigemasa | Osaka Univ |
Keywords: Discrete event systems, Supervisory control
Abstract: A certain intersection-based architecture for decentralized supervisory control of discrete event systems was proposed in the literature. This existing intersection-based architecture can be regarded as an anti-permissive one. In this paper, we propose a dual architecture, named the permissive intersection-based architecture, and develop a general architecture by combining the anti-permissive and permissive ones. Then, we define a general notion of SEI-coobservability as a part of necessary and sufficient conditions under which the specification is achieved in the developed general architecture and show how to verify it.
|
|
10:40-11:00, Paper ThAT10.3 | Add to My Program |
Verification of Strong K-Step Opacity for Discrete-Event Systems |
|
Han, Xiaoguang | Tianjin University of Science and Technology |
Zhang, Kuize | University of Surrey |
Li, Zhiwu | Xidian University |
Keywords: Automata, Discrete event systems
Abstract: In this paper, we revisit the verification of strong K-step opacity (K-SSO) for partially-observed discrete-event systems modeled as nondeterministic finite-state automata. As a stronger version of the standard K-step opacity, K-SSO requires that an intruder cannot make sure whether or not a secret state has been visited within the last K observable steps. To efficiently verify K-SSO, we propose a new concurrent-composition structure, which is a variant of our previously-proposed one. Based on this new structure, we design an algorithm for deciding K-SSO and prove that the proposed algorithm not only reduces the time complexity of the existing algorithms, but also does not depend on the value of K. Furthermore, a new upper bound on the value of K in K-SSO is derived, which also reduces the existing upper bound on K in the literature. Finally, we illustrate the proposed algorithm by a simple example.
|
|
11:00-11:20, Paper ThAT10.4 | Add to My Program |
Fault Diagnosis of Discrete-Event Systems under Non-Deterministic Observations with Output Fairness |
|
Dong, Weijie | Shanghai Jiao Tong University |
Gao, Shang | Shanghai Jiao Tong Uni |
Yin, Xiang | Shanghai Jiao Tong University |
Li, Shaoyuan | Shanghai Jiao Tong University |
Keywords: Discrete event systems, Automata, Supervisory control
Abstract: In this paper, we revisit the fault diagnosis problem of discrete-event systems (DES) under non-deterministic observations. Non-deterministic observation is a general observation model that includes the case of intermittent loss of observations. In this setting, upon the occurrence of an event, the sensor reading may be non-deterministic such that a set of output symbols are all possible. Existing works on fault diagnosis under non-deterministic observations require to consider all possible observation realizations. However, this approach includes the case where some possible outputs are permanently disabled. In this work, we introduce the concept of output fairness by requiring that, for any output symbols, if it has infinite chances to be generated, then it will indeed be generated infinite number of times. We use an assume-guarantee type of linear temporal logic formulas to formally describe this assumption. A new notion called output-fair diagnosability (OF-diagnosability) is proposed. An effective approach is provided for the verification of OF-diagnosability. We show that the proposed notion of OF-diagnosability is weaker than the standard definition of diagnosability under non-deterministic observations, and it better captures the physical scenario of observation non-determinism or intermittent loss of observations.
|
|
11:20-11:40, Paper ThAT10.5 | Add to My Program |
About Existence of Resilient Supervisors against Smart Sensor Attacks |
|
Su, Rong | Nanyang Technological University |
Keywords: Supervisory control, Resilient Control Systems
Abstract: One key challenge of cybersecurity of discrete event systems (DES) is how to ensure system resilience against sensor and/or actuator attacks, which may tamper data integrity and service availability. In this paper we discuss decidability issues related to smart sensor attacks. We first present a sufficient and necessary condition that ensures the existence of a smart sensor attack, which reveals a novel demand-supply relationship between an attacker and a controlled plant, represented as a set of risky pairs. Each risky pair consists of a damage string desired by the attacker and an observable sequence feasible in the supervisor such that the latter induces a sequence of control patterns, which allows the damage string to happen. It turns out that each risky pair can induce a smart weak sensor attack. Next, we show that, when the plant, supervisor and damage language are regular, it is possible to remove all such risky pairs from the plant behaviour, via a genuine encoding scheme, upon which we establish our key result that the existence of a nonblocking supervisor resilient against all smart sensor attacks is decidable.
|
|
11:40-12:00, Paper ThAT10.6 | Add to My Program |
On the Convergence of the Backward Reachable Sets of Robust Controlled Invariant Sets for Discrete-Time Linear Systems |
|
Liu, Zexiang | University of Michigan |
Ozay, Necmiye | Univ. of Michigan |
Keywords: Formal Verification/Synthesis, Predictive control for linear systems, Constrained control
Abstract: This paper considers discrete-time linear systems with bounded additive disturbances, and studies the convergence properties of the backward reachable sets of robust controlled invariant sets (RCIS). Under a simple condition, we prove that the backward reachable sets of an RCIS are guaranteed to converge to the maximal RCIS in Hausdorff distance, with an exponential convergence rate. When all sets are represented by polytopes, this condition can be checked numerically via a linear program. We discuss how the developed condition generalizes the existing conditions in the literature for (controlled) invariant sets of systems without disturbances (or without control inputs).
|
|
ThAT11 Regular Session, Maya Ballroom III |
Add to My Program |
Sliding Mode Control |
|
|
Chair: Incremona, Gian Paolo | Politecnico Di Milano |
Co-Chair: Ferrara, Antonella | University of Pavia |
|
10:00-10:20, Paper ThAT11.1 | Add to My Program |
Homogeneous Low-Chattering Sliding Mode Discretization |
|
Hanan, Avi | Tel-Aviv University |
Jbara, Adam | Tel-Aviv University |
Levant, Arie | Tel-Aviv University |
Keywords: Variable-structure/sliding-mode control, Nonlinear output feedback, Sampled-data control
Abstract: A low-chattering bihomogeneous discretization method is proposed for homogeneous sliding mode control, which combines two different homogeneity degrees and coordinate weights distributions. The new method is explicit, always preserves system trajectories and accuracies in simulation and control applications. When properly used, it significantly decreases system vibrations in the absence of sampling noises. Numeric experiments demonstrate the efficacy of the approach.
|
|
10:20-10:40, Paper ThAT11.2 | Add to My Program |
Robust Multi-Model Predictive Control Via Integral Sliding Modes |
|
Galván Guerra, Rosalba | Instituto Politécnico Nacional |
Incremona, Gian Paolo | Politecnico Di Milano |
Fridman, Leonid | Universidad Nacional Autonoma De Mexico |
Ferrara, Antonella | University of Pavia |
Keywords: Variable-structure/sliding-mode control, Uncertain systems
Abstract: This paper presents a novel optimal control approach for systems represented by a multi-model, i.e., a finite set of models, each one corresponding to a different operating point. The proposed control scheme is based on the combined use of model predictive control (MPC) and first order integral sliding mode control. The sliding mode control component plays the important role of rejecting matched uncertainty terms possibly affecting the plant, thus making the controlled equivalent system behave as the nominal multi-model. A min-max multi-model MPC problem is solved using the equivalent system without further robustness oriented add-ons. In addition, the MPC design is performed so as to keep the computational complexity limited, thus facilitating the practical applicability of the proposal. Simulation results show the effectiveness of the proposed control approach.
|
|
10:40-11:00, Paper ThAT11.3 | Add to My Program |
Neural Network Based Practical/Ideal Integral Sliding Mode Control |
|
Sacchi, Nikolas | University of Pavia |
Incremona, Gian Paolo | Politecnico Di Milano |
Ferrara, Antonella | University of Pavia |
Keywords: Variable-structure/sliding-mode control, Uncertain systems
Abstract: This paper deals with the design of a novel neural network based integral sliding mode (NN-ISM) control for nonlinear systems with uncertain drift term and control effectiveness matrix. Specifically, this paper extends the classical integral sliding mode control law to the case of unknown nominal model. The latter is indeed reconstructed by two deep neural networks capable of approximating the unknown terms, which are instrumental to design the so-called integral sliding manifold. In the paper, the ultimate boundedness of the system state is formally proved by using Lyapunov stability arguments, thus providing the conditions to enforce practical integral sliding modes. The possible generation of ideal integral sliding modes is also discussed. Moreover, the effectiveness of the proposed NN-ISM control law is assessed in simulation relying on the classical Duffing oscillator.
|
|
11:00-11:20, Paper ThAT11.4 | Add to My Program |
Comparative Study of Three High Order Sliding Mode Model Based Design for a Floating Wind Turbine Robust Control |
|
Basbas, Hedi | Université De Technologie Belfort-Montbéliard (UTBM) |
Obeid, Hussein | University of Caen Normandy |
Laghrouche, Salah | Université De Technologie Belfort-Montbéliard (UTBM) |
Hilairet, Mickael | Université De Franche-Comté |
Plestan, Franck | Ecole Centrale De Nantes-LS2N |
Keywords: Variable-structure/sliding-mode control, Nonlinear systems
Abstract: This article proposes a comparative study of three high order sliding mode control (HOSMCs) algorithms for the 5MW tensioned-leg platform (TLP) based floating offshore wind turbines (FOWT). The control objectives are to regulate the rotor speed to its nominal value and to reduce the fluctuations of the platform pitch angle. A nonlinear control-oriented model of the 5 MW TLP-based FOWT has been selected from the literature and adapted for nonlinear controller design. Then, a continuous-twisting algorithm (CTA), a quasi-continuous homogeneous algorithm (QCA) and a super-twisting algorithm (STA) have been designed to meet the control objectives in turbulent and high wind speed conditions. Finally, these robust nonlinear controllers have been validated in simulations and compared with each other using OpenFAST code in MATLAB/Simulink.
|
|
11:20-11:40, Paper ThAT11.5 | Add to My Program |
Design and Tuning of the Super-Twisting-Based Synchronous Reference Frame Phase-Locked-Loop |
|
Rueda-Escobedo, Juan G. | Brandenburg University of Technology Cottbus - Senftenberg |
Moreno, Jaime A. | Universidad Nacional Autonoma De Mexico-UNAM |
Schiffer, Johannes | Brandenburg University of Technology |
Keywords: Variable-structure/sliding-mode control, Power systems, Power electronics
Abstract: The worldwide transition to climate-friendly energy systems entails the substitution of conventional energy generation based on synchronous generators by renewable energy sources based on power electronics. As a consequence, the overall inertia of the grid decreases, resulting in high volatility of the frequency and posing new challenges for the estimation of the latter quantity. To address this problematic, in the present paper a phase-locked-loop (PLL) based on the super-twisting algorithm is considered for the estimation of the phase angle and time-varying frequency of a symmetric three-phase signal. A rigorous proof of the algorithm's exact convergence in the presence of a fast-varying frequency together with tuning rules for its gains are derived by means of Lyapunov theory. Additionally, an estimate of the region of attraction is provided. The effectiveness of the proposed tuning method is illustrated in numerical simulations, while comparing its performance against a standard synchronous reference frame PLL.
|
|
11:40-12:00, Paper ThAT11.6 | Add to My Program |
Discretization of the Super-Twisting Algorithm Using Variational Integrators |
|
Weissenberger, Florian | TU Ilmenau |
Watermann, Lars | TU Ilmenau |
Koch, Stefan | Graz University of Technology |
Reichhartinger, Markus | Graz University of Technology |
Hettiger, Christina | TU Ilmenau |
Eisenzopf, Lukas | Graz University of Technology |
Kumari, Kiran | Indian Institute of Technology Bombay |
Reger, Johann | TU Ilmenau |
Horn, Martin | Graz University of Technology |
Keywords: Variable-structure/sliding-mode control
Abstract: In this paper, a new framework for the discretization of the super-twisting algorithm is developed. The proposed discretization scheme is not based on the underlying differential equations, but uses the variational formulation of the problem. Discrete-time versions derived in the proposed fashion exhibit a great tracking of the continuous-time energy decay rate and give a good approximation of the continuous-time system. Furthermore, the paper presents a stability proof for an exemplary algorithm, a semi-implicit version of the super-twisting algorithm derived by using variational integrators.
|
|
ThAT12 Invited Session, Maya Ballroom IV |
Add to My Program |
Modeling, Estimation, and Control of Epidemic Processes |
|
|
Chair: Pare, Philip E. | Purdue University |
Co-Chair: She, Baike | Purdue University |
Organizer: She, Baike | University of Florida |
Organizer: Sundaram, Shreyas | Purdue University |
Organizer: Pare, Philip E. | Purdue University |
|
10:00-10:20, Paper ThAT12.1 | Add to My Program |
Logarithmic Dynamics and Aggregation in Epidemics (I) |
|
Blanchini, Franco | Univ. Degli Studi Di Udine |
Bolzern, Paolo | Politecnico Di Milano |
Colaneri, Patrizio | Politecnico Di Milano |
De Nicolao, Giuseppe | Univ. Pavia |
Giordano, Giulia | University of Trento |
Keywords: Biological systems, Systems biology, Modeling
Abstract: We consider a class of epidemiological models with an arbitrary number of infected compartments. We show that the logarithmic derivatives of the infected states converge to a consensus; this property rigorously explains the feature empirically observed in real epidemic data: the logarithms of the state variables associated with infected categories tend to behave as "parallel lines". We introduce and characterise the class of contagion functions, i.e., linear co-positive functions of the state variables that decrease (resp. increase) when the reproduction number is smaller (resp. larger) than 1. Finally, we analyse the generalised epidemiological model by considering the susceptible state variable along with a variable that aggregates all the infected compartments: this leads to an auxiliary planar system, governed by two differential inclusions, which has the same structure as the two-dimensional SI model and whose coefficients are functions of the original variables. We prove that well known properties of the classical SI model still hold in this generalised case.
|
|
10:20-10:40, Paper ThAT12.2 | Add to My Program |
A Class of Nonlinear State Observers for an SIS System Counting Primo-Infections (I) |
|
Fang, Marcel | Sorbonne Université, INRIA |
Bliman, Pierre-Alexandre J | Sorbonne Universités, Inria, UPMC Université Paris 06 |
Efimov, Denis | Inria |
Ushirobira, Rosane | Inria |
Keywords: Biological systems, Observers for nonlinear systems, Lyapunov methods
Abstract: Observation and identification are important issues for the practical use of compartmental models of epidemic dynamics. They are usually evaluated based on the number of infected individuals (the prevalence) or the newly infected cases (the incidence). We are interested in a general question: may the measure of the number of primo-infected individuals and the prevalence improve state estimation? To study this question, we analyze in this paper a simple model of infection with waning immunity and, consequently, the possibility of reinfections. A class of nonlinear observers is built for this model, and tractable sufficient conditions on the gain matrices are established, ensuring asymptotic convergence of the state estimate towards its actual value. Numerical simulations illustrate the method.
|
|
10:40-11:00, Paper ThAT12.3 | Add to My Program |
Observer Design for the State Estimation of Epidemic Processes (I) |
|
Niazi, Muhammad Umar B. | KTH Royal Institute of Technology |
Johansson, Karl H. | Royal Institute of Technology |
Keywords: Biological systems, Observers for nonlinear systems, Nonlinear systems
Abstract: Although an appropriate choice of measured state variables may ensure observability, designing state observers for the state estimation of epidemic models remains a challenging task. Epidemic spread is a nonlinear process, often modeled as the law of mass action, which is of a quadratic form; thus, on a compact domain, its Lipschitz constant turns out to be local and relatively large, which renders the Lipschitz-based design criteria of existing observer architectures infeasible. In this paper, a novel observer architecture is proposed for the state estimation of a class of nonlinear systems that encompasses the deterministic epidemic models. The proposed observer offers extra leverage to reduce the influence of nonlinearity in the estimation error dynamics, which is not possible in other Luenberger-like observers. Algebraic Riccati inequalities are derived as sufficient conditions for the asymptotic convergence of the estimation error to zero under local Lipschitz and generalized Lipschitz assumptions. Equivalent linear matrix inequality formulations of the algebraic Riccati inequalities are also provided. The efficacy of the proposed observer design is illustrated by its application on the celebrated SIDARTHE-V epidemic model.
|
|
11:00-11:20, Paper ThAT12.4 | Add to My Program |
Optimal Mitigation of SIR Epidemics under Model Uncertainty (I) |
|
She, Baike | Purdue University |
Sundaram, Shreyas | Purdue University |
Pare, Philip E. | Purdue University |
Keywords: Emerging control applications, Control applications, Optimal control
Abstract: We study the impact of model parameter uncertainty on optimally mitigating the spread of epidemics. We capture the epidemic spreading process using a susceptible infected-removed (SIR) epidemic model and consider testing for isolation as the control strategy. We use a testing strategy to remove (isolate) a portion of the infected population. Our goal is to keep the daily infected population below a certain level, while minimizing the total number of tests. Distinct from existing works on leveraging optimal control strategies in epidemic spreading, we propose a testing strategy by overestimating the seriousness of the epidemic and study the feasibility of the system under the impact of model parameter uncertainty. Compared to the optimal testing strategy, we establish that the proposed strategy under model parameter uncertainty will flatten the curve effectively but require more tests and a longer time period.
|
|
11:20-11:40, Paper ThAT12.5 | Add to My Program |
Approximate Testing in Uncertain Epidemic Processes |
|
Bi, Xiaoqi | University of Illinois, Urbana-Champaign |
Miehling, Erik | University of Illinois at Urbana-Champaign |
Beck, Carolyn L. | Univ of Illinois, Urbana-Champaign |
Basar, Tamer | Univ of Illinois, Urbana-Champaign |
Keywords: Estimation, Simulation, Markov processes
Abstract: Diagnostic tests have proven to be a critical tool in controlling the progression of a virus. In this paper, we formulate the testing of a homogeneous population as an optimal control problem. The population state, given by the distribution of agents' viral states in a compartmental model, is assumed to be unknown. Information regarding the population state is provided via noisy tests, which are allocated from a stockpile whose size is updated via a stochastic process. The objective of the control problem is to allocate tests so as to minimize uncertainty of the underlying population state over a finite horizon. As such, the control problem is cast as a POMDP with a negative entropy reward function. We study various heuristic policies and investigate conditions under which each heuristic performs best.
|
|
11:40-12:00, Paper ThAT12.6 | Add to My Program |
Epidemic Population Games with Nonnegligible Disease Death Rate |
|
Certorio, Jair | University of Maryland |
Martins, Nuno C. | University of Maryland |
La, Richard J. | University of Maryland, College Park |
Keywords: Stability of nonlinear systems, Game theory, Emerging control applications
Abstract: A recent article that combines normalized epidemic compartmental models and population games put forth a system theoretic approach to capture the coupling between a population's strategic behavior and the course of an epidemic. It introduced a payoff mechanism that governs the population's strategic choices via incentives, leading to the lowest endemic proportion of infectious individuals subject to cost constraints. Under the assumption that the disease death rate is approximately zero, it uses a Lyapunov function to prove convergence and formulate a quasi-convex program to compute an upper bound for the peak size of the population's infectious fraction. In this article, we generalize these results to the case in which the disease death rate is nonnegligible. This generalization brings on additional coupling terms in the normalized compartmental model, leading to a more intricate Lyapunov function and payoff mechanism. Moreover, the associated upper bound can no longer be determined exactly, but it can be computed with arbitrary accuracy by solving a set of convex programs.
|
|
ThAT13 Regular Session, Maya Ballroom V |
Add to My Program |
Constrained Control and Optimization I |
|
|
Chair: Zaccarian, Luca | LAAS-CNRS and University of Trento |
Co-Chair: Polyakov, Andrey | Inria, Univ. Lille |
|
10:00-10:20, Paper ThAT13.1 | Add to My Program |
Approximation-Free Adaptive Prescribed Performance Control for Unknown SISO Nonlinear Systems with Input Saturation |
|
Trakas, Panagiotis | University of Patras |
Bechlioulis, Charalampos P. | University of Patras |
Keywords: Constrained control, Nonlinear systems, Robust control
Abstract: A universal approximation-free adaptive prescribed performance control scheme is designed for unknown SISO nonlinear systems with input saturation. The proposed control method introduces a compromising relaxation of output performance specifications depending on the input limitations. Given the conflicting nature between input and output constraints, the stability properties are inevitably local. In this respect, a sufficient stability condition for the closed-loop system is provided through theoretical analysis. Owing to the adopted prescribed performance control technique, the satisfaction of the aforementioned stability properties guarantees the desired trade-off between input and output constraints. Moreover, no hard calculations are needed, neither for the controller nor for the adaptive law, maintaining the complexity of the control algorithm relatively low. Finally, the proposed approach is clarified and verified by various simulation studies.
|
|
10:20-10:40, Paper ThAT13.2 | Add to My Program |
Optimal Planetary Landing with Pointing and Glide-Slope Constraints |
|
Leparoux, Clara | ONERA & ENSTA Paris |
Herisse, Bruno | ONERA - the French Aerospace Lab |
Jean, Frederic | ENSTA Paris |
Keywords: Optimal control, Aerospace
Abstract: This paper studies a vertical powered descent problem in the context of planetary landing, considering glide-slope and thrust pointing constraints and minimizing any final cost. After stating the Max-Min-Max or Max-Singular-Max form of the optimal control deduced from the Pontryagin Maximum Principle, it theoretically analyzes the optimal trajectory for a more specific problem formulation to show that there can be at most one contact or boundary interval with the state constraint on each Max or Min arc.
|
|
10:40-11:00, Paper ThAT13.3 | Add to My Program |
Static Linear Anti-Windup Design with Sign-Indefinite Quadratic Forms |
|
Priuli, Alberto | University of Trento |
Tarbouriech, Sophie | LAAS-CNRS |
Zaccarian, Luca | LAAS-CNRS and University of Trento |
Keywords: Constrained control, Stability of nonlinear systems, Lyapunov methods
Abstract: We design static anti-windup gains to mitigate the effect of input saturation in linear output feedback closed loops. The design is conducted with the help of a non-quadratic Lyapunov function involving sign-indefinite quadratic forms, which allows for additional degrees of freedom to be exploited, for the anti-windup gain design. Synthesis conditions, combining the use of sign-indefinite quadratic forms and several sector bound properties are stated in the form of bilinear matrix inequalities ensuring global exponential stability of the closed-loop system. An iterative design algorithm is then devised, based on the resolution of a sequence of semidefinite programs. The superiority of the proposed technique over classical quadratic constructions is illustrated on an example borrowed from the literature, where quadratic positive definite functions are ineffective.
|
|
11:00-11:20, Paper ThAT13.4 | Add to My Program |
Homogeneous Nonovershooting Stabilizers and Safety Filters Rejecting Matched Disturbances |
|
Polyakov, Andrey | Inria, Univ. Lille |
Krstic, Miroslav | University of California, San Diego |
Keywords: Constrained control, Robust control, Stability of nonlinear systems
Abstract: Non-overshooting stabilization is a form of safe control where the setpoint chosen by the user is at the boundary of the safe set. Exponential non-overshooting stabilization, including suitable extensions to systems with deterministic and stochastic disturbances, has been solved by the second author and his coauthors. In this paper we develop homogeneous feedback laws for fixed-time nonovershooting stabilization for nonlinear systems that are input-output linearizable with a full relative degree, i.e., for systems that are diffeomorphically equivalent to the chain of integrators. These homogeneous feedback laws can also assume the secondary role of `fixed-time safety filters' (FxTSf filters) which keep the system within the closed safe set for all time but, in the case where the user's nominal control commands approach to the unsafe set, allow the system to reach the boundary of the safe set no later than a desired time that is independent of nominal control and independent of the value of the state at the time the nominal control begins to be overridden.
|
|
11:20-11:40, Paper ThAT13.5 | Add to My Program |
Output-Feedback Path Planning with Robustness to State-Dependent Errors |
|
Bahreinian, Mahroo | Boston University |
Tron, Roberto | Boston University |
Keywords: Optimization, Control applications, Linear systems
Abstract: We consider the problem of sample-based feedback motion planning from measurements affected by systematic errors. Our previous work presented output feedback controllers that use measurements from landmarks in the environment to navigate through a cell-decomposable environment using duality, Control Lyapunov and Barrier Functions (CLF, CBF), and Linear Programming. In this paper, we build on this previous work with a novel strategy that allows the use of measurements affected by systematic errors in perceived depth (similarly to what might be generated by vision-based sensors), as opposed to accurate displacements measurements. As a result, our new method has the advantage of providing more robust performance (with quantitative guarantees) when inaccurate sensors are used. We test the proposed algorithm in the simulation to evaluate the performance limits of our approach predicted by our theoretical derivations.
|
|
11:40-12:00, Paper ThAT13.6 | Add to My Program |
Lift, Partition, and Project: Parametric Complexity Certification of Active-Set QP Methods in the Presence of Numerical Errors |
|
Arnström, Daniel | Linköping University |
Axehill, Daniel | Linköping University |
Keywords: Optimization algorithms, Predictive control for linear systems, Fault tolerant systems
Abstract: When Model Predictive Control (MPC) is used in real-time to control linear systems, quadratic programs (QPs) need to be solved within a limited time frame. Recently, several parametric methods have been proposed that certify the number of computations active-set QP solvers require to solve these QPs. These certification methods, hence, ascertain that the optimization problem can be solved within the limited time frame. A shortcoming in these methods is, however, that they do not account for numerical errors that might occur internally in the solvers, which ultimately might lead to optimistic complexity bounds if, for example, the solvers are implemented in single precision. In this paper we propose a general framework that can be incorporated in any of these certification methods to account for such numerical errors.
|
|
ThAT14 Regular Session, Maya Ballroom VI |
Add to My Program |
Energy Systems |
|
|
Chair: Namerikawa, Toru | Keio University |
Co-Chair: Richter, Hanz | Cleveland State University |
|
10:00-10:20, Paper ThAT14.1 | Add to My Program |
Energy Cyclo-Directionality, Average Equipartition and Exergy Efficiency of Multidomain Power Networks |
|
Richter, Hanz | Cleveland State University |
Keywords: Energy systems, Power systems, Control of networks
Abstract: A recent formalization of thermodynamics under a dynamical systems approach introduces an axiom restricting the direction of power transmission between two subsystems, reflecting heat transfer from hot to cold bodies. This axiom enables precise results paralleling the statements of classical thermodynamics, including its second law, which places a limit on the amount of work that may be transferred to across system boundaries beyond that imposed by energy conservation. Systems exhibiting non-diffusive power transfer, including those with Hamiltonian dynamics are ruled out. Given that power networks with Hamiltonian dynamics fail the directionality axiom, are they still subject to limitations on their ability to perform work on the surroundings? This paper shows that such systems can satisfy a version of the above axiom involving averages over periodic regimes, revealing limitations on external power transfer and allowing a definition of second-law efficiency and a cyclic interpretation of energy equipartition. Focus is on a class of linear port-Hamiltonian systems, with frequency-domain methods used to describe the pertinent average quantities. The ability to establish an order relationship between the weighted average kinetic and potential energies has a central role. We show that this can be cast as a generalized eigenvalue problem.
|
|
10:20-10:40, Paper ThAT14.2 | Add to My Program |
Power Demand-Supply Adjustment Via Negawatt Trading Based on Regret Matching |
|
Namerikawa, Toru | Keio University |
Keywords: Energy systems, Power systems, Smart grid
Abstract: This paper deals with a power demand-supply management based on negawatt trading, and the purpose of this research is to keep demand supply balance and to minimize the social cost of negawatt trading, balancing generator, and power flow. In this research, consumer and Independent System Operator participate in the electricity market, and each consumer acts based on Regret Matching which is one of the learning algorithms. Furthermore, as the incentive design of negawatt trading, we consider strategy-proofness and individual rationality using VCG mechanism design. Then, we propose a novel negawatt trading algorithm based on Regret Matching and VCG mechanism design. Finally, simulation results show the effectiveness of the proposed algorithm.
|
|
10:40-11:00, Paper ThAT14.3 | Add to My Program |
Energy-Grade Double Pricing Rule in the Heating Market |
|
Yi, Xinyi | Tsinghua University |
Guo, Ye | Tsinghua University |
Sun, Hongbin | Tsinghua University |
Keywords: Energy systems, Power systems
Abstract: The problem of heat system pricing is considered. A direct extension of locational marginal prices (LMP) in electricity markets to heat systems may lead to revenue inadequate issues. The underlying reason for such a problem is that, unlike electric power, heat has different grades and cannot be considered as homogenized commodity. Accordingly, an energy-grade double pricing rule is proposed in this paper. Heat energy and grade prices are explained as the shadow prices related to the nodal heat balance constraints and temperature requirements constraints at the optimal solution. The resulting merchandise surplus at each dispatch interval can be decomposed into several explainable parts, namely, congestion rent, impact from the last period, and impact from the upcoming period. And the total merchandise surplus over all dispatch intervals can be decomposed into several non-negative interpretable parts, including congestion rent and impact from the initial state, thus guaranteeing the revenue adequacy for the heat system operator. Simulations verify the effectiveness of the proposed mechanism.
|
|
11:00-11:20, Paper ThAT14.4 | Add to My Program |
Quantification of Market Power Mitigation Via Efficient Aggregation of Distributed Energy Resources |
|
Gao, Zuguang | The University of Chicago |
Alshehri, Khaled | King Fahd University of Petroleum and Minerals |
Birge, John | University of Chicago |
Keywords: Energy systems, Power systems
Abstract: Distributed energy resources (DERs) such as solar panels have small supply capacities and cannot be directly integrated into wholesale markets. So, the presence of an intermediary is critical. The intermediary could be a profit-seeking entity (called the aggregator) that buys DER supply from prosumers, and then sells them in the wholesale electricity market. Thus, DER integration has an influence on wholesale market prices, demand, and supply. The purpose of this article is to shed light onto the impact of efficient DER aggregation on the market power of conventional generators. Firstly, under efficient DER aggregation, we quantify the social welfare gap between two cases: when conventional generators are truthful, and when they are strategic. We also do the same when DERs are not present. Secondly, we show that the gap due to market power of generators in the presence of DERs is smaller than the one when there is no DER participation. Finally, we provide explicit expressions of the gaps and conduct numerical experiments to gain deeper insights. The main message of this article is that market power of conventional generators can be mitigated by adopting an efficient DER aggregation model.
|
|
11:20-11:40, Paper ThAT14.5 | Add to My Program |
Learning Task-Aware Energy Disaggregation: A Federated Approach |
|
Liu, Ruohong | The Hong Kong University of Science and Technology (Guangzhou) |
Chen, Yize | Berkeley Lab |
Keywords: Energy systems, Smart grid, Machine learning
Abstract: We consider the problem of learning the energy disaggregation signals for residential load data. Such a task is referred as non-intrusive load monitoring (NILM), and in order to find individual devices' power consumption profiles based on aggregated meter measurements, a machine learning model is usually trained based on large amount of training data coming from a number of residential homes. Yet collecting such residential load datasets requires both huge efforts and customers' approval on sharing metering data, while load data coming from different regions or electricity users may exhibit heterogeneous usage patterns. Both practical and privacy concerns make training a single, centralized NILM model challenging. In this paper, we propose a decentralized and task-adaptive learning scheme for NILM tasks, where nested meta-learning and federated learning steps are designed for learning task-specific models collectively. Simulation results on benchmark dataset validate proposed algorithm's performance in efficiently inferring the appliance-level consumption for a variety of homes and appliances.
|
|
11:40-12:00, Paper ThAT14.6 | Add to My Program |
Online Optimal Energy Management for Building Heating with Thermal Energy Storage |
|
Cao, Mengfan | Southern University of Science and Technology |
Chen, Shibo | Southern University of Science and Technology |
Miao, Haoyu | Southern University of Science and Technology |
Yang, Zaiyue | Southern University of Science and Technology |
Keywords: Energy systems, Building and facility automation, Optimization
Abstract: This paper studies the online optimal energy management problem for the building heating system. This system is driven by electric heat pumps, and equipped with the thermal energy storage (TES) that provides heat flexibility. The temporal coupling introduced by building temperature dynamics and TES energy level dynamics, together with the uncertainties of outdoor temperature and real-time electricity price, pose the optimal energy management problem a computationally challenging one. Hence, based on the Lyapunov optimization techniques, we propose an online algorithm to minimize the long-term time average cost of the heating system. Our algorithm can ensure the building temperature constraints and TES energy level constraints are satisfied all the time. The maximum optimality gap of our algorithm is also rigorously derived. Simulation results show that this algorithm can achieve user comfort and reduce energy cost effectiv
|
|
ThAT15 Regular Session, Maya Ballroom VII |
Add to My Program |
Game Theory IV |
|
|
Chair: Marden, Jason R. | University of California, Santa Barbara |
Co-Chair: Paccagnan, Dario | Imperial College London |
|
10:00-10:20, Paper ThAT15.1 | Add to My Program |
Fictitious Play with Maximin Initialization |
|
Ganzfried, Sam | Ganzfried Research |
Keywords: Game theory
Abstract: Fictitious play has recently emerged as the most accurate scalable algorithm for approximating Nash equilibrium strategies in multiplayer games. We show that the degree of equilibrium approximation error of fictitious play can be significantly reduced by carefully selecting the initial strategies. We present several new procedures for strategy initialization and compare them to the classic approach, which initializes all pure strategies to have equal probability. The best-performing approach, called maximin, solves a nonconvex quadratic program to compute initial strategies and results in a nearly 75% reduction in approximation error compared to the classic approach when 5 initializations are used.
|
|
10:20-10:40, Paper ThAT15.2 | Add to My Program |
A Synchronous Updating Rule for Evolutionary Congestion Games Based on Common Information |
|
Jiang, Kaichen | Dalian University of Technology |
Wang, Jinhuan | Hebei University of Technology |
Wu, Yuhu | Dalian University of Technology |
Keywords: Game theory
Abstract: In this paper, we propose a novel method for designing a synchronous strategy updating rule to guarantee the almost sure convergence of the Nash equilibria in evolutionary congestion games. Different from much literature in which the full information of the game is used, in this paper, the information obtained by each player only contains the resource allocation of the last play which is sufficient to guarantee the convergence. The main idea is embedding the player’s time-varying inertia factor in the traditional myopic best response adjustment rule to improve the evolutionary dynamic. The developed theoretical results are simulated by solving a routing problem.
|
|
10:40-11:00, Paper ThAT15.3 | Add to My Program |
Data-Driven Robust Congestion Pricing |
|
Wang, Yize | ETH Zurich |
Paccagnan, Dario | Imperial College London |
Keywords: Game theory, Transportation networks
Abstract: Whilst overloaded transportation systems bear a significant impact on everyone's welfare, governments strive to improve their performances. Amongst the many solutions proposed, congestion pricing is becoming increasingly popular as it has the potential to reduce congestion by indirectly influencing the drivers' routing choices. Commonly advocated for, the marginal cost mechanism ensures that self-interested decision-making results in optimal system performances. However, such a mechanism suffers from three important drawbacks in that i) it requires levying tolls on every road, ii) it does not allow for upper bounds on the magnitude of the tolls, and iii) it is flow-dependent. In response to these challenges, researchers have introduced the restricted network tolling problem, seeking constant tolls of bounded magnitude that induce equilibria with a small social cost. However, tolls designed through this approach are tailored to a specific traffic demand, resulting in a design that has the potential to exacerbate the very issue it set out to solve, if the demand changes. Our work addresses this issue and aims at infusing robustness guarantees to the restricted network tolling problem. We do so by seeking tolls that have good performance over past demand realizations, and leverage recent results in scenario optimization to equip our design with formal generalization guarantees.
|
|
11:00-11:20, Paper ThAT15.4 | Add to My Program |
Strategic Investments in Multi-Stage General Lotto Games |
|
Chandan, Rahul | University of California, Santa Barbara |
Paarporn, Keith | University of Colorado, Colorado Springs |
Alizadeh, Mahnoosh | University of California Santa Barbara |
Marden, Jason R. | University of California, Santa Barbara |
Keywords: Game theory
Abstract: In adversarial interactions, one is often required to make strategic decisions over multiple periods of time, wherein decisions made earlier impact a player's competitive standing as well as how choices are made in later stages. In this paper, we study such scenarios in the context of General Lotto games, which models the competitive allocation of resources over multiple battlefields between two players. We propose a two-stage formulation where one of the players has reserved resources that can be strategically pre-allocated across the battlefields in the first stage. The pre-allocation then becomes binding and is revealed to the other player. In the second stage, the players engage by simultaneously allocating their real-time resources against each other. The main contribution in this paper provides complete characterizations of equilibrium payoffs in the two-stage game, revealing the interplay between performance and the amount of resources expended in each stage of the game. We find that real-time resources are at least twice as effective as pre-allocated resources. We then determine the player's optimal investment when there are linear costs associated with purchasing each type of resource before play begins, and there is a limited monetary budget.
|
|
11:20-11:40, Paper ThAT15.5 | Add to My Program |
Complexity and Efficiency of Nash Equilibria in Noncooperative Simple Platoon Games |
|
Ibrahim, Adrianto Ravi | N/A |
Cetinkaya, Ahmet | National Institute of Informatics |
Kishida, Masako | National Institute of Informatics |
Keywords: Game theory, Transportation networks, Optimization
Abstract: We investigate the problem of platoon matching through the lens of complexity and efficiency. Specifically, we consider a noncooperative game among a number of vehicles that decide to form or not to form a platoon on a single road. To characterize the computational complexity of calculating the Nash equilibria in this game, we obtain a general upper bound for the length of any best response sequence. Then, we completely characterize the Nash equilibrium when the vehicles are interchangeable. Regarding the efficiency, we show that platooning games can be very inefficient in the worst case, as they can always have zero price of anarchy even when the vehicles have the same cost function.
|
|
11:40-12:00, Paper ThAT15.6 | Add to My Program |
A Stochastic Generalized Nash Equilibrium Model for Platforms Competition in the Ride-Hail Market |
|
Fabiani, Filippo | IMT School for Advanced Studies Lucca |
Franci, Barbara | Maastricht University |
Keywords: Game theory, Uncertain systems, Traffic control
Abstract: The inherent uncertainties in the ride-hailing market complicate the pricing strategies of on-demand platforms that compete each other to offer a mobility service while striving to maximize their profit. Looking at this problem as a stochastic generalized Nash equilibrium problem (SGNEP), we design a distributed, stochastic equilibrium seeking algorithm with Tikhonov regularization to find an optimal pricing strategy. The proposed iterative scheme does not require a potentially infinite number of samples of the random variable to perform the stochastic approximation, thus making it appealing from a practical perspective. Moreover, we show that the algorithm returns a Nash equilibrium under mere monotonicity assumptions and a careful choice of the step size sequence, obtained by exploiting the specific structure of the SGNEP at hand.
|
|
ThAT16 Regular Session, Maya Ballroom VIII |
Add to My Program |
Nonlinear Output Feedback |
|
|
Chair: Ushirobira, Rosane | Inria |
Co-Chair: Zhu, Guchuan | Ecole Polytechnique De Montreal |
|
10:00-10:20, Paper ThAT16.1 | Add to My Program |
LMI-Based Stubborn and Dead-Zone Redesign in Linear Dynamic Output Feedback |
|
Tarbouriech, Sophie | LAAS-CNRS |
Alessandri, Angelo | University of Genoa |
Astolfi, Daniele | Cnrs - Lagepp |
Zaccarian, Luca | LAAS-CNRS and University of Trento |
Keywords: Nonlinear output feedback, LMIs, Stability of nonlinear systems
Abstract: The redesign of output feedback controllers for linear systems based on adaptive saturation (stubborn) and dead-zone redesign is investigated by showing that input-to-state stability holds in closed loop upon the satisfaction of linear matrix inequalities. Such results are obtained by using sector conditions that are involved in the Lyapunov analysis in order to ensure input-to-state stability. A simulation case study shows the effectiveness of the proposed redesign in denoising and outlier attenuation with increased accuracy and precision.
|
|
10:20-10:40, Paper ThAT16.2 | Add to My Program |
A Hybrid Observer-Based Controller for a Non-Uniformly Observable System |
|
Maghenem, Mohamed Adlene | Gipsa Lab, CNRS, France |
Pasillas-Lepine, William | CNRS |
Loria, Antonio | CNRS |
Aguado-Rojas, Missie | Université Paris-Sud, Université Paris-Saclay |
Keywords: Nonlinear output feedback, Hybrid systems, Observers for nonlinear systems
Abstract: For systems that are not observable at the very equilibrium of interest to be stabilized, output-feedback stabilization is considerably challenging. In this paper we solve this control problem for the case-study of a second-order system that is bilinear and affine, both in the input and the output, but it is unobservable at the target equilibrium. The case-study is representative of a well-studied class of non-uniformly observable systems and stems from automotive control. Our main contribution is a novel certainty-equivalence hybrid controller that achieves asymptotic stabilization semiglobally. The controller relies on a switched observer that estimates the state, provided that the latter is ‘kept away’ from the singular equilibrium. To achieve both competing tasks, stabilization and estimation, the controller also relies on the keen construction of a piece-wise constant, converging, reference. Our main results are illustrated via numerical simulations on a meaningful example.
|
|
10:40-11:00, Paper ThAT16.3 | Add to My Program |
Funnel Control under Hard and Soft Output Constraints |
|
Mehdifar, Farhad | KTH Royal Institute of Technology |
Bechlioulis, Charalampos P. | University of Patras |
Dimarogonas, Dimos V. | KTH Royal Institute of Technology |
Keywords: Nonlinear systems, Constrained control, Uncertain systems
Abstract: This paper proposes a funnel control method under time-varying hard and soft output constraints. First, an online funnel planning scheme is designed that generates a constraint consistent funnel, which always respects hard (safety) constraints, and soft (performance) constraints are met only when they are not conflicting with the hard constraints. Next, the prescribed performance control method is employed for designing a robust low-complexity funnel-based controller for uncertain nonlinear Euler-Lagrangian systems such that the outputs always remain within the planned constraint consistent funnels. Finally, the results are verified with a simulation example of a mobile robot tracking a moving object while staying in a box-constrained safe space.
|
|
11:00-11:20, Paper ThAT16.4 | Add to My Program |
Synchronization of Oscillators by Nonlinear Measurements with Application to VLC |
|
Ushirobira, Rosane | Inria |
Efimov, Denis | Inria |
Costanzo, Antonio | Inria |
Loscri, Valeria | Inria |
Keywords: Nonlinear systems, Nonlinear output feedback, Control over communications
Abstract: This note studies the problem of master-slave phase synchronization between two oscillators described by Andronov-Hopf equations, where pulse-like signals, which can be heavily corrupted by disturbances, are used for communication. The synchronization conditions are established using the Lyapunov function theory, and the synchronization error bounds are evaluated. The results are motivated and illustrated by designing a synchronization scheme for visible light communication (VLC), where a periodic sequence is usually sent at the beginning of the communication to synchronize the clocks of the transmitter and the receiver.
|
|
11:20-11:40, Paper ThAT16.5 | Add to My Program |
Wienerization of Systems in Nonlinear Control Canonical Normal Form |
|
Engmark, Hans | The Norwegian University of Science and Technology |
Bjørkøy, Håvard | Norwegian University of Science and Technology |
Rasheed, Adil | Norwegian University of Science and Technology |
Varagnolo, Damiano | NTNU - Norwegian University of Science and Technology |
Keywords: Nonlinear systems, Nonlinear output feedback, Modeling
Abstract: We extend the concept of model approximation via wienerization to systems in nonlinear control canonical normal form. We elaborate on the conditions for, and implications of, analytically separating nonlinear input affine dynamical systems in state space form in a static part plus a dynamic one. In doing so, we discuss under which conditions Wiener models may approximate the resulting models well. More precisely, we report that a specific bijective transformation of the original nonlinear model will separate the system into a multidimensional state space structure for which it is possible to compare nonlinear Wiener control against linear control for underactuated nonlinear systems. We finally assess how the former type of control has better closed-loop performance than the latter by means of quantitative examples.
|
|
11:40-12:00, Paper ThAT16.6 | Add to My Program |
L^1-Input-To-State Stability for Nonlinear Systems |
|
Zheng, Jun | Southwest Jiaotong University |
Zhu, Guchuan | Ecole Polytechnique De Montreal |
Keywords: Nonlinear systems, Lyapunov methods, Stability of nonlinear systems
Abstract: In this paper, we consider the L^1-input-to-state stability (L^1-ISS) for nonlinear systems with L^1-inputs. The work is twofold: (i) we provide Lyapunov characterizations of L^1-ISS by using the classical Lyapunov method (CLM); (ii) we introduce a novel method, namely, the approximative Lyapunov method (ALM), for the L^1-ISS analysis of nonlinear systems. In particular, unlike the CLM that mainly focuses on the construction of Lyapunov functions (LFs), the ALM amounts to constructing an approximation of LFs by using a convex function, which provides a new methodology for stability analysis of nonlinear systems. In order to illustrate the difference of the two methods, we provide an example of the L^1-ISS analysis for a class of nonlinear systems with L^1-inputs.
|
|
ThAT17 Regular Session, Acapulco |
Add to My Program |
Optimal Control and Applications |
|
|
Chair: Tron, Roberto | Boston University |
Co-Chair: Scruggs, Jeff | University of Michigan |
|
10:00-10:20, Paper ThAT17.1 | Add to My Program |
Low-Complexity Control of Nonholonomic Mobile Robots with Formation Constraints |
|
Min, Xiao | Southeast University |
Baldi, Simone | Southeast University |
Yu, Wenwu | Southeast University |
Keywords: Nonholonomic systems, Distributed control, Autonomous robots
Abstract: Low-complexity control (i.e. control avoiding complex function approximation and dynamic adaptive laws) is important in small mobile robots, which cannot be equipped with high-end communication and control hardware. However, the literature has shown that attaining low-complexity control is challenging in the presence of formation constraints such as connectivity maintenance, safety and performance constraints. This work proposes a low-complexity control design (in the framework of funnel control) which is able to handle a large set of formation constraints, while merely relying on static nonlinear feedback, without any function approximation nor dynamic adaptation mechanism. A simulation study further illustrates the method as compared to the state of the art.
|
|
10:20-10:40, Paper ThAT17.2 | Add to My Program |
Model Reference Control of Constrained Overactuated Systems with Integral Compensation |
|
Le Goff, Gregoire | LAPLACE, University of Toulouse, CNRS, INPT, UPS |
Bodson, Marc | Univ. of Utah |
Fadel, Maurice | LAPLACE/CNRS/ENSEEIHT |
Keywords: Control system architecture, Optimal control, Power electronics
Abstract: A model reference control allocation (CA) method is modified to become a control allocation method with integrator (CAI), it significantly improves control performance while readily integrating with existing methods. In general, CA methods take advantage of the redundancy of an overactuated system to achieve control objectives while respecting actuator limits. The method of this paper adds integral compensation to a CA method for multivariable model reference control with the special property that the closed-loop behavior remains the same. The transfer function matrix is preserved, yet zero static error results when experiencing small parametric uncertainties and constant disturbances. The application of the concept to the coordinated control of multiple buck converters feeding a common load is considered. A simulation of the system confirms the benefits of the proposed method.
|
|
10:40-11:00, Paper ThAT17.3 | Add to My Program |
Adaptive Sampling-Based Motion Planning with Control Barrier Functions |
|
Ahmad, Ahmad | Boston University |
Belta, Calin | Boston University |
Tron, Roberto | Boston University |
Keywords: Randomized algorithms, Autonomous robots, Lyapunov methods
Abstract: Sampling-based algorithms, such as Rapidly Exploring Random Trees (RRT) and its variants, have been used extensively for motion planning. Control barrier functions (CBFs) have been recently proposed to synthesize controllers for safety-critical systems. In this paper, we combine the effectiveness of RRT-based algorithms with the safety guarantees provided by CBFs in a method called CBF-RRT^ast. CBFs are used for local trajectory planning for RRT^ast, avoiding explicit collision checking of the extended paths. We prove that CBF-RRT^ast preserves the probabilistic completeness of RRT^ast. Furthermore, in order to improve the sampling efficiency of the algorithm, we equip the algorithm with an adaptive sampling procedure, which is based on the cross-entropy method (CEM) for importance sampling (IS). The procedure exploits the tree of samples to focus the sampling in promising regions of the configuration space. We demonstrate the efficacy of the proposed algorithms through simulation examples.
|
|
11:00-11:20, Paper ThAT17.4 | Add to My Program |
An LQG-Inspired Framework for Self-Powered Feedback Control |
|
Scruggs, Jeff | University of Michigan |
Ligeikis, Connor | University of Michigan |
Keywords: Smart structures, Control applications
Abstract: A control system is termed ``self-powered" if it can operate solely using the energy extracted from the response of the plant in which it is embedded. Due to dissipative losses in the control transducers, associated power electronics, and the energy storage subsystem, self-powered control laws must adhere to feasibility conditions that are more stringent than classical passivity. In this paper, we present a two-stage framework for the synthesis of self-powered feedback controllers, assuming a stochastic disturbance model and a mean-square performance objective. In the first stage, we optimize an LTI, colocated feedback law that adheres to the self-powered feasibility constraint. In the second stage, we design a nonlinear, full-state feedback controller that is guaranteed to improve upon the closed-loop performance obtained by the first stage design and maintain feasibility. We explore a numerical example that demonstrates a novel application of self-powered control involving the virtual ``coupling" of two civil structures excited by an earthquake disturbance.
|
|
11:20-11:40, Paper ThAT17.5 | Add to My Program |
Stagewise Newton Method for Dynamic Game Control with Imperfect State Observation |
|
Jordana, Armand | New York University |
Hammoud, Bilal | New York University |
Carpentier, Justin | INRIA |
Righetti, Ludovic | New York University |
Keywords: Optimal control, Game theory
Abstract: In this letter, we study dynamic game optimal control with imperfect state observations and introduce an iterative method to find a local Nash equilibrium. The algorithm consists of an iterative procedure combining a backward recursion similar to minimax differential dynamic programming and a forward recursion resembling a risk-sensitive Kalman smoother. A coupling equation renders the resulting control dependent on the estimation. In the end, the algorithm is equivalent to a Newton step but has linear complexity in the time horizon length. Furthermore, a merit function and a line search procedure are introduced to guarantee convergence of the iterative scheme. The resulting controller reasons about uncertainty by planning for the worst case disturbances. Lastly, the low computational cost of the proposed algorithm makes it a promising method to do output-feedback model predictive control on complex systems at high frequency. Numerical simulations on realistic robotic problems illustrate the risk-sensitive behavior of the resulting controller.
|
|
11:40-12:00, Paper ThAT17.6 | Add to My Program |
Constrained Trajectory Synthesis Via Quasi-Interpolation |
|
Ganguly, Siddhartha | Indian Institute of Technology, Bombay |
Randad, Nakul | Indian Institute of Technology Bombay |
Chatterjee, Debasish | Indian Institute of Technology, Bombay |
Banavar, Ravi N. | Indian Institute of Technology |
Keywords: Optimal control, Constrained control, Numerical algorithms
Abstract: In this article we introduce QuITO - Quasi-Interpolation based Trajectory Optimization, a direct multiple shooting algorithm to solve a class of constrained nonlinear minimum energy optimal control problems. This technique is based on the theory of approximate approximations - a quasi-interpolation scheme. We parameterize the control trajectory using the quasi-interpolation formula, and we discretize the optimal control problem using the collocation points on a uniform cardinal grid, thereby transcribing the optimal control problem (OCP) into a nonlinear program (NLP). Several examples are provided to show the numerical fidelity of the algorithm.
|
|
ThBT01 Invited Session, Tulum Ballroom A |
Add to My Program |
Distributed Optimization and Learning for Networked Systems I |
|
|
Chair: Uribe, Cesar A. | Rice University |
Co-Chair: Yang, Tao | Northeastern University |
Organizer: Uribe, Cesar A. | Rice University |
Organizer: Yang, Tao | Northeastern University |
Organizer: Lu, Jie | ShanghaiTech University |
Organizer: Niu, Xiaochun | Northwestern University |
Organizer: Wei, Ermin | Northwestern Univeristy |
Organizer: Nedich, Angelia | Arizona State University |
|
13:30-13:50, Paper ThBT01.1 | Add to My Program |
Optimal Ordering Policies for Multi-Echelon Supply Networks (I) |
|
Caiza, Jose | Purdue University |
Walter, Ian | Purdue University |
Panchal, Jitesh | Purdue University, School of Mechanical Engineering |
Qin, Junjie | Purdue University |
Pare, Philip E. | Purdue University |
Keywords: Manufacturing systems and automation, Network analysis and control, Stochastic optimal control
Abstract: In this paper, we formulate an optimal ordering policy as a stochastic control problem where each firm decides the amount of input goods to order from their upstream suppliers based on the current inventory level of its output good. For this purpose, we provide a closed-form solution for the optimal request of the raw materials given a fixed production policy. We implement the proposed policy on a 15-firm acyclic network based on a real product supply chain. we first simulate ideal demand situations, and then we implement demand-side shocks (i.e., demand levels outside of those considered in the policy formulation) and supply-side shocks (i.e., halts in production for some suppliers) to evaluate the robustness of the proposed policies.
|
|
13:50-14:10, Paper ThBT01.2 | Add to My Program |
Convergence Rates of Average-Reward Multi-Agent Reinforcement Learning Via Randomized Linear Programming (I) |
|
Koppel, Alec | JP Morgan Chase |
Bedi, Amrit Singh | US Army Research Lab |
Ganguly, Bhargav | Purdue University |
Aggarwal, Vaneet | Purdue University |
Keywords: Learning, Optimization, Markov processes
Abstract: In tabular multi-agent reinforcement learning with average-cost criterion, a team of agents sequentially interacts with the environment and observes local incentives. We focus on the case that the global reward is a sum of local rewards, the joint policy factorizes into agents' marginals, and full state observability. To date, few global optimality guarantees exist even for this simple setting, as most results yield convergence to stationarity for parameterized policies in large/possibly continuous spaces. To solidify the foundations of MARL, we build upon linear programming (LP) reformulations, for which stochastic primal-dual methods yield a model-free approach to achieve emph{optimal sample complexity} in the centralized case. We develop multi-agent extensions, whereby agents solve their local saddle point problems and then perform local weighted averaging. We establish that the sample complexity to obtain near-globally optimal solutions matches tight dependencies on the cardinality of the state and action spaces, and exhibits classical scalings with respect to the network in accordance with multi-agent optimization. Experiments corroborate these results in practice.
|
|
14:10-14:30, Paper ThBT01.3 | Add to My Program |
Social Shaping of Dynamic Multi-Agent Systems Over a Finite Horizon (I) |
|
Salehi, Zeinab | The Australian National University |
Chen, Yijun | University of Sydney |
Petersen, Ian R. | Australian National University |
Ratnam, Elizabeth | The Australian National University |
Shi, Guodong | The University of Sydney |
Keywords: Agents-based systems, Optimal control, Smart grid
Abstract: This paper studies self-sustained dynamic multiagent systems (MAS) for decentralized resource allocation operating at a competitive equilibrium over a finite horizon. The utility of resource consumption, along with the income from resource exchange, forms each agent’s payoff which is aimed to be maximized. Each utility function is parameterized by individual preferences which can be designed by agents independently. By shaping these preferences and proposing a set of utility functions, we can guarantee that the optimal resource price at the competitive equilibrium always remains socially acceptable, i.e., it never violates a given threshold that indicates affordability. First, we show this problem is solvable at the conceptual level under some convexity assumptions. Then, as a benchmark case, we consider quadratic MAS and formulate the associated social shaping problem as a multi-agent LQR problem which enables us to propose explicit utility sets using quadratic programming and dynamic programming. Finally, a numerical algorithm is presented for calculating the range of the preference function parameters which guarantee a socially accepted price. Some illustrative examples are given to examine the effectiveness of the proposed methods.
|
|
14:30-14:50, Paper ThBT01.4 | Add to My Program |
Faster Asynchronous Nonconvex Block Coordinate Descent with Locally Chosen Stepsizes |
|
Ubl, Matthew | University of Florida |
Hale, Matthew | University of Florida |
Keywords: Optimization, Agents-based systems, Large-scale systems
Abstract: Distributed nonconvex optimization problems underlie many applications in learning and autonomy, and such problems commonly face asynchrony in agents’ computations and communications. When delays in these operations are bounded, they are called partially asynchronous. In this paper, we present an uncoordinated stepsize selection rule for partially asynchronous block coordinate descent that only requires local information to implement, and it leads to faster convergence for a class of nonconvex problems than existing stepsize rules, which require global information in some form. The problems we consider satisfy the error bound condition, and the stepsize rule we present only requires each agent to know (i) a certain type of Lipschitz constant of its block of the gradient of the objective and (ii) the communication delays experienced between it and its neighbors. This formulation requires less information to be available to each agent than existing approaches, typically allows for agents to use much larger stepsizes, and alleviates the impact of stragglers while still guaranteeing convergence to a stationary point. Simulation results provide comparisons and validate the faster convergence attained by the stepsize rule we develop.
|
|
14:50-15:10, Paper ThBT01.5 | Add to My Program |
Peer-To-Peer Non-Bayesian Learning in Finite Time with a Finite Amount of Communication (I) |
|
Sundaram, Shreyas | Purdue University |
Mitra, Aritra | University of Pennsylvania |
Keywords: Learning, Networked control systems, Communication networks
Abstract: We consider the problem of distributed non-Bayesian learning (or hypothesis testing) where a group of agents interacts over a peer-to-peer network to identify the true state of the world from a finite set of hypotheses, based on a series of stochastic signals that each agent receives. Prior work on this problem has provided distributed algorithms that guarantee asymptotic learning of the true state, with corresponding efforts to improve the rate of learning. In this paper, we first argue that one can readily modify existing asymptotic learning algorithms to enable learning in finite time, effectively yielding arbitrarily large (asymptotic) rates. Furthermore, we show that such finite-time learning can be achieved via a simple algorithm which only requires the agents to exchange a binary vector (of length equal to the number of possible hypotheses) with their neighbors at each time-step. Finally, we show that if the agents know the diameter of the network, our algorithm can be further modified to allow all agents to learn the true state and stop transmitting to their neighbors after a finite number of time-steps.
|
|
15:10-15:30, Paper ThBT01.6 | Add to My Program |
Stochastic Gradient Tracking Methods for Distributed Personalized Optimization Over Networks (I) |
|
Huang, Yan | Zhejiang University |
Xu, Jinming | Zhejiang University |
Meng, Wenchao | Carleton University |
Wai, Hoi-To | The Chinese University of Hong Kong |
Keywords: Optimization algorithms, Distributed control, Networked control systems
Abstract: In this paper, we consider a distributed optimization problem over a network of nodes whose cost functions depend on a decision variable consisting of two parts: global (shared) part and local (node-specific) part. This problem structure arises in several important scenarios where the global part captures the common feature among nodes and the local part represents the personalized feature. To solve this problem, we develop a new personalized distributed stochastic gradient tracking method, where each node locally updates variables in a stochastic way and communicate the shared part with its neighbors for coordination. Leveraging a proper Lyapunov design, we show that the proposed algorithm converges linearly to a neighborhood of the optimum for smooth and strongly convex objective functions. The obtained rate result shows a clear dependence of the convergence performance on the topology and the properties of the objective functions. Numerical examples illustrate the effectiveness of the proposed algorithm towards mitigating data heterogeneity among nodes.
|
|
ThBT02 Regular Session, Tulum Ballroom B |
Add to My Program |
Distributed Control II |
|
|
Chair: Cucuzzella, Michele | University of Groningen |
Co-Chair: Stankovic, Milos | University Singidunum |
|
13:30-13:50, Paper ThBT02.1 | Add to My Program |
Reaching a Consensus with Limited Information |
|
Zhu, Jingxuan | Stony Brook University |
Lin, Yixuan | Stony Brook University |
Liu, Ji | Stony Brook University |
Morse, A. Stephen | Yale Univ |
Keywords: Distributed control, Cooperative control, Agents-based systems
Abstract: In its simplest form the well known consensus problem for a networked family of autonomous agents is to devise a set of protocols or update rules, one for each agent, which can enable all of the agents to adjust or tune their ``agreement variable'' to the same value by utilizing real-time information obtained from their ``neighbors'' within the network. The aim of this paper is to study the problem of achieving a consensus in the face of limited information transfer between agents. By this it is meant that instead of each agent receiving an agreement variable or real-valued state vector from each of its neighbors, it receives a linear function of each state instead. The specific problem of interest is formulated and provably correct algorithms are developed for a number of special cases of the problem.
|
|
13:50-14:10, Paper ThBT02.2 | Add to My Program |
A Distributed Control Framework for the Optimal Operation of DC Microgrids |
|
Fu, Zao | University of Groningen |
Cucuzzella, Michele | University of Pavia |
Cenedese, Carlo | ETH Zurich |
Yu, Wenwu | Southeast University |
Scherpen, Jacquelien M.A. | University of Groningen |
Keywords: Distributed control, Energy systems, Game theory
Abstract: In this paper we propose an original distributed control framework for DC microgrids. We first formulate the (optimal) control objectives as an aggregative game suitable for the energy trading market. Then, based on duality, we analyze the equivalent distributed optimal condition for the proposed aggregative game and design a distributed control scheme to solve it. By interconnecting the DC microgrid and the designed distributed control system in a power preserving way, we steer the DC microgrid's state to the desired optimal equilibrium, satisfying a predefined set of local and coupling constraints. Finally, based on the singular perturbation system theory, we analyze the convergence of the closed-loop system. The simulation results show excellent performance of the proposed control framework.
|
|
14:10-14:30, Paper ThBT02.3 | Add to My Program |
Multi-Agent Actor-Critic Multitask Reinforcement Learning Based on GTD(1) with Consensus |
|
Stankovic, Milos | University Singidunum |
Beko, Marko | COPELABS, Universidade Lusófona De Humanidades E Tecnologias |
Ilic, Nemanja | University of Belgrade |
Stankovic, Srdjan S. | University of Belgrade |
Keywords: Distributed control, Learning, Agents-based systems
Abstract: In this paper, a new distributed multi-agent off-policy Actor-Critic algorithm for collaborative multitask reinforcement learning is proposed. The Critic stage is based on the distributed gradient temporal difference algorithm GTD(1), while the Actor stage is derived from a predefined global criterion function and consists of a complementary consensus-based exact policy gradient algorithm. A proof that the Feller-Markov properties hold for the derived algorithm at the Actor stage is derived. The weak convergence of the algorithm to the set of stationary points of an attached ODE is proved under mild conditions using the two-time-scale stochastic approximation arguments. An experimental verification of the algorithm properties is given, demonstrating its high efficiency and practical applicability.
|
|
14:30-14:50, Paper ThBT02.4 | Add to My Program |
Multi-Agent Persistent Monitoring Via Time-Inverted Kuramoto Dynamics |
|
Boldrer, Manuel | Delft University of Technology |
Pasqualetti, Fabio | University of California, Riverside |
Palopoli, Luigi | University of Trento |
Fontanelli, Daniele | University of Trento |
Keywords: Distributed control, Network analysis and control, Robotics
Abstract: We present a novel distributed multi-robot coordination strategy to persistently monitor a closed path-like environment. Our monitoring strategy relies on a class of time-inverted Kuramoto dynamics, whose multiple equilibria coincide with different monitoring configurations and allow us to tune the covering time of specific areas based on their priority. We provide a detailed analysis of the equilibria of the considered class of time-inverted Kuramoto dynamics and demonstrate the effectiveness of the proposed monitoring strategy via numerical examples.
|
|
14:50-15:10, Paper ThBT02.5 | Add to My Program |
An Adaptive Distributed Protocol for Finite-Time Supremum or Infimum Dynamic Consensus |
|
Lippi, Martina | Roma Tre University |
Furchì, Antonio | Roma Tre University |
Marino, Alessandro | Universita` Degli Studi Di Cassino E Del Lazio Meridionale |
Gasparri, Andrea | Roma Tre University |
Keywords: Distributed control, Networked control systems, Adaptive control
Abstract: In this paper, the problem of distributively tracking the minimum infimum (or maximum supremum) of a set of time-varying signals in finite-time is addressed. More specifically, each agent has access to a local time-varying exogenous signal, and all the agents are required to follow the minimum infimum (or the maximum supremum) of these signals in a distributed fashion. No assumption is made on the network size nor on the bounds of the exogenous signal derivatives. An adaptive protocol is provided which can provably solve the above problem in finite-time for multi-agent systems with undirected connected network topologies. Numerical simulations are provided to corroborate the theoretical findings.
|
|
15:10-15:30, Paper ThBT02.6 | Add to My Program |
Consensus of Open Multi-Agent Systems Over Dynamic Undirected Graphs with Preserved Connectivity and Collision Avoidance |
|
Restrepo, Esteban | KTH Royal Institute of Technology |
Loria, Antonio | CNRS |
Sarras, Ioannis | ONERA |
Marzat, Julien | ONERA - the French Aerospace Lab |
Keywords: Distributed control, Switched systems, Lyapunov methods
Abstract: We address the consensus problem with collision avoidance for multi-agent systems under limited sensing ranges, in the case where new interconnections and agents may be added at any time. The graph topology is represented by a dynamic undirected graph, assumed to be connected only at an initial time, and the open multi-agent system is modeled via a multidimensional impulsive switched representation. We propose a barrier-Lyapunov-function-based consensus control law that guarantees inter-agent collision-avoidance and connectivity maintenance and, relying on the edge-agreement framework, we establish almost-everywhere asymptotic stability of the consensus manifold. The obtained results are also readily applicable to closed multi-agent systems with edge addition. A numerical simulation illustrates the effectiveness of the proposed approach.
|
|
ThBT03 Regular Session, Tulum Ballroom C |
Add to My Program |
Autonomous Systems II |
|
|
Chair: Smith, Stephen L. | University of Waterloo |
Co-Chair: Ramazi, Pouria | Brock University |
|
13:30-13:50, Paper ThBT03.1 | Add to My Program |
Characterizing Oscillations in Heterogeneous Populations of Coordinators and Anticoordinators |
|
Ramazi, Pouria | Brock University |
Roohi, Mohammad | PhD Student at University of Alberta |
Keywords: Game theory, Autonomous systems
Abstract: Oscillations often take place in populations of decision-making individuals that are either a coordinator, who takes action only if enough others do so, or an anticoordinator, who takes action only if few others do so. Populations consisting of exclusively one of these types are known to reach an equilibrium, where every individual is satisfied with her decision. Yet it remains open whether and when oscillations take place in a population consisting of both types, and if they do, what features they share. We take the first step towards answering this question by simulating a well-mixed population of coordinators and anticoordinator, each associated with a possibly unique non-negative threshold and initialized with the strategy A or B. We take the distribution of the actions A over the thresholds as the state of the population dynamics. The dynamics in our example admit two minimally positively invariant sets, where the solution trajectory oscillates, and an equilibrium. We identify the basic properties of the dynamics, based on which, we introduce a class of sets that are positively invariant. Our results highlight the possibility of non-trivial, complex oscillations in the absence of noise and population structure and shed light on the reported oscillations in decision-making populations.
|
|
13:50-14:10, Paper ThBT03.2 | Add to My Program |
Distributed Offline Reinforcement Learning |
|
Heredia, Paulo | Purdue |
George, Jemin | U.S. Army Research Laboratory |
Mou, Shaoshuai | Purdue University |
Keywords: Autonomous systems, Cooperative control, Learning
Abstract: In this work, we explore the problem of offline reinforcement learning for a multi-agent system. Offline reinforcement learning differs from classical online and off-policy reinforcement learning settings in that agents must learn from a fixed and finite dataset. We consider a scenario where there exists a large dataset produced by interactions between an agent an its environment. We suppose the dataset is too large to be efficiently processed by an agent with limited resources, and so we consider a multi-agent network that cooperatively learns a control policy. We present a distributed reinforcement learning algorithm based on Q-learning and an approach called offline regularization. The main result of this work shows that the proposed algorithm converges in the sense that the norm squared error is asymptotically bounded by a constant, which is determined by the number of samples in the dataset. In the simulation, we have implemented the proposed algorithm to train agents to control both a linear system and a nonlinear system, namely the well-known cartpole system. We provide simulation results showing the performance of the trained agents.
|
|
14:10-14:30, Paper ThBT03.3 | Add to My Program |
A Pursuit Evasion Approach for Avoiding an Inattentive Human in the Presence of a Static Obstacle |
|
Wang, YiFeng | University of Waterloo |
Nielsen, Christopher | University of Waterloo |
Smith, Stephen L. | University of Waterloo |
Keywords: Game theory, Autonomous vehicles, Autonomous robots
Abstract: We consider a mobile robot, modelled as a Dubins' car, following a path. In the vicinity of the path is a human, modelled as an agile point mass, and a static obstacle. The robot's objective is to follow the path unless it is absolutely necessary to deviate so as to avoid collision with the static obstacle or human. We seek to guarantee robot's safety, even if the human is distracted or inattentive, and thus we consider worst-case motions for the human. The resulting problem takes the form of a reversed homicidal chauffeur game, but with the addition of a static obstacle. The static obstacle can interfere with the escape route of the robot, and thus fundamentally changes the form of the game. We propose a navigation algorithm for the robot that provably guarantees safety and that attempts to delay its reaction for as long as possible. We validate the proposed approach in simulation and provide a comparison to existing collision avoidance methods.
|
|
14:30-14:50, Paper ThBT03.4 | Add to My Program |
Worst-Case Scenario Evasive Strategies in a Two-On-One Engagement between Dubins' Vehicles with Partial Information |
|
Nath, Suryadeep | Indian Institute of Science |
Ghose, Debasish | Indian Institute of Science |
Keywords: Game theory, Nonholonomic systems, Optimal control
Abstract: In this paper, we consider the problem of pursuit-evasion between two pursuers against an evader with the agents modelled as Dubins' vehicle and having partial information regarding each other's motion. A novel, two-phase, evasive strategy based on worst-case scenario planning and proximity-based maneuvers is proposed. In the first phase, the evader assumes the pursuers to be holonomic and executes a best response strategy. An analytic and geometric approach is used to obtain the evader's strategy. In the second phase, the evader switches to movement along high-curvature paths to side-step the pursuers when they are in proximity of the evader in an attempt to delay capture. We also propose an approximation to compute the evader strategy in the first phase, which enables it to be applicable against multiple non-holonomic pursuers. Illustrative simulation results are given.
|
|
14:50-15:10, Paper ThBT03.5 | Add to My Program |
Reachability Analysis Using Spectrum of Koopman Operator |
|
Umathe, Bhagyashree | Clemson University |
Tellez Castro, Duvan Andres | Universidad Nacional De Colombia |
Vaidya, Umesh | Clemson University |
Keywords: Autonomous systems, Optimization
Abstract: This paper proposes using the Koopman operator for reachability analysis of an autonomous dynamical system. In particular, we demonstrate the application of spectral analysis of the Koopman operator involving eigenfunctions and eigenvalues in the approximate computation of forward and backward reachable sets for an autonomous dynamical system. The formal guarantees for the approximate reachable sets are provided using the Hausdorff distance between sets that measure how far the approximate reachable set is from the true reachable set. A computational framework based on convex optimization is provided to compute the Koopman spectrum and the approximate reachable set. Finally, we present simulation results to demonstrate the application of the developed framework.
|
|
15:10-15:30, Paper ThBT03.6 | Add to My Program |
Congestion-Aware Path Coordination Game with Markov Decision Process Dynamics |
|
Li, Sarah H.Q. | University of Washington |
Calderone, Dan | University of Washington |
Acikmese, Behcet | University of Washington |
Keywords: Game theory, Markov processes, Optimal control
Abstract: Inspired by the path coordination problem arising from robo-taxis, warehouse management, and mixed-vehicle routing, we model a group of heterogeneous players responding to stochastic demands as a congestion game under Markov decision process dynamics. Players share a common state-action space but have unique transition dynamics, and each player's unique cost is a function of the joint state-action probability distribution. For a class of player cost functions, we formulate the player-specific optimization problem, prove equivalence between the Nash equilibrium and the solution of a potential minimization problem, and derive dynamic programming approaches to solve the Nash equilibrium. We apply this game to model multi-agent path coordination and introduce congestion-based cost functions that enable players to complete individual tasks while avoiding congestion with their opponents. Finally, we present a learning algorithm for finding the Nash equilibrium that has linear complexity in the number of players. We demonstrate our game model on a multi-robot warehouse path coordination problem, in which robots autonomously retrieve and deliver packages while avoiding congested paths.
|
|
ThBT04 Regular Session, Tulum Ballroom D |
Add to My Program |
Learning-Based Systems I |
|
|
Chair: Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Co-Chair: Beckers, Thomas | University of Pennsylvania |
|
13:30-13:50, Paper ThBT04.1 | Add to My Program |
Deep Graphic FBSDEs for Opinion Dynamics Control |
|
Chen, Tianrong | Georgia Institute of Technology |
Wang, Ziyi | Georgia Institute of Technology |
Theodorou, Evangelos | Georgia Institute of Technology |
Keywords: Machine learning, Stochastic optimal control, Mean field games
Abstract: In this paper, we present a scalable deep learning approach to solve opinion dynamics stochastic optimal control problems with mean field term coupling in the dynamics and cost function. Our approach relies on the probabilistic representation of the solution of the Hamilton-Jacobi-Bellman partial differential equation. Grounded on the nonlinear version of the Feynman-Kac lemma, the solutions of the Hamilton-Jacobi-Bellman partial differential equation are linked to the solution of Forward-Backward Stochastic Differential Equations which can be solved numerically using a novel deep neural network with architecture tailored to the problem in consideration. The resulting algorithm is tested on a polarized opinion consensus experiment. The large-scale (10K) agents experiment validates the scalability and generalizability of our algorithm. The proposed framework opens up the possibility for future applications on extremely large-scale problems.
|
|
13:50-14:10, Paper ThBT04.2 | Add to My Program |
Behavioral Feedback for Optimal LQG Control |
|
Al Makdah, Abed AlRahman | University of California Riverside |
Krishnan, Vishaal | University of California, Riverside |
Katewa, Vaibhav | Indian Institute of Science Bangalore |
Pasqualetti, Fabio | University of California, Riverside |
Keywords: Behavioural systems, Optimal control, Learning
Abstract: In this work, we revisit the Linear Quadratic Gaussian (LQG) optimal control problem from a behavioral perspective. Motivated by the suitability of behavioral models for data-driven control, we begin with a reformulation of the LQG problem in the space of input-output behaviors and obtain a complete characterization of the optimal solutions. In particular, we show that the optimal LQG controller can be expressed as a static behavioral-feedback gain, thereby eliminating the need for dynamic state estimation characteristic of state space methods. The static form of the optimal LQG gain also makes it amenable to its computation by gradient descent, which we investigate via numerical experiments. Furthermore, we highlight the advantage of this approach in the data-driven control setting of learning the optimal LQG controller from expert demonstrations.
|
|
14:10-14:30, Paper ThBT04.3 | Add to My Program |
Learning-Based Balancing of Model-Based and Feedback Control for Second-Order Mechanical Systems |
|
Beckers, Thomas | University of Pennsylvania |
Colombo, Leonardo Jesus | Spanish National Research Council |
Morari, Manfred | University of Pennsylvania |
Pappas, George J. | University of Pennsylvania |
Keywords: Mechatronics, Learning, Adaptive control
Abstract: High-performance tracking control of mechanical systems typically requires model-based control as it enables to counteract undesirable dynamics in a timely fashion. The quality of the compensation depends on the accuracy of the underlying model. However, the dynamics are often (partially) infeasible for complex systems in a-priori unknown environments. Due to the favorable properties of model-based control, the goal is to apply feedback control for error compensation only when necessary. In this paper, we present an online learning-based balancing (OLBB) framework to adapt the feedback gains such that the controlled system with state-dependent uncertainties satisfies given performance specifications. For this purpose, an oracle predicts the unknown dynamics and an error model adapts the feedback gains. The framework can be easily applied to existing control approaches to improve the safety and performance of the closed-loop.
|
|
14:30-14:50, Paper ThBT04.4 | Add to My Program |
Data-Based Actuator Selection for Optimal Control Allocation (I) |
|
Fotiadis, Filippos | Georgia Institute of Technology |
Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Jiang, Zhong-Ping | New York University |
Keywords: Learning, Linear systems, Autonomous systems
Abstract: In this work, we consider an actuator redundant system, i.e., a system with more actuators than the number of effective control inputs, and bring together connections between control allocation, actuator selection, and learning. In this kind of systems, the actuator commands can be chosen to meet a given control objective while still having leftover degrees of freedom to use towards minimizing the overall actuation energy. We show that this energy can be further minimized by optimally selecting the actuators themselves, which we perform in two different scenarios; first, in the case where the control objective is not known beforehand; and second, in the case where the control objective is defined to be a stabilizing state feedback controller. To relax the requirement for knowledge of the system's plant matrix, we compose a novel learning mechanism based on policy iteration, which computes the anti-stabilizing solution to an associated algebraic Riccati equation using trajectory data. Simulations are performed that demonstrate our approach.
|
|
14:50-15:10, Paper ThBT04.5 | Add to My Program |
Decentralized Event-Triggered Federated Learning with Heterogeneous Communication Thresholds |
|
Zehtabi, Shahryar | Purdue University |
Hosseinalipour, Seyyedali | Purdue University |
Brinton, Christopher | Purdue University |
Keywords: Communication networks, Machine learning
Abstract: A recent emphasis of distributed learning research has been on federated learning (FL), in which model training is conducted by the data-collecting devices. Existing research on FL has mostly focused on a star topology learning architecture with synchronized (time-triggered) model training rounds, where the local models of the devices are periodically aggregated by a centralized coordinating node. However, in many settings, such a coordinating node may not exist, motivating efforts to fully decentralize FL. In this work, we propose a novel methodology for distributed model aggregations via asynchronous, event-triggered consensus iterations over the network graph topology. We consider heterogeneous communication event thresholds at each device that weigh the change in local model parameters against the available local resources in deciding the benefit of aggregations at each iteration. Through theoretical analysis, we demonstrate that our methodology achieves asymptotic convergence to the globally optimal learning model under standard assumptions in distributed learning and graph consensus literature, and without restrictive connectivity requirements on the underlying topology. Subsequent numerical results demonstrate that our methodology obtains substantial improvements in communication requirements compared with FL baselines.
|
|
15:10-15:30, Paper ThBT04.6 | Add to My Program |
Adaptive Decision Method of Directional Drilling Tool Face Based on Model Free Online Learning |
|
Hao, Jiasheng | University of Electronic Science and Technology of China |
Ma, Dongwei | University of Electronic Science and Technology of China(UESTC) |
Liu, Wei | Ccdc Drilling&production Technology Research Institute of Cnpc |
Peng, Zhinan | University of Electronic Science and Technology of China |
Chen, Dong | Drilling and Production Engineering Technology Research Institut |
Wang, Yang | University of Electronic Science and Technology of China |
Keywords: Intelligent systems, Machine learning, Control applications
Abstract: Tool face adjustment in oil and gas drilling are vital issues that affect the efficiency and safety of directional drilling. The existing tool face adjustment methods mainly rely on manual real-time intervention for continuous adjustment. Affected by manual experience, the effect is unstable and the labor cost is high. With rising energy consumption, the need for intelligent directional drilling is becoming more pressing. However, establishing an autonomous adjustment approach for the tool face remains challenging because of the variety of complex downhole environments encountered during actual drilling operations. This paper proposes a model-free online learning adaptive decision strategy for cross well intelligent adjustment and stability of tool face. A reward function embedded with expert operating experience is designed to learn the directional policy from the driller's corrective actions. Further, to improve the efficiency of online learning, a priority-based experience playback algorithm is developed. A data-driven directional drilling simulation environment is proposed to realize the accurate simulation of the directional drilling process and pre-training of directional strategy. Simulations are carried out to validate the efficacy of the proposed method. The outcomes of field application suggest that the proposed strategy can achieve decision-making goals in a short period.
|
|
ThBT05 Invited Session, Tulum Ballroom E |
Add to My Program |
Recent Advances in Learning and Control I |
|
|
Chair: Azizan, Navid | MIT |
Co-Chair: Wierman, Adam | California Institute of Technology |
Organizer: Qu, Guannan | Carnegie Mellon University |
Organizer: Wierman, Adam | California Institute of Technology |
Organizer: Zhang, Kaiqing | MIT |
|
13:30-13:50, Paper ThBT05.1 | Add to My Program |
Revisiting PGD Attacks for Stability Analysis of High-Dimensional Nonlinear Systems and Perception-Based Control |
|
Havens, Aaron | University of Illinois at Urbana-Champaign |
Keivan, Darioush | University of Illinois at Urbana Champaign |
Seiler, Peter | University of Michigan, Ann Arbor |
Dullerud, Geir E. | Univ of Illinois, Urbana-Champaign |
Hu, Bin | University of Illinois at Urbana-Champaign |
Keywords: Stability of nonlinear systems, Optimization algorithms, Neural networks
Abstract: Many existing region-of-attraction (ROA) analysis tools find difficulty in addressing feedback systems with large-scale neural network (NN) policies and/or high-dimensional sensing modalities such as cameras. In this paper, we tailor the projected gradient descent (PGD) attack method as a general-purpose ROA analysis tool for high-dimensional nonlinear systems and end-to-end perception-based control. We show that the ROA analysis can be approximated as a constrained maximization problem such that PGD-based iterative methods can be directly applied. In the model-based setting, we show that the PGD updates can be efficiently performed using back-propagation. In the model-free setting, we propose a finite-difference PGD estimate which is general and only requires a black-box simulator for generating the trajectories of the closed-loop system given any initial state. Finally, we demonstrate the scalability and generality of our analysis tool on several numerical examples with large state dimensions or complex image observations.
|
|
13:50-14:10, Paper ThBT05.2 | Add to My Program |
Achieving Logarithmic Regret Via Hints in Online Learning of Noisy LQR Systems (I) |
|
Akbari, Mohammad | Queen's University |
Gharesifard, Bahman | University of California, Los Angeles |
Linder, Tamas | Queen's University |
Keywords: Adaptive control, Iterative learning control, Identification for control
Abstract: We consider the problem of online adaptive control of a linear-quadratic system, where the true system transition parameters (matrices A and B) are unknown. The objective is to design and analyze algorithms that generate control policies with sublinear “regret”, defined as the difference between the cumulative cost of the policies generated by the algorithm and the cumulative cost of the optimal policy. Recent studies show that when the system parameters are fully unknown, for any algorithm that only uses data from the past system trajectory, there is a choice of system parameters such that the algorithm at best achieves a square root regret, providing a hard fundamental limit on the achievable regret in general. However, it is known that (poly)-logarithmic regret is achievable when only matrix A or only matrix B is unknown. We prove a result, encompassing both of these scenarios, showing that (poly)logarithmic regret is achievable when both of these matrices are unknown, but a hint about them is given to the learner over time.
|
|
14:10-14:30, Paper ThBT05.3 | Add to My Program |
Privacy-Preserving Reinforcement Learning Beyond Expectation |
|
Rajabi, Arezoo | University of Washington |
Ramasubramanian, Bhaskar | Western Washington University |
Maruf, Abdullah Al | University of Washington |
Poovendran, Radha | University of Washington |
Keywords: Learning, Agents-based systems, Markov processes
Abstract: Cyber and cyber-physical systems equipped with machine learning algorithms such as autonomous cars share environments with humans. In such a setting, it is important to align system (or agent) behaviors with the preferences of one or more human users. We consider the case when an agent has to learn behaviors in an unknown environment. Our goal is to capture two defining characteristics of humans: i) a tendency to assess and quantify risk, and ii) a desire to keep decision making hidden from external parties. We incorporate cumulative prospect theory (CPT) into the objective of a reinforcement learning (RL) problem for the former. For the latter, we use differential privacy. We design an algorithm to enable an RL agent to learn policies to maximize a CPT-based objective in a privacy-preserving manner, and establish guarantees on the privacy of value functions learned by the algorithm when the rewards are sufficiently close. This is accomplished through adding a calibrated noise using a Gaussian process mechanism at each step. Through empirical evaluations, we highlight a privacy-utility tradeoff and demonstrate that the RL agent is able to learn behaviors that are aligned with that of a human user in the same environment in a privacy-preserving manner.
|
|
14:30-14:50, Paper ThBT05.4 | Add to My Program |
Model-Free Learning of Regions of Attraction Via Recurrent Sets (I) |
|
Shen, Yue | The Johns Hopkins University |
Bichuch, Maxim | Johns Hopkins University |
Mallada, Enrique | Johns Hopkins University |
Keywords: Learning, Stability of nonlinear systems, Autonomous systems
Abstract: We consider the problem of learning an inner approximation of the region of attraction (ROA) of an asymptotically stable equilibrium point without an explicit model of the dynamics. Rather than leveraging approximate models with bounded uncertainty to find a (robust) invariant set contained in the ROA, we propose to learn sets that satisfy a more relaxed notion of containment known as recurrence. We define a set to be tau-recurrent (resp. k-recurrent) if every trajectory that starts within the set, returns to it after at most tau seconds (resp. k steps). We show that under mild assumptions a tau-recurrent set containing a stable equilibrium must be a subset of its ROA. We then leverage this property to develop algorithms that compute inner approximations of the ROA using counter-examples of recurrence that are obtained by sampling finite-length trajectories. Our algorithms process samples sequentially, which allows them to continue being executed even after an initial offline training stage. We further provide an upper bound on the number of counter-examples used by the algorithm, and almost sure convergence guarantees.
|
|
14:50-15:10, Paper ThBT05.5 | Add to My Program |
One-Pass Learning Via Bridging Orthogonal Gradient Descent and Recursive Least-Squares (I) |
|
Min, Youngjae | MIT |
Ahn, Kwangjun | MIT |
Azizan, Navid | MIT |
Keywords: Machine learning, Estimation, Optimization
Abstract: While deep neural networks are capable of achieving state-of-the-art performance in various domains, their training typically requires iterating for many passes over the dataset. However, due to computational and memory constraints and potential privacy concerns, storing and accessing all the data is impractical in many real-world scenarios where the data arrives in a stream. In this paper, we investigate the problem of one-pass learning, in which a model is trained on sequentially arriving data without retraining on previous datapoints. Motivated by the increasing use of overparameterized models, we develop Orthogonal Recursive Fitting (ORFit), an algorithm for one-pass learning which seeks to perfectly fit every new datapoint while changing the parameters in a direction that causes the least change to the predictions on previous datapoints. By doing so, we bridge two seemingly distinct algorithms in adaptive filtering and machine learning, namely the recursive least-squares (RLS) algorithm and orthogonal gradient descent (OGD). Our algorithm uses the memory efficiently by exploiting the structure of the streaming data via an incremental principal component analysis (IPCA). Further, we show that, for overparameterized linear models, the parameter vector obtained by our algorithm is what stochastic gradient descent (SGD) would converge to in the standard multi-pass setting. Finally, we generalize the results to the nonlinear setting for highly overparameterized models, relevant for deep learning. Our experiments show the effectiveness of the proposed method compared to the baselines.
|
|
15:10-15:30, Paper ThBT05.6 | Add to My Program |
Behavioral Uncertainty Quantification for Data-Driven Control (I) |
|
Padoan, Alberto | ETH Zürich |
Coulson, Jeremy | ETH Zürich |
van Waarde, Henk J. | University of Groningen |
Lygeros, John | ETH Zurich |
Dörfler, Florian | Swiss Federal Institute of Technology (ETH) Zurich |
Keywords: Behavioural systems, Robust control, Uncertain systems
Abstract: This paper explores the problem of uncertainty quantification in the behavioral setting for data-driven control. Building on classical ideas from robust control, the problem is regarded as that of selecting a metric which is best suited to a data-based description of uncertainties. Leveraging on Willems’ fundamental lemma, restricted behaviors are viewed as subspaces of fixed dimension, which may be represented by data matrices. Consequently, metrics between restricted behaviors are defined as distances between points on the Grassmannian, i.e., the set of all subspaces of equal dimension in a given vector space. A new metric is defined on the set of restricted behaviors as a direct finite-time counterpart of the classical gap metric. The metric is shown to capture parametric uncertainty for the class of autoregressive (AR) models. Numerical simulations illustrate the value of the new metric with a data-driven mode recognition and control case study.
|
|
ThBT06 Regular Session, Tulum Ballroom F |
Add to My Program |
Nonlinear Estimation |
|
|
Chair: Chong, Michelle | Eindhoven University of Technology |
Co-Chair: Efimov, Denis | Inria |
|
13:30-13:50, Paper ThBT06.1 | Add to My Program |
Approximation of Koopman Operators: Irregular Domains and Positive Orbits |
|
Powell, Nathan | Virginia Tech |
Liu, Bowei | Virginia Tech |
Kurdila, Andrew J. | Virginia Tech |
Guo, Jia | Georgia Institute of Technology |
Burns, John A | Virginia Tech |
Estes, Leonard Boone | Virginia Tech |
Paruchuri, Sai Tej | Lehigh University |
Keywords: Estimation, Nonlinear systems, Machine learning
Abstract: In this paper we derive rates of convergence of a class of approximations of the Koopman operator that are obtained using the extended domain decomposition (EDMD) method. The paper extends the previous approaches that consider the case in which samples become dense in or approach a smooth Riemannian manifold or a regular subset of RR^d. Here, we extend these results to the case where samples become dense in some compact subset of the state space without requiring the common regularity properties. Rates of convergence of approximations of the Koopman operator are described in terms of approximation spaces, spectral approximation spaces, and in terms of the power function of the kernel of a reproducing kernel Hilbert space sH. A numerical example is presented to demonstrate the results.
|
|
13:50-14:10, Paper ThBT06.2 | Add to My Program |
Entropy for Incremental Stability of Nonlinear Systems under Disturbances |
|
Chong, Michelle | Eindhoven University of Technology |
Keywords: Nonlinear systems, Estimation, Quantized systems
Abstract: Entropy notions for varepsilon-incremental practical stability and incremental stability of deterministic nonlinear systems under disturbances are introduced. The entropy notions are constructed via a set of points in state space which induces the desired stability properties, called an approximating set. We provide conditions on the system which ensures that the approximating set is finite. Lower and upper bounds for the two estimation entropies are computed. The construction of the finite approximating sets induces a robust state estimation algorithm for systems under disturbances using quantized and time-sampled measurements.
|
|
14:10-14:30, Paper ThBT06.3 | Add to My Program |
On Simple Design of a Robust Finite-Time Observer |
|
Nekhoroshikh, Artem | ITMO University |
Efimov, Denis | Inria |
Polyakov, Andrey | Inria, Univ. Lille |
Perruquetti, Wilfrid | Ecole Centrale De Lille |
Furtat, Igor | Institute of Problems of Mechanical Engineering Russian Academy |
Keywords: Estimation, Observers for Linear systems, Stability of nonlinear systems
Abstract: This paper presents a novel observer that estimates the state of a linear system in the observer canonical form in finite time. Due to the specially selected dynamical gain, the proposed observer is easy to tune and, at the same time, it is robust to external perturbations (e.g., state disturbances and measurement noises). The tuning algorithm consists in solving a simple system of linear matrix inequalities with only two adjustable parameters. Theoretical results and a comparison with homogeneous and prescribed-time observers are illustrated by numerical simulation.
|
|
14:30-14:50, Paper ThBT06.4 | Add to My Program |
Almost Finite-Time Observers for a Family of Nonlinear Continuous-Time Systems |
|
Mazenc, Frederic | Inria Saclay |
Malisoff, Michael | Louisiana State University |
Keywords: Observers for nonlinear systems, Estimation, Stability of nonlinear systems
Abstract: We provide a new class of observers for a class of nonlinear systems which are not required to be affine in the unmeasured states. Our observers ensure exponential convergence of the observation errors to zero. The rate of exponential convergence of the observation errors converges to infinity, as the growth rate of the nonlinear state-dependent part of the dynamics converges to zero. Therefore, we call the observers almost finite-time observers. Under global Lipschitz conditions on the state-dependent part of the dynamics, we obtain a global result that ensures convergence of the observers, for all initial states. Then, for cases where the nonlinearity is of order two at the origin, we provide local results ensuring exponential convergence of the observation errors to zero, when the initial state is small enough. We apply our global result to a model of a pendulum with friction, and our local result to aclass of dynamics with Lotka-Volterrra types of nonlinearities
|
|
14:50-15:10, Paper ThBT06.5 | Add to My Program |
Invariant Smoothing with Low Process Noise |
|
Chauchat, Paul | CentraleSupelec |
Bonnabel, Silvere | Mines ParisTech |
Barrau, Axel | Safran |
Keywords: Estimation, Robotics, Observers for nonlinear systems
Abstract: In this paper we address smoothing (that is, optimisation-based) estimation techniques for localisation problems in the case where motion sensors are very accurate. Our mathematical analysis focuses on the difficult limit case where motion sensors are infinitely precise, resulting in the absence of process noise. Then the formulation degenerates, as the dynamical model that serves as a soft constraint becomes an equality constraint, and conventional smoothing methods are not able to fully respect it. By contrast, once an appropriate Lie group embedding has been found, we prove theoretically that invariant smoothing gracefully accommodates this limit case in that the estimates tend to be consistent with the induced constraints when the noise tends to zero. Simulations on the important problem of initial alignement in inertial navigation show that, in a low noise setting, invariant smoothing may favorably compare to state-of-the-art smoothers when using precise inertial measurements units (IMU).
|
|
15:10-15:30, Paper ThBT06.6 | Add to My Program |
LMI-Based Observer Design for Non-Globally Lipschitz Systems Using Kirszbraun-Valentine Extension Theorem |
|
Zemouche, Ali | CRAN UMR CNRS 7039 & Inria: EPI-DISCO |
Rajamani, Rajesh | Univ. of Minnesota |
Keywords: Observers for nonlinear systems, Estimation, LMIs
Abstract: This paper deals with observer design for a class of nonlinear systems. The paper makes two notable contributions. First, we propose a solution to design an observer for systems without global Lipschitz conditions by extending the nonlinearities to globally Lipschitz functions. Secondly, we provide a novel Linear Matrix Inequality (LMI) condition ensuring asymptotic convergence of the observer. The extension of the nonlinearities is achieved by exploiting old mathematical tools on Kirszbraun-Valentine Lipschitz extensions. Once the structure of the observer, based on the extended functions, is well-posed, we propose a new LMI technique, which is more general and more convenient than those existing in the literature.
|
|
ThBT07 Invited Session, Tulum Ballroom G |
Add to My Program |
Distributionally Robust Optimization and Control |
|
|
Chair: Cherukuri, Ashish | University of Groningen |
Co-Chair: Boskos, Dimitris | TU Delft |
Organizer: Cherukuri, Ashish | University of Groningen |
Organizer: Boskos, Dimitris | TU Delft |
|
13:30-13:50, Paper ThBT07.1 | Add to My Program |
Data-Driven Distributionally Robust Optimization Over a Network Via Distributed Semi-Infinite Programming (I) |
|
Cherukuri, Ashish | University of Groningen |
Zolanvari, Alireza | University of Groningen |
Banjac, Goran | ETH Zurich |
Hota, Ashish | Indian Institute of Technology (IIT), Kharagpur |
Keywords: Optimization, Large-scale systems, Optimization algorithms
Abstract: This paper focuses on solving a data-driven distributionally robust optimization problem over a network of agents. The agents aim to minimize the worst-case expected cost computed over a Wasserstein ambiguity set that is centered at the empirical distribution. The samples of the uncertainty are distributed across the agents. Our approach consists of reformulating the problem as a semi-infinite program and then designing a distributed algorithm that solves a generic semi-infinite problem that has the same information structure as the reformulated problem. In particular, the decision variables consist of both local ones that agents are free to optimize over and global ones where they need to agree on. Our distributed algorithm is an iterative procedure that combines the notions of distributed ADMM and the cutting-surface method. We show that the iterates converge asymptotically to a solution of the distributionally robust problem to any pre-specified accuracy. Simulations illustrate our results.
|
|
13:50-14:10, Paper ThBT07.2 | Add to My Program |
Uncertain Uncertainty in Data-Driven Stochastic Optimization: Towards Structured Ambiguity Sets (I) |
|
Chaouach, Lotfi Mustapha | TU Delft |
Boskos, Dimitris | TU Delft |
Oomen, Tom | Eindhoven University of Technology |
Keywords: Statistical learning, Optimization, Uncertain systems
Abstract: Ambiguity sets of probability distributions are a prominent tool to hedge against distributional uncertainty in stochastic optimization. The aim of this paper is to build tight Wasserstein ambiguity sets for data-driven optimization problems. The method exploits independence between the distribution components to introduce structure in the ambiguity sets and speed up their shrinkage with the number of collected samples. Tractable reformulations of the stochastic optimization problems are derived for costs that are expressed as sums or products of functions that depend only on the individual distribution components. The statistical benefits of the approach are theoretically analyzed for compactly supported distributions and demonstrated in a numerical example.
|
|
14:10-14:30, Paper ThBT07.3 | Add to My Program |
Distributionally Robust Optimization Via Haar Wavelet Ambiguity Sets (I) |
|
Boskos, Dimitris | TU Delft |
Cortes, Jorge | University of California, San Diego |
Martinez, Sonia | University of California at San Diego |
Keywords: Statistical learning, Optimization, Uncertain systems
Abstract: This paper introduces a spectral parameterization of ambiguity sets to hedge against distributional uncertainty in stochastic optimization problems. We build an ambiguity set of probability densities around a histogram estimator, which is constructed by independent samples from the unknown distribution. The densities in the ambiguity set are determined by bounding the distance between the coefficients of their Haar wavelet expansion and the expansion of the histogram estimator. This representation facilitates the computation of expectations, leading to tractable minimax problems that are linear in the parameters of the ambiguity set, and enables the inclusion of additional constraints that can capture valuable prior information about the unknown distribution.
|
|
14:30-14:50, Paper ThBT07.4 | Add to My Program |
Data-Driven Distributionally Robust MPC for Systems with Uncertain Dynamics (I) |
|
Micheli, Francesco | ETH Zurich |
Summers, Tyler H. | University of Texas at Dallas |
Lygeros, John | ETH Zurich |
Keywords: Predictive control for linear systems, Uncertain systems, Stochastic optimal control
Abstract: We present a novel data-driven distributionally robust Model Predictive Control formulation for unknown discrete-time linear time-invariant systems affected by unknown and possibly unbounded additive uncertainties. We use off-line collected data and an approximate model of the dynamics to formulate a finite-horizon optimization problem. To account for both the uncertainty related to the dynamics and the disturbance acting n the system, we resort to a distributionally robust formulation that optimizes the cost expectation while satisfying Conditional Value-at-Risk constraints with respect to the worst-case probability distributions of the uncertainties within an ambiguity set defined using the Wasserstein metric. Using results from the distributionally robust optimization literature we derive a tractable finite-dimensional convex optimization problem with finite-sample guarantees for the class of convex piecewise affine cost and constraint functions. The performance of the proposed algorithm is demonstrated in closed-loop simulation on a simple numerical example.
|
|
14:50-15:10, Paper ThBT07.5 | Add to My Program |
Partially Observable Markov Decision Subject to Total Variation Distance Ambiguity (I) |
|
Tzortzis, Ioannis | University of Cyprus |
Charalambous, Charalambos D. | University of Cyprus |
Keywords: Markov processes, Stochastic optimal control, Uncertain systems
Abstract: In this paper we consider such Markov decision problems on finite spaces, when the probability distribution of the output conditioned on the sufficient statistic, belongs to a ball, with respect to the total variation distance metric, centered around a nominal conditional distribution. We formulate the Markov decision problem, as a minimax optimization problem in which the minimization is over the admissible set of control strategies, while the maximization is over the conditional distributions which lie in the total variation distance ball. Then we present a two-step procedure to determine the optimal control strategies. In step 1, we determine the maximizing conditional distribution, as a function of the nominal conditional distribution and the radius of the ball; it is defined on a finite partition of the observation space of aggregating the output states to form new states. In step 2, the minimax problem is transformed into a minimization problem over the control strategies, and a new dynamic programming equation is derived, where the conditional expectation is taken with respect to the maximizing conditional distribution. Surprisingly, although the current paper deals with partially observable Markov decision problems, the mathematical analysis is tractable.
|
|
15:10-15:30, Paper ThBT07.6 | Add to My Program |
Wasserstein Distributionally Robust Control of Partially Observable Linear Systems: Tractable Approximation and Performance Guarantee (I) |
|
Hakobyan, Astghik | Seoul National University |
Yang, Insoon | Seoul National University |
Keywords: Stochastic optimal control, Stochastic systems, Optimal control
Abstract: Wasserstein distributionally robust control (WDRC) is an effective method for addressing inaccurate distribution information about disturbances in stochastic systems. It provides various salient features, such as an out-of-sample performance guarantee, while most of the existing methods use full-state observations. In this paper, we develop a computationally tractable WDRC method for discrete-time partially observable linear-quadratic (LQ) control problems. The key idea is to reformulate the WDRC problem as a novel minimax control problem with an approximate Wasserstein penalty. We derive a closed-form expression of the optimal control policy of the approximate problem using a nontrivial Riccati equation. We further show the guaranteed cost property of the resulting controller and identify a provable bound for the optimality gap. Finally, we evaluate the performance of our method through numerical experiments using both Gaussian and non-Gaussian disturbances.
|
|
ThBT08 Regular Session, Tulum Ballroom H |
Add to My Program |
Secure Control Systems |
|
|
Chair: Garg, Kunal | University of Michigan-Ann Arbor |
Co-Chair: Gupta, Abhishek | The Ohio State University |
|
13:30-13:50, Paper ThBT08.1 | Add to My Program |
Control Barrier Function Based Attack-Recovery with Provable Guarantees |
|
Garg, Kunal | University of Michigan-Ann Arbor |
Sanfelice, Ricardo G. | University of California at Santa Cruz |
Cardenas, Alvaro A. | University of California, Santa Cruz |
Keywords: Attack Detection, Cyber-Physical Security, Constrained control
Abstract: This paper studies provable security guarantees for cyber-physical systems (CPS) under actuator attacks. Specifically, we consider safety for CPS and propose a new attack-detection mechanism based on a zeroing control barrier function (ZCBF) condition. To reduce the conservatism in its implementation, we design an adaptive recovery mechanism based on how close the state is to violating safety. We show that the attack-detection mechanism is sound, i.e., there are no false negatives for adversarial attacks. Finally, we use a Quadratic Programming (QP) approach for online recovery (and nominal) control synthesis. We demonstrate the effectiveness of the proposed method in a case study involving a quadrotor with an attack on its motors.
|
|
13:50-14:10, Paper ThBT08.2 | Add to My Program |
Change Detection of Markov Kernels with Unknown Pre and Post Change Kernel |
|
Chen, Hao | The Ohio Sate University |
Tang, Jiacheng | Ohio State University |
Gupta, Abhishek | The Ohio State University |
Keywords: Attack Detection, Markov processes, Cyber-Physical Security
Abstract: In this paper, we develop a new change detection algorithm for detecting a change in the Markov kernel over a metric space in which the post-change kernel is unknown. Under the assumption that the pre- and post-change Markov kernel is uniformly ergodic, we derive an upper bound on the mean delay and a lower bound on the mean time between false alarms. A numerical simulation is provided to demonstrate the effectiveness of our method.
|
|
14:10-14:30, Paper ThBT08.3 | Add to My Program |
Co-Design of Watermarking and Robust Control for Security in Cyber-Physical Systems |
|
Goyal, Raman | Palo Alto Research Center |
Somarakis, Christoforos | Palo Alto Research Center |
Noorani, Erfaun | University of Maryland College Park |
Rane, Shantanu | Palo Alto Research Center |
Keywords: Attack Detection, Optimization, Cyber-Physical Security
Abstract: This work discusses a novel framework for simultaneous synthesis of optimal watermarking signal and robust controllers in cyber-physical systems to minimize the loss in performance due to added watermarking signal and to maximize the detection rate of the attack. A general dynamic controller is designed to improve system performance with respect to the mathcal H_2 norm, while a watermarking signal is added to improve security performance concerning the detection rate of replay attacks. The attack model considered in the paper is a replay attack, a natural attack mode when the dynamics of the system is unknown to the attacker. The paper first generalizes the existing result on the detection rate of chi^2 detector from a static-LQR controller to a general dynamic controller. The design improvements on both robustness and security fronts are obtained by iteratively solving the convex subsets of the formulated non-convex problem in terms of the controller and watermarking signal. A semi-definite programming optimization is formulated using Linear Matrix Inequality (LMI) results to solve the larger system-level design optimization problem. We highlight the effectiveness of our method over a simplified three-tank chemical system.
|
|
14:30-14:50, Paper ThBT08.4 | Add to My Program |
Fragility-Aware Stealthy Attack Strategy for Multi-Robot Systems against Multi-Hop Wireless Networks |
|
Gao, Yun | Shanghai Jiao Tong University |
Luo, Kai | Shanghai Jiao Tong University |
Fang, Chongrong | Shanghai Jiao Tong University |
He, Jianping | Shanghai Jiao Tong University |
Keywords: Computer/Network Security, Cyber-Physical Security, Cooperative control
Abstract: This paper proposes a fragility-aware stealthy attack for multiple mobile robots to secretly learn the accurate model of a multi-hop wireless network (MHWN). The stealthy attack scenario consists of three steps: active detection, node hijacking, and network eavesdropping. Our goal is to design distributed motion control strategies for naive attackers, for obtaining the connectivity information to evaluate the fragility degree of each node in MHWN. In order to achieve this goal, a switching control strategy is employed, which includes (i) move-to-fragile-neighbors-barycenter (MFNB) law and (ii) move-to-undetected-region-centroid (MURC) law. This strategy can drive mobile attackers to detect, hijack and eavesdrop on the MHWN’s nodes as quickly as possible. As a result, the fragility degree map of MHWN can be secretly learned to paralyze the whole network. Finally, the stability of the multi-robot system is proved and the effectiveness of the proposed strategy is verified.
|
|
14:50-15:10, Paper ThBT08.5 | Add to My Program |
Synthesizing Attack-Aware Control and Active Sensing Strategies under Reactive Sensor Attacks |
|
Udupa, Sumukha | University of Florida |
Kulkarni, Abhishek | University of Florida at Gainesville |
Han, Shuo | University of Illinois at Chicago |
Leslie, Nandi | U.S. Army Research Laboratory |
Kamhoua, Charles | U.S. Army Research Laboratory |
Fu, Jie | University of Florida |
Keywords: Discrete event systems, Game theory, Markov processes
Abstract: We consider the probabilistic planning problem for a defender (P1) who can jointly query the sensors and take control actions to reach a set of goal states while being aware of possible sensor attacks by an adversary (P2) who has perfect observations. To synthesize a provably-correct, attack-aware joint control and active sensing strategy for P1, we construct a stochastic game on graph with augmented states that include the actual game state (known only to the attacker), the belief of the defender about the game state (constructed by the attacker based on his knowledge of the defender's observations). We present an algorithm to compute a belief-based, randomized strategy for P1 to satisfy the reachability objective with probability one, under the worst-case sensor attacks carried out by an informed P2. We prove the correctness of the algorithm and illustrate it using an example.
|
|
15:10-15:30, Paper ThBT08.6 | Add to My Program |
Sensor Deception Attacks against Initial-State Privacy in Supervisory Control Systems |
|
Yao, Jingshi | Shanghai Jiao Tong University |
Yin, Xiang | Shanghai Jiao Tong University |
Li, Shaoyuan | Shanghai Jiao Tong University |
Keywords: Discrete event systems, Supervisory control, Automata
Abstract: This paper investigates the problem of synthesizing sensor deception attackers against privacy in the context of supervisory control of discrete-event systems (DES). We consider a DES plant controlled by a supervisor, which is subject to sensor deception attacks. Specifically, we consider an active attacker that can tamper with the observations received by the supervisor by, e.g., hacking on the communication channel between the sensors and the supervisor. The privacy requirement of the supervisory control system is to maintain initial-state opacity, i.e., it does not want to reveal the fact that it was initiated from a secret state during its operation. On the other hand, the attacker aims to deceive the supervisor, by tampering with its observations, such that initial-state opacity is violated due to incorrect control actions. In this work, we investigate from the attacker's point of view by presenting an effective approach for synthesizing sensor attack strategies threatening the privacy of the system. To this end, we propose the All Attack Structure (AAS) that records state estimates for both the supervisor and the attacker. This structure serves as a basis for synthesizing a sensor attack strategy. We also discuss how to simplify the synthesis complexity by leveraging the structural property of the initial-state privacy requirement. A running academic example is provided to illustrate the synthesis procedure.
|
|
ThBT09 Regular Session, Maya Ballroom I |
Add to My Program |
Linear Systems II |
|
|
Chair: Rodrigues, Luis | Concordia University |
Co-Chair: van Waarde, Henk J. | University of Groningen |
|
13:30-13:50, Paper ThBT09.1 | Add to My Program |
Data-Driven Criteria for Detectability and Observer Design for LTI Systems |
|
Mishra, Vikas Kumar | Technische Universitat Kaiserlautern |
van Waarde, Henk J. | University of Groningen |
Bajcinca, Naim | University of Kaiserslautern |
Keywords: Linear systems, Observers for Linear systems, LMIs
Abstract: We study the problems of determining the de- tectability and designing a state observer for linear time- invariant systems from measured data. First, we establish algebraic criteria to verify the detectability of the system from noise-free data. Then, we formulate data-driven linear matrix inequality-based conditions for observer design. Finally, we give conditions to infer the detectability of the system from noisy data.
|
|
13:50-14:10, Paper ThBT09.2 | Add to My Program |
A Data-Driven Approach to Distributed Modal Consensus and Synchronization |
|
Monti, Andrea | University of Rome Tor Vergata |
Galeani, Sergio | Università Di Roma Tor Vergata |
Possieri, Corrado | Università Degli Studi Di Roma "Tor Vergata" |
Sassano, Mario | University of Rome, Tor Vergata |
Keywords: Linear systems, Output regulation, Sampled-data control
Abstract: In this paper, we propose a data-driven control strategy to solve the distributed modal consensus and synchronization problems. The proposed solution relies only on input/output data and does not require any knowledge of the dynamics of each agent. Furthermore, it is shown that the synchronization task requires to address the problem of transferring, in finite time, the state of a system from an initial state to a terminal one in a completely data-driven framework, which is therefore tackled and solved here. The above concepts are then illustrated via a multi-agent system consisting of RC circuits.
|
|
14:10-14:30, Paper ThBT09.3 | Add to My Program |
Competitive-Ratio and Regret-Optimal Control with General Weights |
|
Sabag, Oron | Caltech |
Lale, Sahin | Caltech |
Hassibi, Babak | Caltech |
Keywords: Linear systems, Optimal control, Stochastic optimal control
Abstract: Motivated by the online policy design approaches in learning theory, new controller design paradigms such as competitive-ratio and regret-optimal control have been recently proposed as alternatives to the classical H2 and H∞ controllers. These metrics respectively consider the performances against a clairvoyant controller, which has access to future disturbances, in terms of ratio and difference. Even though the prior works on regret-optimal control provide its exact solution, in the competitive-ratio setting the solution is only provided for the suboptimal problem. In this work, we give the first exact solution to the optimal competitive-ratio control problem and present an explicit construction of the optimal competitive-ratio controller. A key technique that underpins our explicit solution is a reduction of the competitive-ratio control problem to the Nehari extension problem (similar to the regret-optimal control setting). The resulting optimal competitive-ratio controller is given by an explicit state space and the state-feedback law that is inherited from the H2 controller. Inspired by this explicit solution, we generalize regret-optimal control to have weight functions on the state, input, and noise sequences and show that competitive-ratio control is an instance of this general framework. The utilization of weight functions allow penalization of particular sequences, but still enjoying the explicit and optimal solution for the regret-optimal control problem.
|
|
14:30-14:50, Paper ThBT09.4 | Add to My Program |
Discrete-Time Output Feedback under Bounded Actuators: Single and Multi-Agent Problems |
|
Xu, Yicheng | University of California Irvine |
Jabbari, Faryar | Univ. of California at Irvine |
Keywords: Agents-based systems, Linear systems, Discrete event systems
Abstract: Synthesis results are presented for a general discrete-time dynamic output feedback controller and its anti-windup augmentation for operation under bounded actuation. The approach is then adopted for the leader-follower tracking problem for a multi-agent system, relying on the information shared among agents, through the network. For performance, we consider the l_2 gain from the disturbances to a general output, and obtain upper bounds for such gains. The scheme developed includes sufficient conditions, in the form of LMIs, for both controller design and anti-windup synthesis. Numerical examples are presented to show the effectiveness.
|
|
14:50-15:10, Paper ThBT09.5 | Add to My Program |
Two Double Recursive Block Macaulay Matrix Algorithms to Solve Multiparameter Eigenvalue Problems |
|
Vermeersch, Christof | KU Leuven |
De Moor, Bart L.R. | Katholieke Universiteit Leuven |
Keywords: Numerical algorithms, Linear systems, Identification
Abstract: We present two double recursive block Macaulay matrix algorithms to solve multiparameter eigenvalue problems (MEPs). In earlier work, we have developed a non-recursive approach that finds the solutions of an MEP via a multidimensional realization problem in the null space of the block Macaulay matrix constructed from the coefficient matrices of that MEP. However, this approach requires an iterative increase of the degree of the block Macaulay matrix: in order to determine whether the null space contains all the (affine) solutions of the MEP, we need to compute a basis matrix of the null space for every degree and check its dimension or rank structure. In this letter, we employ a recursive/sparse technique to compute a basis matrix of the null space of the block Macaulay matrix and a recursive technique to perform the necessary rank checks. We provide two system identification examples to show our improvements in computation time and memory usage.
|
|
15:10-15:30, Paper ThBT09.6 | Add to My Program |
From LQR to Static Output Feedback: A New LMI Approach |
|
Rodrigues, Luis | Concordia University |
Keywords: LMIs, Linear systems, Computer-aided control design
Abstract: This paper proposes a new Linear Matrix Inequality (LMI) for static output feedback control assuming that a Linear Quadratic Regulator (LQR) has been previously designed for the system. The main idea is to use a quadratic candidate Lyapunov function for the closedloop system parameterized by the unique positive definite matrix that solves the Riccati equation. A converse result will also be proved guaranteeing the existence of matrices satisfying the LMI if the system is static output feedback stabilizable. The static output feedback includes the LQR solution as a special case when the state is available, which is a desirable property. The examples show that the method is successful and works well in practice.
|
|
ThBT10 Regular Session, Maya Ballroom II |
Add to My Program |
Markov Processes |
|
|
Chair: Coogan, Samuel | Georgia Institute of Technology |
Co-Chair: Mahajan, Aditya | McGill University |
|
13:30-13:50, Paper ThBT10.1 | Add to My Program |
Polya Decision Processes: A New History-Dependent Framework for Reinforcement Learning |
|
Kohjima, Masahiro | NTT Corporation |
Keywords: Markov processes, Learning, Stochastic optimal control
Abstract: We propose a new framework for sequential decision making, Polya Decision Processes (PDP); it can express the agent's history-dependent transitions by using the Polya urn model. We show that PDP can be converted into a new type of Belief-MDP, whose belief update equation requires only urn model parameters. We introduce its theory, value iteration algorithm, and reinforcement learning algorithm for PDP using the belief state representation. Their effectiveness is confirmed by numerical experiments.
|
|
13:50-14:10, Paper ThBT10.2 | Add to My Program |
Policy Gradient Primal-Dual Mirror Descent for Constrained MDPs with Large State Spaces |
|
Ding, Dongsheng | University of Southern California |
Jovanovic, Mihailo R. | University of Southern California |
Keywords: Markov processes, Machine learning, Optimization algorithms
Abstract: We study constrained sequential decision-making problems modeled by constrained Markov decision processes with potentially infinite state spaces. We propose a Bregman distance-based direct policy search method -- policy gradient primal-dual mirror descent -- which includes the natural policy primal-dual method and the projected policy primal-dual method as two special cases. When the exact gradient is known, we prove dimension-free global convergence with a sublinear rate in both optimality gap and constraint violation. When the exact gradient is not available, we instantiate our algorithm in the linear function approximation setting and establish sample complexity guarantees. The introduction of the Bregman-distance regularizers enjoys the dimension-free property with applicability to large-scale spaces, the first of its kind in the constrained RL literature.
|
|
14:10-14:30, Paper ThBT10.3 | Add to My Program |
Partially Observable Restless Bandits with Restarts: Indexability and Computation of Whittle Index |
|
Akbarzadeh, Nima | McGill University |
Mahajan, Aditya | McGill University |
Keywords: Markov processes, Stochastic systems, Large-scale systems
Abstract: We consider restless bandits with restarts, where the state of the active arms resets according to a known probability distribution while the state of the passive arms evolves in a Markovian manner. We assume that the state of the arm is observed after it is reset but not observed otherwise. We show that the model is indexable and propose an efficient algorithm to compute the Whittle index by exploiting the qualitative properties of the optimal policy. A detailed numerical study of machine repair models shows that Whittle index policy outperforms myopic policy and is close to optimal policy.
|
|
14:30-14:50, Paper ThBT10.4 | Add to My Program |
Learning Gaussian Hidden Markov Models from Aggregate Data |
|
Singh, Rahul | Georgia Institute of Technology, Atlanta, GA |
Chen, Yongxin | Georgia Institute of Technology |
Keywords: Markov processes, Stochastic systems, Statistical learning
Abstract: We consider system identification (learning) problems for Gaussian hidden Markov models (GHMMs). We propose an algorithm to tackle the cases where the data is recorded in aggregate (collective) form generated by a large population of individuals following a certain dynamics. Our parameter learning algorithm is built upon the expectation-maximization algorithm with a novel expectation step proposed recently known as the collective Gaussian forward-backward algorithm. The proposed learning algorithm generalizes the traditional Baum-Welch learning algorithm for GHMMs as it naturally reduces to the latter in case of individual observations.
|
|
14:50-15:10, Paper ThBT10.5 | Add to My Program |
Safe Learning for Uncertainty-Aware Planning Via Interval MDP Abstraction |
|
Jiang, Jesse | Georgia Institute of Technology |
Zhao, Ye | Georgia Tech |
Coogan, Samuel | Georgia Institute of Technology |
Keywords: Markov processes, Automata, Hybrid systems
Abstract: We study the problem of refining satisfiability bounds for partially-known switched stochastic systems against planning specifications defined using syntactically co-safe Linear Temporal Logic (scLTL). We propose an abstraction-based approach that iteratively generates high-confidence Interval Markov Decision Process (IMDP) abstractions of the system from high-confidence bounds on the unknown component of the dynamics obtained via Gaussian process regression. In particular, we develop a synthesis strategy to sample the unknown dynamics by finding paths which avoid specification-violating states using a product IMDP formulation. We further provide a heuristic to choose among various candidate paths to maximize the information gain. Finally, we propose an iterative algorithm to synthesize a satisfying control policy for the product IMDP system. We demonstrate our work with a case study on mobile robot navigation.
|
|
15:10-15:30, Paper ThBT10.6 | Add to My Program |
Online Planning of Uncertain MDPs under Temporal Tasks and Safe-Return Constraints |
|
Zhang, Yuyang | Peking University |
Guo, Meng | Peking University |
Keywords: Markov processes, Formal Verification/Synthesis, Human-in-the-loop control
Abstract: This paper addresses the online motion planning problem of mobile robots under complex high-level tasks. The robot motion is modeled as an uncertain Markov Decision Process (MDP) due to limited initial knowledge, while the task is specified as Linear Temporal Logic (LTL) formulas. The proposed framework enables the robot to explore and update the system model in a Bayesian way, while simultaneously optimizing the asymptotic costs of satisfying the complex temporal task. Theoretical guarantees are provided for the synthesized outgoing policy and safety policy. More importantly, instead of greedy exploration under the classic ergodicity assumption, a safe-return requirement is enforced such that the robot can always return to home states with a high probability. The overall methods are validated by numerical simulations.
|
|
ThBT11 Regular Session, Maya Ballroom III |
Add to My Program |
Uncertain Systems I |
|
|
Chair: Sznaier, Mario | Northeastern University |
Co-Chair: Silvestre, Daniel | ISR |
|
13:30-13:50, Paper ThBT11.1 | Add to My Program |
Data-Driven Superstabilizing Control of Error-In-Variables Discrete-Time Linear Systems |
|
Miller, Jared | Northeastern University |
Dai, Tianyu | Northeastern University |
Sznaier, Mario | Northeastern University |
Keywords: Uncertain systems, Robust control, Optimization
Abstract: This paper proposes a method to find super-stabilizing controllers for discrete-time linear systems that are consistent with a set of corrupted observations. The L-infinity bounded measurement noise introduces a bilinearity between the unknown plant parameters and noise terms. A super-stabilizing controller may be found by solving a feasibility problem involving a set of polynomial nonnegativity constraints in terms of the unknown plant parameters and noise terms. A sequence of sum-of-squares (SOS) programs in rising degree will yield a super-stabilizing controller if such a controller exists. Unfortunately, these SOS programs exhibit very poor scaling as the degree increases. A theorem of alternatives is employed to yield equivalent, convergent (under mild conditions), and more computationally tractable SOS programs.
|
|
13:50-14:10, Paper ThBT11.2 | Add to My Program |
On Discretization Methods for Indirect Adaptive Sliding Mode Control |
|
Hettiger, Christina | TU Ilmenau |
Watermann, Lars | TU Ilmenau |
Kumari, Kiran | Indian Institute of Technology Bombay |
Eisenzopf, Lukas | Graz University of Technology |
Weissenberger, Florian | TU Ilmenau |
Horn, Martin | Graz University of Technology |
Koch, Stefan | Graz University of Technology |
Reger, Johann | TU Ilmenau |
Reichhartinger, Markus | Graz University of Technology |
Keywords: Variable-structure/sliding-mode control, Indirect adaptive control
Abstract: In this paper, discrete-time versions of the first-order indirect adaptive sliding mode control (SMC) algorithm are developed and analyzed. In particular, a scalar linear system with parametric uncertainty linear in the state and with bounded input disturbance is considered. The discretization of continuous-time first-order indirect adaptive SMC algorithm may not preserve continuous-time properties like asymptotic stability of the origin. In order to ensure boundedness, knowledge about where the actual parameter may be located is used in this work. Then, for each proposed discretization scheme, conditions on the sampling time are derived that guarantee convergence into a bounded set. The theoretical results are illustrated with simulation examples and the proposed discretization schemes are compared with regard to the condition on sampling time and precision.
|
|
14:10-14:30, Paper ThBT11.3 | Add to My Program |
Learning Random Feature Dynamics for Uncertainty Quantification |
|
Agudelo-España, Diego | Max Planck Institute for Intelligent Systems |
Nemmour, Yassine | Max Planck Institute for Intelligent Systems |
Schölkopf, Bernhard | MPI for Biological Cybernetics |
Zhu, Jia-Jie | Max Planck Institute for Intelligent Systems |
Keywords: Uncertain systems, Learning, Optimal control
Abstract: An inherent challenge of learning-based control tasks is posed by uncertainty due to finite training datasets. Even though there are principled tools to obtain confidence bounds for pointwise evaluation of learned dynamics models, it remains a challenging task to quantify the induced uncertainty in downstream quantities of interest due to the intrinsic recursive structure of dynamic systems. In this paper, we view the unknown one-step dynamics as a smooth function in a reproducing kernel Hilbert space and leverage random features for an approximate but highly structured parameterization of pointwise confidence bounds. As a result, we obtain downstream confidence bounds through an optimal control formulation under an uncertainty-aware random feature dynamics model. Our model is effectively a shallow neural network, which enables us to view the corresponding dynamic system as a deep neural network. Exploiting this perspective, we show that a Pontryagin’s minimum principle solution is equivalent to using the Frank-Wolfe algorithm on the induced neural network. Various numerical experiments on dynamics learning showcase the capacity of our methodology.
|
|
14:30-14:50, Paper ThBT11.4 | Add to My Program |
Approximate Information States for Worst-Case Control of Uncertain Systems |
|
Dave, Aditya | University of Delaware |
Senthil Kumar, Nishanth Venkatesh | University of Delaware |
Malikopoulos, Andreas A. | University of Delaware |
Keywords: Uncertain systems, Robust control, Markov processes
Abstract: In this paper, we investigate a worst-case-scenario control problem with a partially observed state. We consider a non-stochastic formulation, where noises and disturbances in our dynamics are uncertain variables which take values in finite sets. In such problems, the optimal control strategy can be derived using a dynamic program (DP) with respect to the memory. The computational complexity of this DP can be improved using a conditional range of the state instead of the memory. We present a more general definition of an information state which is sufficient to construct a DP without loss of optimality, and show that the conditional range is an example of an information state. Next, we extend this notion to define an approximate information state and an approximate DP. We also bound the maximum loss of optimality when using an approximate DP to derive the control strategy. Finally, we illustrate our results in a numerical example.
|
|
14:50-15:10, Paper ThBT11.5 | Add to My Program |
Certifying the Intersection of Reach Sets of Integrator Agents with Set-Valued Input Uncertainties |
|
Haddad, Shadi | University of California, Santa Cruz |
Halder, Abhishek | University of California, Santa Cruz |
Keywords: Uncertain systems, Variational methods, Computational methods
Abstract: We consider the problem of verifying safety for a pair of identical integrator agents in continuous time with compact set-valued input uncertainties. We encode this verification problem as that of certifying or falsifying the intersection of their reach sets. We transcribe the same into a variational problem, namely that of minimizing the support function of the difference of the two reach sets over the unit sphere. We illustrate the computational tractability of the proposed formulation by developing two cases in detail, viz. when the inputs have time-varying norm-bounded and generic hyperrectangular uncertainties. We show that the latter case allows distributed certification via second order cone programming.
|
|
15:10-15:30, Paper ThBT11.6 | Add to My Program |
Accurate Guaranteed State Estimation for Uncertain LPVs Using Constrained Convex Generators |
|
Silvestre, Daniel | NOVA University of Lisbon |
Keywords: Uncertain systems, Autonomous systems, Estimation
Abstract: Guaranteed state estimation for autonomous vehicles in GPS-denied areas that resort to landmarks detection and onboard sensors requires set-membership techniques that are capable of representing heterogeneous bounds using hyper-planes and ellipsoids. Recently, in the literature, the concept of Convex Constrained Generators (CCGs) has been introduced for the case where the dynamical system can be represented by a Linear Parameter-Varying (LPV) model. However, in practical applications, dynamics have uncertain parameters caused by noise-corrupted measurements of quantities of interest such as mass or orientation angles. In this paper, we first explore a closed-form solution for the convex hull of polytopes to showcase the main challenges of guaranteed state estimation for uncertain LPVs. We then propose the use of CCGs to have low conservatism when in the presence of distance measurements and avoid the exponential growth of the generators used in the state representation by performing an approximation using ray-shooting. Simulations illustrate the ability of CCGs to accurately model distance measurements with the corresponding decrease in volume without adding additional constraints.
|
|
ThBT12 Tutorial Session, Maya Ballroom IV |
Add to My Program |
Modelling and Control of Epidemics across Scales |
|
|
Chair: Giordano, Giulia | University of Trento |
Co-Chair: Beck, Carolyn L. | Univ of Illinois, Urbana-Champaign |
|
13:30-13:31, Paper ThBT12.1 | Add to My Program |
Modelling and Control of Epidemics across Scales (I) |
|
Hernandez-Vargas, Esteban Abelardo | University of Idaho |
González, Alejandro H. | CONICET-Universidad Nacional Del Litoral |
Beck, Carolyn L. | Univ of Illinois, Urbana-Champaign |
Bi, Xiaoqi | University of Illinois, Urbana-Champaign |
Calà Campana, Francesca | University of Trento |
Giordano, Giulia | University of Trento |
Keywords: Biological systems, Network analysis and control, Systems biology
Abstract: When facing the global health threat posed by an infectious disease, predictive mathematical models are crucial not only to understand and forecast the epidemic evolution, but also to plan effective control strategies that contrast the disease and its spread in the population. This tutorial aims to give a broad overview of the fundamental developments enabled by systems-and-control methodologies in modelling and controlling epidemiological dynamics across scales, from infection dynamics within hosts to contagion dynamics between hosts. The first part is focused on modelling and control of infectious diseases in the host, capturing the dynamic interplay between pathogens and the immune system, and discussing control strategies to design tailored therapies and treatments to optimally clear the infection. The second part deals with the spread of contagion between hosts: epidemic dynamics are modelled resorting to networked systems where the nodes represent individuals and the links represent interactions that can lead to contagion, and a comparison to compartmental models is carried out. The third part surveys multi-scale models and multi-pronged approaches to contrast the spread of infectious diseases: a holistic perspective is adopted, including behavioural and socio-economic aspects along with public health issues, to discuss optimal epidemic control across scales.
|
|
13:31-14:10, Paper ThBT12.2 | Add to My Program |
Modelling and Control of Infectious Diseases in the Host (I) |
|
Hernandez-Vargas, Esteban Abelardo | University of Idaho |
González, Alejandro H. | CONICET-Universidad Nacional Del Litoral |
Keywords: Biological systems, Biomolecular systems, Cellular dynamics
Abstract: Control of infectious diseases is a multidisciplinary field linking the application of engineering principles, mathematical modelling, medicine, and biology for healthcare purposes. This talk aims to present interdisciplinary tools to tackle infectious diseases at the host level. Dissecting detailed contributions of key players of the immune system to infectious diseases as well as their respective interactions will be discussed. Parameter fitting procedures to adjust parameters based on experimental data will be developed. Consequently, mathematical models will serve to perform stability analysis and control strategies for personalized therapies based on in-hosts models for several viral infections. Concepts as the critical fraction of susceptible/non-infected cells (under which the infection can no longer increase) can be fully understood and used in more general control objectives if put in terms of the equilibrium sets and their stability. Based on classical Lyapunov methods, a full characterization of the dynamical behaviour of the target-cell models under control actions will be discussed. Furthermore, based on the concept of virus spreadability, antiviral effectiveness thresholds are determined to establish whether a given treatment will be able to clear the infection. Also, it is shown how to simultaneously minimize the total fraction of infected cells while maintaining the virus load under a given level. Several examples for parameter fitting, modelling, and control applications are presented.
|
|
14:10-14:50, Paper ThBT12.3 | Add to My Program |
Analyzing and Controlling Epidemic Process Dynamics Over Networks (I) |
|
Beck, Carolyn L. | Univ of Illinois, Urbana-Champaign |
Bi, Xiaoqi | University of Illinois, Urbana-Champaign |
Keywords: Biological systems, Network analysis and control, Markov processes
Abstract: The study of epidemic processes has been of interest over a wide range of fields for the past century, including in mathematical systems, biology, physics, computer science, social sciences and economics. There has been renewed interest in the study of epidemic processes focused on the spread of viruses over networks, motivated not only by recent outbreaks of infectious diseases, but also by the rapid spread of opinions over social networks, and the security threats posed by computer viruses. Most of the models considered in recent studies have been focused on network models with static network structures, however many of the systems being considered have inherently dynamic structures. In this talk we will discuss data-informed modeling and equilibria analysis results for epidemic processes over both static and time-varying networks, with the goal being to elucidate the behavior of such spread processes. Multi-strain models, and issues arising from the use of data from ongoing viral outbreaks, will also be discussed as time allows. Simulation results and potential mitigation actions will be reviewed to conclude the talk.
|
|
14:50-15:30, Paper ThBT12.4 | Add to My Program |
Holistic Models and Multi-Pronged Interventions to Contrast Epidemics (I) |
|
Calà Campana, Francesca | University of Trento |
Giordano, Giulia | University of Trento |
Keywords: Biological systems, Systems biology, Network analysis and control
Abstract: Models are crucial to unveil the mechanisms of epidemic phenomena and forecast their evolution based on the available data. The tutorial will present a holistic viewpoint on epidemics that spans across multiple scales and includes opinion dynamics and socio-economic aspects along with public health issues. Multi-scale models capture both in-host infection (individual-level) and between-host contagion (population-level) dynamics, and consider the interplay between immunological (individual) and epidemiological (population) phenomena. Understanding infection and contagion dynamics through models enables the rigorous design of mathematical control approaches for effective suppression and mitigation. Multi-pronged strategies to control epidemics across scales leverage both pharmaceutical interventions, such as drugs and vaccines, and non-pharmaceutical interventions, including hygiene, use of personal protective equipment, physical distancing, travel bans. Public measures, essential to contain the contagion, need to be carefully planned to maximize their effectiveness and account for opinion-driven adherence to guidelines, treatments and vaccination. The interplay between contagion and opinion-driven behaviours can be captured by coupling epidemic dynamics with opinion dynamics describing how the attitude towards responsible behaviour evolves in the population, thus affecting the spread of the infection.
|
|
ThBT13 Regular Session, Maya Ballroom V |
Add to My Program |
Constrained Control and Optimization II |
|
|
Chair: Athanasopoulos, Nikolaos | Queen's University Belfast |
Co-Chair: Saoud, Adnane | CentraleSupelec |
|
13:30-13:50, Paper ThBT13.1 | Add to My Program |
A Dynamic Grid-Based Q-Learning for Noise Covariance Adaptation in EKF and Its Application in Navigation |
|
Dai, Xiang | CentraleSupélec |
Fourati, Hassen | University Grenoble Alpes |
Prieur, Christophe | CNRS |
Keywords: Kalman filtering, Machine learning, Stochastic systems
Abstract: The process and measurement noise covariance matrices significantly impact the Extended Kalman Filter (EKF) performance and are often hand-tuned in practice, which usually entails a tedious task. Q-learning, a well-known method in reinforcement learning, has been applied recently to better adapt the noise covariance matrices for EKF thanks to its simplicity and capability in handling uncertain environments. Typically, some heuristics are involved in designing the Q-learning-based EKF (QLEKF), such as the tuning of grid size and covariance matrices values of each state, which inevitably degrade the estimation performance when the heuristics are not suitable. We propose a dynamic grid-based Q-learning EKF (DG-QLEKF) to overcome that drawback, which brings two novelties, an updated epsilon-greedy algorithm and a dynamic grid strategy. The proposed algorithm and strategy can thoroughly exploit arbitrary search scope and find appropriate values of noise covariance matrices. The effectiveness of DG-QLEKF, applied in navigation for attitude and bias estimation, is validated through the Monte Carlo method and real flight data from an unmanned aerial vehicle. The DG-QLEKF leads to much more improved state estimation than the QLEKF and traditional EKF.
|
|
13:50-14:10, Paper ThBT13.2 | Add to My Program |
Characterizations and Computation of Controlled Invariants for Monotone Dynamical Systems |
|
Saoud, Adnane | CentraleSupelec |
Arcak, Murat | University of California, Berkeley |
Keywords: Formal Verification/Synthesis, Hybrid systems
Abstract: In this paper, we consider the problem of computing robust controlled invariants for discrete-time monotone dynamical systems. We consider different classes of monotone systems depending on whether the sets of states, control inputs and disturbances respect a given partial order. Then, we present set-based and trajectory-based characterizations of robust controlled invariants for the considered class of systems. Based on these characterizations, we propose an algorithmic approach to the computation of controlled invariants. Finally, we illustrate the proposed approach on an adaptive cruise control problem.
|
|
14:10-14:30, Paper ThBT13.3 | Add to My Program |
Polytope Shaping While Preserving Invariance |
|
Athanasopoulos, Nikolaos | Queen's University Belfast |
Keywords: Constrained control, Linear systems, Hybrid systems
Abstract: We present a framework on how to dynamically change the shape of invariant polytopes via elementary operations, namely, by adding a vertex or a linear inequality in their description, and at the same time maintain invariance. We work on the face lattice of the polytope, and introduce the cone lattice which embeds in an efficient way the geometric and combinatorial structure of the polytope. Combining the above, we can characterize a priori the complexity of the set resulting from either elementary operation. Importantly, using order-theoretic arguments, we identify the parts of the face and cone lattice that must be recalculated in every operation. Finally, we recall and extend equivalent algebraic conditions for preserving invariance of polytopes for a few families of dynamical systems.
|
|
14:30-14:50, Paper ThBT13.4 | Add to My Program |
Maximal Ellipsoid Method for Guaranteed Reachability of Unknown Fully Actuated Systems |
|
Shafa, Taha | University of Illinois Urbana-Champaign |
Ornik, Melkior | University of Illinois Urbana-Champaign |
Keywords: Optimization, Uncertain systems, Autonomous systems
Abstract: In the face of an adverse event, autonomous systems may undergo abrupt changes in their dynamics. In such an event, systems should be able to determine their continuing capabilities to then execute a provably completable task. This paper focuses on the scenario of a change in the system dynamics following an adverse event, aiming to determine the system's guaranteed performance capabilities by finding a set of states that are provably reachable by the system. While it is obviously impossible to exactly determine the reachable set without full knowledge of the system dynamics, we present a method of determining its under-approximation while assuming only partial knowledge of the system structure. Our technical approach relies on showing that an intersection of infinitely many ellipsoids --- available velocity sets for each system consistent with the partial knowledge of the dynamics --- is the same as an intersection of some finite collection of ellipsoids. This result enables us to find a maximal ellipsoid lying in such an intersection, yielding a set of velocities that the system is provably able to pursue regardless of its exact dynamics.
|
|
14:50-15:10, Paper ThBT13.5 | Add to My Program |
Bounding the Distance of Closest Approach to Unsafe Sets with Occupation Measures |
|
Miller, Jared | Northeastern University |
Sznaier, Mario | Northeastern University |
Keywords: Optimization, Algebraic/geometric methods, Nonlinear systems
Abstract: This paper presents a method to lower-bound the distance of closest approach between points on an unsafe set and points along system trajectories. Such a minimal distance is a quantifiable and interpretable certificate of safety of trajectories, as compared to prior art in barrier and density methods which offers a binary indication of safety/unsafety. The distance estimation problem is converted into a infinite-dimensional linear program in occupation measures based on existing work in peak estimation and optimal transport. The moment-SOS hierarchy is used to obtain a sequence of lower bounds obtained through solving semidefinite programs in increasing size, and these lower bounds will converge to the true minimal distance as the degree approaches infinity under mild conditions (e.g. Lipschitz dynamics, compact sets).
|
|
15:10-15:30, Paper ThBT13.6 | Add to My Program |
Solving Least Squares Problems on Partially Ordered Sets |
|
Amice, Alexandre | MIT |
Parrilo, Pablo A. | Massachusetts Institute of Technology |
Keywords: Optimization algorithms, Hierarchical control, Linear systems
Abstract: We study a general class of least-squares problems structured according to a partially ordered set (poset). This is a fundamental optimization problem underlying the design of structured controllers on directed acyclic graphs or posets. We show that the optimality conditions of this problem yield a structured linear system, with sparsity pattern determined by a derived poset known as the poset of intervals. In general, this system could be relatively dense, and thus standard sparse linear algebra techniques may fail to provide significant reduction in computational complexity. Nonetheless, for a broad class of posets called multitrees identified in cite{nayyar_structural_2015} we show that performing elimination according to an order defined by the poset intervals progressively decouples variables, reducing the arithmetic complexity of solving the problem.
|
|
ThBT14 Regular Session, Maya Ballroom VI |
Add to My Program |
Dimensioning and Operation of Energy Systems |
|
|
Chair: Zanvettor, Giovanni Gino | Universita' Di Siena |
Co-Chair: Qin, Junjie | Purdue University |
|
13:30-13:50, Paper ThBT14.1 | Add to My Program |
Spatio-Temporal Thermal Monitoring for Lithium-Ion Batteries Via Kriged Kalman Filtering |
|
Tu, Hao | University of Kansas |
Wang, Yebin | Mitsubishi Electric Research Labs |
Li, Xianglin | Washington University, St. Louis |
Fang, Huazhen | University of Kansas |
Keywords: Energy systems, Estimation, Statistical learning
Abstract: Thermal monitoring plays an essential role in ensuring safe, efficient and long-lasting operation of lithium-ion batteries (LiBs). Existing methods in the literature mostly rely on physics-based thermal models. However, an accurate physical thermal model is practically hard to obtain due to various uncertainties such as uncaptured dynamics, parameter errors, and unknown cooling conditions. Motivated by this problem, this paper considers a data-driven approach named Kriged Kalman filter to estimate the temperature field of LiBs. First, we demonstrate that the evolution of a pouch-type LiB cell's temperature field can be formulated as a spatio-temporal random field in a physically consistent manner. Then, we leverage the Kriged Kalman filter to update and reconstruct the random temperature field sequentially through time using sensor data. Our simulations show that the proposed approach can accurately reconstruct the LiB cell's temperature field with a small number of sensors.
|
|
13:50-14:10, Paper ThBT14.2 | Add to My Program |
An Optimal Battery Sizing Co-Design Approach for Electric Racing Cars |
|
Riva, Giorgio | Politecnico Di Milano |
Radrizzani, Stefano | Politecnico Di Milano |
Panzani, Giulio | Politecnico Di Milano |
Corno, Matteo | Politecnico Di Milano |
Savaresi, Sergio M. | Politecnico Di Milano |
Keywords: Optimization, Automotive control, Mechatronics
Abstract: The electrification trend in automotive industry is having a huge impact also in racing contexts. Indeed, Formula E, the most important competition for electric vehicles (EVs), is currently evolving to its third generation, developing more powerful cars. With respect to fuel-based vehicles, batteries represent the main shortcoming in EVs, due to lower energy and power densities compared to fuel tanks. For this reason, battery sizing in EVs is a real challenge, especially in racing applications, where its size significantly affects vehicle performance. In this letter, the battery sizing for electric racing cars is formulated as a co-design problem, tackled through a double-layer nested optimization approach. The external layer optimizes the battery pack design parameters and the internal one computes its optimal use, mainly the battery power request, to achieve the minimum race time. Finally, the proposed approach is applied to a meaningful case study, that is the Formula E battery pack sizing for the 2021 Rome and Valencia ePrix.
|
|
14:10-14:30, Paper ThBT14.3 | Add to My Program |
Optimization of Spatial Infrastructure for EV Charging |
|
Paganini, Fernando | Universidad ORT Uruguay |
Espindola, Emiliano | Universidad ORT Uruguay |
Marvid, Diego | Universidad ORT Uruguay |
Ferragut, Andres | Universidad ORT, Uruguay |
Keywords: Optimization, Smart grid, Automotive systems
Abstract: We consider the problem of deploying a spatial supply infrastructure to serve a distributed demand, motivated by Electrical Vehicle charging facilities. We present a series of optimization problems, which include the global transport cost from demand points to supply stations with bounded capacity, and also model demand elasticity. When supply locations are fixed, linear programs of the class of the Monge-Kantorovich problem apply; here our focus is showing that integer solutions that respect the indivisibility of demand units can be found. If locations are part of the design, the problem is not convex; we study iterative methods that generalize the clustering literature. Also, we investigate the issue of sparsity in the allocation, invoking mixed-integer linear program formulations of the facility location problem. The features and tractability of these methods are demonstrated in illustrative simulations. The paper ends by outlining a methodology through which these tools may be used jointly for the progressive deployment and operation of an EV charging infrastructure.
|
|
14:30-14:50, Paper ThBT14.4 | Add to My Program |
A Data-Driven Dynamic Pricing Scheme for EV Charging Stations with Price-Sensitive Customers |
|
Zanvettor, Giovanni Gino | Universita' Di Siena |
Fochesato, Marta | ETH Zurich |
Casini, Marco | Universita' Di Siena |
Vicino, Antonio | Univ. Di Siena |
Keywords: Optimization, Stochastic systems, Energy systems
Abstract: The increasing adoption of electric vehicles has forced power network providers to deal with open issues both in terms of grid stability and electricity market design. On the latter direction, a challenging problem is represented by the development of probabilistic algorithms capable of computing optimal time-varying price profiles for EV charging stations to induce a desired aggregative behavior. Here, the inclusion of demand elasticity represents a key feature to provide usable schemes for real-world cases. In this paper, we propose an ``estimate-then-optimize'' framework for optimal dynamic pricing computation in the presence of price-sensitive customers. It consists of an {estimation} step based on nonparametric kernel estimation to infer about the demand elasticity, followed by an {optimization} step to maximize the expected daily profit. We describe the charging process via a probabilistic framework and the benefits of the proposed formulation are shown through extensive numerical experiments.
|
|
14:50-15:10, Paper ThBT14.5 | Add to My Program |
Scheduling and Pricing Non-Preemptive Electric Loads: A Convex Approach |
|
Chen, Mingyu | Huazhong University of Science and Technology |
Qin, Junjie | Purdue University |
Keywords: Smart grid, Power systems, Energy systems
Abstract: This paper studies the problem of scheduling and pricing a large collection of non-preemptive electric loads. For the scheduling problem, we formulate the problem as a mixed integer program and propose an algorithm for solving it based on its convex relaxation. We establish that the admissible schedule produced by our algorithm is near optimal for any finite number of loads, and asymptotically optimal in a per-load cost sense when the number of loads grows to infinite. For the pricing problem, we analyze a natural marginal pricing policy defined for the convex relaxation. We show that the pricing policy is approximately budget adequate, and self-scheduling. Furthermore, we demonstrate that the pricing policy provides proper incentives for each flexible load to report its true flexibility window when the maximum duration of all loads is one. When there are some loads spanning multiple time slots, we show that this flexibility revealing property fails to hold. Our results suggest that while it is possible to achieve most of the aforementioned desirable properties by scheduling and pricing non-preemptive loads via convex relaxation, the marginal pricing rule needs to be adjusted to incentivize flexibility revealing for general non-preemptive loads.
|
|
15:10-15:30, Paper ThBT14.6 | Add to My Program |
Learning Local Volt/Var Controllers towards Efficient Network Operation with Stability Guarantees |
|
Cavraro, Guido | National Renewable Energy Laboratory |
Yuan, Zhenyi | University of California, San Diego |
Singh, Manish Kumar | University of Minnesota |
Cortes, Jorge | University of California, San Diego |
Keywords: Smart grid, Decentralized control, Learning
Abstract: This paper considers the problem of voltage regulation in distribution networks. The primary motivation is to keep voltages within preassigned operating limits by commanding the reactive power output of distributed energy resources (DERs) deployed in the grid. We develop a framework for developing local Volt/Var control that comprises two main steps. In the first, by exploiting historical data and for each DER, we learn a function representing the desirable equilibrium points for the power network. These points approximate solutions of an Optimal Power Flow (OPF) problem. In the second, we propose a control scheme for steering the network towards these favorable configurations. Theoretical conditions are derived to formally guarantee the stability of the developed control scheme, and numerical simulations illustrate the effectiveness of the proposed approach.
|
|
ThBT15 Regular Session, Maya Ballroom VII |
Add to My Program |
Game Theory V |
|
|
Chair: Brown, Philip N. | University of Colorado, Colorado Springs |
Co-Chair: Yildiz, Yildiray | Bilkent University |
|
13:30-13:50, Paper ThBT15.1 | Add to My Program |
Hierarchical Decompositions of Stochastic Pursuit-Evasion Games |
|
Guan, Yue | Georgia Institute of Technology |
Afshari, Mohammad | McGill University |
Zhang, Qifan | Georgia Institute of Technology |
Tsiotras, Panagiotis | Georgia Institute of Technology |
Keywords: Hierarchical control, Game theory
Abstract: In this work we present a hierarchical framework for solving discrete stochastic pursuit-evasion games (PEGs) in large grid worlds. Given a partition of the grid world into superstates, the proposed approach creates a two-resolution decision-making process, which consists of a set of local PEGs at the original state level and an aggregated PEG at the superstate level. With a much smaller state space, both the local games and the aggregated game can be easily solved to a Nash equilibrium. Through numerical simulations, we show that the proposed hierarchical framework significantly reduces the computation overhead, while still maintaining a satisfactory performance level when competing against the flat Nash policies.
|
|
13:50-14:10, Paper ThBT15.2 | Add to My Program |
Driver Modeling Using Continuous Reasoning Levels: A Game Theoretical Approach |
|
Yaldiz, Cem Okan | Georgia Institute of Technology |
Yildiz, Yildiray | Bilkent University |
Keywords: Modeling, Game theory, Machine learning
Abstract: The focus of this paper is designing a game theoretical method using a continuous policy space for modeling human driver interactions on highway traffic. The proposed method is based on Gaussian Processes and developed as an enrichment to the hierarchical decision-making concept called "level-k reasoning". This concept conventionally assigns discrete levels of behaviors to agents. Although shown to be an effective modeling tool, the level-k reasoning approach may pose undesired constraints for predicting human decision making due to a limited number (usually 2 or 3) of driver policies it provides. The proposed approach is put forward to fill this gap in the literature by introducing a continuous domain framework that enables an infinite policy space. By using the approach presented in this paper, more accurate driver models are obtained, which can be employed for creating high fidelity simulation platforms for the validation of autonomous vehicle control algorithms. The proposed method is validated on a traffic dataset and compared with the conventional level-k approach to demonstrate its contributions and implications.
|
|
14:10-14:30, Paper ThBT15.3 | Add to My Program |
Self-Organized Set Cover Via Nash Equilibrium Learning and Selection |
|
Sun, Changhao | Qian Xuesen Laboratory of Space Technology, China Academy of Spa |
Zhou, Qingrui | Qian Xuesen Laboratory of Space Technology, China Academy of Spa |
Ma, Xiaowei | Nanjing Mobile Communication & Computing Innovation Institute |
Qiu, Huaxin | Beihang University |
Feng, Yuting | China Academy of Space Technology |
Liu, Jiaxin | China Academy of Space Technology |
Keywords: Optimization algorithms, Game theory, Agents-based systems
Abstract: This paper focuses on the weighted set cover problem in networking systems and presents a fully distributed algorithm from the perspective of Nash equilibrium learning and selection. By viewing each set as an agent, we recast the problem as a networked ordinal potential game and classify the resulting Nash equilibrium into two categories. We show that each inferior Nash equilibrium (INE) could always be improved via local action exchange and better approximations could be achieved via self-organized selection among superior Nash equilibria (SNEs). By showing the existence of an improvement path that leads any action profile to an SNE, we prove that our algorithm converges in finite time to a conventional Nash equilibrium, where the joint action is a selected SNE. Comparison experiments with typical methods demonstrate the superiority to the state of the art.
|
|
14:30-14:50, Paper ThBT15.4 | Add to My Program |
Nash Equilibrium Seeking under Partial Decision Information: Monotonicity, Smoothness and Proximal-Point Algorithms |
|
Bianchi, Mattia | Delft University of Technology |
Grammatico, Sergio | Delft Univ. of Tech |
Keywords: Optimization algorithms, Game theory, Variational methods
Abstract: We address Nash equilibrium problems in a partial-decision information scenario, where each agent can only exchange information with some neighbors, while its cost function possibly depends on the strategies of all agents. We characterize the relation between several monotonicity and smoothness conditions postulated in the literature. Furthermore, we prove convergence of a preconditioned proximal-point algorithm, under a restricted monotonicity property that allows for a non-Lipschitz, non-continuous game mapping.
|
|
14:50-15:10, Paper ThBT15.5 | Add to My Program |
Viability and Exponentially Stable Trajectories for Differential Inclusions in Wasserstein Space |
|
Frankowska, Helene | CNRS and Sorbonne University, Campus Pierre Et Marie Curie |
Bonnet, Benoît | Laas - Cnrs |
Keywords: Constrained control, Mean field games, Lyapunov methods
Abstract: In this article, we prove a general viability theorem for continuity inclusions in Wasserstein spaces, and provide an application thereof to the existence of exponentially stable trajectories obtained via the second method of Lyapunov.
|
|
15:10-15:30, Paper ThBT15.6 | Add to My Program |
A Safe Pricing Mechanism for Distributed Resource Allocation with Bandit Feedback |
|
Hutchinson, Spencer | University of California, Santa Barbara |
Turan, Berkay | University of California Santa Barbara |
Alizadeh, Mahnoosh | University of California Santa Barbara |
Keywords: Optimization, Agents-based systems, Human-in-the-loop control
Abstract: Pricing mechanisms are commonly advocated as a main tool to shape customers’ demand in societal scale networked infrastructure such as power or transportation systems. Given that the price response function of each user is generally considered private and unknown, most existing algorithms rely on protocols that explicitly or implicitly solicit this information in order to design prices. However, approaches that rely solely on learning the price response through repeated interactions are more practical and gaining traction. In this paper, we model each customer’s price response by an unknown parameter vector and we design a resource pricing mechanism to manage demand in order to maximize total welfare while ensuring that a set of linear constraints on the consumption are satisfied at all time steps with high probability. We propose an algorithm to address this problem that utilizes the well known principle of optimism in the face of uncertainty (OFU), while simultaneously being pessimistic with respect to constraint violation. Our analysis of this algorithm shows that, with high probability, it will not violate the constraints and will achieve O(log(T)sqrt{T}) regret. Numerical experiments validate these results and demonstrate how our algorithm can be applied to demand response management in power distribution systems.
|
|
ThBT16 Regular Session, Maya Ballroom VIII |
Add to My Program |
Lyapunov Methods I |
|
|
Chair: Dotta, Daniel | University of Camp |
Co-Chair: Peet, Matthew M. | Arizona State University |
|
13:30-13:50, Paper ThBT16.1 | Add to My Program |
Lyapunov-Based Transient Stability Analysis |
|
Gao, Jianli | Imperial College London |
Chaudhuri, Balarko | Imperial College London |
Astolfi, Alessandro | Imperial College & Univ. of Rome |
Keywords: Lyapunov methods, Stability of nonlinear systems, Power systems
Abstract: The paper presents an analytical control solution to the problem of transient stabilization of lossy multi-machine power systems. Firstly, a new form of control Lyapunov function candidates with a flexible potential-energy-like term is proposed. This is achieved mainly by introducing an auxiliary state that contributes to the derivation of a cross-term. Based on the Lyapunov function candidates, a new control law ensuring asymptotic stability of the desired closed-loop operating equilibrium is proposed. Finally, a case study on the model of a two-machine system to illustrate the effectiveness of the proposed control solution is presented.
|
|
13:50-14:10, Paper ThBT16.2 | Add to My Program |
Estimation of Order of Settling-Time Using Strict Lyapunov Function for Finite-Time PD Control |
|
Fukui, Yoshiro | Kyushu Institute of Technology |
Keywords: Lyapunov methods, Stability of nonlinear systems, PID control
Abstract: This paper proposes a design method of an upper bound of the settling-time for the closed-loop system of the second-ordered system and the finite-time PD control. The proposed method provides the upper bound and its order with respect to P-gain and D-gain, where the order means that when the order is negative (resp. positive), the larger the gains, the faster (resp. slower) the convergence. Through analysis using the strict Lyapunov function, the proposed method provides a better order of the upper bound than the conventional one. In addition, we show a sufficient condition of the gains that achieve the negative order.
|
|
14:10-14:30, Paper ThBT16.3 | Add to My Program |
Converging Approximations of Attractors Via Almost Lyapunov Functions and Semidefinite Programming |
|
Schlosser, Corbinian | CNRS-LAAS |
Keywords: Lyapunov methods, Optimization, Stability of nonlinear systems
Abstract: In this paper we combine the approaches from previous work and an article by Morgan and Peet for approximating global attractors. In the first mentioned method the global attractors is arbitrarily well approximated by sets that are not necessarily positively invariant. On the contrary, the second mentioned method provides supersets of the global attractor which are positively invariant but not necessarily converging. In this paper we marry both approaches by combining their techniques and get converging outer approximations of the global attractor consisting of positively invariant sets. Because both the underlying methods are based on convex optimization via sum-of-squares techniques the same is true for our proposed method. The method is easy to use and numerical examples illustrate the procedure.
|
|
14:30-14:50, Paper ThBT16.4 | Add to My Program |
Combining Trajectory Data with Analytical Lyapunov Functions for Improved Region of Attraction Estimation |
|
Lugnani Fernandes, Lucas | University of Campinas |
Jones, Morgan | Arizona State University |
Alberto, Luis Fernando Costa | University of Sao Paulo |
Peet, Matthew M. | Arizona State University |
Dotta, Daniel | University of Camp |
Keywords: Lyapunov methods, Power systems, Stability of nonlinear systems
Abstract: The increasing uptake of inverter based resources (IBRs) has resulted in many new challenges for power system operators around the world. The high level of complexity of IBR generators makes accurate classical model-based stability analysis a difficult task. This paper proposes a novel methodology for solving the problem of estimating the Region of Attraction (ROA) of a nonlinear system by combining classical model based methods with modern data driven methods. Our method yields certifiable inner approximations of the ROA, typical to that of model based methods, but also harnesses trajectory data to yield an improved accurate ROA estimation. The method is carried out by using analytical Lyapunov functions, such as energy functions, in combination with data that is used to fit a converse Lyapunov function. Our methodology is independent of the function fitting method used. In this work, for implementation purposes, we use Bernstein polynomials to function fit. Several numerical examples of ROA estimation are provided, including the Single Machine Infinite Bus (SMIB) system, a three machine system and the Van-der-Pol system.
|
|
14:50-15:10, Paper ThBT16.5 | Add to My Program |
Koopman-Based Neural Lyapunov Functions for General Attractors |
|
Deka, Shankar | University of California, Berkeley |
Marco Valle, Alonso | University of California Berkeley |
Tomlin, Claire J. | UC Berkeley |
Keywords: Lyapunov methods, Neural networks, Stability of nonlinear systems
Abstract: Koopman spectral theory has grown in the past decade as a powerful tool for dynamical systems analysis and control. In this paper, we show how recent data-driven techniques for estimating Koopman-Invariant subspaces with neural networks can be leveraged to extract Lyapunov certificates for the underlying system. In our work, we specifically focus on systems with a limit-cycle, beyond just an isolated equilibrium point, and use Koopman eigenfunctions to efficiently parameterize candidate Lyapunov functions to construct forward-invariant sets under some (unknown) attractor dynamics. Additionally, when the dynamics are polynomial and when neural networks are replaced by polynomials as a choice of function approximators in our approach, one can further leverage Sum-of-Squares programs and/or nonlinear programs to yield provably correct Lyapunov certificates. In such a polynomial case, our Koopman-based approach for constructing Lyapunov functions uses significantly fewer decision variables compared to directly formulating and solving a Sum-of-Squares optimization problem.
|
|
15:10-15:30, Paper ThBT16.6 | Add to My Program |
Practical Stability and Attractors of Systems with Bounded Perturbations |
|
Colotti, Alessandro | Nantes Université, École Centrale Nantes, CNRS, LS2N, UMR 6004 |
Goldsztejn, Alexandre | LS2N - Ecole Centrale De Nantes |
Keywords: Lyapunov methods, Uncertain systems, Stability of nonlinear systems
Abstract: The classical Lyapunov analysis of stable fixed points is extended to perturbed dynamical systems that may not have any fixed point due to perturbations. Practical stability is meant here to assess the convergence of such systems. This is achieved by investigating a parametric optimization problem encoding some worst-case Lie derivative. Key properties of this parametric optimization problem are formulated. The proposed framework is finally applied to a class of perturbed linear systems tracking a highly nonlinear reference.
|
|
ThBT17 Regular Session, Acapulco |
Add to My Program |
Quantum Information and Control I |
|
|
Chair: Pozzoli, Eugenio | CNRS, Université Bourgogne Franche-Comté |
Co-Chair: Rouchon, Pierre | Mines ParisTech |
|
13:30-13:50, Paper ThBT17.1 | Add to My Program |
Single-Input Perturbative Control of a Quantum Symmetric Rotor |
|
Pozzoli, Eugenio | CNRS, Université Bourgogne Franche-Comté |
Chambrion, Thomas | Université De Bourgogne |
Keywords: Quantum information and control, Distributed parameter systems, Algebraic/geometric methods
Abstract: We consider the Schrödinger partial differential equation of a rotating symmetric rigid molecule (symmetric rotor) driven by a z-linearly polarized electric field, as prototype of degenerate infinite-dimensional bilinear control system. By introducing an abstract perturbative criterium, we classify its simultaneous approximate controllability; based on this insight, we numerically perform an orientational selective transfer of rotational population.
|
|
13:50-14:10, Paper ThBT17.2 | Add to My Program |
Data-Driven Spectral Analysis of Quantum Spin Networks with Limited Access Using Hankel Dynamic Mode Decomposition |
|
Kato, Yuzuru | Future University Hakodate |
Nakao, Hiroya | Tokyo Institute of Technology |
Keywords: Quantum information and control, Nonlinear systems identification
Abstract: Dynamic mode decomposition (DMD) is an equation-free, data-driven method for the prediction and control of complex nonlinear dynamical systems. A DMD method for data-driven quantum control was proposed recently and numerically demonstrated in a single spin system where time series of a complete orthonormal set of Hamiltonian is available. In quantum spin networks, it is generally difficult to access all spins but only a small set of spins is practically accessible. In this paper, we formulate a Hankel-DMD method for open quantum systems and extend the applicability of the DMD framework to quantum spin networks with limited access. We demonstrate that Hankel DMD can precisely evaluate the eigenvalues and decompose the dynamics into the respective oscillatory eigenmodes from the observed data. In particular, it can reveal the decoherence-free dynamics in spin networks possessing eigenvalues on the imaginary axis.
|
|
14:10-14:30, Paper ThBT17.3 | Add to My Program |
Exponential Convergence of a Dissipative Quantum System towards Finite-Energy Grid States of an Oscillator |
|
Sellem, Lev-Arcady | Laboratoire De Physique De l’Ecole Normale Supérieure, Mines Par |
Campagne-Ibarcq, Philippe | Laboratoire De Physique De l’Ecole Normale Supérieure, Mines Par |
Mirrahimi, Mazyar | INRIA Paris-Rocquencourt |
Sarlette, Alain | Laboratoire De Physique De l’Ecole Normale Supérieure, Inria, C |
Rouchon, Pierre | Mines ParisTech |
Keywords: Quantum information and control, Lyapunov methods
Abstract: Based on the stabilizer formalism underlying Quantum Error Correction (QEC), the design of an original Lindblad master equation for the density operator of a quantum harmonic oscillator is proposed. This Lindblad dynamics stabilizes exactly the finite-energy grid states introduced in 2001 by Gottesman, Kitaev and Preskill for quantum computation. Stabilization results from an exponential Lyapunov function with an explicit lower-bound on the convergence rate. Numerical simulations indicate the potential interest of such autonomous QEC in presence of non-negligible photon-losses.
|
|
14:30-14:50, Paper ThBT17.4 | Add to My Program |
Minimal Resources for Exact Simulation of Quantum Walks |
|
Grigoletto, Tommaso | University of Padova |
Ticozzi, Francesco | Università Di Padova |
Keywords: Quantum information and control, Linear systems, Model/Controller reduction
Abstract: Quantum walks are stochastic processes generated by a quantum evolution mechanism, allowing for speed-up in spreading and hitting-time performance with respect to their classical counterparts. Investigating the role of the memory effects for these models, we address the problem of finding the minimal linear system that exactly reproduces the evolution of the output distribution of a quantum walk. After adapting the classical approach to our setting, we investigate analytically and numerically the structural complexity of quantum walks, both in general and in particular examples. Lastly, we focus on Grover's algorithm, a quantum computing search algorithm that can be re-framed as a quantum walk, showing that it allows for a dramatic reduction in its representation.
|
|
14:50-15:10, Paper ThBT17.5 | Add to My Program |
Parameter Estimation for Quantum Trajectories: Convergence Result |
|
Bompais, Mael | University Paris Saclay |
Amini, Nina H. | CNRS, L2S, CentraleSupelec |
Pellegrini, Clement | Université Paul Sabatier, Toulouse |
Keywords: Quantum information and control, Estimation, Stochastic systems
Abstract: A quantum trajectory describes the evolution of a quantum system undergoing indirect measurement. In the discrete-time setting, the state of the system is updated by applying Kraus operators according to the measurement results. From an experimental perspective, these Kraus operators can depend on unknown physical parameters p. An interesting and powerful method has been proposed in [1] to estimate a parameter in a finite set; however, complete results of convergence were lacking. This article fills this gap by rigorously showing the consistency of the method, whereas there was only numerical evidence so far. When the parameter belongs to a continuous set, we propose an algorithm to approach its value and show simulation results.
|
|
15:10-15:30, Paper ThBT17.6 | Add to My Program |
NMR Pulse Design Using Moment Dynamical Systems |
|
Ning, Nancy | Washington University in St.Louis |
Paes de Lima, André Luiz | Washington University in St. Louis |
Li, Jr-Shin | Washington University in St. Louis |
Keywords: Quantum information and control, Large-scale systems, Control applications
Abstract: We investigate pulse design problems arising in diverse applications in quantum science and technology. In modern approaches, pulse design is cast as an ensemble control problem involving the control of a continuum of nuclear spin systems, which, however, is typically challenging to solve. In this paper, we present a new pulse design paradigm by introducing moment representations of the spin ensemble system and transforming the ensemble control problem associated to pulse design to a moment control problem. We show that feasible and optimal pulses can be effectively designed using the moment system with performance guarantees across the entire ensemble. We also illustrate the versatility and robustness of our moment-based approach by designing uniform and selective pulses essential to enable prominent applications in magnetic resonance.
|
|
ThCT01 Invited Session, Tulum Ballroom A |
Add to My Program |
Distributed Optimization and Learning for Networked Systems II |
|
|
Chair: Lessard, Laurent | Northeastern University |
Co-Chair: Uribe, Cesar A. | Rice University |
Organizer: Uribe, Cesar A. | Rice University |
Organizer: Yang, Tao | Northeastern University |
Organizer: Lu, Jie | ShanghaiTech University |
Organizer: Niu, Xiaochun | Northwestern University |
Organizer: Wei, Ermin | Northwestern Univeristy |
Organizer: Nedich, Angelia | Arizona State University |
|
16:00-16:20, Paper ThCT01.1 | Add to My Program |
A Universal Decomposition for Distributed Optimization Algorithms |
|
Van Scoy, Bryan | Miami University |
Lessard, Laurent | Northeastern University |
Keywords: Optimization algorithms, Networked control systems
Abstract: In the distributed optimization problem for a multi-agent system, each agent knows a local function and must find a minimizer of the sum of all agents' local functions by performing a combination of local gradient evaluations and communicating information with neighboring agents. We prove that every distributed optimization algorithm can be factored into a centralized optimization method and a second-order consensus estimator, effectively separating the "optimization" and "consensus" tasks. We illustrate this fact by providing the decomposition for many recently proposed distributed optimization algorithms. Conversely, we prove that any optimization method that converges in the centralized setting can be combined with any second-order consensus estimator to form a distributed optimization algorithm that converges in the multi-agent setting. Finally, we describe how our decomposition may lead to a more systematic algorithm design methodology.
|
|
16:20-16:40, Paper ThCT01.2 | Add to My Program |
A Zeroth-Order Momentum Method for Risk-Averse Online Convex Games |
|
Wang, Zifan | KTH Royal Institute of Technology |
Shen, Yi | Duke University |
Bell, Zachary I. | Air Force |
Nivison, Scott | Johns Hopkins University Applied Physics Lab |
Zavlanos, Michael M. | Duke University |
Johansson, Karl H. | Royal Institute of Technology |
Keywords: Optimization algorithms, Agents-based systems, Game theory
Abstract: We consider risk-averse learning in repeated unknown games where the goal of the agents is to minimize their individual risk of incurring significantly high cost. Specifically, the agents use the conditional value at risk (CVaR) as a risk measure and rely on bandit feedback in the form of the cost values of the selected actions at every episode to estimate their CVaR values and update their actions. A major challenge in using bandit feedback to estimate CVaR is that the agents can only access their own cost values, which, however, depend on the actions of all agents. To address this challenge, we propose a new risk-averse learning algorithm with momentum that utilizes the full historical information on the cost values. We show that this algorithm achieves sub-linear regret and matches the best known algorithms in the literature. We provide numerical experiments for a Cournot game that show that our method outperforms existing methods.
|
|
16:40-17:00, Paper ThCT01.3 | Add to My Program |
Resource Allocation in Open Multi-Agent Systems: An Online Optimization Analysis (I) |
|
Vizuete, Renato | CentraleSupélec |
Monnoyer de Galland de Carnières, Charles | UCLouvain |
Hendrickx, Julien M. | UCLouvain |
Panteley, Elena | CNRS |
Frasca, Paolo | CNRS, GIPSA-Lab, Univ. Grenoble Alpes |
Keywords: Optimization, Agents-based systems, Discrete event systems
Abstract: The resource allocation problem consists of the optimal distribution of a budget between agents in a group. We consider such a problem in the context of open systems, where agents can be replaced at some time instances. These replacements lead to variations in both the budget and the total cost function that hinder the overall network's performance. For a simple setting, we analyze the performance of the Random Coordinate Descent algorithm (RCD) using tools similar to those commonly used in online optimization. In particular, we study the accumulated errors that compare solutions issued from the RCD algorithm and the optimal solution or the non-collaborating selfish strategy and we derive some bounds in expectation for these accumulated errors.
|
|
17:00-17:20, Paper ThCT01.4 | Add to My Program |
Automated Performance Estimation for Decentralized Optimization Via Network Size Independent Problems (I) |
|
Colla, Sebastien | UCLouvain |
Hendrickx, Julien M. | UCLouvain |
Keywords: Optimization, Agents-based systems
Abstract: We develop a novel formulation of the Performance Estimation Problem (PEP) for decentralized optimization whose size is independent of the number of agents in the network. The PEP approach allows computing automatically the worst-case performance and worst-case instance of first-order optimization methods by solving an SDP. Unlike previous work, the size of our new PEP formulation is independent of the network size. For this purpose, we take a global view of the decentralized problem and we also decouple the consensus subspace and its orthogonal complement. We apply our methodology to different decentralized methods such as DGD, DIGing and EXTRA and obtain numerically tight performance guarantees that are valid for any network size.
|
|
17:20-17:40, Paper ThCT01.5 | Add to My Program |
Decentralized Federated Learning for Over-Parameterized Models (I) |
|
Qin, Tiancheng | University of Illinois at Urbana-Champaign |
Etesami, S. Rasoul | University of Illinois at Urbana-Champaign |
Uribe, Cesar A. | Rice University |
Keywords: Optimization algorithms, Distributed parameter systems, Machine learning
Abstract: Modern machine learning features models that are often highly expressive and over-parameterized. Such models have been shown to interpolate the data and drive the empirical loss close to zero. We analyze the convergence rate of decentralized stochastic gradient descent (SGD), which is at the core of decentralized federated learning (DFL), for over-parameterized models. Our analysis covers the setting of decentralized SGD with time-varying networks, local updates, and heterogeneous data. We show provable convergence guarantees for convex and non-convex functions that either improve upon the existing literature or are the first for their corresponding regime.
|
|
17:40-18:00, Paper ThCT01.6 | Add to My Program |
Byzantine-Robust Federated Linear Bandits (I) |
|
Jadbabaie, Ali | MIT |
Li, Haochuan | Massachusetts Institute of Technology |
Qian, Jian | MIT |
Tian, Yi | MIT |
Keywords: Distributed control, Fault tolerant systems, Machine learning
Abstract: In this paper, we study a linear bandit optimization problem in a federated setting where a large collection of distributed agents collaboratively learn a common linear bandit model. Standard federated learning algorithms applied to this setting are vulnerable to Byzantine attacks on even a small fraction of agents. We propose a novel algorithm with a robust aggregation oracle that utilizes the geometric median. We prove that our proposed algorithm is robust to Byzantine attacks on fewer than half of agents and achieves a sublinear tilde{mathcal{O}}({T^{3/4}}) regret with mathcal{O}(sqrt{T}) steps of communication in T steps. Moreover, we make our algorithm differentially private via a tree-based mechanism. Finally, if the level of corruption is known to be small, we show that using the geometric median of mean oracle for robust aggregation further improves the regret bound.
|
|
ThCT02 Regular Session, Tulum Ballroom B |
Add to My Program |
Decentralized Control |
|
|
Chair: Cucuzzella, Michele | University of Pavia |
Co-Chair: Matni, Nikolai | University of Pennsylvania |
|
16:00-16:20, Paper ThCT02.1 | Add to My Program |
Decentralized Unified Position-Attitude Control of Nonlinear UAVs |
|
Zhang, Boyang | Duke University |
Gavin, Henri P. | Duke University |
Keywords: Decentralized control, Constrained control, Nonlinear systems
Abstract: In this paper, we propose a fully decentralized, nonlinear feedback control law to maneuver multiple nonlinear quadrotor UAVs with interagent collision avoidance and natural deadlock resolution. Most existing work on this problem adopts a cascaded position-attitude control scheme and utilizes linearized dynamics in controller synthesis. In this work, each UAV controls its position and attitude simultaneously in one unified step, with no dynamics linearization involved at any stage. The proposed scheme is based on a generalization of Gauss’s principle of least constraint that allows constrained systems of any order and any type and that identifies, differentiates, stabilizes, partitions, and incorporates the active constraints at each time instant. The control actions result from asymptotically stabilizing the active constraints by user-specified natural frequencies and damping ratios according to a generalized constraint stabilization. Two numerical examples are used to demonstrate the effectiveness of the present method, whose performance on collision avoidance and deadlock resolution is sufficiently close to that of a centralized method.
|
|
16:20-16:40, Paper ThCT02.2 | Add to My Program |
A Dual Accelerated Method for a Class of Distributed Optimization Problems: From Consensus to Decentralized Policy Evaluation |
|
Zhang, Sheng | Georgia Institute of Technology |
Pananjady, Ashwin | Georgia Institute of Technology |
Romberg, Justin | Georgia Tech |
Keywords: Decentralized control, Distributed control, Cooperative control
Abstract: Motivated by decentralized sensing and policy evaluation problems, we consider a particular type of distributed stochastic optimization problem over a network, called the online stochastic distributed averaging problem. We design a dual-based method for this distributed consensus problem with Polyak--Ruppert averaging and analyze its behavior. We show that the proposed algorithm attains an accelerated deterministic error depending optimally on the condition number of the network, and also that it has an order-optimal stochastic error. This improves on the guarantees of state-of-the-art distributed stochastic optimization algorithms when specialized to this setting, and yields---among other things---corollaries for decentralized policy evaluation. Our proofs rely on explicitly studying the evolution of several relevant linear systems, and may be of independent interest.
|
|
16:40-17:00, Paper ThCT02.3 | Add to My Program |
Decentralized Signal Temporal Logic Control for Perturbed Interconnected Systems Via Assume-Guarantee Contract Optimization |
|
Ghasemi, Kasra | Boston University |
Sadraddini, Sadra | Dexai Robotics |
Belta, Calin | Boston University |
Keywords: Decentralized control, Large-scale systems, Uncertain systems
Abstract: We develop a decentralized control method for a network of perturbed linear systems with dynamical couplings subject to Signal Temporal Logic (STL) specifications. We first transform the STL requirements into set containment problems, then we develop controllers to solve these problems. Our approach is based on treating the couplings between subsystems as disturbances, which are bounded sets that the subsystems negotiate in the form of parametric assume-guarantee contracts. The set containment requirements and parameterized contracts are added to the subsystems' constraints. We introduce a centralized optimization problem to derive the contracts, reachability tubes, and decentralized closed-loop control laws. We show that, when the STL formula is separable with respect to the subsystems, the centralized optimization problem can be solved in a distributed way, which scales to large systems. We present formal theoretical guarantees on robustness of STL satisfaction. The effectiveness of the proposed method is demonstrated via a power network case study.
|
|
17:00-17:20, Paper ThCT02.4 | Add to My Program |
Voltage Control of DC Microgrids: Robustness for Unknown ZIP-Loads |
|
Cucuzzella, Michele | University of Pavia |
Kosaraju, Krishna Chaitanya | University of Notre Dame |
Scherpen, Jacquelien M.A. | University of Groningen |
Keywords: Decentralized control, Stability of nonlinear systems, Control applications
Abstract: In this letter we propose a new passivity-based control technique for Buck converter based DC microgrids comprising ZIP-loads, i.e., loads with the parallel combination of constant impedance (Z), current (I) and power (P). More precisely, we propose a novel passifying input and a storage function based on the mixed potential function introduced by Brayton and Moser, relaxing restrictive (sufficient) conditions on Z, P and the voltage reference, which are usually assumed to be satisfied in the literature. Consequently, we develop a new passivity-based controller that is robust with respect to uncertain ZIP-loads.
|
|
17:20-17:40, Paper ThCT02.5 | Add to My Program |
On Infinite-Horizon System Level Synthesis Problems |
|
Yu, Jing | California Institute of Technology |
Kjellqvist, Olle | Lund University |
Keywords: Distributed control, Large-scale systems, Decentralized control
Abstract: System level synthesis is a promising approach that formulates structured optimal controller synthesis problems as convex problems. This work solves the distributed linear-quadratic regulator problem under communication constraints directly in infinite-dimensional space, without the finite-impulse response relaxation common in related work. Our method can also be used to construct optimal distributed Kalman filters with limited information exchange. We combine the distributed Kalman filter with state-feedback control to perform localized LQG control with communication constraints. We provide agent-level implementation details for the resulting output-feedback state-space controller.
|
|
17:40-18:00, Paper ThCT02.6 | Add to My Program |
Distributed Optimal Control of Graph Symmetric Systems Via Graph Filters |
|
Yang, Fengjun | University of Pennsylvania |
Gama, Fernando | Rice University |
Sojoudi, Somayeh | UC Berkeley |
Matni, Nikolai | University of Pennsylvania |
Keywords: Distributed control, Decentralized control, Large-scale systems
Abstract: Designing distributed optimal controllers subject to communication constraints is a difficult problem unless structural assumptions are imposed on the underlying dynamics and information exchange structure, e.g., sparsity, delay, or spatial invariance. In this paper, we borrow ideas from graph signal processing and define and analyze a class of emph{Graph Symmetric Systems} (GSSs), which are systems that are symmetric with respect to an underlying graph topology. We show that for linear quadratic problems subject to dynamics defined by a GSS, the optimal centralized controller is given by a novel class of graph filters with transfer function valued filter taps and can be implemented via distributed message passing. We then propose several methods for approximating the optimal centralized graph filter by a distributed controller only requiring communication with a small subset of neighboring subsystems. We further provide stability and suboptimality guarantees for the resulting distributed controllers. Finally, we empirically demonstrate that our approach allows for a principled tradeoff between communication cost and performance while guaranteeing stability. Our results can be viewed as a first step towards bridging the fields of distributed optimal control and graph signal processing.
|
|
ThCT03 Regular Session, Tulum Ballroom C |
Add to My Program |
Autonomous Vehicles |
|
|
Chair: Ma, Yao | Texas Tech University |
Co-Chair: Alonso-Mora, Javier | Delft University of Technology |
|
16:00-16:20, Paper ThCT03.1 | Add to My Program |
A Robust Receding-Horizon Collision Avoidance Strategy for Constrained Unmanned Ground Vehicles Moving in Shared Planar Environments |
|
Savehshemshaki, Shima | Concordia University |
Lucia, Walter | Concordia University |
Keywords: Autonomous vehicles, Traffic control, Predictive control for linear systems
Abstract: This paper deals with the reference tracking and collision avoidance control problems for constrained unmanned ground vehicles moving in shared planar environments. The proposed solution improves the strategy developed in [1] by minimizing the number of vehicle's full stops required to avoid collisions. This is achieved through a modified traffic manager algorithm that can exploit, in a receding horizon fashion, a preview of the successive vehicle's waypoints. Such information is properly used to speed up or speed down the vehicles and minimize the chances of future collisions and vehicle's full stops. The proposed control solutions enjoys recursive feasibility regardless of the waypoint prediction horizon.
|
|
16:20-16:40, Paper ThCT03.2 | Add to My Program |
Online Multi-Robot Task Assignment with Stochastic Blockages |
|
Wilde, Nils | TU Delft |
Alonso-Mora, Javier | MIT |
Keywords: Autonomous robots, Stochastic systems, Transportation networks
Abstract: In this paper we study the multi-robot task assignment problem with tasks that appear online and need to be serviced within a fixed time window in an uncertain environment. For example, when deployed in dynamic, human-centered environments, the team of robots may not have perfect information about the environment. Parts of the environment may temporarily become blocked and blockages may only be observed on location. While numerous variants of the Canadian Traveler Problem describe the path planning aspect of this problem, few work has been done on multi-robot task allocation (MRTA) under this type of uncertainty. In this paper, we introduce and theoretically analyze the problem of MRTA with recoverable online blockages. Based on a stochastic blockage model, we compute offline tours using the expected travel costs for the online routing problem. The cost of the offline tours is used in a greedy task assignment algorithm. In simulation experiments we highlight the performance benefits of the proposed method under various settings.
|
|
16:40-17:00, Paper ThCT03.3 | Add to My Program |
Collision Avoidance for Dynamic Obstacles with Uncertain Predictions Using Model Predictive Control |
|
Nair, Siddharth | University of California, Berkeley |
Tseng, H. Eric | Ford Motor Company |
Borrelli, Francesco | Unversity of California at Berkeley |
Keywords: Autonomous vehicles, Predictive control for linear systems, Uncertain systems
Abstract: We propose a Model Predictive Control (MPC) for collision avoidance between an autonomous agent and dynamic obstacles with uncertain predictions. The collision avoidance constraints are imposed by enforcing positive distance between convex sets representing the agent and the obstacles, and tractably reformulating them using Lagrange duality. This approach allows for smooth collision avoidance constraints even for polytopes, which otherwise require mixed-integer or non-smooth constraints. We consider three widely used descriptions of the uncertain obstacle position: 1) Arbitrary distribution with polytopic support, 2) Gaussian distributions and 3) Arbitrary distribution with first two moments known. For each case we obtain deterministic reformulations of the collision avoidance constraints. The proposed MPC formulation optimizes over feedback policies to reduce conservatism in satisfying the collision avoidance constraints. The proposed approach is validated using simulations of traffic intersections in CARLA.
|
|
17:00-17:20, Paper ThCT03.4 | Add to My Program |
Lane Change in Automated Driving: An Explicit Coordination Strategy |
|
Falsone, Alessandro | Politecnico Di Milano |
Melani, Beatrice | Politecnico Di Milano |
Prandini, Maria | Politecnico Di Milano |
Keywords: Autonomous vehicles, Computational methods
Abstract: We address a multi-vehicle automated driving scenario, where a vehicle has to change lane and merge in a platoon in a one-way roadway with two lanes. We focus on the coordination phase of the lane change, where vehicles in the platoon need to create a gap for the merging vehicle to enter safely following a pre-computed optimal trajectory. The goal is pre-computing also the multi-vehicle coordination strategy, so as to limit the computational and communication effort involved in its online implementation. This is achieved by considering the platoon as if it was composed of an infinite number of vehicles and solving a multi-parametric optimization program providing the coordination strategy as an explicit function of position and velocity of the ego vehicle, integrating a multi-class classifier to identify the best merging position. Numerical simulations show that the resulting performance degradation when implementing the strategy on a finite platoon is limited to boundary effects at its head and tail.
|
|
17:20-17:40, Paper ThCT03.5 | Add to My Program |
Trust-Aware Control of Automated Vehicles in Car-Following Interactions with Human Drivers |
|
Ozkan, Mehmet | Texas Tech University |
Ma, Yao | Texas Tech University |
Keywords: Autonomous vehicles, Automotive control
Abstract: Trust is essential for automated vehicles (AVs) to promote and sustain technology acceptance in human-dominated traffic scenarios. However, computational trust dynamic models describing the interactive relationship between the AVs and surrounding human drivers in traffic rarely exist. This paper aims to fill this gap by developing a quantitative trust dynamic model of the human driver in the car-following interaction with the AV and incorporating the proposed trust dynamic model into the AV’s control design. The human driver’s trust level is modeled as a plan evaluation metric that measures the explicability of the AV’s plan from the human driver’s perspective, and the explicability score of the AV’s plan is integrated into the AV’s decision-making process. With the proposed approach, trust-aware AVs generate explicable plans by optimizing both predefined plans and explicability of the plans in the car-following interactions with the following human driver. The results collectively demonstrate that the trust-aware AV can generate more explicable plans and achieve a higher trust level for the human driver compared to trust-unaware AV in human-AV interactions.
|
|
17:40-18:00, Paper ThCT03.6 | Add to My Program |
Simultaneous Lane-Keeping and Obstacle Avoidance by Combining Model Predictive Control and Control Barrier Functions |
|
Brüggemann, Sven | University of California, San Diego |
Steeves, Drew | University of California, San Diego |
Krstic, Miroslav | University of California, San Diego |
Keywords: Autonomous vehicles, Nonlinear systems, Predictive control for linear systems
Abstract: We combine Model Predictive Control (MPC) and Control Barrier Function (CBF) design methods to create a hierarchical control law for simultaneous lane-keeping (LK) and obstacle avoidance (OA): at the low level, MPC performs LK via trajectory tracking during nominal operation; and at the high level, various CBF-based safety filters that ensure LK and OA are designed and compared across practical scenarios. In particular, we show that Exponential Safety (ESf) and Prescribed-Time Safety (PTSf) filters, which override the MPC control when necessary, result in feasible Quadratic Programs when OA-safety is prioritized. We additionally investigate control designs subject to input constraints by using Input-Constrained-CBFs. Finally, we compare the performance of combinations of ESf, PTSf, and their input-constrained counterparts with respect to the LK and OA goals in two simulation studies for early- and late-detected obstacle scenarios.
|
|
ThCT04 Regular Session, Tulum Ballroom D |
Add to My Program |
Learning-Based Systems II |
|
|
Chair: Parise, Francesca | Cornell University |
Co-Chair: Barton, Kira | University of Michigan, Ann Arbor |
|
16:00-16:20, Paper ThCT04.1 | Add to My Program |
Collaborative Learning Model Predictive Control for Repetitive Tasks |
|
Chanfreut, Paula | University of Seville |
Maestre, Jose Maria (Pepe) | University of Seville |
Camacho, Eduardo F. | Univ. of Sevilla |
Borrelli, Francesco | Unversity of California at Berkeley |
Keywords: Iterative learning control, Predictive control for linear systems, Agents-based systems
Abstract: This paper presents a cloud-based learning model predictive controller that integrates three interacting components: a set of agents, which must learn to perform a finite set of tasks with the minimum possible local cost; a coordinator, which assigns the tasks to the agents; and the cloud, which stores data to facilitate the agents’ learning. The tasks consist in traveling repeatedly between a set of target states while satisfying input and state constraints. In turn, the state constraints may change in time for each of the possible tasks. To deal with it, different modes of operation, which establish different restrictions, are defined. The agents’ inputs are found by solving local model predictive control (MPC) problems where the terminal set and cost are defined from previous trajectories. The data collected by each agent is uploaded to the cloud and made accessible to all their peers. Likewise, similarity between tasks is exploited to accelerate the learning process. The applicability of the proposed approach is illustrated by simulation results.
|
|
16:20-16:40, Paper ThCT04.2 | Add to My Program |
Data-Driven Approximations of Chance Constrained Programs in Nonstationary Environments |
|
Yan, Shuhao | Cornell University |
Parise, Francesca | Cornell University |
Bitar, Eilyan | Cornell University |
Keywords: Randomized algorithms, Optimization, Machine learning
Abstract: We study sample average approximations (SAA) of chance constrained programs. SAA methods typically approximate the actual distribution in the chance constraint using an empirical distribution constructed from random samples assumed to be independent and identically distributed according to the actual distribution. In this paper, we consider a nonstationary variant of this problem, where the random samples are assumed to be independently drawn in a sequential fashion from an unknown and possibly time-varying distribution. This nonstationarity may be driven by changing environmental conditions present in many real-world applications. To account for the potential nonstationarity in the data generation process, we propose a novel robust SAA method exploiting information about the Wasserstein distance between the sequence of data-generating distributions and the actual chance constraint distribution. As a key result, we obtain distribution-free estimates of the sample size required to ensure that the robust SAA method will yield solutions that are feasible for the chance constraint under the actual distribution with high confidence.
|
|
16:40-17:00, Paper ThCT04.3 | Add to My Program |
Reinforcement Learning Enabled Autonomous Manufacturing Using Transfer Learning and Probabilistic Reward Modeling |
|
Alam, Md Ferdous | The Ohio State University |
Shtein, Max | University of Michigan |
Barton, Kira | University of Michigan, Ann Arbor |
Hoelzle, David | Ohio State University |
Keywords: Manufacturing systems and automation, Autonomous systems, Machine learning
Abstract: Here we propose a reinforcement learning enabled physical autonomous manufacturing system (AMS) that is capable of learning the manufacturing process parameters to autonomously fabricate a complex-geometry artifact with desired performance characteristics. The poor sample efficiency of traditional RL algorithms challenges real-world manufacturing decision making due to a high variable cost from raw material, machine utilization, and labor costs. To make decision making sample efficient, we propose to leverage a first-principles based source task for training, transfer effective representations from trained knowledge, and then use these representations to interact with the physical system to learn a probabilistic model of the target reward function. We deploy this idea to a novel dataset obtained from a custom physical AMS machine that can autonomously manufacture phononic crystals, a complex geometry artifact with spectral response as performance characteristic. We demonstrate that our method uses as low as 25 artifacts to model the interesting part of the target reward function and find an artifact with high reward. This task typically requires manual design of phononic crystals and extensive empirical iterations on the order of hundreds.
|
|
17:00-17:20, Paper ThCT04.4 | Add to My Program |
Data-Driven Learning of Safety-Critical Control with Stochastic Control Barrier Functions |
|
Wang, Chuanzheng | University of Waterloo |
Meng, Yiming | University of Waterloo |
Smith, Stephen L. | University of Waterloo |
Liu, Jun | University of Waterloo |
Keywords: Stochastic optimal control, Uncertain systems, Lyapunov methods
Abstract: Control barrier functions are widely used to synthesize safety-critical controls. The existence of Gaussian-type noise may lead to unsafe actions and result in severe consequences. While studies are widely done in safety-critical control for stochastic systems, in many real-world applications, we do not have the knowledge of the stochastic component of the dynamics. In this paper, we study safety-critical control of stochastic systems with an unknown diffusion part and propose a data-driven method to handle these scenarios. More specifically, we propose a data-driven stochastic control barrier function (DDSCBF) framework and use supervised learning to learn the unknown stochastic dynamics via the DDSCBF scheme. Under some reasonable assumptions, we provide guarantees that the DDSCBF scheme can approximate the It^{o} derivative of the stochastic control barrier function (SCBF) under partially unknown dynamics using the universal approximation theorem. We also show that we can achieve the same safety guarantee using the DDSCBF scheme as with SCBF in previous work without requiring the knowledge of stochastic dynamics. We use two non-linear stochastic systems to validate our theory in simulations.
|
|
17:20-17:40, Paper ThCT04.5 | Add to My Program |
End-To-End Imitation Learning with Safety Guarantees Using Control Barrier Functions |
|
Cosner, Ryan | California Institute of Techno |
Yue, Yisong | California Institute of Technology |
Ames, Aaron D. | California Institute of Technology |
Keywords: Lyapunov methods, Machine learning
Abstract: Imitation learning (IL) is a learning paradigm which can be used to synthesize controllers for complex systems that mimic behavior demonstrated by an expert (user or control algorithm). Despite their popularity, IL methods generally lack guarantees of safety, which limits their utility for complex safety-critical systems. In this work we consider safety, formulated as set-invariance, and the associated formal guarantees endowed by Control Barrier Functions (CBFs). We develop conditions under which robustly-safe expert controllers, utilizing CBFs, can be used to learn end-to-end controllers (which we refer to as CBF-Compliant controllers) that have safety guarantees. These guarantees are presented from the perspective of input-to-state safety (ISSf) which considers safety in the context of disturbances, wherein it is shown that IL using robustly safe expert demonstrations results in ISSf with the disturbance directly related to properties of the learning problem. We demonstrate these safety guarantees in simulated vision-based end-to-end control of an inverted pendulum and a car driving on a track.
|
|
17:40-18:00, Paper ThCT04.6 | Add to My Program |
Fitting an Immersed Submanifold to Data Via Sussmann's Orbit Theorem |
|
Hanson, Joshua | University of Illinois at Urbana-Champaign |
Raginsky, Maxim | University of Illinois at Urbana-Champaign |
Keywords: Statistical learning, Nonlinear systems, Algebraic/geometric methods
Abstract: This paper describes an approach for fitting an immersed submanifold of a finite-dimensional Euclidean space to random samples. The reconstruction mapping from the ambient space to the desired submanifold is implemented as a composition of an encoder that maps each point to a tuple of (positive or negative) times and a decoder given by a composition of flows along finitely many vector fields starting from a fixed initial point. The encoder supplies the times for the flows. The encoder-decoder map is obtained by empirical risk minimization, and a high-probability bound is given on the excess risk relative to the minimum expected reconstruction error over a given class of encoder-decoder maps. The proposed approach makes fundamental use of Sussmann's orbit theorem, which guarantees that the image of the reconstruction map is indeed contained in an immersed submanifold.
|
|
ThCT05 Invited Session, Tulum Ballroom E |
Add to My Program |
Recent Advances in Learning and Control II |
|
|
Chair: Nakahira, Yorie | Carnegie Mellon University |
Co-Chair: Wierman, Adam | California Institute of Technology |
Organizer: Qu, Guannan | Carnegie Mellon University |
Organizer: Wierman, Adam | California Institute of Technology |
Organizer: Zhang, Kaiqing | MIT |
|
16:00-16:20, Paper ThCT05.1 | Add to My Program |
Escaping High-Order Saddles in Policy Optimization for Linear Quadratic Gaussian (LQG) Control (I) |
|
Zheng, Yang | University of California San Diego |
Sun, Yue | University of Washington |
Fazel, Maryam | University of Washington |
Li, Na | Harvard University |
Keywords: Optimal control, Optimization, Optimization algorithms
Abstract: First-order policy optimization has been widely used in reinforcement learning. It guarantees to find the optimal policy for the state-feedback linear quadratic regulator (LQR). However, the performance of policy optimization remains unclear for the linear quadratic Gaussian (LQG) control where the LQG cost has spurious suboptimal stationary points. In this paper, we introduce a novel perturbed policy gradient (PGD) method to escape a large class of bad stationary points (including high-order saddles). In particular, based on the specific structure of LQG, we introduce a novel reparameterization procedure that converts the iterate from a high-order saddle to a strict saddle, from which standard random perturbations in PGD can escape efficiently. We further characterize a class of high-order saddles that can be escaped by our algorithm.
|
|
16:20-16:40, Paper ThCT05.2 | Add to My Program |
Source Seeking for Planar Underactuated Vehicles by Surge Force Tuning (I) |
|
Wang, Bo | Villanova University |
Nersesov, Sergey | Villanova University |
Ashrafiuon, Hashem | Villanova University |
Naseradinmousavi, Peiman | San Diego State University |
Krstic, Miroslav | University of California, San Diego |
Keywords: Adaptive systems, Autonomous vehicles, Nonholonomic systems
Abstract: We extend source seeking algorithms, in the absence of position and velocity measurements, and with tuning of the surge input, from velocity-actuated (unicycle) kinematic models to force-actuated generic Euler-Lagrange dynamic underactuated models. In the design and analysis, we employ a symmetric product approximation, averaging, passivity, and partial-state stability theory. The proposed control law requires only real-time measurement of the source signal at the current position of the vehicle and ensures semi-global practical uniform asymptotic stability (SPUAS) with respect to the linear motion coordinates for the closed-loop system. The performance of our source seeker with surge force tuning is illustrated with numerical simulations of an underactuated surface vessel.
|
|
16:40-17:00, Paper ThCT05.3 | Add to My Program |
Probabilistic Safety Certificate for Multi-Agent Systems (I) |
|
Jing, Haoming | Carnegie Mellon University |
Nakahira, Yorie | Carnegie Mellon University |
Keywords: Distributed control, Decentralized control, Intelligent systems
Abstract: This paper focuses on the multi-agent safe control problem for stochastic systems. We propose a probabilistic certificate for safety and performance specifications and use it to construct a distributed algorithm. The certificate integrates the reachability- and invariance-based (barrier-function-based) approaches via a new notion of forward invariance defined on the long-term probability. The proposed method has two features. First, it can guarantee a long-term probability of safety and performance satisfaction using myopic evaluation. Second, each agent can collaboratively ensure system-wide specifications even if each does not have sufficient information to evaluate the specifications. The effectiveness of the proposed method is tested using numerical experiments.
|
|
17:00-17:20, Paper ThCT05.4 | Add to My Program |
Reachable Set Approximation As a Non-Cooperative Multi-Agent Coverage Game (I) |
|
Rajab, Fat-hy Omar | University of Illinois at Urbana-Champaign |
Shamma, Jeff S. | University of Illinois at Urbana-Champaign |
Keywords: Agents-based systems, Game theory, Randomized algorithms
Abstract: We estimate the reachable set of a dynamical system by posing reachable set construction as a multi-agent coverage problem. As the terminology implies, the reachable set is the set of all states that can be reached within a specified time, using exogenous inputs with a specified bound, and starting from a specified initial condition. In multi-agent coverage, mobile agents self-deploy in an online manner to cover a region that is unknown a priori. The mapping between the two settings is the unknown region being the reachable set. Using time discretization and randomized spatial discretization, the proposed algorithm simultaneously generates a finite graph contained within the true reachable set and deploys the agents to optimally cover the graph. The utilized game-theoretic methods assure that, asymptotically, the agents self-deploy in a manner that provides optimal coverage with high probability. The accuracy of the approximation of the reachable set depends on the temporal and spacial discretization. The proposed algorithm is illustrated on different dynamical systems, where the performance is compared to related scenario-based approaches to reachable set estimation.
|
|
17:20-17:40, Paper ThCT05.5 | Add to My Program |
Connectivity of the Feasible and Sublevel Sets of Dynamic Output Feedback Control with Robustness Constraints |
|
Hu, Bin | University of Illinois at Urbana-Champaign |
Zheng, Yang | University of California San Diego |
Keywords: Robust control, Optimization, Output regulation
Abstract: This paper considers the optimization landscape of linear dynamic output feedback control with H-infinity robustness constraints. We consider the feasible set of all the stabilizing full-order dynamical controllers that satisfy an additional H-infinity robustness constraint. We show that this H-infinity-constrained set has at most two path-connected components that are diffeomorphic under a mapping defined by a similarity transformation. Our proof technique utilizes a classical change of variables in H-infinity control to establish a surjective mapping from a set with a convex projection to the H-infinity-constrained set. This proof idea can also be used to establish the same topological properties of strict sublevel sets of linear quadratic Gaussian (LQG) control and optimal H-infinity control. Our results bring positive news for gradient-based policy search on robust control problems.
|
|
17:40-18:00, Paper ThCT05.6 | Add to My Program |
Online Greedy Identification of Linear Dynamical Systems |
|
Blanke, Matthieu | INRIA-ENS |
Lelarge, Marc | INRIA-ENS |
Keywords: Identification, Learning, Linear systems
Abstract: This work addresses the problem of exploration in an unknown environment. For multi-input multi-output, linear time-invariant dynamical systems, we use an experimental design framework and introduce an online greedy policy where the control maximizes the information of the next step. We evaluate our approach experimentally and compare it with more elaborate gradient-based methods. In a setting with a limited number of observations, our algorithm has low complexity and shows competitive performances.
|
|
ThCT06 Regular Session, Tulum Ballroom F |
Add to My Program |
Estimation and Filtering |
|
|
Chair: Muller, Matthias A. | Leibniz University Hannover |
Co-Chair: Berntorp, Karl | Mitsubishi Electric Research Labs |
|
16:00-16:20, Paper ThCT06.1 | Add to My Program |
A Statistical Decision-Theoretical Perspective on the Two-Stage Approach to Parameter Estimation |
|
Lakshminarayanan, Braghadeesh | KTH Royal Institute of Technology |
Rojas, Cristian R. | KTH Royal Institute of Technology |
Keywords: Estimation, Identification, Statistical learning
Abstract: One of the most important problems in system identification and statistics is how to estimate the unknown parameters of a given model. Optimization methods and specialized procedures, such as Empirical Minimization (EM) can be used in case the likelihood function can be computed. For situations where one can only simulate from a parametric model, but the likelihood is difficult or impossible to evaluate, a technique known as the Two-Stage (TS) Approach can be applied to obtain reliable parametric estimates. Unfortunately, there is currently a lack of theoretical justification for TS. In this paper, we propose a statistical decision-theoretical derivation of TS, which leads to Bayesian and Minimax estimators. We also show how to apply the TS approach on models for independent and identically distributed samples, by computing quantiles of the data as a first step, and using a linear function as the second stage. The proposed method is illustrated via numerical simulations.
|
|
16:20-16:40, Paper ThCT06.2 | Add to My Program |
On the Accuracy of the One-Step UKF and the Two-Step UKF |
|
Goel, Ankit | University of Maryland Baltimore County |
Bernstein, Dennis S. | Univ. of Michigan |
Keywords: Estimation, Kalman filtering, Nonlinear systems
Abstract: The most accurate version of the unscented Kalman filter (UKF) involves the construction of two ensembles. To reduce computational cost, however, UKF is often implemented without the second ensemble. This simplification comes at a price, however, since, for linear systems, the one-step variation of the two-step UKF does not specialize to the classical Kalman filter, with an associated loss of accuracy. This paper remedies this drawback by developing a modified one-step UKF that recovers the classical Kalman filter for linear systems. Numerical examples show that the modified one-step UKF also recovers the accuracy of the two-step UKF in nonlinear systems with linear outputs.
|
|
16:40-17:00, Paper ThCT06.3 | Add to My Program |
Testing for Coincidences between Time Varying Poisson Processes |
|
Pasha, Syed Ahmed | Air University |
Solo, Victor | University of New South Wales |
Keywords: Estimation, Stochastic systems
Abstract: Multivariate point processes have a long history of application to areas such as queueing theory, neural coding, high frequency finance but the dominant models exclude coincidences i.e. of events in two different point processes occurring simultaneously. Here we consider testing for simultaneous occurrence of events between two point processes. Though this kind of problem occurs widely in practice it has been largely ignored. We develop a simple test based on cross covariance type statistics and illustrate it with simulations and on real data.
|
|
17:00-17:20, Paper ThCT06.4 | Add to My Program |
Stability Conditions for Remote State Estimation of Multiple Systems Over Semi-Markov Fading Channels |
|
Liu, Wanchun | The University of Sydney |
Quevedo, Daniel E. | Queensland University of Technology |
Vucetic, Branka | The University of Sydney |
Li, Yonghui | The University of Sydney |
Keywords: Control over communications, Estimation, Kalman filtering
Abstract: This letter studies remote state estimation of multiple linear time-invariant systems over shared wireless time-varying communication channels. We model the channel states by a semi-Markov process which captures both the random holding period of each channel state and the state transitions. The model is sufficiently general to be used in both fast and slow fading scenarios. We derive necessary and sufficient stability conditions of the multi-sensor-multi-channel system in terms of the system parameters. We further investigate how the delay of the channel state information availability and the holding period of channel states affect the stability. In particular, we show that, from a system stability perspective, fast fading channels may be preferable to slow fading ones.
|
|
17:20-17:40, Paper ThCT06.5 | Add to My Program |
A Simple Suboptimal Moving Horizon Estimation Scheme with Guaranteed Robust Stability |
|
Schiller, Julian D. | Leibniz University Hannover |
Wu, Boyang | Leibniz University Hannover, Institute of Automatic Control |
Muller, Matthias A. | Leibniz University Hannover |
Keywords: Observers for nonlinear systems, Estimation, Lyapunov methods
Abstract: We propose a suboptimal moving horizon estimation (MHE) scheme for a general class of nonlinear systems. To this end, we consider an MHE formulation that optimizes over the trajectory of a robustly stable observer. Assuming that the observer admits a Lyapunov function, we show that this function is an M-step Lyapunov function for suboptimal MHE. The presented sufficient conditions can be easily verified in practice. We illustrate the practicability of the proposed suboptimal MHE scheme with a standard nonlinear benchmark example. Here, performing a single iteration is sufficient to significantly improve the observer's estimation results under valid theoretical guarantees.
|
|
17:40-18:00, Paper ThCT06.6 | Add to My Program |
Distributed Kalman Filtering: When to Share Measurements |
|
Greiff, Marcus Carl | Lund University |
Berntorp, Karl | Mitsubishi Electric Research Labs |
Keywords: Kalman filtering, Estimation, Filtering
Abstract: This paper considers the problem of designing distributed Kalman filters (DKFs) when the sensor measurement noise is correlated. To this end, we analyze several existing methods in terms of their Bayesian Crameer-Rao bounds (BCRB), and insights from the analysis motivates a departure from the conventional estimate-sharing frameworks in favor of measurement-sharing. We demonstrate that if the communication bandwidth and computational resources permit, the minimum mean-square error (MMSE) estimator is implementable under measurement-sharing protocols. Furthermore, such approaches may use less communication bandwidth than standard consensus methods for smaller estimation problems. The developments are verified in several numerical examples, including comparisons against previously reported methods.
|
|
ThCT07 Invited Session, Tulum Ballroom G |
Add to My Program |
Estimation and Control of Infinite-Dimensional Systems I |
|
|
Chair: Demetriou, Michael A. | Worcester Polytechnic Institute |
Co-Chair: Burns, John A | Virginia Tech |
Organizer: Demetriou, Michael A. | Worcester Polytechnic Institute |
Organizer: Burns, John A | Virginia Tech |
|
16:00-16:20, Paper ThCT07.1 | Add to My Program |
In-Domain Damping Assignment of a Timoshenko-Beam Using State Feedback Boundary Control |
|
Redaud, Jeanne | Université Paris-Saclay, Inria, CentraleSupélec |
Auriol, Jean | CNRS |
Le Gorrec, Yann | Ensmm, Femto-St / As2m |
Keywords: Distributed parameter systems, Distributed control, Control applications
Abstract: n this paper, we combine the backstepping methodology and the port Hamiltonian framework to design a boundary full-state feedback controller that modifies the closed-loop in-domain damping of a Timoshenko beam. The beam under consideration is clamped in one end of its spatial domain and actuated at the opposite one. The port Hamiltonian formulation is used to derive several boundedly invertible transformations that map the original system into an exponentially stable closed-loop target system with additional in-domain damping terms. The proposed methodology allows the introduction of tuning parameters with clear physical interpretations for achievable closed-loop behavior. Simulations illustrate the performances of the controller.
|
|
16:20-16:40, Paper ThCT07.2 | Add to My Program |
Adaptive RKHS-Based Functional Estimation of Structurally Perturbed Second Order Infinite Dimensional Systems (I) |
|
Demetriou, Michael A. | Worcester Polytechnic Institute |
Keywords: Distributed parameter systems, Estimation
Abstract: This paper proposes a new approach for the adaptive functional estimation of second order infinite dimensional systems with structured perturbations. First, the proposed observer is formulated in the natural second order setting thus ensuring the time derivative of the estimated position is the estimated velocity, and therefore called natural adaptive observer. Assuming that the system does not yield a positive real system when placed in first order form, then the next step in deriving parameter adaptive laws is to assume a form of input-output collocation. Finally, to estimate structured perturbations taking the form of functions of the position and/or velocity outputs, the parameter space is not identified by a finite dimensional Euclidean space but instead is considered in a Reproducing Kernel Hilbert Space. Such a setting allows one not to be restricted by a priori assumptions on the dimension of the parameter spaces. Convergence of the position and velocity errors in their respective norms is established via the use of a parameter-dependent Lyapunov function, specifically formulated for second order infinite dimensional systems that include appropriately defined norms of the functional errors in the reproducing kernel Hilbert spaces. Boundedness of the functional estimates immediately follow and via an appropriate definition of a persistence of excitation condition for functional estimation, a functional convergence follows. When the system is governed by vector second order dynamics, all abstract spaces for the state evolution collapse to a Euclidean space and the natural adaptive observer results simplify. Numerical results of a second order PDE and a multi-degree of freedom finite dimensional mechanical system are presented.
|
|
16:40-17:00, Paper ThCT07.3 | Add to My Program |
Robust Output Regulation for a Wave Equation Via Adaptive Internal Model (I) |
|
Guo, Bao-Zhu | North China Electric Power University |
Zhao, Ren-Xi | Academy of Mathematics and Systems Science |
Keywords: Distributed parameter systems, Linear systems, Robust control
Abstract: This paper considers output regulation for a 1-d wave equation where the disturbances generated from an unknown finite-dimensional exosystem. By solving a Sylvester equation, the original system is decoupled into a PDE-ODE cascade system. An adaptive observer approach is adopted to estimate all possible unknown frequencies that have entered into a transformed system which has disturbance in tracking error only. By the estimates of the unknown frequencies, we are able to design a tracking error based feedback control to achieve output regulation and disturbance rejection for this PDEs where the tracking error is solely used and the asymptotic convergence is achieved.
|
|
17:00-17:20, Paper ThCT07.4 | Add to My Program |
Machine Learning Accelerated PDE Backstepping Observers (I) |
|
Shi, Yuanyuan | University of California San Diego |
Li, Zongyi | Caltech |
Yu, Huan | The Hong Kong University of Science and Technology(Guangzhou) |
Steeves, Drew | University of California, San Diego |
Anandkumar, Animashree | California Institute of Technology |
Krstic, Miroslav | University of California, San Diego |
Keywords: Distributed parameter systems, Machine learning, Fluid flow systems
Abstract: State estimation is important for a variety of tasks, from forecasting to substituting for unmeasured states in feedback controllers. Performing real-time state estimation for PDEs using provably and rapidly converging observers, such as those based on PDE backstepping, is computationally expensive and in many cases prohibitive. We propose a framework for accelerating PDE observer computations using learning-based approaches that are much faster while maintaining accuracy. In particular, we employ the recently-developed Fourier Neural Operator (FNO) to learn the functional mapping from the initial observer state and boundary measurements to the state estimate. By employing backstepping observer gains for previously-designed observers with particular convergence rate guarantees, we provide numerical experiments that evaluate the increased computational efficiency gained with FNO. We consider the state estimation for three benchmark PDE examples motivated by applications: first, for a reaction-diffusion (parabolic) PDE whose state is estimated with an exponential rate of convergence; second, for a parabolic PDE with exact prescribed-time estimation; and, third, for a pair of coupled first-order hyperbolic PDEs that modeling traffic flow density and velocity. The ML-accelerated observers trained on simulation data sets for these PDEs achieves up to three orders of magnitude improvement in computational speed compared to classical methods. This demonstrates the attractiveness of the ML-accelerated observers for real-time state estimation and control.
|
|
17:20-17:40, Paper ThCT07.5 | Add to My Program |
Leak Detection, Size Estimation and Localization in Water Distribution Networks Containing Loops (I) |
|
Wilhelmsen, Nils Christian Aars | Norwegian University of Science and Technology |
Aamo, Ole Morten | NTNU |
Keywords: Distributed parameter systems, Fluid flow systems, Adaptive systems
Abstract: We consider the problem of leak detection, size estimation and localization of leaks in a water distribution network with ring topology, with flow and pressure measurements taken at junctions only. The pipeline system is mapped into an interconnection (cascade) of 2X2 linear hyperbolic systems with an unknown scalar additive boundary parameter coming into the far end of the last PDE system. Using an output signal from the opposite end of the interconnection constructed from known signals, an adaptive observer for estimating the PDE states as well as the unknown additive boundary parameter is designed. The pipeline in which the leakage has occurred is then identified and the total leak size and leak distribution are estimated independently. Provided they occur sufficiently spaced in time, any number of point leaks can be located in the network.
|
|
17:40-18:00, Paper ThCT07.6 | Add to My Program |
Infinite-Dimensional Adaptive Boundary Observer for Inner-Domain Temperature Estimation of 3D Electrosurgical Processes Using Surface Thermography Sensing (I) |
|
El-Kebir, Hamza | University of Illinois at Urbana-Champaign |
Ran, Junren | University of Illinois at Urbana-Champaign |
Ostoja-Starzewski, Martin | University of Illinois at Urbana-Champaign |
Berlin, Richard | University of Illinois at Urbana-Champaign |
Bentsman, Joseph | University of Illinois at Urbana-Champaign |
Chamorro, Leonardo | University of Illinois at Urbana-Champaign |
Keywords: Distributed parameter systems, Estimation, Biomedical
Abstract: We present a novel 3D adaptive observer framework for use in the determination of subsurface organic tissue temperatures in electrosurgery. The observer structure leverages pointwise 2D surface temperature readings obtained from a real-time infrared thermographer for both parameter estimation and temperature field observation. We introduce a novel approach to decoupled parameter adaptation and estimation, wherein the parameter estimation can run in realtime, while the observer loop runs on a slower time scale. To achieve this, we introduce a novel parameter estimation method known as attention-based noise-robust averaging, in which surface thermography time series are used to directly estimate the tissue’s diffusivity. Our observer contains a realtime parameter adaptation component based on this diffusivity adaptation law, as well as a Luenberger-type corrector based on the sensed surface temperature. In this work, we also present a novel model structure adapted to the setting of robotic surgery, wherein we model the electrosurgical heat distribution as a compactly supported magnitude- and velocity-controlled heat source involving a new nonlinear input mapping. We demonstrate satisfactory performance of the adaptive observer in simulation, using real-life experimental ex vivo porcine tissue data.
|
|
ThCT08 Regular Session, Tulum Ballroom H |
Add to My Program |
Privacy and Security |
|
|
Chair: Rudie, Karen | Queen's Univ |
Co-Chair: Vasconcelos, Marcos M. | Virginia Tech |
|
16:00-16:20, Paper ThCT08.1 | Add to My Program |
Opacity and Its Trade-Offs with Security in Linear Systems |
|
John, Varkey | Indian Institute of Science, Bangalore |
Katewa, Vaibhav | Indian Institute of Science Bangalore |
Keywords: Control Systems Privacy, Cyber-Physical Security, Attack Detection
Abstract: Opacity and attack detectability are important properties for any system as they allow the states to remain private and malicious attacks to be detected, respectively. In this paper, we show that a fundamental trade-off exists between these properties for a linear dynamical system, in the sense that one cannot have an opaque system without making it vulnerable to undetectable attacks. We first characterize the opacity conditions for the system in terms of its weakly unobservable subspace (WUS) and show that the number of opaque states is proportional to the size of the WUS. Further, we establish conditions under which increasing the opaque sets also increases the set of undetectable attacks. This highlights a fundamental trade-off between security and privacy. We demonstrate our results on a team of delivery UAVs.
|
|
16:20-16:40, Paper ThCT08.2 | Add to My Program |
Enforcing Degree of Opacity with Supervisory Control |
|
Schonewille, Bryony H. | Queen's University |
Moulton, Richard Hugh | Queen's University |
Rudie, Karen | Queen's Univ |
Keywords: Control Systems Privacy, Discrete event systems, Autonomous vehicles
Abstract: The opacity-enforcement problem in discrete-event systems is one of ensuring that an adversarial observer cannot determine a given system's secret. Although the problem has been well studied, existing formulations enforce opacity as a binary notion: a system is either opaque or it isn't. Here we introduce a more general measure of a system's opacity called degree of opacity. This measure can be used to assess the security of systems with respect to different criteria through the quantification of the level of security. We demonstrate how the subobserver property allows supervisory control to produce a closed-loop system whose overall degree of opacity meets a desired threshold. We demonstrate that degree of opacity can be used in place of the traditional, binary definition and argue that many applications will benefit from this nuanced view.
|
|
16:40-17:00, Paper ThCT08.3 | Add to My Program |
Securing Signal-Free Intersections against Strategic Jamming Attacks: A Macroscopic Approach |
|
Bai, Yumeng | Shanghai Jiao Tong University |
Amin, Saurabh | Massachusetts Institute of Technology |
Wang, Xudong | Shanghai Jiao Tong University |
Jin, Li | Shanghai Jiao Tong University |
Keywords: Control Systems Privacy, Game theory, Traffic control
Abstract: We consider the security-by-design of a signal-free intersection for connected and autonomous vehicles in the face of strategic jamming attacks. We use a fluid model to characterize macroscopic traffic flow through the intersection, where the saturation rate is derived from a vehicle coordination algorithm. We model jamming attacks as sudden increase in communication latency induced on vehicle-to-infrastructure connectivity; such latency triggers the safety mode for vehicle coordination and thus reduces the intersection saturation rate. A strategic attacker selects the attacking rate, while a system operator selects key design parameters, either the saturation rate or the recovery rate. Both players' actions induce technological costs and jointly determine the mean travel delay. By analyzing the equilibrium of the security game, we study the preferable level of investment in the intersection's nominal discharging capability or recovery capability, for balance between hardware/infrastructure cost and security-by-design.
|
|
17:00-17:20, Paper ThCT08.4 | Add to My Program |
Predictability of Stochastic Dynamical System: A Probabilistic Perspective |
|
Xu, Tao | Shanghai Jiao Tong University |
He, Jianping | Shanghai Jiao Tong University |
Keywords: Control Systems Privacy, Stochastic systems, Information theory and control
Abstract: In this paper, to characterize the predictability of stochastic dynamical system with infinite states and use predictability to explain the probability of making accurate predictions, we have proposed a new performance metric.This metric quantifies the asymptotic exponential decaying rate for the probability that prediction errors never exceed a fixed. The novelties of this predictability metric lies in that it is proposed from an asymptotic probabilistic perspective, which i)minimizes the exponential decaying rate of prediction probability,ii) holds an elegant approximated expression deeply related with differential entropy within an error of O(epsilon), and iii) possesses a fast converging speed from finite decaying rate of O(e^{−K}) in the sense of probability. Numerical examples are given to illustrate the efficiency of the obtained results.
|
|
17:20-17:40, Paper ThCT08.5 | Add to My Program |
Robust Remote Estimation Over the Collision Channel in the Presence of an Intelligent Jammer |
|
Zhang, Xu | Chinese Academy of Sciences |
Vasconcelos, Marcos M. | Virginia Tech |
Keywords: Cyber-Physical Security, Sensor networks, Game theory
Abstract: We consider a sensor-receiver pair communicating over a wireless channel in the presence of a jammer who may launch a denial-of-service attack. We formulate a zero-sum game between a coordinator that jointly designs the transmission and estimation policies, and the jammer. We consider two cases depending on whether the jammer can sense the channel or not. We characterize a saddle-point equilibrium for the class of symmetric and unimodal probability density functions when the jammer cannot sense the channel. If the jammer can sense if the channel is being used, we provide an efficient algorithm that alternates between iterations of Projected Gradient Ascent and the Convex-Concave Procedure to find approximate First-order Nash-Equilibria. Our numerical results show that in certain cases the jammer may decide to launch a denial-of-service attack with the goal of deceiving the receiver even when the sensor decides not to transmit.
|
|
17:40-18:00, Paper ThCT08.6 | Add to My Program |
Optimal Myopic Attacks on Nonlinear Estimation |
|
Hallyburton, R. Spencer | Duke University |
Khazraei, Amir | Duke University |
Pajic, Miroslav | Duke University |
Keywords: Cyber-Physical Security
Abstract: Prior works have analyzed the security of estimation and control (E&C) for linear, time-invariant systems; however, there are few analyses of nonlinear systems despite their broad safety-critical use. We define two attack objectives on nonlinear E&C and illustrate that realizing the optimal attacks against the widely-adopted extended Kalman filter with industry-standard anomaly detection is equivalent to solving convex quadratically-constrained quadratic programs. Although these require access to the true state of the system, we provide practical relaxations on the optimal attacks to allow for execution at runtime given a specified amount of attacker knowledge. We show that the difference between the optimal and relaxed attacks is bounded by the attacker knowledge.
|
|
ThCT09 Regular Session, Maya Ballroom I |
Add to My Program |
Stability of Linear Systems |
|
|
Chair: Gomes da Silva Jr, Joao Manoel | Universidade Federal Do Rio Grande Do Sul (UFRGS) |
Co-Chair: Fiacchini, Mirko | CNRS, Univ. Grenoble Alpes |
|
16:00-16:20, Paper ThCT09.1 | Add to My Program |
A Lagrange Subspace Approach to Dissipation Inequalities |
|
van der Schaft, Arjan | Univ. of Groningen |
Mehrmann, Volker | Technische Universität Berlin |
Keywords: Stability of linear systems, Differential-algebraic systems, Numerical algorithms
Abstract: The standard dissipation inequality for passivity is extended by replacing storage functions by general Lagrange subspaces. This leads to some interesting consequences, including loss of controllability. A classical fundamental factorization result for passive systems is extended to this generalized case, making use of the newly defined concept of the Hamiltonian lift of a differential-algebraic equation (DAE) system.
|
|
16:20-16:40, Paper ThCT09.2 | Add to My Program |
Inner-Outer Decomposition for Strictly Proper Linear Time Invariant Systems and Non-Minimum Phase Performance Limitations |
|
Spirito, Mario | University of Bologna |
Marconi, Lorenzo | Univ. Di Bologna |
Keywords: Stability of linear systems, Linear systems
Abstract: In this work, we deal with an equivalent input-output description of non-minimum phase multi-input multi-output Linear Time Invariant systems and its use in the output feedback stabilisation problem. We first present a state-space inner-outer decomposition of the original system, with the original system shown to be input-output equivalent to the cascade of a right invertible minimum phase (outer) system feeding and an all-pass non-minimum phase (inner) one. By assuming the existence of a state feedback stabiliser for the outer minimum-phase system, we then show the existence of an output feedback stabiliser for the original system such that its output behaviour is arbitrarily close to the output of the controlled outer system (e.g., reducing the size of undershoot in the system response), except that in an initial time interval that can be kept arbitrarily short, provided that the dynamics of the controlled outer system are sufficiently slow.
|
|
16:40-17:00, Paper ThCT09.3 | Add to My Program |
Convex Parameterization of Stabilizing Controllers and Its LMI-Based Computation Via Filtering |
|
de Oliveira, Mauricio | University of California, San Diego |
Zheng, Yang | University of California San Diego |
Keywords: Stability of linear systems, Linear systems, LMIs
Abstract: Various new implicit parameterizations for stabilizing controllers that allow one to impose structural constraints on the controller have been proposed lately. They are convex but infinite-dimensional, formulated in the frequency~domain with no available efficient methods for computation. In this paper, we introduce a kernel version of the Youla parameterization to characterize the set of stabilizing controllers. It features a single affine constraint, which allows us to recast the controller parameterization as a novel robust filtering problem. This makes it possible to derive the first efficient Linear Matrix Inequality (LMI) implicit parametrization of stabilizing controllers. Our LMI characterization not only admits efficient numerical computation, but also guarantees a full-order stabilizing dynamical controller that is efficient for practical deployment. Numerical experiments demonstrate that our LMI can be orders of magnitude faster to solve than the existing closed-loop parameterizations.
|
|
17:00-17:20, Paper ThCT09.4 | Add to My Program |
Harmonic Pole Placement |
|
Riedinger, Pierre | Université De Lorraine / CNRS UMR7039 |
Daafouz, Jamal | Université De Lorraine, CRAN, CNRS |
Keywords: Stability of linear systems, Time-varying systems, Electrical machine control
Abstract: In this paper, we propose a method to design state feedback harmonic control laws that assign the closed loop poles of a linear harmonic model to some desired locations. The procedure is based on the solution of an infinite-dimensional harmonic Sylvester equation under an invertibility constraint. We provide a sufficient condition to ensure this invertibility and show how this infinite-dimensional Sylvester equation can be solved up to an arbitrary small error. The results are illustrated on an unstable linear periodic system. We also provide a counter-example to illustrate the fact that, unlike the classical finite dimensional case, the solution of the Sylvester equation may not be invertible in the infinite dimensional case even if an observability condition is satisfied.
|
|
17:20-17:40, Paper ThCT09.5 | Add to My Program |
Implementation-Oriented Filtered PID Control: Robustness Margins |
|
Mao, Qi | City University of Hong Kong |
Xu, Yong | Guangdong University of Technology |
Chen, Jie | City University of Hong Kong |
Georgiou, Tryphon T. | University of California, Irvine |
Keywords: Stability of linear systems, PID control, Robust control
Abstract: In this paper, we investigate the gain and phase robustness of PID control in stabilizing linear time-invariant (LTI) systems subject to gain and phase variations. We consider specifically filtered PID controllers, out of the necessity in practical implementation of PID controllers. We examine first-order unstable systems and seek to find analytical expressions of the maximal gain and phase margins achievable by filtered PID control, where the maximal gain and phase margins are referred to as the largest ranges of gain and phase variations within which the system is guaranteed to be stabilizable. We also investigate the gain and phase maximization problems subject to steady-state tracking performance constraint. Our results show how in a practically implementable PID control scheme the gain and phase margins may be confined by the plant characteristics. The results also shed light into the tradeoff between performance and robustness of PID controllers.
|
|
17:40-18:00, Paper ThCT09.6 | Add to My Program |
Necessary and Sufficient Convex Condition for the Stabilization of Linear Sampled-Data Systems under Poisson Sampling Process |
|
Denardi Huff, Daniel | Université Grenoble Alpes |
Fiacchini, Mirko | CNRS, Univ. Grenoble Alpes |
Gomes da Silva Jr, Joao Manoel | Universidade Federal Do Rio Grande Do Sul (UFRGS) |
Keywords: Sampled-data control, Stability of linear systems, Stochastic systems
Abstract: This work presents a control design method for linear sampled-data systems whose random sampling intervals form a Poisson process. Unlike a previous result in the literature, the proposed stabilization conditions, based on linear feedbacks of both the state and the past input values, are necessary and sufficient for the mean exponential stability of the system. Moreover, such non-conservative conditions correspond to linear matrix inequalities, implying then that the stabilization problem can be efficiently addressed through semidefinite programming. As a second contribution, the characterization and optimization of the mean exponential convergence rate of the closed-loop system is given in form of a generalized eigenvalue problem. A numerical example illustrates the theoretical results.
|
|
ThCT10 Regular Session, Maya Ballroom II |
Add to My Program |
Mean Field Games |
|
|
Chair: Caines, Peter E. | McGill University |
Co-Chair: Mehta, Prashant G. | Univ of Illinois, Urbana-Champaign |
|
16:00-16:20, Paper ThCT10.1 | Add to My Program |
Markowitz Portfolio Optimization Extended Quadratic Mean-Field Games Approach |
|
Saude, Joao | McGill University |
Caines, Peter E. | McGill University |
Keywords: Mean field games, Finance, Modeling
Abstract: In this paper, we study the Markowitz dynamic portfolio problem, where individual agents seek to maximize their expected return while minimizing the variance of the return (risk). We model a market with a large number of similar risk-averse agents. For the first time, we incorporate the impact of the agents' aggregated trades using the extended mean-field games framework. We illustrate our results with simulations.
|
|
16:20-16:40, Paper ThCT10.2 | Add to My Program |
Master Equation for Discrete-Time Stackelberg Mean Field Games with Single Leader |
|
Vasal, Deepanshu | University of Michigan, Ann Arbor |
Berry, Randall A. | Northwestern University |
Keywords: Mean field games, Game theory, Stochastic optimal control
Abstract: In this paper, we consider a discrete-time Stackelberg mean field game with a leader and an infinite number of followers. The leader and the followers each observe types privately that evolve as conditionally independent controlled Markov processes. The leader commits to a dynamic policy and the followers best respond to that policy and each other. Knowing that the followers would play a mean field game based on her policy, the leader chooses a policy that maximizes her reward. We refer to the resulting outcome as a Stackelberg mean field equilibrium (SMFE). In this paper, we provide a master equation of this game that allows one to compute all SMFE. Based on our framework, we consider two numerical examples. First, we consider an epidemic model where the followers get infected based on the mean field population. The leader chooses subsidies for a vaccine to maximize social welfare and minimize vaccination costs. In the second example, we consider a technology adoption game where the followers decide to adopt a technology or a product and the leader decides the cost of one product that maximizes his returns, which are proportional to the people adopting that technology.
|
|
16:40-17:00, Paper ThCT10.3 | Add to My Program |
How Does a Rational Agent Act in an Epidemic? (I) |
|
Olmez, Sukru Yagiz | University of Illinois at Urbana-Champaign |
Aggarwal, Shubham | University of Illinois, Urbana Champaign |
Kim, Jin Won | University of Potsdam |
Miehling, Erik | University of Illinois at Urbana-Champaign |
Basar, Tamer | Univ of Illinois, Urbana-Champaign |
West, Matthew | Univ of Illinois, Urbana-Champaign |
Mehta, Prashant G. | Univ of Illinois, Urbana-Champaign |
Keywords: Mean field games, Stochastic optimal control, Game theory
Abstract: The evolution of a disease in a large population is a function of the top-down policy measures from a centralized planner and the self-interested decisions (to be socially active) of individual agents in a large heterogeneous population. This paper is concerned with understanding the latter based on a mean-field type optimal control model. Specifically, the model is used to investigate the role of partial information on an agent's decision-making and study the impact of such decisions by a large number of agents on the spread of the virus in the population. The motivation comes from the presymptomatic and asymptomatic spread of the COVID-19 virus, where an agent unwittingly spreads the virus. We show that even in a setting with fully rational agents, limited information on the viral state can result in epidemic growth.
|
|
17:00-17:20, Paper ThCT10.4 | Add to My Program |
Carbon Emission-Aware Storage Control Via Mean Field Game Coordination |
|
Zhang, Yaoyu | Tsinghua University |
Kang, Yuhan | University of Houston |
Wu, Chenye | The Chinese University of Hong Kong, Shenzhen |
Shi, Jian | University of Houston |
Wang, Dan | The Hong Kong Polytechnic University |
Han, Zhu | University of Houston |
Keywords: Mean field games, Optimal control, Smart grid
Abstract: The uncertainties in the renewable outputs are challenging the reliable operation of the grid. Integrating more storage systems is considered a promising solution as they provide the most urgently needed flexibility to the grid. However, it is challenging to coordinate a large number of storage systems due to their heterogeneous requirements. In this paper, we utilize the notion of the mean field game (MFG) to tackle this challenge. Specifically, we consider the carbon emission-aware storage control in an electricity market with carbon tax. The storage systems are allowed to arbitrage, and yet their behaviors are coupled through the market, yielding the MFG formulation. We first show the existence and uniqueness of the MFG equilibrium, and then design a dynamic programming algorithm to achieve the MFG equilibrium. Numerical studies further demonstrate the proposed MFG algorithm's effectiveness in terms of both benefiting the storage systems and reducing carbon emissions.
|
|
17:20-17:40, Paper ThCT10.5 | Add to My Program |
Embedded Vertexon-Graphons and Embedded GMFG Systems (I) |
|
Caines, Peter E. | McGill University |
Keywords: Mean field games, Large-scale systems, Stochastic systems
Abstract: An (embedded) vertexon in a connected compact set M in R^{m} is defined to be the vertex set of a graph together with an asymptotically dense partition hierarchy of M. It is shown that sequences of vertexons have subsequential vertexon limits measures in M independent of the particular partition hierarchy employed within a large class of hierarchies. Consequently, the differentiation of functions on vertexon limits with open support is well defined. Further, along these sequences the associated sequence of graphs have subsequential graph edge limits termed embedded graphons; the resulting embedded vertexon-graphon limit (measure) pair permits the definition of critical nodes and other features involving differentiation with respect to node location. This is significant for the analysis of Nash value functions for Embedded Graphon Mean Field Game (EGMFG) systems, that is to say, GMFG systems with nodes lying in a limiting vertexon in M subset R^{m} with open support and associated embedded graphons defined on M^{2}. A further feature of vertexon limits is that they provide a form of node population density which extends the local agent population aspect of embedded GMFG systems.
|
|
17:40-18:00, Paper ThCT10.6 | Add to My Program |
Optimal Network Location in Infinite Horizon LQG Graphon Mean Field Games (I) |
|
Foguen Tchuendom, Rinel | McGill University |
Gao, Shuang | UC Berkeley, McGill University |
Huang, Minyi | Carleton University |
Caines, Peter E. | McGill University |
Keywords: Mean field games, Large-scale systems
Abstract: We propose to study a class of infinite horizon linear quadratic Gaussian Graphon Mean Field Games (GMFGs) inspired by the infinite horizon Mean Field Games in [1]. Graphon Mean Field Games (GMFGs) are non-uniform generalizations of Mean Field Games where the non-uniformity of agents is characterized by the nodes on which they are located in a network. Under mild conditions, we obtain for almost every node, an analytical expression for the cost at GMFG equilibrium, and propose a necessary and sufficient condition under which a particular node in the network is associated with the minimal cost at GMFG equilibrium.
|
|
ThCT11 Regular Session, Maya Ballroom III |
Add to My Program |
Uncertain Systems II |
|
|
Chair: Formentin, Simone | Politecnico Di Milano |
Co-Chair: Oliveira, Vilma A. | Universidade De Sao Paulo |
|
16:00-16:20, Paper ThCT11.1 | Add to My Program |
Robust Safe Control Synthesis with Disturbance Observer-Based Control Barrier Functions |
|
Das, Ersin | Caltech |
Murray, Richard M. | California Inst. of Tech |
Keywords: Supervisory control, Lyapunov methods, Uncertain systems
Abstract: In a complex real-time operating environment, external disturbances and uncertainties adversely affect the safety, stability, and performance of dynamical systems. This paper presents a robust stabilizing safety-critical controller synthesis framework with control Lyapunov functions (CLFs) and control barrier functions (CBFs) in the presence of disturbance. A high-gain input observer method is adapted to estimate the time-varying unmodelled dynamics of the CBF with an error bound using the first-order time derivative of the CBF. This approach leads to an easily tunable low-order disturbance estimator structure with a design parameter as it utilizes only the CBF constraint. The estimated unknown input and associated error bound are used to ensure robust safety and exponential stability by formulating a CLF-CBF quadratic program. The proposed method is applicable to both relative degree one and higher relative degree CBF constraints. The efficacy of the proposed approach is demonstrated using a numerical simulations of an adaptive cruise control system and a Segway platform with an external disturbance.
|
|
16:20-16:40, Paper ThCT11.2 | Add to My Program |
Kelly-Based Stock Trading Via Feedback Control |
|
Dettù, Federico | Politecnico Di Milano |
Abbracciavento, Francesco | Politecnico Di Milano |
Formentin, Simone | Politecnico Di Milano |
Keywords: Finance, Adaptive systems, Identification
Abstract: In stock trading, Kelly betting has recently gained much popularity as a methodologically sound approach to the computation of optimal investment fractions. In this work, by extending Kelly’s original idea to time series, we develop a novel closed-loop trading investment strategy. The proposed algorithm first estimates a probabilistic model of the price dynamics from the available data, then employs the identified model to compute the investment fraction, resorting to Kelly betting theory. Experimental validation of the algorithm on real-world market quotations shows promising results against state-of-the-art solutions.
|
|
16:40-17:00, Paper ThCT11.3 | Add to My Program |
Correct-By-Design Control of Parametric Stochastic Systems |
|
Schön, Oliver | Newcastle University |
van Huijgevoort, Birgit | TU Eindhoven |
Haesaert, Sofie | Eindhoven University of Technology |
Soudjani, Sadegh | Newcastle University |
Keywords: Formal Verification/Synthesis, Stochastic systems, Uncertain systems
Abstract: This paper addresses the problem of computing controllers that are correct by design for safety-critical systems and can provably satisfy (complex) functional requirements. We develop new methods for models of systems subject to both stochastic and parametric uncertainties. We provide for the first time novel simulation relations for enabling correct-by-design control refinement, that are founded on coupling uncertainties of stochastic systems via sub-probability measures. Such new relations are essential for constructing abstract models that are related to not only one model but to a set of parameterized models. We provide theoretical results for establishing this new class of relations and the associated closeness guarantees for both linear and nonlinear parametric systems with additive Gaussian uncertainty. The results are demonstrated on a linear model and the nonlinear model of the Van der Pol Oscillator.
|
|
17:00-17:20, Paper ThCT11.4 | Add to My Program |
Handling Uncertanity in Decision Support Systems Based on Pythagorean Fuzzy Sets |
|
Kizielewicz, Bartłomiej | National Institute of Telecommunications |
Shekhovtsov, Andrii | National Institute of Telecommunications |
Sałabun, Wojciech | West Pomeranian University of Tehchnology in Szczecin |
Keywords: Fuzzy systems, Computational methods, Uncertain systems
Abstract: One of the branches related to artificial intelligence is decision-making based on expert knowledge. Unfortunately, extracting this knowledge is very difficult and complex. Additionally, it is accompanied by the coming up of uncertain data. In order to model human knowledge more accurately, fuzzy sets and their generalizations have been developed. Today's challenges are mainly related to handling uncertain data in a decision support system. Therefore, this paper uses Pythagorean fuzzy sets (PFS) as the popular generalization used to model human knowledge. We propose a new approach to solving problems using the COMET method and the PFS score function. As a result, we obtain an interesting hybrid approach of the MABAC and COMET methods to identify the decision model and give reliable decisions. The proposed solution combines the best features of these methods. Finally, we compare our proposition with the PFS TOPSIS method to show the superiority and limitations of our proposition.
|
|
17:20-17:40, Paper ThCT11.5 | Add to My Program |
Extension of the SPOTIS Method for the Rank Reversal Free Decision-Making under Fuzzy Environment |
|
Shekhovtsov, Andrii | National Institute of Telecommunications |
Paradowski, Bartosz | West Pomeranian University of Tehchnology in Szczecin |
Więckowski, Jakub | National Institute of Telecommunications |
Kizielewicz, Bartłomiej | National Institute of Telecommunications |
Sałabun, Wojciech | West Pomeranian University of Tehchnology in Szczecin |
Keywords: Fuzzy systems, Computational methods, Uncertain systems
Abstract: The growing popularity of the usability of MCDA (Multi-Criteria Decision Analysis) methods has resulted in many studies verifying their applicability and quality of performance. Due to the possibility of different data representation in multi-criteria problems, the issues addressed may be in the space of crisp values or an uncertain environment. In addition, due to the quality of the guaranteed results, methods that provide resistance to the ranking reversal phenomenon are desirable. This paper proposes an extension of the SPOTIS (Stable Preference Ordering Towards Ideal Solution) method, free of the rank reversal phenomenon in multi-criteria problems under a fuzzy environment. The proof of concept was presented based on two numerical examples, presenting the method's performance and usefulness in decision-making. Obtained results showed that the proposed approach could be effectively used when handling the uncertain conditions of multi-criteria problems.
|
|
17:40-18:00, Paper ThCT11.6 | Add to My Program |
State Feedback Design for TS Fuzzy Systems Subject to Persistent Bounded Disturbances |
|
Magossi, Rafael | Solar21 |
Elias, Leandro J. | Instituto Federal De Educação, Ciência E Tecnologia De São Paulo |
Faria, Flávio A. | UNESP - Univ Estadual Paulista |
Oliveira, Vilma A. | Universidade De Sao Paulo |
Keywords: Fuzzy systems, LMIs, Nonlinear systems
Abstract: In this research, a state feedback design for a class of continuous-time nonlinear systems under persistent bounded disturbance is designed considering a L1 performance index. The state feedback design is based on a switched fuzzy PDC controller. The conservativeness of the sufficient conditions given in terms of LMIs is decreased by the use of a switched Lyapunov-like function and an alternative description for the time derivative of membership functions. Numerical examples illustrate the efficiency of the proposed state feedback design.
|
|
ThCT12 Regular Session, Maya Ballroom IV |
Add to My Program |
Modeling and Control of Epidemics |
|
|
Chair: Como, Giacomo | Politecnico Di Torino |
Co-Chair: Nowzari, Cameron | George Mason University |
|
16:00-16:20, Paper ThCT12.1 | Add to My Program |
Bi-SIS Epidemics on Graphs - Quantitative Analysis of Coexistence Equilibria |
|
Doshi, Vishwaraj | North Carolina State University |
Hu, Jie | North Carolina State University |
Eun, Do Young | North Carolina State University |
Keywords: Biological systems, Communication networks
Abstract: We consider a system in which two viruses of the Susceptible-Infected-Susceptible (SIS) type compete over general, overlaid graphs. While such systems have been the focus of many recent works, they have mostly been studied in the sense of convergence analysis, with no existing results quantifying the non-trivial coexistence equilibria (CE) - that is, when both competing viruses maintain long term presence over the network. In this paper, we prove monotonicity of the CE with respect to effective infection rates of the two viruses, and provide the first quantitative analysis of such equilibria in the form of upper bounds involving spectral radii of the underlying graphs, as well as positive equilibria of related single-virus systems. Our results provide deeper insight into how the long term infection probabilities are affected by system parameters, which we further highlight via numerical results.
|
|
16:20-16:40, Paper ThCT12.2 | Add to My Program |
Multiple Peaks in Network SIR Models |
|
Alutto, Martina | Politecnico Di Torino |
Cianfanelli, Leonardo | Politecnico Di Torino |
Como, Giacomo | Politecnico Di Torino |
Fagnani, Fabio | Politecnico Di Torino |
Keywords: Network analysis and control, Nonlinear systems
Abstract: We study network SIR (Susceptible - Infected - Recovered) epidemic models in the case of two interacting populations. We analyze the dynamics behavior of the fractions of infected individuals in the two populations. In contrast to the classical scalar SIR epidemic model, where the fraction of infected individuals is known to have an unimodal behavior (either decreasing throughout time or initially increasing, until reaching a peak and decreasing everafter), we show the possible occurrence of a novel multimodal behaviors in the network SIR model. Specifically, we show that the curve of the fraction of infected individuals in a population may incur in a change of monotonicity even when it starts with a decreasing trend. Our analysis focuses on a homogeneous mixing model, whereby all contacts have unitary frequency. We study the initial conditions and network characteristics sufficient for the aforementioned multimodal behavior to emerge and those that instead guarantee the classical unimodal behavior.
|
|
16:40-17:00, Paper ThCT12.3 | Add to My Program |
Optimal Capacity-Constrained COVID-19 Vaccination for Heterogeneous Populations |
|
Arghal, Raghu | University of Pennsylvania |
Bidokhti, Shirin Saeedi | University of Pennsylvania |
Sarkar, Saswati | University of Pennsylvania |
Keywords: Network analysis and control, Optimal control
Abstract: COVID-19 and the ensuing vaccine capacity constraints have emphasized the importance of proper prioritization during vaccine rollout. This problem is complicated by heterogeneity in risk levels, contact rates, and network topology which can dramatically and unintuitively change the efficacy of vaccination and must be taken into account when allocating resources. This paper proposes a general model to capture a wide array of network heterogeneity while maintaining computational tractability and formulates vaccine prioritization as an optimal control problem. Pontryagin's Maximum Principle is used to derive properties of optimal, potentially highly dynamic, allocation policies, providing significant reductions in the set of candidate policies. Extensive numerical simulations of COVID-19 vaccination are used to corroborate these findings and further illicit optimal policy characteristics and the effects of various system, disease, and population parameters.
|
|
17:00-17:20, Paper ThCT12.4 | Add to My Program |
Individual Non-Pharmaceutical Intervention Strategies for Stochastic Networked Epidemics |
|
Mubarak, Mohammad | George Mason University |
Berneburg, James | George Mason University |
Nowzari, Cameron | George Mason University |
Keywords: Networked control systems, Markov processes, Stochastic optimal control
Abstract: Unlike the vast majority of works that study the spread of epidemic processes in search of top-down policy recommendations or resource allocation strategies in eradicating an unwanted virus, in this paper we instead consider how the virus spreads at the person-to-person level. More specifically, based on the locally available information to a particular person, how should that person make use of this information to better protect themself? How can that person socialize as much as possible while ensuring some desired level of safety? More formally, the solution to this problem requires a rigorous understanding of the trade-off between socializing with potentially infected individuals and the increased risk of infection. We set this up as a finite time optimal control problem using a well known exact Markov chain compartmental Susceptible-Infected-Removed (SIR) model. Unfortunately, the problem set up is intractable and requires a relaxation. Leveraging results from the literature, we employ a commonly used mean-field approximation (MFA) technique to relax the problem. We find that the optimal solution of the problem to be a form of threshold on the chance of infection of the neighbors of that person. Simulations illustrate our results.
|
|
17:20-17:40, Paper ThCT12.5 | Add to My Program |
Impulsive Neural Control to Schedule Antivirals and Immunomodulators for COVID-19 |
|
Hernandez-Mejia, Gustavo | Institute of Epidemiology and Social Medicine, University of Mü |
Sanchez, Edgar N. | CINVESTAV |
Victor, Chan | CINVESTAV |
Hernandez-Vargas, Esteban Abelardo | University of Idaho |
Keywords: Biomedical, Biological systems, Systems biology
Abstract: New SARS-CoV-2 variants escaping the effect of vaccines are an eminent threat. The use of antivirals to inhibit the viral replication cycle or immunomodulators to regulate host immune responses can help to tackle the viral infection at the host level. To evaluate the potential use of these therapies, we propose the application of an inverse optimal neural controller to a mathematical model that represents SARS-CoV-2 dynamics in the host. Antiviral effects and immune responses are considered as the control actions. The variability between infected hosts can be large, thus, the host infection dynamics are identified based on a Recurrent High-Order Neural Network (RHONN) trained with the Extended Kalman Filter (EKF). The performance of the control strategies is tested by employing a Monte Carlo analysis. Simulation results present different scenarios where potential antivirals and immunomodulators could reduce the viral load.
|
|
17:40-18:00, Paper ThCT12.6 | Add to My Program |
Robust State Estimation of Nonlinear Systems Using High-Gain Transmissibility (I) |
|
Khalil, Abdelrahman | Memorial University of Newfoundland |
Boker, Almuatazbellah | Virginia Tech |
Al Janaideh, Mohammad | Memorial University of Newfoundland |
Keywords: Estimation, Observers for nonlinear systems
Abstract: Transmissibility operators are mathematical relations that connect unknown system outputs. In previous work, transmissibility operators showed robustness against unknown nonlinear system dynamics. Inspired by the high-gain observers, this paper extends transmissibility operators to the form of high-gain transmissibility. High-gain transmissibility is then used for the robust estimation of the system states. No direct measurements of the system states are available. The system model in this work can follow any non-canonical form. We show that the high-gain transmissibility estimator is able to robustly estimate the system states in the presence of unknown system nonlinearities that affect all states, and part of the output measurement is corrupted by noise. The potential of the proposed estimator is demonstrated through a simulation example.
|
|
ThCT13 Regular Session, Maya Ballroom V |
Add to My Program |
Applications of Optimization |
|
|
Chair: Jungers, Raphaël M. | University of Louvain |
Co-Chair: Charalambous, Themistoklis | University of Cyprus |
|
16:00-16:20, Paper ThCT13.1 | Add to My Program |
Distributed Resource Allocation Via ADMM Over Digraphs |
|
Jiang, Wei | Aalto University, Finland |
Doostmohammadian, Mohammadreza | Aalto University |
Charalambous, Themistoklis | University of Cyprus |
Keywords: Optimization algorithms, Networked control systems, Agents-based systems
Abstract: In this paper, we solve the resource allocation problem over a network of agents, with edges as communication links that can be unidirectional. The goal is to minimize the sum of allocation cost functions subject to a coupling constraint in a distributed way by using the finite-time consensus-based alternating direction method of multipliers (ADMM) technique. In contrast to the existing gradient descent (GD) based distributed algorithms, our approach can be applied to non-differentiable cost functions. Also, the proposed algorithm is initialization-free and converges at a rate of O(1/k), where k is the optimization iteration counter. The fast convergence performance related to iteration counter k compared to state-of-the-art GD based algorithms is shown via a simulation example.
|
|
16:20-16:40, Paper ThCT13.2 | Add to My Program |
Distributionally Robust Decision Making Leveraging Conditional Distributions |
|
Chen, Yuxiao | Nvidia Corporation |
Kim, Jip | Columbia University |
Anderson, James | Columbia University |
Keywords: Optimization, Optimization algorithms, Power systems
Abstract: Distributionally robust optimization (DRO) is a powerful tool for decision-making under uncertainty. It is particularly appealing because of its ability to leverage existing data. However, many practical problems call for decision making with some auxiliary information, and DRO in the context of conditional distribution is not straightforward. We propose a conditional kernel distributionally robust optimization (CKDRO) method that enables robust decision making under conditional distributions through kernel DRO and the conditional mean operator in the reproducing kernel Hilbert space (RKHS). In particular, we consider problems where there is a correlation between the unknown variable y and an auxiliary observable variable x. Given past data of the two variables and a queried auxiliary variable, CKDRO represents the conditional distribution P(y|x) as the conditional mean operator in the RKHS space and quantifies the ambiguity set in the RKHS as well, which depends on the size of the dataset as well as the query point. To justify the use of RKHS, we demonstrate that the ambiguity set defined in RKHS can be viewed as a ball under a metric that is similar to the Wasserstein metric. The DRO is then dualized and solved via a finite-dimensional convex program. The proposed CKDRO approach is applied to a generation scheduling problem and shows that the result of CKDRO is superior to common benchmarks in terms of quality and robustness.
|
|
16:40-17:00, Paper ThCT13.3 | Add to My Program |
Maximum Mean Discrepancy Distributionally Robust Nonlinear Chance-Constrained Optimization with Finite-Sample Guarantee |
|
Nemmour, Yassine | Max Planck Institute for Intelligent Systems |
Kremer, Heiner | Max Planck Institute for Intelligent Systems |
Schölkopf, Bernhard | MPI for Biological Cybernetics |
Zhu, Jia-Jie | Max Planck Institute for Intelligent Systems |
Keywords: Optimization, Stochastic optimal control, Robust control
Abstract: This paper is motivated by addressing open questions in distributionally robust chance-constrained programs (DRCCP) using the popular Wasserstein ambiguity sets. Specifically, the computational techniques for those programs typically place restrictive assumptions on the constraint functions. The size of the Wasserstein ambiguity sets is often set using the costly cross-validation (CV) procedures or conservative measure concentration bounds. In contrast, we propose a practical DRCCP algorithm using kernel maximum mean discrepancy (MMD) ambiguity sets, which we term MMD-DRCCP, to treat general nonlinear constraints without using ad-hoc reformulation techniques. MMD-DRCCP can handle general non-linear and non-convex constraints with a proven finite-sample constraint satisfaction guarantee of a dimension-independent O(1/sqrt(N)) rate, achievable by a practical algorithm. We further propose an efficient bootstrap scheme for constructing sharp MMD ambiguity sets in practice without resorting to CV. Our algorithm is validated numerically on a portfolio optimization problem and a tube-based distributionally robust model predictive control problem with non-convex constraints.
|
|
17:00-17:20, Paper ThCT13.4 | Add to My Program |
Probabilistic Guarantees on the Objective Value for the Scenario Approach Via Sensitivity Analysis |
|
Wang, Zheming | Université Catholique De Louvain |
Jungers, Raphaël M. | University of Louvain |
Keywords: Optimization, Optimization algorithms, Randomized algorithms
Abstract: This paper is concerned with objective value performance of the scenario approach for robust convex optimization. A novel method is proposed to derive probabilistic bounds for the objective value from scenario programs with a finite number of samples. This method relies on a max-min reformulation and the concept of complexity of robust optimization problems. With additional continuity and regularity conditions, via sensitivity analysis, we also provide explicit bounds which outperform an existing result in the literature. To illustrate the improvements of our results, we also provide a numerical example.
|
|
17:20-17:40, Paper ThCT13.5 | Add to My Program |
Safe Bayesian Optimization Using Interior-Point Methods - Applied to Personalized Insulin Dose Guidance |
|
Krishnamoorthy, Dinesh | TU Eindhoven |
Doyle III, Francis J. | Harvard University |
Keywords: Optimization, Machine learning, Healthcare and medical systems
Abstract: This paper considers the problem of Bayesian optimization for systems with safety-critical constraints, where both the objective function and the constraints are unknown, but can be observed by querying the system. In safety-critical applications, querying the system at an infeasible point can have catastrophic consequences. Such systems require a safe learning framework, such that the performance objective can be optimized while satisfying the safety-critical constraints with high probability. In this paper we propose a safe Bayesian optimization framework that ensures that the points queried are always in the interior of the partially revealed safe region, thereby guaranteeing constraint satisfaction with high probability. The proposed interior-point Bayesian optimization framework can be used with any acquisition function, making it broadly applicable. The performance of the proposed method is demonstrated using a personalized insulin dosing application for patients with type 1 diabetes.
|
|
17:40-18:00, Paper ThCT13.6 | Add to My Program |
Explicit Solutions for Safety Problems Using Control Barrier Functions |
|
Wang, Han | University of Oxford |
Papachristodoulou, Antonis | University of Oxford |
Margellos, Kostas | University of Oxford |
Keywords: Optimization, Nonlinear systems, Constrained control
Abstract: The control barrier function approach has been widely used for safe controller synthesis. By solving an online convex quadratic programming problem, an optimal safe controller can be synthesized implicitly. Since the solution is unique, the mapping from the state-space to the control inputs is injective, thus enabling us to evaluate the underlying relationship. In this paper we aim at explicitly synthesizing a safe control law as a function of the state for nonlinear control-affine systems with limited control ability. We transform the online quadratic programming problem into an offline parameterized optimisation problem which considers states as parameters. The obtained explicit safe controller is shown to be a piece-wise Lipschitz continuous function over the partitioned state space if the program is feasible. We address the infeasible cases by solving a parameterized adaptive control barrier function-based quadratic programming problem. Extensive simulation results show the state-space partition and the controller properties.
|
|
ThCT14 Invited Session, Maya Ballroom VI |
Add to My Program |
Trends in Optimization for Power Systems |
|
|
Chair: Engelmann, Alexander | TU Dortmund University |
Co-Chair: Jiang, Yuning | EPFL |
Organizer: Engelmann, Alexander | TU Dortmund University |
Organizer: Du, Xu | Shanghaitech |
Organizer: Jiang, Yuning | EPFL |
Organizer: Houska, Boris | ShanghaiTech University |
Organizer: Faulwasser, Timm | TU Dortmund University |
|
16:00-16:20, Paper ThCT14.1 | Add to My Program |
Scheduling Delays and Curtailment for Household Appliances with Deterministic Load Profiles Using MPC |
|
Lian, Yingzhao | EPFL |
Jiang, Yuning | EPFL |
Jones, Colin N. | EPFL |
Opila, Daniel F. | United States Naval Academy |
Keywords: Smart cities/houses, Predictive control for linear systems
Abstract: Smart home appliances can time-shift and curtail their power demand to assist demand side management or allow operation with limited power, as in an off-grid application. This paper proposes a scheduling process to start appliances with time-varying deterministic load profiles. Self-triggered model predictive control is used to limit the household net power demand below a given threshold. Meanwhile, deterministic load profiles are more difficult to schedule compared to variable charging or thermal loads because system failure will occur once power demand is not satisfied. The proposed scheme formulates the decision of the load shifting time as a continuous optimization problem, and an inhomogeneous time grid system is introduced to handle the optimization of different appliances and their consensus at this resolution. The efficacy of the proposed scheme is studied by numerical comparison with a mixed-integer MPC controller and by a case study of three home appliances and an interruptible washing machine.
|
|
16:20-16:40, Paper ThCT14.2 | Add to My Program |
Approximations for Optimal Experimental Design in Power System Parameter Estimation (I) |
|
Du, Xu | ShanghaiTech University |
Engelmann, Alexander | TU Dortmund University |
Faulwasser, Timm | TU Dortmund University |
Houska, Boris | ShanghaiTech University |
Keywords: Estimation, Optimization, Power systems
Abstract: This paper is about computationally tractable methods for power system parameter estimation and Optimal Experiment Design (OED). The main motivation of OED is to increase the accuracy of power system parameter estimates for a given number of batches. One issue in OED, however, is that solving the OED problem for larger power grids turns out to be computationally expensive and, in many cases, computationally intractable. Therefore, the present paper proposes three numerical approximation techniques, which increase the computational tractability of OED for power systems. These approximation techniques are benchmarked on a 5-bus and a 14-bus case study.
|
|
16:40-17:00, Paper ThCT14.3 | Add to My Program |
Optimal Control Configuration in Distribution Network Via an Exact OPF Relaxation Method (I) |
|
Sekhavatmanesh, Hossein | FHNW |
Ferrari-Trecate, Giancarlo | Ecole Polytechnique Fédérale De Lausanne |
Mastellone, Silvia | University of Applied Science Northwestern Switzerland FHNW |
Keywords: Decentralized control, Optimal control, Power systems
Abstract: In the last decade, the integration of Renewable Energy Sources (RESs) in distribution networks has been constantly increasing due to their many technical, economical, and environmental benefits. However, the large-scale penetration of RESs is limited by the grid security constraints, e.g., voltage and current limits. The control of inverter-based RESs can guarantee compliance with those constraints while preserving the RESs performance. However, the installation of additional controllable converter units introduces additional investment costs and has therefore to be limited. In this paper, a Mixed-Integer Second-Order Cone (MISOCP) optimization problem is developed to optimally configure the layout of the Q-V and P-V droop controllers for the RES converters. The controllers’ layout is optimized to minimize investment, maintenance, and energy purchase costs subject to the system constraints. To provide an accurate and convex model of these constraints, we adapt an augmented relaxation method, recently proposed to address the optimal power flow problem in radial distribution networks. Our method is tested on a standard IEEE 34-bus network and the results are compared to those provided by standard approaches.
|
|
17:00-17:20, Paper ThCT14.4 | Add to My Program |
Sharing the Load: Considering Fairness in De-Energization Scheduling to Mitigate Wildfire Ignition Risk Using Rolling Optimization (I) |
|
Kody, Alyssa | Argonne National Laboratory |
West, Amanda | Georgia Institute of Technology |
Molzahn, Daniel | Georgia Institute of Technology |
Keywords: Power systems, Optimization, Energy systems
Abstract: Wildfires are a threat to public safety and have increased in both frequency and severity due to climate change. To mitigate wildfire ignition risks, electric power system operators proactively de-energize high-risk power lines during "public safety power shut-off" (PSPS) events. Line de-energizations can cause communities to lose power, which may result in negative economic, health, and safety impacts. Furthermore, the same communities may repeatedly experience power shutoffs over the course of a wildfire season, which compounds these negative impacts. However, there may be many combinations of power lines whose de-energization will result in about the same reduction of system-wide wildfire risk, but the associated power outages affect different communities. Therefore, one may raise concerns regarding the fairness of de-energization decisions. Accordingly, this paper proposes a framework to select lines to de-energize in order to balance wildfire risk reduction, total load shedding, and fairness considerations. The goal of the framework is to prevent a small fraction of communities from disproportionally being impacted by PSPS events, and to instead more equally share the burden of power outages. For a geolocated test case in the southwestern United States, we use actual California demand data as well as real wildfire risk forecasts to simulate PSPS events during the 2021 wildfire season and compare the performance of various methods for promoting fairness. Our results demonstrate that the proposed formulation can provide significantly more fair outcomes with limited impacts on system-wide performance.
|
|
17:20-17:40, Paper ThCT14.5 | Add to My Program |
Towards Optimal Kron-Based Reduction of Networks (Opti-KRON) for the Electric Power Grid (I) |
|
Chevalier, Samuel | MIT |
Almassalkhi, Mads | University of Vermont |
Keywords: Power systems, Optimization algorithms, Optimization
Abstract: For fast timescales or long prediction horizons, the AC optimal power flow (OPF) problem becomes a computational challenge for large-scale, realistic AC networks. To overcome this challenge, this paper presents a novel network reduction methodology that leverages an efficient mixed-integer linear programming (MILP) formulation of a Kron-based reduction that is optimal in the sense that it balances the degree of the reduction with resulting modeling errors in the reduced network. The method takes as inputs the full AC network and a pre-computed library of AC load flow data and uses the graph Laplacian to constraint nodal reductions to only be feasible for neighbors of non-reduced nodes. This results in a highly effective MILP formulation which is embedded within an iterative scheme to successively improve the Kron-based network reduction until convergence. The resulting optimal network reduction is, thus, grounded in the physics of the full network. The accuracy of the network reduction methodology is then explored for a 100+ node medium-voltage radial distribution feeder example across a wide range of operating conditions. It is finally shown that a network reduction of 25-85% can be achieved within seconds and with worst-case voltage magnitude deviation errors within any super node cluster of less than 0.01pu. These results illustrate that the proposed optimization-based approach to Kron reduction of networks is viable for larger networks and suitable for use within various power system applications.
|
|
17:40-18:00, Paper ThCT14.6 | Add to My Program |
Time-Domain Generalization of Kron Reduction |
|
Singh, Manish Kumar | University of Minnesota |
Dhople, Sairaj | University of Minnesota |
Dörfler, Florian | Swiss Federal Institute of Technology (ETH) Zurich |
Giannakis, Georgios B. | University of Minnesota |
Keywords: Model/Controller reduction, Power systems, Network analysis and control
Abstract: Kron reduction is a network-reduction method that eliminates nodes with zero current injections from electrical networks operating in sinusoidal steady state. In the time domain, the state-of-the-art application of Kron reduction has been in networks with transmission lines that have constant R/L ratios. In contrast, this paper considers the generalized setting of RL networks without such restriction and puts forth a provably exact time-domain version of Kron reduction. Exemplifying empirical tests on a wye-delta network are provided to validate the analytical results.
|
|
ThCT15 Regular Session, Maya Ballroom VII |
Add to My Program |
Game-Theoretic Methods in Agent-Based Systems |
|
|
Chair: Ramazi, Pouria | Brock University |
Co-Chair: Brown, Philip N. | University of Colorado, Colorado Springs |
|
16:00-16:20, Paper ThCT15.1 | Add to My Program |
Pseudo-Gradient Dynamics with Cognitive Predictions in Noncooperative Dynamical Systems |
|
Yan, Yuyue | Tokyo Institute of Technology |
Hayakawa, Tomohisa | Tokyo Institute of Technology |
Keywords: Game theory, Agents-based systems, Stability of nonlinear systems
Abstract: A framework of pseudo-gradient-based noncooperative systems with Level-k thinking under bounded depth of reasoning is proposed. In the characterized framework, each agent believes that he/she is the most sophisticated person in the noncooperative system and is allowed to base their decisions on the predictions about the likely actions of other agents. Depending on a knowledge network of payoff functions, the agents may be able to reasoning the other agents' best-response states and use these predicted states in the pseudo-gradient dynamics. Some sufficient conditions are presented to guarantee stability of a Nash equilibrium with uncertain sensitivity parameters or uncertain payoff information exchanging network. We present a numerical example to illustrate the efficacy of our approch.
|
|
16:20-16:40, Paper ThCT15.2 | Add to My Program |
Bandit Learning with Regularized Second-Order Mirror Descent |
|
Gao, Bolin | University of Toronto |
Pavel, Lacra | University of Toronto |
Keywords: Game theory, Agents-based systems, Learning
Abstract: Many recent game-theoretic applications can benefit from relaxed assumptions on the players’ informational requirements as well as structural properties of the game. Bandit information represents one of the weakest possible environments for which convergence towards Nash equilibrium can be shown. Currently, most results on multi-agent bandit learning only consider games with strict monotonicity or strict variationally stable states. In this work, we propose a novel second-order variant of the bandit mirror descent algorithm and show that it can converge in games with mere VSS, which is a broader class of games compared to the ones studied in the existing literature. Aside from the incorporation of second-order learning, this convergence is also enabled through a Tikhonov regularization term. Furthermore, we show that our algorithm converges in representative games and the adjustment of the regularization term is consistent with our prediction.
|
|
16:40-17:00, Paper ThCT15.3 | Add to My Program |
Valid Utility Games with Information Sharing Constraints |
|
Grimsman, David | Brigham Young University |
Brown, Philip N. | University of Colorado, Colorado Springs |
Marden, Jason R. | University of California, Santa Barbara |
Keywords: Game theory, Agents-based systems, Cooperative control
Abstract: The use of game theoretic methods for control in multiagent systems has been an important topic in recent research. Valid utility games in particular have been used to model real-world problems; such games have the convenient property that the value of any decision set which is a Nash equilibrium of the game is guaranteed to be within 1/2 of the value of the optimal decision set. However, an implicit assumption in this guarantee is that each agent is aware of the decisions of all other agents. In this work, we first describe how this guarantee degrades as agents are only aware of a subset of the decisions of other agents. We then show that this loss can be mitigated by restriction to a relevant subclass of games.
|
|
17:00-17:20, Paper ThCT15.4 | Add to My Program |
Evolutionary Dynamics of Mixed Rings of Coordinators and Anticoordinators |
|
Saeedi, Niloufar | Isfahan University of Technology |
Richard, Dan | University of Alberta |
Ramazi, Pouria | Brock University |
Keywords: Game theory, Agents-based systems, Network analysis and control
Abstract: Decision-making based on peers' actions often results in one of the following two groups: coordinators who tend to blend in with the population and make trending decisions, and anticoordinators who act against the majority. Only mixed networks of both coordinators and anticoordinators are capable of not reaching an equilibrium state, where every individual is satisfied with her choice. However, the conditions for non-equilibration, and more challengingly, the characterization of the non-equilibrium limit set remain concealed. We answer these problems for ring networks. We show that a mixed ring of coordinators and anticoordinators equilibrates if and only if it does not contain a particular arrangement of consecutive agents, and never equilibrates if and only if it contains a particular arrangement of consecutive agents with particular actions. As a result, a ring may admit both equilibrium and non-equilibrium limit sets. We further investigate the stability of the equilibrium states of the resulting network dynamics.
|
|
17:20-17:40, Paper ThCT15.5 | Add to My Program |
Multi-Agent Motion Planning Using Differential Games with Lexicographic Preferences |
|
Miller, Kristina | University of Illinois Urbana-Champaign |
Mitra, Sayan | University of Illinois |
Keywords: Game theory, Formal Verification/Synthesis, Agents-based systems
Abstract: Multi-player games with lexicographic cost functions can capture a variety of driving and racing scenarios and are known to have pure-strategy Nash Equilibria (NE) under certain conditions. The standard Iterated Best Response (IBR) procedure for finding such equilibria can be slow because computing the best response for each agent generally involves solving a non-convex optimization problem. In this paper, we introduce a type of game which uses a lexicographic cost function. We show that for this class of games, the best responses can be effectively computed through piece-wise linear approximations. This enables us to approximate the NE using a linearized version of IBR. We show that the gap between the linear approximations returned by our linearized IBR and the true best response drops asymptotically. We implement the algorithm and show that it can find approximate NE for a handful of agents driving in realistic scenarios in under 10 seconds.
|
|
17:40-18:00, Paper ThCT15.6 | Add to My Program |
Asymmetric Battlefield Uncertainty in General Lotto Games |
|
Paarporn, Keith | University of Colorado, Colorado Springs |
Chandan, Rahul | University of California, Santa Barbara |
Alizadeh, Mahnoosh | University of California Santa Barbara |
Marden, Jason R. | University of California, Santa Barbara |
Keywords: Game theory, Agents-based systems
Abstract: How to strategically allocate resources against opponents is a central component in adversarial decision-making. Moreover, informational asymmetries often exist between competitors and can significantly impact outcomes. In this paper, we study General Lotto games under varying levels of informational asymmetry between two players. The General Lotto game is a popular model of competitive resource allocation, in which opposing players compete over a set of valuable battlefields. We consider information here as knowledge of the battlefield values, which are randomly drawn from a finite distribution. First, we consider a class of games in which one player observes all realized values, and the opponent does not observe any. We then consider scenarios where the opponent observes a subset of the realized values. In both settings, we completely characterize equilibrium payoffs and strategies. We then provide improvement factors that result from acquiring information, demonstrating that information can significantly improve one's competitive performance.
|
|
ThCT16 Regular Session, Maya Ballroom VIII |
Add to My Program |
Lyapunov Methods II |
|
|
Chair: Moreno, Jaime A. | Universidad Nacional Autonoma De Mexico-UNAM |
Co-Chair: Angeli, David | Imperial College |
|
16:00-16:20, Paper ThCT16.1 | Add to My Program |
Ruling Out Positive Lyapunov Exponents by Using the Jacobian's Second Additive Compound Matrix |
|
Martini, Davide | University of Florence |
Angeli, David | Imperial College |
Innocenti, Giacomo | Universita' Di Firenze |
Tesi, Alberto | Univ. Di Firenze |
Keywords: Lyapunov methods, LMIs, Stability of nonlinear systems
Abstract: Second additive compound matrices of the system's Jacobian are used to formulate sufficient conditions to rule out existence of attractors with positive Lyapunov exponents. The criteria are expressed in terms of Lyapunov dissipation inequalities or Linear Matrix Inequalities amenable to analytic verification. The results extend applicability of previous existing conditions formulated to discard periodic and almost periodic oscillations. An example of the technique to rule out chaos in certain parameters region of the Lorenz system is discussed.
|
|
16:20-16:40, Paper ThCT16.2 | Add to My Program |
Global Asymptotic Stabilization of Single-Input Bilinear Discrete Time-Invariant Complex Systems |
|
Zaitsev, Vasilii | Udmurt State University |
Zaitsev, Egor | Udmurt State University |
Keywords: Lyapunov methods, Nonlinear systems, Stability of nonlinear systems
Abstract: The problem of global asymptotic stabilization by state feedback is considered for bilinear autonomous discrete-time control systems with the single input in the complex space. For such systems, the possibility of applying the second Lyapunov method is proved, which is not valid for general nonlinear complex systems. The approach uses the Krasovsky-La Salle theorem on global asymptotic stability for discrete-time systems. Sufficient conditions for global asymptotic stabilization of a bilinear complex system by real state feedback are obtained. Finally, an example of using the obtained results is presented.
|
|
16:40-17:00, Paper ThCT16.3 | Add to My Program |
Safe Backstepping with Control Barrier Functions |
|
Taylor, Andrew | California Institute of Technology |
Molnar, Tamas G. | California Institute of Technology |
Ong, Pio | California Institute of Technology |
Ames, Aaron D. | California Institute of Technology |
Keywords: Lyapunov methods, Nonlinear systems, Stability of nonlinear systems
Abstract: Complex control systems are often described in a layered fashion, represented as higher-order systems where the inputs appear after a chain of integrators. While Control Barrier Functions (CBFs) have proven to be powerful tools for safety-critical controller design of nonlinear systems, their application to higher-order systems adds complexity to the controller synthesis process---it necessitates dynamically extending the CBF to include higher order terms, which consequently modifies the safe set in complex ways. We propose an alternative approach for addressing safety of higher-order systems through Control Barrier Function Backstepping. Drawing inspiration from the method of Lyapunov backstepping, we provide a constructive framework for synthesizing safety-critical controllers and CBFs for higher-order systems from a top-level dynamics safety specification and controller design. Furthermore, we integrate the proposed method with Lyapunov backstepping, allowing the tasks of stability and safety to be expressed individually but achieved jointly. We demonstrate the efficacy of this approach in simulation.
|
|
17:00-17:20, Paper ThCT16.4 | Add to My Program |
Inverse Optimal Control with Discount Factor for Continuous and Discrete-Time Control-Affine Systems and Reinforcement Learning |
|
Rodrigues, Luis | Concordia University |
Keywords: Lyapunov methods, Optimal control, Learning
Abstract: This paper addresses the inverse optimal control problem of finding the state weighting function that leads to a quadratic value function when the cost on the input is fixed to be quadratic. The paper focuses on a class of infinite horizon discrete-time and continuous-time optimal control problems whose dynamics are control-affine and whose cost is quadratic in the input. The optimal control policy for this problem is the projection of minus the gradient of the value function onto the space formed by all feasible control directions. This projection points along the control direction of steepest decrease of the value function. For discrete-time systems and a quadratic value function the optimal control law can be obtained as the solution of a regularized least squares program, which corresponds to a receding horizon control with a single step ahead. For the single input case and a quadratic value function the solution for small weights in the control energy is interpreted as a control policy that at each step brings the trajectories of the system as close as possible to the origin, as measured by an appropriate norm. Conditions under which the optimal control law is linear are also stated. Additionally, the paper offers a mapping of the optimal control formulation to an equivalent reinforcement learning formulation. Examples show the application of the theoretical results.
|
|
17:20-17:40, Paper ThCT16.5 | Add to My Program |
An Analysis of the Convergence Properties of Finite-Time Homogeneous Controllers through Its Implementation in a Flexible-Joint Robot |
|
Mendoza-Avila, Jesus | INRIA Lille-Nord Europe |
Efimov, Denis | Inria |
Fridman, Leonid | Universidad Nacional Autonoma De Mexico |
Moreno, Jaime A. | Universidad Nacional Autonoma De Mexico-UNAM |
Keywords: Lyapunov methods, Stability of nonlinear systems, Variable-structure/sliding-mode control
Abstract: A study of the stability of interconnected homogeneous systems, affected by singular perturbations, is presented by means of the implementation of finite-time composite control in a single-link flexible-joint robot. Previous results suggest that the implementation of finite-time convergent controllers causes the arising of chattering. Now, for the case under study, we show that the design of a controller making the whole system homogeneous avoids the undesired chattering and recovers the ideal finite-time convergence properties. Regrettably, in most cases, information on the states of the fast dynamics is not available, then the presented strategy is not applicable at all but it is still interesting because it allows the expansion of the panorama for a better understanding of the causes of chattering, and contributes to the development of chattering reduction techniques which has attracted a lot of attention, nowadays.
|
|
17:40-18:00, Paper ThCT16.6 | Add to My Program |
Finite-Time Integral Control of Nonlinear Planar Systems Subject to Mismatched Disturbances |
|
Chen, Xiandong | Shandong University |
Qian, Chunjiang | University of Texas at San Antonio |
Zhang, Xianfu | Shandong University |
Keywords: Lyapunov methods, Uncertain systems, Nonlinear systems
Abstract: This paper investigates the finite-time bounded control of nonlinear planar systems subject to mismatched disturbances, where the perturbed uncertainties are relaxed to nonlinear functions with low-order terms. By revamping the technique of adding a power integrator and introducing new coordinates, a systematic Lyapunov method is proposed. This is achieved by three major mechanisms: (i) for the purpose of finite-time convergence, a lower-order integral dynamic is first constructed; (ii) a new structural controller is constructed to handle the mismatched disturbances; and (iii) a new structural Lyapunov function is established to provide an effective estimating tool for analyzing the finite-time boundedness of the considered systems. A simulation example is given to verify the effectiveness of the proposed controller.
|
|
ThCT17 Regular Session, Acapulco |
Add to My Program |
Quantum Information and Control II |
|
|
Chair: Pozzoli, Eugenio | CNRS, Université Bourgogne Franche-Comté |
Co-Chair: Deutschmann-Olek, Andreas | TU Wien |
|
16:00-16:20, Paper ThCT17.1 | Add to My Program |
Iterative Shaping of Optical Potentials for One-Dimensional Bose-Einstein Condensates |
|
Deutschmann-Olek, Andreas | TU Wien |
Tajik, Mohammadamin | TU Wien |
Calzavara, Martino | Forschungszentrum Jülich GmbH |
Schmiedmayer, Jörg | TU Wien |
Calarco, Tommaso | Forschungszentrum Jülich GmbH |
Kugi, Andreas | TU Wien |
Keywords: Iterative learning control, Quantum information and control, Distributed parameter systems
Abstract: The ability to manipulate clouds of ultra-cold atoms is crucial for modern experiments on quantum manybody systems and quantum thermodynamics as well as future metrological applications of Bose-Einstein condensate. While optical manipulation offers almost arbitrary flexibility, the precise control of the resulting dipole potentials and the mitigation of unwanted disturbances is quite involved and only heuristic algorithms with rather slow convergence rates are available up to now. This paper thus suggests the application of iterative learning control (ILC) methods to generate fine-tuned effective potentials in the presence of uncertainties and external disturbances. Therefore, the given problem is reformulated to obtain a one-dimensional tracking problem by using a quasicontinuous input mapping which can be treated by established ILC methods. Finally, the performance of the proposed concept is illustrated in a simulation scenario.
|
|
16:20-16:40, Paper ThCT17.2 | Add to My Program |
Two-Stage Solution of Quantum Process Tomography in the Natural Basis |
|
Xiao, Shuixin | Shanghai Jiao Tong University |
Wang, Yuanlong | Chinese Academy of Sciences |
Dong, Daoyi | University of New South Wales |
Zhang, Jun | Shanghai Jiao Tong University |
Keywords: Quantum information and control
Abstract: Quantum process tomography (QPT) is a critical task for characterizing the dynamics of quantum systems and achieving precise quantum control. In this paper, we propose an effective two-stage solution (TSS) for QPT in the natural basis. Our algorithm has O(d^6) computational complexity where d is the dimension of the quantum system. We establish an error bound O(frac{d^5}{sqrt{N}}) where N is the copy number for each output state. The numerical examples compared with a convex optimization algorithm demonstrate the effectiveness of our algorithm.
|
|
16:40-17:00, Paper ThCT17.3 | Add to My Program |
Applying Classical Control Techniques to Quantum Systems: Entanglement versus Stability Margin and Other Limitations |
|
Weidner, Carrie Ann | University of Bristol |
Shermer, Sophie | Swansea University |
Langbein, Frank C. | Cardiff University |
Jonckheere, Edmond | University of Southern California |
Keywords: Quantum information and control, Robust control, Uncertain systems
Abstract: Development of robust quantum control has been challenging and there are numerous obstacles to applying classical robust control to quantum system including bilinearity, marginal stability, state preparation errors, nonlinear figures of merit. The requirement of marginal stability, while not satisfied for closed quantum systems, can be satisfied for open quantum systems where Lindbladian behavior leads to non-unitary evolution, and allows for nonzero classical stability margins, but it remains difficult to extract physical insight when classical robust control tools are applied to these systems. We consider a straightforward example of the entanglement between two qubits dissipatively coupled to a lossy cavity and analyze it using the classical stability margin and structured perturbations. We attempt, where possible, to extract physical insight from these analyses. Our aim is to highlight where classical robust control can assist in the analysis of quantum systems and identify areas where more work needs to be done to develop specific methods for quantum robust control.
|
|
17:00-17:20, Paper ThCT17.4 | Add to My Program |
A J-Spectral Factorization Condition for the Phyiscal Realizability of a Transfer Function Matrix with Only Direct Feedthrough Quantum Noise |
|
Thien, Rebbecca Tze Yean | Australian National University |
Vuglar, Shanon Leigh | Princeton University |
Petersen, Ian R. | Australian National University |
Keywords: Algebraic/geometric methods, Quantum information and control
Abstract: This paper gives a J-spectral factorization condition for the implementation of a strictly proper transfer function matrix as a physically realizable quantum system using only direct feedthrough quantum noise. A necessary frequency response condition is also presented. Examples are included to illustrate the main results.
|
|
17:20-17:40, Paper ThCT17.5 | Add to My Program |
Information Transfer in Spintronics Networks under Worst Case Uncertain Parameter Errors |
|
Rompokos, Athanasios | University of Southern California |
Langbein, Frank C. | Cardiff University |
Jonckheere, Edmond | University of Southern California |
Keywords: Quantum information and control, Optimization, Robust control
Abstract: A novel quantum landscape optimization with respect to bias field control inputs is developed with the goal of achieving optimal transfer fidelity subject to robustness against bias field, spin couplings and other uncertainties. This objective is achieved by minimization of a convex combination of fidelity error and worst-case perturbation of fidelity error under directional perturbation of uncertain parameters. The novelty is that the end-point perturbations of the parameters are points of a random uniform sampling of the sphere centered at the nominal values of the parameters. This reveals that the previously developed perfect state transfer with zero sensitivity solution keeps high fidelity and robustness under large rather than differential perturbations.
|
|
17:40-18:00, Paper ThCT17.6 | Add to My Program |
A Quantum Optimal Control Problem with State Constrained Preserving Coherence |
|
Binandeh Dehaghani, Nahid | University of Porto |
Lobo Pereira, Fernando | Porto University |
Aguiar, A. Pedro | Faculty of Engineering, University of Porto |
Keywords: Quantum information and control, Optimal control, Optimization algorithms
Abstract: In this work, we address the problem of maximizing fidelity in a quantum state transformation process controlled in such a way as to keep decoherence within given bounds. We consider a three-level Lambda-type atom subjected to Markovian decoherence characterized by non-unital decoherence channels. We introduce fidelity as the performance index for the quantum state transformation process since the goal is to maximize the similarity of the final state density operator with the one of the desired target state. We formulate the quantum optimal control problem with state constraints where the latter reflect the fact that the decoherence level remains within a pre-defined bound. Optimality conditions of Pontryagin's Maximum Principle in Gamkrelidze's form are given. These provide a complete set of relations enabling the computation of the optimal control strategy. This is a novel approach in the context of quantum systems.
|
| |