| |
Last updated on November 17, 2022. This conference program is tentative and subject to change
Technical Program for Wednesday December 7, 2022
|
WePL Plenary Session, Tulum Ballroom A-H |
Add to My Program |
How Do We Learn to Use Learning in Manufacturing Systems |
|
|
Chair: Serrani, Andrea | The Ohio State University |
Co-Chair: Valcher, Maria Elena | Universita' Di Padova |
|
08:30-09:30, Paper WePL.1 | Add to My Program |
How Do We Learn to Use Learning in Manufacturing Systems |
|
Barton, Kira | University of Michigan, Ann Arbor |
Keywords: Manufacturing systems and automation, Learning
Abstract: Manufacturing has undergone significant changes over the past five-ten years thanks to technological advancements that have been leveraged to meet a diverse set of customer requirements driven by global and societal needs. Conventional manufacturing control strategies were typically designed for robustness and speed within a controlled and well-regulated environment. However, recent demands for customization and agility coupled with big data investments have provided an opportunity for more learning-based methods to be introduced. Data driven strategies have long provided a means of harnessing information to enhance the performance of these complex systems. This talk is motivated by real-world interest from industry in understanding how to combine data-based learning and experiential knowledge to make intelligent decisions that can save time, money, and resources. In this talk, we examine which aspects of manufacturing processes lend themselves to learning strategies and which bring additional challenges. We also explore cases in which learning has been applied in different ways to additive manufacturing processes in order to improve process knowledge and performance. Opportunities for additional integration of learning into the manufacturing domain will be discussed and open research questions for control-theoretic advancements will be highlighted.
|
|
WeAT01 Regular Session, Tulum Ballroom A |
Add to My Program |
Hybrid Systems I |
|
|
Chair: Sanfelice, Ricardo G. | University of California at Santa Cruz |
Co-Chair: Jain, Neera | Purdue University |
|
10:00-10:20, Paper WeAT01.1 | Add to My Program |
A Mixed Integer Approach to Solve Hybrid Model Predictive Control Problems |
|
Nodozi, Iman | University of California, Santa Cruz |
Sanfelice, Ricardo G. | University of California at Santa Cruz |
Keywords: Hybrid systems, Optimal control
Abstract: This paper presents an algorithm for solving the optimization problem associated with hybrid model predictive control for a class of discretized hybrid control systems. The proposed approach consists of reformulating the optimal control problem as a mixed integer quadratic problem (MIQP), which can be solved using well-established algorithms in the literature. Specifically, the given discretized hybrid control system is transformed into a mixed logical dynamical (MLD) system that, for the class of discretized hybrid control systems considered, gives rise to an MIQP. The MLD model is obtained through an intermediate step that transforms the discretized hybrid control system into a discrete-time control system.
|
|
10:20-10:40, Paper WeAT01.2 | Add to My Program |
Robust Successor and Precursor Sets of Hybrid Systems Using Hybrid Zonotopes |
|
Siefert, Jacob | Pennsylvania State University |
Bird, Trevor J. | Purdue University |
Koeln, Justin | University of Texas at Dallas |
Jain, Neera | Purdue University |
Pangborn, Herschel | Pennsylvania State University |
Keywords: Hybrid systems, Predictive control for linear systems
Abstract: This paper presents identities for calculating robust successor and precursor sets of discrete-time linear hybrid systems. The proposed technique relies on generating a set containing all possible state transitions of a dynamic system over a region of interest, named the state-update set. Forward and backward reachability can then be performed using only projection, intersection, and Minkowski difference set operations with the state-update set. It is shown how state-update sets may be defined as hybrid zonotopes for mixed logical dynamical systems and linear systems in closed loop with model predictive control
|
|
10:40-11:00, Paper WeAT01.3 | Add to My Program |
A Negative Imaginary Approach to Hybrid Integrator-Gain System Control |
|
Shi, Kanghong | Australian National University |
Nikooienejad, Nastaran | University of Texas at Dallas |
Petersen, Ian R. | Australian National University |
Moheimani, S.O. Reza | University of Texas at Dallas |
Keywords: Hybrid systems, Nonlinear systems, Robust control
Abstract: In this paper, we show that a hybrid integrator-gain system (HIGS) is a nonlinear negative imaginary (NNI) system. We prove that the positive feedback interconnection of a linear negative imaginary (NI) system and a HIGS is asymptotically stable. We apply the HIGS to a MEMS nanopositioner, as an example of a linear NI system, in a single-input single-output framework. We analyze the stability and the performance of the closed-loop interconnection in both time and frequency domains through simulations and demonstrate the applicability of HIGS as an NNI controller to a linear NI system.
|
|
11:00-11:20, Paper WeAT01.4 | Add to My Program |
Observer Design Based on Observability Decomposition for Hybrid Systems with Linear Maps and Known Jump Times |
|
Tran, Gia Quoc Bao | Mines Paris, Université PSL |
Bernard, Pauline | MINES ParisTech, Université PSL |
Di Meglio, Florent | MINES ParisTech |
Marconi, Lorenzo | Univ. Di Bologna |
Keywords: Hybrid systems, Observers for Linear systems, Lyapunov methods
Abstract: We propose an observer design method for hybrid systems with linear maps and known jump times based on decomposing the state into parts exhibiting different kinds of observability properties. Using a series of transformations depending on the time elapsed since the previous jump, the state may be decomposed into up to three parts, where the first one is instantaneously observable during flows from the flow output, the second one detectable at jumps from the jump output thanks to the combination of flows and jumps, and the remaining part naturally detectable at jumps still thanks to this combination of flows and jumps but implicitly from the flow output. An observer is then designed to estimate each part, relying on a flow-based Kalman-like observer exploiting the flow output for the first part, a jump-based observer exploiting the jump output for the second, and a jump-based observer exploiting a fictitious output for the third. Global exponential stability of the estimation error is proven using Lyapunov analysis.
|
|
11:20-11:40, Paper WeAT01.5 | Add to My Program |
On Infinitesimal Contraction Analysis for Hybrid Systems |
|
Burden, Samuel A. | University of Washington |
Coogan, Samuel | Georgia Institute of Technology |
Keywords: Hybrid systems, Stability of hybrid systems
Abstract: Infinitesimal contraction analysis, wherein global convergence results are obtained from properties of local dynamics, is a powerful analysis tool. In this paper, we generalize infinitesimal contraction analysis to hybrid systems in which state-dependent guards trigger transitions defined by reset maps between modes that may have different norms and need not be of the same dimension. In contrast to existing literature, we do not restrict mode sequence or dwell time. We work in settings where the hybrid system flow is differentiable almost everywhere and its derivative is the solution to a jump-linear-time-varying differential equation whose jumps are defined by a saltation matrix determined from the guard, reset map, and vector field. Our main result shows that if the vector field is infinitesimally contracting, and if the saltation matrix is non-expansive, then the intrinsic distance between any two trajectories decreases exponentially in time. When bounds on dwell time are available, our approach yields a bound on the intrinsic distance between trajectories regardless of whether the dynamics are expansive or contractive. We illustrate our results using wo examples: a constrained mechanical system and an electrical circuit with an ideal diode.
|
|
11:40-12:00, Paper WeAT01.6 | Add to My Program |
A Generalization of Synergistic Hybrid Feedback Control with Application to Maneuvering Control of Ships |
|
Schmidt-Didlaukies, Henrik M. | Norwegian University of Science and Technology |
Basso, Erlend Andreas | Norwegian University of Science and Technology |
Sorensen, Asgeir Johan | Norwegian Univ of Sci and Technology |
Pettersen, Kristin Y. | Norwegian University of Science and Technology (NTNU) |
Keywords: Hybrid systems, Maritime control, Stability of hybrid systems
Abstract: This paper generalizes results on synergistic hybrid feedback control. Specifically, we propose a generalized definition of synergistic Lyapunov functions and feedbacks which allows the logic variable in traditional synergistic control, denoted the synergy variable, to be vector-valued and change during flows. Moreover, we introduce synergy gaps relative to components of product sets, which enables us to define jump conditions in the form of synergy gaps for different components of the synergy variable. In particular, this enables us to formulate existing hybrid output feedback control schemes within the synergistic control framework. Furthermore, we show that our generalized definition is amenable to backstepping. Finally, we give an example of how traditional synergistic control can be combined with ship maneuvering control with discrete path dynamics.
|
|
WeAT02 Regular Session, Tulum Ballroom B |
Add to My Program |
Adaptive Control IV |
|
|
Chair: Pettersen, Kristin Y. | Norwegian University of Science and Technology (NTNU) |
Co-Chair: Lamperski, Andrew | University of Minnesota |
|
10:00-10:20, Paper WeAT02.1 | Add to My Program |
Practically Safe Extremum Seeking |
|
Williams, Alan | UCSD |
Krstic, Miroslav | University of California, San Diego |
Scheinker, Alexander | Los Alamos National Lab |
Keywords: Adaptive control, Optimization algorithms, Constrained control
Abstract: We introduce an algorithm that we call Practically Safe Extremum Seeking (PSfES), which seeks to minimize an unknown objective function while avoiding unsafe regions of state space, except for a possible minor violation of the safety boundaries, which can be arbitrarily reduced using the algorithm's design parameters, such as the perturbation frequency and amplitude. We allow the metric of safety---a barrier function---to be functionally unknown. Only the value of the barrier function is assumed measured. We introduce dynamic filters which emulate, in an average and singularly perturbed sense, the feedback law of a standard quadratic programming (QP) and control barrier function (CBF) based safety filter, acting on a nominal extremum seeking (ES) controller. These filters have the effect of enabling the use of the averaging and singular perturbation theorems, which enable us to guarantee convergence (practical) to near a point in the safe set and safety (practical) during the transient, both in a local sense. We present a design for multiple dimensions but provide an analysis in one dimension. Finally, the behavior of our controller is demonstrated in simulation for two examples.
|
|
10:20-10:40, Paper WeAT02.2 | Add to My Program |
Data-Driven Adaptive Model Predictive Control for Wind Farms: A Koopman-Based Online Learning Approach |
|
Dittmer, Antje | German Aerospace Center |
Sharan, Bindu | Hamburg University of Technology |
Werner, Herbert | Hamburg University of Technology |
Keywords: Adaptive control, Nonlinear systems identification, Power systems
Abstract: A novel adaptive Koopman based model predictive control (MPC) algorithm for wind farm control is presented. Using the data-driven Koopman approach the highly non-linear wake effects governing wind farm dynamics can be efficiently modelled. An update rule is presented to enable online learning only when new information is available. Moreover, to provide sufficient excitation of all relevant model frequencies in closed loop, a small test signal is superimposed on the control input while the Koopman model is updated. Simulation studies in the WFSim environment illustrate excellent accuracy for wind speed estimation. In closed-loop, the adaptive online-update strategy tracks reference farm yield well, considerably outperforming recently presented non-adaptive schemes.
|
|
10:40-11:00, Paper WeAT02.3 | Add to My Program |
Optimal Adaptive Output Regulation of Uncertain Nonlinear Discrete-Time Systems Using Lifelong Concurrent Learning |
|
Moghadam, Rohollah | California State University, Sacramento |
Farzanegan, Behzad | Missouri University S&T |
Jagannathan, Sarangapani | Missouri Univ of Science & Tech |
Natarajan, Pappa | MIT Campus, Anna University |
Keywords: Adaptive control, Optimal control
Abstract: This paper addresses neural network (NN) based optimal adaptive regulation of uncertain nonlinear discrete-time systems in affine form using output feedback via lifelong concurrent learning. First, an adaptive NN observer is introduced to estimate both the state vector and control coefficient matrix, and its NN weights are adjusted using both output error and concurrent learning term to relax the persistency excitation (PE) condition. Next, by utilizing an actor-critic framework for estimating the value functional and control policy, the critic network weights are tuned via both temporal different error and concurrent learning schemes through a replay buffer. The actor NN weights are tuned using control policy errors. To attain lifelong learning for performing effectively during multiple tasks, an elastic weight consolidation term is added to the critic NN weight tuning law. The state estimation, regulation, and the weight estimation errors of the observer, actor and critic NNs are demonstrated to be bounded when performing tasks by using Lyapunov analysis. Simulation results are carried out to verify the effectiveness of the proposed approach on a Vander Pol Oscillator. Finally, extension to optimal tracking is given briefly.
|
|
11:00-11:20, Paper WeAT02.4 | Add to My Program |
Global Asymptotic Position and Heading Tracking for Multirotors Using Tuning Function-Based Adaptive Hybrid Feedback |
|
Basso, Erlend Andreas | Norwegian University of Science and Technology |
Schmidt-Didlaukies, Henrik M. | Norwegian University of Science and Technology |
Pettersen, Kristin Y. | Norwegian University of Science and Technology (NTNU) |
Keywords: Flight control, Adaptive control, Control applications
Abstract: This letter considers the problem of global asymptotic position and heading tracking for multirotors. We propose a hybrid adaptive feedback control law that globally asymptotically tracks a position and heading reference in the presence of unknown constant disturbances in both the translational and rotational dynamics. By employing a tuning function-based backstepping approach, the number of parameter estimates are minimized. Moreover, we propose a novel control law for the translational subsystem, which leads to a simpler virtual control law when backstepping. Global asymptotic heading tracking is achieved through a novel construction of the desired rotation matrix. The theory is verified through experiments on a quadrotor.
|
|
11:20-11:40, Paper WeAT02.5 | Add to My Program |
Decentralized Adaptive Stabilization of Infinite Networks of Switched Nonlinear Systems with Unknown Control Directions |
|
Pavlichkov, Svyatoslav | University of Groningen |
Bajcinca, Naim | University of Kaiserslautern |
Keywords: Lyapunov methods, Adaptive control, Large-scale systems
Abstract: We solve the problem of decentralized adaptive stabilization for infinite networks of hierarchically intercon- nected switched systems in strict-feedback form with unknown switching signals and with unknown control directions. We assume that each node is interconnected with only finite set of its neighbors and the number of neighbors of each node has a uniform upper boundary. The trajectories of the closed-loop system with our switching-independent adaptive feedback law converge to the origin with respect to the l∞-norm, and this property holds uniformly with respect to the unknown switching signals.
|
|
11:40-12:00, Paper WeAT02.6 | Add to My Program |
Sufficient Conditions for Persistency of Excitation with Step and ReLU Activation Functions |
|
Lekang, Tyler | University of Minnesota, Twin Cities |
Lamperski, Andrew | University of Minnesota |
Keywords: Machine learning, Adaptive control, Identification
Abstract: This paper defines geometric criteria which are then used to establish sufficient conditions for persistency of excitation with vector functions constructed from single hidden-layer neural networks with step or ReLU activation functions. We show that these conditions hold when employing reference system tracking, as is commonly done in adaptive control. We demonstrate the results numerically on a system with linearly parameterized activations of this type and show that the parameter estimates converge to the true values with the sufficient conditions met.
|
|
WeAT03 Regular Session, Tulum Ballroom C |
Add to My Program |
Robotics IV |
|
|
Chair: Ryll, Markus | Technical University Munich |
Co-Chair: Freitas, Gustavo | Universidade Federal De Minas Gerais |
|
10:00-10:20, Paper WeAT03.1 | Add to My Program |
Vector Field for Curve Traversal with Obstacle Avoidance |
|
Dias Nunes, Arthur Henrique | Federal University of Minas Gerais |
Rezende, Adriano | UFMG |
Pereira da Cruz Júnior, Gilmar | Universidade Federal De Minas Gerais |
Freitas, Gustavo | Universidade Federal De Minas Gerais |
Mariano, Vinicius | UFMG |
Pimenta, Luciano C. A. | Universidade Federal De Minas Gerais |
Keywords: Robotics, Autonomous robots, Autonomous vehicles
Abstract: In this work, we extend a recently proposed methodology to construct artificial vector fields for robot navigation in n-dimensional spaces to track and circulate time-varying curves. Now, we incorporate the ability to deviate from obstacles that might be static or dynamic by constructing a collision-free vector field. It can be considered a reactive approach in which the obstacles can be locally sensed and then circumnavigated at a fixed distance. The novel vector field that allows the circumnavigation of obstacles and the traversal of a target curve is built upon the consideration of the closest points on the obstacles, which can assume a generic shape, and the closest point on the curve. In order to validate our method, we present ROS-based computational simulations and real world experiments with a wheeled robot and an UAV quadcopter.
|
|
10:20-10:40, Paper WeAT03.2 | Add to My Program |
Fast Replanning of a Lower-Limb Exoskeleton Trajectories for Rehabilitation |
|
Brunet, Maxime | Wandercraft |
Pétriaux, Marine | Wandercraft |
Di Meglio, Florent | MINES ParisTech |
Petit, Nicolas | MINES Paris, PSL University |
Keywords: Robotics, Healthcare and medical systems, Optimal control
Abstract: The paper addresses the rehabilitation of disabled patients using a lower-limb fully-actuated exoskeleton. We propose a numerical method to replan the current step without jeopardizing stability. Stability is evaluated in the light of a simple linear time-invariant surrogate model. The method's core is the analysis of an input-constrained optimal control problem with state specified at an unspecified terminal time. A detailed study of the extremals given by Pontryagin Maximum Principle is sufficient to characterize its feasibility. This allows a fast replanning strategy. The efficiency of the numerical algorithm (resolution time below 1,ms) yields responsiveness to the patient's request. Realistic simulations on a full-body model of the patient-exoskeleton system stress that cases of practical interest for physiotherapists are well-addressed.
|
|
10:40-11:00, Paper WeAT03.3 | Add to My Program |
Spatial Motion Planning with Pythagorean Hodograph Curves |
|
Arrizabalaga, Jon | Technical University of Munich (TUM) |
Ryll, Markus | Technical University Munich |
Keywords: Robotics, Predictive control for nonlinear systems
Abstract: This paper presents a two-stage prediction-based control scheme for embedding the environment's geometric properties into a collision-free Pythagorean Hodograph spline, and subsequently finding the optimal path within the parameterized free space. The ingredients of this approach are twofold: First, we present a novel spatial path parameterization applicable to any arbitrary curve without prior assumptions in its adapted frame. Second, we identify the appropriateness of Pythagorean Hodograph curves for a compact and continuous definition of the path-parametric functions required by the presented spatial model. This dual-stage formulation results in a motion planning approach, where the geometric properties of the environment arise as states of the prediction model. Thus, the presented method is attractive for motion planning in dense environments. The efficacy of the approach is evaluated according to an illustrative example.
|
|
11:00-11:20, Paper WeAT03.4 | Add to My Program |
Task-Invariant Centroidal Momentum Shaping for Lower-Limb Exoskeletons |
|
Yu, Miao | Clemson University |
Lv, Ge | Clemson University |
Keywords: Robotics, Nonlinear systems, Biomedical
Abstract: Task-invariant approaches are desirable in exoskeleton control design as they have the potential of providing consistent assistance across locomotor tasks. Different from traditional trajectory-tracking approaches that are specific to tasks and users, task-invariant control approaches do not replicate normative joint kinematics, which could eliminate the need for task detection and allow more flexibility for human users. In this paper, we propose a task-invariant control paradigm for lower-limb exoskeletons to alter the human user’s centroidal momentum, i.e., a sum of projected limb momenta onto the human’s center of mass. We design a virtual reference model based on human user’s self-selected gaits to provide a reference centroidal momentum for the exoskeleton to track and make it adaptable to changes in gait patterns. Mathematically, the proposed approach reduces the control design problem into a lower-dimensional space. With the number of actuators being greater than the dimension of the centroidal momentum vector, we can guarantee the existence of a centroidal momentum shaping law for underactuated systems through optimization. Simulation results on a human-like biped show that the proposed shaping strategy can produce beneficial results on assisting human locomotion, such as metabolic cost reduction.
|
|
11:20-11:40, Paper WeAT03.5 | Add to My Program |
Multi-Robot-Assisted Human Crowd Evacuation Using Navigation Velocity Fields |
|
Zheng, Tongjia | University of Notre Dame |
Yuan, Zhenyuan | Pennsylvania State University |
Nayyar, Mollik | The Pennsylvania State University |
Wagner, Alan | The Pennsylvania State University |
Zhu, Minghui | Pennsylvania State University |
Lin, Hai | University of Notre Dame |
Keywords: Robotics, Large-scale systems, Distributed parameter systems
Abstract: This work studies a robot-assisted crowd evacuation problem where we control a small group of robots to guide a large human crowd to safe locations. The challenge lies in how to model human-robot interactions and design robot controls to indirectly control a human population that significantly outnumbers the robots. To address the challenge, we treat the crowd as a continuum and formulate the evacuation objective as driving the crowd density to target locations. We propose a novel mean-field model which consists of a family of microscopic equations that explicitly model how human motions are locally guided by the robots and an associated macroscopic equation that describes how the crowd density is controlled by the navigation velocity fields generated by all robots. Then, we design density feedback controllers for the robots to dynamically adjust their states such that the generated navigation velocity fields drive the crowd density to a target density. Stability guarantees of the proposed controllers are proven. Agent-based simulations are included to evaluate the proposed evacuation algorithms.
|
|
11:40-12:00, Paper WeAT03.6 | Add to My Program |
Distributed Quadratic Programming-Based Nonlinear Controllers for Periodic Gaits on Legged Robots |
|
Kamidi, Vinay R. | Virginia Tech |
Kim, Jeeseop | Virginia Tech |
Fawcett, Randall | Virginia Tech |
Ames, Aaron D. | California Institute of Technology |
Akbari Hamed, Kaveh | Virginia Tech |
Keywords: Robotics, Distributed control, Stability of nonlinear systems
Abstract: Quadratic programming (QP)-based nonlinear controllers have gained increasing popularity in the legged locomotion community. This paper presents a formal foundation to systematically decompose QP-based centralized nonlinear controllers into a network of lower-dimensional local QPs, with application to legged locomotion. The proposed approach formulates a feedback structure between the local QPs and assumes a one-step communication delay protocol. The properties of local QPs are analyzed, wherein it is established that their steady-state solutions on periodic orbits (representing gaits) coincide with that of the centralized QP. The asymptotic convergence of local QPs' solutions to the steady-state solution is studied via Floquet theory. The effectiveness of the analytical results is evaluated through rigorous numerical simulations and various experiments on a quadrupedal robot, with the result being robust locomotion on different terrains and in the presence of external disturbances. The paper shows that the proposed distributed QPs have considerably less computation time and reduced noise propagation sensitivity than the centralized QP.
|
|
WeAT04 Regular Session, Tulum Ballroom D |
Add to My Program |
Neural Networks I |
|
|
Chair: Zhang, Fumin | Georgia Institute of Technology |
Co-Chair: Meyer, Pierre-Jean | Univ Gustave Eiffel |
|
10:00-10:20, Paper WeAT04.1 | Add to My Program |
Comparative Analysis of Interval Reachability for Robust Implicit and Feedforward Neural Networks |
|
Davydov, Alexander | University of California, Santa Barbara |
Jafarpour, Saber | Georgia Institute of Technology |
Abate, Matthew | Georgia Institute of Technology |
Bullo, Francesco | Univ of California at Santa Barbara |
Coogan, Samuel | Georgia Institute of Technology |
Keywords: Neural networks, Machine learning, Large-scale systems
Abstract: Implicit neural networks (INNs) are a class of learning models that use implicit algebraic equations as layers and have been shown to exhibit several notable benefits over traditional feedforward neural networks (FFNNs). In this paper, we use interval reachability analysis to study robustness of INNs and compare them with FFNNs. We first introduce the notion of tight inclusion function and use it to provide the tightest rectangular over-approximation of the neural network's input-output map. We also show that tight inclusion functions lead to sharper robustness guarantees than the well-studied robustness measures of Lipschitz constants. Like exact Lipschitz constants, tight inclusions functions are computationally challenging to obtain, and thus we develop a framework based upon mixed monotonicity and contraction theory to estimate the tight inclusion functions for INNs. We show that our approach performs at least as well as, and generally better than, state-of-the-art interval-bound propagation methods for INNs. Finally, we design a novel optimization problem for training robust INNs and we provide empirical evidence that suitably-trained INNs can be more robust than comparably-trained FFNNs.
|
|
10:20-10:40, Paper WeAT04.2 | Add to My Program |
Neural ODE Control for Trajectory Approximation of Continuity Equation |
|
Elamvazhuthi, Karthik | University of California, Riverside |
Gharesifard, Bahman | University of California, Los Angeles |
Bertozzi, Andrea L. | University of California Los Angeles |
Osher, Stanley | University of California, Los Angeles |
Keywords: Machine learning, Neural networks, Distributed parameter systems
Abstract: We consider the controllability problem for the continuity equation, corresponding to neural ordinary differential equations (ODEs), which describes how a probability measure is pushedforward by the flow. We show that the controlled continuity equation has very strong controllability properties. Particularly, a given solution of the continuity equation corresponding to a bounded Lipschitz vector field defines a trajectory on the set of probability measures. For this trajectory, we show that there exist piecewise constant training weights for a neural ODE such that the solution of the continuity equation corresponding to the neural ODE is arbitrarily close to it. As a corollary to this result, we establish that the continuity equation of the neural ODE is approximately controllable on the set of compactly supported probability measures that are absolutely continuous with respect to the Lebesgue measure.
|
|
10:40-11:00, Paper WeAT04.3 | Add to My Program |
Reachability Analysis of Neural Networks Using Mixed Monotonicity |
|
Meyer, Pierre-Jean | Univ Gustave Eiffel |
Keywords: Neural networks, Uncertain systems
Abstract: This paper presents a new reachability analysis approach to compute interval over-approximations of the output set of feedforward neural networks with input uncertainty. We adapt to neural networks an existing mixed-monotonicity method for the reachability analysis of dynamical systems and apply it to each partial network within the main network. This ensures that the intersection of the obtained results is the tightest interval over-approximation of the output of each layer that can be obtained using mixed-monotonicity on any partial network decomposition. Unlike other tools in the literature focusing on small classes of piecewise-affine or monotone activation functions, the main strength of our approach is its generality: it can handle neural networks with any Lipschitz-continuous activation function. In addition, the simplicity of our framework allows users to very easily add unimplemented activation functions, by simply providing the function, its derivative and the global argmin and argmax of the derivative. Our algorithm is compared to five other interval-based tools (Interval Bound Propagation, ReluVal, Neurify, VeriNet, CROWN) on both existing benchmarks and two sets of small and large randomly generated networks for four activation functions (ReLU, TanH, ELU, SiLU).
|
|
11:00-11:20, Paper WeAT04.4 | Add to My Program |
Lyapunov-Net: A Deep Neural Network Architecture for Lyapunov Function Approximation |
|
Gaby, Nathan | Georgia State University |
Zhang, Fumin | Georgia Institute of Technology |
Ye, Xiaojing | Georgia State University |
Keywords: Machine learning, Lyapunov methods, Stability of nonlinear systems
Abstract: We develop a versatile deep neural network architecture, called Lyapunov-Net, to approximate Lyapunov functions of dynamical systems in high dimensions. Lyapunov-Net guarantees positive definiteness, and thus can be easily trained to satisfy the negative orbital derivative condition, which only renders a single term in the empirical risk function in practice. This significantly simplifies parameter tuning and results in greatly improved convergence during network training and approximation quality. We also provide comprehensive theoretical justifications on the approximation accuracy and certification guarantees of Lyapunov-Nets. We demonstrate the efficiency of the proposed method on nonlinear dynamical systems in high dimensional state spaces, and show that the proposed approach significantly outperforms the state-of-the-art methods.
|
|
11:20-11:40, Paper WeAT04.5 | Add to My Program |
Neural Lyapunov Differentiable Predictive Control |
|
Mukherjee, Sayak | Pacific Northwest National Laboratory |
Drgona, Jan | Pacific Northwest National Laboratory |
Tuor, Aaron | Pacific Northwest National Laboratory |
Halappanavar, Mahantesh | Pacific Northwest National Laboratory |
Vrabie, Draguna | Pacific Northwest National Laboratory |
Keywords: Learning, Predictive control for nonlinear systems, Lyapunov methods
Abstract: We present a learning-based predictive control methodology using the differentiable programming framework with probabilistic Lyapunov-based stability guarantees. The neural Lyapunov differentiable predictive control (NLDPC) learns the policy by constructing a computational graph encompassing the system dynamics, state and input constraints, and the necessary Lyapunov certification constraints, and thereafter using the automatic differentiation to update the neural policy parameters. In conjunction, our approach jointly learns a Lyapunov function that certifies the regions of state-space with stable dynamics. We also provide a sampling-based statistical guarantee for the training of NLDPC from the distribution of initial conditions. Our offline training approach provides a computationally efficient and scalable alternative to classical explicit model predictive control solutions. We substantiate the advantages of the proposed approach with simulations to stabilize the double integrator model and on an example of controlling an aircraft model.
|
|
11:40-12:00, Paper WeAT04.6 | Add to My Program |
Neural Network-Based KKL Observer for Nonlinear Discrete-Time Systems |
|
Peralez, Johan | LAGEP |
Nadri, Madiha | Universite Claude Bernard Lyon 1 |
Astolfi, Daniele | Cnrs - Lagepp |
Keywords: Learning, Observers for nonlinear systems, Numerical algorithms
Abstract: For non-autonomous multivariable discrete-time nonlinear systems, we address the state estimation problem using a Kazantzis-Kravaris-Luenberger (KKL) observer. We aim to build a mapping that transforms a nonlinear dynamics into a stable linear system modulo an output injection and to design an asymptotic observer. However, this mapping is difficult to compute and its numerical approximation may be badly conditioned during the transient phase. We propose an algorithm based on ensemble learning techniques to improve the numerical approximation of the mapping and its extension in the transient phase. This ensures a good asymptotic convergence of the observer and avoids peaking phenomena. The algorithm demonstrates good performance in high-dimensional and multi-input-multi-output examples.
|
|
WeAT05 Invited Session, Tulum Ballroom E |
Add to My Program |
Learning-Based Control I: Data-Driven Predictive Control |
|
|
Chair: Garatti, Simone | Politecnico Di Milano |
Co-Chair: Solowjow, Friedrich | RWTH Aachen University |
Organizer: Müller, Matthias A. | Leibniz University Hannover |
Organizer: Schoellig, Angela P | University of Toronto |
Organizer: Trimpe, Sebastian | RWTH Aachen University |
Organizer: Zeilinger, Melanie N. | ETH Zurich |
|
10:00-10:20, Paper WeAT05.1 | Add to My Program |
Data-Enabled Predictive Control with Instrumental Variables: The Direct Equivalence with Subspace Predictive Control (I) |
|
van Wingerden, Jan-Willem | Delft University of Technology |
Mulders, Sebastiaan Paul | Delft University of Technology |
Dinkla, Rogier | Delft University of Technology |
Oomen, Tom | Eindhoven University of Technology |
Verhaegen, Michel | Delft University of Technology |
Keywords: Subspace methods, Predictive control for linear systems, Identification for control
Abstract: Direct data-driven control has attracted substantial interest since it enables optimization-based control without the need for a parametric model. This paper presents a new Instrumental Variable (IV) approach to Data-enabled Predictive Control (DeePC) that results in favorable noise mitigation properties, and demonstrates the direct equivalence between DeePC and Subspace Predictive Control (SPC). The methodology relies on the derivation of the characteristic equation in DeePC along the lines of subspace identification algorithms. A particular choice of IVs is presented that is uncorrelated with future noise, but at the same time highly correlated with the data matrix. A simulation study demonstrates the improved performance of the proposed algorithm in the presence of process and measurement noise.
|
|
10:20-10:40, Paper WeAT05.2 | Add to My Program |
Data-Driven Multiple Shooting for Stochastic Optimal Control |
|
Ou, Ruchuan | Technische Universität Dortmund |
Pan, Guanru | TU Dortmund |
Faulwasser, Timm | TU Dortmund University |
Keywords: Predictive control for linear systems, Stochastic optimal control, Optimal control
Abstract: The implementation of data-driven predictive control schemes based on Willems' fundamental lemma often relies on a single-shooting approach, i.e., it uses one large Hankel matrix to cover the entire optimization horizon. However, the numerical solution is fostered by the use of multiple segmented horizons which require less data in smaller Hankel matrices. This paper extends the segmentation idea towards multiple shooting for data-driven optimal control of stochastic LTI systems. Using a stochastic variant of the fundamental lemma and polynomial chaos expansions, we propose a multiple-shooting formulation which combines trajectory segmentation and moment matching. We show that, for LTI systems subject to Gaussian noise of finite variance, our formulation is without loss of optimality while it allows for a significant reduction of the problem dimension in Gaussian and non-Gaussian settings. We draw upon a numerical example to compare the proposed framework to the usual single-shooting approach.
|
|
10:40-11:00, Paper WeAT05.3 | Add to My Program |
An Offset-Free Nonlinear MPC Scheme for Systems Learned by Neural NARX Models (I) |
|
Bonassi, Fabio | Politecnico Di Milano |
Xie, Jing | Politecnico Di Milano |
Farina, Marcello | Politecnico Di Milano |
Scattolini, Riccardo | Politecnico Di Milano |
Keywords: Predictive control for nonlinear systems, Neural networks, Output regulation
Abstract: This paper deals with the design of nonlinear MPC controllers that provide offset-free setpoint tracking for models described by Neural Nonlinear AutoRegressive eXogenous (NNARX) networks. The NNARX model is identified from input-output data collected from the plant, and can be given a state-space representation with known measurable states made by past input and output variables, so that a state observer is not required. In the training phase, the Incremental Input-to-State Stability (δISS) property can be forced when consistent with the behavior of the plant. The δISS property is then leveraged to augment the model with an explicit integral action on the output tracking error, which allows to achieve offset-free tracking capabilities to the designed control scheme. The proposed control architecture is numerically tested on a water heating system and the achieved results are compared to those scored by another popular offset-free MPC method, showing that the proposed scheme attains remarkable performances even in presence of disturbances acting on the plant.
|
|
11:00-11:20, Paper WeAT05.4 | Add to My Program |
Data-Driven Nonlinear Model Reduction Using Koopman Theory: Integrated Control Form and NMPC Case Study |
|
Schulze, Jan Christoph | RWTH Aachen University |
Mitsos, Alexander | RWTH Aachen University |
Keywords: Machine learning, Predictive control for nonlinear systems, Chemical process control
Abstract: We use Koopman theory for data-driven model reduction of nonlinear dynamical systems with controls. We propose generic model structures combining delay-coordinate encoding of measurements and full-state decoding to integrate reduced Koopman modeling and state estimation. We present a deep-learning approach to train the proposed models. A case study demonstrates that our approach provides accurate control models and enables real-time capable nonlinear model predictive control of a high-purity cryogenic distillation column.
|
|
11:20-11:40, Paper WeAT05.5 | Add to My Program |
Compression at the Service of Learning: A Case Study for the Guaranteed Error Machine (I) |
|
Garatti, Simone | Politecnico Di Milano |
Campi, Marco | University of Brescia |
Keywords: Pattern recognition and classification, Statistical learning, Machine learning
Abstract: The scenario approach is a technique for data-driven decision making that has found application in a variety of fields including systems and control design. Although initially conceived in the context of worst-case optimization, the scenario approach has progressively evolved into a general methodology that allows one to keep control on the risk of solutions designed from data according to complex decision processes. In a recent contribution, the theory of compression schemes (a paradigm that plays a fundamental role in statistical learning theory) has been deeply revisited in the wake of the scenario approach, which has led to unprecedentedly sharp generalization and risk quantification results. In this paper, we build on these achievements to gain insight on a classification paradigm called Guaranteed Error Machine (GEM). First, by leveraging the theory of reproducing kernels Hilbert spaces, we introduce a new, more flexible, GEM algorithm, which allows for complex classification geometries. The proposed scheme is then shown to fit into the new compression theory, from which new sharp results for the probability of GEM misclassification are derived in a distribution-free context.
|
|
11:40-12:00, Paper WeAT05.6 | Add to My Program |
Learning Functions and Uncertainty Sets Using Geometrically Constrained Kernel Regression (I) |
|
Fiedler, Christian | RWTH Aachen University |
Scherer, Carsten W. | University of Stuttgart |
Trimpe, Sebastian | RWTH Aachen University |
Keywords: Machine learning, Uncertain systems, Robust control
Abstract: Learning-based control offers the potential to tackle challenging control problems and hence receives a lot of attention, both from the control and machine learning communities. To provide rigorous control-theoretic guarantees like stability or other safety-related properties, many of these approaches require a quantification of the %remaining uncertainty. uncertainty that is inevitable when learning from real data. Unfortunately, many existing methods rely on unrealistic assumptions to derive such uncertainty bounds, preventing the real-world usage of learning-based control. We focus on the important regression setting and propose a new approach that only needs assumptions that can be derived from reasonable engineering knowledge. In order to achieve this goal, we combine the recently introduced Hard Shape Constrained Kernel Machines with geometric assumptions expressing prior model knowledge. The resulting algorithms can compute both nominal predictions with prescribed properties and rather tight uncertainty sets. Numerical experiments, including an illustrative control example, demonstrate the feasibility and performance of our approach.
|
|
WeAT06 Regular Session, Tulum Ballroom F |
Add to My Program |
Parameter Identification |
|
|
Chair: Cucuzzella, Michele | University of Pavia |
Co-Chair: Ferrara, Antonella | University of Pavia |
|
10:00-10:20, Paper WeAT06.1 | Add to My Program |
Two Identification Algorithms for State Space Modeling from Binary Output Measurements |
|
Mestrah, Ali | Université De Caen |
Pouliquen, Mathieu | Normandie Univ, UNICAEN, ENSICAEN |
Pigeon, Eric | University of CAEN |
Keywords: Identification
Abstract: Two identification algorithms for state space modeling from binary output measurements are presented in this paper. The first algorithm is based on the estimation of a high order FIR model used to recover the high resolution output signal. The second algorithm is based on an iterative scheme allowing the construction of state sequence and then the recovering of the high resolution output signal. Implementation conditions are discussed and illustrated. Algorithms are shown to be efficient by numerical simulations.
|
|
10:20-10:40, Paper WeAT06.2 | Add to My Program |
Robust Online Joint State/input/parameter Estimation of Linear Systems |
|
Brouillon, Jean-Sébastien | EPFL |
Moffat, Keith | UC Berkeley |
Dörfler, Florian | Swiss Federal Institute of Technology (ETH) Zurich |
Ferrari-Trecate, Giancarlo | Ecole Polytechnique Fédérale De Lausanne |
Keywords: Identification, Uncertain systems, Statistical learning
Abstract: This paper presents a method for jointly estimating the state, input, and parameters of linear systems in an online fashion. The method is specially designed for measurements that are corrupted with non-Gaussian noise or outliers, which are commonly found in engineering applications. In particular, it combines recursive, alternating, and iteratively-reweighted least squares into a single, one-step algorithm, which solves the estimation problem online and benefits from the robustness of least-deviation regression methods. The convergence of the iterative method is formally guaranteed. Numerical experiments show the good performance of the estimation algorithm in presence of outliers and in comparison to state-of-the-art methods.
|
|
10:40-11:00, Paper WeAT06.3 | Add to My Program |
Concurrent Learning in High-Order Tuners for Parameter Identification |
|
Le, Justin H. | Univ. of California at Santa Barbara |
Teel, Andrew R. | Univ. of California at Santa Barbara |
Keywords: Identification, Time-varying systems, Lyapunov methods
Abstract: High-order tuners are algorithms that show promise in achieving greater efficiency than classic gradient-based algorithms in identifying the parameters of parametric models and/or in facilitating the progress of a control or optimization algorithm whose adaptive behavior relies on such models. For high-order tuners, robust stability properties, namely uniform global asymptotic (and exponential) stability, currently rely on a persistent excitation (PE) condition. In this work, we establish such stability properties with a novel analysis based on a Matrosov theorem and then show that the PE requirement can be relaxed via a concurrent learning technique driven by sampled data points that are sufficiently rich. We show numerically that concurrent learning may greatly improve efficiency. We incorporate reset methods that preserve the stability guarantees while providing additional improvements that may be relevant in applications that demand highly accurate parameter estimates at relatively low additional cost in computation.
|
|
11:00-11:20, Paper WeAT06.4 | Add to My Program |
Moving Horizon Estimation with Adaptive Regularization for Ill-Posed State and Parameter Estimation Problems |
|
Baumgärtner, Katrin | University of Freiburg |
Reiter, Rudolf | University of Freiburg |
Diehl, Moritz | University of Freiburg |
Keywords: Closed-loop identification, Nonlinear systems identification, Estimation
Abstract: We investigate the usage of Moving Horizon Estimation (MHE) for state and parameter estimation for partially non-detectable systems with measurements corrupted by outliers. We propose a GGN-based arrival cost update formula and illustrate how it can be generalized to nonconvex loss functions that can be effectively used for outlier rejection. Moreover, we propose an adaptive regularization scheme for the arrival cost which introduces forgetting as well as additional pseudo-measurements to the arrival cost update. We illustrate the performance of the proposed algorithms on a longitudinal vehicle state and parameter estimation problem.
|
|
11:20-11:40, Paper WeAT06.5 | Add to My Program |
Direct Closed-Loop Identification of Continuous-Time Systems Using Fixed-Pole Observer Model |
|
Maruta, Ichiro | Kyoto University |
Sugie, Toshiharu | Osaka University |
Keywords: Closed-loop identification, Identification for control
Abstract: This paper provides a method for obtaining a continuous-time model of a target system in closed-loop from input-output data alone, in the case where no knowledge of the controllers nor excitation signals is available and I/O data may suffer from unknown offsets. The proposed method is based on a fixed-pole observer model, which is a reasonable continuous-time version corresponding to the innovation model in discrete-time and allows the identification of unstable target systems. Furthermore, it is shown that the proposed method can be attributed to a convex optimization problem by fixing the observer poles. The method is within the framework of the stabilized output error method and shares usability advantages such as robustness to noise with complex dynamics and applicability to a wide class of models. The effectiveness of the method is illustrated through numerical examples.
|
|
11:40-12:00, Paper WeAT06.6 | Add to My Program |
Finite Time Output Parameter Estimation for a Class of Nonlinear Systems |
|
Rinaldi, Gianmario | University of Exeter |
Machado Martínez, Juan Eduardo | University of Groningen |
Cucuzzella, Michele | University of Pavia |
Menon, Prathyush P | University of Exeter |
Ferrara, Antonella | University of Pavia |
Scherpen, Jacquelien M.A. | University of Groningen |
Keywords: Observers for nonlinear systems, Variable-structure/sliding-mode control, Control applications
Abstract: In this letter, a novel scheme is proposed to identify in finite time the value of an unknown output parameter for a class of nonlinear dynamical systems. Inspired by the Super-Twisting Sliding Mode Algorithm (STA), the identification problem is solved in an innovative way, consisting of two steps. An STA observer is firstly designed to track in finite time the system output. Exploiting the observer behaviour during the sliding motion, a second scheme, still inspired by the STA, is used to estimate the unknown value of the output parameter, thus solving the identification problem in finite time. The numerical simulations based on district heating systems validate the effectiveness of the proposed identification method.
|
|
WeAT07 Invited Session, Tulum Ballroom G |
Add to My Program |
Modular Design and Verification of Control Systems |
|
|
Chair: Johansson, Karl H. | Royal Institute of Technology |
Co-Chair: Nuzzo, Pierluigi | University of Southern California |
Organizer: Besselink, Bart | University of Groningen |
Organizer: Girard, Antoine | CNRS |
Organizer: Johansson, Karl H. | KTH Royal Institute of Technology |
|
10:00-10:20, Paper WeAT07.1 | Add to My Program |
Compositional Synthesis of Signal Temporal Logic Tasks Via Assume-Guarantee Contracts |
|
Liu, Siyuan | Technical University of Munich |
Saoud, Adnane | CentraleSupelec |
Jagtap, Pushpak | Indian Institute of Science |
Dimarogonas, Dimos V. | KTH Royal Institute of Technology |
Zamani, Majid | University of Colorado Boulder |
Keywords: Large-scale systems, Hybrid systems, Network analysis and control
Abstract: In this paper, we focus on the problem of compositional synthesis of controllers enforcing signal temporal logic (STL) tasks over a class of continuous-time nonlinear interconnected systems. By leveraging the idea of funnel-based control, we show that a fragment of STL specifications can be formulated as assume-guarantee contracts. A new concept of contract satisfaction is then defined to establish our compositionality result, which allows us to guarantee the satisfaction of a global contract by the interconnected system when all subsystems satisfy their local contracts. Based on this compositional framework, we then design closed-form continuous-time feedback controllers to enforce local contracts over subsystems in a decentralized manner. Finally, we demonstrate the effectiveness of our results on a numerical example.
|
|
10:20-10:40, Paper WeAT07.2 | Add to My Program |
Invariant Sets for Assume-Guarantee Contracts (I) |
|
Girard, Antoine | CNRS |
Iovine, Alessio | CNRS |
Benberkane, Sofiane | Université Paris-Saclay |
Keywords: Formal Verification/Synthesis
Abstract: Contract theory is a powerful tool to reason on systems that are interacting with an external environment, possibly made of other systems. Formally, a contract is usually given by assumptions and guarantees, which specify the expected behavior of the system (the guarantees) in a certain context (the assumptions). In this work, we present a verification framework for discrete-time dynamical systems with assume-guarantee contracts. We first introduce a class of assume-guarantee contracts with their satisfaction semantics parameterized by a time-horizon over which assumptions are evaluated. We then show that the problem of verifying whether such contracts are satisfied is equivalent to show the existence of a positive invariant set for an auxiliary system. This allows us to leverage the extensive literature on invariant set computation. A simple illustrative example is provided to show the effectiveness of our approach.
|
|
10:40-11:00, Paper WeAT07.3 | Add to My Program |
Task-Driven Modular Co-Design of Vehicle Control Systems (I) |
|
Zardini, Gioele | ETH Zürich |
Suter, Zelio | ETH Zürich |
Censi, Andrea | MIT |
Frazzoli, Emilio | ETH Zürich |
Keywords: Autonomous robots, Robotics, Autonomous systems
Abstract: When designing autonomous systems, we need to consider multiple trade-offs at various abstraction levels, and the choices of single (hardware and software) components need to be studied jointly. In this work we consider the problem of designing the control algorithm as well as the platform on which it is executed. In particular, we focus on vehicle control systems, and formalize state-of-the-art control schemes as monotone feasibility relations. We then show how, leveraging a monotone theory of co-design, we can study the embedding of control synthesis problems into the task-driven co-design problem of a robotic platform. The properties of the proposed approach are illustrated by considering urban driving scenarios. We show how, given a particular task, we can efficiently compute Pareto optimal design solutions.
|
|
11:00-11:20, Paper WeAT07.4 | Add to My Program |
Series Composition of Simulation-Based Assume-Guarantee Contracts for Linear Dynamical Systems (I) |
|
Shali, Brayan M. | University of Groningen |
Heidema, Marieke | University of Groningen |
van der Schaft, Arjan | Univ. of Groningen |
Besselink, Bart | University of Groningen |
Keywords: Formal Verification/Synthesis, Linear systems, Large-scale systems
Abstract: We present assume-guarantee contracts for continuous-time linear dynamical systems with inputs and outputs. These contracts are used to express specifications on the dynamic behaviour of a system. Contrary to existing approaches, we use simulation to compare the dynamic behaviour of two systems. This has the advantage of being supported by efficient numerical algorithms for verification as well as being related to the rich literature on (bi)simulation based techniques for verification and control, such as those based on (discrete) abstractions. Using simulation, we define contract implementation and a notion of contract refinement. We also define a notion of series composition for contracts, which allows us to reason about the series interconnection of systems on the basis of the contracts on its components. Together, the notions of refinement and composition allow contracts to be used for modular design and analysis of interconnected systems.
|
|
11:20-11:40, Paper WeAT07.5 | Add to My Program |
Data-Adaptive Retrofit Control for Power System Stabilizer Design (I) |
|
Ishizaki, Takayuki | Tokyo Institute of Technology |
Ito, Masahiro | Tokyo Institute of Technology |
Kawaguchi, Takahiro | Gunma University |
Chakrabortty, Aranya | North Carolina State University |
Keywords: Power systems, Decentralized control, Network analysis and control
Abstract: In this paper, we propose a design procedure of data-adaptive power system stabilizers (PSSs) in the framework of retrofit control. The proposed procedure is modular in the sense that both design and implementation processes of PSSs can be performed using only a local subsystem model and local measurement. In particular, we consider online identification of a dynamical feedback effect between the states of a generator of interest and the main grid to make the PSS adaptive to the variation of grid characteristics depending on power flow distributions. The main theoretical contribution is to show that the same retrofit controller as that developed for linear systems in the literature works properly even for nonlinear power systems where an operating point of interest varies depending on power flow distributions. In addition, we propose an online identification algorithm of the grid characteristics that makes use of the physical structure of power systems. We demonstrate the efficacy of the proposed data-adaptive PSS by a numerical simulation on the IEEE 9-bus test power system.
|
|
11:40-12:00, Paper WeAT07.6 | Add to My Program |
Contract-Based Control Synthesis with Barrier Functions for Vehicular Mission Planning (I) |
|
Waqas, Muhammad | University of Southern California |
Naik, Nikhil Vijay | University of Southern California |
Ioannou, Petros A. | Univ. of Southern California |
Nuzzo, Pierluigi | University of Southern California |
Keywords: Computer-aided control design, Automotive control
Abstract: We present a compositional control synthesis method based on assume-guarantee contracts with application to correct-by-construction design of vehicular mission plans. In our approach, a mission-level specification expressed in a fragment of signal temporal logic (STL) is decomposed into predicates defined on non-overlapping time intervals. The STL predicates are then mapped to an aggregation of contracts defined via piecewise continuously differentiable time-varying control barrier functions. The barrier functions are used to constrain the lower-level control synthesis problem, which is solved via quadratic programming. Our approach can avoid the conservatism of previous methods for task-driven control based on under-approximations. We illustrate its effectiveness on a case study motivated by vehicular mission planning under safety constraints as well as constraints imposed by traffic regulations, vehicle-to-vehicle, and vehicle-to-infrastructure communication.
|
|
WeAT08 Regular Session, Tulum Ballroom H |
Add to My Program |
Resilient Control Systems |
|
|
Chair: Clark, Andrew | Washington University in St. Louis |
Co-Chair: Panagou, Dimitra | University of Michigan, Ann Arbor |
|
10:00-10:20, Paper WeAT08.1 | Add to My Program |
Trust-Based Rate-Tunable Control Barrier Functions for Non-Cooperative Multi-Agent Systems |
|
Parwana, Hardik | University of Michigan |
Mustafa, Aquib | University of Michigan, Ann Arbor |
Panagou, Dimitra | University of Michigan, Ann Arbor |
Keywords: Resilient Control Systems, Agents-based systems, Autonomous robots
Abstract: For efficient and robust task accomplishment in multi-agent systems, an agent must be able to distinguish cooperative agents from non-cooperative (i.e., uncooperative and adversarial) agents. In this paper, we first develop a trust metric based on which each agent forms its own belief of how cooperative the other agents are, i.e., of how much the other agents contribute to maintaining safety. With safety encoded as Control Barrier Functions (CBFs), the trust metric is in turn used to adjust the rate at which the CBFs allow the system trajectories to approach the boundary of the safe set. This is achieved via a novel form of a CBF, called the Rate-Tunable CBF, which yields less conservative performance compared to an identity-agnostic implementation, where cooperative and non-cooperative agents are treated similarly. The proposed adaptation and control method is evaluated via simulations on heterogeneous multi-agent systems including non-cooperative agents.
|
|
10:20-10:40, Paper WeAT08.2 | Add to My Program |
Data-Driven Characterization of Recovery Energy in Controlled Dynamical Systems Using Koopman Operator |
|
Ramachandran, Thiagarajan | Pacific Northwest National Laboratory |
Nandanoori, Sai Pushpak | Pacific Northwest National Laboratory |
Sinha, Subhrajit | Pacific Northwest National Laboratory |
Bakker, Craig | Pacific Northwest National Laboratory |
Keywords: Resilient Control Systems, Optimal control, Power systems
Abstract: A key aspect of a resilient dynamical system is its ability to recover from large disruptions. In this paper, we define and characterize recovery energy as a quantitative measure of the minimum effort required by a control dynamical system to return to a safe operating region after large disruptions. The problem of steering the system state to a pre-specified safe region with minimum recovery energy is posed as an optimization problem which turns out to be a nonconvex due to the nonlinear nature of the system dynamics. To this end, we use the Koopman operator theoretic framework to obtain a convex reformulation in a lifted observable space. Finally, we demonstrate the approach on an IEEE 123-bus test feeder and quantify its ability to recover from disruptions.
|
|
10:40-11:00, Paper WeAT08.3 | Add to My Program |
Adaptive Malicious Robot Detection in Dynamic Topologies (I) |
|
Cavorsi, Matthew | Harvard University |
Jadhav, Ninad | Harvard University |
Saldana, David | Lehigh University |
Gil, Stephanie | Harvard University |
Keywords: Resilient Control Systems, Robotics, Networked control systems
Abstract: We consider a class of problems where robots gather observations of each other to assess the legitimacy of their peers. Previous works propose accurate detection of malicious robots when robots are able to extract observations of each other for a long enough time. However, they often consider static networks where the set of neighbors a robot observes remains the same. Mobile robots experience a dynamic set of neighbors as they move, making the acquisition of adequate observations more difficult. We design a stochastic policy that enables the robots to periodically gather observations of every other robot, while simultaneously satisfying a desired robot distribution over an environment modeled by sites. We show that with this policy, any pre-existing or new malicious robot in the network will be detected in a finite amount of time, which we minimize and also characterize. We derive bounds on the time needed to obtain the desired number of observations for a given topological map and validate these bounds in simulation. We also show and verify in a hardware experiment that the team is able to successfully detect malicious robots, and thus estimate the true distribution of cooperative robots per site, in order to converge to the desired robot distribution over sites.
|
|
11:00-11:20, Paper WeAT08.4 | Add to My Program |
Quantized and Distributed Subgradient Optimization Method with Malicious Attack |
|
Emiola, Iyanuoluwa | University of Central Florida |
Enyioha, Chinwendu | University of Central Florida |
Keywords: Optimization, Communication networks, Optimization algorithms
Abstract: This paper considers a distributed optimization problem in a multi-agent system where a fraction of the agents is compromised and acts in a malicious manner. Specifically, the compromised agents steer the network of agents away from the optimal solution by sending relaying false information to their neighbors and consume significant bandwidth in the available communication channels. We propose a distributed gradient-based optimization algorithm in which the non-malicious agents exchange quantized information with one another. We prove convergence of the solution to a neighborhood of the optimal solution, explore the convergence attributes of the algorithm and characterize the solutions obtained under the communication-constrained environment and presence of malicious agents. Numerical simulations to illustrate the results are also presented.
|
|
11:20-11:40, Paper WeAT08.5 | Add to My Program |
Inference Attack in Distributed Optimization Via Interpolation and Manipulation |
|
Xu, Jisheng | Shanghai Jiao Tong University |
He, Zhiyu | Shanghai Jiao Tong University |
Fang, Chongrong | Shanghai Jiao Tong University |
He, Jianping | Shanghai Jiao Tong University |
Peng, Yunfeng | Shanghai Jiao Tong University |
Keywords: Optimization algorithms, Optimization, Network analysis and control
Abstract: We study the problem of inference attack in distributed optimization, with adversarial agents aiming to obtain the sensitive information of some critical agent in a network. Different from existing privacy-preserving and resilient distributed optimization algorithms, we propose inference algorithms from the perspective of launching well-designed attacks to help infer the sensitive local information. The key idea is that by utilizing the critical agent's neighborhood information and the predefined update protocol, adversarial agents can not only interpolate the gradient of its local objective function, but also manipulate it to converge to its own local minimizer. The proposed algorithms can thus obtain the approximation of the gradient or the minimizer of the local objective of the critical agent. We characterize the performance through interpolation errors, as well as distances to the optimal value and optimal point of the local objective. Numerical simulations are presented to verify the effectiveness of these algorithms.
|
|
11:40-12:00, Paper WeAT08.6 | Add to My Program |
Barrier Certificate Based Safe Control for LiDAR-Based Systems under Sensor Faults and Attacks |
|
Zhang, Hongchao | Worcester Polytechnic Institute |
Cheng, Shiyu | Worcester Polytechnic Institute |
Niu, Luyao | University of Washington |
Clark, Andrew | Washington University in St. Louis |
Keywords: Vision-based control, Resilient Control Systems, Cyber-Physical Security
Abstract: Autonomous Cyber-Physical Systems (CPS) fuse proprioceptive sensors such as GPS and exteroceptive sensors including Light Detection and Ranging (LiDAR) and cameras for state estimation and environmental observation. It has been shown that both types of sensors can be compromised by malicious attacks, leading to unacceptable safety violations. We study the problem of safety-critical control of a LiDAR-based system under sensor faults and attacks. We propose a framework consisting of fault tolerant estimation and fault tolerant control. The former reconstructs a LiDAR scan with state estimations, and excludes the possible faulty estimations that are not aligned with LiDAR measurements. We also verify the correctness of LiDAR scans by comparing them with the reconstructed ones and removing the possibly compromised sector in the scan. Fault tolerant control computes a control signal with the remaining estimations at each time step. We prove that the synthesized control input guarantees system safety using control barrier certificates. We validate our proposed framework using a UAV delivery system in an urban environment. We show that our proposed approach guarantees safety for the UAV whereas a baseline fails.
|
|
WeAT09 Regular Session, Maya Ballroom I |
Add to My Program |
Cooperative Control |
|
|
Chair: Rivera, Phillip | The Johns Hopkins University Applied Physics Laboratory |
Co-Chair: Hayashi, Naoki | Osaka University |
|
10:00-10:20, Paper WeAT09.1 | Add to My Program |
A Barrier-Certified Optimal Coordination Framework for Connected and Automated Vehicles |
|
Chalaki, Behdad | Honda Research Institute |
Malikopoulos, Andreas A. | University of Delaware |
Keywords: Cooperative control, Emerging control applications, Traffic control
Abstract: In this paper, we extend a framework that we developed earlier for coordination of connected and automated vehicles (CAVs) at a signal-free intersection by integrating a safety layer using control barrier functions. First, in our motion planning module, each CAV computes the optimal control trajectory using simple vehicle dynamics. The trajectory does not make any of the state, control, and safety constraints active. A vehicle-level tracking controller employs a combined feedforward-feedback control law to track the resulting optimal trajectory from the motion planning module. Then, a barrier-certificate module, acting as a middle layer between the vehicle-level tracking controller and physical vehicle, receives the control law from the vehicle-level tracking controller and using realistic vehicle dynamics ensures that none of the state, control, and safety constraints becomes active. The latter is achieved through a quadratic program, which can be solved efficiently in real time. We demonstrate the effectiveness of our extended framework through a numerical simulation.
|
|
10:20-10:40, Paper WeAT09.2 | Add to My Program |
Contraction Analysis of Multi-Agent Control for Guaranteed Capture of a Faster Evader |
|
Rivera, Phillip | The Johns Hopkins University Applied Physics Laboratory |
Frommer, Andrew | University of Maryland |
Diaz-Mercado, Yancy | University of Maryland |
Keywords: Cooperative control
Abstract: This work presents a verifiable condition for the selection of a sufficient number of pursuers to capture a faster evader. The condition is based on the tracking performance of a multi-agent control scheme. Trajectory tracking results are provided for both the effects of the multi-agent control topology and its execution by the pursuers in the context of input saturation. To that end, nonlinear contraction theory is leveraged because it provides a unifying framework for the analysis of systems subject to bounded disturbances. Simulation experiments are performed to validate the proposed condition for sufficient pursuers selection.
|
|
10:40-11:00, Paper WeAT09.3 | Add to My Program |
Resilient Synchronization of Heterogeneous MAS against Correlated Sensor Attacks |
|
Zuo, Shan | University of Connecticut |
Wang, Yichao | University of Connecticut |
Zhang, Yi | University of Connecticut |
Keywords: Cooperative control, Fault tolerant systems, Resilient Control Systems
Abstract: Accurate state measurement is important to ensure the reliable operation of distributed multi-agent systems (MAS). Existing fault-tolerant control methods generally assume the sensor faults to be bounded and uncorrelated. In this paper, we study the ramifications of allowing the sensor attack injections to be unbounded and correlated. Concerted malicious behaviors may bypass existing attack-detection methods and threaten the synchronization performance and even system stability of the distributed networked MAS. In particular, we consider networked heterogeneous MAS in the face of correlated sensor attacks and unforeseen actuator faults. We propose a distributed attack-resilient synchronization control framework to guarantee the uniform ultimate boundedness of the overall dynamical system. A simulation example illustrates the efficacy of the proposed attack-resilient method.
|
|
11:00-11:20, Paper WeAT09.4 | Add to My Program |
Distributed Inequality Constrained Online Optimization for Unbalanced Digraphs Using Row Stochastic Property |
|
Tada, Keishin | Osaka University |
Hayashi, Naoki | Osaka University |
Takai, Shigemasa | Osaka Univ |
Keywords: Cooperative control, Optimization, Distributed control
Abstract: In this study, we discuss a primal-dual distributed algorithm for online convex optimization with a time-varying coupled constraint on unbalanced directed graphs. A group of agents exchanges the estimation variable for the dual optimizer and the scaling variable, which are used for compensating the unbalanced information flow. Then, each agent updates the primal and dual variables using the projected subgradient methods. We confirm that the regret of the cost function and the cumulative error of the constraint violation achieve sublinearity. A numerical example of a distributed economic dispatch problem demonstrates that the estimation of each agent approaches the optimal strategy under the coupled inequality constraint.
|
|
11:20-11:40, Paper WeAT09.5 | Add to My Program |
On Asymptotic Stability of Leader-Follower Multi-Agent Systems under Transient Constraints |
|
Restrepo, Esteban | KTH Royal Institute of Technology |
Dimarogonas, Dimos V. | KTH Royal Institute of Technology |
Keywords: Cooperative control, Lyapunov methods, Stability of nonlinear systems
Abstract: We address the agreement-based coordination of first-order multi-agent systems interconnected over arbitrary connected undirected graphs and under transient and steady-state constraints. The system is in a leader-follower configuration where only a part of the agents, the leaders, are directly controlled via an external control input, in addition to the agreement protocol. We propose a control law for the leaders, based on the gradient of a potential function, that achieves consensus and guarantees that the trajectories of the inter-agent distances of the entire system remain bounded by a performance function. Relying on the edge-agreement framework and Lyapunov’s first method, we establish strong stability results in the sense of asymptotic stability of the consensus manifold and, in the leaderless case, nonuniform-in-time input-to-state stability with respect to additive disturbances. A numerical simulation illustrates the effectiveness of the proposed approach.
|
|
11:40-12:00, Paper WeAT09.6 | Add to My Program |
Global Max-Tracking of Multiple Time-Varying Signals Using a Distributed Protocol |
|
Sen, Arijit | IISER Bhopal |
Sahoo, Soumya Ranjan | Indian Institute of Technology Kanpur |
Singh, Bhavana | Indian Institute of Technology Kanpur |
Keywords: Cooperative control, Distributed control
Abstract: This paper presents a distributed protocol for a heterogeneous MAS to track the global maximum of multiple time-varying references. We define this as the global max-tracking problem. Compared to the existing max-consensus problems, the global max-tracking problem has more challenges, as the references are time-varying, and no agent has any knowledge about their global maximum a priori. Under the proposed protocol, each agent tracks the global maximum of continuously time-varying references irrespective of their time-derivatives. Thus, the current protocol works for a wide range of time-varying references. A discussion with the numerical simulations exemplifies the efficacy of the proposed global max-tracking protocol.
|
|
WeAT10 Regular Session, Maya Ballroom II |
Add to My Program |
Stochastic Systems IV |
|
|
Chair: Vidyasagar, Mathukumalli | Indian Institute of Technology Hyderabad |
Co-Chair: Burdick, Joel W. | California Inst. of Tech |
|
10:00-10:20, Paper WeAT10.1 | Add to My Program |
Near Optimality of Finite Memory Policies for POMPDs with Continuous Spaces |
|
Kara, Ali Devran | University of Michigan |
Bayraktar, Erhan | University of Michigan |
Yuksel, Serdar | Queen's University |
Keywords: Stochastic optimal control, Filtering, Learning
Abstract: We study an approximation method for partially observed Markov decision processes (POMDPs) with continuous spaces. Belief MDP reduction, which has been the standard approach to study POMDPs, which, due to its uncountable state space and strict regularity properties however, requires rigorous approximation methods for practical applications. In this work, we focus on an approximation procedure via discretizing the observation space and constructing a fully observed finite MDP model using a finite length history of the discrete observations and control actions. We show that the resulting policy is nearly optimal under some regularity assumptions on the channel, and under certain controlled filter stability requirements for the hidden state process. We also provide a Q learning algorithm that uses a finite memory of discretized information variables, and prove its convergence to the optimality equation of the finite fully observed MDP constructed using the approximation method.
|
|
10:20-10:40, Paper WeAT10.2 | Add to My Program |
Risk-Averse Reinforcement Learning Via Dynamic Time-Consistent Risk Measures |
|
Yu, Xian | University of Michigan |
Shen, Siqian | University of Michigan |
Keywords: Stochastic optimal control, Markov processes, Learning
Abstract: Traditional reinforcement learning (RL) aims to maximize the expected total reward, while the risk of uncertain outcomes needs to be controlled to ensure reliable performance in a risk-averse setting. In this paper, we consider the problem of maximizing dynamic risk of a sequence of rewards in infinite-horizon Markov Decision Processes (MDPs). We adapt the Expected Conditional Risk Measures (ECRMs) to the infinite-horizon risk-averse MDP and prove its time consistency. Using a convex combination of expectation and conditional value-at-risk (CVaR) as a special one-step conditional risk measure, we reformulate the risk-averse MDP as a risk-neutral counterpart with augmented action space and manipulation on the immediate rewards. We further prove that the related Bellman operator is a contraction mapping, which guarantees the convergence of any value-based RL algorithms. Accordingly, we develop a risk-averse deep Q-learning framework, and our numerical studies based on two simple MDPs show that the risk-averse setting can reduce the variance and enhance robustness of the results.
|
|
10:40-11:00, Paper WeAT10.3 | Add to My Program |
Deterministic Sequencing of Exploration and Exploitation for Reinforcement Learning |
|
Gupta, Piyush | Michigan State University |
Srivastava, Vaibhav | Michigan State University |
Keywords: Stochastic systems, Machine learning, Learning
Abstract: We propose Deterministic Sequencing of Exploration and Exploitation (DSEE) algorithm with interleaving exploration and exploitation epochs for model-based RL problems that aim to simultaneously learn the system model, i.e., a Markov decision process (MDP), and the associated optimal policy. During exploration, DSEE explores the environment and updates the estimates for expected reward and transition probabilities. During exploitation, the latest estimates of the expected reward and transition probabilities are used to obtain a robust policy with high probability. We design the lengths of the exploration and exploitation epochs such that the cumulative regret grows as a sub-linear function of time.
|
|
11:00-11:20, Paper WeAT10.4 | Add to My Program |
A New Converse Lyapunov Theorem for Global Exponential Stability and Applications to Stochastic Approximation |
|
Vidyasagar, Mathukumalli | Indian Institute of Technology Hyderabad |
Keywords: Stochastic systems, Randomized algorithms, Machine learning
Abstract: In this paper, we give a simple and direct proof of the convergence of the stochastic approximation algorithm under suitable conditions. The main result here can be com- pared to that in Borkar and Meyn (2000), which is based on the ODE method, that is, showing that the sample paths of the algorithm converge towards the deterministic trajectories of an associated ODE. In contrast, the present proof is based on martingale theory, first proposed in Gladyshev (1965). Consequently, there are fewer assumptions here compared to previous papers. An important part of the proof is a new converse Lyapunov theorem for global exponential stability. Aside from its application to stochastic approximation theory, this new converse Lyapunov theorem would be useful for researchers in stability theory.
|
|
11:20-11:40, Paper WeAT10.5 | Add to My Program |
Moving Obstacle Avoidance: A Data-Driven Risk-Aware Approach |
|
Wei, Skylar | Caltech |
Dixit, Anushri | Caltech |
Tomar, Shashank | California Institute of Technology |
Burdick, Joel W. | California Inst. of Tech |
Keywords: Stochastic optimal control, Statistical learning, Uncertain systems
Abstract: This paper proposes a new structured method for a moving agent to predict the paths of dynamically moving obstacles and avoid them using a risk-aware model predictive control (MPC) scheme. Given noisy measurements of the a priori unknown obstacle trajectory, a bootstrapping technique predicts a set of obstacle trajectories. The bootstrapped predictions are incorporated in the MPC optimization using a risk-aware methodology so as to provide probabilistic guarantees on obstacle avoidance. We validate our methods using simulations of a multi-rotor drone that avoids various moving obstacles.
|
|
11:40-12:00, Paper WeAT10.6 | Add to My Program |
Nonparametric, Nonasymptotic Confidence Bands with Paley-Wiener Kernels for Band-Limited Functions |
|
Csáji, Balázs Cs. | SZTAKI |
Horváth, Bálint | SZTAKI |
Keywords: Statistical learning, Machine learning, Estimation
Abstract: The paper introduces a method to construct confidence bands for bounded, band-limited functions based on a finite sample of input-output pairs. The approach is distribution-free w.r.t. the observation noises and only the knowledge of the input distribution is assumed. It is nonparametric, that is, it does not require a parametric model of the regression function and the regions have non-asymptotic guarantees. The algorithm is based on the theory of Paley-Wiener reproducing kernel Hilbert spaces. The paper first studies the fully observable variant, when there are no noises on the observations and only the inputs are random; then it generalizes the ideas to the noisy case using gradient-perturbation methods. Finally, numerical experiments demonstrating both cases are presented.
|
|
WeAT11 Regular Session, Maya Ballroom III |
Add to My Program |
Robust Control II |
|
|
Chair: Sojoudi, Somayeh | UC Berkeley |
Co-Chair: Heinlein, Moritz | TU Dortmund University |
|
10:00-10:20, Paper WeAT11.1 | Add to My Program |
Chance-Constrained Trajectory Planning with Multimodal Environmental Uncertainty |
|
Ren, Kai | University of British Columbia |
Ahn, Heejin | University of British Columbia |
Kamgarpour, Maryam | EPFL |
Keywords: Robust control, Sampled-data control, Stochastic optimal control
Abstract: We tackle safe trajectory planning under Gaussian mixture model (GMM) uncertainty. Specifically, we use a GMM to model the multimodal behaviors of obstacles' uncertain states. Then, we develop a mixed-integer conic approximation to the chance-constrained trajectory planning problem with deterministic linear systems and polyhedral obstacles. When the GMM moments are estimated via finite samples, we develop a tight concentration bound to ensure the chance constraint with a desired confidence. Moreover, to limit the amount of constraint violation, we develop a Conditional Value-at-Risk (CVaR) approach corresponding to the chance constraints and derive a tractable approximation for known and estimated GMM moments. We verify our methods with state-of-the-art trajectory prediction algorithms and autonomous driving datasets.
|
|
10:20-10:40, Paper WeAT11.2 | Add to My Program |
Fragility Margin of PWA Control Laws Using a Hyperplane Based Binary Search Tree |
|
Yang, Songlin | CentraleSupele, Paris Saclay University |
Olaru, Sorin | CentraleSupélec |
Rodriguez-Ayerbe, Pedro | Supelec |
Dorea, Carlos E.T. | Universidade Federal Do Rio Grande Do Norte |
Keywords: Robust control, Nonlinear systems, Constrained control
Abstract: This paper concentrates on the fragility margins of discrete-time Piecewise Affine (PWA) closed-loop dynamics. Starting from the case where the nominal trajectories are controlled by a PWA controller using a positioning mechanism within a binary search tree (BST), we are interested in preserving the properties of the nominal dynamics, particularly the positive invariance in the presence of perturbations in the control law representation. As the main result, we define (and effectively construct) the fragility margin for a hyperplane defining the partition of the nominal PWA controller. This hyperplane will be identified through a node in a BST, and the proposed margin characterizes the degrees of freedom in the perturbation of its coefficients as it might result from a quantization operation. From the mathematical standpoint, the fragility margin is a set in the coefficients’ space, and the properties of this set are formally described.
|
|
10:40-11:00, Paper WeAT11.3 | Add to My Program |
LQR Control with Sparse Adversarial Disturbances |
|
Pfrommer, Samuel | University of California, Berkeley |
Sojoudi, Somayeh | UC Berkeley |
Keywords: Robust control, Optimal control, Linear systems
Abstract: Recent developments in cyber-physical systems and event-triggered control have led to an increased interest in the impact of sparse disturbances on dynamical processes. We study Linear Quadratic Regulator (LQR) control under sparse disturbances by analyzing three distinct policies: the blind online policy, the disturbance-aware policy, and the optimal offline policy. We derive the two-dimensional recurrence structure of the optimal disturbance-aware policy, under the assumption that the controller has information about future disturbance values with only a probabilistic model of their locations in time. Under mild conditions, we show that the disturbance-aware policy converges to the blind online policy if the number of disturbances grows sublinearly in the time horizon. Finally, we provide a finite-horizon regret bound between the blind online policy and optimal offline policy, which is proven to be quadratic in the number of disturbances and in their magnitude. This provides a useful characterization of the suboptimality of a standard LQR controller when confronted with unexpected sparse perturbations.
|
|
11:00-11:20, Paper WeAT11.4 | Add to My Program |
Robust MPC Approaches for Monotone Systems |
|
Heinlein, Moritz | TU Dortmund University |
Subramanian, Sankaranarayanan | TU Dortmund |
Molnar, Marco | Technische Universität Berlin |
Lucia, Sergio | TU Dortmund University |
Keywords: Robust control, Predictive control for nonlinear systems, Predictive control for linear systems
Abstract: This paper exploits the efficient computation of reachable sets for monotone systems to formulate model predictive control approaches with guaranteed constraint satisfaction and recursive feasibility in the presence of uncertainty. To include all possible realizations of the uncertainties, the reachable sets are approximated as hyperrectangles in the problem formulation. The presented approach is extended to include recourse, i.e., the knowledge about the presence of further measurement information in the prediction horizon. By dividing the reachable sets, multiple regions are obtained, for which the future inputs can be chosen separately. The applicability of the proposed approaches are shown by the means of a temperature control example.
|
|
11:20-11:40, Paper WeAT11.5 | Add to My Program |
Nu-Analysis: A New Notion of Robustness for Large Systems with Structured Uncertainties |
|
Kjellqvist, Olle | Lund University |
Doyle, John C. | Caltech |
Keywords: Robust control, Uncertain systems, Large-scale systems
Abstract: We present a new, scalable alternative to the structured singular value, which we call nu, provide a convex upper bound, study their properties and compare them to ell_1 robust control. The analysis relies on a novel result on the relationship between robust control of dynamical systems and non-negative constant matrices.
|
|
11:40-12:00, Paper WeAT11.6 | Add to My Program |
Convex Robust Performance with Structured Uncertainties in System Level Synthesis |
|
Sarma, Anish | California Institute of Technology |
Doyle, John C. | Caltech |
Keywords: Robust control, Uncertain systems, Large-scale systems
Abstract: We develop convex parameterizations for L1-robust controllers in the System Level Synthesis (SLS) framework, allowing us to find robust decentralized controllers that preserve design constraints on sparsity, locality, and delay. The parameterizations have interpretable relationships with the set of nominally stabilizing SLS controllers. We show that the state feedback robust performance SLS problem can be posed exactly and end-to-end as a linear program applicable to large-scale distributed control. We generalize these results to unstructured uncertainties in the output feedback setting, providing new decentralized robust performance guarantees in this setting.
|
|
WeAT12 Invited Session, Maya Ballroom IV |
Add to My Program |
Learning and Resilience in Event-Triggered Control |
|
|
Chair: Nowzari, Cameron | George Mason University |
Co-Chair: Hirche, Sandra | Technische Universität München |
Organizer: Nowzari, Cameron | George Mason University |
Organizer: Heemels, W.P.M.H. | Eindhoven University of Technology |
Organizer: Johansson, Karl H. | KTH Royal Institute of Technology |
Organizer: Hirche, Sandra | Technische Universität München |
|
10:00-10:20, Paper WeAT12.1 | Add to My Program |
Event-Based Robust Stabilization: An Operator-Theoretic Approach (I) |
|
Zhang, Shiqi | Peking Univeristy |
Li, Zhongkui | Peking University |
Keywords: Linear systems, Robust control
Abstract: This paper considers the robustness of event-triggered control of general linear systems against additive frequency-domain uncertainties. It is revealed that in the event-triggered mechanisms, the sampling errors are images of affine operators acting on the sampled outputs. Though not belonging to mathcal {RH}_{infty}, these operators are finite-gain mathcal L_2 stable with operator-norm depending on the triggering conditions and the norm bound of the uncertainties. Moreover, the robust event-triggered controller design problem can then be transformed into a standard H_{infty} synthesis problem of a linear system having the same order as the controlled plant. Algorithms are also provided to construct the event-triggered robust dynamic output-feedback controllers.
|
|
10:20-10:40, Paper WeAT12.2 | Add to My Program |
Event-Based Communication in Distributed Q-Learning (I) |
|
Jarne Ornia, Daniel | Delft University of Technology |
Mazo Jr., Manuel | Delft University of Technology |
Keywords: Networked control systems, Discrete event systems, Learning
Abstract: We present an approach to reduce the communication of information needed on a Distributed Q-Learning system inspired by Event Triggered Control (ETC) techniques. We consider a baseline scenario of a Distributed Q-Learning problem on a Markov Decision Process (MDP). Following an event-based approach, N agents sharing a value function explore the MDP and compute a trajectory-dependent triggering signal which they use distributedly to decide when to communicate information to a central learner in charge of computing updates on the action-value function. These decision functions form an Event Based distributed Q learning system (EBd-Q), and we derive convergence guarantees resulting from the reduction of communication. We then apply the proposed algorithm to a cooperative path planning problem, and show how the agents are able to learn optimal trajectories communicating a fraction of the information. Additionally, we discuss what effects (desired and undesired) these event-based approaches have on the learning processes studied, and how they can be applied to more complex multi-agent systems.
|
|
10:40-11:00, Paper WeAT12.3 | Add to My Program |
Self-Triggered Ternary Control for Resilient Consensus against Mobile Adversarial Agents (I) |
|
Matsume, Hiroki | Tokyo Institute of Technology |
Wang, Yuan | KTH Royal Institute of Technology |
Ishii, Hideaki | Tokyo Institute of Technology |
Defago, Xavier | Tokyo Institute of Technology |
Keywords: Resilient Control Systems, Agents-based systems, Cyber-Physical Security
Abstract: In this paper, we consider the problem of multiagent consensus in the presence of mobile adversaries. Faulty agents try to prevent the coordination among the regular agents and moreover are mobile in the sense that they can change their identities over time. Our approach towards resilient consensus is to extend the so-called mean subsequence reduced (MSR) algorithms to reduce the necessary communication based on two measures: The information exchanged by the agents takes the form of ternary data in each message and furthermore selftriggered method is used to keep the transmission frequency limited. Certain features are introduced to address issues specific to the mobile nature of the adversarial agents. We verify the effectiveness of the proposed algorithm by means of a numerical example.
|
|
11:00-11:20, Paper WeAT12.4 | Add to My Program |
Asynchronous Bayesian Learning Over a Network (I) |
|
Bhar, Kinjal | Oklahoma State University |
Bai, He | Oklahoma State University |
George, Jemin | U.S. Army Research Laboratory |
Busart, Carl | US Army DEVCOM ARL |
Keywords: Machine learning, Sensor networks, Statistical learning
Abstract: We present a practical asynchronous data fusion model for networked agents to perform distributed Bayesian learning without sharing raw data. Our algorithm employs unadjusted Langevin dynamics with a gossip-based protocol for sampling, coupled with an event-triggered mechanism to further reduce communication between gossiping agents. These mechanisms drastically reduce communication overhead and help avoid bottlenecks commonly experienced with distributed algorithms. In addition, the algorithm is expected to increase resilience to occasional link failure. We establish mathematical guarantees for our algorithm and demonstrate its effectiveness via a numerical experiment.
|
|
11:20-11:40, Paper WeAT12.5 | Add to My Program |
Adaptive Sampling and Control for POMDPs: Application to Precision Agriculture (I) |
|
Antunes, Duarte | Eindhoven University of Technology, the Netherlands |
Beumer, Ruben | Eindhoven University of Technology (TU/e) |
Molengraft, René van de | Eindhoven University of Technology |
Heemels, W.P.M.H. | Eindhoven University of Technology |
Keywords: Optimal control, Markov processes, Discrete event systems
Abstract: Given a partially observable Markov decision process (POMDP) with finite state, input and measurement spaces, and costly measurements and control, we consider the problem of when to sample and actuate. Both sampling and actuation are modeled as control actions in a framework encompassing estimation and intervention problems. The process evolves freely between two consecutive control action times. Control actions are assumed to reset the conditional distribution of the state given the measurements to one of a finite number of distributions. We tackle the problem of deciding when control actions should occur in order to minimize an average cost that penalizes states and the rate of control actions. The problem is first shown to boil down to a stopping time problem. While the latter can be solved optimally, the complexity of the optimal policy is intractable. Thus, we propose two approximate methods. The first is inspired by relaxed dynamic programming, and it is within an additive cost factor of the optimal policy. The second is inspired by consistent event-triggered control and ensures that the cost is smaller than that of periodic control for the same control rate. We conclude that the latter policy can deal with large dimensional problems, as demonstrated in the context of precision agriculture.
|
|
11:40-12:00, Paper WeAT12.6 | Add to My Program |
Backstepping Tracking Control Using Gaussian Processes with Event-Triggered Online Learning |
|
Jiao, Junjie | Technical University of Munich |
Capone, Alexandre | Technical University of Munich |
Hirche, Sandra | Technische Universität München |
Keywords: Machine learning, Uncertain systems, Adaptive control
Abstract: In this paper, we present a trajectory tracking control law for a class of partially unknown nonlinear systems that combines backstepping and event-triggered online learning. We employ Gaussian processes to learn the unknown system model using measurement data collected online, while the proposed control law is active. Our approach uses an efficient event-triggered online learning scheme that exclusively collects informative data to update the estimated model used for control. The resulting control law guarantees that the tracking error is globally uniformly ultimately bounded. The inter-event time is shown to be lower-bounded by a positive constant. Moreover, we also discuss how to obtain a trade-off between the cardinality of the collected training data and the size of the ultimate tracking error bound. In a simulation example, our approach is shown to outperform a state-of-the-art offline learning-based approach both in terms of tracking performance and data efficiency.
|
|
WeAT13 Regular Session, Maya Ballroom V |
Add to My Program |
Predictive Control for Nonlinear Systems I |
|
|
Chair: Lermusiaux, Pierre F. J. | Massachusetts Institute of Technology |
Co-Chair: Ferrari, Riccardo M.G. | Delft University of Technology |
|
10:00-10:20, Paper WeAT13.1 | Add to My Program |
Nonlinear MPC and MHE of a Gasoline Controlled Auto-Ignition Engine Based on Reduced-Order Differential Equation Models |
|
De Schutter, Jochem | ALU Freiburg |
Nuss, Eugen | Institute of Automatic Control |
Baumgärtner, Katrin | University of Freiburg |
Abel, Dirk | RWTH Aachen University |
Diehl, Moritz | University of Freiburg |
Keywords: Predictive control for nonlinear systems, Automotive control, Switched systems
Abstract: Gasoline Controlled Auto-Ignition (GCAI) is a novel combustion technology that promises higher efficiency and lower pollutant emissions than conventional combustion engines. These advantages come at the cost of a high sensitivity to disturbances and instabilities in combustion phasing. Closed-loop control methods such as linear model predictive control are able to stabilize the process, although the operating range remains limited. Nonlinear model predictive control and moving horizon estimation are able to optimize the fully nonlinear system model and promise higher prediction and estimation accuracy. Nonlinear models based on differential equations offer high physical detail and flexibility, however they pose a high computational burden on the control and estimation algorithms. This paper proposes an efficient, real-time capable MPC and MHE scheme based on the Real Time Iteration (RTI) for a continuous-time GCAI model. We achieve a feedback delay of below 1 ms and RTI preparation time of below 18 ms in numerical simulations on the embedded hardware of an existing GCAI engine test setup. Thereby the proposed scheme satisfies the computational constraints imposed by the engine speed by a safety factor of 4.
|
|
10:20-10:40, Paper WeAT13.2 | Add to My Program |
Navigating Underactuated Agents by Hitchhiking Forecast Flows |
|
Wiggert, Marius | UC Berkeley |
Doshi, Manan | Massachusetts Institute of Technology |
Tomlin, Claire J. | UC Berkeley |
Lermusiaux, Pierre F. J. | Massachusetts Institute of Technology |
Keywords: Predictive control for nonlinear systems, Maritime control, Autonomous systems
Abstract: Underactuated agents can achieve energy-efficient navigation in the air and oceans by hitchhiking complex flows. However, in real flows the forecast error can be larger than the actuation of the agent which poses a challenge for reliable navigation. In this paper, we propose a closed-loop control schema in the spirit of Model Predictive Control which allows time-optimal replanning at every step with one computation per forecast. We us the recent Multi-Time Hamilton-Jacobi Reachability formulation to obtain a value function which is used for closed-loop control. We evaluate the reliability of our method empirically over a large set of multi-day start-target missions in the ocean currents of the Gulf of Mexico with realistic forecast errors. Our method outperforms the baselines significantly and achieves high reliability, measured as the success rate of navigating from start to target within a maximum allowed time, at low computational cost.
|
|
10:40-11:00, Paper WeAT13.3 | Add to My Program |
An Economic Model Predictive Control Approach for Load Mitigation on Multiple Tower Locations of Wind Turbines |
|
Feng, Zhixin | TU Delft |
Gallo, Alexander J. | TU Delft |
Liu, Yichao | Delft University of Technology |
Pamososuryo, Atindriyo Kusumo | Delft University of Technology |
Ferrari, Riccardo M.G. | Delft University of Technology |
van Wingerden, Jan-Willem | Delft University of Technology |
Keywords: Predictive control for nonlinear systems, Flexible structures, Power generation
Abstract: The current trend in the evolution of wind turbines is to increase their rotor size in order to capture more power. This leads to taller, slender and more flexible towers, which thus experience higher dynamical loads due to the turbine rotation and environmental factors. It is hence compelling to deploy advanced control methods that can dynamically counteract such loads, especially at tower positions that are more prone to develop cracks or corrosion damages. Still, to the best of the authors’ knowledge, little to no attention has been paid in the literature to load mitigation at multiple tower locations. Furthermore, there is a need for control schemes that can balance load reduction with optimization of power production. In this paper, we develop an Economic Model Predictive Control (eMPC) framework to address such needs. First, we develop a linear modal model to account for the tower flexural dynamics. Then we incorporate it into an eMPC framework, where the dynamics of the turbine rotation are expressed in energy terms. This allows us to obtain a convex formulation, that is computationally attractive. Our control law is designed to avoid the “turn-pike” behavior and guarantee recursive feasibility. We demonstrate the performance of the proposed controller on a 5MW reference WT model: the results illustrate that the proposed controller is able to reduce the tower loads at multiple locations, without significant effects to the generated power.
|
|
11:00-11:20, Paper WeAT13.4 | Add to My Program |
Model Predictive Control of a Tandem-Rotor Helicopter with a Non-Uniformly Spaced Prediction Horizon |
|
Ahmed, Faraaz | McGill University |
Sobiesiak, Ludwik Andrew | NGC Aerospace Ltd |
Forbes, James Richard | McGill University |
Keywords: Predictive control for nonlinear systems, Optimal control, Aerospace
Abstract: This paper considers model predictive control of a tandem-rotor helicopter. The error is formulated using the matrix Lie group SE2(3). A reference trajectory to a target is calculated using a quartic guidance law, leveraging the differentially flat properties of the system, and refined using a finite-horizon linear quadratic regulator. The nonlinear system is linearized about the reference trajectory enabling the formulation of a quadratic program with control input, attitude keep-in zone, and attitude error constraints. A non-uniformly spaced prediction horizon is leveraged to capture the multi-timescale dynamics while keeping the problem size tractable. Monte-Carlo simulations demonstrate robustness of the proposed control structure to initial conditions, model uncertainty, and environmental disturbances.
|
|
11:20-11:40, Paper WeAT13.5 | Add to My Program |
EigenMPC: An Eigenmanifold-Inspired Model-Predictive Control Framework for Exciting Efficient Oscillations in Mechanical Systems |
|
Coelho, Andre | Dextrous Robotics Inc |
Albu-Schaeffer, Alin | German Aerospace Center (DLR) |
Sachtler, Arne | Technical University of Munich (TUM) |
Mishra, Hrishik | German Aerospace Center (DLR) |
Bicego, Davide | LAAS-CNRS |
Ott, Christian | TU Wien |
Franchi, Antonio | University of Twente |
Keywords: Predictive control for nonlinear systems, Robotics, Algebraic/geometric methods
Abstract: This paper proposes a Nonlinear Model-Predictive Control (NMPC) method capable of finding and converging to energy-efficient regular oscillations, which require no control action to be sustained. The approach builds up on the recently developed Eigenmanifold theory, which defines the sets of line-shaped oscillations of a robot as an invariant two-dimensional submanifold of its state space. By defining the control problem as a nonlinear program (NLP), the controller is able to deal with constraints in the state and control variables and be energy-efficient not only in its final trajectory but also during the convergence phase. An initial implementation of this approach is proposed, analyzed, and tested in simulation.
|
|
11:40-12:00, Paper WeAT13.6 | Add to My Program |
Hamilton-Jacobi Multi-Time Reachability |
|
Doshi, Manan | Massachusetts Institute of Technology |
Bhabra, Manmeet S | MIT |
Wiggert, Marius | UC Berkeley |
Tomlin, Claire J. | UC Berkeley |
Lermusiaux, Pierre F. J. | Massachusetts Institute of Technology |
Keywords: Predictive control for nonlinear systems, Computational methods, Fluid flow systems
Abstract: For the analysis of dynamical systems, it is fundamental to determine all states that can be reached at any given time. In this work, we obtain and apply new governing equations for reachability analysis over multiple start and terminal times all at once, and for systems operating in time-varying environments with dynamic obstacles and any other relevant dynamic fields. The theory and schemes are developed for both backward and forward reachable tubes with time-varying target and start sets. The resulting value functions elegantly capture not only the reachable tubes but also time-to-reach and time-to-leave maps as well as start time vs. duration plots and other useful secondary quantities for optimal control. We discuss the numerical schemes and computational efficiency. We first verify our results in an environment with a moving target and obstacle where reachability tubes can be analytically computed. We then consider the Dubin’s car problem extended with a moving target and obstacle. Finally, we showcase our multi-time reachability in a non-hydrostatic bottom gravity current system. Results highlight the novel capabilities of exact multi-time reachability in dynamic environments.
|
|
WeAT14 Invited Session, Maya Ballroom VI |
Add to My Program |
Mechatronic Systems |
|
|
Chair: Flores, Gerardo | Center for Research in Optics |
Co-Chair: Heertjes, Marcel | Eindhoven University of Technology |
Organizer: Flores, Gerardo | Center for Research in Optics |
Organizer: Rakotondrabe, Micky | ENIT Tarbes, INPT, University of Toulouse |
Organizer: Khadraoui, Sofiane | University of Sharjah |
Organizer: Oomen, Tom | Eindhoven University of Technology |
Organizer: Heertjes, Marcel | Eindhoven University of Technology |
Organizer: Al Janaideh, Mohammad | Memorial University |
|
10:00-10:20, Paper WeAT14.1 | Add to My Program |
Chetaev Instability Analysis of Kinetostatic Compliance-Based Protein Unfolding |
|
Mohammadi, Alireza | University of Michigan, Dearborn |
Spong, Mark W. | University of Texas at Dallas |
Keywords: Emerging control applications, Modeling, Biomolecular systems
Abstract: Understanding the process of protein unfolding plays a crucial role in various applications such as design of folding-based protein engines. Using the well-established kinetostatic compliance (KCM)-based method for modeling of protein conformation dynamics and a recent nonlinear control theoretic approach to KCM-based protein folding, this paper formulates protein unfolding as a destabilizing control analysis/synthesis problem. In light of this formulation, it is shown that the Chetaev instability framework can be used to investigate the KCM-based unfolding dynamics. In particular, a Chetaev function for analysis of unfolding dynamics under the effect of optical tweezers and a class of control Chetaev functions for synthesizing control inputs that elongate protein strands from their folded conformations are presented. Based on the presented control Chetaev function, an unfolding input is derived from the Artstein-Sontag universal formula and the results are compared against optical tweezer-based unfolding.
|
|
10:20-10:40, Paper WeAT14.2 | Add to My Program |
Nonlinear Proportional-Integral Disturbance Observers for Motion Control of Permanent Magnet Synchronous Motors |
|
Jeong, Yong Woo | Hanyang University |
Chung, Chung Choo | Hanyang University |
Keywords: Mechatronics, Observers for nonlinear systems, Control applications
Abstract: In this paper, we present nonlinear proportional integrator (N-PI) disturbance observers (DOBs) for enhancing the motion tracking performance of a surface-mounted permanent magnet synchronous motor in rapidly varying speed regions. By presenting an N-PI-DOB for load torque estimation with a torque modulation technique, we show that the electromechanical system, including the motion controller, disturbance observers, and inverter controller, can be presented in a three-cascaded form, in which the stability analysis of the closed-loop system can be easily performed. After analyzing the disturbances of the current tracking error dynamics, we designed another N-PI-DOB and a Lyapunov-based nonlinear controller for the current loop to enhance the motion tracking performance. With these N-PI-DOBs and the motion controller, we analyzed the stability of the motion tracking error dynamics and estimation error dynamics. We experimentally performed a comparative study with and without N-PI-DOB to validate the effectiveness of the proposed method under the conditions of unknown load torque and rapid speed variation.
|
|
10:40-11:00, Paper WeAT14.3 | Add to My Program |
Stabilization of Underactuated Systems of Degree One Via Neural Interconnection and Damping Assignment -- Passivity Based Control |
|
Sanchez-Escalonilla, Santiago | University of Groningen |
Reyes-Baez, Rodolfo | University of Groningen |
Jayawardhana, Bayu | University of Groningen |
Keywords: Neural networks, Machine learning, Nonlinear output feedback
Abstract: In this work, we show the potential of the universal approximation property of neural networks in the design of interconnection and damping assignment passivity-based controllers (IDA-PBC) for stabilizing nonlinear underactuated mechanical systems of degree one. Towards this end, we refor- mulate the IDA-PBC design methodology as a neural supervised learning problem that approximates the solution of the partial differential matching equations, which fulfills the equilibrium assignment and stability conditions. The output of the neural learning process has clear physical and control-theoretic inter- pretations in terms of energy, passivity and Lyapunov stability. The proposed approach is numerically evaluated in two well- known underactuated systems: the inverted pendulum on a cart and inertial wheel pendulum, whose analytic IDA-PBC solutions are non-trivial to obtain.
|
|
11:00-11:20, Paper WeAT14.4 | Add to My Program |
Finite-Time Stabilization of the Generalized Bouc-Wen Model for Piezoelectric Systems |
|
Flores, Gerardo | Center for Research in Optics |
Rakotondrabe, Micky | ENIT Tarbes, INPT, University of Toulouse |
Keywords: Mechatronics, Robotics, Robust control
Abstract: When designing controllers for piezoelectric systems with hysteresis, usually simplified models are used. This can lead to inaccuracies in the closed-loop system response. In this paper, we pose the problem of tracking stabilization of piezoelectric systems using the generalized Bouc-Wen model, a highly non-linear system rarely used to design controllers. Besides, we consider only partial knowledge of one hysteresis system parameter and external disturbances in all the system states. We propose an interconnected control composed of three parts for the solution: an observer, a virtual hysteresis control, and an actuator control. It is demonstrated that the closed-loop system converges in finite time. Simulation experiments were carried out, demonstrating the effectiveness of our approach despite exogenous and unknown disturbances.
|
|
11:20-11:40, Paper WeAT14.5 | Add to My Program |
Unifying Model-Based and Neural Network Feedforward: Physics-Guided Neural Networks with Linear Autoregressive Dynamics (I) |
|
Kon, Johan | Eindhoven University of Technology |
Bruijnen, Dennis | Philips Engineering Solutions |
van de Wijdeven, Jeroen | ASML Netherlands B.V |
Heertjes, Marcel | Eindhoven University of Technology |
Oomen, Tom | Eindhoven University of Technology |
Keywords: Mechatronics, Grey-box modeling, Machine learning
Abstract: Unknown nonlinear dynamics often limit the tracking performance of feedforward control. The aim of this paper is to develop a feedforward control framework that can compensate these unknown nonlinear dynamics using universal function approximators. The feedforward controller is parametrized as a parallel combination of a physics-based model and a neural network, where both share the same linear autoregressive (AR) dynamics. This parametrization allows for efficient output-error optimization through Sanathanan-Koerner (SK) iterations. Within each SK-iteration, the output of the neural network is penalized in the subspace of the physics-based model through orthogonal projection-based regularization, such that the neural network captures only the unmodelled dynamics, resulting in interpretable models.
|
|
11:40-12:00, Paper WeAT14.6 | Add to My Program |
Output-Feedback Control of Electromagnetic Actuated Micropositioning System with Uncertain Nonlinearities and Unknown Gap Variation (I) |
|
Al Saaideh, Mohammad | Memorial University of Newfoundland |
Boker, Almuatazbellah | Virginia Tech |
Al Janaideh, Mohammad | Memorial University of Newfoundland |
Keywords: Mechatronics
Abstract: This paper investigates the output feedback tracking control of the motion system driven by an electromagnetic actuator with uncertain nonlinearities and unknown gap variation, considering only the measured output position. The output feedback control is based on two cascade high-gain observers combined with a full state feedback control that is based on the backstepping approach. In this work, the observer of two cascaded high-gain observers with different speeds is considered; the faster one estimates the output position and velocity of the motion system and feeds a virtual nonlinear output to estimate the magnetic flux and nonlinear hysteresis of the reluctance actuator. We show that the equilibrium point of the full state feedback control system under full knowledge of the system information is globally asymptotically stable. The simulation results show that the output feedback control achieves the tracking control objective and recovers the performance of the state feedback control. Also, the simulation results show the robustness of the output feedback control for unknown variations in the air gap and unknown external disturbances.
|
|
WeAT15 Regular Session, Maya Ballroom VII |
Add to My Program |
Game Theory I |
|
|
Chair: Bopardikar, Shaunak D. | Michigan State University |
Co-Chair: Cenedese, Carlo | ETH Zurich |
|
10:00-10:20, Paper WeAT15.1 | Add to My Program |
A Stackelberg Game for Incentive-Based Demand Response in Energy Markets |
|
Fochesato, Marta | ETH Zurich |
Cenedese, Carlo | ETH Zurich |
Lygeros, John | ETH Zurich |
Keywords: Game theory, Hierarchical control, Smart grid
Abstract: In modern buildings renewable energy generators and storage devices are spreading, and consequently the role of the users in the power grid is shifting from passive to active. We design a demand response scheme that exploits the prosumers' flexibility to provide ancillary services to the main grid. We propose a hierarchical scheme to coordinate the interactions between the distribution system operator and a community of smart prosumers. The framework inherits characteristics from price-based and incentive-based schemes and it retains the advantages of both. We cast the problem as a Stackelberg game with the prosumers as followers and the distribution system operator as leader. We solve the resulting bilevel optimization program via a KKT reformulation, proving the existence and the convergence to a local Stackelberg equilibrium.
|
|
10:20-10:40, Paper WeAT15.2 | Add to My Program |
Game-Theoretic Steady-State Control |
|
Romano, Andrew | University of Toronto |
Pavel, Lacra | University of Toronto |
Keywords: Game theory, Distributed control, Optimization algorithms
Abstract: We consider a set of autonomous agents with LTI dynamics subject to constant external disturbance. Each agent seeks to minimize a coupled cost function in steady-state subject to equality constraints. We propose a novel control methodology to solve the problem that captures much the past work done on Nash equilibrium seeking for dynamic agents. We show that using our methodology, the problem reduces to one of designing a set of decentralized stabilizing controllers. Example controller designs are provided for two cases.
|
|
10:40-11:00, Paper WeAT15.3 | Add to My Program |
Priority Based Synchronization for Faster Learning in Games |
|
Koochakzadeh, Abbasali | University of Minnesota |
Yazicioglu, Yasin | University of Minnesota |
Keywords: Game theory, Cooperative control, Decentralized control
Abstract: Learning in games has been widely used to solve many cooperative multi-agent problems such as coverage control, consensus, self-reconfiguration or vehicle-target assignment. One standard approach in this domain is to formulate the problem as a potential game and to use an algorithm such as log-linear learning to achieve the stochastic stability of globally optimal configurations. Standard versions of such learning algorithms are asynchronous, i.e., only one agent updates its action at each round of the learning process. To enable faster learning, we propose a synchronization strategy based on decentralized random prioritization of agents, which allows multiple agents to change their actions simultaneously when they do not affect each other's utility or feasible actions. We show that the proposed approach can be integrated into any standard asynchronous learning algorithm to improve the convergence speed while maintaining the limiting behavior (e.g., stochastically stable configurations). We support our theoretical results with simulations in a coverage control scenario.
|
|
11:00-11:20, Paper WeAT15.4 | Add to My Program |
FlipDyn: A Game of Resource Takeovers in Dynamical Systems |
|
Banik, Sandeep | Michigan State University |
Bopardikar, Shaunak D. | Michigan State University |
Keywords: Game theory, Cyber-Physical Security, Resilient Control Systems
Abstract: We introduce a game in which two players with opposing objectives seek to repeatedly takeover a common resource. The resource is modeled as a discrete time dynamical system over which a player can gain control after spending a state-dependent amount of energy at each time step. We use a FlipIT-inspired deterministic model that governs which player is in control at every time step. A player's policy is the probability with which it should spend energy to gain control of the resource at a given time step. Our main results are three-fold. First, we present analytic expressions for the cost-to-go as a function of the hybrid state of the system, i.e., the physical state of the dynamical system and the binary FlipDyn state for any general system with arbitrary costs. These expressions are exact when the physical state is also discrete and has finite cardinality. Second, for a continuous physical state with linear dynamics and quadratic costs, we derive expressions for Nash equilibrium (NE). For scalar physical states, we show that the NE depends only on the parameters of the value function and costs, and is independent of the state. Third, we derive an approximate value function for higher dimensional linear systems with quadratic costs. Finally, we illustrate our results through a numerical study on the problem of controlling a linear system in a given environment in the presence of an adversary.
|
|
11:20-11:40, Paper WeAT15.5 | Add to My Program |
On the Role of Social Identity in the Market for (Mis)information |
|
Hebbar, Vijeth | University of Illinois Urbana-Champaign |
Langbort, Cedric | Univ of Illinois, Urbana-Champaign |
Keywords: Game theory, Decentralized control, Control over communications
Abstract: Motivated by recent works in the communication and psychology literature, we model and study the role social identity -- a person's sense of belonging to a group -- plays in human information consumption. A hallmark of Social Identity Theory (SIT) is the notion of `status', i.e., an individual's desire to enhance their and their `in-group's' utility textit{relative} to that of an `out-group'. In the context of belief formation, this comes off as a desire to believe positive news about the in-group and negative news about the out-group, which has been empirically shown to support belief in misinformation and false news. We model this phenomenon as a Stackelberg game being played over an information channel between a news-source (sender) and news-consumer (receiver), with the receiver incorporating the `status' associated with social identity in their utility, in addition to accuracy. We characterize the strategy that must be employed by the sender to ensure that its message is trusted by receivers of all identities while maximizing accuracy of information. We show that, as a rule, this optimal quality of information at equilibrium decreases when a receiver's sense of identity increases. Our work supports the perspective that identity based reasoning among receivers is a motivating factor in encouraging media slant. We further demonstrate how extensions of our model can be used to quantitatively estimate the level of importance given to identity in a population.
|
|
11:40-12:00, Paper WeAT15.6 | Add to My Program |
Convergence Bounds of Decentralized Fictitious Play Around a Single Nash Equilibrium in Near-Potential Games |
|
Aydin, Sarper | Texas A&M University |
Arefizadeh, Sina | Arizona State University |
Eksin, Ceyhun | Texas A&M University |
Keywords: Game theory, Distributed control, Networked control systems
Abstract: We analyze convergence of decentralized fictitious play (DFP) in near-potential games, where agents participate in a game in which the change in utility functions are closely aligned with a potential function. In DFP, agents take actions that maximize their expected utilities computed based on local estimates of empirical frequencies of other agents. These local estimates are updated by averaging estimates received from neighbors in a time-varying communication network. Given near-potential games with finitely many Nash equilibria that are distant enough from each other, we show that the empirical frequencies converge near a single Nash Equilibrium. This result establishes that DFP maintains the properties of standard fictitious play (FP) in near-potential games.
|
|
WeAT16 Regular Session, Maya Ballroom VIII |
Add to My Program |
Output Regulation |
|
|
Chair: Simpson-Porco, John W. | University of Toronto |
Co-Chair: Borri, Alessandro | CNR-IASI |
|
10:00-10:20, Paper WeAT16.1 | Add to My Program |
An Internal Model for Tracking a Sinusoidal Reference with Unknown Frequency and Uncertain Plant |
|
Lorenzetti, Pietro | Tel Aviv University |
Reißner, Florian Andreas | Tel Aviv University |
Weiss, George | Tel Aviv University |
Keywords: Output regulation, Adaptive control, Power systems
Abstract: We propose a novel internal model-based controller to solve the reference tracking problem for an uncertain plant and a sinusoidal reference of unknown frequency. A third order self-synchronizing synchronverter model is used as internal model, leading to remarkable results. We only require that the plant mathbf{P} is stable and linear, and that an ``initial guess'' omega_n on the reference frequency omega_r is available. Under these assumptions, we ensure tracking for omega_rin[0.25omega_n,4omega_n]. Moreover, among other features, our controller is able to withstand (at steady-state) large step jumps in omega_r (approx 300% frequency jumps) and in the dynamics of mathbf{P} (e.g., mathbf{P} is allowed to change sign during operation). Extensive numerical experiments are provided to illustrate these impressive features.
|
|
10:20-10:40, Paper WeAT16.2 | Add to My Program |
Adaptive Nonlinear Regulation Via Gaussian Processes |
|
Gentilini, Lorenzo | Universitŕ Di Bologna |
Bin, Michelangelo | Imperial College London |
Marconi, Lorenzo | Univ. Di Bologna |
Keywords: Output regulation, Robust adaptive control, Flight control
Abstract: The paper deals with the problem of adaptive output regulation of nonlinear systems. Building on the design technique recently proposed in [1], we present a new approach where the regulator’s internal model is adapted online via Gaussian Processes. Adaptation is performed by following an "event-triggered" logic and hybrid tools are used to analyse the resulting closed-loop system. Unlike the approach of [1], where the steady-state control policy is supposed to belong to a specific finite-dimensional model set, here we only require smoothness. The paper also presents numerical simulations showing how the proposed method outperforms previous approaches.
|
|
10:40-11:00, Paper WeAT16.3 | Add to My Program |
Adaptive Dynamic Programming and Data-Driven Cooperative Optimal Output Regulation with Adaptive Observers |
|
Qasem, Omar | Florida Institute of Technology |
Jebari, Khalid | Florida Institute of Technology |
Gao, Weinan | Florida Institute of Technology |
Keywords: Optimal control, Cooperative control, Output regulation
Abstract: In this paper, a novel adaptive optimal control strategy is proposed to achieve the cooperative optimal output regulation of continuous-time linear multi-agent systems based on adaptive dynamic programming (ADP). The proposed method is different from those in the existing literature of ADP and cooperative output regulation in the sense that the knowledge of the exosystem dynamics is not required in the design of the exostate observers for those agents with no direct access to the exosystem. Moreover, an optimal control policy is obtained without the prior knowledge of the modeling information of any agent while achieving the cooperative output regulation. Instead, we use the state/input information along the trajectories of the underlying dynamical systems and the estimated exostates to learn the optimal control policy. Simulation results show the efficacy of the proposed algorithm, where both estimation errors of exosystem matrix and exostates, and the tracking errors converge to zero in an optimal sense, which solves the cooperative optimal output regulation problem.
|
|
11:00-11:20, Paper WeAT16.4 | Add to My Program |
Unknown Input Observer-Based Output Regulation for Uncertain Minimum Phase Linear Systems Affected by a Periodic Disturbance |
|
Wang, Yang | Shanghai Technology Unversity |
Gong, Yizhou | ShanghaiTech University |
Pin, Gilberto | Electrolux |
Zhu, Fanglai | Tongji University |
Serrani, Andrea | The Ohio State University |
Parisini, Thomas | Imperial College & Univ. of Trieste |
Keywords: Output regulation, Uncertain systems
Abstract: This paper deals with the problem of disturbance rejection for uncertain LTI SISO systems perturbed by an emph{unmeasurable} external disturbance under the framework of output regulation. The system is assumed to be minimum phase and internally stable, but the model parameters are completely unknown. In addition, no knowledge of the external disturbance, including frequency, amplitude and phase is required to be known in advance. A novel high-order sliding mode-based Unknown Input Observer(UIO) is developed to stabilize the system and reconstruct the external disturbance. The main feature distinguishing the proposed method from the existing ones is that we do not need to integrate a frequency estimator into the adaptive controller or update the frequency estimation in a hybrid manner. Instead, the disturbance is directly duplicated by the aforementioned unknown input observer. The boundedness of states and asymptotic convergence properties are rigorously proved. Finally, the effectiveness of the proposed technique is illustrated by a numerical example.
|
|
11:20-11:40, Paper WeAT16.5 | Add to My Program |
Low-Gain Stabilizers for Linear-Convex Optimal Steady-State Control |
|
Simpson-Porco, John W. | University of Toronto |
Keywords: Output regulation, Optimization algorithms, Constrained control
Abstract: We consider the problem of designing a feedback controller which robustly regulates an LTI system to an optimal operating point in the presence of unmeasured disturbances. A general design framework based on so-called optimality models was previously put forward for this class of problems, effectively reducing the problem to that of stabilization of an associated nonlinear plant. This paper presents several simple and fully constructive stabilizer designs to accompany the optimality model designs from [1]. The designs are based on a low-gain integral control approach, which enforces time-scale separation between the exponentially stable plant and the controller. We provide explicit formulas for controllers and gains, along with LMI-based methods for the computation of robust/optimal gains. The results are illustrated via an academic example and an application to power system frequency control.
|
|
11:40-12:00, Paper WeAT16.6 | Add to My Program |
Prioritized Control |
|
An, Sang-ik | University of Seoul |
Park, Gyunghoon | University of Seoul |
Lee, Dongheui | Technische Universität Wien (TU Wien) |
Keywords: Feedback linearization, Output regulation, Optimization
Abstract: We discuss a control strategy to handle singularity of input gain matrix in the input-output linearization of a dynamical system with multiple outputs. The key idea is to prioritize the outputs and designate available control inputs to the outputs in the order of priority, by which tracking control for higher priority outputs is possibly maintained even in the presence of the singularity. This strategy of prioritization (called the prioritized control) allows to successfully deal with critical control objectives (such as stability and/or collision avoidance) by assigning higher priority to the relevant outputs. In this paper, we first explain how the prioritization can be applied to the process of input-output linearization and provide a definition of the prioritized control law on the framework of the multi-objective optimization with the lexicographical ordering. Then, a simple case study is performed for a two-link manipulator along with simulation results, where we further discuss an intrinsic degenerate property of prioritization that the prioritized control law can be discontinuous.
|
|
WeAT17 Regular Session, Acapulco |
Add to My Program |
Healthcare and Medical Systems |
|
|
Chair: Hahn, Jin-Oh | University of Maryland |
Co-Chair: Leth, John | Aalborg University |
|
10:00-10:20, Paper WeAT17.1 | Add to My Program |
Enabling Anticipatory Response in Multi-Stage MPC Formulation for Fully Automated Artificial Pancreas System |
|
Colmegna, Patricio Hernan | University of Virginia |
Diaz Castańeda, Jenny Lorena | Institut De Robňtica I Informŕtica Industrial, CSIC-UPC |
Garcia Tirado, Jose Fernando | University of Virginia |
Breton, Marc | University of Virginia |
Keywords: Healthcare and medical systems, Biomedical, Metabolic systems
Abstract: Controlling postprandial glucose excursions in type 1 diabetes (T1D) still remains a challenge for most automated insulin delivery (AID) algorithms. The main limiting factor has been the slow absorption of current insulin analogs that prevents the controller from responding aggressively enough to significant glucose excursions without increasing the risk for insulin stacking and, consequently, late hypoglycemia. Faster insulin analogs represent an appealing solution to this problem, because they would enable better alignment between meal and insulin rates of appearance. In this paper, we discuss about another way to overcome this fundamental performance limitation that consists in allowing the controller to properly anticipate likely disturbances through a robust model predictive control (MPC) strategy.
|
|
10:20-10:40, Paper WeAT17.2 | Add to My Program |
An Online Stochastic Optimization Approach for Insulin Intensification in Type 2 Diabetes with Attention to Pseudo-Hypoglycemia |
|
Al Ahdab, Mohamad | Aalborg University |
Knudsen, Torben | Aalborg University, Denmark |
Stoustrup, Jakob | Aalborg University |
Leth, John | Aalborg University |
Keywords: Healthcare and medical systems, Emerging control applications, Human-in-the-loop control
Abstract: In this paper, we present a model free approach to calculate long-acting insulin doses for Type 2 Diabetic (T2D) subjects in order to bring their blood glucose (BG) concentration to be within a safe range. The proposed strategy tunes the parameters of a proposed control law by using a zeroth-order online stochastic optimization approach for a defined cost function. The strategy uses gradient estimates obtained by a Recursive Least Square (RLS) scheme in an adaptive moment estimation based approach named AdaBelief. Additionally, we show how the proposed strategy with a feedback rating measurement can accommodate for a phenomena known as relative hypoglycemia or pseudo-hypoglycemia (PHG) in which subjects experience hypoglycemia symptoms depending on how quick their BG concentration is lowered. The performance of the insulin calculation strategy is demonstrated and compared with current insulin calculation strategies using simulations with three different models.
|
|
10:40-11:00, Paper WeAT17.3 | Add to My Program |
Estimating a Personalized Basal Insulin Dose from Short-Term Closed-Loop Data in Type 2 Diabetes |
|
Engell, Sarah Ellinor | Technical University of Denamrk |
Aradóttir, Tinna Björk | Technical University of Denmark |
Ritschel, Tobias K. S. | Technical University of Denmark |
Bengtsson, Henrik | Novo Nordisk A/S |
Jorgensen, John Bagterp | Technical University of Denmark |
Keywords: Healthcare and medical systems, Identification for control, Simulation
Abstract: In type 2 diabetes (T2D) treatment, finding a safe and effective basal insulin dose is a challenge. The dose-response is highly individual and to ensure safety, people with T2D “titrate” by slowly increasing the daily insulin dose to meet treatment targets. This titration can take months. To ease and accelerate the process, we use short-term artificial pancreas (AP) treatment tailored for initial titration and apply it as a diagnostic tool. Specifically, we present a method to automatically estimate a personalized daily dose of basal insulin from closed-loop data collected with an AP. Based on AP-data from a stochastic simulation model, we employ the continuous-discrete extended Kalman filter and a maximum likelihood approach to estimate parameters in a simple dose-response model for 100 virtual people. With the identified model, we compute a daily dose of basal insulin to meet treatment targets for each individual. We test the personalized dose and evaluate the treatment outcomes against clinical reference values. In the tested simulation setup, the proposed method is feasible. However, more extensive tests will reveal whether it can be deemed safe for clinical implementation.
|
|
11:00-11:20, Paper WeAT17.4 | Add to My Program |
A Kalman Filter-Based Hybrid Model Predictive Control Algorithm for Mixed Logical Dynamical Systems: Application to Optimized Interventions for Physical Activity |
|
Khan, Owais | Arizona State University |
El Mistiri, Mohamed | Arizona State University |
Rivera, Daniel E. | Arizona State Univ |
Martin, Cesar A. | Escuela Superior Politecnica Del Litoral (ESPOL) |
Hekler, Eric | UC San Diego |
Keywords: Healthcare and medical systems, Predictive control for linear systems, Process Control
Abstract: Hybrid Model Predictive Control (HMPC) is presented as a decision-making tool for novel behavioral interventions to increase physical activity in sedentary adults, such as Just Walk. A broad-based HMPC formulation for mixed logical dynamical (MLD) systems relevant to problems in behavioral medicine is developed and illustrated on a representative participant model arising from the Just Walk study. The MLD model is developed based on the requirement of granting points for meeting daily step goals and categorical input variables. The algorithm features three degrees-of-freedom tuning for setpoint tracking, measured and unmeasured disturbance rejection that facilitates controller robustness; disturbance anticipation further improves performance for upcoming events such as weekends and weather forecasts. To avoid the corresponding mixed-integer quadratic problem (MIQP) from becoming infeasible, slack variables are introduced in the objective function. Simulation results indicate that the proposed HMPC scheme effectively manages hybrid dynamics, setpoint tracking, disturbance rejection, and the transition between the two phases of the intervention (initiation and maintenance) and is suitable for evaluation in clinical trials.
|
|
11:20-11:40, Paper WeAT17.5 | Add to My Program |
IMU-Based Transparency Control of Exoskeletons Driven by Series Elastic Actuator |
|
dos Santos, Leonardo Felipe | University of Săo Paulo |
Escalante, Felix M. | University of Săo Paulo |
Siqueira, Adriano A G | Univ. of Sao Paulo |
Boaventura, Thiago | University of Săo Paulo |
Keywords: Human-in-the-loop control, Healthcare and medical systems, Robotics
Abstract: Wearable devices such as rehabilitation robots, power augmentation exoskeletons, and haptic manipulators feature operation modes that need to mirror the user's action as naturally as possible. By tracking such motions, the human-robot physical interaction is minimized, or, in other words, the device displays effusive mechanical transparency. The present work applies a transparency control framework based on inertial measurement units fixed on the user to a series elastic exoskeleton joint. This approach guarantees the robot's transparent behavior without explicitly relying on force or human-robot interaction sensors. Instead, we used a state estimator to provide the human-related quantities needed for feedforward and feedback control. The proposed controller considers the dynamics of the series elastic actuator for two motor driver configurations: torque-based and velocity-based low-level control. Using a novel definition of the human-robot transparency impedance, we designed and experimentally evaluated the performance of both controllers.
|
|
11:40-12:00, Paper WeAT17.6 | Add to My Program |
Hemodynamic Monitoring Via Model-Based Extended Kalman Filtering: Hemorrhage Resuscitation and Sedation Case Study |
|
Yin, Weidi | University of Maryland |
Tivay, Ali | University of Maryland |
Hahn, Jin-Oh | University of Maryland |
Keywords: Healthcare and medical systems, Biomedical, Control applications
Abstract: This paper investigates the potential of model-based extended Kalman filtering (EKF) for hemodynamic monitoring in a hemorrhage resuscitation-sedation case study. To the best of our knowledge, it may be the first model-based state estimation study conducted in the context of hemodynamic monitoring. Built upon a grey-box mathematical model with parametric uncertainty as process noise, the EKF can estimate cardiac output (CO) and total peripheral resistance (TPR) continuously from mean arterial pressure (AP) measurements against inter-individual physiological and pharmacological variability. Its unique practical strengths include: it does not require AP waveform as in existing AP-based pulse-contour CO (PCCO) monitors; and it can estimate CO and TPR with explicit account for the effect of sedative drugs. The efficacy of the EKF-based hemodynamic monitoring was evaluated based on a large number of plausible virtual patients generated using a collective inference algorithm, which demonstrated that it has significant advantage over open-loop pure prediction, and that its accuracy is comparable to PCCO.
|
|
WeBT01 Regular Session, Tulum Ballroom A |
Add to My Program |
Hybrid Systems II |
|
|
Chair: Zahn, Frederik | Karlsruhe Institute of Technology |
Co-Chair: Zamani, Majid | University of Colorado Boulder |
|
13:30-13:50, Paper WeBT01.1 | Add to My Program |
Data-Driven Synthesis of Symbolic Abstractions with Guaranteed Confidence |
|
Lavaei, Abolfazl | Newcastle University |
Frazzoli, Emilio | ETH Zürich |
Keywords: Hybrid systems, Sampled-data control, Discrete event systems
Abstract: In this work, we propose a data-driven approach for the construction of finite abstractions (a.k.a., symbolic models) for discrete-time deterministic control systems with unknown dynamics. We leverage notions of so-called alternating bisimulation functions (ABF), as a relation between each unknown system and its symbolic model, to quantify the mismatch between state behaviors of two systems. Accordingly, one can employ our proposed results to perform formal verification and synthesis over symbolic models and then carry the results back over unknown original systems. In our data-driven setting, we first cast the required conditions for constructing ABF as a robust optimization program (ROP). Solving the provided ROP is not tractable due to the existence of unknown models in the constraints of ROP. To tackle this difficulty, we collect finite numbers of data from trajectories of unknown systems and propose a scenario optimization program (SOP) corresponding to the original ROP. By establishing a probabilistic relation between optimal values of SOP and ROP, we formally construct ABF between unknown systems and their symbolic models based on the number of data and a required confidence level. We verify the effectiveness of our data-driven results over two physical case studies with unknown models including (i) a DC motor and (ii) a nonlinear jet engine compressor. We construct symbolic models from data as appropriate substitutes of original systems and synthesize policies maintaining states of unknown systems in a safe set within infinite time horizons with some guaranteed confidence levels.
|
|
13:50-14:10, Paper WeBT01.2 | Add to My Program |
Assessing the Combination of Differential Flatness and Deterministic Automata for Controllable Hybrid Systems |
|
Zahn, Frederik | Karlsruhe Institute of Technology |
Kleinert, Tobias | RWTH Aachen University |
Hagenmeyer, Veit | Karlsruhe Institute of Technology (KIT) |
Keywords: Hybrid systems, Modeling, Automata
Abstract: Hybrid automata are a powerful model class to analyze and control technical systems with nonlinear continuous dynamics and discrete-event dynamics. However, due to their complexity, controllability is generally hard to achieve in hybrid automata, although it is a desirable property of engineered systems. In the present paper, we assess a novel model class of controllable deterministic hybrid automata, which we call Flat Hybrid Automata. This model class is based on the concept of differential flatness for the nonlinear continuous dynamics and on the property of determinism and controllability for the discrete event dynamics. We review the formal definition and we give the respective conditions and a proof for its controllability. An electrical network example illustrates the methodology and shows its potential regarding modelling, control and system design.
|
|
14:10-14:30, Paper WeBT01.3 | Add to My Program |
How Do We Walk? Using Hybrid Holonomy to Approximate Non-Holonomic Systems |
|
Oprea, Maria | Cornell University |
Clark, William | Cornell University |
Keywords: Hybrid systems, Nonholonomic systems, Robotics
Abstract: Why do we move forward when we walk? Our legs undergo periodic motion and thus possess no net change in position; however, our bodies do possess a net change in position and we are propelled forward. From a geometric perspective, this phenomenon of periodic input producing non-periodic output is holonomy. To obtain non-zero holonomy and propel forward, we must alternate which leg is in contact with the ground; a non-zero net motion can be obtained by concatenating arcs that would individually produce no net motion. We develop a framework for computing the holonomy group of hybrid systems and analyze their behavior in the limit as the number of impacts goes to infinity.
|
|
14:30-14:50, Paper WeBT01.4 | Add to My Program |
A Rapidly-Exploring Random Trees Motion Planning Algorithm for Hybrid Dynamical Systems |
|
Wang, Nan | University of California, Santa Cruz |
Sanfelice, Ricardo G. | University of California at Santa Cruz |
Keywords: Hybrid systems
Abstract: This paper proposes a rapidly-exploring random trees (RRT) algorithm to solve the motion planning problem for hybrid systems. At each iteration, the proposed algorithm, called HyRRT, randomly picks a state sample and extends the search tree by flow or jump, which is also chosen randomly when both regimes are possible. Through a definition of concatenation of functions defined on hybrid time domains, we show that HyRRT is probabilistically complete, namely, the probability of failing to find a motion plan approaches zero as the number of iterations of the algorithm increases. This property is guaranteed under mild conditions on the data defining the motion plan, which include a relaxation of the usual positive clearance assumption imposed in the literature of classical systems. The motion plan is computed through the solution of two optimization problems, one associated with the flow and the other with the jumps of the system. The proposed algorithm is applied to a walking robot so as to highlight its generality and computational features.
|
|
14:50-15:10, Paper WeBT01.5 | Add to My Program |
Constructing MDP Abstractions Using Data with Formal Guarantees |
|
Lavaei, Abolfazl | Newcastle University |
Soudjani, Sadegh | Newcastle University |
Frazzoli, Emilio | ETH Zürich |
Zamani, Majid | University of Colorado Boulder |
Keywords: Hybrid systems, Stochastic systems, Sampled-data control
Abstract: This paper is concerned with a data-driven technique for constructing finite Markov decision processes (MDPs) as finite abstractions of discrete-time stochastic control systems with unknown dynamics while providing formal closeness guarantees. The proposed scheme is based on notions of stochastic bisimulation functions (SBF) to capture the probabilistic distance between state trajectories of an unknown stochastic system and those of finite MDP. In our proposed setting, we first reformulate corresponding conditions of SBF as a robust convex program (RCP). We then propose a scenario convex program (SCP) associated to the original RCP by collecting a finite number of data from trajectories of the system. We ultimately construct an SBF between the data-driven finite MDP and the unknown stochastic system with a given confidence level by establishing a probabilistic relation between optimal values of the SCP and the RCP. We also propose two different approaches for the construction of finite MDPs from data. We illustrate the efficacy of our results over a nonlinear jet engine compressor with unknown dynamics. We construct a data-driven finite MDP as a suitable substitute of the original system to synthesize controllers maintaining the system in a safe set with some probability of satisfaction and a desirable confidence level.
|
|
15:10-15:30, Paper WeBT01.6 | Add to My Program |
Estimation of Infinitesimal Generators for Unknown Stochastic Hybrid Systems Via Sampling: A Formal Approach |
|
Nejati, Ameneh | Technical University of Munich (TUM) |
Lavaei, Abolfazl | Newcastle University |
Soudjani, Sadegh | Newcastle University |
Zamani, Majid | University of Colorado Boulder |
Keywords: Hybrid systems, Stochastic systems, Sampled-data control
Abstract: In this work, we develop a data-driven framework with formal confidence bounds for the estimation of infinitesimal generators for continuous-time stochastic hybrid systems with unknown dynamics. The proposed approximation scheme employs both time discretization and sampling from the solution process, and estimates the infinitesimal generator of the solution process via a set of data collected from trajectories of systems. We assume some mild continuity assumptions on the dynamics of the system and quantify the closeness between the infinitesimal generator and its approximation while ensuring an a-priori guaranteed confidence bound. To provide a reasonable closeness precision, we discuss significant roles of both time discretization and number of data in our approximation scheme. In particular, for a fixed number of data, variance of the estimation converges to infinity when the time discretization goes to zero. The proposed approximation framework guides us how to jointly select a suitable data size and a time discretization parameter to cope with this counter-intuitive behavior. We demonstrate the effectiveness of our proposed results by applying them to a nonlinear jet engine compressor with unknown dynamics.
|
|
WeBT02 Regular Session, Tulum Ballroom B |
Add to My Program |
Adaptive Systems |
|
|
Chair: Landau, Ioan Dore | CNRS GIPSA-LAB |
Co-Chair: Dogan, Kadriye Merve | Embry-Riddle Aeronautical University |
|
13:30-13:50, Paper WeBT02.1 | Add to My Program |
Recursive Averaging with Application to Bio-Inspired 3D Source Seeking |
|
Abdelgalil, Mahmoud | University of California, Irvine |
Taha, Haithem | University of California, Irvine |
Keywords: Adaptive systems, Biologically-inspired methods, Time-varying systems
Abstract: We analyze a class of high-frequency, high-amplitude oscillatory systems in which periodicity occurs on two distinct time scales and establish the convergence of its trajectories to a suitably averaged system by recursively applying the averaging theorem. Moreover, we introduce a novel bio-inspired 3D source seeking algorithm for rigid bodies with a collocated sensor and prove its practical stability under typical assumptions on the source signal strength field by combining our averaging results with singular perturbation.
|
|
13:50-14:10, Paper WeBT02.2 | Add to My Program |
Does a General Structure Exist for Adaptation/Learning Algorithms? |
|
Landau, Ioan Dore | CNRS GIPSA-LAB |
Airimitoaie, Tudor-Bogdan | University of Bordeaux |
Keywords: Adaptive systems, Estimation, Learning
Abstract: There are many parameter adaptation/learning algorithms (PALA) used in adaptive control, system identification and neural networks (Nesterov, Conjugate gradients, Momentum back propagation, Averaged gradient, Integral+ proportional+derivative, ...). For most of these algorithms unfortunately there are no results available for the choice of the various coefficients (weights) allowing to guarantee the stability of the estimator for any value of the learning rate and for any initial conditions. All these algorithms are in fact particular cases of a general structure for PALA which is introduced in this paper. This structure is characterized by the presence of an embedded ARMA (poles-zeros) filter. Taking into account the inherent feedback structure of these adaptation/learning algorithms, passivity approach is used for addressing the stability issue. Conditions which will assure the stability of this general structure will be provided and then particularized for the specific algorithms described in the paper. The impact of the MA and AR terms of the embedded filter upon the performance of the algorithms will be emphasized through simulation.
|
|
14:10-14:30, Paper WeBT02.3 | Add to My Program |
A Projection Operator-Based Discrete-Time Adaptive Architecture for Control of Uncertain Dynamical Systems with Actuator Dynamics |
|
Dogan, Kadriye Merve | Embry-Riddle Aeronautical University |
Kurttisi, Atahan | Embry-Riddle Aeronautical University |
Yucelen, Tansel | University of South Florida |
Koru, Ahmet Taha | University of Texas at Arlington |
Keywords: Lyapunov methods, Stability of linear systems, Adaptive systems
Abstract: Stability analyses of discrete-time adaptive control algorithms are generally predicated on quadratic Lyapunov-based frameworks that result in unavoidable complexity due to the resulting terms in the Lyapunov difference equations. This prevents generalizations of valuable continuous-time adaptive control results to their discrete-time settings. To this end, one important generalization is the consideration of actuator dynamics, which is present in any uncertain dynamical system. To address this problem, we propose a novel discrete-time adaptive control architecture predicated on the hedging method and a new projection operator. A logarithmic Lyapunov function is used for proving the asymptotic stability of the error between uncertain dynamical system states and hedging-based reference model states. An illustrative numerical example is then presented to demonstrate the efficacy of the proposed architecture.
|
|
14:30-14:50, Paper WeBT02.4 | Add to My Program |
Two Simple Model-Free Extremum Seeking Strategies with Convergence Acceleration through Hessian Estimation |
|
Dewasme, Laurent | Université De Mons |
Vande Wouwer, Alain | Université De Mons |
Keywords: Direct adaptive control, Biological systems, Optimization
Abstract: Recently, several Newton-based extremum seeking strategies have been proposed, which provide fast convergence. In this study, we focus attention on perturbation-based proportional-integral extremum seeking (PIES) and block-oriented model recursive least squares extremum seeking (BOM-RLSES), and we discuss stability and convergence, highlighting the impact of the quality of the Hessian estimation. A bioprocess application is used to compare the performance of PI and RLS Newton-based strategies with respect to the blockoriented RLSES. The results confirm the faster convergence of the Newton-based formulations for an input range contained in the region of attraction of the extremum.
|
|
14:50-15:10, Paper WeBT02.5 | Add to My Program |
Direct Data-Driven Stabilization of Nonlinear Affine Systems Via the Koopman Operator |
|
Fu, Xingyun | Tsinghua University |
You, Keyou | Tsinghua University |
Keywords: Direct adaptive control, Stability of nonlinear systems
Abstract: In this work, we are concerned with the design of direct data-driven controllers for nonlinear affine systems without explicit dynamical models. To this end, we adopt the Koopman operator to approximately reformulate the nonlinear systems into bilinear forms, based on which we propose a “simple” static controller in the linear form of the preset function dictionary. Then, we show how to obtain the feedback gain matrix and establish the stability condition of the closed- loop system using only the off-line collected data. Finally, numerical results demonstrate the effectiveness and robustness of this data-driven controller.
|
|
15:10-15:30, Paper WeBT02.6 | Add to My Program |
Parameter-Dependent Input Normalization: Direct-Adaptive Control with Uncertain Control Direction |
|
Pin, Gilberto | Electrolux |
Serrani, Andrea | The Ohio State University |
Wang, Yang | Shanghai Technology Unversity |
Keywords: Direct adaptive control, Uncertain systems
Abstract: In this work, we propose a direct-adaptive MRAC for relative-degree-unity SISO systems with unknown control direction. The proposed scheme, employing an original construction of the control law and the use of an adaptive observer, achieves the long-searched objective of injecting, through the input, the unmeasurable derivative of the output error. The output derivative injection is performed by a smart construction of the control input that features a Parameter-dependent Input Normalization (PIN). The PIN scheme does not make use of Nussbaum functions usually invoked in the direct-adaptive setting, does not require persistence of excitation of indirect adaptive schemes does not require switching between multiple models does not suffer from singularities, and does not require to know a-priori bounds on the norm of the high-frequency gain and on the parameters. Effectiveness of the algorithm is illustrated by a numerical example.
|
|
WeBT03 Regular Session, Tulum Ballroom C |
Add to My Program |
Robotics V |
|
|
Chair: Baldi, Simone | Southeast University |
Co-Chair: Fagiolini, Adriano | University of Palermo |
|
13:30-13:50, Paper WeBT03.1 | Add to My Program |
Flow Sensing-Based Underwater Target Detection Using Distributed Mobile Sensors |
|
Wang, Jun | Peking University |
Shen, Tongsheng | National Innovation Institute of Defense Technology |
Zhao, Dexin | National Innovation Institute of Defense Technology |
Zhang, Feitian | Peking University |
Keywords: Robotics, Estimation, Biologically-inspired methods
Abstract: This paper presents a novel bioinspired underwater target detection method with distributed mobile sensors. Inspired by the fish's lateral line system, distributed mobile sensors are proposed to measure their respective local flow pressure and collectively estimate the parameters (e.g., position and dimension) of underwater targets with dynamic sensor trajectory planning. This paper formulates an optimal trajectory planning problem for a group of mobile sensors using information entropy as the cost function, with an aim to maximize the confidence level in the estimated target parameters. This paper in particular investigates the detection of a circular target in a uniform flow. A Bayesian filter is adopted to assimilate the distributed pressure measurements dynamically to estimate the position and radius of given circular targets. To verify the effectiveness of the proposed method, simulation is conducted with both fixed and moving circular targets in uniform flow. Simulation results are presented and analyzed.
|
|
13:50-14:10, Paper WeBT03.2 | Add to My Program |
Tracking Control for Motion Constrained Robotic System Via Dynamic Event-Sampled Intelligent Learning Method |
|
Peng, Zhinan | University of Electronic Science and Technology of China |
Hu, Jiangping | University of Electronic Science and Technology of China |
Cheng, Hong | University of Electronic Science and Technology of China |
Huang, Rui | University of Electronic Science and Technology of China |
Luo, Rui | University of Electronic Science and Technology of China |
Zhao, Pengbo | Northwestern University |
Ghosh, Bijoy | Texas Tech University |
Keywords: Robotics, Adaptive control, Learning
Abstract: This paper introduces a new event-sampled intelligent learning method to solve tracking control for motion constrained robotic system. In many applications, motion constraints on joint movements of a robot are needed for its safe operation or to accommodate limitations arising from its mechanical structures. To handle the control design problem, a state-transformation method is employed to guarantee motion constraints of the robot and obtain a corresponding unconstrained tracking error system. Next, the constrained control problem is transformed into a general optimal tracking control problem for the unconstrained error system. Finally, we implement neural networks based learning structures to obtain the optimal solution. An important highlight of our proposed algorithm is that we have integrated a dynamic event-sampled approach to the learning-based controller design, thus reducing the system state sampling times. Using the proposed event-sampled learning control method, the stability of the closed-loop system and the convergence of the weights of neural networks are both established. Lastly, the effectiveness of the proposed intelligent control method is verified via simulation.
|
|
14:10-14:30, Paper WeBT03.3 | Add to My Program |
Analytic Estimation of Region of Attraction of an LQR Controller for Torque Limited Simple Pendulum |
|
Gross, Lukas | DFKI |
Maywald, Lasse | Deutsches Forschungszentrum Für Künstliche Intelligenz |
Kumar, Shivesh | German Research Center for Artificial Intelligence (DFKI GmbH) |
Kirchner, Frank | Robotics Innovation Center, DFKI and Department of Mathematics |
Lüth, Christoph | Deutsches Forschungszentrum Für Künstliche Intelligenz (DFKI) |
Keywords: Stability of nonlinear systems, Robotics, Formal Verification/Synthesis
Abstract: Linear-quadratic regulators (LQR) are a well known and widely used tool in control theory for both linear and nonlinear dynamics. For nonlinear problems, an LQR-based controller is usually only locally viable, thus, raising the problem of estimating the region of attraction (ROA). The need for good ROA estimations becomes especially pressing for underactuated systems, as a failure of controls might lead to unsafe and unrecoverable system states. Known approaches based on optimization or sampling, while working well, might be too slow in time critical applications and are hard to verify formally. In this work, we propose a novel approach to estimate the ROA based on the analytic solutions to linear ODEs for the torque limited simple pendulum. In simulation and physical experiments, we compared our approach to a Lyapunov-sampling baseline approach and found that our approach was faster to compute, while yielding ROA estimations of similar phase space area.
|
|
14:30-14:50, Paper WeBT03.4 | Add to My Program |
PDE-Based Dynamic Control and Estimation of Soft Robotic Arms |
|
Zheng, Tongjia | University of Notre Dame |
Lin, Hai | University of Notre Dame |
Keywords: Distributed parameter systems, Robotics, Filtering
Abstract: Compared with traditional rigid-body robots, soft robots not only exhibit unprecedented adaptation and flexibility but also present novel challenges in their modeling and control because of their infinite degrees of freedom. Most of the existing approaches have mainly relied on approximated models so that the well-developed finite-dimensional control theory can be exploited. However, this may bring in modeling uncertainty and performance degradation. Hence, we propose to exploit infinite-dimensional analysis for soft robotic systems. Our control design is based on the increasingly adopted Cosserat rod model, which describes the kinematics and dynamics of soft robotic arms using nonlinear partial differential equations (PDE). We design infinite-dimensional state feedback control laws for the Cosserat PDE model to achieve trajectory tracking (consisting of position, rotation, linear and angular velocities) and prove their uniform tracking convergence. We also design an infinite-dimensional extended Kalman filter on Lie groups for the PDE system to estimate all the state variables (including position, rotation, strains, curvature, linear and angular velocities) using only position measurements. The proposed algorithms are evaluated using simulations.
|
|
14:50-15:10, Paper WeBT03.5 | Add to My Program |
Adaptive Single-Stage Control for Uncertain Nonholonomic Euler-Lagrange Systems |
|
Tao, Tian | Delft University of Technology |
Roy, Spandan | IIIT HYDERABAD |
Baldi, Simone | Southeast University |
Keywords: Nonholonomic systems, Adaptive control, Robotics
Abstract: This work introduces a new single-stage adaptive controller for Euler-Lagrange systems with nonholonomic constraints. The proposed mechanism provides a simpler design philosophy compared to double-stage mechanisms (that address kinematics and dynamics in two steps), while achieving analogous stability properties, i.e. stability of both original and internal states. Meanwhile, we do not require direct access to the internal states as required in state-of-the-art single-stage mechanisms. The proposed approach is studied via Lyapunov analysis, validated numerically on wheeled mobile robot dynamics and compared to a standard double-stage approach.
|
|
15:10-15:30, Paper WeBT03.6 | Add to My Program |
A Rigid Body Observer (BObs) Considering Pfaffian Constraints with a Pose Regulation Framework |
|
Rothammer, Michael | Technical University of Munich |
Coelho, Andre | Dextrous Robotics Inc |
Mishra, Hrishik | German Aerospace Center (DLR) |
Ott, Christian | TU Wien |
Franchi, Antonio | University of Twente |
Albu-Schaeffer, Alin | German Aerospace Center (DLR) |
Keywords: Observers for nonlinear systems, Stability of nonlinear systems, Robotics
Abstract: Pfaffian (velocity) constraints are encountered commonly in mechanical systems. In many cases, the measurements for feedback control are obtained from an exteroceptive sensor system, which is not only of low rate, but also suffers from physical discontinuities. This negatively affects controller performance and places severe limitations on the choice of control parameters. To this end, a novel framework comprised of a rigid Body Observer (BObs) and a pose regulator is proposed. During the inter-sampling periods, the observer propagates the state based on an internal model to provide continuous estimates, which are exploited by the pose regulator to stabilize equilibria. We prove uniform asymptotic stability for the closed loop. Furthermore, we validate the proposed framework through simulation and also the BObs experimentally.
|
|
WeBT04 Regular Session, Tulum Ballroom D |
Add to My Program |
Neural Networks II |
|
|
Chair: Lahijanian, Morteza | University of Colorado Boulder |
Co-Chair: Sojoudi, Somayeh | UC Berkeley |
|
13:30-13:50, Paper WeBT04.1 | Add to My Program |
Formal Control Synthesis for Stochastic Neural Network Dynamic Models |
|
Adams, Steven | TU Delft |
Lahijanian, Morteza | University of Colorado Boulder |
Laurenti, Luca | TU Delft |
Keywords: Neural networks, Markov processes, Switched systems
Abstract: Neural networks (NNs) are emerging as powerful tools to represent the dynamics of control systems with complicated physics or black-box components. Due to complexity of NNs, however, existing methods are unable to synthesize complex behaviors with guarantees for NN dynamic models (NNDMs). This work introduces a control synthesis framework for stochastic NNDMs with performance guarantees. The focus is on specifications expressed in linear temporal logic interpreted over finite traces (LTLf), and the approach is based on finite abstraction. Specifically, we leverage recent techniques for convex relaxation of NNs to formally abstract a NNDM into an interval Markov decision process (IMDP). Then, a strategy that maximizes the probability of satisfying a given specification is synthesized over the IMDP and mapped back to the underlying NNDM. We show that the process of abstracting NNDMs to IMDPs reduces to a set of convex optimization problems, hence guaranteeing efficiency. We also present an adaptive refinement procedure that makes the framework scalable. On several case studies, we illustrate the our framework is able to provide non-trivial guarantees of correctness for NNDMs with architectures of up to 5 hidden layers and hundreds of neurons per layer.
|
|
13:50-14:10, Paper WeBT04.2 | Add to My Program |
Using Euler’s Method to Prove the Convergence of Neural Networks |
|
Jerray, Jawher | University Telecom-Paris |
Saoud, Adnane | CentraleSupelec |
Fribourg, Laurent | CNRS |
Keywords: Neural networks, Optimization algorithms, Machine learning
Abstract: It was shown in the literature that, for a fully connected neural network (NN), the gradient descent algorithm converges to zero. Motivated by that work, we provide here general conditions under which we can derive the convergence of the gradient descent algorithm from the convergence of the gradient flow, in the case of NNs, in a systematic way. Our approach is based on an analysis of the error in Euler's method in the case of NNs, and relies on the concept of local strong convexity. Unlike existing approaches in the literature, our approach allows to provide convergence guarantees without making any assumptions on the number of hidden nodes of the NN or the number of training data points. A numerical example is proposed, showing the merits of our approach.
|
|
14:10-14:30, Paper WeBT04.3 | Add to My Program |
Neural Network Training under Semidefinite Constraints |
|
Pauli, Patricia | University of Stuttgart |
Funcke, Niklas | University of Stuttgart |
Gramlich, Dennis | RWTH Aachen |
Msalmi, Mohamed Amine | University of Stuttgart |
Allgöwer, Frank | University of Stuttgart |
Keywords: Neural networks, Machine learning, LMIs
Abstract: This paper is concerned with the training of neural networks (NNs) under semidefinite constraints, which allows for NN training with robustness and stability guarantees. In particular, we focus on Lipschitz bounds for NNs. Exploiting the banded structure of the underlying matrix constraint, we set up an efficient and scalable training scheme for NN training problems of this kind based on interior point methods. Our implementation allows to enforce Lipschitz constraints in the training of large-scale deep NNs such as Wasserstein generative adversarial networks (WGANs) via semidefinite constraints. In numerical examples, we show the superiority of our method and its applicability to WGAN training.
|
|
14:30-14:50, Paper WeBT04.4 | Add to My Program |
Safety Verification of Neural Feedback Systems Based on Constrained Zonotopes |
|
Zhang, Yuhao | University of Wisconsin-Madison |
Xu, Xiangru | University of Wisconsin-Madison |
Keywords: Neural networks, Machine learning, Optimization
Abstract: Artificial neural networks have recently been utilized in many feedback control systems and introduced new challenges regarding the safety of such systems. This paper considers the safe verification problem for a dynamical system with a given feedforward neural network as the feedback controller by using a constrained zonotope-based approach. A novel set-based method is proposed to compute both exact and over-approximated reachable sets for neural feedback systems with linear models, and linear program-based sufficient conditions are presented to verify whether the trajectories of such a system can avoid unsafe regions represented as constrained zonotopes. The results are also extended to neural feedback systems with nonlinear models. The computational efficiency and accuracy of the proposed method are demonstrated by two numerical examples where a comparison with state-of-the-art methods is also provided.
|
|
14:50-15:10, Paper WeBT04.5 | Add to My Program |
Reachability Analysis of Neural Feedback Loops Using Sparse Polynomial Optimisation |
|
Newton, Matthew | University of Oxford |
Papachristodoulou, Antonis | University of Oxford |
Keywords: Neural networks, Nonlinear systems, Optimization
Abstract: Neural networks have seen a recent increased use in control feedback systems. However, providing robustness guarantees on these feedback systems has proven challenging and to combat these issues, there is a significant amount of current research. One of the biggest shortcoming of neural networks is how sensitive they are to adversarial inputs. Given that feedback systems are usually subject to external perturbations, this issue surrounding neural networks must be overcome before they can be used in safety-critical applications. One method to tackle this problem is to compute outer-approximations of the reachable sets, through bounding the activation functions in the neural network controller. Our approach is to use these bounds in a sparse polynomial optimisation framework in conjunction with the Positivstellensatz. The sparsity property is able to exploit the natural cascading structure of the neural network to allow for tractable solve times. The Positivstellensatz is able to provide more accurate bounds over similar methods by asserting the emptiness of a semi-algebraic set. We show through examples that our method can provide tighter bounds over similar methods, with reasonable computational time. Our approach is also able to deal with non-linear polynomial dynamics due to the polynomial optimisation framework we use, while other methods need to use alternative work-around solutions to incorporate the non-linearities.
|
|
15:10-15:30, Paper WeBT04.6 | Add to My Program |
A Sequential Greedy Approach for Training Implicit Deep Models |
|
Gautam, Tanmay | University of California, Berkeley |
Anderson, Brendon G. | University of California, Berkeley |
Sojoudi, Somayeh | UC Berkeley |
El Ghaoui, Laurent | Univ. of California at Berkeley |
Keywords: Neural networks, Optimization, Optimization algorithms
Abstract: Recent works in deep learning have demonstrated impressive performance using “implicit deep models,” wherein conventional architectures composed of forward-propagating, differentiable parametric layers are replaced by more expressive models composed of an implicitly defined fixed-point equation together with a prediction equation. Methods for training implicit deep models are currently restricted to end-to-end optimization, which relies on solving a matrix-variable fixed-point equation to compute the gradient and an expensive projection step at every iteration. In this work, we extend the idea of greedy layer-wise training, an approach found to yield state-of-the-art performance in conventional deep learning, to a sequential greedy training algorithm for implicit deep models with a strictly upper block triangular structure. We show that such implicit models can be regarded as generalized dense block modules of Dense Convolutional Networks (DenseNets), and thus inherit the underlying parameter efficiency property. For models trained with the Euclidean loss, we develop an alternating minimization subroutine for our sequential optimization algorithm, which consists of alternating between efficiently solvable least squares problems and single hidden-layer training problems. Furthermore, we theoretically prove that training a non-strictly upper triangular ReLU implicit model is equivalent to training a strictly upper block triangular one, allowing for the application of our algorithm to even more general models. Experiments on smooth and nonsmooth function interpolation, and on MNIST and Fashion-MNIST classification tasks, show that our algorithm consistently converges to models that out-perform state-of-the-art end-to-end implicit learning.
|
|
WeBT05 Invited Session, Tulum Ballroom E |
Add to My Program |
Learning-Based Control II: Analysis, Estimation and Control |
|
|
Chair: Müller, Matthias A. | Leibniz University Hannover |
Co-Chair: Solowjow, Friedrich | RWTH Aachen University |
Organizer: Müller, Matthias A. | Leibniz University Hannover |
Organizer: Schoellig, Angela P | University of Toronto |
Organizer: Trimpe, Sebastian | RWTH Aachen University |
Organizer: Zeilinger, Melanie N. | ETH Zurich |
|
13:30-13:50, Paper WeBT05.1 | Add to My Program |
On a Continuous-Time Version of Willems' Lemma |
|
Lopez, Victor G. | Leibniz University Hannover |
Muller, Matthias A. | Leibniz University Hannover |
Keywords: Uncertain systems, Linear systems
Abstract: In this paper, a method to represent every input-output trajectory of a continuous-time linear system in terms of previously collected data is presented. This corresponds to a continuous-time version of the well-known Willems' lemma. The result is obtained by sampling the continuous signals at regular intervals, and constructing Hankel-like structures that closely resemble their discrete-time counterparts. Then, it is shown how to use measured persistently excited data to design a time-varying vector of parameters that allows the generation of arbitrary piecewise differentiable trajectories. A class of input signals that satisfies the conditions for persistence of excitation is also provided.
|
|
13:50-14:10, Paper WeBT05.2 | Add to My Program |
Neural System Level Synthesis: Learning Over All Stabilizing Policies for Nonlinear Systems |
|
Furieri, Luca | EPFL |
Galimberti, Clara Lucía | École Polytechnique Fédérale De Lausanne (EPFL) |
Ferrari-Trecate, Giancarlo | Ecole Polytechnique Fédérale De Lausanne |
Keywords: Optimal control, Neural networks
Abstract: We address the problem of designing stabilizing control policies for nonlinear systems in discrete-time, while minimizing an arbitrary cost function. When the system is linear and the cost is convex, the System Level Synthesis (SLS) approach offers an effective solution based on convex programming. Beyond this case, a globally optimal solution cannot be found in a tractable way, in general. In this paper, we develop a parametrization of all and only the control policies stabilizing a given time-varying nonlinear system in terms of the combined effect of 1) a strongly stabilizing base controller and 2) a stable SLS operator to be freely designed. Based on this result, we propose a Neural SLS (Neur-SLS) approach guaranteeing closed-loop stability during and after parameter optimization, without requiring any constraints to be satisfied. We exploit recent Deep Neural Network (DNN) models based on Recurrent Equilibrium Networks (RENs) to learn over a rich class of nonlinear stable operators, and demonstrate the effectiveness of the proposed approach in numerical examples.
|
|
14:10-14:30, Paper WeBT05.3 | Add to My Program |
Data-Driven Input Reconstruction and Experimental Validation |
|
Shi, Jicheng | École Polytechnique Fédérale De Lausanne |
Lian, Yingzhao | EPFL |
Jones, Colin N. | EPFL |
Keywords: Estimation, Lyapunov methods, Observers for Linear systems
Abstract: This paper proposes a data-driven input reconstruction method from outputs (IRO) based on the Willems' Fundamental Lemma. Given only output measurements, the unknown inputs estimated recursively by the IRO asymptotically converge to the true input without knowing the initial conditions. A recursive IRO and a moving-horizon IRO are developed based respectively on Lyapunov conditions and Luenberger-observer-type feedback, and their asymptotic convergence properties are studied. An experimental study is presented demonstrating the efficacy of the moving-horizon IRO for estimating the occupancy of a building on the EPFL campus via measured carbon dioxide levels.
|
|
14:30-14:50, Paper WeBT05.4 | Add to My Program |
Linear Observer Learning by Temporal Difference (I) |
|
Menchetti, Stefano | IMT School for Advanced Studies Lucca |
Zanon, Mario | IMT Institute for Advanced Studies Lucca |
Bemporad, Alberto | IMT School for Advanced Studies Lucca |
Keywords: Observers for Linear systems, Learning, Estimation
Abstract: This paper proposes a method for learning optimal state estimators from input/output data for linear discrete-time stochastic systems. We show that this problem can be expressed in the reinforcement learning framework, suitably adapted to the peculiar problem structure. In particular, we introduce the specific Bellman equation for the state estimation problem and use temporal differences to solve it. We show in simulations that the resulting data-driven method for state estimation converges to the optimal observer.
|
|
14:50-15:10, Paper WeBT05.5 | Add to My Program |
Data-Driven Analysis and Design Beyond Common Lyapunov Functions (I) |
|
van Waarde, Henk J. | University of Groningen |
Camlibel, M. Kanat | University of Groningen |
Trentelman, Harry L. | Univ. of Groningen |
Keywords: Identification for control, Robust control, LMIs
Abstract: In this paper we investigate data-based analysis and design problems without making the prevailing assumption that there exists a common Lyapunov function for all systems unfalsified by data. In particular, we provide necessary and sufficient conditions under which a given set of state data are informative for stability, in the sense that all systems explaining the data are stable. These conditions are derived by making use of the celebrated Kalman-Yakubovich-Popov lemma. We also explain the potential of extending these results to other analysis problems like data-based stabilizability, and to the design of stabilizing controllers. The results are applied to analyze the stability of a large-scale network, which highlights the tractability of the provided framework in comparison to previous conditions involving linear matrix inequalities.
|
|
15:10-15:30, Paper WeBT05.6 | Add to My Program |
Finite-Sample Guarantees for State-Space System Identification under Full State Measurements (I) |
|
Baggio, Giacomo | University of Padova, Italy |
Care', Algo | University of Brescia |
Pillonetto, Gianluigi | University of Padova |
Keywords: Identification, Identification for control, Statistical learning
Abstract: Complementing data-driven models of dynamic systems with certificates of reliability and safety is of critical importance in many applications, such as in the design of robust control policies for unknown or uncertain systems. In this paper, we propose an efficient method to construct finite-sample confidence regions for the parameters of unknown linear systems in state-space form. The proposed procedure builds on the Sign-Perturbed Sums (SPS) paradigm and returns regions that are provably exact, i.e., contain the true parameters with the desired probability, using finite data and under minimal assumptions on the noise distribution. In particular, the noise distribution is only required to be symmetric about the origin, thus encompassing as special cases commonly studied noise settings, such as zero-mean Gaussian or Laplacian noise. The performance of our procedure is tested across different noise scenarios and compared to that of alternative approaches.
|
|
WeBT06 Regular Session, Tulum Ballroom F |
Add to My Program |
Identification for Control |
|
|
Chair: Formentin, Simone | Politecnico Di Milano |
Co-Chair: van Haren, Max | Eindhoven University of Technology |
|
13:30-13:50, Paper WeBT06.1 | Add to My Program |
Frequency Domain Identification of Multirate Systems: A Lifted Local Polynomial Modeling Approach |
|
van Haren, Max | Eindhoven University of Technology |
Blanken, Lennart | Eindhoven University of Technology |
Oomen, Tom | Eindhoven University of Technology |
Keywords: Identification for control, Closed-loop identification, Sampled-data control
Abstract: Frequency-domain representations of multirate systems are essential for controller design and performance evaluation of multirate systems and sampled-data control. The aim of this paper is to develop a time-efficient closed-loop identification approach for multirate systems in the frequency domain. The developed method utilizes local polynomial modeling for lifted representations of LPTV systems, which enables direct identification of closed-loop multirate systems in a single identification experiment. Unlike LTI identification techniques, the developed method does not suffer from bias due to ignored LPTV dynamics. The developed approach is demonstrated on a multirate example, resulting in accurate and fast identification in the frequency domain.
|
|
13:50-14:10, Paper WeBT06.2 | Add to My Program |
Inverse Stochastic Optimal Control for Linear-Quadratic Gaussian and Linear-Quadratic Sensorimotor Control Models |
|
Karg, Philipp | Karlsruhe Institute of Technology (KIT) |
Stoll, Simon | Karlsruhe Institute of Technology (KIT) |
Rothfuß, Simon | Karlsruhe Institute of Technology (KIT) |
Hohmann, Soeren | KIT |
Keywords: Identification, Stochastic optimal control, Optimization
Abstract: In this paper, we define and solve the Inverse Stochastic Optimal Control (ISOC) problem of the linear-quadratic Gaussian (LQG) and the linear-quadratic sensorimotor (LQS) control model. These Stochastic Optimal Control (SOC) models are state-of-the-art approaches describing human movements. The LQG ISOC problem consists of finding the unknown weighting matrices of the quadratic cost function and the covariance matrices of the additive Gaussian noise processes based on ground truth trajectories observed from the human in practice. The LQS ISOC problem aims at additionally finding the covariance matrices of the signal-dependent noise processes characteristic for the LQS model. We propose a solution to both ISOC problems which iteratively estimates cost function and covariance matrices via two bi-level optimizations. Simulation examples show the effectiveness of our developed algorithm. It finds parameters that yield trajectories matching mean and variance of the ground truth data.
|
|
14:10-14:30, Paper WeBT06.3 | Add to My Program |
Finite-Sample Analysis of Identification of Switched Linear Systems with Arbitrary or Restricted Switching |
|
Shi, Shengling | Delft University of Technology |
Mazhar, Othmane | KTH Royal Institute of Technology |
De Schutter, Bart | Delft University of Technology |
Keywords: Identification, Switched systems, Identification for control
Abstract: For the identification of switched systems with measured states and a measured switching signal, this work aims to analyze the effect of switching strategies on the estimation error. The data is assumed to be collected from globally asymptotically or marginally stable switched systems under switches that are arbitrary or subject to an average dwell time constraint. Then the switched system is estimated by the least-squares (LS) estimator. To capture the effect of the parameters of the switching strategies on the LS estimation error, finite-sample error bounds are developed in this work. The obtained error bounds show that the estimation error is logarithmic of the switching parameters when there are only stable modes; however, when there are unstable modes, the estimation error bound can increase linearly as the switching parameter changes. This suggests that in the presence of unstable modes, the switching strategy should be properly designed to avoid the significant increase of the estimation error.
|
|
14:30-14:50, Paper WeBT06.4 | Add to My Program |
Physics-Guided and Energy-Based Learning of Interconnected Systems: From Lagrangian to Port-Hamiltonian Systems |
|
Bao, Yajie | The University of Georgia |
Thesma, Vaishnavi | University of Georgia |
Kelkar, Atul | Clemson University |
Mohammadpour Velni, Javad | Clemson University |
Keywords: Neural networks, Identification for control, Nonlinear systems identification
Abstract: This paper presents a framework for physics-informed energy-based neural network (NN) design to learn models of interconnected systems under the port-Hamiltonian (pH) formalism. In particular, this paper focuses on mechanical systems and incorporates the physical knowledge of Lagrangians into the neural networks to facilitate learning of equations of motion from the data. Moreover, the transformation from the Lagrangian mechanics to the Hamiltonian mechanics is incorporated into the NN architecture and learned from the data such that the learned model is compatible with the pH framework. Then, the structure of input-state-output pH models is imposed on the NN, which guarantees the dissipativity of the learned model. Furthermore, modeling interconnected systems is facilitated by the compositionality property of the pH systems. Additionally, the consistency between the Hamiltonian and Lagrangian is employed for the energy estimation to enable energy-based control. The proposed approach is shown to be computationally more efficient than the existing Lagrangian-based NN design approaches. Furthermore, the learned models with energy estimation are employed for energy-based model predictive control (MPC) design purpose. Experimental results using single (and double) inverted pendulum on carts show that the proposed learning-based approach can achieve an improved performance of model identification compared to the Lagrangian neural networks, accurate estimation of energies and strong control performance.
|
|
14:50-15:10, Paper WeBT06.5 | Add to My Program |
Data-Driven Explicit Predictive Control |
|
Sassella, Andrea | Politecnico Di Milano |
Breschi, Valentina | Politecnico Di Milano |
Formentin, Simone | Politecnico Di Milano |
Keywords: Identification for control, Predictive control for linear systems
Abstract: In this paper, we propose a novel approach to map experimental measurements directly onto explicit predictive control laws, without the need of deriving the system matrices first. To this aim, we resort to Willems’ fundamental lemma, through which we obtain the explicit formulas by suitably elaborating the resulting constrained optimization problem. To tackle the pervading presence of noise in real data, we propose a noise handling strategy guaranteeing to asymptotically recover the noiseless performance. The effectiveness of the proposed method is supported by numerical simulations on two benchmark examples.
|
|
15:10-15:30, Paper WeBT06.6 | Add to My Program |
Dead-Beat Identification for Model Reference Adaptive Control |
|
Kashani, Ali | University of New Mexico |
Kalhor, Ahmad | University of Tehran |
Araabi, Babak N. | University of Tehran |
Danielson, Claus | University of New Mexico |
Keywords: Identification, Indirect adaptive control, Robust adaptive control
Abstract: In this work, we propose a novel parameter identifier for linear time-invariant systems, which has the fastest possible convergence in principal (i.e., dead-beat). The identifier applies where states and derivatives of states can be measured. The identifier is applied to the certainty-equivalence (indirect) model reference adaptive control that relaxes the conventional assumptions about open-loop stability and persistent excitation in MRAC. Furthermore, in conventional MRAC the parameter convergence rate is proportional to the tracking error, whereas our proposed adaptation law has dead-beat dynamics. This method is previously applied to a Delta robot. In this paper, we present a theoretical analysis of the method by solving the identifier dynamics to get its closed-form solution and examine the effect of perturbations such as white additive noise, uncertainty, and disturbance. The linearized model of an actuated inverted pendulum with unknown parameters is used as a benchmark example.
|
|
WeBT07 Invited Session, Tulum Ballroom G |
Add to My Program |
Online Learning, Optimization, and Game Theory I |
|
|
Chair: Doan, Thinh T. | Virginia Tech |
Co-Chair: Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Organizer: Doan, Thinh T. | Virginia Tech |
Organizer: Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Organizer: Zhang, Kaiqing | MIT |
|
13:30-13:50, Paper WeBT07.1 | Add to My Program |
Independent Natural Policy Gradient Methods for Potential Games: Finite-Time Global Convergence with Entropy Regularization (I) |
|
Cen, Shicong | Carnegie Mellon University |
Chen, Fan | Peking University |
Chi, Yuejie | Carnegie Mellon University |
Keywords: Game theory, Optimization algorithms
Abstract: A major challenge in multi-agent systems is that the system complexity grows dramatically with the number of agents as well as the size of their action spaces, which is typical in real world scenarios such as autonomous vehicles, robotic teams, network routing, etc. It is hence in imminent need to design decentralized or independent algorithms where the update of each agent is only based on their local observations without the need of introducing complex communication/coordination mechanisms. In this work, we study the finite-time convergence of independent entropy-regularized natural policy gradient (NPG) methods for potential games, where the difference in an agent's utility function due to unilateral deviation matches exactly that of a common potential function. The proposed entropy-regularized NPG method enables each agent to deploy symmetric, decentralized, and multiplicative updates according to its own payoff. We show that the proposed method converges to the quantal response equilibrium (QRE)---the equilibrium to the entropy-regularized game---at a sublinear rate, which is independent of the size of the action space and grows at most sublinearly with the number of agents. Appealingly, the convergence rate further becomes independent with the number of agents for the important special case of identical-interest games, leading to the first method that converges at a dimension-free rate. Our approach can be used as a smoothing technique to find an approximate Nash equilibrium (NE) of the unregularized problem without assuming that stationary policies are isolated.
|
|
13:50-14:10, Paper WeBT07.2 | Add to My Program |
Inverse Reinforcement Learning Control for Linear Multiplayer Games (I) |
|
Lian, Bosen | The University of Texas at Arlington |
Donge, Vrushabh | University of Texas at Arlington |
Lewis, Frank L. | University of Texas at Arlington |
Chai, Tianyou | Northeastern University |
Davoudi, Ali | University of Texas-Arlington |
Keywords: Adaptive control, Optimal control, Linear systems
Abstract: This paper proposes model-based and model-free inverse reinforcement learning (RL) control algorithms for multiplayer game systems described by linear continuous-time differential equations. Both algorithms find the learner the same optimal control policies and trajectories as the expert, by inferring the unknown expert players' cost functions from the expert's trajectories. This paper first discusses a model-based inverse RL policy iteration that consists of 1) policy evaluation for cost matrices using a Lyapunov equation, 2) state-reward weight improvement using inverse optimal control (IOC), and 3) policy improvement using optimal control. Based on the model-based algorithm, an online data-driven inverse RL algorithm is proposed without knowing system dynamics or expert control gains. Rigorous convergence and stability analysis of these algorithms are provided. Finally, a simulation example verifies our approach.
|
|
14:10-14:30, Paper WeBT07.3 | Add to My Program |
Independent Learning and Subjectivity in Mean-Field Games (I) |
|
Yongacoglu, Bora | Queen's University |
Arslan, Gurdal | University of Hawaii at Manoa |
Yuksel, Serdar | Queen's University |
Keywords: Mean field games, Learning, Large-scale systems
Abstract: Independent learners naively employ single-agent learning algorithms in multi-agent systems, oblivious to the effect of other strategic agents present in their environment. This paper studies partially observed N-player mean-field games from a decentralized learning perspective with two primary objectives: (i) to study the convergence properties of independent learners, and (ii) to identify structural properties that can guide algorithm design. Toward the first objective, we study the learning iterates obtained by independent learners, and find that these iterates converge under mild conditions. We then present a notion of subjective equilibrium suitable for analyzing independent learners. Toward the second objective, we study policy updating processes subject to a so-called epsilon-satisficing condition: agents who are subjectively epsilon-best-responding at a given joint policy do not change their policy. After establishing structural results for such processes, we develop an independent learning algorithm for N-player mean-field games. Exploiting the aforementioned structural results, we give guarantees of convergence to subjective epsilon-equilibrium under self-play.
|
|
14:30-14:50, Paper WeBT07.4 | Add to My Program |
On the Global Convergence of Stochastic Fictitious Play in Stochastic Games with Turn-Based Controllers (I) |
|
Sayin, Muhammed Omer | Bilkent University |
Keywords: Game theory, Markov processes, Learning
Abstract: This paper presents a learning dynamic with almost sure convergence guarantee for any stochastic game with turn-based controllers (on state transitions) as long as stage-payoffs have stochastic fictitious-play-property. For example, two-player zero-sum and n-player potential strategic-form games have this property. Note also that stage-payoffs for different states can have different structures such as they can sum to zero in some states and be identical in others. The dynamics presented combines the classical stochastic fictitious play with value iteration for stochastic games. There are two key properties: (i) players play finite horizon stochastic games with increasing lengths within the underlying infinite-horizon stochastic game, and (ii) the turn-based controllers ensure that the auxiliary stage-games (induced from the continuation payoff estimated) have the stochastic fictitious-play-property.
|
|
14:50-15:10, Paper WeBT07.5 | Add to My Program |
On Confident Policy Evaluation for Factored Markov Decision Processes with Node Dropouts (I) |
|
Fiscko, Carmel | Carnegie Mellon University |
Kar, Soummya | Carnegie Mellon University |
Sinopoli, Bruno | Washington University in St Louis |
Keywords: Markov processes, Machine learning, Agents-based systems
Abstract: In this work we investigate an importance sampling approach for evaluating policies for a structurally time-varying factored Markov decision process (MDP), i.e. the policy's value is estimated with a high-probability confidence interval. In particular, we begin with a multi-agent MDP controlled by a known policy but with unknown transition dynamics. One agent is then removed from the system - i.e. the system experiences node dropout - forming a new MDP of the remaining agents, with a new state space, action space, and new transition dynamics. We assume that the effect of removing an agent corresponds to the marginalization of its factor in the transition dynamics. The reward function may likewise be marginalized, or it may be entirely redefined for the new system. Robust policy importance sampling is then used to evaluate candidate policies for the new system, and estimated values are presented with probabilistic confidence bounds. This computation is completed with no observations of the new system, meaning that a safe policy may be found before dropout occurs. The utility of this approach is demonstrated in simulation and compared to Monte Carlo simulation of the new system.
|
|
15:10-15:30, Paper WeBT07.6 | Add to My Program |
Inducing Social Optimality in Games Via Adaptive Incentive Design (I) |
|
Maheshwari, Chinmay | University of California Berkeley |
Kulkarni, Kshitij | University of California, Berkeley |
Wu, Manxi | University of California, Berkeley |
Sastry, Shankar | Univ. of California at Berkeley |
Keywords: Game theory, Learning, Emerging control applications
Abstract: How can a social planner adaptively incentivize selfish agents who are learning in a strategic environment to induce a socially optimal outcome in the long run? We propose a two-timescale learning dynamics to answer this question in games. In our learning dynamics, players adopt a class of learning rules to update their strategies at a faster timescale, while a social planner updates the incentive mechanism at a slower timescale. In particular, the update of the incentive mechanism is based on each player's externality, which is evaluated as the difference between the player's marginal cost and the society's marginal cost in each time step. We show that any fixed point of our learning dynamics corresponds to the optimal incentive mechanism such that the corresponding Nash equilibrium also achieves social optimality. We also provide sufficient conditions for the learning dynamics to converge to a fixed point so that the adaptive incentive mechanism eventually induces a socially optimal outcome. Finally, as an example, we demonstrate that the sufficient conditions for convergence are satisfied in Cournot competition with finite players.
|
|
WeBT08 Regular Session, Tulum Ballroom H |
Add to My Program |
Formal Verification and Synthesis |
|
|
Chair: Lahijanian, Morteza | University of Colorado Boulder |
Co-Chair: Vasile, Cristian Ioan | Lehigh University |
|
13:30-13:50, Paper WeBT08.1 | Add to My Program |
Logic for Timed Agent Network Topologies |
|
Eberhart, Clovis | National Institute of Informatics |
Haydon, James Henri | National Institute of Informatics |
Dubut, Jérémy | National Institute of Informatics |
Cetinkaya, Ahmet | National Institute of Informatics |
Pruekprasert, Sasinee | National Institute of Advanced Industrial Science and Technology |
Keywords: Formal Verification/Synthesis, Agents-based systems, Network analysis and control
Abstract: We define μTGL, a spatio-temporal logic with fixed points and first-order agent quantification whose expressive power allows the definition of topological properties of networks of communicating agents. The existence of temporal operators and fixed points requires particular care when defining its semantics. We demonstrate the logic’s usefulness on an example, where we monitor a complex property that ensures resilient consensus.
|
|
13:50-14:10, Paper WeBT08.2 | Add to My Program |
Planning for Modular Aerial Robotic Tools with Temporal Logic Constraints |
|
Cardona, Gustavo A. | Lehigh University |
Saldana, David | Lehigh University |
Vasile, Cristian Ioan | Lehigh University |
Keywords: Formal Verification/Synthesis, Autonomous robots
Abstract: Modular robots are highly versatile due to their ability to reconfigure and change their mechanical properties. This ability makes them optimal for scenarios that require different types of tasks. However, task allocation and cooperation become a combinatorial problem when the number of modules increases. To tackle this problem, we propose a high-level planner for reconfigurable robots with heterogeneous capabilities, e.g., aerial motion and tool operation. Modules can attach and detach to create configurations that manipulate tools satisfying temporal and logic-constrained tasks. The mission is specified using Metric Temporal Logic (MTL) which offers the capacity to not only account for where and who needs to satisfy a task but also when and for how long. We model the problem using a Mixed Integer Linear Problem (MILP) approach, capturing cost for reconfiguration, satisfying a task, and motion in the environment in a specific configuration. Additionally, we consider that not all configurations can satisfy every task. We find trajectories for modular robots that guarantee mission satisfaction. Finally, we show the performance in simulations with multiple tasks and requirements in an environment.
|
|
14:10-14:30, Paper WeBT08.3 | Add to My Program |
Pareto Optimal Strategies for Event-Triggered Estimation |
|
Theurkauf, Anne | University of Colorado, Boulder |
Ahmed, Nisar | University of Colorado Boulder |
Lahijanian, Morteza | University of Colorado Boulder |
Keywords: Formal Verification/Synthesis, Estimation, Autonomous systems
Abstract: This work considers the problem of resource-performance trade-off analysis for a system consisting of an active agent and a sensor network. Specifically, we focus on Event-triggered (ET) estimation, which allows communication of measurements only when deemed useful, e.g., when Kalman filter innovations exceed some threshold. We introduce a framework for ET-threshold strategy synthesis that optimally trades off resource consumption with task performance. Our approach is based on a novel belief space discretization technique that abstracts a continuous-space dynamics model for ET estimation to a finite Markov decision process. We show this abstraction method is more efficient than alternatives and also benefits from interpretability. Then, we leverage off-the-shelf tools to compute the set of all optimal trade-offs between resource consumption and task performance. Given a choice from this set, we synthesize an ET-threshold strategy that achieves the desirable behavior. Simulated results show our approach identifies non-trivial trade-offs between performance and energy savings, with only modest computational effort.
|
|
14:30-14:50, Paper WeBT08.4 | Add to My Program |
Temporal Relaxation of Signal Temporal Logic Specifications for Resilient Control Synthesis |
|
Buyukkocak, Ali Tevfik | University of Minnesota |
Aksaray, Derya | Northeastern University |
Keywords: Formal Verification/Synthesis, Optimal control, Autonomous systems
Abstract: We introduce a metric that can quantify the temporal relaxation of Signal Temporal Logic (STL) specifications and facilitate resilient control synthesis in the face of infeasibilities. The proposed metric quantifies a cumulative notion of relaxation among the subtasks, and minimizing it yields to structural changes in the original STL specification by i) modifying time-intervals, ii) removing subtasks entirely if needed. To this end, we formulate an optimal control problem that extracts state and input sequences by minimally violating the temporal requirements while achieving the desired predicates. We encode this problem in the form of a computationally efficient mixed-integer program. We show some theoretical results on the properties of the new metric. Finally, we present a case study of a robot that minimally violates the time constraints of desired tasks in the face of an infeasibility.
|
|
14:50-15:10, Paper WeBT08.5 | Add to My Program |
Backward Reachability Analysis for Neural Feedback Loops |
|
Rober, Nicholas | MIT |
Everett, Michael | MIT |
How, Jonathan, P. | MIT |
Keywords: Formal Verification/Synthesis, Neural networks, Uncertain systems
Abstract: The increasing prevalence of neural networks (NNs) in safety-critical applications calls for methods to certify their behavior and guarantee safety. This paper presents a backward reachability approach for safety verification of neural feedback loops (NFLs), i.e., closed-loop systems with NN control policies. While recent works have focused on forward reachability as a strategy for safety certification of NFLs, backward reachability offers advantages over the forward strategy, particularly in obstacle avoidance scenarios. Prior works have developed techniques for backward reachability analysis for systems without NNs, but the presence of NNs in the feedback loop presents a unique set of problems due to the nonlinearities in their activation functions and because NN models are generally not invertible. To overcome these challenges, we use existing forward NN analysis tools to find affine bounds on the control inputs and solve a series of linear programs (LPs) to efficiently find an approximation of the backprojection (BP) set, i.e., the set of states for which the NN control policy will drive the system to a given target set. We present an algorithm to iteratively find BP set estimates over a given time horizon and demonstrate the ability to reduce conservativeness in the BP set estimates by up to 88% with some extra computational cost. We use numerical results from a double integrator model to verify the efficacy of these algorithms and demonstrate the ability to certify safety for a linearized ground robot model in a collision avoidance scenario where forward reachability fails.
|
|
15:10-15:30, Paper WeBT08.6 | Add to My Program |
NNSynth: Neural Network Guided Abstraction-Based Controller Synthesis for Stochastic Systems |
|
Sun, Xiaowu | University of California, Irvine |
Shoukry, Yasser | University of California, Irvine |
Keywords: Formal Verification/Synthesis, Neural networks
Abstract: In this paper, we introduce NNSynth, a new framework that uses machine learning techniques to guide the design of abstraction-based controllers with correctness guarantees. NNSynth utilizes neural networks (NNs) to guide the search over the space of controllers. The trained neural networks are ``projected'' and used for constructing a ``local'' abstraction of the system. An abstraction-based controller is then synthesized from such ``local'' abstractions. If a controller that satisfies the specifications is not found, then the best found controller is ``lifted'' to a neural network for further training. Our experiments show that this neural network-guided synthesis leads to more than 50x or even 100x speedup in high dimensional systems compared to the state-of-the-art.
|
|
WeBT09 Regular Session, Maya Ballroom I |
Add to My Program |
Networked Control Systems I |
|
|
Chair: Leonow, Sebastian | Ruhr-Universität Bochum |
Co-Chair: D'Innocenzo, Alessandro | University of L'Aquila |
|
13:30-13:50, Paper WeBT09.1 | Add to My Program |
A Fully Homomorphic Encryption Scheme for Real-Time Safe Control |
|
Stobbe, Pieter | Delft University of Technology |
Keijzer, Twan | Delft University of Technology |
Ferrari, Riccardo M.G. | Delft University of Technology |
Keywords: Networked control systems, Cyber-Physical Security, Embedded systems
Abstract: Fully Homomorphic Encryption (FHE) has made it possible to perform addition and multiplication operations on encrypted data. Using FHE in control thus has the advantage that control effort for a plant can be calculated remotely without ever decrypting the exchanged information. FHE in its current form is however not practically applicable for real-time control as its computational load is very high compared to traditional encryption methods. In this paper a reformulation of the Gentry FHE scheme is proposed and applied on an FPGA to solve this problem. It is shown that the resulting FHE scheme can be implemented for real-time stabilization of an inverted double pendulum using discrete time control.
|
|
13:50-14:10, Paper WeBT09.2 | Add to My Program |
Secure Collaborative Control on Lean Embedded Hardware |
|
Leonow, Sebastian | Ruhr-Universität Bochum |
Monnigmann, Martin | Ruhr-Universität Bochum |
Keywords: Networked control systems, Control over communications, Cooperative control
Abstract: We use the homomorphic properties of the Paillier cryptosystem to implement decoupling control over a wireless network without disclosing any information of a control agents state to neighboring agents. The results from a hardware in the loop implementation reveal the primary bottlenecks regarding the computational load on the local agents. Nevertheless they demonstrate a basic usability of Micropython for encrypted control with viable key sizes.
|
|
14:10-14:30, Paper WeBT09.3 | Add to My Program |
Modular Computation of Restoration Entropy for Networks of Systems: A Dissipativity Approach |
|
Tomar, Mahendra Singh | University of Colorado Boulder |
Zamani, Majid | University of Colorado Boulder |
Keywords: Networked control systems, Control over communications, Estimation
Abstract: The problem of state estimation based on information received over a finite bit-rate channel gives rise to the study of minimal bit rate above which state can be estimated with any desired accuracy. In the past few years, researchers have studied the minimal average bit rate which is sufficient enough for state estimation such that the estimation error stays within a given factor of its initial value. The notion of restoration entropy characterizes this type of bit rate. Recent results proposed numerical schemes to estimate restoration entropy by the computation of singular values of the linearized systems. Such schemes are either complex to implement or suffer severely from computational complexity and the size of the state dimension. In this paper, we describe a modular approach to compute an upper bound of the restoration entropy of a large network by decomposing the network to an interconnection of smaller subsystems. Then, we formulate a distributed optimization problem which is solved for each subsystem separately and then their optimization results are composed to get an upper bound of the restoration entropy for the overall network. We illustrate the effectiveness of our results by two examples.
|
|
14:30-14:50, Paper WeBT09.4 | Add to My Program |
Location Secrecy Enhancement in Adversarial Networks Via Trajectory Control |
|
Khojasteh, Mohammad Javad | Massachusetts Institute of Technology |
Saucan, Augustin | MIT |
Liu, Zhenyu | Massachusetts Institute of Technology |
Conti, Andrea | University of Ferrara |
Win, Moe Z. | Massachusetts Institute of Technology (MIT) |
Keywords: Networked control systems, Communication networks, Linear systems
Abstract: In networked environments, adversaries may exploit location information to perform carefully crafted attacks on cyber-physical systems. To prevent such security breaches, this letter develops a network localization and navigation (NLN) paradigm that accounts for network secrecy in the control of mobile agents. We consider a scenario in which a mobile agent is tasked with maneuvering through an adversarial network, based on a nominal control policy, and we aim to reduce the ability of the adversarial network to infer the mobile agent’s position. Specifically, the Fisher information of the agent’s position obtained by the adversarial network is adopted as a secrecy metric. We propose a new control policy that results from an optimization problem and achieves a compromise between maximizing location secrecy and minimizing the deviation from the nominal control policy. Results show that the proposed optimization-based control policy significantly improves the secrecy of the mobile agent.
|
|
14:50-15:10, Paper WeBT09.5 | Add to My Program |
Secure State Estimation Over Markov Wireless Communication Channels |
|
Impicciatore, Anastasia | University of L'Aquila |
Tsiamis, Anastasios | University of Pennsylvania |
Zacchia Lun, Yuriy | IMT School for Advanced Studies Lucca |
D'Innocenzo, Alessandro | University of L'Aquila |
Pappas, George J. | University of Pennsylvania |
Keywords: Networked control systems, Cyber-Physical Security, Estimation
Abstract: This note studies state estimation in wireless networked control systems with secrecy against eavesdropping. Specifically, a sensor transmits a system state information to the estimator over a legitimate user link, and an eavesdropper overhears these data over its link independent of the user link. Each connection may be affected by packet losses and is modeled by a finite-state Markov channel (FSMC), an abstraction widely used to design wireless communication systems. This paper presents a novel concept of optimal mean square expected secrecy over FSMCs and delineates the design of a secrecy parameter requiring the user mean square estimation error (MSE) to be bounded and eavesdropper MSE unbounded. We illustrate the developed results on an example of an inverted pendulum on a cart whose parameters are estimated remotely over a wireless link exposed to an eavesdropper.
|
|
15:10-15:30, Paper WeBT09.6 | Add to My Program |
A Model-Free Online False Data Injection Attack Strategy in Networked Control Systems |
|
Luo, Xiaoyu | Shanghai Jiao Tong University |
Fang, Chongrong | Shanghai Jiao Tong University |
Zhao, Chengcheng | Zhejiang University |
He, Jianping | Shanghai Jiao Tong University |
Keywords: Networked control systems, Cyber-Physical Security, Optimization
Abstract: The data-driven attack strategies recently have received much attention when the full knowledge of the system model is unknown or difficult to be obtained for the adversary. Note that despite the critical parameters of the system model being unavailable for the adversary, the existing data-driven attack methods still depend on the linearity of the unknown system model. In this paper, we design a completely model-free online attack strategy where the adversary with limited capability aims to compromise state variables such that the output value follows the expected trajectory. Specifically, we first construct a zeroth-order feedback optimization framework and uninterruptedly use probing signals for real-time measurements. Then, we iteratively update the attack signals along the composite direction of the gradient estimates of the objective function evaluations and the projected gradients. These objective function evaluations can be obtained only by real-time measurements. Furthermore, we characterize the optimality of these solutions via the optimality gap, which is affected by the dimensions of the attack signal, the iterations of solutions, and the convergence rate of the system. Extensive simulations are conducted to show the effectiveness of the proposed attack strategy.
|
|
WeBT10 Regular Session, Maya Ballroom II |
Add to My Program |
Stochastic Optimal Control I |
|
|
Chair: Kishida, Masako | National Institute of Informatics |
Co-Chair: Chen, Yongxin | Georgia Institute of Technology |
|
13:30-13:50, Paper WeBT10.1 | Add to My Program |
Solving Mission-Wide Chance-Constrained Optimal Control Using Dynamic Programming |
|
Wang, Kai | Norwegian University of Science and Technology |
Gros, Sebastien | NTNU |
Keywords: Stochastic optimal control, Optimal control, Markov processes
Abstract: This paper aims to provide a Dynamic Programming (DP) approach to solve the Mission-Wide Chance-Constrained Optimal Control Problems (MWCC-OCP). The mission-wide chance constraint guarantees that the probability that the entire state trajectory lies within a constraint/safe region is higher than a prescribed level, and is different from the stage-wise chance constraints imposed at individual time steps. The control objective is to find an optimal policy sequence that achieves both (i) satisfaction of a mission-wide chance constraint, and (ii) minimization of a cost function. By transforming the stage-wise chance-constrained problem into an unconstrained counterpart via Lagrangian method, standard DP can then be deployed. Yet, for MWCC-OCP, this methods fails to apply, because the mission-wide chance constraint cannot be easily formulated using stage-wise chance constraints due to the time-correlation between the latter (individual states are coupled through the system dynamics). To fill this gap, firstly, we detail the conditions required for a classical DP solution to exist for this type of problem; secondly, we propose a DP solution to the MWCC-OCP through state augmentation by introducing an additional functional state variable.
|
|
13:50-14:10, Paper WeBT10.2 | Add to My Program |
Schrödinger Meets Kuramoto Via Feynman-Kac: Minimum Effort Distribution Steering for Noisy Nonuniform Kuramoto Oscillators |
|
Nodozi, Iman | University of California, Santa Cruz |
Halder, Abhishek | University of California, Santa Cruz |
Keywords: Stochastic optimal control, Stochastic systems, Optimal control
Abstract: We formulate and solve the problem of finite horizon minimum control effort steering of the state probability distribution between prescribed endpoint joints for a finite population of networked noisy nonuniform Kuramoto oscillators. We consider both the first and second order stochastic Kuramoto models. For numerical solution of the associated stochastic optimal control, we propose combining certain measure-valued proximal recursions and the Feynman-Kac path integral computation. We illustrate the proposed framework via numerical examples.
|
|
14:10-14:30, Paper WeBT10.3 | Add to My Program |
Risk-Aware Event and Self-Triggered Controls by Worst-Case Conditional Value-At-Risk |
|
Kishida, Masako | National Institute of Informatics |
Keywords: Stochastic optimal control, Stochastic systems, Linear systems
Abstract: After introducing a risk-aware stability definition, this paper investigates the problems of the event- and self-triggered control for discrete-time linear stochastic systems in the view of risk. In particular, the risk is quantified by the worst-case Conditional Value-at-Risk (CVaR) using the first two moments of the uncertainty probability distributions. This allows us to design triggering mechanisms considering the risk that the estimated worst-case tail behaviors of the system are worse than a given threshold. Numerical examples are included to illustrate the performances of the proposed approaches.
|
|
14:30-14:50, Paper WeBT10.4 | Add to My Program |
Policy Iteration for Multiplicative Noise Output Feedback Control |
|
Gravell, Benjamin | The University of Texas at Dallas |
Gargiani, Matilde | ETH Zurich |
Lygeros, John | ETH Zurich |
Summers, Tyler H. | University of Texas at Dallas |
Keywords: Stochastic optimal control, Stochastic systems, Linear systems
Abstract: We propose a policy iteration algorithm for solving the multiplicative noise linear quadratic output feedback design problem. The algorithm solves a set of coupled Riccati equations for estimation and control arising from a partially observable Markov decision process (POMDP) under a class of linear dynamic control policies. We show in numerical experiments far faster convergence than a value iteration algorithm, formerly the only known algorithm for solving this class of problem. The results suggest promising future research directions for policy optimization algorithms in more general POMDPs, including the potential to develop novel approximate data-driven approaches when model parameters are not available.
|
|
14:50-15:10, Paper WeBT10.5 | Add to My Program |
Geometry of Finite-Time Thermodynamic Cycles with Anisotropic Thermal Fluctuations |
|
Movilla Miangolarra, Olga | University of Calfornia, Irvine |
Taghvaei, Amirhossein | University of Washington Seattle |
Chen, Yongxin | Georgia Institute of Technology |
Georgiou, Tryphon T. | University of California, Irvine |
Keywords: Stochastic optimal control, Optimal control, Stochastic systems
Abstract: In contrast to the classical concept of a Carnot engine that alternates contact between heat baths of different temperatures, naturally occurring processes usually harvest energy from anisotropy, being exposed simultaneously to chemical and thermal fluctuations of different intensities. In these cases, the enabling mechanism responsible for transduction of energy is the presence of a non-equilibrium steady state (NESS). A suitable stochastic model for such a phenomenon is the Brownian gyrator -- a two-degree of freedom stochastically driven system that exchanges energy and heat with the environment. In this context, we present a geometric view of the energy harvesting mechanism, from a stochastic control perspective, that entails a forced periodic trajectory of the system state on the thermodynamic manifold. Dissipation and work output are expressed accordingly as path integrals of a controlled process, and fundamental limitations on power and efficiency are expressed in geometric terms via a relationship to an isoperimetric problem. The theory is presented for high-order systems far from equilibrium and beyond the linear response regime.
|
|
15:10-15:30, Paper WeBT10.6 | Add to My Program |
Quantile Formulation for Optimization under a Qualitative Risk Constraint |
|
Vázquez-Abad, Felisa J. | University of Melbourne |
Shetler, Oliver | Columbia University Irving Medical Center |
Soto, Pedro | CUNY Graduate Center |
Keywords: Stochastic optimal control, Optimization algorithms, Constrained control
Abstract: This paper focuses on the pathologies of common gradient-based algorithms for solving optimization problems under probability constraints. These problems are both important and difficult to handle. We provide a thorough analysis of what we believe is the main pathology of the distributional formulation of the problem, using both examples and rigorous proofs. With this knowledge, we then present an alternative formulation that we call the quantile formulation of the problem and prove that this formulation is exempt of the numerical pathologies of the original formulation. Our theoretical framework is developed for univariate distributions. To complete the work, we present generalizations to a class of multivariate problems motivated by the design of a robotic medical device to guide micro-surgeries, which must satisfy a specific structural reliability
|
|
WeBT11 Regular Session, Maya Ballroom III |
Add to My Program |
Robust Control III |
|
|
Chair: Rantzer, Anders | Lund University |
Co-Chair: Zorzi, Mattia | University of Padova |
|
13:30-13:50, Paper WeBT11.1 | Add to My Program |
H-Infinity Control with Nearly Symmetric State Matrix |
|
Vladu, Emil | Lund University |
Rantzer, Anders | Lund University |
Keywords: Robust control
Abstract: In this paper, we give an upper bound on the deviation from H-infinity optimality of a class of controllers as a function of the deviation from symmetry in the state matrix. We further suggest a scalar measure of symmetry which is shown to be directly relevant for estimating nearness to optimality. In connection to this, we give a simple analytical solution to a class of Lyapunov equations for two dimensional state matrices. Finally, we demonstrate how a well-chosen symmetric part for nearly symmetric state matrices may lead not only to near-optimality, but also to controller sparsity, a desirable property for large-scale systems. In the special case that the state matrix is symmetric and Hurwitz, our main result simplifies to give an H-infinity optimal controller with several benefits, a result which has recently appeared in the literature. In this sense, the above is a significant generalization which considers a much wider class of systems, yet allows one to retain the benefits of symmetric state matrices, while offering means of quantifying the effect of this on the H-infinity norm.
|
|
13:50-14:10, Paper WeBT11.2 | Add to My Program |
Performance Improvement of Output-Feedback Tracking Control in an Augmented ADRC System with a Fixed-Bandwidth LESO |
|
Michalek, Maciej, M. | Poznan University of Technology (PUT) |
Adamski, Wojciech | Poznan University of Technology, VAT Number: PL 777-00-03-699 |
Keywords: Robust control, Uncertain systems, Output regulation
Abstract: Robust output-feedback with active disturbance rejection control (ADRC) proved to be a very effective control paradigm for highly uncertain perturbed systems. The main practical limitation of the ADRC results from an application of a conventional high-gain observer which may cause significant amplification of a high-frequency noise corrupting the output measurements. We show that this main limitation can be significantly reduced by augmenting a conventional ADRC system with an auxiliary on-line estimator of the closed-loop tracking error dynamics which contains key residual information, useful for improving a resultant tracking control performance in the augmented ADRC system. It is shown that the tracking control improvement, resulting from an application of the proposed augmentation mechanism, prevents an excessive amplification of feedback noises and keeps a control cost on an acceptable level, especially for systems with higher-order dynamics. The proposed concept has been illustrated by selected results of extensive numerical simulations.
|
|
14:10-14:30, Paper WeBT11.3 | Add to My Program |
Robust Stabilization of Inverter-Based Resources Using Virtual Resistance-Based Control |
|
Anubi, Olugbenga, M | Florida State University |
Ameli, Sina | Florida State University |
Keywords: Robust control, Uncertain systems, Power systems
Abstract: This paper proposes a virtual resistance-based nonlinear control to stabilize and robustify the current layer of inverter-based resources, subject to the grid voltage disturbances, and the grid parameter uncertainties. A class of virtual resistances is proposed and analyzed using concepts from dissipative systems theory. Moreover, specific nonlinear virtual resistance-based controllers are derived, with their corresponding performance analytically bounded. The theoretical and simulation results show that the proposed nonlinear virtual resistance-based controllers significantly reduce the mathcal{L}_2 gain of the closed-loop error system to grid voltage variation and parametric uncertainties. Significant improvements of the transient and steady-state current responses are also demonstrated over linear virtual resistance counterparts.
|
|
14:30-14:50, Paper WeBT11.4 | Add to My Program |
A New Perspective on Robust Performance for LQG Control Problems |
|
Falconi, Lucia | University of Padova |
Ferrante, Augusto | Universita' Di Padova |
Zorzi, Mattia | University of Padova |
Keywords: Robust control, Uncertain systems, Stochastic optimal control
Abstract: This paper considers a class of discrete time, linear, stochastic uncertain systems defined in terms of a nominal Gaussian state-space model; the uncertainty is described by a relative entropy tolerance for each time increment of the dynamic model. For this class of systems, a problem of worstcase robust performance analysis with respect to a quadratic cost functional is solved. The solution takes the form of a risk-sensitive cost with a time-varying risk-sensitive parameter. Finally, a numerical example is presented to illustrate the methodology.
|
|
14:50-15:10, Paper WeBT11.5 | Add to My Program |
Generating Minimal Controller Sets for Mixing MMAC |
|
Provan, Gregory | University College Cork |
Quinones-Grueiro, Marcos | Vanderbilt University |
Sohege, Yves | Insight-Centre for Data Analytics |
Keywords: Robust adaptive control, Fault tolerant systems, Learning
Abstract: Multiple model adaptive control (MMAC) is an adaptive control method designed for plant parameter uncertainty given both linear and non-linear plant models. For a system subject to varying operating conditions, the number of controllers necessary to guarantee stable control under nominal-plant uncertainty, or under multiple operating conditions, are both unknown. We propose a learning-based controller synthesis approach that can guarantee stability of a system subject to varying operating conditions. We adopt a convex hull (CH)-based multiple-model controller estimation algorithm that only requires N+M+1 controllers, where N is the dimension of a compact nominal uncertainty parameter set, and M is the number of additive faults. We empirically validate this result for a quadcopter, which is subject to faults in rotors and sensors as well as to adverse wind conditions.
|
|
15:10-15:30, Paper WeBT11.6 | Add to My Program |
Barrier Function-Based Stabilization of a Perturbed Chain of Integrators with a Predefined Upperbound of the Settling Time |
|
Estrada, Manuel A. | Facultad De Ingeniería, Universidad Nacional Autónoma De México |
Cruz-Ancona, Christopher D. | Universidad Nacional Autónoma De México, Facultad De Ingeniería |
Ovalle, Luis | TecNM/Instituto Tecnológico De La Laguna |
Fridman, Leonid | Universidad Nacional Autonoma De Mexico |
Keywords: Robust adaptive control, Variable-structure/sliding-mode control
Abstract: This paper propose an adaptive time-varying approach for stabilization for a class of perturbed chains of integrators with unknown control coefficient. To drive the trajectories initiating far from the origin, a time-varying feedback with adaptation is used, ensuring that the closed-loop system’s solutions reach a prescribed vicinity of the origin within a finite-time, which is upper bounded by a prescribed time moment. Once the prescribed neighborhood is reached, a barrier function-based controller guarantees ultimate boundedness of the system’s solutions despite the presence of perturbations with unknown upper bound. Numerical simulations on the trajectory tracking problem of a 1-DOF torsional system and a stabilization problem of a third-order academic example show the effectiveness of the approach.
|
|
WeBT12 Tutorial Session, Maya Ballroom IV |
Add to My Program |
Cybersecurity and Supervisory Control: Synthesis of Attacks and of Defense
Strategies Using Discrete Event Systems Techniques |
|
|
Chair: Lafortune, Stephane | Univ. of Michigan |
Co-Chair: Hadjicostis, Christoforos N. | University of Cyprus |
|
13:30-13:31, Paper WeBT12.1 | Add to My Program |
Cybersecurity and Supervisory Control: A Tutorial on Robust State Estimation, Attack Synthesis, and Resilient Control (I) |
|
Hadjicostis, Christoforos N. | University of Cyprus |
Lafortune, Stephane | Univ. of Michigan |
Lin, Feng | Wayne State Univ |
Su, Rong | Nanyang Technological University |
Keywords: Discrete event systems, Supervisory control, Cyber-Physical Security
Abstract: This tutorial paper studies the effect of deception attacks on compromised sensors and actuators at the supervisory control layer of cyber-physical control systems. The problem is modeled and analyzed in the framework of the theories of state estimation, diagnosability, and supervisory control of discrete event systems, where discrete transition models are used. Both attacks and defense against attacks are considered. First, robust estimation and diagnosis in the presence of sensor attacks is analyzed. Next, the problem of synthesizing covert attacks is formulated and its solution is discussed in different contexts. Then, necessary and sufficient conditions on the existence of resilient supervisors are presented in the context of a general attack model. Finally, the problem of synthesizing supervisors that are resilient to covert attacks on sensors and actuators is studied. This paper accompanies the tutorial session on this topic presented at the IEEE CDC 2022.
|
|
13:31-14:10, Paper WeBT12.2 | Add to My Program |
Tamper-Tolerant State Estimation and Fault Diagnosis in Discrete Event Systems (I) |
|
Hadjicostis, Christoforos N. | University of Cyprus |
Keywords: Discrete event systems, Automata, Fault diagnosis
Abstract: The talk starts by discussing recursive algorithms for state estimation and event inference, both of which are key tasks for monitoring and control of discrete event systems. We also discuss various pertinent properties of interest and, in particular, diagnosability (i.e., the ability to detect within finite time the occurrence/type of a fault). We then analyze how such state estimation and event inference tasks might be compromised when a malicious attacker utilizes its knowledge of the underlying system model and its tampering capabilities (such as deletions, insertions, or substitutions of observed symbols) to influence the sequence of observations generated by the system. Assuming that each tampering action by the attacker is associated with a positive cost, we discuss approaches to describe matching sequences of observations, as well as to systematically obtain corresponding (``robustified") state estimates. We also develop techniques for verifying tamper-tolerant diagnosability under constraints on the total number of deletions, insertions, and substitutions (or, more generally, the total budget) that the attacker has at its disposal.
|
|
14:10-14:50, Paper WeBT12.3 | Add to My Program |
Sensor Deception Attacks in Supervisory Control: Modeling and Synthesis (I) |
|
Lafortune, Stephane | Univ. of Michigan |
Keywords: Discrete event systems, Supervisory control, Cyber-Physical Security
Abstract: We start with a brief review of relevant results from the theory of supervisory control of discrete event systems. We then describe modeling techniques that account for potential deception attacks on sensors in the feedback loop of supervisory control. Next, we consider the point of view of the attacker. How can the attacker inflict damage to the system without being detected? Finally, we outline a methodology for synthesizing person-in-the-middle attacks on communication protocols.
|
|
14:50-15:30, Paper WeBT12.4 | Add to My Program |
An Introduction to Resilient Supervisory Control against Stealthy Sensor and Actuator Attacks (I) |
|
Su, Rong | Nanyang Technological University |
Keywords: Discrete event systems, Supervisory control, Cyber-Physical Security
Abstract: We provide an intuitive introduction to a few recent theories about resilient supervisory control against stealthy attacks. By analyzing the closed-loop control mechanism in the Ramadge-Wonham (RW) supervisory control paradigm, we explain how a stealthy attack may exploit weakness of the RW paradigm. Upon such intuition, we introduce two defense strategies against stealthy sensor attacks and stealthy actuator attacks, respectively. We first identify the minimum ``risk'' information in a given plant model that ensures the existence of a stealthy sensor attack. Then we explain how to eliminate such risk information from the plant model, upon which we show that the existence of a resilient supervisor against all possible stealthy sensor attacks is decidable under some mild assumptions on the plant model. Considering the decidability of the existence of a resilient supervisor against all possible stealthy actuator attacks is still open, we introduce a heuristic supervisor obfuscation strategy, which relies on the fact that, given a plant and a supervisor, the existence of a stealthy actuator attack is decidable. We explain how to obfuscate control information in the supervisor via supervisor reduction so that a resulting control-equivalent obfuscated supervisor may become resilient against all stealthy actuator attacks. Simple examples are provided to illustrate introduced ideas.
|
|
WeBT13 Regular Session, Maya Ballroom V |
Add to My Program |
Predictive Control for Nonlinear Systems II |
|
|
Chair: Findeisen, Rolf | TU Darmstadt |
Co-Chair: Jayawardhana, Bayu | University of Groningen |
|
13:30-13:50, Paper WeBT13.1 | Add to My Program |
Difference of Convex Functions in Robust Tube MPC |
|
Doff-Sotta, Martin | University of Oxford |
Cannon, Mark | University of Oxford |
Keywords: Predictive control for nonlinear systems, Robust control, Optimal control
Abstract: We propose a robust tube-based Model Predictive Control (MPC) paradigm for nonlinear systems whose dynamics can be expressed as a difference of convex functions. The approach exploits the convexity properties of the system model to derive convex conditions that govern the evolution of robust tubes bounding predicted trajectories. These tubes allow an upper bound on a performance cost to be minimised subject to state and control constraints as a convex program, the solution of which can be used to update an estimate of the optimal state and control trajectories. This process is the basis of an iteration that solves a sequence of convex programs at each discrete time step. We show that the algorithm is recursively feasible, converges asymptotically to a fixed point of the iteration and ensures closed loop stability. The algorithm can be terminated after any number of iterations without affecting stability or constraint satisfaction. A case study is presented to illustrate an application of the algorithm.
|
|
13:50-14:10, Paper WeBT13.2 | Add to My Program |
Robust Predictive Output-Feedback Safety Filter for Uncertain Nonlinear Control Systems |
|
Brunke, Lukas | University of Toronto |
Zhou, Siqi | University of Toronto |
Schoellig, Angela P | University of Toronto |
Keywords: Predictive control for nonlinear systems, Uncertain systems
Abstract: In real-world applications, we often require reliable decision making under dynamics uncertainties using noisy high-dimensional sensory data. Recently, we have seen an increasing number of learning-based control algorithms developed to address the challenge of decision making under dynamics uncertainties. These algorithms often make assumptions about the underlying unknown dynamics and, as a result, can provide safety guarantees. This is more challenging for other widely used learning-based decision making algorithms such as reinforcement learning. Furthermore, the majority of existing approaches assume access to state measurements, which can be restrictive in practice. In this paper, inspired by the literature on safety filters and robust output-feedback control, we present a robust predictive output-feedback safety filter (RPOF-SF) framework that provides safety certification to an arbitrary controller applied to an uncertain nonlinear control system. The proposed RPOF-SF combines a robustly stable observer that estimates the system state from noisy measurement data and a predictive safety filter that renders an arbitrary controller safe by (possibly) minimally modifying the controller input to guarantee safety. We show in theory that the proposed RPOF-SF guarantees constraint satisfaction despite disturbances applied to the system. We demonstrate the efficacy of the proposed RPOF-SF algorithm using an uncertain mass-spring-damper system.
|
|
14:10-14:30, Paper WeBT13.3 | Add to My Program |
Iterative Learning and Model Predictive Control for Repetitive Nonlinear Systems Via Koopman Operator Approximation |
|
Saltik, Muhammed Bahadir | University of Groningen |
Jayawardhana, Bayu | University of Groningen |
Cherukuri, Ashish | University of Groningen |
Keywords: Process Control, Predictive control for nonlinear systems, Iterative learning control
Abstract: This paper presents an iterative way of computing a control algorithm with the aim of enabling reference tracking for an unknown nonlinear system. The method consists of three blocks: iterative learning control (ILC), robust model predictive control (MPC), and a linear approximation of the Koopman operator. The method proceeds in iterations, where at the end of an iteration, two steps are performed. First, the trajectories of the system obtained from previous iterations are used to build the linear approximation of the Koopman operator. Second, the linear model is used to compute the ILC signal. While these steps are executed in an offline manner, during an iteration, the control actions are computed online using the robust tube-based MPC. The tubes are defined by constraint tightening sets that compensate for the discrepancy between the true dynamics and its linear approximation. We demonstrate our method on the reference tracking for a 4 tank system.
|
|
14:30-14:50, Paper WeBT13.4 | Add to My Program |
Non-Diverging Neural Networks Supported Tube Model Predictive Control |
|
Zieger, Tim | Otto-Von-Guericke University Magdeburg |
Nguyen, Hoang Hai | Otto-Von-Guericke University Magdeburg |
Schulz, Erik | IAV GmbH |
Oehlschlägel, Thimo | IAV |
Findeisen, Rolf | TU Darmstadt |
Keywords: Predictive control for nonlinear systems, Predictive control for linear systems, Machine learning
Abstract: Machine learning techniques such as neural networks bear the potential to improve the performance and applicability of model predictive control to real-world systems. However, they also bear the danger of erratic-unpredictable behavior and malfunctioning of machine learning approaches. Neural networks might fail to predict system behavior as it is often impossible to provide strict performance or uncertainty bounds. While this challenge can be tackled using robust model predictive control approaches that span a safety net around the machine learning supported predictions, this can lead to significant performance degradation and infeasibility. To tackle this challenge, a safe neural network-supported learning tube model predictive control scheme is proposed, which allows to bound the worst-case performance in case of a malfunctioning of the machine learning component, yet enables decreasing the conservatism. The basic idea is to enforce the neural network to stay in the vicinity of a presumed given nominal model with the error dynamics directly incorporated into the neural network output function. Therefore, the error dynamics do not require an additional control input, resulting in the omission of input constraint tightening. Constraint fulfillment is guaranteed, robust set stability for a particular learning function class is established, and an upper bound for the performance of a malfunctioning neural network is given. The method is evaluated in simulations considering a rover operating in an uncertain environment.
|
|
14:50-15:10, Paper WeBT13.5 | Add to My Program |
Nonlinear Dual-Mode Model Predictive Control Using Koopman Eigenfunctions |
|
Krolicki, Alexander | Clemson University |
Tellez Castro, Duvan Andres | Universidad Nacional De Colombia |
Vaidya, Umesh | Clemson University |
Keywords: Predictive control for nonlinear systems, Computational methods, Identification for control
Abstract: We present an analytical and computational framework using the theory of Koopman operator to design dual-mode model predictive control (MPC) for nonlinear control systems. Dual-mode MPC incorporates stability and performance constraints in the controller and remains a challenging problem to solve for systems involving nonlinear dynamics. We exploit the spectral properties of the Koopman operator to provide a systematic approach for the design of optimal control, guaranteeing the stability of the feedback control system. The optimal cost function is also used for characterizing the maximal invariant set, which serves as the terminal set in the design of finite-horizon MPC. The finite horizon MPC relies on lifting nonlinear system dynamics using Koopman eigenfunctions to provide a linear approach for the design of MPC with state and input constraints.
|
|
15:10-15:30, Paper WeBT13.6 | Add to My Program |
Funnel MPC with Feasibility Constraints for Nonlinear Systems with Arbitrary Relative Degree |
|
Berger, Thomas | Universität Paderborn |
Dennstädt, Dario | Technische Universität Ilmenau |
Keywords: Predictive control for nonlinear systems, Constrained control
Abstract: We study tracking control for nonlinear systems with known relative degree and stable internal dynamics by the recently introduced technique of Funnel MPC. The objective is to achieve the evolution of the tracking error within a prescribed performance funnel. We propose a novel stage cost for Funnel MPC, extending earlier designs to the case of arbitrary relative degree, and show that the control objective as well as initial and recursive feasibility are always achieved - without requiring any terminal conditions or a sufficiently long prediction horizon. We only impose an additional feasibility constraint in the optimal control problem.
|
|
WeBT14 Regular Session, Maya Ballroom VI |
Add to My Program |
Control Applications IV |
|
|
Chair: Ferrari, Riccardo M.G. | Delft University of Technology |
Co-Chair: Gatsis, Nikolaos | The University of Texas at San Antonio |
|
13:30-13:50, Paper WeBT14.1 | Add to My Program |
Robust Nonlinear Control for the Fully-Actuated Hexa-Rotor: Theory and Experiments |
|
Flores, Gerardo | Center for Research in Optics |
Montes de Oca Rebolledo, Andres | Centro De Investigaciones En Optica |
Flores, Alejandro | Centro De Investigaciones En Óptica A.C |
Keywords: Aerospace, Mechatronics, Robotics
Abstract: This work addresses a fully-actuated Hexa-rotor's control under exogenous position and attitude disturbances. For that aim, we propose a nonlinear and robust control based on a backstepping technique for position dynamics and a geometric control for attitude dynamics. We demonstrate that the closed-loop system is globally exponentially stable. To validate the reached stability, we present simulation results using Matlab/Simulink and software in the loop simulations using the well-known PX4 firmware in a realistic scenario on a virtual environment in Gazebo. Finally, to ultimately demonstrate the effectiveness of our algorithm, we implement it on an own-developed fully-actuated Hexa-rotor platform. Our contributions can be summarized as 1) the design of a robust controller for the fully-actuated Hexa-rotor not presented in the literature, 2) the publication of the controller's code for the PX4 autopilot software, and 3) drone's construction and presentation of real flight experiments.
|
|
13:50-14:10, Paper WeBT14.2 | Add to My Program |
Space Lander Descent Control System Design Considering Fuel Sloshing under Microgravity |
|
Tanaka, Yuji | Keio University |
Baba, Mitsuhisa | Apan Aerospace Exploration Agency |
Otsuki, Masatsugu | Japan Aerospace Exploration Agency |
Fujita, Kazuhisa | Japan Aerospace Exploration Agency |
Himeno, Takehiro | University of Tokyo |
Maeda, Takao | Tokyo University of Agriculture and Technology |
Takahashi, Masaki | Keio University |
Keywords: Aerospace, Control applications, Simulation
Abstract: This study proposes a descent control system for a space lander with a large amount of fuel, to protect the lander from overturning when landing on microgravity objects. As sloshing is a major problem during descent, it is desirable to construct a control system that considers the sloshing dynamics. We derive a model of the lander considering fuel dynamics, determine the fuel-optimal trajectory for guidance, design a fuel state estimator for navigation, and design a tracking controller. The effectiveness of the proposed method is evaluated using a physical simulator. The proposed method achieves simultaneous stabilization of the lander and fuel, and is shown to be effective in preventing overturning.
|
|
14:10-14:30, Paper WeBT14.3 | Add to My Program |
Assessment of Quadrotor PID Control Algorithms Using Six-Degrees of Freedom CFD Simulations |
|
Carreńo Ruiz, Manuel | Politecnico Di Torino |
Bloise, Nicoletta | Politecnico Di Torino |
Capello, Elisa | Politecnico Di Torino, CNR-IEIIT |
D'Ambrosio, Domenic | Politecnico Di Torino - DIMEAS |
Guglieri, Giorgio | Politecnico Di Torino |
Keywords: Aerospace, Computational methods, PID control
Abstract: The evolution of technology has made increasingly advantageous the introduction of Unmanned Aerial Systems (UASs) in various applications, especially by exploiting their ability for autonomous flight. This paper presents an innovative approach to simulating UAS maneuvers that integrates a Computational Fluid Dynamics (CFD) model and a closed-loop control algorithm for both position and attitude dynamics. We chose the Proportional-Integrative-Derivative (PID) controller for this preliminary research activity because of its simple implementation and widespread employment in commercial autopilot systems. The numerical simulation of the UAS aerodynamics allows for performing an accurate analysis in critical situations. These include, for example, ground effect or wind gusts scenarios, which require an enhanced propulsive model to capture the interaction between vehicle dynamics, aerodynamics, and environmental conditions. The coupled CFD/PID framework can be a virtual testing environment for UAS platforms. Here we report on its validation. The paper compares such an innovative in-the-loop CFD approach and a classical simplified propulsive model that adopts constant thrust and torque coefficients.
|
|
14:30-14:50, Paper WeBT14.4 | Add to My Program |
Image-Based Visual Servoing Via Nonsingular Fast Terminal Adaptive Sliding Mode Control for a Quadrotor UAV Subjected to Wind Fields |
|
Miranda-Moya, Armando | Tecnologico De Monterrey |
Castaneda, Herman | Tecnologico De Monterrey |
Wang, Hesheng | Shanghai Jiao Tong University |
Keywords: Visual servo control, Flight control, Autonomous vehicles
Abstract: This paper presents the design of an image-based visual servoing developed under a nonsingular fast terminal adaptive sliding mode strategy. The approach considers a quadrotor UAV executing a dynamic target tracking against wind turbulence. The image projection of the target is extracted through a virtual camera method to establish an image features vector that allows controlling the position and heading of the rotorcraft through a nonsingular fast terminal adaptive sliding mode controller, which provides robustness against model uncertainties and bounded external disturbances; and chattering reduction due to the nonoverestimation of its gain. Furthermore, a Lyapunov stability analysis guarantees the practical finite-time convergence of the closed-loop system. Finally, simulation results demonstrate the feasibility and advantages of the proposed visual-based control scheme.
|
|
14:50-15:10, Paper WeBT14.5 | Add to My Program |
Convex Model Predictive Control for Down-Regulation Strategies on Wind Turbines |
|
Gonzalez Silva, Jean | Delft University of Technology |
Ferrari, Riccardo M.G. | Delft University of Technology |
van Wingerden, Jan-Willem | Delft University of Technology |
Keywords: Power generation, Predictive control for nonlinear systems, Fluid flow systems
Abstract: Wind turbine (WT) controllers are often geared towards maximum power extraction, while suitable operating constraints should be guaranteed such that WT components are protected from failures. Control strategies can be also devised to reduce the generated power, for instance to track a power reference provided by the grid operator. They are called down- regulation strategies and allow to balance power generation and grid loads, as well as to provide ancillary grid services, such as frequency regulation. Although this balance is limited by the wind availability and grid demand, the quality of wind energy can be improved by introducing down-regulation strategies that make use of the kinetic energy of the turbine dynamics. This paper shows how the kinetic energy in the rotating components of turbines can be used as an additional degree-of-freedom by different down-regulation strategies. In particular we explore the power tracking problem based on convex model predictive control (MPC) at a single wind turbine. The use of MPC allows us to introduce a further constraint that guarantees flow stability and avoids stall conditions. Simulation results are used to illustrate the performance of the developed down- regulation strategies. Notably, by maximizing rotor speeds, and thus kinetic energy, the turbine can still temporarily guarantee tracking of a given power reference even when occasional saturation of the available wind power occurs. In the study case we proved that our approach can guarantee power tracking in saturated conditions for 10 times longer than with traditional down-regulation strategies.
|
|
15:10-15:30, Paper WeBT14.6 | Add to My Program |
Assessing the Optimality of LinDist3Flow for Optimal Tap Selection of Step Voltage Regulators in Unbalanced Distribution Networks |
|
Ayyagari, Krishna Sandeep | The University of Texas at San Antonio |
Abraham, Sherin Ann | National Renewable Energy Laboratory |
Yao, Yiyun | National Renewable Energy Laboratory |
Ghosh, Shibani | National Renewable Energy Laboratory |
Flores-Espino, Francisco | National Renewable Energy Laboratory |
Nagarajan, Adarsh | National Renewable Energy Laboratory |
Gatsis, Nikolaos | The University of Texas at San Antonio |
Keywords: Power systems, Smart grid
Abstract: The adoption of distributed energy resources such as photovoltaics (PVs) has increased dramatically during the previous decade. The increased penetration of PVs into distribution networks (DNs) can cause voltage fluctuations that have to be mitigated. One of the key utility assets employed to this end are step-voltage regulators (SVRs). It is desirable to include tap selection of SVRs in optimal power flow (OPF) routines, a task that turns out to be challenging because the resultant OPF problem is nonconvex with added complexities stemming from accurate SVR modeling. While several convex relaxations based on semi-definite programming (SDP) have been presented in the literature for optimal tap selection, SDP based schemes do not scale well and are challenging to implement in large-scale planning or operational frameworks. This paper deals with the optimal tap selection (OPTS) problem for wye-connected SVRs using linear approximations of power flow equations. Specifically, the LinDist3Flow model is adopted and the effective SVR ratio is assumed to be continuous–enabling the formulation of a problem called LinDist3Flow-OPTS, which amounts to a linear program. The scalability and optimality gap of LinDist3Flow-OPTS are evaluated with respect to existing SDP-based and nonlinear programming techniques for optimal tap selection in three standard feeders, namely, the IEEE 13-bus, 123-bus, and 8500-node DNs. For all DNs considered, LinDist3Flow-OPTS achieves an optimality gap of approximately 1% or less while significantly lowering the computational burden.
|
|
WeBT15 Regular Session, Maya Ballroom VII |
Add to My Program |
Game Theory II |
|
|
Chair: Eksin, Ceyhun | Texas A&M University |
Co-Chair: Govaert, Alain | Lund University |
|
13:30-13:50, Paper WeBT15.1 | Add to My Program |
Population Games on Dynamic Community Networks |
|
Govaert, Alain | Lund University |
Zino, Lorenzo | University of Groningen |
Tegling, Emma | Lund University |
Keywords: Game theory, Large-scale systems, Network analysis and control
Abstract: IIn this letter, we deal with evolutionary game-theoretic learning processes for population games on networks with dynamically evolving communities. Specifically, we propose a novel mathematical framework in which a deterministic, continuous-time replicator equation on a community network is coupled with a closed dynamic flow process between communities, in turn governed by an environmental feedback mechanism. When such a mechanism is independent of the game-theoretic learning process, a closed-loop system of differential equations is obtained. Through a direct analysis of the system, we study its asymptotic behavior. Specifically, we prove that, if the learning process converges, it converges to a (possibly restricted) Nash equilibrium of the game, even when the dynamic flow process does not converge. Moreover, for a class of population games —two-strategy matrix games— a Lyapunov argument is employed to establish an evolutionary folk theorem that guarantees convergence to a subset of Nash equilibria, that is, the evolutionary stable states of the game. Numerical simulations are provided to illustrate and corroborate our findings.
|
|
13:50-14:10, Paper WeBT15.2 | Add to My Program |
Learning in Games with Quantized Payoff Observations |
|
Lotidis, Kyriakos | Stanford University |
Mertikopoulos, Panayotis | French National Center for Scientific Research (CNRS) |
Bambos, Nicholas | Stanford University |
Keywords: Game theory, Learning, Optimization algorithms
Abstract: This paper investigates the impact of feedback quantization on multi-agent learning. In particular, we analyze the equilibrium convergence properties of the well-known “follow the regularized leader” (FTRL) class of algorithms when players can only observe a quantized (and possibly noisy) version of their payoffs. In this information-constrained setting, we show that coarser quantization triggers a qualitative shift in the convergence behavior of FTRL schemes. Specifically, if the quantization error lies below a threshold value (which depends only on the underlying game and not on the level of uncertainty entering the process or the specific FTRL variant under study), then (i) FTRL is attracted to the game’s strict Nash equilibria with arbitrarily high probability; and (ii) the algorithm’s asymptotic rate of convergence remains the same as in the non-quantized case. Otherwise, for larger quantization levels, these convergence properties are lost altogether: players may fail to learn anything beyond their initial state, even with full information on their payoff vectors. This is in contrast to the impact of quantization in continuous optimization problems, where the quality of the obtained solution degrades smoothly with the quantization level.
|
|
14:10-14:30, Paper WeBT15.3 | Add to My Program |
Social Learning with a Self-Interested Coordinator |
|
Wei, Xupeng | University of Michigan |
Anastasopoulos, Achilleas | University of Michigan |
Keywords: Game theory, Learning, Control of networks
Abstract: Social learning refers to the process by which networked strategic agents learn an unknown state of the world by observing state-related private signals as well as other agents' actions. In the classic study of social learning by Bikhchandani, Hirshleifer, and Welch, it was shown that in this setting, information cascades occur, in which agents blindly imitate others' behavior and as a result learning stops for the whole community. Several proposals have been forwarded to mitigate this detrimental phenomenon. In this paper we consider the introduction of an information coordinator to mitigate information cascades. The coordinator commits to a contract and agents choose to enter the mechanism or not. If they enter they pay a fee and inform the coordinator of their private information (not necessarily truthfully). The coordinator, in turn, suggests an action to the agents based on his cumulative knowledge. We study a class of mechanisms that possess properties such as individual rationality for agents (i.e., agents want to enter the mechanism), truth telling, and profit maximization for the coordinator. We show the existence of such a mechanism which strictly improves social welfare, and results in strictly positive profit for the coordinator, so that agents and the coordinator are willing to adopt this approach. Furthermore, we analyze the performance of this mechanism and show significant gains on both aforementioned metrics.
|
|
14:30-14:50, Paper WeBT15.4 | Add to My Program |
Diffusion of Innovation under Limited-Trust Equilibrium |
|
Leon, Vincent | University of Illinois at Urbana-Champaign |
Etesami, S. Rasoul | University of Illinois at Urbana-Champaign |
Nagi, Rakesh | University of Illinois, Urbana-Champaign, Department of Industri |
Keywords: Game theory, Network analysis and control, Simulation
Abstract: We consider the diffusion of innovation in social networks using a game-theoretic approach. In our model, each individual plays a coordination game with its neighbors and decides what alternative product to adopt to maximize its payoff. As products are used in conjunction with others and through repeated interactions, individuals are more interested in their long-term benefits and often tend to show trustworthiness to others in order to maximize their long-term payoffs. To capture such trustworthy behavior, we deviate from the expected utility theory and use a new notion of rationality based on the limited-trust equilibrium. By incorporating such rational behavior into the product diffusion model, we analyze the convergence of emerging dynamics to their equilibrium points using a mean-field approximation. In particular, we study the convergence rate of the diffusion process using the absorption probability and the expected absorption time of a reduced-size Markov chain. Simulation shows that when individuals behave trustworthily, their long-term payoffs will increase significantly compared to the case when they are solely self-interested.
|
|
14:50-15:10, Paper WeBT15.5 | Add to My Program |
Approximate Submodularity of Maximizing Anticoordination in Network Games |
|
Das, Soham | Texas A&M University |
Eksin, Ceyhun | Texas A&M University |
Keywords: Game theory, Optimization algorithms, Network analysis and control
Abstract: We consider decentralized learning dynamics for agents in an anti-coordination network game. In the anti-coordination network game, there is a preferred action in the absence of neighbors' actions, and the utility an agent receives from the preferred action decreases as more of its neighbors select the preferred action, potentially causing the agent to select a less desirable action. The decentralized dynamics that is based on the iterated elimination of dominated strategies converge for the considered game. Given a convergent action profile, we measure anti-coordination by the number of edges in the underlying graph that have at least one agent in either end of the edge not taking the preferred action. The maximum anti-coordination (MAC) problem seeks to find an optimal set of agents to control under a finite budget so that the overall network disconnect is maximized on game convergence as a result of the dynamics. In this paper we show that the MAC is approximately submodular in line networks for any realization of the utility constants in the population. Utilizing this result, we provide a performance guarantee for the greedy agent selection algorithm for MAC. Finally, we use a computational study to show the effectiveness of greedy node selection strategies to solve MAC on general bipartite networks.
|
|
15:10-15:30, Paper WeBT15.6 | Add to My Program |
Information Preferences of Individual Agents in Linear-Quadratic-Gaussian Network Games |
|
Sezer, Furkan | Texas A&M University |
Eksin, Ceyhun | Texas A&M University |
Keywords: Game theory, Network analysis and control, Agents-based systems
Abstract: We consider linear-quadratic-Gaussian (LQG) network games in which agents have quadratic payoffs that depend on their individual and neighbors' actions, and an unknown payoff-relevant state. An information designer determines the fidelity of information revealed to the agents about the payoff state to maximize the social welfare. Prior results show that full information disclosure is optimal under certain assumptions on the payoffs, i.e., it is beneficial for the average individual. In this paper, we provide conditions for general network structures based on the strength of the dependence of payoffs on neighbors' actions, i.e., competition, under which a rational agent is expected to benefit, i.e., receive higher payoffs, from full information disclosure. We find that all agents benefit from information disclosure for the star network structure when the game is homogeneous. We also identify that the central agent benefits more than a peripheral agent from full information disclosure unless the competition is strong and the number of peripheral agents is small enough. Despite the fact that all agents expect to benefit from information disclosure ex-ante, a central agent can be worse-off from information disclosure in many realizations of the payoff state under strong competition, indicating that a risk-averse central agent can prefer uninformative signals ex-ante.
|
|
WeBT16 Regular Session, Maya Ballroom VIII |
Add to My Program |
Control Techniques |
|
|
Chair: Komaee, Arash | Southern Illinois University |
Co-Chair: Brown, Philip N. | University of Colorado, Colorado Springs |
|
13:30-13:50, Paper WeBT16.1 | Add to My Program |
On the Active Disturbance Rejection Control for the Sensorless Maximum Power Point Tracking Task of a Variable Speed Wind Turbine |
|
Aguilar-Orduńa, Mario Andrés | CINVESTAV |
Sira-Ramirez, Hebertt | CINVESTAV |
Gomez Leon, Brian | Centro De Investigación Y De Estudios Avanzados |
Keywords: Electrical machine control, Power generation, Robust control
Abstract: Wind Energy Conversion Systems (WECS) are complex mechatronic systems with elaborate control schemes interacting to ensure the overall performance and safe operation. Often, mechanical sensors are eliminated either to reduce costs, lighten the weight, or for other design purposes. In lack of a position sensor, the Maximum Power Point Tracking (MPPT) strategies require estimates of the position and the velocity of the WECS. Commonly, measurement of the generator phase currents is used for the needed estimation process; nevertheless, high-frequency components, due to the switching nature of the converter, distort the measured currents. This paper introduces an improved Active Disturbance Rejection Control (ADRC) scheme for systems with output measurement noise to solve the sensorless MPPT problem of a Permanent Magnet Synchronous Generator (PMSG) based direct drive scheme in WECS.
|
|
13:50-14:10, Paper WeBT16.2 | Add to My Program |
Lyapunov Based Stochastic Stability of Human-Machine Interaction: A Quantum Decision System Approach |
|
Snow, Luke | Cornell University |
Jain, Shashwat | Cornell University |
Krishnamurthy, Vikram | Cornell University |
Keywords: Human-in-the-loop control, Markov processes
Abstract: In mathematical psychology, decision makers are modeled using the Lindbladian equations from quantum mechanics to capture important human-centric features such as order effects and violation of the sure thing principle. We consider human-machine interaction involving a quantum decision maker (human) and a controller (machine). Given a sequence of human decisions over time, how can the controller dynamically provide input messages to adapt these decisions so as to converge to a specific decision? We show via novel stochastic Lyapunov arguments how the Lindbladian dynamics of the quantum decision maker can be controlled to converge to a specific decision asymptotically. Our methodology yields a useful mathematical framework for human-sensor decision making. The stochastic Lyapunov results are also of independent interest as they generalize recent results in the literature.
|
|
14:10-14:30, Paper WeBT16.3 | Add to My Program |
Quasistatic Control of Dynamical Systems |
|
Komaee, Arash | Southern Illinois University |
Keywords: Nonlinear systems, Output regulation, Linear systems
Abstract: This paper investigates a control strategy in which the state of a dynamical system is driven slowly along a trajectory of stable equilibria. This trajectory is a continuum set of points in the state space, each one representing a stable equilibrium of the system under some constant control input. Along the continuous trajectory of such constant control inputs, a slowly varying control is then applied to the system, aimed to create a stable quasistatic equilibrium that slowly moves along the trajectory of equilibria. As a stable equilibrium attracts the state of system within its vicinity, by moving the equilibrium slowly along the trajectory of equilibria, the state of system travels near this trajectory alongside the equilibrium. Despite the disadvantage of being slow, this control strategy is attractive for certain applications, as it can be implemented based only on partial knowledge of the system dynamics. This feature is in particular important for the complex systems for which detailed dynamical models are not available.
|
|
14:30-14:50, Paper WeBT16.4 | Add to My Program |
Privacy and Optimality of Distributed Schemes for Secondary Frequency Regulation in Power Networks |
|
Khan, Kanwal Hasan | University of Cyprus |
Kasis, Andreas | University of Cyprus |
Polycarpou, Marios M. | University of Cyprus |
Timotheou, Stelios | University of Cyprus |
Keywords: Power systems, Control Systems Privacy, Network analysis and control
Abstract: The increasing participation of local generation and controllable demand units within the power network motivates the use of distributed schemes for their control. Simultaneously, it raises two issues; achieving an optimal power allocation among these units, and securing the privacy of the generation/demand profiles. This study considers the problem of designing distributed optimality schemes that preserve the privacy of the generation and controllable demand units within the secondary frequency control timeframe. We propose a consensus scheme that includes the generation/demand profiles within its dynamics, providing guarantees that those cannot be inferred from the communicated signals, even when eavesdroppers possess knowledge of the underlying system dynamics. For the proposed scheme, we provide analytic stability and optimality guarantees and show that the secondary frequency control objectives are satisfied. The presented scheme is distributed, locally verifiable and applicable to arbitrary network topologies. Our analytic results are verified with simulations on a 9-bus system, where we demonstrate that the proposed scheme enables an optimal power allocation and preserves the privacy of the generation/demand and the stability of the power network.
|
|
14:50-15:10, Paper WeBT16.5 | Add to My Program |
All Low-Quality Equilibria Are Unstable in Submodular Maximization with Communication-Denied Agents |
|
Seaton, Joshua | University of Colorado at Colorado Springs |
Brown, Philip N. | University of Colorado, Colorado Springs |
Keywords: Game theory, Distributed control, Networked control systems
Abstract: This paper considers the robustness of game-theoretic approaches to distributed submodular maximization problems, which have been used to model a wide variety of applications such as competitive facility location, distributed sensor coverage, and routing in transportation networks. Recent work showed that in this class of games, if k agents suffer a technical fault and cannot observe the actions of other agents, Nash equilibria are still guaranteed to be within a factor of k+2 of optimal. However, our paper shows that this worst-case guarantee is fragile in the sense that very low-quality equilibria are very "close" to optimal allocations. Specifically, at a very low-quality Nash equilibrium, the total payoffs of compromised agents are very close to the payoffs they would receive at an optimal allocation. At the extreme worst-case equilibria, all agents are perfectly indifferent between their equilibrium and optimal actions. Conversely, we show that if agents' equilibrium payoffs are much higher than their optimal-allocation payoffs (i.e., the equilibrium is "stable"), then this ensures that the equilibrium must be of relatively high quality. To demonstrate how this phenomenon may be exploited algorithmically, we perform simulations using the log-linear learning algorithm and show that average performance on worst-case instances is far better even than our improved analytical guarantees.
|
|
15:10-15:30, Paper WeBT16.6 | Add to My Program |
Analyzing and Mitigating of Time Delay Attack (TDA) by Using Fractional Filter Based IMC-PID with Smith Predictor |
|
Kumar, Vivek | Indian Institute of Technology, Roorkee, Uttarakhand, 247667 |
Hote, Yogesh Vijay | Indian Institute of Technology, Roorkee |
Keywords: Control applications, Cyber-Physical Security, PID control
Abstract: In this era, with a great extent of automation and connection, modern production processes are highly prone to cyber-attacks. The sensor-controller chain becomes an obvious target for attacks because sensors are commonly used to regulate production facilities. In this research, we introduce a new control configuration for the system, which is sensitive to time delay attacks (TDA), in which data transfer from the sensor to the controller is intentionally delayed. The attackers want to disrupt and damage the system by forcing controllers to use obsolete data about the system status. In order to improve the accuracy of delay identification and prediction, as well as erroneous limit and estimation for control, a new control structure is developed by an Internal Model Control (IMC) based Proportional-Integral-Derivative (PID) scheme with a fractional filter. An additional concept is included to mitigate the effect of time delay attack, i.e., the smith predictor. Simulation studies of the established control framework have been implemented with two numerical examples. The performance assessment of the proposed method has been done based on integral square error (ISE), integral absolute error (IAE) and total variation (TV).
|
|
WeBT17 Regular Session, Acapulco |
Add to My Program |
Systems Biology |
|
|
Chair: Del Vecchio, Domitilla | Massachusetts Institute of Technology |
Co-Chair: Nieto, Cesar | University of Delaware |
|
13:30-13:50, Paper WeBT17.1 | Add to My Program |
Design of a Long-Term Memory Genetic Toggle Switch Inspired by Chromatin Modification Circuits |
|
Kwon, Ukjin | MIT |
Huang, Hsin-Ho | Massachusetts Institute of Technology |
Del Vecchio, Domitilla | Massachusetts Institute of Technology |
Keywords: Modeling, Markov processes, Biomolecular systems
Abstract: A genetic toggle switch, a bistable gene-regulatory network, has many biotechnology applications, from environmental sensing to therapeutics. In order for a toggle switch to be practically useful, it should be able to maintain either of its states for a sufficiently long time. While a number of bistable circuit designs have appeared, it remains a challenge to control the duration of memory of the two states due to the presence of noise. To address this problem, we propose a bacterial toggle switch design that is inspired by a chromatin modification circuit ubiquitous in mammalian systems. We specifically propose a bacterial implementation based on two DNA invertases, in which each invertase is auto-catalyzing its own expression while also catalyzing the other invertase’s repression. We perform a mathematical analysis of the time to memory loss of the circuit’s stable states in a simplified stochastic model of the system. Our analysis shows that we can increase the time to memory loss by increasing the expression rates of the invertases, allowing to design the circuit for long-term memory. As a comparison, we also analyze two additional designs based on invertases, a published one, and a simpler version of our design. We demonstrate that for these circuits, there is no design parameter that allows to extend the time to memory loss, thereby highlighting structural properties of our design necessary for long-term memory. We validate the theoretical findings by stochastic simulations of the full set of reactions describing the circuits. More broadly, our results provide criteria for designing long-term memory toggle switches in bacteria.
|
|
13:50-14:10, Paper WeBT17.2 | Add to My Program |
Differential Equation Model for the Population-Level Dynamics of a Toggle Switch with Growth-Feedback |
|
Hirsch, Dylan | Massachusetts Institute of Technology |
Del Vecchio, Domitilla | Massachusetts Institute of Technology |
Keywords: Biomolecular systems, Systems biology, Markov processes
Abstract: Bistable genetic circuits that can be toggled between two states have been engineered in bacterial cells for a variety of applications. These circuits often impose state-dependent resource loads on the cell, creating growth feedback. In the context of a population of cells, each with a copy of the genetic circuit, cells in either circuit state grow at different rates, thereby affecting the emergent population-level dynamics. It is generally difficult to predict how this growth heterogeneity will affect the composition of the population over time. In this work, we consider an ODE population model and evaluate its ability to predict the transient dynamics of the fraction of cells in either state. These dynamics are driven by two processes. The first is due to the difference in growth rate between the cells in the two states, while the second process arises from the probability that the circuit switches state. For the latter, we compute switching rates for the toggle switch using a Markov chain two-dimensional model and exploit the system's structure for efficient computation. We demonstrate via simulations that the ODE model well approximates the dynamics of the system obtained by a published population simulation algorithm for sufficiently large molecular counts and population sizes. The ability to approximate via ODEs the population-level dynamics of cells engineered with multi-stable circuits will be especially relevant to forward engineer such circuits for desired population dynamics.
|
|
14:10-14:30, Paper WeBT17.3 | Add to My Program |
Optimal Drug Treatment for Reducing Long-Term Drug Resistance |
|
Asnaashari, Tina | Oregon Health & Science University |
Chang, Young Hwan | Oregon Health and Science University |
Keywords: Biological systems, Systems biology, Biomedical
Abstract: The maximum-tolerated dose principle, the highest possible drug dose in the shortest possible time period, has been the standard care for cancer treatment. Although it is appealing in a homogeneous tumor settings, tumor heterogeneity and adaptation play a significant role in driving treatment failure. They are still major obstacles in cancer treatments despite great advances in modeling and cancer therapy using optimal control theory. To address this, we first generalize two population models and examine the long-term effects of differential selective treatment strategies. Second, we take into account different drug-imposed selective pressure into designing optimal treatment strategies. Numerical examples demonstrate that the proposed treatment strategy decreases long-term tumor burden by decreasing the rate of tumor adaptation.
|
|
14:30-14:50, Paper WeBT17.4 | Add to My Program |
Cell Size Control Shapes Fluctuations in Colony Population |
|
Nieto, Cesar | University of Delaware |
Vargas-Garcia, Cesar A. | Fundación Universitaria Konrad Lorenz |
Pedraza, Juan | Universidad De Los Andes |
Singh, Abhyudai | University of Delaware |
Keywords: Biological systems, Hybrid systems, Biomolecular systems
Abstract: Exponentially growing cells regulate their size by controlling their timing of division. Since two daughter cells are born as a result of this cell splitting, cell size regulation has a direct connection with cell proliferation dynamics. Recent models found more clues about this connection by suggesting that division occurs at a size-dependent rate. In this article, we propose a framework that couples the stochastic transient dynamics of both the cell size and the number of cells in the initial expansion of a single-cell-derived colony. We describe the population from the two most common perspectives. The first is known as Single Lineage: where only one cell is followed in each colony, and the second is Population Snapshots: where all cells in different colonies are followed. At a low number of cells, we propose a third perspective; Single Colony, where one tracks only cells with a common ancestor. We observe how the statistics of these three approaches are different at low numbers and how the Single Colony perspective tends to Population Snapshots at high numbers. We analyze random fluctuations from colony to colony in the number of cells. If cell division occurs at a size-dependent rate, the extent of this variability first increases with time and then decreases to near zero when the population is high. In contrast, in classical proliferation models, where cell division occurs based not on cell size but on a pure timing mechanism, fluctuations in cell number increase monotonically over time to approach a non-zero value. We systematically study these differences and the convergence speed using different size control strategies.
|
|
14:50-15:10, Paper WeBT17.5 | Add to My Program |
Model of Bacteria Mutualism in a Chemostat: Analysis and Optimization with Interval Detector |
|
Arceo, Juan Carlos | Université Nice Côte d’Azur, Inria BIOCORE, BP93, 06902 Sophia-A |
Bernard, Olivier | Inria |
Gouze, Jean-Luc | INRIA |
Keywords: Biological systems, Identification for control, Optimization
Abstract: This work focuses on a model of two bacteria growing and exchanging nutrients in a chemostat. Bacterial uptake rate is described via Michaelis-Menten equations; we assume a constant yield and metabolite production directly proportional to bacterial growth. We analyse the model via a reduced order system, then, conditions to determine existence and global stability of the equilibria are given in terms of the dilution rate. Finally, bacterial productivity is maximized; an interval detector is designed to estimate this productivity and an optimization strategy for periodic dilution rates is proposed.
|
|
15:10-15:30, Paper WeBT17.6 | Add to My Program |
Model-Based Feedforward Control of an Intra and Interspecific Competitive Population System |
|
Kurth, Anna-Carina | Institute for System Dynamics, University of Stuttgart |
Veil, Carina | University of Stuttgart |
Sawodny, Oliver | University of Stuttgart |
Keywords: Systems biology, Control applications, Distributed parameter systems
Abstract: Intra- and interspecific competitive population systems are relevant for a variety of applications, such as bioreactors or wastewater treatment plants. These systems are described by coupled hyperbolic semilinear integro partial differential equations with non-local integral boundary conditions. This type of population system has not been considered in the context of control theory in the literature to date. It is assumed that it is possible to measure both species separately, but only one control input is available, namely the dilution rate. A system analysis allows for the determination of infinitely many, but uniquely determined steady-states that are used to derive nonlinear input-output dynamics via the relation of hyperbolic partial differential equations to integral delay equations. A model inversion yields a feedforward control to control the two different outputs, which are a weighted integral over the population density. This results in a restriction in the choice of reference trajectories due to the undercount of inputs. Simulations show the great potential that can be achieved with model-based feedforward control in the context of population systems.
|
|
WeCT01 Regular Session, Tulum Ballroom A |
Add to My Program |
Hybrid Systems III |
|
|
Chair: Berger, Guillaume O. | CU Boulder |
Co-Chair: Dimarogonas, Dimos V. | KTH Royal Institute of Technology |
|
16:00-16:20, Paper WeCT01.1 | Add to My Program |
Handling Disjunctions in Signal Temporal Logic Based Control through Nonsmooth Barrier Functions |
|
Wiltz, Adrian | KTH Royal Institute of Technology |
Dimarogonas, Dimos V. | KTH Royal Institute of Technology |
Keywords: Formal Verification/Synthesis, Hybrid systems, Constrained control
Abstract: For a class of spatio-temporal tasks defined by a fragment of Signal Temporal Logic (STL), we construct a nonsmooth time-varying control barrier function (CBF) and develop a controller based on a set of simple optimization problems. Each of the optimization problems invokes constraints that allow to exploit the piece-wise smoothness of the CBF for optimization additionally to the common gradient constraint in the context of CBFs. In this way, the conservativeness of the control approach is reduced in those points where the CBF is nonsmooth. Thereby, nonsmooth CBFs become applicable to time-varying control tasks. Moreover, we overcome the problem of vanishing gradients for the considered class of constraints which allows us to consider more complex tasks including disjunctions compared to approaches based on smooth CBFs. As a well-established and systematic method to encode spatiotemporal constraints, we define the class of tasks under consideration as an STL-fragment. The results are demonstrated in a relevant simulation example.
|
|
16:20-16:40, Paper WeCT01.2 | Add to My Program |
Automaton-Guided Control Synthesis for Signal Temporal Logic Specifications |
|
Ho, Qi Heng | University of Colorado Boulder |
Ilyes, Roland | University of Colorado Boulder |
Sunberg, Zachary | University of Colorado |
Lahijanian, Morteza | University of Colorado Boulder |
Keywords: Formal Verification/Synthesis, Hybrid systems
Abstract: This paper presents an algorithmic framework for control synthesis of continuous dynamical systems subject to signal temporal logic (STL) specifications. We propose a novel algorithm to obtain a time-partitioned finite automaton from an STL specification, and introduce a multi-layered framework that utilizes this automaton to guide a sampling-based search tree both spatially and temporally. Our approach is able to synthesize a controller for nonlinear dynamics and polynomial predicate functions. We prove the correctness and probabilistic completeness of our algorithm, and illustrate the efficiency and efficacy of our framework on several case studies. Our results show an order of magnitude speedup over the state of the art.
|
|
16:40-17:00, Paper WeCT01.3 | Add to My Program |
Learning Fixed-Complexity Polyhedral Lyapunov Functions from Counterexamples |
|
Berger, Guillaume O. | CU Boulder |
Sankaranarayanan, Sriram | University of Colorado, Boulder |
Keywords: Stability of hybrid systems, Lyapunov methods, Learning
Abstract: We study the problem of synthesizing polyhedral Lyapunov functions for hybrid linear systems. Such functions are defined as convex piecewise linear functions, with a finite number of pieces. We first prove that deciding whether there exists an m-piece polyhedral Lyapunov function for a given hybrid linear system is NP-hard. We then present a counterexample-guided algorithm for solving this problem. The algorithm alternates between choosing a candidate polyhedral function based on a finite set of counterexamples and verifying whether the candidate satisfies the Lyapunov conditions. If the verification fails, we find a new counterexample that is added to our set. We prove that if the algorithm terminates, it discovers a valid Lyapunov function or concludes that no such Lyapunov function exists. However, our initial algorithm can be non-terminating. We modify our algorithm to provide a terminating version based on the so-called cutting-plane argument from nonsmooth optimization. We demonstrate our algorithm on numerical examples.
|
|
17:00-17:20, Paper WeCT01.4 | Add to My Program |
Hybrid Concurrent Learning for Hybrid Linear Regression |
|
Johnson, Ryan S. | University of California, Santa Cruz |
Sanfelice, Ricardo G. | University of California at Santa Cruz |
Keywords: Hybrid systems, Estimation, Identification
Abstract: We consider the problem of estimating a vector of unknown constant parameters for a linear regression model whose input and output signals are hybrid -- that is, they exhibit both continuous (flow) and discrete (jump) evolution. Using a hybrid systems framework, we propose a hybrid algorithm capable of operating during both flows and jumps, that utilizes current measurements alongside stored data for adaptation. We show that our algorithm guarantees exponential convergence of the parameter estimate to the true value under a new notion of excitation that relaxes both the classical continuous-time and discrete-time persistence of excitation conditions and a recently proposed hybrid persistence of excitation condition. Simulation results show the merits of our proposed approach.
|
|
17:20-17:40, Paper WeCT01.5 | Add to My Program |
Deep Neural Network-Based Adaptive FES-Cycling Control: A Hybrid Systems Approach (I) |
|
Griffis, Emily | University of Florida |
Isaly, Axton | University of Florida |
Le, Duc M. | University of Florida |
Dixon, Warren E. | University of Florida |
Keywords: Lyapunov methods, Adaptive control, Hybrid systems
Abstract: Functional electrical stimulation (FES)-cycling is a rehabilitation method for restoration of motor function in individuals with neuromuscular disorders. FES-cycling control faces challenges due to the nonlinear behavior and unstructured uncertainty of the rider’s muscle dynamics. Moreover, there exist regions in the crank cycle at which it is not kinematically efficient to stimulate certain muscles. To maximize torque output, the stimulation pattern is designed such that the FES control input switches between the different muscle groups. The resulting continuous-time and discrete-time characteristics motivate a hybrid systems approach. In this paper, a deep neural network (DNN)-based adaptive closed-loop hybrid control scheme is developed for the rider-cycle system. A hybrid system is formulated to model the hybrid behavior of both the FES-cycling system and adaptive DNN updates. Unknown dynamics of the system are approximated using a feedforward DNN estimate in the developed motor input controller, and a Lyapunov stability-derived weight adaptation law is developed for real-time estimation of the DNN outer-layer weights. Asymptotic convergence of the position and cadence tracking errors is guaranteed with a hybrid systems analysis using Lyapunov-based techniques.
|
|
17:40-18:00, Paper WeCT01.6 | Add to My Program |
Sufficient Conditions for Optimality in Finite-Horizon Two-Player Zero-Sum Hybrid Games |
|
J. Leudo, Santiago | University of California, Santa Cruz |
Sanfelice, Ricardo G. | University of California at Santa Cruz |
Keywords: Hybrid systems, Game theory, Optimal control
Abstract: A finite-horizon two-player zero-sum game under dynamic constraints given in terms of hybrid dynamical systems is formulated in this paper. Sufficient conditions that consist of a hybrid version of the Hamilton-Jacobi-Isaacs equations are provided to guarantee that a pure strategy is a saddle- point equilibrium for the game. It is shown that when the players select the optimal strategy, the value function can be evaluated without needing computing solutions. Using this framework, a finite-horizon optimal control problem under an adversarial action with decision-making agents exhibiting hybrid dynamics is addressed.
|
|
WeCT02 Regular Session, Tulum Ballroom B |
Add to My Program |
Linear Parameter-Varying Systems |
|
|
Chair: Sename, Olivier | Grenoble INP / GIPSA-Lab |
Co-Chair: Torres, Lizeth | Universidad Nacional AutÓnoma De MÉxico |
|
16:00-16:20, Paper WeCT02.1 | Add to My Program |
Deep-Learning-Based Identification of LPV Models for Nonlinear Systems |
|
Verhoek, Chris | Eindhoven University of Technology |
Beintema, Gerben Izaak | Eindhoven University of Technology |
Haesaert, Sofie | Eindhoven University of Technology |
Schoukens, Maarten | Eindhoven University of Technology |
Tóth, Roland | Eindhoven University of Technology |
Keywords: Linear parameter-varying systems, Machine learning, Nonlinear systems identification
Abstract: The Linear Parameter-Varying (LPV) framework provides a modeling and control design toolchain to address nonlinear (NL) system behavior via linear surrogate models. Despite major research effort on LPV data-driven modeling, a key shortcoming of the current identification theory is that often the scheduling variable is assumed to be a given measured signal in the data set. In case of identifying an LPV model of a NL system, the selection of the scheduling map, which describes the relation to the measurable scheduling signal, is put on the users' shoulder, with only limited supporting tools available. This choice however greatly affects the usability and complexity of the resulting LPV model. This paper presents a deep-learning-based approach to provide joint estimation of a scheduling map and an LPV state-space model of a NL system from input-output data, and has consistency guarantees under general innovation-type noise conditions. Its efficiency is demonstrated on a realistic identification problem.
|
|
16:20-16:40, Paper WeCT02.2 | Add to My Program |
On Minimal LPV State-Space Representations in Innovation Form: An Algebraic Characterization |
|
Rouphael, Elie | University of Lille |
Petreczky, Mihaly | UMR CNRS 9189, Ecole Centrale De Lille |
Belkoura, Lotfi | Université De Lille |
Keywords: Linear parameter-varying systems, Subspace methods, Stochastic systems
Abstract: In this paper we will propose a definition of the concept of minimal state-space representations in innovation form for LPV. We also present algebraic conditions for a stochastic LPV state-space representation to be minimal in forward innovation form and discuss an algorithm for transforming any stochastic LPV state-space representation to a minimal one in innovation form.
|
|
16:40-17:00, Paper WeCT02.3 | Add to My Program |
L2 Observer Design for Singular Lypschitz Linear Parameter-Varying (S-LPV) Systems |
|
Do, Manh-Hung | CNRS GIPSA-Lab |
Koenig, Damien | Grenoble-INP |
Theilliol, Didier | Universite De Lorraine |
Torres, Lizeth | Universidad Nacional AutÓnoma De MÉxico |
Keywords: Linear parameter-varying systems, Uncertain systems, Optimization
Abstract: The main contribution of this paper is an observer design based on mathcal{L}_2 optimization for a class of Singular Lypschitz Linear Parameter-Varying (S-LPV). Thanks to a strict LMI solution, the observer not only relaxes the existing UI-decoupling constraint in the conventional UI observer but also ensures the stability of estimation dynamics under the presence of Lipschitz nonlinearity and time-varying parameters. This proposed synthesis allows to extend the results of the UI Observer for a large class of system (singular, nonlinearity, parameter variant) and by relaxing the UI decoupling condition. Finally, a numerical example is illustrated to highlight the advantages and drawbacks of the proposed design.
|
|
17:00-17:20, Paper WeCT02.4 | Add to My Program |
Conversion from Unstructured LPV Controllers to Observer-Structured LPV Controllers |
|
Sato, Masayuki | Japan Aerospace Exploration Agency |
Sebe, Noboru | Kyushu Institute of Technology |
Keywords: Observers for Linear systems, Linear parameter-varying systems, Uncertain systems
Abstract: This note addresses the conversion problem from unstructured Linear Parameter-Varying (LPV) controllers, which are designed {it a priori} for LPV plant systems possessing parametric uncertainties as well as scheduling parameters, to observer-structured LPV controllers. In general, LPV controllers designed using parametrically dependent Linear Matrix Inequalities (LMIs) have no special structures. On this issue, we propose ``observer-structured LPV controllers'' which can be defined even for LPV plant systems without knowledge of the nominal state-space matrices of the LPV plant systems, and we address the conversion problem from {it a priori} designed unstructured LPV controllers to observer-structured LPV controllers in order to embed observer property without changing controllers' input-to-output properties. To this end, we first parametrize the state-space matrices of observer-structured LPV controllers using those of the unstructured LPV controllers, single free matrices and constant state transformation matrices, and then give a formulation in terms of parametrically multi-affine LMIs to obtain the optimal constant state transformation matrices with respect to induced L_2/l_2 norm from external input to plant state estimation errors. A numerical example is introduced to demonstrate the usefulness of our propositions.
|
|
17:20-17:40, Paper WeCT02.5 | Add to My Program |
Combined LPV and Ultra-Local Model-Based Control Design Approach for Autonomous Vehicles |
|
Fenyes, Daniel | Institute for Computer Science and Control (SZTAKI) |
Hegedűs, Tamás | Budapest University of Technology and Economics |
Nemeth, Balazs | SZTAKI Institute for Computer Science and Control |
Szabo, Zoltan | SZTAKI |
Gaspar, Peter | SZTAKI |
Keywords: Robust adaptive control, Autonomous vehicles, Nonlinear systems
Abstract: The paper presents a novel approach for controlling highly nonlinear systems using the combination of LPV framework and ultra-local model-based solution. Firstly, a new formulation of the ultra-local model is presented, by which the implementation-related issues can be overcome. Secondly, an extended state-space representation is introduced, which includes the nominal model of the considered nonlinear system and the effect of the ultra-local model. This extended state-space representation serves as the basis of LPV-based control design. In this way, the stability of the closed-loop system can be guaranteed, while the variation of the tuning parameter alpha of the ultra-local model can also be handled. The effectiveness and the operation of the proposed control strategy are demonstrated through a vehicle-oriented control problem.
|
|
17:40-18:00, Paper WeCT02.6 | Add to My Program |
Unified Generalized H2 Nonlinear Parameter Varying Observer: Application to Automotive Suspensions |
|
Tran, Gia Quoc Bao | Mines Paris, Université PSL |
Pham, Thanh-Phong | The University of Danang-University of Technology and Education |
Sename, Olivier | Grenoble INP / GIPSA-Lab |
Keywords: Linear parameter-varying systems, Observers for Linear systems, Automotive control
Abstract: This paper extends the unified observer design problem to the class of Nonlinear Parameter Varying (NLPV) systems with parameter dependence in both the dynamics and the control input matrices. First, parameterization of the observer matrices, herein generalized for the NLPV case, allows us to decouple the input disturbance from the estimation error. Then, the vanishing disturbance caused by the nonlinearity is bounded by the Lipschitz property and the effect of measurement noise on the error is minimized using the generalized H2 condition. Both objectives are combined into a single framework thanks to the S-procedure. Furthermore, the asymptotic stability of the error is tackled using a parameter-dependent Lyapunov function, then a grid-based Linear Matrix Inequalities (LMIs) solution is provided, which reduces conservatism. The efficiency of this observer is illustrated and compared with an LPV observer through the damper force estimation problem, a crucial topic in semi-active suspensions.
|
|
WeCT03 Regular Session, Tulum Ballroom C |
Add to My Program |
Autonomous Robots |
|
|
Chair: Dimarogonas, Dimos V. | KTH Royal Institute of Technology |
Co-Chair: Cristofaro, Andrea | Sapienza University of Rome |
|
16:00-16:20, Paper WeCT03.1 | Add to My Program |
Autonomous Navigation of Interconnected Tethered Drones in a Partially Known Environment with Obstacles |
|
Bolognini, Michele | Politecnico Di Milano |
Saccani, Danilo | Politecnico Di Milano |
Cirillo, Fabrizio | Politecnico Di Milano University |
Fagiano, Lorenzo | Politecnico Di Milano |
Keywords: Autonomous systems, Optimization, Autonomous vehicles
Abstract: Systems of tethered multicopters are multi-copter drones connected one to the other by an electric tether providing energy and communication. They are capturing the interest of researchers and industry due to their versatility and prolonged flight time. The tether enables continuous power supply from ground and fast, reliable, all-to-all communication. Nevertheless, it couples the vehicles, introducing range limitations and challenging control and navigation problems, in particular in presence of obstacles that are not fully known a-priori. To solve these challenges, a new approach based on a combination of off-line and real-time optimization is proposed. An off-line mission planning method is used to find optimal configurations for these peculiar systems in the nominal environment, guaranteeing safety with regards to the presence of the tether and obstacles. Furthermore, an on-line, real-time, reactive tracking algorithm based on Model Predictive Control is presented to bring the system to the aforementioned optimal configuration, coping with the presence of unexpected static obstacles. Such procedure also ensures collision avoidance, both for the vehicles and for the tether connecting them. The mentioned contributions are validated through simulation with realistic models of the drones and tethers.
|
|
16:20-16:40, Paper WeCT03.2 | Add to My Program |
Safe Robot Navigation in a Crowd Combining NMPC and Control Barrier Functions |
|
Vulcano, Veronica | Sapienza Universitŕ Di Roma |
Tarantos, Spyridon | Sapienza University of Rome |
Ferrari, Paolo | Sapienza University of Rome |
Oriolo, Giuseppe | Universita Di Roma |
Keywords: Autonomous robots, Robotics, Predictive control for nonlinear systems
Abstract: We propose a sensor-based scheme for safe robot navigation in a crowd of moving humans. It consists of two modules, i.e., the crowd prediction and motion generation module, which run sequentially during every sampling interval. Using information acquired online by an on-board sensor, the crowd prediction module foresees the future motion of the humans in the robot surroundings. Based on such prediction, the motion generation module produces feasible commands to safely drive the robot among the humans by combining a nonlinear Model Predictive Control (NMPC) algorithm with collision avoidance constraints formulated via discrete-time Control Barrier Functions (CBFs). We show the effectiveness of the proposed approach via simulations obtained in CoppeliaSim on the Pioneer 3-DX mobile robot in scenarios of different complexity.
|
|
16:40-17:00, Paper WeCT03.3 | Add to My Program |
Safe Trajectory Tracking Using Closed-Form Controllers Based on Control Barrier Functions |
|
Cristofaro, Andrea | Sapienza University of Rome |
Ferro, Marco | Sapienza University of Roma |
Vendittelli, Marilena | Univ. of Rome La Sapienza |
Keywords: Autonomous systems, Robotics, Constrained control
Abstract: This paper considers the problem of guaranteeing avoidance of critical state space regions during tracking of reference trajectories for systems with dynamics equivalent to r-th order decoupled integrators. The necessity to avoid those critical regions during trajectory tracking may arise during the transient phase or because the reference trajectory was planned without taking into account the presence of those critical regions. A typical problem in mobile robotics, taken as reference in this paper, is the avoidance of obstacles in the robot workspace during tracking of reference state space trajectories. The proposed controller ensures a safety clearance from the forbidden regions by filtering out, when appropriate, the component of the tracking command that would eventually lead the system to enter the critical region. The method relies on the construction of first-order control barrier functions and closed-form controllers, with formal proof of safety and stability, and its effective application to wheeled mobile robots and quadrotors is demonstrated through simulation.
|
|
17:00-17:20, Paper WeCT03.4 | Add to My Program |
Interpretable Stochastic Model Predictive Control Using Distributional Reinforced Estimation for Quadrotor Tracking Systems |
|
Wang, Yanran | Imperial College London |
O'Keeffe, James | Imperial College |
Qian, Qiuchen | Imperial College London |
Boyle, David | Imperial College London |
Keywords: Autonomous robots, Stochastic optimal control, Iterative learning control
Abstract: This paper presents a novel trajectory tracker for autonomous quadrotor navigation in dynamic and complex environments. The proposed framework integrates a distributional Reinforcement Learning (RL) estimator for unknown aerodynamic effects into a Stochastic Model Predictive Controller (SMPC) for trajectory tracking. Aerodynamic effects derived from drag forces and moment variations are difficult to model directly and accurately. Most current quadrotor tracking systems therefore treat them as simple `disturbances' in conventional control approaches. We propose Quantile-approximation-based Distributional Reinforced-disturbance-estimator, an aerodynamic disturbance estimator, to accurately identify disturbances, i.e., uncertainties between the true and estimated values of aerodynamic effects. Simplified Affine Disturbance Feedback is employed for control parameterization to guarantee convexity, which we then integrate with a SMPC to achieve sufficient and non-conservative control signals. We demonstrate our system to improve the cumulative tracking errors by at least 66% with unknown and diverse aerodynamic forces compared with recent state-of-the-art. Concerning traditional Reinforcement Learning's non-interpretability, we provide convergence and stability guarantees of Distributional RL and SMPC, respectively, with non-zero mean disturbances.
|
|
17:20-17:40, Paper WeCT03.5 | Add to My Program |
Data-Driven Damage Detection and Control Adaptation for an Autonomous Underwater Vehicle |
|
Özkahraman, Özer | KTH Royal Institute of Technology |
Tajvar, Pouria | KTH, Royal Institute of Technology |
Dimarogonas, Dimos V. | KTH Royal Institute of Technology |
Ogren, Petter | KTH Royal Institute of Technology |
Keywords: Autonomous vehicles, Fault accomodation, Computer-aided control design
Abstract: Underwater robotic exploration missions typically involve traveling long distances without any human contact. The robots that go on such missions risk getting damaged by the unknown environment, accruing great costs and missed opportunities. Thus it is important for the robot to be able to accommodate unknown changes to its dynamics as much as possible and attempt to finish the given mission, or at the very least move itself to a retrievable position. In this paper, we show how we can detect physical changes to the robot reliably and then incorporate these changes through adapting the model to the data followed by automated control redesign. We adopt a piecewise-affine (PWA) modelling of the dynamics that is well suited for low data regime learning of the dynamics and provides a structure for computationally efficient control synthesis. We demonstrate the effectiveness of the proposed method on a combination of real robot data and simulated scenarios.
|
|
17:40-18:00, Paper WeCT03.6 | Add to My Program |
Robust Trajectory Tracking for Underactuated Quadrotors with Prescribed Performance |
|
Lapandić, Dženan | KTH Royal Institute of Technology |
Verginis, Christos | Uppsala University |
Dimarogonas, Dimos V. | KTH Royal Institute of Technology |
Wahlberg, Bo | KTH Royal Institute of Technology |
Keywords: Autonomous vehicles, Constrained control, Robust control
Abstract: We propose a control protocol based on the prescribed performance control (PPC) methodology for a quadrotor unmanned aerial vehicle (UAV). Quadrotor systems belong to the class of underactuated systems for which the original PPC methodology cannot be directly applied. We introduce the necessary design modifications to stabilize the considered system with prescribed performance. The proposed control protocol does not use any information of dynamic model parameters or exogenous disturbances. Furthermore, the stability analysis guarantees that the tracking errors remain inside of designer-specified time-varying functions, achieving prescribed performance independent from the control gains' selection. Finally, simulation results verify the theoretical results.
|
|
WeCT04 Regular Session, Tulum Ballroom D |
Add to My Program |
Neural Networks III |
|
|
Chair: Ferrari-Trecate, Giancarlo | Ecole Polytechnique Fédérale De Lausanne |
Co-Chair: Lamperski, Andrew | University of Minnesota |
|
16:00-16:20, Paper WeCT04.1 | Add to My Program |
Sampling Matters: SGD Smoothing through Importance Sampling |
|
Zancato, Luca | University of Padova |
Chiuso, Alessandro | Univ. Di Padova |
Keywords: Neural networks, Statistical learning, Randomized algorithms
Abstract: Many authors have suggested studying the loss landscape of Deep Neural Networks as a tool to understand their generalisation capabilities, the performance of optimisation algorithms as well as to tailor acceleration schemes. Nonetheless, a complete understanding of the entire learning process is still missing. This is mainly due to the complexity of the loss landscape, characterised by a rich structure with many local minima and saddle points. The purpose of this paper is to study the stochastic nature of Stochastic Gradient Descent and its link with the loss function as well as to propose a data sampling scheme which favours smoothing of the loss landscape to accelerate convergence speed. We validate our sampling scheme on AlexNet and ResNet applied to the CIFAR-10 dataset.
|
|
16:20-16:40, Paper WeCT04.2 | Add to My Program |
Neural Network Independence Properties with Applications to Adaptive Control |
|
Lamperski, Andrew | University of Minnesota |
Keywords: Neural networks, Adaptive control, Machine learning
Abstract: Neural networks form a general purpose architecture for machine learning and parameter identification. The simplest neural network consists of a single hidden layer connected to a linear output layer. It is often assumed that the components of the hidden layer correspond to linearly independent functions, but proofs of this are only known for a few specialized classes of network activation functions. This paper shows that for a wide class of activation functions, including most of the commonly used activation functions in neural network libraries, almost all choices of hidden layer parameters lead to linearly independent functions. These linear independence properties are then used to derive sufficient conditions for persistence of excitation, a condition commonly used to ensure parameter convergence in adaptive control.
|
|
16:40-17:00, Paper WeCT04.3 | Add to My Program |
Robust Classification Using Contractive Hamiltonian Neural ODEs |
|
Zakwan, Muhammad | EPFL |
Xu, Liang | École Polytechnique Fédérale De Lausanne |
Ferrari-Trecate, Giancarlo | Ecole Polytechnique Fédérale De Lausanne |
Keywords: Neural networks, Pattern recognition and classification, Time-varying systems
Abstract: Deep neural networks can be fragile and sensitive to small input perturbations that might cause a significant change in the output. In this paper, we employ contraction theory to improve the robustness of neural ODEs (NODEs). A dynamical system is contractive if all solutions with different initial conditions converge to each other asymptotically. As a consequence, perturbations in initial conditions become less and less relevant over time. Since in NODEs, the input data corresponds to the initial condition of dynamical systems, we show contractivity can mitigate the effect of input perturbations. More precisely, inspired by NODEs with Hamiltonian dynamics, we propose a class of contractive Hamiltonian NODEs (CH-NODEs). By properly tuning a scalar parameter, CH-NODEs ensure contractivity by design and can be trained using standard backpropagation and gradient descent algorithms. Moreover, CH-NODEs enjoy built-in guarantees of non-exploding gradients, which ensures a well-posed training process. Finally, we demonstrate the robustness of CH-NODEs on the MNIST image classification problem with noisy test datasets.
|
|
17:00-17:20, Paper WeCT04.4 | Add to My Program |
Super Resolution for Turbulent Flows in 2D: Stabilized Physics Informed Neural Networks |
|
Zayats, Mykhaylo | IBM Research |
Zimon, Malgorzata | IBM Research |
Yeo, Kyongmin | IBM Research |
Zhuk, Sergiy | IBM |
Keywords: Neural networks, Chaotic systems, Estimation
Abstract: We propose a new design of a neural network for solving a zero shot super resolution problem for turbulent flows. We embed Luenberger-type observer into the network's architecture to inform the network of the physics of the process, and to provide error correction and stabilization mechanisms. In addition, to compensate for decrease of observer's performance due to the presence of unknown destabilizing forcing, the network is designed to estimate the contribution of the unknown forcing implicitly from the data over the course of training. By running a set of numerical experiments, we demonstrate that the proposed network does recover unknown forcing from data and is capable of predicting turbulent flows in high resolution from low resolution noisy observations.
|
|
17:20-17:40, Paper WeCT04.5 | Add to My Program |
Safe-By-Repair: A Convex Optimization Approach for Repairing Unsafe Two-Level Lattice Neural Network Controllers |
|
Santa Cruz Leal, Ulices | University of California Irvine |
Ferlez, James | University of California, Irvine |
Shoukry, Yasser | University of California, Irvine |
Keywords: Neural networks, Formal Verification/Synthesis, Computer-aided control design
Abstract: In this paper, we consider the problem of repairing a data-trained Rectified Linear Unit (ReLU) Neural Network (NN) controller for a discrete-time, input-affine system. That is, we assume such a NN controller is given, and seek to repair unsafe closed-loop behavior at one known "counterexample" state, without violating closed-loop safety on a separate set of states. Our main result is an algorithm that can systematically and efficiently perform such repair, assuming that the controller has a Two-Level Lattice (TLL) architecture. In particular, we show sufficient conditions for the TLL repair problem can be formulated as two separate, but largely decoupled convex optimization problems: one of essentially local scope and one of essentially global scope. Furthermore, we use our algorithm to repair a TLL controller trained for a four-wheel-car dynamical model.
|
|
17:40-18:00, Paper WeCT04.6 | Add to My Program |
Chordal Sparsity for Lipschitz Constant Estimation of Deep Neural Networks |
|
Xue, Anton | University of Pennsylvania |
Lindemann, Lars | University of Pennsylvania |
Robey, Alexander | University of Pennsylvania |
Hassani, Hamed | University of Pennsylvania |
Pappas, George J. | University of Pennsylvania |
Alur, Rajeev | U Penn |
Keywords: Neural networks, LMIs, Optimization
Abstract: Computing Lipschitz constants of neural networks allows for robustness guarantees in image classification, safety in controller design, and generalization beyond the training data. As calculating Lipschitz constants of neural networks is NP-hard, techniques for estimating Lipschitz constants must navigate the trade-off between scalability and accuracy. In this work, we significantly push the scalability frontier of a semidefinite programming technique known as LipSDP while achieving zero accuracy loss. We first show that LipSDP has chordal sparsity, which allows us to derive a chordally sparse formulation that we call Chordal-LipSDP. The key benefit is that the main computational bottleneck of LipSDP, a large linear matrix inequality, can be decomposed into an equivalent collection of smaller ones — allowing Chordal-LipSDP to outperform LipSDP particularly as the network depth grows. Moreover, our formulation uses a tunable sparsity parameter that enables one to gain tighter estimates without incurring a significant computational cost. We illustrate the scalability of our approach through extensive numerical experiments.
|
|
WeCT05 Invited Session, Tulum Ballroom E |
Add to My Program |
Learning-Based Control III: Robustness and Safety |
|
|
Chair: Muller, Matthias A. | Leibniz University Hannover |
Co-Chair: Zeilinger, Melanie N. | ETH Zurich |
Organizer: Müller, Matthias A. | Leibniz University Hannover |
Organizer: Schoellig, Angela P | University of Toronto |
Organizer: Trimpe, Sebastian | RWTH Aachen University |
Organizer: Zeilinger, Melanie N. | ETH Zurich |
|
16:00-16:20, Paper WeCT05.1 | Add to My Program |
A System Level Approach to Regret Optimal Control |
|
Didier, Alexandre | ETH Zurich |
Sieber, Jerome | ETH Zurich |
Zeilinger, Melanie N. | ETH Zurich |
Keywords: Predictive control for linear systems, Constrained control, Optimal control
Abstract: We present an optimisation-based method for synthesising a dynamic regret optimal controller for linear systems with potentially adversarial disturbances and known or adversarial initial conditions. The dynamic regret is defined as the difference between the true incurred cost of the system and the cost which could have optimally been achieved under any input sequence having full knowledge of all future disturbances for a given disturbance energy. This problem formulation can be seen as an alternative to classical H 2- or H ∞-control. The proposed controller synthesis is based on the system level parametrisation, which allows reformulating the dynamic regret problem as a semi-definite problem. This yields a new framework that allows to consider structured dynamic regret problems, which have not yet been considered in the literature. For known pointwise ellipsoidal bounds on the disturbance, we show that the dynamic regret bound can be improved compared to using only a bounded energy assumption and that the optimal dynamic regret bound differs by at most a factor of 2/pi from the computed solution. Furthermore, the proposed framework allows guaranteeing state and input constraint satisfaction.
|
|
16:20-16:40, Paper WeCT05.2 | Add to My Program |
Improved Rates for Derivative Free Gradient Play in Strongly Monotone Games |
|
Drusvyatskiy, Dmitriy | University of Washington |
Fazel, Maryam | University of Washington |
Ratliff, Lillian J. | University of Washington |
Keywords: Game theory, Optimization algorithms
Abstract: The influential work of Bravo et al.~2018 shows that derivative free gradient play in strongly monotone games has complexity O(d^2/epsilon^3), where epsilon is the target accuracy on the expected squared distance to the solution. This paper shows that the efficiency estimate is actually O(d^2/epsilon^2), which reduces to the known efficiency guarantee for the method in unconstrained optimization. The argument we present simply interprets the method as stochastic gradient play on a slightly perturbed strongly monotone game to achieve the improved rate.
|
|
16:40-17:00, Paper WeCT05.3 | Add to My Program |
Safe Reinforcement Learning Via Confidence-Based Filters (I) |
|
Curi, Sebastian Martin | ETH Zürich |
Lederer, Armin | Technical University of Munich |
Hirche, Sandra | Technische Universität München |
Krause, Andreas | ETH Zurich |
Keywords: Machine learning, Constrained control, Uncertain systems
Abstract: Ensuring safety is a crucial challenge when deploying reinforcement learning (RL) to real-world systems. We develop confidence-based safety filters, a control-theoretic approach for certifying state safety constraints for nominal policies learnt via standard RL techniques, based on probabilistic dynamics models. Our approach is based on a reformulation of state constraints in terms of cost functions, reducing safety verification to a standard RL task. By exploiting the concept of hallucinating inputs, we extend this formulation to determine a ``backup’’ policy which is safe for the unknown system with high probability. The nominal policy is minimally adjusted at every time step during a roll-out towards the backup policy, such that safe recovery can be guaranteed afterwards. We provide formal safety guarantees, and empirically demonstrate the effectiveness of our approach.
|
|
17:00-17:20, Paper WeCT05.4 | Add to My Program |
Performance-Robustness Tradeoffs in Adversarially Robust Linear-Quadratic Control (I) |
|
Lee, Bruce | University of Pennsylvania |
Zhang, Thomas | University of Pennsylvania |
Hassani, Hamed | University of Pennsylvania |
Matni, Nikolai | University of Pennsylvania |
Keywords: Robust control
Abstract: While Hinfinity methods can introduce robustness against worst-case perturbations, their nominal performance under conventional stochastic disturbances is often drastically reduced. Though this fundamental tradeoff between nominal performance and robustness is known to exist, it has not been quantitatively characterized. Toward addressing this issue, we borrow from the increasingly ubiquitous notion of adversarial training from machine learning to construct a class of controllers which are optimized for disturbances consisting of mixed stochastic and worst-case components. We find that this problem admits a stationary optimal controller that has a simple analytic form closely related to suboptimal Hinfinity solutions. We then provide a quantitative performance-robustness tradeoff analysis, in which system-theoretic properties such as controllability and stability explicitly manifest in an interpretable manner. This provides practitioners with general guidance for determining how much robustness to incorporate based on a priori system knowledge. We empirically validate our results by comparing the performance of our controller against standard baselines, and plotting tradeoff curves.
|
|
17:20-17:40, Paper WeCT05.5 | Add to My Program |
Improving the Performance of Robust Control through Event-Triggered Learning (I) |
|
von Rohr, Alexander | RWTH Aachen University, Germany |
Solowjow, Friedrich | RWTH Aachen University |
Trimpe, Sebastian | RWTH Aachen University |
Keywords: Robust control, Machine learning, Time-varying systems
Abstract: Robust controllers ensure stability in feedback loops designed under uncertainty but at the cost of performance. Model uncertainty in time-invariant systems can be reduced by recently proposed learning-based methods, which improve the performance of robust controllers using data. However, in practice, many systems also exhibit uncertainty in the form of changes over time, e.g., due to weight shifts or wear and tear, leading to decreased performance or instability of the learning-based controller. We propose an event-triggered learning algorithm that decides when to learn in the face of uncertainty in the LQR problem with rare or slow changes. Our key idea is to switch between robust and learned controllers. For learning, we first approximate the optimal length of the learning phase via Monte-Carlo estimations using a probabilistic model. We then design a statistical test for uncertain systems based on the moment-generating function of the LQR cost. The test detects changes in the system under control and triggers re-learning when control performance deteriorates due to system changes. We demonstrate improved performance over a robust controller baseline in a numerical example.
|
|
17:40-18:00, Paper WeCT05.6 | Add to My Program |
Scalable Estimation of Invariant Sets for Mixed-Integer Nonlinear Systems Using Active Deep Learning (I) |
|
Bonzanini, Angelo Domenico | UC Berkeley |
Paulson, Joel | The Ohio State University |
Makrygiorgos, Georgios | University of California, Berkeley |
Mesbah, Ali | University of California, Berkeley |
Keywords: Learning, Nonlinear systems, Constrained control
Abstract: Set invariance is a crucial property for ensuring safe and feasible performance of closed-loop systems under state and input constraints. Classical set-theoretic methods for constructing reachable and invariant sets are generally inadequate in handling complex system dynamics and may not be scalable to high-dimensional systems. This paper presents a sample-efficient approach for data-driven estimation of invariant sets for constrained nonlinear systems that can exhibit a mixture of continuous, discrete, and/or switching-mode behavior. The approach relies on learning an oracle that verifies if a given system state is feasible. Thus, the invariant set construction problem is converted to a classification problem that can be effectively solved with deep learning. We also present an active learning algorithm to improve the sample efficiency of deep learning-based estimation of the feasibility oracle. Randomized verification is then used to provide probabilistic guarantees for set invariance. The proposed approach does not impose any assumptions on the structure of system dynamics, and is particularly suitable when the feasibility test for control invariance requires solving (expensive) mixed-integer nonlinear programs. The approach is illustrated on a benchmark problem.
|
|
WeCT06 Regular Session, Tulum Ballroom F |
Add to My Program |
Estimation and Input Design |
|
|
Chair: Gharesifard, Bahman | University of California, Los Angeles |
Co-Chair: Bazanella, Alexandre S. | Univ. Federal Do Rio Grande Do Sul |
|
16:00-16:20, Paper WeCT06.1 | Add to My Program |
Application-Oriented Input Design with Low Coherence Constraint |
|
Parsa, Javad | KTH Royal Inst. of Tech |
Rojas, Cristian R. | KTH Royal Institute of Technology |
Hjalmarsson, Hĺkan | KTH Royal Inst. of Tech |
Keywords: Identification, Estimation, Linear systems
Abstract: In optimal input design input sequences are typically generated without paying attention to the correlations between the regressors of the model to be estimated. In fact, in many cases high correlations are beneficial. This is in contrast to the requirements in sparse estimation. Mutual coherence is the maximum of these correlations, and in case the parameter vector is known to be sparse, we need a low mutual coherence in order to estimate it accurately. This contribution proposes adding a constraint on the mutual coherence to the optimal input design problem to improve the accuracy of estimated sparse models. The proposed method can be combined with any sparse estimation algorithm to estimate the parameters of a model. However, we focus in particular on the bound on the mutual coherence required for Orthogonal Matching Pursuit (OMP), a well-known algorithm in sparse estimation. Furthermore, we analyze the effect of the proposed method on the required input power. Finally, we evaluate, in a numerical study, the performance of the proposed method compared to state-of-the-art algorithms for input design.
|
|
16:20-16:40, Paper WeCT06.2 | Add to My Program |
Uniqueness of Induction Machine Parameters Estimated from Data - Identifiability, Priors and Prediction Error Identification |
|
Bazanella, Alexandre S. | Univ. Federal Do Rio Grande Do Sul |
Eckhard, Diego | Universidade Federal Do Rio Grande Do Sul |
Pereira, Luis Alberto | Universidade Federal Do RIo Grande Do Sul |
Perin, Matheus | Instituto Federal Do Rio Grande Do Sul |
Keywords: Identification, Electrical machine control, Nonlinear systems identification
Abstract: Unique identification of all the electrical and mechanical parameters of an induction machine is not possible with typical experimental data. Methods for the experimental estimation of these parameters usually cope with this limitation, which is inherent to the model's structure, by adding some prior information on the parameters or renouncing to the ambition of estimating all the parameters. We present a formal identifiability analysis that shows: i) exactly what can not be identified with each different set of measurements that can be conceived for an induction machine; ii) what is needed as additional information (i.e. prior) to uniquely determine the parameter values for each set of measurements. Then we propose to perform the identification with the standard prediction error formulation plus the inclusion of the required prior(s) and illustrate, by means of an example, that this can be done with data collected in only one standard no-load startup transient.
|
|
16:40-17:00, Paper WeCT06.3 | Add to My Program |
Excitation Allocation for Generic Identifiability of Linear Dynamic Networks with Fixed Modules |
|
Dreef, H.J. | Eindhoven University of Technology |
Shi, Shengling | Delft University of Technology |
Cheng, Xiaodong | University of Cambridge |
Donkers, M.C.F. | Eindhoven University of Technology |
Van den Hof, Paul M.J. | Eindhoven University of Technology |
Keywords: Identification, Network analysis and control
Abstract: Identifiability of linear dynamic networks requires the presence of a sufficient number of external excitation signals. The problem of allocating a minimal number of external signals for guaranteeing generic network identifiability has been recently addressed in the literature. Here we will extend that work by explicitly incorporating the situation that some network modules are known, and thus are fixed in the parametrized model set. The graphical approach introduced earlier is extended to this situation, showing that the presence of fixed modules reduces the required number of external signals. An algorithm is presented that allocates the external signals in a systematic fashion.
|
|
17:00-17:20, Paper WeCT06.4 | Add to My Program |
A Teacher-Student Markov Decision Process-Based Framework for Online Correctional Learning |
|
Lourenço, Inęs | KTH Royal Institute of Technology |
Winqvist, Rebecka | KTH Royal Institute of Technology |
Rojas, Cristian R. | KTH Royal Institute of Technology |
Wahlberg, Bo | KTH Royal Institute of Technology |
Keywords: Identification, Statistical learning, Estimation
Abstract: A classical learning setting typically concerns an agent/student who collects data, or observations, from a system in order to estimate a certain property of interest. Correctional learning is a type of cooperative teacher-student framework where a teacher, who has partial knowledge about the system, has the ability to observe and alter (correct) the observations received by the student in order to improve the accuracy of its estimate. In this paper, we show how the variance of the estimate of the student can be reduced with the help of the teacher. We formulate the corresponding online problem – where the teacher has to decide, at each time instant, whether or not to change the observations due to a limited budget – as a Markov decision process, from which the optimal policy is derived using dynamic programming. We validate the framework in numerical experiments, and compare the optimal online policy with the one from the batch setting
|
|
17:20-17:40, Paper WeCT06.5 | Add to My Program |
LiDAR Point Cloud Registration with Formal Guarantees |
|
Marchi, Matteo | University of California, Los Angeles |
Bunton, Jonathan | University of California, Los Angeles |
Gharesifard, Bahman | University of California, Los Angeles |
Tabuada, Paulo | University of California at Los Angeles |
Keywords: Estimation, Control applications, Robust control
Abstract: In recent years, LiDAR sensors have become pervasive in the solutions to localization tasks for autonomous systems. One key step in using LiDAR data for localization is the alignment of two LiDAR scans taken from different poses, a process called scan-matching or point cloud registration. Most existing algorithms for this problem are heuristic in nature and local, meaning they may not produce accurate results under poor initialization. Moreover, existing methods give no guarantee on the quality of their output, which can be detrimental for safety-critical tasks. In this paper, we analyze a simple algorithm for point cloud registration, termed PASTA. This algorithm is global and does not rely on point-to-point correspondences, which are typically absent in LiDAR data. Moreover, and to the best of our knowledge, we offer the first point cloud registration algorithm with provable error bounds. Finally, we illustrate the proposed algorithm and error bounds in simulation on a simple trajectory tracking task.
|
|
17:40-18:00, Paper WeCT06.6 | Add to My Program |
On the Uniform Observability of the Relative Pose Estimation Problem Using Bearing Measurements and Epipolar Constraints |
|
Gintrand, Pierre | I3S Laboratory IMR 7271 UCA-CNRS |
Hua, Minh-Duc | I3s Uca-Cnrs Umr7271 |
Hamel, Tarek | Université De Nice Sophia Antipolis |
Varra, Guillaume | Airbus Helicopters |
Keywords: Observers for nonlinear systems, Estimation, Sensor fusion
Abstract: This paper proposes a comprehensive observability analysis of the relative pose estimation of a monocular camera (moving in three-dimensional space) from bearing measurements and epipolar constraints. It extends our previous work on observer design for the particular case of 3-source points with unknown 3D coordinates. The paper addresses the observability analysis of the more general case of n-source points ( n ≥ 3) using persistence of excitation of the translational motion and bearing references (or equivalently, the position of the origin of the reference frame with respect to the source points). The key contribution of this work is to show that the persistence of excitation is not enough to guarantee uniform observability. In particular, we show that uniform observability also depends on bearing references and the number of observed source points.
|
|
WeCT07 Invited Session, Tulum Ballroom G |
Add to My Program |
Online Learning, Optimization, and Game Theory II |
|
|
Chair: Dall'Anese, Emiliano | University of Colorado Boulder |
Co-Chair: Doan, Thinh T. | Virginia Tech |
Organizer: Doan, Thinh T. | Virginia Tech |
Organizer: Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Organizer: Zhang, Kaiqing | MIT |
|
16:00-16:20, Paper WeCT07.1 | Add to My Program |
Accelerated Gradient Approach for Neural Network Adaptive Control of Nonlinear Systems (I) |
|
Le, Duc M. | University of Florida |
Patil, Omkar Sudhir | University of Florida |
Nino, Cristian F. | University of Florida |
Dixon, Warren E. | University of Florida |
Keywords: Lyapunov methods, Adaptive control, Neural networks
Abstract: Recent connections in the adaptive control literature to continuous-time analogues of Nesterov's accelerated gradient method have led to the development of new real-time adaptation laws based on accelerated gradient methods. However, previous results assume the system's uncertainties are linear-in-the-parameters (LIP). In this paper, a new NN-based accelerated gradient adaptive controller is developed to achieve trajectory tracking in general nonlinear systems subject to unstructured uncertainties that do not satisfy the LIP assumption. Higher-order accelerated gradient-based adaptation laws are developed to generate real-time estimates of both the unknown ideal output- and hidden-layer weights of a NN. A nonsmooth Lyapunov-based method is used to guarantee the closed-loop error system achieves global asymptotic tracking. Simulations are conducted to demonstrate the improved performance from the developed method. Results show the higher-order adaptation outperforms the standard gradient-based NN adaptation by 32.3% in terms of the root mean squared function approximation error.
|
|
16:20-16:40, Paper WeCT07.2 | Add to My Program |
Constant Costate Iterations for Finite-Horizon Optimal Control with Nonlinear Dynamics (I) |
|
Tarantino, Lorenzo | Universitŕ Degli Studi Di Roma Tor Vergata |
Sassano, Mario | University of Rome, Tor Vergata |
Galeani, Sergio | Universitŕ Di Roma Tor Vergata |
Astolfi, Alessandro | Imperial College & Univ. of Rome |
Keywords: Optimal control, Nonlinear systems, Iterative learning control
Abstract: A class of nonlinear finite-horizon optimal control problems is studied. We propose a solution based on an iterative strategy that relies on the linearization of the nonlinear dynamics and on the construction of the corresponding time-varying Hamiltonian dynamics. Differently from existing methods that hinge upon similar tools, the proposed strategy hinges upon the solution to a linear initial value problem and does not require at each step the (numerical) solution of a two-point boundary value problem or of a time-varying Riccati equation. The result is achieved by exploiting a time-varying change of coordinates with the objective of obtaining a constant optimal costate in the transformed coordinates.
|
|
16:40-17:00, Paper WeCT07.3 | Add to My Program |
Deep Residual Neural Network (ResNet)-Based Adaptive Control: A Lyapunov-Based Approach (I) |
|
Patil, Omkar Sudhir | University of Florida |
Le, Duc M. | University of Florida |
Griffis, Emily | University of Florida |
Dixon, Warren E. | University of Florida |
Keywords: Neural networks, Lyapunov methods, Adaptive control
Abstract: Deep Neural Network (DNN)-based controllers have emerged as a tool to compensate for unstructured uncertainties in nonlinear dynamical systems. A recent breakthrough in the adaptive control literature provides a Lyapunov-based approach to derive weight adaptation laws for each layer of a fully-connected feedforward DNN-based adaptive controller. However, deriving weight adaptation laws from a Lyapunov-based analysis remains an open problem for deep residual neural networks (ResNets). This paper provides the first result on Lyapunov-derived adaptation laws for the weights of each layer of a ResNet-based adaptive controller. A nonsmooth Lyapunov-based analysis is provided to guarantee global asymptotic tracking error convergence.
|
|
17:00-17:20, Paper WeCT07.4 | Add to My Program |
Convergence Rates of Asynchronous Policy Iteration for Zero-Sum Markov Games under Stochastic and Optimistic Settings (I) |
|
Brahma, Sarnaduti | Virginia Tech |
Bai, Yitao | Virginia Tech |
Do, Duy Anh | Virginia Tech |
Doan, Thinh T. | Virginia Tech |
Keywords: Game theory, Markov processes, Mean field games
Abstract: We consider an asynchronous policy iteration method for solving the problem of sequential zero-sum Markov games. This method can be viewed as a recursive iteration for finding the fixed point of a contractive Bellman operator. Our focus in this paper is to derive the convergence rates of this method under two scenarios, namely, stochastic and optimistic settings. In the first scenario, we assume that we only have access to an unbiased estimate of the underlying Bellman operator, resulting in a stochastic variant of the policy iteration method. In the second, we consider the popular optimistic setting of the policy iteration method, where the policy improvement steps are approximately estimated. In both scenarios, we show that the asynchronous policy iteration method converges with a sublinear rate O(1/k), where k is the number of iterations. We also provide numerical simulations to illustrate our theoretical results.
|
|
17:20-17:40, Paper WeCT07.5 | Add to My Program |
Online Stochastic Gradient Methods under Sub-Weibull Noise and the Polyak-Lojasiewicz Condition (I) |
|
Kim, Seunghyun | University of Colorado Boulder |
Madden, Liam | University of Colorado Boulder |
Dall'Anese, Emiliano | University of Colorado Boulder |
Keywords: Optimization, Optimization algorithms
Abstract: This paper focuses on the online gradient and proximal-gradient methods with stochastic gradient errors. In particular, we examine the performance of the online gradient descent method when the cost satisfies the Polyak-Lojasiewicz (PL) inequality. We provide bounds in expectation and in high probability (that hold iteration-wise), with the latter derived by leveraging a sub-Weibull model for the errors affecting the gradient. The convergence results show that the instantaneous regret converges linearly up to an error that depends on the variability of the problem and the statistics of the sub-Weibull gradient error. Convergence results are then provided for the online proximal-gradient method, under the assumption that the composite cost satisfies the proximal-PL condition. In the case of static costs, we provide new bounds for the regret incurred by these methods when the gradient errors are modeled as sub-Weibull random variables. Illustrative simulations are provided to corroborate the technical findings.
|
|
17:40-18:00, Paper WeCT07.6 | Add to My Program |
Source Seeking with a Torque-Controlled Unicycle |
|
Suttner, Raik | University of Wuerzburg |
Krstic, Miroslav | University of California, San Diego |
Keywords: Autonomous vehicles, Nonholonomic systems, Stability of nonlinear systems
Abstract: We propose a novel source seeking method for a torque-controlled unicycle. Our control law leads to a positive constant forward velocity and an oscillatory rotational velocity with non-zero mean. Under suitable assumptions, we prove that the unicycle tends to a circular motion around the source. An implementation of the proposed method only requires real-time measurements of the unknown scalar signal. The feedback law for the torque involves a periodic large-amplitude high-frequency perturbation signal. A suitable averaging argument for mechanical systems under vibrational control reveals that the closed-loop system approximates the behavior of a certain averaged system. The motion of the averaged system is determined by a symmetric product of a vector field from the approximating closed-loop system. This symmetric product causes a torque which is given by the gradient of the unknown scalar signal. The gradient-based torque can lead to asymptotic stability for the averaged system, which in turn implies practical asymptotic stability for the approximating closed-loop system.
|
|
WeCT08 Regular Session, Tulum Ballroom H |
Add to My Program |
Social and Financial Networks |
|
|
Chair: Touri, Behrouz | University of California San Diego |
Co-Chair: Tegling, Emma | Lund University |
|
16:00-16:20, Paper WeCT08.1 | Add to My Program |
A Social Power Game for the Concatenated Friedkin-Johnsen Model |
|
Wang, Lingfei | Academy of Systems Science, Chinese Academy of Sciences |
Chen, Guanpu | Academy of Mathematics and Systems Science, Chinese Academy of S |
Hong, Yiguang | Chinese Academy of Sciences |
Shi, Guodong | The University of Sydney |
Altafini, Claudio | Linkoping University |
Keywords: Network analysis and control, Agents-based systems
Abstract: If a concatenated Friedkin-Johnsen model is used to describe the evolution of the opinions of stubborn agents in a sequence of discussion events, then the social power achieved by the agents at the end of the discussions depends from the stubbornness coefficients adopted by the agents through the sequence of events. In this paper we assume that the agents are free to choose their stubbornness profiles, and ask ourselves what strategy should an agent follow in order to maximize its social power. Formulating the problem as a strategic game, we show that choosing the highest possible values of stubbornness in the early discussions leads to the highest possible social power.
|
|
16:20-16:40, Paper WeCT08.2 | Add to My Program |
A Random Adaptation Perspective on Distributed Averaging |
|
Parasnis, Rohit Yashodhar | University of California San Diego |
Verma, Ashwin | University of California San Diego |
Franceschetti, Massimo | UCSD |
Touri, Behrouz | University of California San Diego |
Keywords: Networked control systems, Network analysis and control, Agents-based systems
Abstract: We propose a random adaptation variant of time-varying distributed averaging dynamics in discrete time. We show that this leads to novel interpretations of fundamental concepts in distributed averaging, opinion dynamics, and distributed learning. Namely, we show that the ergodicity of a stochastic chain is equivalent to the almost sure (a.s.) finite-time agreement attainment in the proposed random adaptation dynamics. Using this result, we provide a new interpretation for the absolute probability sequence of an ergodic chain. We then modify the base-case dynamics into a time-reversed inhomogeneous Markov chain, and we show that in this case ergodicity is equivalent to the uniqueness of the limiting distributions of the Markov chain. Finally, we introduce and study a time-varying random adaptation version of the Friedkin-Johnsen model and a rank-one perturbation of the base-case dynamics.
|
|
16:40-17:00, Paper WeCT08.3 | Add to My Program |
Multi-Dimensional Extensions of the Hegselmann-Krause Model |
|
De Pasquale, Giulia | University of Padova |
Valcher, Maria Elena | Universita' Di Padova |
Keywords: Agents-based systems, Network analysis and control, Cooperative control
Abstract: In this paper we consider two multi-dimensional Hagselmann-Krause (HK) models for opinion dynamics, by this meaning two bounded-confidence models describing how individuals' opinions on a set of topics evolve over time. The models differ in the criterion according to which individuals decide whom they want to be influenced by. In the first model, agents are influenced only by those individuals whose opinion on each single topic does not differ more than a prefixed tolerance from their own. In the second model, on the other hand, the comparison is based on the agents' "average opinions" on the various topics. While the former is a special case of a model proposed in the past and whose convergence properties have already been explored, the latter is original. In both cases we prove that the global range of opinions and the range of opinions on each single topic are non-increasing, and we show that under certain conditions the models enjoy the order-preservation property topic-wise. Finally, for the average-based model we prove that either consensus or clustering are reached in a finite number of steps.
|
|
17:00-17:20, Paper WeCT08.4 | Add to My Program |
Achieving Consensus in Networks of Increasingly Stubborn Voters |
|
Ohlin, David | Lund University |
Bencherki, Fethi | Lund |
Tegling, Emma | Lund University |
Keywords: Cooperative control, Distributed control, Agents-based systems
Abstract: We study opinion evolution in networks of stubborn agents discussing a sequence of issues, modeled through the so called concatenated Friedkin-Johnsen (FJ) model. It is concatenated in the sense that agents' opinions evolve for each issue, and the final opinion is then taken as a starting point for the next issue. We consider the scenario where agents also take a vote at the end of each issue and propose a feedback mechanism from the result (based on the median voter) to the agents' stubbornness. Specifically, agents become increasingly stubborn during issue s+1 the more they disagree with the vote at the end of issue s. We analyze this model for a number of special cases and provide sufficient conditions for convergence to consensus stated in terms of permissible initial opinion and stubbornness. In the opposite scenario, where agents become less stubborn when disagreeing with the vote result, we prove that consensus is achieved, and we demonstrate the faster convergence of opinions compared to constant stubbornness.
|
|
17:20-17:40, Paper WeCT08.5 | Add to My Program |
Control of Dynamic Financial Networks |
|
Calafiore, Giuseppe C. | Politecnico Di Torino |
Fracastoro, Giulia | Politecnico Di Torino |
Proskurnikov, Anton V. | Politecnico Di Torino |
Keywords: Finance, Control of networks, Control applications
Abstract: The current global financial system forms a highly interconnected network where a default in one of its nodes can propagate to many other nodes, causing a catastrophic avalanche effect. In this paper we consider the problem of reducing the financial contagion by introducing some targeted interventions that can mitigate the cascaded failure effects. We consider a multi-step dynamic model of clearing payments and introduce an external control term that represents corrective cash injections made by a ruling authority. The proposed control model can be cast and efficiently solved as a linear program. We show via numerical examples that the proposed approach can significantly reduce the default propagation by applying small targeted cash injections.
|
|
17:40-18:00, Paper WeCT08.6 | Add to My Program |
Feedback Control-Based Publisher Revenue Maximization in Online Advertising |
|
Karlsson, Niklas | Amazon |
Keywords: Emerging control applications, Information technology systems, Optimization algorithms
Abstract: Online publishers often monetize traffic on their webpages by selling ad inventory to advertisers via real-time bidding in open exchanges. Typically they leverage a supply-side platform (SSP) to sell the ad inventory. An SSP is a technology platform that enables publishers to manage their advertising space inventory, fill it with ads, and receive revenue. A state-of-the-art SSP solves an optimization problem, which in this article is defined as publisher revenue maximization subject to a lower bound on the SSP margin. The optimal bidding mechanism is derived and it is shown how the solution can be implemented as two separate subsystems, publisher control and bid shading optimization. Feedback control plays an important role to make the optimization scalable and adaptive. A proof of concept control system is designed and illustrated in simulations.
|
|
WeCT09 Regular Session, Maya Ballroom I |
Add to My Program |
Networked Control Systems II |
|
|
Chair: Lucia, Walter | Concordia University |
Co-Chair: Siami, Milad | Northeastern University |
|
16:00-16:20, Paper WeCT09.1 | Add to My Program |
Connectivity-Preserving Formation Tracking of Multiple Double-Integrator Systems |
|
Pan, Zini | The Chinese University of Hong Kong |
Chen, Ben M. | Chinese University of Hong Kong |
Keywords: Networked control systems, Distributed control, Cooperative control
Abstract: In this paper, we study the distributed formation tracking problem of multiple double-integrator systems with connectivity preservation over a state-dependent communication network. In particular, we modify the existing potential function to accommodate offsets between the leader and the vehicles and design a potential-function-based distributed control law that can achieve formation tracking while preserving the connectivity of the communication network. Compared with the existing results, our result does not rely on the knowledge of the bound of the initial condition of the closed-loop system.
|
|
16:20-16:40, Paper WeCT09.2 | Add to My Program |
An Approach to Distributed Estimation of Time-Varying Signals by Multi-Agent Systems |
|
Caccavale, Fabrizio | Universita Degli Studi Della Basilicata |
Pierri, Francesco | Universita Degli Studi Della Basilicata |
Keywords: Networked control systems, Sensor networks, Estimation
Abstract: In this paper, a novel distributed estimation scheme for multi-agent systems is devised, where each agent communicates only with a subset of neighbouring mates. It is assumed that a given time-varying signal is measured or computed by only a subset of agents, named sources, while the other agents, the users, are required to estimate this signal in a distributed fashion. To the purpose, each user agent runs an estimator of the signal and its derivatives up to a given order, based on a dynamic consensus scheme. Stability and performance are analyzed, simulations are run to show the effectiveness of the approach and assess its performance.
|
|
16:40-17:00, Paper WeCT09.3 | Add to My Program |
Stochastic Aperiodic Control of Networked Systems with I.i.d. Time-Varying Communication Delays |
|
Hosoe, Yohei | Kyoto University |
Keywords: Networked control systems, Stochastic systems, LMIs
Abstract: This paper studies stochastic aperiodic stabilization of a networked control system (NCS) consisting of a continuous-time plant and a discrete-time controller. The plant and the controller are assumed to be connected by communication channels with i.i.d. (independent and identically distributed) time-varying delays. The delays are theoretically not required to be bounded even when the plant is unstable in the deterministic sense. In our NCS, the sampling interval is supposed to be determined directly by such communication delays. A necessary and sufficient inequality condition is presented for designing a state-feedback controller stabilizing the NCS at sampling points in a stochastic sense. The results are also illustrated numerically.
|
|
17:00-17:20, Paper WeCT09.4 | Add to My Program |
Sensor and Actuator Scheduling in Bilinear Dynamical Networks |
|
Castello Branco de Oliveira, Arthur | Northeastern University |
Siami, Milad | Northeastern University |
Sontag, Eduardo | Northeastern University |
Keywords: Networked control systems, Time-varying systems, Agents-based systems
Abstract: In this paper, we investigate the problem of finding a sparse sensor and actuator (S/A) schedule that minimizes the approximation error between the input-output behavior of the fully sensed/actuated bilinear system and the system with the scheduling. The quality of this approximation is measured by an H2-like metric, which is defined for a bilinear (time-varying) system with S/A scheduling based on the discrete Laplace transform of its Volterra kernels. First, we discuss the difficulties of designing S/A schedules for bilinear systems, which prevented us from finding a polynomial time algorithm for solving the problem. We then propose a polynomial-time S/A scheduling heuristic that selects a fraction of sensors and node actuators at each time step while maintaining a small approximation error between the input-output behavior of the fully sensed/actuated system and the one with S/A scheduling in this H2-based sense. Numerical experiments illustrate the good approximation quality of our proposed methods.
|
|
17:20-17:40, Paper WeCT09.5 | Add to My Program |
Encrypted Cloud-Based Set-Theoretic Model Predictive Control |
|
Naseri, Amir Mohammad | Concordia University |
Lucia, Walter | Concordia University |
Youssef, Amr | Concordia University |
Keywords: Networked control systems
Abstract: We propose an encrypted set-theoretic model predictive control (ST-MPC) strategy for cloud-based networked control systems. In particular, we consider a scenario where the plant is subject to state and input constraints, and a curious but honest cloud provider is available to implement the control logic remotely. We address the inherent privacy issue by jointly using an additive homomorphic cryptosystem and a modified version of the ST-MPC algorithm, which is tailored to run on encrypted data. We show that, by leveraging a family of zonotopic inner approximations of robust one-step controllable sets and a half-space projection algorithm, we can design the unavoidable computational load on the smart actuator's side to be real-time affordable by the available hardware compared to other existing solutions. A simulation experiment, considering a two-tank water system, is presented to verify the effectiveness of the proposed approach.
|
|
17:40-18:00, Paper WeCT09.6 | Add to My Program |
Compressed Cluster Sensing in Multiagent IoT Control |
|
Uzhva, Denis | Saint Petersburg State University |
Granichin, Oleg | Saint Petersburg State University |
Granichina, Olga | Herzen State Pedagogical University |
Keywords: Agents-based systems, Networked control systems, Hierarchical control
Abstract: Traditional multiagent system control relies primarily on inter-agent local communications. In large-scale IoT systems it may appear hard to synthesize local control actions in a simple manner. Cluster control possibilities are thus thoroughly discussed, with possible control goals stated. The relation between multiagent state sparsity and cluster patterns is illustrated, for further utilization of sparsity for cluster control. Consequently, the problem of cluster identification is stated, and a possible solution is proposed in the form of the compressed cluster sensing algorithm. The algorithm utilizes compressed sensing methodology for a compact representation of agent states, which is then used to synthesize compact control actions in a low-dimensional space, without requiring to specify cluster locations.
|
|
WeCT10 Regular Session, Maya Ballroom II |
Add to My Program |
Stochastic Optimal Control II |
|
|
Chair: Tanaka, Takashi | University of Texas at Austin |
Co-Chair: Pakniyat, Ali | University of Alabama |
|
16:00-16:20, Paper WeCT10.1 | Add to My Program |
A Convex Duality Approach to Assigning Probability Distributions for Nonlinear Stochastic Systems |
|
Pakniyat, Ali | University of Alabama |
Keywords: Stochastic optimal control, Stochastic systems, Optimal control
Abstract: In order to optimally assign a desired probability distribution to the state of a nonlinear stochastic system, a convex duality approach is proposed to arrive at the associated optimality conditions. For a general class of stochastic systems governed by controlled Ito differential equations and subject to constraints on the probability distribution of the state at a fixed terminal time, a measure theoretic formulation is presented and it is shown that the original problem is embedded in a convex linear program on the space of Radon measures and that the embedding is tight, i.e., the optimal solution of both the original and the convex relaxation problems are equal. By exploiting the duality relationship between the space of continuous functions and that of measures, the associated optimality conditions are identified in the form of Hamilton-Jacobi problems where the optimization objective, in addition to the value function evaluation at the initial conditions, includes an extra term which is the integral of the product of the value function at the terminal time and the desired probability distribution. Numerical examples are provided to illustrate the results.
|
|
16:20-16:40, Paper WeCT10.2 | Add to My Program |
Non-Gaussian Chance-Constrained Trajectory Control Using Gaussian Mixtures and Risk Allocation |
|
Boone, Spencer | University of Colorado Boulder |
McMahon, Jay | University of Colorado |
Keywords: Stochastic optimal control, Aerospace
Abstract: Standard chance-constrained trajectory control algorithms typically rely on the assumption that the vehicle state uncertainties obey Gaussian distributions. While this is a valid assumption for many systems, this paper considers the class of real-life systems for which this assumption does not hold - for example, systems with highly nonlinear dynamics such as spacecraft maneuver planning problems. This paper extends the chance-constrained control formulation to consider a non-Gaussian distribution by approximating the distribution as a mixture of Gaussian distributions. The original chance constraint is then approximated as a conjunction of weighted individual chance constraints on each of the distributions in the mixture. Iterative risk allocation is used in a two-stage optimization procedure to minimize the degree of conservatism in this approximation. The method is applied to a simple impulsive stochastic spacecraft maneuver targeting problem in two-body dynamics. The resulting algorithm accurately computes control parameters that satisfy the probabilistic bounds on the non-Gaussian distribution.
|
|
16:40-17:00, Paper WeCT10.3 | Add to My Program |
Chance-Constrained Stochastic Optimal Control Via Path Integral and Finite Difference Methods |
|
Patil, Apurva | The University of Texas at Austin |
Duarte, Alfredo | The University of Texas at Austin |
Smith, Aislinn | The University of Texas at Austin |
Bisetti, Fabrizio | The University of Texas at Austin |
Tanaka, Takashi | University of Texas at Austin |
Keywords: Stochastic optimal control, Autonomous systems, Numerical algorithms
Abstract: This paper addresses a continuous-time continuous-space chance-constrained stochastic optimal control (SOC) problem via a Hamilton-Jacobi-Bellman (HJB) partial differential equation (PDE). Through Lagrangian relaxation, we convert the chance-constrained (risk-constrained) SOC problem to a risk-minimizing SOC problem, the cost function of which possesses the time-additive Bellman structure. We show that the risk-minimizing control synthesis is equivalent to solving an HJB PDE whose boundary condition can be tuned appropriately to achieve a desired level of safety. Furthermore, it is shown that the proposed risk-minimizing control problem can be viewed as a generalization of the problem of estimating the risk associated with a given control policy. Two numerical techniques are explored, namely the path integral and the finite difference method (FDM), to solve a class of risk-minimizing SOC problems whose associated HJB equation is linearizable via the Cole-Hopf transformation. Using a 2D robot navigation example, we validate the proposed control synthesis framework and compare the solutions obtained using path integral and FDM.
|
|
17:00-17:20, Paper WeCT10.4 | Add to My Program |
Risk-Averse Sequential Decision Problems with Time-Consistent Stochastic Dominance Constraints (I) |
|
Dentcheva, Darinka | Stevens Institute of Technology |
Ye, Mingsong | Stevens Institute of Technology |
Yi, Yunxuan | Stevens Institute of Technology |
Keywords: Stochastic optimal control, Uncertain systems, Variational methods
Abstract: We discuss ways of comparison of two random sequences and their application in multistage stochastic optimization problems. While various comparisons of stochastic sequences have been proposed in the literature, their integration in a sequential decision problem is non-trivial and usually results in a time-inconsistent evaluations and inconsistent decisions. We propose a framework for constructing stochastic orderings that enable time-consistent comparisons at any stage of the decision process and ensure the dominance property of the optimal policy. We use the comparisons as constraints in multi-stage stochastic problems. Particular attention is paid to constraints based on stochastic dominance of the second-order imposed conditionally; it reflects risk-averse preferences and results in problems amenable to numerical treatment. We derive optimality condition for the new problems in a special case and establish relations to the expected utility theory. Additionally, we propose a numerical method for solving the new problems.
|
|
17:20-17:40, Paper WeCT10.5 | Add to My Program |
Data-Driven Distributionally Robust Bounds for Stochastic Model Predictive Control |
|
Fochesato, Marta | ETH Zurich |
Lygeros, John | ETH Zurich |
Keywords: Stochastic systems, Stochastic optimal control, Predictive control for linear systems
Abstract: We present a distributionally robust stochastic model predictive control scheme for linear discrete-time systems subject to unbounded additive disturbance. We consider joint chance constraints over the task horizon for both the states and inputs. For settings where distributional information is unavailable and only few samples of the disturbance are accessible, we devise a tube MPC formulation where we synthesize ambiguous tubes in the Wasserstein metric. These tubes are used for constraint tightening around the nominal system and are based on the synthesis of bounds that encompass a given probability mass of the error distribution despite distributional ambiguity. The method is tested on a building temperature control problem.
|
|
17:40-18:00, Paper WeCT10.6 | Add to My Program |
Ergodic Control of a Heterogeneous Population and Application to Electricity Pricing |
|
Jacquet, Quentin | EDF R&D, INRIA, CMAP, Ecole Polytechnique |
van Ackooij, Wim Stefanus | EDF R&D |
Alasseur, Clemence | EDF R&D |
Gaubert, Stephane | INRIA and Ecole Polytechnique |
Keywords: Optimal control, Stochastic optimal control, Mean field games
Abstract: We consider a control problem for a heterogeneous population composed of customers able to switch at any time between different contracts, depending not only on the tariff conditions but also on the characteristics of each individual. A provider aims to maximize an average gain per time unit, supposing that the population is of infinite size. This leads to an ergodic control problem for a ``mean-field" MDP in which the state space is a product of simplices, and the population evolves according to a controlled linear dynamics. By exploiting contraction properties of the dynamics in Hilbert's projective metric, we show that the ergodic eigenproblem admits a solution. This allows us to obtain optimal strategies, and to quantify the gap between steady-state strategies and optimal ones. We illustrate this approach on examples from electricity pricing, and show in particular that the optimal policies may be cyclic --alternating between discount and profit taking stages.
|
|
WeCT11 Regular Session, Maya Ballroom III |
Add to My Program |
Robust Control IV |
|
|
Chair: Ramírez , Adrián | IPICYT |
Co-Chair: Selmic, Rastko | Concordia University |
|
16:00-16:20, Paper WeCT11.1 | Add to My Program |
Robust Formation Control of Nonlinear Agents with Distance Constraints |
|
Babazadeh, Reza | Concordia University |
Selmic, Rastko | Concordia University |
Keywords: Robust control, Autonomous systems, Optimal control
Abstract: This paper studies robustness of the distance-based formation control problem for a set of nonlinear agents with additive uncertainties. Directed graph theory is used to model the desired formation topology, and the task of controlling an edge is given to only one of its incident agents. We proposed a distributed, robust control method for the distance-based formation control of uncertain nonlinear agents. We designed a distributed optimal controller for the nominal system. We then modified the nominal controller for the uncertain system by combining it with an integral sliding mode control (ISMC) controller. A rigorous Lyapunov stability analysis is carried out to show the asymptotic convergence of each agent to its desired formation. Then, using the stability theory of cascaded interconnected systems, and the concept of mathematical induction, the stability of the overall formation is proven. Simulations results illustrate the effectiveness of the proposed method in two- and three-dimensional spaces.
|
|
16:20-16:40, Paper WeCT11.2 | Add to My Program |
Inverse-Inverse Reinforcement Learning. How to Hide Strategy from an Adversarial Inverse Reinforcement Learner? |
|
Pattanayak, Kunal | Cornell University |
Krishnamurthy, Vikram | Cornell University |
Berry, Christopher M | Lockheed Martin Advanced Technologies Laboratory |
Keywords: Robust control, Autonomous systems, Adaptive control
Abstract: Inverse reinforcement learning (IRL) deals with estimating an agent's utility function from its actions. In this paper, we consider how an agent can hide its strategy and mitigate an adversarial IRL attack; we call this inverse IRL (I-IRL). How should the decision maker choose its response to ensure a poor reconstruction of its strategy by an adversary performing IRL to estimate the agent's strategy? This paper comprises four results: First, we present an adversarial IRL algorithm that estimates the agent's strategy while controlling the agent's utility function. Second, we propose an I-IRL result that mitigates the IRL algorithm used by the adversary. Our I-IRL results are based on revealed preference theory in micro-economics. The key idea is for the agent to deliberately choose sub-optimal responses so that its true strategy is sufficiently masked. Third, we give a sample complexity result for our main I-IRL result when the agent has noisy estimates of the adversary-specified utility function. Finally, we illustrate our I-IRL scheme in a radar problem where a meta-cognitive radar is trying to mitigate an adversarial target.
|
|
16:40-17:00, Paper WeCT11.3 | Add to My Program |
An Integral Sliding-Mode Robust Regulation for Constrained Three-Wheeled Omnidirectional Mobile Robots |
|
Gutiérrez, Ariana | Tecnológico Nacional De México/I.T. De La Laguna |
Mera, Manuel | Esime Upt Ipn |
Ríos, Héctor | Tecnológico Nacional De México/I.T. La Laguna |
Keywords: Robust control, Constrained control, Autonomous robots
Abstract: This paper proposes a solution to the regulation problem for a three–wheeled omnidirectional mobile robot (OMR) with input saturation, state constraints, parameter uncertainties, and external disturbances. The proposed robust control is composed of a linear and a nonlinear part, which can be designed independently due to the use of an integral sliding–mode control approach. The linear control deals with the input saturation, the state constraints, and the parameter uncertainties, while the nonlinear part is also saturated and can compensate the effect of some matched disturbances. Also, a safe set where the system trajectories do not transgress the state constraints and input saturation is provided. The proposed scheme guarantees asymptotic convergence to zero of the regulation error coping with the system constraints and disturbances. A constructive and simple method, based on linear matrix inequalities (LMIs), is proposed to compute the controller gains. Some simulation results illustrate the feasibility of the proposed scheme.
|
|
17:00-17:20, Paper WeCT11.4 | Add to My Program |
LMI Conditions for Stability and Hinf Control of Discrete-Time Multi-Mode Multi-Dimensional Systems |
|
Medero, Ariel | Université Grenoble Alpes (Gipsa-LAB) / Universitat Politčcnica |
Sename, Olivier | Grenoble INP / GIPSA-Lab |
Puig, Vicenc | Universitat Politčcnica De Catalunya |
Keywords: Robust control, Stability of linear systems, Switched systems
Abstract: This paper deals with stability and state feedback control of discrete-time Multi-Mode Multi-Dimensional (M3D) linear systems. The M3D switch dynamics are modeled through a state mapping describing the mode transitions. This M3D model framework then allows to consider poly-quadratic Lyapunov functions to obtain Linear Matrix Inequalities conditions for stability proof and for the synthesis of state-feedback controllers under Hinf performance. A numerical example illustrates the improvement of the controller synthesis conditions here introduced for discrete-time M3D systems over independent point-wise mode solutions.
|
|
17:20-17:40, Paper WeCT11.5 | Add to My Program |
Robust H-Infinity Delay-Based Control of a Fuel-Cell System with Time-Varying Norm-Bounded Uncertainties |
|
Ramírez, Adrián | IPICYT |
Leyva-Ramos, Jesus | Instituto Potosino De Investigacion Cientifica Y Tecnologica |
Keywords: Robust control, Delay systems, Power electronics
Abstract: Robustness and performance are key properties that can contribute to the widespread dissemination of Fuel-Cell Systems (FCS) as power sources, and a precise output voltage can serve as a reliable criterion to evaluate these properties. In this paper, we propose to utilize intentional time delays as part of controllers to achieve proper output voltage regulation in an FCS subject to norm-bounded time-varying uncertainties associated with the electrical load. For the nominal FCS, a derivative-dependent controller is first proposed and then approximated using artificial delays with the goal of obtaining a realizable controller. With the approximated controller at hand, a tuning strategy to optimize system's response is adopted and the closed-loop system is presented as an uncertain time-delay system for which we derive simple sufficient stability conditions, based on Lyapunov-Krasovskii methods, so as to achieve robustness with prescribed H-infinity performance guarantees.
|
|
17:40-18:00, Paper WeCT11.6 | Add to My Program |
Robust Passivity-Based Control of Underactuated Systems Via Neural Approximators and Bayesian Inference |
|
Ashenafi, Nardos Ayele | Boise State University |
Sirichotiyakul, Wankun | Boise State University |
Satici, Aykut C | Boise State University |
Keywords: Robotics, Machine learning, Robust control
Abstract: Inspired by passivity-based control (PBC) techniques, we propose a data-driven approach in order to learn a neural net parameterized storage function of underactuated mechanical systems. First, the PBC problem is cast as an optimization problem that searches for point estimates of the neural net parameters. Then, we improve the robustness properties of this deterministic framework against system parameter uncertainties and measurement error by injecting techniques from Bayesian inference. In the Bayesian framework, the neural net parameters are samples drawn from a posterior distribution learned via Variational Inference. We demonstrate the performance of the deterministic and Bayesian trainings on the swing-up task of an inertia wheel pendulum in simulation and real-world experiment.
|
|
WeCT12 Regular Session, Maya Ballroom IV |
Add to My Program |
Sampled-Data Control and Estimation |
|
|
Chair: Borri, Alessandro | CNR-IASI |
Co-Chair: Simpson-Porco, John W. | University of Toronto |
|
16:00-16:20, Paper WeCT12.1 | Add to My Program |
Data-Driven Model Predictive Control for Linear Time-Periodic Systems |
|
Li, Ruiqi | University of Waterloo |
Simpson-Porco, John W. | University of Toronto |
Smith, Stephen L. | University of Waterloo |
Keywords: Behavioural systems, Predictive control for linear systems, Time-varying systems
Abstract: We consider the problem of data-driven predictive control for an unknown discrete-time linear time-periodic (LTP) system of known period. Our proposed strategy generalizes both Data-enabled Predictive Control (DeePC) and Subspace Predictive Control (SPC), which are established data-driven control techniques for linear time-invariant (LTI) systems. The approach is supported by an extensive theoretical development of behavioral systems theory for LTP systems, culminating in a generalization of the fundamental lemma. Our algorithm produces results identical to standard Model Predictive Control (MPC) for deterministic LTP systems. Robustness of the algorithm to noisy data is illustrated via simulation of a regularized version of the algorithm applied to a stochastic multi-input multi-output LTP system.
|
|
16:20-16:40, Paper WeCT12.2 | Add to My Program |
Controllability of Linear Time-Varying Systems with Quantized Controls and Finite Data-Rate |
|
Scabin Vicinansa, Guilherme | University of Illinois at Urbana-Champaign |
Liberzon, Daniel | Univ of Illinois, Urbana-Champaign |
Keywords: Quantized systems, Control over communications, Time-varying systems
Abstract: In this paper, we define a notion of controllability that is suitable for digital systems, i.e., with sampling, quantization, and operating with a finite data-rate. In particular, we study that notion for linear time-varying systems by proving a necessary condition and a sufficient condition for such systems to be controllable with quantized controls and finite data-rate.
|
|
16:40-17:00, Paper WeCT12.3 | Add to My Program |
Sampled-Data Glucose Regulation with Reach-And-Stay Specifications through Time-Varying Contracts (I) |
|
Di Loreto, Ilaria | University of L'Aquila |
Borri, Alessandro | CNR-IASI |
Di Benedetto, Maria Domenica | University of L'Aquila |
Keywords: Biological systems, Control applications, Sampled-data control
Abstract: The complexity of the glucose-insulin system makes the glucose control problem a hard task to accomplish. In this context, a decentralized approach can be of help, through the exploitation of contracts theory, which allows to formalize the fulfillment of safety/invariance specifications over a system in terms of set of assumptions and guarantees over the composing subsystems. We here take a compositional model-based approach considering, as a first attempt, simplified scalar glucose and insulin subsystems. Assumptions and guarantees sets are piecewise-constant time-varying intervals, computed at sampling times, on the basis of the glucose measurements, so they are not completely known a priori. Updating the intervals may lead to temporary violation of the contracts, according to their classical definition, until the system reaches the new target set. By exploiting the property of monotonicity of the involved subsystems, we define a minimum-time reachability problem, which is solved in closed form to minimize the worst-case contract time violation, and such that the insulin subsystem is steered to a controlled invariant set (reach-and-stay specification). Simulations performed in a non-ideal scenario confirm the potential of the proposed approach.
|
|
17:00-17:20, Paper WeCT12.4 | Add to My Program |
Fixed-Point Uniform Quantization Analysis for Numerical Controllers |
|
Susca, Mircea | Technical University of Cluj-Napoca |
Mihaly, Vlad Mihai | Technical University of Cluj-Napoca |
Dobra, Petru | Technical University of Cluj |
Keywords: Quantized systems, Linear systems
Abstract: The current paper proposes a set of theoretical results regarding the closed-loop control of continuous linear time-invariant (LTI) physical processes using fixed-point discrete-time LTI regulators, interfaced using uniform data converters, by providing necessary and sufficient conditions to quantify the closed-loop stability up to a worst-case computable bound. Additionally, the results are then gathered in an algorithmic manner such that the control engineer can specify the desired performance for the numerical control system, starting from a satisfactory discrete-time regulator, and return the minimum requirements of the necessary microprocessor-based system, including state and output computation tolerances, along with analog-to-digital (ADC) and digital-to-analog (DAC) converter resolutions. An end-to-end approach is illustrated on a numerical case study through simulation, with discussions encompassing both theoretical and practical implications alike.
|
|
17:20-17:40, Paper WeCT12.5 | Add to My Program |
Latent State Space Modeling of High-Dimensional Time Series with a Canonical Correlation Objective |
|
Yu, Jiaxin | CityU of Hong Kong |
Qin, S. Joe | City University of Hong Kong |
Keywords: Statistical learning, Subspace methods, Chemical process control
Abstract: High-dimensional time series are commonly encountered in modern control systems, especially in autonomous systems. In this work, a novel parsimonious latent state space (LaSS) model is proposed to characterize the latent dynamics, achieving general latent dynamic modeling with dimension reduction. The LaSS model is optimized by alternating estimations of the dimension reduction projection and the latent state space model. Precisely, the latent state dynamics are estimated by stochastic subspace identification methods. Furthermore, the canonical correlation analysis (CCA) objective is employed to acquire the optimal predictability for the extracted latent variables. The proposed LaSS-CCA algorithm is tested on a real industrial case for its effectiveness.
|
|
17:40-18:00, Paper WeCT12.6 | Add to My Program |
Control of a Wind Energy Conversion Systems by a Novel Data-Driven Model-Free Adaptive Algorithm |
|
Corradini, Maria Letizia | Universitŕ Di Camerino |
Ippoliti, Gianluca | Universitŕ Politecnica Delle Marche (UNIVPM) |
Orlando, Giuseppe | Universitŕ Politecnica Delle Marche |
Keywords: Sampled-data control, Energy systems, Control applications
Abstract: This paper explores the possibility of applying non model-based controllers to wind energy conversion systems (WECS), and proposes the application of a novel data-driven control algorithm based on Model-Free Adaptive Control. The proposed technique makes use of an equivalent dynamic linearization model obtained adopting a dynamic linearization technique based on pseudo-partial derivatives. A stability proof of convergence of the closed loop system is provided, proving the asymptotical vanishing of the tracking error. The proposed approach has been applied to the problem of efficiency maximization of a 5MW wind turbine operating in the region of medium wind speed using the recognized high-fidelity simulation tool FAST by NREL. The obtained realistic validation data seem to support the theoretical development and confirm the potential interest on data driven controllers for WECS, in view of their flexibility, effectiveness, low cost and possible interoperability with smart production lines.
|
|
WeCT13 Regular Session, Maya Ballroom V |
Add to My Program |
Predictive Control for Nonlinear Systems III |
|
|
Chair: Chen, Wen-Hua | Loughborough University |
Co-Chair: Beckenbach, Lukas | Technische Universität Chemnitz |
|
16:00-16:20, Paper WeCT13.1 | Add to My Program |
Simplified Nonlinear Programs for NMPC Based on Active Set Construction |
|
Dyrska, Raphael | Ruhr-Universität Bochum |
Monnigmann, Martin | Ruhr-Universität Bochum |
Keywords: Predictive control for nonlinear systems, Optimal control, Constrained control
Abstract: The nonlinear program arising in nonlinear model predictive control can be simplified by constructing candidate active sets for the successor state. Instead of modifying the optimization algorithm directly, we use these active sets to anticipate the relevant constraints for the next time step and to solve a simplified nonlinear program. Since active sets are in general valid for a set of initial states, an inherent robustness with respect to additive disturbances results.
|
|
16:20-16:40, Paper WeCT13.2 | Add to My Program |
A Penalty Function Approach to Constrained Pontryagin-Based Nonlinear Model Predictive Control |
|
Pagone, Michele | Politecnico Di Torino |
Boggio, Mattia | Politecnico Di Torino |
Novara, Carlo | Politecnico Di Torino |
Proskurnikov, Anton V. | Politecnico Di Torino |
Calafiore, Giuseppe C. | Politecnico Di Torino |
Keywords: Predictive control for nonlinear systems, Optimal control, Constrained control
Abstract: A Pontryagin-based approach to solve a class of constrained Nonlinear Model Predictive Control problems is proposed, which employs the method of penalty functions for dealing with the state constraints. Unlike the existing works in literature, the proposed method is able to cope with nonlinear input and state constraints without any significant modification of the optimization algorithm. Theoretical results are tested and confirmed by numerical simulations on the Lotka-Volterra prey/predator nonlinear system.
|
|
16:40-17:00, Paper WeCT13.3 | Add to My Program |
Approximate Infinite-Horizon Predictive Control |
|
Beckenbach, Lukas | Technische Universität Chemnitz |
Streif, Stefan | Technische Universität Chemnitz |
Keywords: Predictive control for nonlinear systems, Optimal control
Abstract: Predictive control is frequently used for control problems involving constraints. Being an optimization based technique utilizing a user specified so-called stage cost, performance properties, i.e., bounds on the infinite horizon accumulated stage cost, aside closed-loop stability are of interest. To achieve good performance and to influence the region of attraction associated with the prediction horizon, the terminal cost of the predictive controller's optimization objective is a key design factor. Approximate dynamic programming refers to one particular approximation paradigm that pursues iterative cost adaptation over a state domain. Troubled by approximation errors, the associated approximate optimal controller is, in general, not necessarily stabilizing nor is its performance quantifiable on the entire approximation domain. Using a parametric terminal cost trained via approximate dynamic programming, a stabilizing predictive controller is proposed whose performance can directly be related to cost approximation errors. The controller further ensures closed-loop asymptotic stability beyond the training domain of the approximate optimal controller associated to the terminal cost.
|
|
17:00-17:20, Paper WeCT13.4 | Add to My Program |
Model Predictive Control with Preview: Recursive Feasibility and Stability |
|
Fang, Xing | Jiangnan University |
Chen, Wen-Hua | Loughborough University |
Keywords: Predictive control for nonlinear systems, Stability of nonlinear systems
Abstract: This paper proposes a stabilising model predictive control (MPC) scheme with preview information of disturbance for nonlinear systems. The proposed MPC algorithm is able to not only reject disturbance by making use of disturbance preview information as necessary, but also take advantage of the disturbance if it is good for a control task. This is realised by taking into account both the task (e.g. reference trajectory) and disturbance preview in the prediction horizon when performing online optimisation. Conditions are established to ensure recursive feasibility and stability under disturbance. First the disturbance within the horizon is augmented with the state to form a new composite system and then the stage cost function is modified accordingly. With the help of input-to-state stability theory, a terminal cost and a terminal constraint are constructed and added to the MPC algorithm with preview to guarantee its recursive feasibility and stability under a prebounded disturbance. Numerical simulation results demonstrate the effectiveness of the proposed MPC algorithm.
|
|
17:20-17:40, Paper WeCT13.5 | Add to My Program |
Event-Triggered Cloud-Based Nonlinear Model Predictive Control with Neighboring Extremal Adaptations |
|
Vahidi-Moghaddam, Amin | Miichigan State University |
Li, Zhaojian | Michigan State University |
Li, Nan | Auburn University |
Zhang, Kaixiang | Michigan State University |
Wang, Yan | Ford Research and Advanced Engineerintg, Ford Motor Company |
Keywords: Predictive control for nonlinear systems, Optimal control
Abstract: Model predictive control (MPC) is a popular optimal control approach that can explicitly deal with system safety constraints. However, its high computational cost motivates researches on various approximation methods that make MPC more practical for systems with fast dynamics and/or limited on-board computing power. Neighboring extremal (NE) is one such method that can be used to adapt to a nominal open-loop MPC sequence based on state perturbations so that it alleviates the online computational burden and extends the applicability of MPC. In this paper, a cloud-based nonlinear MPC (NMPC) framework is considered where an NMPC is first performed on the cloud by taking advantage of its powerful computation and storage capabilities. To save the computational power, instead of solving the NMPC problem at every time step, the open-loop control sequence is downloaded for on-board implementation, and the NE is employed to systematically update the control sequence based on the deviation of the measured/estimated state from the nominal state obtained by the cloud-based NMPC. However, due to request-response communication delays between the plant and the cloud, the cloud-based NMPC may not show a reasonable performance for the plant even with the NE when the deviation is large. Therefore, we develop an event-trigger scheme based on a cost criteria to strike a balance between performance and cost. Finally, the efficacy of the proposed control synthesis is demonstrated with simulations on the application of the cart inverted pendulum.
|
|
17:40-18:00, Paper WeCT13.6 | Add to My Program |
Multi-Rate Planning and Control of Uncertain Nonlinear Systems: Model Predictive Control and Control Lyapunov Functions |
|
Csomay-Shanklin, Noel | California Institute of Technology |
Taylor, Andrew | California Institute of Technology |
Rosolia, Ugo | Caltech |
Ames, Aaron | California Institute of Technology |
Keywords: Predictive control for nonlinear systems, Lyapunov methods, Nonlinear systems
Abstract: Modern control systems must operate in increasingly complex environments subject to safety constraints and input limits, and are often implemented in a hierarchical fashion with different controllers running at multiple time scales. Yet traditional constructive methods for nonlinear controller synthesis typically ``flatten'' this hierarchy, focusing on a single time scale, and thereby limited the ability to make rigorous guarantees on constraint satisfaction that hold for the entire system. In this work we seek to address the stabilization of constrained nonlinear systems through a multi-rate control architecture. This is accomplished by iteratively planning continuous reference trajectories for a nonlinear system using a linearized model and Model Predictive Control (MPC), and tracking said trajectories using the full-order nonlinear model and Control Lyapunov Functions (CLFs). Connecting these two levels of control design in a way that ensures constraint satisfaction is achieved through the use of Bezier curves, which enable planning continuous trajectories respecting constraints by planning a sequence of discrete points. Our framework is encoded via convex optimization problems which may be efficiently solved, as demonstrated in simulation.
|
|
WeCT14 Regular Session, Maya Ballroom VI |
Add to My Program |
Aerospace |
|
|
Chair: Nikolakopoulos, George | Luleĺ University of Technology |
Co-Chair: Ghosh, Satadal | Indian Institute of Technology Madras |
|
16:00-16:20, Paper WeCT14.1 | Add to My Program |
Right-Of-Way-Based Probabilistic Acceleration Velocity Obstacle |
|
Gorantla, Balaji | Indian Institute of Technology Madras |
Ghosh, Satadal | Indian Institute of Technology Madras |
Keywords: Aerospace, Autonomous systems, Autonomous vehicles
Abstract: A novel second order online local reactive motion planner, named `Right-of-Way-based Probabilistic Acceleration Velocity Obstacle' (R-PAVO), is developed in this paper in a probabilistic set-up, in which right-of-way rules posed by regulatory authorities are also embedded. The developed motion planning algorithm is capable of generating a collision-free, dynamically feasible, and goal-oriented trajectory for each unmanned vehicle (a.k.a. agent) in a multi-agent environment in the presence of uncertainties associated with information on other agents' motion and occluded regions. The considered right-of-way rules facilitate an implicit coordination among the agents that also help in avoiding any reciprocal oscillation in the agents' planned trajectories. Extensive simulation studies are carried out to demonstrate the effectiveness of the developed algorithm in terms of relevant measures of effectiveness indicating real-time implementability of the developed algorithm.
|
|
16:20-16:40, Paper WeCT14.2 | Add to My Program |
Safe Autonomous Docking Maneuvers for a Floating Platform Based on Input Sharing Control Barrier Functions |
|
Saradagi, Akshit | Indian Institute of Technology Madras |
Banerjee, Avijit | Luleĺ University of Technology |
Satpute, Sumeet | Lulea University of Technology |
Nikolakopoulos, George | Luleĺ University of Technology |
Keywords: Aerospace, Robotics, Control applications
Abstract: In this article, we present a control strategy for the problem of safe autonomous docking for a planar floating platform (Slider) that emulates the movement of a satellite. Employing the proposed strategy, Slider approaches a docking port with the right orientation, maintaining a safe distance, while always keeping a visual lock on the docking port throughout the docking maneuver. Control barrier functions are designed to impose the safety, direction of approach and visual locking constraints. Three control inputs of the Slider are shared among three barrier functions in enforcing the constraints. It is proved that the control inputs are shared in a conflict-free manner in rendering the sets defining safety and visual locking constraints forward invariant and in establishing finite-time convergence to the visual locking mode. The conflict-free input-sharing ensures the feasibility of a quadratic program that generates minimally-invasive corrections for a nominal controller, that is designed to track the docking port, so that the barrier constraints are respected throughout the docking maneuver. The efficacy of the proposed control design approach is validated through various simulations.
|
|
16:40-17:00, Paper WeCT14.3 | Add to My Program |
Gradient Direction Turn Switching Strategy for Source Localization |
|
Gowda S P, Manoj | Indian Institute of Technology Madras |
Ghosh, Satadal | Indian Institute of Technology Madras |
Keywords: Aerospace, Autonomous systems, Autonomous vehicles
Abstract: This paper addresses the problem of localizing the source of an unknown scalar field distribution using a constant-speed UAV with on-board sensors in a planar environment. The scalar field intensity is assumed to decrease monotonically in all directions from the source. The UAV can sense the signal strength of the scalar field at its position at every time-instant. A novel source localization algorithm, named 'Gradient Direction Turn Switching' (GDTS), is developed in this paper, following which the UAV is made to follow circular trajectory-segments (loops) with a nominal constant turn rate and switch its turn direction as it traverses a loop and crosses the last updated gradient estimate direction. Thus, the UAV moves towards the source by estimating the gradient direction of the field and realizing a gradient ascent mechanism facilitating it to reach within a threshold neighborhood of the source. Simulation studies of the proposed algorithm in different scenarios like stationary, moving, and maneuvering sources establish its effectiveness and real-time-implementability. Updated gradient estimates over nominal loops help in improving the performance of the GDTS under noisy sensor data compared to other switched turn strategies in existing literature, which is also validated with extensive simulations studies.
|
|
17:00-17:20, Paper WeCT14.4 | Add to My Program |
Exogenous Disturbance Estimation for Autonomous Navigation Around Small Celestial Bodies |
|
Kottayam Viswanathan, Vignesh | Lulea University of Technology |
Papadimitriou, Andreas | Luleĺ University of Technology |
Banerjee, Avijit | Luleĺ University of Technology |
Mansouri, Sina Sharif | Luleĺ University of Technology |
Nikolakopoulos, George | Luleĺ University of Technology |
Keywords: Aerospace, Autonomous systems
Abstract: In this paper, we propose the implementation of a Nonlinear Moving Horizon Estimation (NMHE) framework to estimate exogenous disturbances acting on a spacecraft for autonomous navigation around Small Celestial Bodies~(SCBs). The estimation framework is coupled with a Nonlinear Model Predictive Control (NMPC) to promote robust autonomous operations in Space. The NMHE based exogenous disturbance estimation formulates a finite horizon optimization problem while incorporating the lumped disturbances as an additional augmented state vector. Next, the estimated disturbance is utilized by the NMPC controller in a feed-forward manner. Numerous closed-loop simulations have been conducted to assess the validity of the proposed estimation and disturbance rejection framework by considering: (a) Inertial hovering around two different asteroid bodies namely 433 Eros and Ryugu and (b) unavailability of two primary accurate asteroid characterization data, the asteroid rotation rate "n_a" and the gravitational parameter "mu". The presented results are compared with an Extended Kalman Filter (EKF) and NMPC formulation to validate the efficacy of the proposed framework.
|
|
17:20-17:40, Paper WeCT14.5 | Add to My Program |
Observability Analysis of Receiver Localization Via Pseudorange Measurements from a Single LEO Satellite |
|
Kassas, Zaher | University of California, Irvine |
Sabbagh, Ralph | American University of Beirut |
Keywords: Estimation, Sensor fusion, Aerospace
Abstract: This letter presents an observability analysis for terrestrial receiver localization via pseudorange measurements extracted from a single low Earth orbit (LEO) satellite. It is shown that a stationary receiver with an unknown state (position and time) can localize itself with a LEO satellite with a known state (position, velocity, and time). In addition, bounds on the determinant of the l-step observability matrix are derived and geometric interpretations are presented indicating directions of poor observability. The implications of the analysis on observability-aided LEO satellite selection are discussed. Experimental results are presented showcasing the conclusions of the observability analysis for a receiver localizing itself with two different LEO satellites: one Starlink and one Orbcomm.
|
|
17:40-18:00, Paper WeCT14.6 | Add to My Program |
Embedding Adaptive Features in the ArduPilot Control Architecture for Unmanned Aerial Vehicles |
|
Li, Peng | Southeast University |
Liu, Di | Southeast University |
Xia, Xin | Southeast University |
Baldi, Simone | Southeast University |
Keywords: Autonomous vehicles, Adaptive control, Flight control
Abstract: The operation of Unmanned Aerial Vehicles (UAVs) is often subject to state-dependent alterations and unstructured uncertainty factors, such as unmodelled dynamics, environmental weather disturbances, aerodynamics gradients, or changes in inertia and mass due to payloads. While a large number of autopilot solutions have been proposed to operate UAVs, none of these solutions is able to counteract the effects of state-dependent and unstructured uncertainties online by parameter estimation and adaptive control techniques. This work presents a systematic integration of adaptive control into ArduPilot, a popular open-source autopilot suite maintained by a large community of UAV developers. Adaptation features are embedded in the ArduPilot control structure without altering the original architecture, to allow users to use the autopilot suite as usual. Tests show that the proposed adaptive ArduPilot provides consistent improved performance in several uncertain flight conditions. The source code of the proposed adaptive ArduPilot is released at https://github.com/Friend-Peng/Adaptive-ArduPilot-Autopilot.
|
|
WeCT15 Regular Session, Maya Ballroom VII |
Add to My Program |
Game Theory III |
|
|
Chair: Marden, Jason R. | University of California, Santa Barbara |
Co-Chair: Belgioioso, Giuseppe | ETH Zürich |
|
16:00-16:20, Paper WeCT15.1 | Add to My Program |
Avoiding Unintended Consequences: How Incentives Aid Information Provisioning in Bayesian Congestion Games |
|
Ferguson, Bryce L. | University of California, Santa Barbara |
Brown, Philip N. | University of Colorado, Colorado Springs |
Marden, Jason R. | University of California, Santa Barbara |
Keywords: Game theory, Transportation networks, Behavioural systems
Abstract: When users lack specific knowledge of various system parameters, their uncertainty may lead them to make undesirable deviations in their decision making. To alleviate this, an informed system operator may elect to signal information to uninformed users with the hope of persuading them to take more preferable actions. In this work, we study public and truthful signalling mechanisms in the context of Bayesian congestion games on parallel networks. We provide bounds on the possible benefit a signalling policy can provide with and without the concurrent use of monetary incentives. We find that though revealing information can reduce system cost in some settings, it can also be detrimental and cause worse performance than not signalling at all. However, by utilizing both signalling and incentive mechanisms, the system operator can guarantee that revealing information does not worsen performance while offering similar opportunities for improvement. These findings emerge from the closed form bounds we derive on the benefit a signalling policy can provide. We provide a numerical example which illustrates the phenomenon that revealing more information can degrade performance when incentives are not used and improves performance when incentives are used.
|
|
16:20-16:40, Paper WeCT15.2 | Add to My Program |
Distributed Nash Equilibrium Seeking for Decomposable Pseudo-Gradient |
|
Gadjov, Dian | University of Toronto |
Pavel, Lacra | University of Toronto |
Keywords: Game theory, Optimization, Distributed control
Abstract: In this paper, we consider distributed Nash equilibrium seeking in partial-decision information setting. We analyze games that have a pseudo-gradient that can be decomposed into two components, where one component is the gradient of a convex function. Two classes of games that have this decomposition are potential games and cocoercive-related games. We consider standard gradient-play with consensus dynamics and show that they converge in both classes of games in continuous-time and discrete-time.
|
|
16:40-17:00, Paper WeCT15.3 | Add to My Program |
Receding Horizon Games with Coupling Constraints for Demand-Side Management |
|
Hall, Sophie | ETH |
Belgioioso, Giuseppe | ETH Zürich |
Liao-McPherson, Dominic | ETH Zurich |
Dörfler, Florian | Swiss Federal Institute of Technology (ETH) Zurich |
Keywords: Game theory, Predictive control for nonlinear systems, Smart grid
Abstract: Distributed energy storage and flexible loads are essential tools for ensuring stable and robust operation of the power grid in spite of the challenges arising from the integration of volatile renewable energy generation and increasing peak loads due to widespread electrification. This paper proposes a demand-side management policy to coordinate self-interested energy prosumers based on Receding Horizon Games, i.e., a closed-loop receding-horizon implementation of game-theoretic day-ahead planning. Practical stability and recursive constraint satisfaction of the proposed feedback control policy is proven under symmetric pricing assumptions using tools from game theory and economic model predictive control. Our numerical studies show that the proposed approach is superior to standard open-loop day-head implementations in terms of peak-shaving, disturbance rejection, and control performance.
|
|
17:00-17:20, Paper WeCT15.4 | Add to My Program |
Nash Equilibria for Scalar LQ Games: Iterative and Data-Driven Algorithms |
|
Nortmann, Benita Alessandra Lucia | Imperial College London |
Monti, Andrea | University of Rome Tor Vergata |
Mylvaganam, Thulasi | Imperial College London |
Sassano, Mario | University of Rome, Tor Vergata |
Keywords: Game theory, Numerical algorithms, Linear systems
Abstract: Determining Nash equilibrium solutions of nonzero-sum dynamic games is generally challenging. In this paper, we propose four different iterative algorithms for finding Nash equilibrium strategies of discrete-time scalar linear quadratic games, with strategy updates based on the solution of either Lyapunov or Riccati equations. Local convergence criteria are discussed. Motivated by the fact that in many practical scenarios each player in the game may have access to different (incomplete) information, we introduce purely data-driven implementations of the algorithms. This allows the players to reach a Nash equilibrium solution of the game via scheduled experiments and without knowledge of each other's performance criteria or of the system dynamics. The efficacy of the presented algorithms is illustrated via a numerical example.
|
|
17:20-17:40, Paper WeCT15.5 | Add to My Program |
Optimal Information Provision for Strategic Hybrid Workers |
|
Shah, Sohil | Massachusetts Institute of Technology |
Amin, Saurabh | Massachusetts Institute of Technology |
Jaillet, Patrick | Massachusetts Institute of Technology |
Keywords: Game theory, Stochastic systems, Modeling
Abstract: We study the problem of information provision by a strategic central planner who can publicly signal about an uncertain infectious risk parameter. Signalling leads to an updated public belief over the parameter, and agents then make equilibrium choices on whether to work remotely or in-person. The planner maintains a set of desirable outcomes for each realization of the uncertain parameter and seeks to maximize the probability that agents choose an acceptable outcome for the true parameter. We distinguish between stateless and stateful objectives. In the former, the set of desirable outcomes does not change as a function of the risk parameter, whereas in the latter it does. For stateless objectives, we reduce the problem to maximizing the probability of inducing mean beliefs that lie in intervals computable from the set of desirable outcomes. We derive the optimal signalling mechanism and show that it partitions the parameter domain into at most two intervals with the signals generated according to an interval-specific distribution. For the stateful case, we consider a practically relevant situation in which the planner seeks to enforce in-person work capacity limits that progressively get more stringent as the risk parameter increases. We show that the optimal signalling mechanism in this case can be obtained by solving a linear program. We numerically verify the improvement in achieving desirable outcomes using our information design relative to the no-information and full-information benchmarks.
|
|
17:40-18:00, Paper WeCT15.6 | Add to My Program |
Gradient-Free Nash Equilibrium Seeking in N-Cluster Games with Uncoordinated Constant Step-Sizes |
|
Pang, Yipeng | Nanyang Technological University |
Hu, Guoqiang | Nanyang Technological University, Singapore |
Keywords: Game theory, Optimization algorithms, Distributed control
Abstract: This work investigates a problem of simultaneous global cost minimization and Nash equilibrium seeking, which commonly exists in N-cluster non-cooperative games. Specifically, the agents in the same cluster collaborate to minimize a global cost function, being a summation of their individual cost functions, and jointly play a non-cooperative game with other clusters as players. For the problem settings, we suppose that the explicit analytical expressions of the agents' local cost functions are unknown, but the function values can be measured. We propose a gradient-free Nash equilibrium seeking algorithm by a synthesis of Gaussian smoothing techniques and gradient tracking. Furthermore, instead of using the uniform coordinated step-size, we allow the agents across different clusters to choose different constant step-sizes. When the largest step-size is sufficiently small, we prove a linear convergence of the agents' actions to a neighborhood of the unique Nash equilibrium under a strongly monotone game mapping condition, with the error gap being propotional to the largest step-size and the smoothing parameter. The performance of the proposed algorithm is validated by numerical simulations.
|
|
WeCT16 Regular Session, Maya Ballroom VIII |
Add to My Program |
Nonlinear Systems |
|
|
Chair: Polyakov, Andrey | Inria, Univ. Lille |
Co-Chair: Efimov, Denis | Inria |
|
16:00-16:20, Paper WeCT16.1 | Add to My Program |
Consistent Discretization of a Homogeneous Finite-Time Control for a Double Integrator |
|
Polyakov, Andrey | Inria, Univ. Lille |
Efimov, Denis | Inria |
Ping, Xubin | Xidian University |
Keywords: Nonlinear systems, Stability of nonlinear systems, Computational methods
Abstract: A discretization of a homogeneous controller for a double integrator is developed. It preserves the finite-time stability property even in the case of the sampled-time implementation of the control law. Theoretical results are supported by numerical simulations.
|
|
16:20-16:40, Paper WeCT16.2 | Add to My Program |
Making Nonlinear Systems Negative Imaginary Via State Feedback |
|
Shi, Kanghong | Australian National University |
Petersen, Ian R. | Australian National University |
Vladimirov, Igor G. | Australian National University |
Keywords: Nonlinear systems, Robust control, Uncertain systems
Abstract: This paper provides a state feedback stabilization approach for nonlinear systems of relative degree less than or equal to two by rendering them nonlinear negative imaginary (NI) systems. Conditions are provided under which a nonlinear system can be made a nonlinear NI system or a nonlinear output strictly negative imaginary (OSNI) system. Roughly speaking, an affine nonlinear system that has a normal form with relative degree less than or equal to two, after possible output transformation, can be rendered nonlinear NI and nonlinear OSNI. In addition, if the internal dynamics of the normal form are input-to-state stable, then there exists a state feedback input that stabilizes the system. This stabilization result is then extended to achieve stability for systems with a nonlinear NI uncertainty.
|
|
16:40-17:00, Paper WeCT16.3 | Add to My Program |
On Generalized Homogeneous Leader-Following Consensus |
|
Li, Min | Inria |
Polyakov, Andrey | Inria, Univ. Lille |
Zheng, Gang | INRIA |
Keywords: Nonlinear systems, Cooperative control, Variable-structure/sliding-mode control
Abstract: The Multi-agent System (MAS) with agents being linear single input plants is considered. The problem of leader-following (generalized homogeneous) consensus control protocol is studied under the assumption that a control input of the leader is unknown. It is shown that the required control protocol can be obtained as an “upgrade” of the existing linear consensus control protocol.
|
|
17:00-17:20, Paper WeCT16.4 | Add to My Program |
Outer-Approximating Controlled Reach-Avoid Sets for Polynomial Systems |
|
Zhao, Changyuan | Institute of Software |
Fan, Chuchu | Massachusetts Institute of Technology |
Xue, Bai | Institute of Software, Chinese Academy of Sciences |
Keywords: Formal Verification/Synthesis, Nonlinear systems, Stability of nonlinear systems
Abstract: In this paper we propose a semi-definite programming method for computing outer-approximations (i.e., supersets) of controlled reach-avoid sets of discrete-time polynomial systems subject to control inputs. The controlled reach-avoid set is a set of all initial states that there exists at least one control policy which steers the system starting from each of them to enter a specified target set in finite time while avoiding a given safe set till the target hit. First, a Bellman type equation, whose unique bounded solution can characterize the exact controlled reach-avoid set, is derived. By relaxing this equation, a set of quantified inequalities for outer-approximating the controlled reach-avoid set is obtained. Via comparing to a set of constraints in state-of-the-art methods on occupation measures, we find that each has its own strengths and can complement each other in outer-approximating controlled reach-avoid sets. As a consequence, we integrate them and obtain a new set of constraints, which is weaker and thus facilitates the gain of tighter outer-approximations. The resulting set of constraints can be encoded into a semi-definite program via the sum-of-squares decomposition for multivariate variables, which can be addressed efficiently via interior point methods in polynomial time. Finally, several examples demonstrate the benefits of our semi-definite programming method over existing methods on outer-approximating controlled reach-avoid sets.
|
|
17:20-17:40, Paper WeCT16.5 | Add to My Program |
Nonlinear System Level Synthesis for Polynomial Dynamical Systems |
|
Conger, Lauren | California Institute of Technology |
Li, Jing Shuang | California Institute of Technology |
Mazumdar, Eric | California Institute of Technology |
Brunton, Steven L. | University of Washington |
Keywords: Nonlinear systems, Feedback linearization
Abstract: This work introduces a controller synthesis method via system level synthesis for nonlinear systems characterized by polynomial dynamics. The resulting framework yields finite impulse response, time-invariant, closed-loop transfer functions with guaranteed disturbance cancellation. Our method generalizes feedback linearization to enable partial feedback linearization, where the cancellation of the nonlinearity is spread across a finite-time horizon. This provides flexibility to use the system dynamics to attenuate disturbances before cancellation via control, reducing the cost of control compared with feedback linearization while maintaining guarantees about disturbance rejection. This approach is illustrated on a benchmark example and on a common model for fluid flow control.
|
|
17:40-18:00, Paper WeCT16.6 | Add to My Program |
Spectral Analysis of Koopman Operator and Nonlinear Optimal Control |
|
Vaidya, Umesh | Clemson University |
Keywords: Optimal control, Stability of nonlinear systems, Robust control
Abstract: In this paper, we present an approach based on the spectral analysis of the Koopman operator for the approximate solution of the Hamilton Jacobi equation that arises while solving the optimal control problem. It is well-known that one can associate a Hamiltonian dynamical system with the Hamilton Jacobi equation. Furthermore, the Lagrangian submanifold of the Hamiltonian dynamical system play a fundamental role in solving the Hamilton Jacobi equation. We show that the principal eigenfunctions of the Koopman operator associated with the Hamiltonian dynamical system can be used in constructing the Lagrangian submanifold, thereby approximating the solution of the Hamilton Jacobi equation. We present simulation results to verify the main findings of the paper.
|
|
WeCT17 Invited Session, Acapulco |
Add to My Program |
Systems and Synthetic Biology |
|
|
Chair: d'Angelo, Massimiliano | Sapienza Universitŕ Di Roma |
Co-Chair: Palumbo, Pasquale | University of Milano-Bicocca |
Organizer: d'Angelo, Massimiliano | Sapienza Universitŕ Di Roma |
Organizer: Palumbo, Pasquale | University of Milano-Bicocca |
Organizer: Singh, Abhyudai | University of Delaware |
|
16:00-16:20, Paper WeCT17.1 | Add to My Program |
A Coarse-Grain Model for Cellular Growth Accounting for Ribosome Synthesis |
|
d'Angelo, Massimiliano | Sapienza Universitŕ Di Roma |
Palumbo, Pasquale | University of Milano-Bicocca |
Busti, Stefano | University of Milano-Bicocca, Department of Biotechnology and Bi |
Vanoni, Marco | Universitŕ Di Milano Bicocca |
Keywords: Systems biology, Modeling, Biological systems
Abstract: Protein synthesis in eukaryotes is carried out by ribosomes, large RNA–protein complexes consisting of a small and a large subunit. In this work we present a new mathematical model for cellular growth comprising both protein production and ribosome synthesis, properly accounting for both small and large subunits dynamics. The model suitably extends previous results recently exploited to deal with a modular whole-cell model working as a scaffold to connect principal cellular activities such as metabolism, growth and cycle. The qualitative analysis of the model is carried out according to a simplifying assumption on the proportion of the two ribosomal subunits in stationary growth conditions; such hypothesis is based on a reasonable biological ground. Conditions are given on the model parameters in order to ensure exponential growth and the corresponding growth rate is straightforwardly computed from the model parameters. These results are validated by numerical simulations carried out according to a set of biologically meaningful model parameters.
|
|
16:20-16:40, Paper WeCT17.2 | Add to My Program |
Continuous-Time and Sampled-Data Optimal Control of Linear Stochastic Reaction Networks (I) |
|
Briat, Corentin | ETH Zürich |
Khammash, Mustafa H. | ETH Zurich |
Keywords: Optimal control, Markov processes, Biomolecular systems
Abstract: Stochastic reaction networks form a powerful class of models for the representation of a wide variety of population models including those arising in biochemistry. The control of such networks has important implications for the control of biological systems and has therefore been a subject of recent interest. The optimal control of stochastic reaction networks, however, has been relatively little studied until now. Here, the continuous-time finite-horizon optimal control problem for linear reaction networks is formulated and solved in the Dynamic Programming framework. The results are formulated through the solution of a non-standard Riccati differential equation. The problem of the optimal sampled-data control of such networks is addressed next and solved using Hybrid Dynamic Programming. In this case, however, the solution is expressed in terms of the solution of coupled Lyapunov differential and Riccati difference equations. An example is given for illustration. The shortcomings of the approach are also discussed.
|
|
16:40-17:00, Paper WeCT17.3 | Add to My Program |
Analytical and Computational Study of the Stochastic Behavior of a Chromatin Modification Circuit (I) |
|
Bruno, Simone | Massachusetts Institute of Technology |
Williams, Ruth J. | Univ. of California at San Diego |
Del Vecchio, Domitilla | Massachusetts Institute of Technology |
Keywords: Biological systems, Model/Controller reduction, Markov processes
Abstract: The property of multicellular organisms that allows cells with the same genetic code to maintain distinct identities for the entire life of the organism is known as epigenetic cell memory (ECM). Recently, chromatin modifications have appeared to have a key role in ECM. Here, we conduct a stochastic analysis of a chromatin modification circuit to determine the effect of time scale separation among principal system processes on the extent to which the system can keep a stable steady state in the face of noise. To this end, we first obtain a reduced circuit model and determine an analytical expression for both the stationary probability distribution and the switching time between repressed and active chromatin states. Then, we validate these analytical results with stochastic simulations of the original full set of reactions. Our results show that when the basal decay of all chromatin marks is sufficiently slower with respect to the auto and cross-catalysis and the recruited erasure of all the marks, the stationary distribution shows bimodality, with two concentrated peaks in correspondence of the active and repressed states, but biased towards the repressed state. In accordance with these results, slower basal decay extends the memory of the active and repressed states, suggesting, more broadly, a critical design principle for long-lasting memory of gene expression states.
|
|
17:00-17:20, Paper WeCT17.4 | Add to My Program |
Fluctuation-Based Approaches to Infer Kinetics of Cell-State Switching (I) |
|
Saint-Antoine, Michael | University of Delaware |
Grima, Ramon | University of Edinburgh |
Singh, Abhyudai | University of Delaware |
Keywords: Systems biology, Cellular dynamics, Stochastic systems
Abstract: In the noisy cellular environment, RNAs and proteins are subject to considerable stochastic fluctuations in copy numbers over time. As a consequence, single cells within the same isoclonal population can differ in their expression profile and reside in different phenotypic states. Here we propose a fluctuation-test approach to infer the kinetics of transitions between cell states. More specifically, single cells are randomly drawn from the population and grown into cell colonies. After growth for a fixed number of generations, the number of cells residing in different states is assayed for each colony. In a simple system with reversible switching between two cell states, our analysis shows that the extent of colony-to-colony fluctuations in the fraction of cells in a given state is monotonically related to the switching kinetics. Several closed-form formulas for inferring the switching rates from experimentally quantified fluctuations are presented. This is especially important for scenarios where a measurement involves killing the cell (for example, performing single-cell RNA-seq or assaying whether a microbial/cancer cell is in a drug-sensitive or drug-tolerant state), and hence the state of the same cell cannot be measured at different time points.
|
|
17:20-17:40, Paper WeCT17.5 | Add to My Program |
Limits on Inferring Gene Regulatory Networks from Single-Cell Measurements of Unstable mRNA Levels (I) |
|
Mahajan, Tarun | University of Illinois at Urbana-Champaign |
Saint-Antoine, Michael | University of Delaware |
Dar, Roy | University of Illinois at Urbana-Champaign |
Singh, Abhyudai | University of Delaware |
Keywords: Genetic regulatory systems, Systems biology, Stochastic systems
Abstract: Gene regulatory network inference from single-cell expression data, such as single-cell RNA sequencing, is a popular problem in computational biology. Despite diverse methods spanning information theory, machine learning, and statistics, it is unsolved. This is attributable to measurement errors, lack of perturbation data, or difficulty in causal inference. Yet the role of kinetic parameters of gene expression is unknown. We show how the relative stability of mRNA and protein hampers inference. We use a simple model of gene expression to show that a more stable protein than mRNA deteriorates correlation between the mRNA of a transcription factor and its target gene. This can also happen when mRNA and protein are on the same timescale. Relative difference in timescales affects true interactions more strongly than false positives. Besides correlation, we find that information-theoretic nonlinear measures are also prone to this problem.
|
|
17:40-18:00, Paper WeCT17.6 | Add to My Program |
Building Molecular Band-Pass Filters Via Molecular Sequestration (I) |
|
Zhang, Yichi | Hyperfine Inc |
Cuba Samaniego, Christian | University of California Los Angeles |
Carleton, Katelyn | UCLA |
Qian, Yili | University of Wisconsin-Madison |
Giordano, Giulia | University of Trento |
Franco, Elisa | University of California a Los Angeles |
Keywords: Biomolecular systems, Cellular dynamics, Genetic regulatory systems
Abstract: Engineered genetic circuits with tailored functions that mimic how cells process information in changing environments (e.g. cell fate decision, chemotaxis, immune response) have great applications in biomedicine and synthetic biology. Although there is a lot of progress toward the design of gene circuits yielding desired steady states (e.g. logic-based networks), building synthetic circuits for dynamic signal processing (e.g. filters, frequency modulation, and controllers) is still challenging. Here, we provide a model-based approach to build gene networks that can operate as band-pass filters by taking advantage of molecular sequestration. {By suitably approximating} the dynamics of molecular sequestration, we analyze an Incoherent Feed-Forward Loop (IFFL) and a Negative Feedback (NF) circuit and illustrate how they can achieve band-pass filter behavior. Computational analysis shows that a circuit that incorporates both IFFL and NF motifs improves the filter performance. Our approach facilitates the design of sequestration-based filters, and may {support the synthesis of molecular controllers with desired specifications
|
| |