| |
Last updated on November 11, 2021. This conference program is tentative and subject to change
Technical Program for Monday December 13, 2021
|
MoA01 Tutorial Session, Coordinated Universal Time (UTC) |
Add to My Program |
Fundamentals of Data-Driven Modeling and Control |
|
|
Chair: Majji, Manoranjan | Texas A&M University |
Co-Chair: Fasel, Urban | University of Washington-Seattle Campus |
Organizer: Majji, Manoranjan | Texas A&M University |
Organizer: Balas, Mark | Embry-Riddle Aeronautical University |
Organizer: Nanda, Aditya | Vanderbilt University |
Organizer: Brunton, Steven L. | University of Washington |
Organizer: Singla, Puneet | The Pennsylvania State University |
|
13:30-13:45, Paper MoA01.1 | Add to My Program |
Application of State Estimation Methods to Low-Temperature Plasma Dynamics (I) |
|
Greve, Christine | Texas A&M University |
Hara, Ken | Stanford University |
Majji, Manoranjan | Texas A&M University |
Keywords: Kalman filtering, Estimation
Abstract: State estimation techniques are applied to two different plasma dynamical systems: spacecraft electric propulsion and plasma processing devices. Physical constraints are accounted for in the extended Kalman filter by adapting the process and measurement noise covariances to ensure that the model remains consistent with the physical processes. This modified filter is first used to investigate discharge current oscillations in a Hall effect thruster. Then, the filter is applied to an argon global model of a plasma discharge to estimate the absorbed power input from experimental data. Both applications demonstrate that plasma density fluctuations can be used to estimate the dynamics of other plasma properties in the system, such as electron temperature and reaction rate coefficients.
|
|
13:45-14:00, Paper MoA01.2 | Add to My Program |
Applications of Gaussian Process Regression in the Aero-Thermo-Servo-Elastic Analysis towards Integrated Hypersonic Flight Dynamic Analysis (I) |
|
Huang, Daning | Pennsylvania State University |
Keywords: Aerospace, Reduced order modeling, Computational methods
Abstract: This tutorial paper presents the surrogate modeling based on Gaussian process regression (GPR) and its application in the hypersonic aero-thermo-servo-elastic analysis, a key ingredient in the design of hypersonic vehicles as well as its guidance, navigation and control. First, the basic formulations of GPR and practices of model training are presented. Next, the existing applications of GPR in hypersonic problems are reviewed. Subsequently, a pedagogical example and an applied example of GPR are presented. Finally, the pros and cons of the GPR modeling approach for general and hypersonic-specific applications are summarized.
|
|
14:00-14:15, Paper MoA01.3 | Add to My Program |
SINDy with Control: A Tutorial (I) |
|
Fasel, Urban | University of Washington-Seattle Campus |
Kaiser, Eurika | University of Washington |
Kutz, J. Nathan | University of Washington |
Brunton, Bingni | University of Washington |
Brunton, Steven L. | University of Washington |
Keywords: Nonlinear systems identification, Identification for control, Machine learning
Abstract: Many dynamical systems of interest are nonlinear, with examples in turbulence, epidemiology, neuroscience, and finance, making them difficult to control using linear approaches. Model predictive control (MPC) is a powerful model-based optimization technique that enables the control of such nonlinear systems with constraints. However, modern systems often lack computationally tractable models, motivating the use of system identification techniques to learn accurate and efficient models for real-time control. In this tutorial article, we review emerging data-driven methods for model discovery and how they are used for nonlinear MPC. In particular, we focus on the sparse identification of nonlinear dynamics (SINDy) algorithm and show how it may be used with MPC on an infectious disease control example. We compare the performance against MPC based on a linear dynamic mode decomposition (DMD) model. Code is provided to run the tutorial examples and may be modified to extend this data-driven control framework to arbitrary nonlinear systems.
|
|
14:15-14:30, Paper MoA01.4 | Add to My Program |
Advances in System Identification: Theory and Applications (I) |
|
Guého, Damien | The Pennsylvania State University |
Singla, Puneet | The Pennsylvania State University |
Majji, Manoranjan | Texas A&M University |
Juang, Jer-Nan | NASA Langley Research Ctr |
Keywords: Identification, Linear parameter-varying systems, Time-varying systems
Abstract: This paper describes the main features and the most recent developments of system identification in the sense of data-driven modeling of dynamical systems. A brief summary of discrete time-invariant system identification techniques is provided, from the modern work of Gilbert, Kalman and Ho and the introduction of state-space realization, to the most recent developments of the identification of discrete time-varying and nonlinear systems. Important concepts of state-space realization, controllability and observability for linear systems are introduced along with more advanced methods to identify nonlinear dynamics. Numerical examples of varying complexity are considered to demonstrate the capability of the different approaches presented in this paper.
|
|
14:30-15:00, Paper MoA01.5 | Add to My Program |
Data-Based Modeling and Control of Dynamical Systems: Parameter Estimation (I) |
|
Guého, Damien | The Pennsylvania State University |
Majji, Manoranjan | Texas A&M University |
Singla, Puneet | The Pennsylvania State University |
Keywords: Identification, Identification for control, Machine learning
Abstract: Parameter estimation methods to provide data-based models to control complex dynamical systems are reviewed. Starting from least square minimization of the equation error, the tutorial provides an overview of how different perspectives of parameter estimation lead to various algorithms that are used in diverse contexts. Both statistical and deterministic approaches are discussed, and the utility of model inferences are explained. The discussions provide a context and review relevant background with respect to three application papers involving recent advances in Gaussian Process Regression (GPR), state estimation approaches and data-driven modeling.
|
|
MoA02 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Machine Learning I |
|
|
Chair: Lavaei, Javad | UC Berkeley |
Co-Chair: Yu, Wen | CINVESTAV-IPN |
|
13:30-13:45, Paper MoA02.1 | Add to My Program |
Progressive Graph Partitioning Based on Information Diffusion |
|
Mavridis, Christos | University of Maryland, College Park |
Baras, John S. | University of Maryland |
Keywords: Machine learning, Communication networks, Network analysis and control
Abstract: We propose an online deterministic annealing algorithm for progressive graph partitioning based on the spectral information of the underlying graph Laplacian matrix. Online deterministic annealing is a prototype-based unsupervised learning algorithm that progressively adjusts the number of prototypes used with respect to a performance-complexity trade-off. Due to the online nature of the proposed learning algorithm, the structure of the graph need not be known a priori. In this regard, we construct a distributed approximation algorithm to estimate the spectral information of the graph Laplacian, bypassing the exact computation of its eigenvectors. By propagating an impulse through the graph via a diffusion equation, we show that each node can construct a local learning representation which can be used for spectral clustering. As a result, the proposed approach is suitable for large graphs, requires minimal hyper-parameter tuning, and provides online control over the complexity-accuracy trade-off. We illustrate the properties and evaluate the performance of the proposed methodology in graph partition and image segmentation applications.
|
|
13:45-14:00, Paper MoA02.2 | Add to My Program |
Imitation Learning from Inconcurrent Multi-Agent Interactions |
|
Zhang, Xin | Worcester Polytechnic Institute |
Huang, Weixiao | WPI |
Li, Yanhua | Worcester Polytechnic Institute (WPI) |
Liao, Renjie | University of Toronto |
Zhang, Ziming | Worcester Polytechnic Institute |
Keywords: Machine learning, Game theory
Abstract: Multi-agent imitation learning (MA-IL) aims to inversely learn policies for all agents using demonstrations collected from an expert group. However, this problem has only been studied in the setting of Markov games (MGs) allowing participants for concurrent actions, and do not work for general MGs, with agents inconcurrently making decisions in different turns. In this work, we propose iMA-IL, a novel multi-agent imitation learning framework for general (inconcurrent) Markov games. The learned policies are proven to guarantee subgame perfect equilibrium (SPE), a stronger equilibrium than Nash equilibrium (NE). The experiment results demon- strate that compared to state-of-the-art baselines, our iMA-IL model can better infer the policy of each expert agent using their demonstration data collected from inconcurrent decision- making scenarios.
|
|
14:00-14:15, Paper MoA02.3 | Add to My Program |
Adversarial Attacks on Computation of the Modified Policy Iteration Method |
|
Yekkehkhany, Ali | University of California, Berkeley |
Feng, Han | University of California, Berkeley |
Lavaei, Javad | UC Berkeley |
Keywords: Machine learning, Markov processes, Statistical learning
Abstract: Adversarial attacks on Markov decision processes (MDPs) and reinforcement learning (RL) have been studied in the literature in the context of robust learning and adversarial game theory. In this paper, we introduce a new notion of adversarial attacks on MDP and RL computation that is motivated by the emergence of edge computing. The large-scale computation of MDP and RL models in the form of value/policy iteration and Q-learning is being offloaded from agents to distributed servers, giving rise to edge reinforcement learning. By the inherently distributed nature of edge RL, the MDP/RL computation can be prone to adversarial attacks in different forms. We analyze a probabilistic model of adversarial attacks on the computation of the modified policy iteration method in which the principal contraction property of the Bellman operator is undermined with a certain probability in iterations of the policy evaluation step of the aforementioned method. This can result in luring the agent to search among sub-optimal policies without improving the true values of policies. We prove that under certain conditions, the attacked modified policy iteration method can still converge to the vicinity of the optimal policy with high probability if the number of policy evaluation iterations is larger than a threshold that is logarithmic in the inverse of a desired precision. We also provide an upper bound on the number of iterations needed for the attacked modified policy iteration method to terminate, which holds with an associated confidence level.
|
|
14:15-14:30, Paper MoA02.4 | Add to My Program |
Human-Behavior Learning for Infinite-Horizon Optimal Tracking Problems of Robot Manipulators |
|
Perrusquía, Adolfo | Cranfield University |
Yu, Wen | CINVESTAV-IPN |
Keywords: Machine learning, Optimal control, Neural networks
Abstract: In this paper, a human-behavior learning approach for optimal tracking control of robot manipulators is proposed. The approach is a generalization of the reinforcement learning control problem which merges the capabilities of different intelligent and control techniques in order to solve the tracking task. Three cognitive models are used: robot and reference dynamics and neural networks. The convergence of the algorithm is achieved under a persistent exciting and experience replay fulfillment. The algorithm learns online the optimal decision making controller according to the proposed cognitive models. Simulations were carry out to verify the approach using a 2-DOF planar robot.
|
|
14:30-14:45, Paper MoA02.5 | Add to My Program |
Client Scheduling for Federated Learning Over Wireless Networks: A Submodular Optimization Approach |
|
Ye, Lintao | University of Notre Dame |
Gupta, Vijay | University of Notre Dame |
Keywords: Machine learning, Optimization, Communication networks
Abstract: Federated Learning (FL) has recently been proposed as a distributed optimization framework under resource constraints and privacy concerns. We study the problem of client scheduling for FL, where the goal is to optimize the performance of FL under certain resource constraints on the FL setup. We show that this problem is a special instance of the general problem of maximizing a submodular function subject to a submodular upper bound constraint. We propose a greedy algorithm to solve this general problem, and provide theoretical approximation guarantees to characterize its performance. The greedy algorithm proposed for the general problem is then applied to solve the FL client scheduling problem with the approximation guarantee. We evaluate the performance of the algorithm using experiments.
|
|
14:45-15:00, Paper MoA02.6 | Add to My Program |
A Generative Machine Learning Approach to Policy Optimization in Pursuit-Evasion Games |
|
Navabi, Shiva | University of Southern California |
Osoba, Osonde | RAND Corporation |
Keywords: Machine learning, Agents-based systems, Game theory
Abstract: We consider a pursuit-evasion game played between two agents, `Blue' (the pursuer) and `Red' (the evader), over T time steps. Red aims to attack Blue's territory. Blue's objective is to intercept Red at time T and thereby limit the success of Red's attack. Blue must plan its pursuit trajectory by choosing parameters that determine its course of movement (speed and angle in our setup) such that it intercepts Red at time T. We show that Blue's path-planning problem in pursuing Red, can be posed as a sequential decision making problem under uncertainty. Blue's unawareness of Red's action policy renders the analytic dynamic programming approach intractable for finding the optimal action policy for Blue. In this work, we are interested in exploring data-driven approaches to the policy optimization problem that Blue faces. We apply generative machine learning (ML) approaches to learn optimal action policies for Blue. This highlights the ability of generative ML models to learn the relevant implicit representations for the dynamics of simulated pursuit-evasion games. We demonstrate the effectiveness of our modeling approach via extensive statistical assessments. This work represents a preliminary step towards the further development of generative modeling approaches for policy optimization problems that arise in the context of multi-agent learning and planning.
|
|
MoA03 Invited Session, Coordinated Universal Time (UTC) |
Add to My Program |
Learning-Based Control and Sweeping Processes I |
|
|
Chair: Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Co-Chair: Malisoff, Michael | Louisiana State University |
Organizer: Cao, Tan | SUNY Korea |
Organizer: Mordukhovich, Boris | Wayne State Univ |
Organizer: Malisoff, Michael | Louisiana State University |
Organizer: Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
|
13:30-13:45, Paper MoA03.1 | Add to My Program |
Data-Driven Adaptive Optimal Control of Mixed-Traffic Connected Vehicles in a Ring Road (I) |
|
Liu, Tong | New York University |
11201, Leilei Cui | New York University |
Pang, Bo | New York University |
Jiang, Zhong-Ping | New York University |
Keywords: Traffic control, Optimal control, Learning
Abstract: This paper studies the issue of data-driven optimal control design for connected and autonomous vehicles (CAVs) in a mixed-traffic environment. More specifically, we investigate the controllability of a string of vehicles composed of multiple CAVs and heterogeneous human-driven vehicles in a ring road. We use the classical Popov-Belevitch-Hautus (PBH) test to single out the uncontrollable mode, and identify the controllable subspace, based on which we obtain an explicit transformation matrix for Kalman controllable decomposition. Combining with the decomposition result, we formulate a linear quadratic regulator problem with constrained initial states and employ the adaptive dynamic programming method to solve it without relying on the exact knowledge of system parameters. The convergence of the data-driven algorithm has been proved rigorously. The simulation result shows that our theoretical analysis is effective and the proposed data-driven controller yields desirable performance for regulating the mixed-traffic flow.
|
|
13:45-14:00, Paper MoA03.2 | Add to My Program |
Learning-Based, Safety-Constrained Control from Scarce Data Via Reciprocal Barriers (I) |
|
Verginis, Christos | University of Texas at Austin |
Djeumou, Franck | University of Texas at Austin |
Topcu, Ufuk | The University of Texas at Austin |
Keywords: Uncertain systems, Learning, Robust control
Abstract: We develop a control algorithm for the safety of a control-affine system with unknown nonlinear dynamics in the sense of confinement in a given safe set. The algorithm leverages robust nonlinear feedback control laws integrated with on- the-fly, data-driven approximations to output a control signal that guarantees the boundedness of the closed-loop system in the given set. More specifically, it first computes estimates of the dynamics based on differential inclusions constructed from data obtained online from a single finite-horizon trajectory. It then computes a novel feedback safety control law that renders the system forward invariant with respect to the safe set, given an accurate enough estimate, using reciprocal barriers. An extension of the algorithm is capable of coping with the controllability loss incurred by the control matrix along the safe set. The algorithm removes a series of common and limiting assumptions considered in the related literature since it does not require global boundedness, growth conditions, or a priori approximations of the unknown dynamics’ terms.
|
|
14:00-14:15, Paper MoA03.3 | Add to My Program |
Learning-Based Actuator Placement for Uncertain Systems (I) |
|
Fotiadis, Filippos | Georgia Institute of Technology |
Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Keywords: Learning, Uncertain systems, Identification
Abstract: In this paper, we develop an online, data-based framework for optimal actuator placement of uncertain systems. In particular, a set of actuators is chosen to maximize a metric of controllability of the system, but without full knowledge of the system's dynamics, and by only measuring the system's trajectories in an online manner. The metric of controllability is associated with the trace of a discounted Gramian, which satisfies a static Lyapunov equation even if the system's plant matrix is not Hurwitz. Subsequently, an estimator is designed to learn the Gramian's trace exponentially fast in real-time. Finally, we show that the trace estimator can be used to place actuators online, and that the optimal set of actuators is found and scheduled permanently in finite time. The efficacy of the proposed framework is shown through simulations.
|
|
14:15-14:30, Paper MoA03.4 | Add to My Program |
Optimal Control Involving Sweeping Processes with End Point Constraints (I) |
|
de Pinho, Maria do Rosario | Universidade Do Porto, Fac. Engenharia |
Ferreira, Maria Margarida | Feup, Univ. Do Porto |
Smirnov, Georgi V. | Universidade Do Minho |
Keywords: Optimal control, Variational methods, Constrained control
Abstract: We generalize a Maximum Principle, previously obtained, to cover optimal control problems involving sweeping processes and end point constraints. The work is based on an ingenious smooth approximating family of problems.
|
|
14:30-14:45, Paper MoA03.5 | Add to My Program |
A One-Shot Convex Optimization Approach to Risk-Averse Q-Learning (I) |
|
Yuzhen, Han | Michigan State University |
Mazouchi, Majid | Michigan State University |
Modares, Hamidreza | Michigan State University |
Nageshrao, Subramanya P. | Ford |
Keywords: Learning, Optimal control, Stochastic optimal control
Abstract: This paper presents a model-free Q-learning algorithm for solving the risk-averse optimal control (RAOC) problem. The entropic risk measure is used in the RAOC to account for the variance of the objective function. A one-shot Q-based convex optimization problem is then formed for which the decision variables are the Q-function parameters and the constraints are formed by sampling from an exponential utility-based entropic Bellman inequality. Samples are constructed using only a batch of data collected from a variety of control policies in a fully off-policy manner, which turns a dataset into a Q-learning based risk-averse optimal policy engine. Convergence of the exact optimization problem, which is infinite- dimensional in decision variables and constraints, to the optimal risk-averse Q-function is shown. For the standard convex optimization problem for which function approximation for Q-value estimations as well as constraint sampling are leveraged, the performance of the approximated solutions is verified through a weighted-norm bound and the Lyapunov bound. A simulation example is provided to verify the effectiveness of the presented approach.
|
|
14:45-15:00, Paper MoA03.6 | Add to My Program |
Fixed-Time Seeking and Tracking of Time-Varying Extrema (I) |
|
Poveda, Jorge I. | University of Colorado at Boulder |
Krstic, Miroslav | University of California, San Diego |
Keywords: Adaptive control, Optimization algorithms, Adaptive systems
Abstract: Motivated by recent (semi-global practical) fixed-time convergence results in time-invariant model-free optimization problems, in this paper we introduce new tracking bounds and guidelines for the design of extremum seeking controllers in model-free optimization problems with dynamic cost functions. Using semi-global practical input-to-state stability characterizations, we show that the proposed non-smooth ES dynamics are able to significantly reduce the tracking error compared to the traditional smooth algorithms studied in the literature. Moreover, under a suitable tuning of the gains of the algorithm, the nominal average dynamics of the controller are able to achieve global fixed-time tracking for a general class of dynamic cost functions. For tuning parameters that do not completely eliminate the tracking error in the nominal average dynamics, but which preserve the continuity of the vector field, we show that “almost complete” error rejection is achieved whenever the gain of the algorithm exceeds a particular threshold. Numerical results are presented to illustrate the performance of the algorithms.
|
|
MoA04 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Identification I |
|
|
Chair: Hjalmarsson, Håkan | KTH Royal Inst. of Tech |
Co-Chair: Solo, Victor | University of New South Wales |
|
13:30-13:45, Paper MoA04.1 | Add to My Program |
Non-Causal Regularized Least-Squares for Continuous-Time System Identification with Band-Limited Input Excitations |
|
González, Rodrigo A. | KTH Royal Institute of Technology |
Rojas, Cristian R. | KTH Royal Institute of Technology |
Hjalmarsson, Håkan | KTH Royal Inst. of Tech |
Keywords: Identification, Estimation, Linear systems
Abstract: In continuous-time system identification, the intersample behavior of the input signal is known to play a crucial role in the performance of estimation methods. One common input behavior assumption is that the spectrum of the input is band-limited. The sinc interpolation property of these input signals yields equivalent discrete-time representations that are non-causal. This observation, often overlooked in the literature, is exploited in this work to study non-parametric frequency response estimators of linear continuous-time systems. We study the properties of non-causal least-square estimators for continuous-time system identification, and propose a kernel-based non-causal regularized least-squares approach for estimating the band-limited equivalent impulse response. The proposed methods are tested via extensive numerical simulations.
|
|
13:45-14:00, Paper MoA04.2 | Add to My Program |
On the Error Rate for Classifying Point Processes |
|
Rong, Xinhui | University of New South Wales |
Solo, Victor | University of New South Wales |
Keywords: Identification, Estimation, Stochastic systems
Abstract: Point process data is increasingly occurring in a wide range of applications including social media. But basic problems such as classifying point processes have not been addressed. Here we study the misclassification error of a point process Bayes rule/likelihood ratio classification rule in a binary classification problem. We first develop the Bhattacharya bound for the error rate for the time-varying Poisson. We then derive a tight computable bound on the Bhattacharya bound for the renewal process. We then develop fast methods for computing these bounds. We study the accuracy of the bounds with some comparative simulations.
|
|
14:00-14:15, Paper MoA04.3 | Add to My Program |
Comparing Vector Networks Via Frequency Domain Persistent Homology |
|
Yue, Zuogong | University of New South Wales |
Cassidy, Ben | Columbia University |
Solo, Victor | University of New South Wales |
Keywords: Identification, Estimation, Stochastic systems
Abstract: Persistent homology is emerging as a powerful approach to comparing networks whose nodes carry signals; but requires a measure of distance between nodes. Almost all existing work applies to scalar networks i.e. where each node carries a scalar signal; and further, these signals are assumed to be white noises though they may be instantaneously cross correlated. We have previously developed frequency domain based distance measures to deal with scalar networks whose nodal signals are cross-autocorrelated. But in most applications networks are vector networks i.e. each node carries a vector of signals. Here we extend our scalar work to provide a frequency domain distance measure for vector networks. The new distance measure is illustrated with comparative simulations They show that persistent homology based on static or white noise vector distance measures fails catastrophically; but when based on dynamic vector distance measures, performs very well.
|
|
14:15-14:30, Paper MoA04.4 | Add to My Program |
Near-Optimal Recursive Identification for Markov Switched Systems |
|
Andrien, Alex Rudolf Petrus | Eindhoven University of Technology |
Antunes, Duarte | Eindhoven University of Technology, the Netherlands |
Keywords: Identification, Filtering, Markov processes
Abstract: This paper tackles the problem of identifying the parameters of a class of stochastic switched systems, where the active subsystem is determined by a Markov chain. This class includes autoregressive models with exogenous inputs (ARX) for which the parameters switch according to a Markov chain and general Markov Jump Linear Systems (MJLSs) with full-state information. The transition probabilities of the Markov chain are assumed to be known, but the active subsystem is unknown. A recursive identification method for the joint maximum a posteriori probability estimate of these parameters and of the unknown mode is proposed relying on relaxed dynamic programming. The method is guaranteed to provide an estimate whose joint posteriori probability is within a constant factor of that of the optimal estimate while reducing the computational complexity. The method is illustrated through a numerical example.
|
|
14:30-14:45, Paper MoA04.5 | Add to My Program |
Design of Input for Data-Driven Simulation with Hankel and Page Matrices |
|
Iannelli, Andrea | ETH Zurich |
Yin, Mingzhou | ETH Zurich |
Smith, Roy S. | ETH Zurich |
Keywords: Identification, Linear systems, Behavioural systems
Abstract: The paper deals with the problem of designing informative input trajectories for data-driven simulation. First, the excitation requirements in the case of noise-free data are discussed and new weaker conditions, which assume the simulated input to be known in advance, are provided. Then, the case of noisy data trajectories is considered and an input design problem based on a recently proposed maximum likelihood estimator is formulated. A Bayesian interpretation is provided, and the implications of using Hankel and Page matrix representations are demonstrated. Numerical examples show the impact of the designed input on the predictive accuracy.
|
|
14:45-15:00, Paper MoA04.6 | Add to My Program |
Beyond Persistent Excitation: Online Experiment Design for Data-Driven Modeling and Control |
|
van Waarde, Henk J. | University of Cambridge |
Keywords: Identification, Linear systems
Abstract: This paper presents a new experiment design method for data-driven modeling and control. The idea is to select inputs online (using past input/output data), leading to desirable rank properties of data Hankel matrices. In comparison to the classical persistency of excitation condition, this online approach requires less data samples and is even shown to be completely sample efficient.
|
|
MoA05 Invited Session, Coordinated Universal Time (UTC) |
Add to My Program |
Advances in Stochastic Control with Partial Information I |
|
|
Chair: Yuksel, Serdar | Queen's University |
Co-Chair: Mehta, Prashant G. | Univ of Illinois, Urbana-Champaign |
Organizer: Mehta, Prashant G. | Univ of Illinois, Urbana-Champaign |
Organizer: Yuksel, Serdar | Queen's University |
|
13:30-13:45, Paper MoA05.1 | Add to My Program |
Continuity Properties of Value Functions in Information Structures for Stochastic Team Problems and Zero-Sum and General Games (I) |
|
Hogeboom-Burr, Ian | Queen's University |
Yuksel, Serdar | Queen's University |
Keywords: Game theory, Information theory and control, Stochastic systems
Abstract: We study continuity properties of value functions in information structures for stochastic team, zero-sum, and general game problems. We establish continuity properties of the value function under total variation, setwise, and weak convergence of information structures. Our analysis reveals that the value function for a bounded game is continuous under total variation convergence of information structures in both zero-sum games and team problems. However, continuity fails under setwise or weak convergence of information structures. The value function exhibits upper semicontinuity properties under weak and setwise convergence of information structures for team problems, and upper or lower semicontinuity properties hold for zero-sum games when such convergence is through a Blackwell-garbled sequence of information structures. Finally, a counterexample reveals that value functions for players may not be continuous even under total variation convergence of information structures in general non-zero-sum games.
|
|
13:45-14:00, Paper MoA05.2 | Add to My Program |
Kullback-Leibler-Quadratic for Distributed Control in a Stochastic Environment (I) |
|
CAMMARDELLA, NEIL | University of Florida |
Busic, Ana | Inria |
Meyn, Sean P. | Univ. of Florida |
Keywords: Stochastic optimal control, Mean field games, Smart grid
Abstract: This paper presents advances in Kullback-Leibler-Quadratic (KLQ) optimal control: a stochastic control framework for Markovian models. The motivation is distributed control of large networked systems. The objective function is composed of a control cost in the form of Kullback-Leibler divergence plus a quadratic cost on the sequence of marginal distributions. With this choice of objective function, the optimal probability distribution of a population of agents over a finite time horizon is shown to be an exponential tilting of the nominal probability distribution. The same is true for the controlled transition matrices that induce the optimal probability distribution. However, one limitation of the previous work is that randomness can only be introduced via the control policy; all uncontrolled processes must be modeled as deterministic to render them immutable under an exponential tilting. In this work, only the controlled dynamics are subject to tilting, allowing for more general probabilistic models. Numerical experiments are conducted in the context of power networks. The distributed control techniques described in this paper can transform a large collection of flexible loads into a `virtual battery' capable of delivering the same grid services as traditional batteries. Additionally, quality of service to the load owner is guaranteed, privacy is preserved, and computation and communication requirements are reduced, relative to alternative centralized control techniques.
|
|
14:00-14:15, Paper MoA05.3 | Add to My Program |
Critical Nodes in Graphon Mean Field Games (I) |
|
Foguen Tchuendom, Rinel | McGill University |
Caines, Peter E. | McGill University |
Huang, Minyi | Carleton University |
Keywords: Mean field games, Decentralized control, Networked control systems
Abstract: We propose to study what shall be termed critical nodes for a family of Linear Quadratic Gaussian Graphon Mean Field Games (LQG-GMFG). The critical nodes are defined to be those nodes at which the local mean field is stationary with respect to its index. We present examples of such nodes for the graphons that are the limits of (i) uniform attachment graph and (ii) the Erdos-Renyi graphs. We also investigate the properties of these critical nodes on the value functions and the equilibrium startegies for the LQG-GMFG.
|
|
14:15-14:30, Paper MoA05.4 | Add to My Program |
Linear Quadratic Mean Field Stackelberg Games: Master Equations and Time Consistent Feedback Strategies (I) |
|
Yang, Xuwei | Carleton University |
Huang, Minyi | Carleton University |
Keywords: Mean field games, Stochastic optimal control, Game theory
Abstract: We consider linear quadratic (LQ) Stackelberg games with a major player (leader) and N minor players (followers) and derive two master equations in a mean field limit model. We show the resulting decentralized strategies are time consistent by adapting a procedure introduced by Ekeland and Lazrak (2006).
|
|
14:30-14:45, Paper MoA05.5 | Add to My Program |
Private Information Compression in Dynamic Games among Teams |
|
Tang, Dengwang | University of Michigan |
Tavafoghi, Hamidreza | UC Berkeley |
Subramanian, Vijay G. | University of Michigan |
Nayyar, Ashutosh | University of Southern California |
Teneketzis, Demosthenis | Univ. of Michigan, Ann Arbor |
Keywords: Game theory, Decentralized control, Stochastic optimal control
Abstract: We investigate finite-horizon stochastic dynamic games among teams. Each team has its own dynamic system, whose evolution is affected by the actions of all players in all teams. Within each team, members share their local states with each other with a delay of d>0. Actions are observed by all agents along with noisy observations of the systems. Such games feature the difficulties of the increasing domain of strategies and interdependence of actions and information over time. In these games, we identify a subclass of Nash Equilibria where the agents use Sufficient Private Information Based (SPIB) strategies, i.e. agents make decisions based on compressed versions of their private information along with the common information. We establish the existence of such equilibria; the proof of existence is not based on standard techniques since SPIB strategies do not feature perfect recall. Finally, we investigate a special case of our model where each agent has their own dynamic system. We show that agents can compress their private information further in this case. Our results provide a foundational step in addressing the difficulties of dynamic games among teams.
|
|
14:45-15:00, Paper MoA05.6 | Add to My Program |
Vanishing Viscosity for Linear-Quadratic Mean-Field Control Problems |
|
Ciampa, Gennaro | BCAM - Basque Center for Applied Mathematics |
Rossi, Francesco | Università Degli Studi Di Padova |
Keywords: Stochastic optimal control, Mean field games, Distributed parameter systems
Abstract: We consider a mean-field control problem with linear dynamics and quadratic control. We prove the existence of an optimal control by applying the vanishing viscosity method: we add a (regularizing) heat diffusion with a small viscosity coefficient and let such coefficient go to zero. The main result is that, in this case, the limit optimal control is exactly the optimal control of the original problem.
|
|
MoA06 Invited Session, Coordinated Universal Time (UTC) |
Add to My Program |
Game Equilibrium Seeking and Learning I |
|
|
Chair: Grammatico, Sergio | Delft Univ. of Tech |
Co-Chair: Pavel, Lacra | University of Toronto |
Organizer: Franci, Barbara | Maastricht University |
Organizer: Grammatico, Sergio | Delft Univ. of Tech |
Organizer: Pavel, Lacra | University of Toronto |
Organizer: Shanbhag, Uday V. | Pennsylvania State University |
Organizer: Staudigl, Mathias | Maastricht University |
|
13:30-13:45, Paper MoA06.1 | Add to My Program |
Distributed Nash Equilibrium Seeking Resilient to Adversaries (I) |
|
Gadjov, Dian | University of Toronto |
Pavel, Lacra | University of Toronto |
Keywords: Game theory, Agents-based systems, Network analysis and control
Abstract: Most research in distributed Nash equilibrium seeking assumes that agents communicate truthfully. However, in general noncooperative games agents may have the incentive to send misinformation to neighbouring agents with the goal of minimizing their own costs. Furthermore, such settings can also be susceptible to communication failures and attacks from agents outside the game. In this paper, we design a NE seeking algorithm that is resilient against malicious agents and communication tampering/failures. The problem is challenging because adversarial agents may be indistinguishable from a truthful agent with a modified (and valid) cost function. The core issue is that agents lack any means of verifying if the information they receive is truthful, i.e. there is no ``ground truth". To address this problem, we make use of an observation graph in addition to a communication graph, as well as pruning of extreme messages. Under the assumption that the number of adversaries/malicious agents does not exceed the number of truthful ones, we show that our algorithm is resilient against adversarial agents and converges to the Nash equilibrium.
|
|
13:45-14:00, Paper MoA06.2 | Add to My Program |
A Relaxed-Inertial Forward-Backward-Forward Algorithm for Stochastic Generalized Nash Equilibrium Seeking (I) |
|
Cui, Shisheng | The Pennsylvania State University |
Franci, Barbara | Technische Universiteit Delft |
Grammatico, Sergio | Delft Univ. of Tech |
Shanbhag, Uday V. | Pennsylvania State University |
Staudigl, Mathias | Maastricht University |
Keywords: Game theory, Optimization, Stochastic systems
Abstract: We propose a new operator splitting algorithm for distributed Nash equilibrium seeking under stochastic uncertainty, featuring relaxation and inertial effects. The proposed algorithm is derived from a forward-backward-forward scheme for solving structured monotone inclusion problems with Lipschitz continuous and monotone pseudogradient operator. To the best of our knowledge, this is the first distributed generalized Nash equilibrium seeking algorithm featuring acceleration techniques in stochastic Nash equilibrium problems without assuming cocoercivity. Numerical examples illustrate the effect of inertia and relaxation on the performance of our proposed algorithm.
|
|
14:00-14:15, Paper MoA06.3 | Add to My Program |
Appointed-Time Distributed Nash Equilibrium Seeking for Networked Games (I) |
|
Zhou, Jialing | Nanjing University of Science and Technology |
Lv, Yuezu | Southeast University |
Ye, Maojiao | Nanjing University of Science and Technology |
Keywords: Game theory, Distributed control, Control over communications
Abstract: Distributed Nash Equilibrium (NE) seeking for networked games has been widely investigated in recent years. In particular, convergence speed and communication cost are two of the critical concerns for the design of the seeking algorithms. To achieve fast convergence while saving communication resources, this paper investigates the appointed-time distributed NE seeking problem for networked games under a discrete-time communication scenario. By utilizing the idea of motion planning, a new continuous-time distributed NE seeking algorithm, which ensures convergence to NE at the prescribed time, is proposed based on sampled-data information exchange. Compared with the existing related works, the established algorithm has advantages on both the communication cost and convergence speed (i.e., appointed-time convergence). Finally, the effectiveness of the proposed method is verified via numerical simulations.
|
|
14:15-14:30, Paper MoA06.4 | Add to My Program |
Adversarial Linear-Quadratic Mean-Field Games Over Multigraphs (I) |
|
Zaman, Muhammad Aneeq uz | UIUC |
Bhatt, Sujay | Baidu USA |
Basar, Tamer | Univ of Illinois, Urbana-Champaign |
Keywords: Mean field games, Game theory, Networked control systems
Abstract: In this paper, we propose a game between an exogenous adversary and a network of agents connected via a multigraph. The multigraph is composed of (1) a global graph structure, capturing the virtual interactions among the agents, and (2) a local graph structure, capturing physical/local interactions among the agents. The aim of each agent is to achieve consensus with the other agents in a decentralized manner by minimizing a local cost associated with its local graph and a global cost associated with the global graph. The exogenous adversary, on the other hand, aims to maximize the average cost incurred by all agents in the multigraph. We derive Nash equilibrium policies for the agents and the adversary in the Mean-Field Game setting, when the agent population in the global graph is arbitrarily large and the ``homogeneous mixing" hypothesis holds on local graphs. This equilibrium is shown to be unique and the equilibrium Markov policies for each agent depend on the local state of the agent, as well as the influences on the agent by the local and global mean fields.
|
|
14:30-14:45, Paper MoA06.5 | Add to My Program |
Distributed Generalized Nash Equilibrium Seeking of N-Coalition Games with Inequality Constraints (I) |
|
Sun, Chao | NTU |
Hu, Guoqiang | Nanyang Technological University, Singapore |
Keywords: Game theory, Distributed control, Agents-based systems
Abstract: In this paper, we study the generalized Nash equilibrium (GNE) seeking problem for an N-coalition game with inequality constraints. First, using full decision information, a finite-time average consensus-based approach is proposed. It is demonstrated that the algorithm converges to a GNE (specifically, a variational equilibrium) of the game. Then, using the finite-time consensus tracking technique, we propose a distributed algorithm that only requires neighboring action information for each agent. The solution by using partial decision information covers the distributed GNE seeking solutions of generalized non-cooperative games as a special case.
|
|
14:45-15:00, Paper MoA06.6 | Add to My Program |
Forward-Backward Algorithms for Stochastic Nash Equilibrium Seeking in Restricted Strongly and Strictly Monotone Games |
|
Franci, Barbara | Technische Universiteit Delft |
Grammatico, Sergio | Delft Univ. of Tech |
Keywords: Game theory, Stochastic systems, Variational methods
Abstract: We study stochastic Nash equilibrium problems with expected valued cost functions whose pseudogradient satisfies restricted monotonicity properties which hold only with respect to the solution. We propose a forward-backward algorithm and prove its convergence under restricted strong monotonicity, restricted strict monotonicity and restricted cocoercivity of the pseudogradient mapping. To approximate the expected value, we use either a finite number of samples and a vanishing step size or an increasing number of samples with a constant step. Numerical simulations show that our proposed algorithm might be faster than the available algorithms.
|
|
MoA07 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Optimization I |
|
|
Chair: Leok, Melvin | University of California, San Diego |
Co-Chair: Bastianello, Nicola | University of Padova |
|
13:30-13:45, Paper MoA07.1 | Add to My Program |
Tvopt: A Python Framework for Time-Varying Optimization |
|
Bastianello, Nicola | University of Padova |
Keywords: Numerical algorithms, Optimization algorithms
Abstract: This paper introduces tvopt, a Python framework for prototyping and benchmarking time-varying (or online) optimization algorithms. The paper first describes the theoretical approach that informed the development of tvopt. Then it discusses the different components of the framework and their use for modeling and solving time-varying optimization problems. In particular, tvopt provides functionalities for defining both centralized and distributed online problems, and a collection of built-in algorithms to solve them, for example gradient-based methods, ADMM and other splitting methods. Moreover, the framework implements prediction strategies to improve the accuracy of the online solvers. The paper then proposes some numerical results on a benchmark problem and discusses their implementation using tvopt. The code for tvopt is available at https://github.com/nicola-bastianello/tvopt.
|
|
13:45-14:00, Paper MoA07.2 | Add to My Program |
Variational Symplectic Accelerated Optimization on Lie Groups |
|
Lee, Taeyoung | George Washington University |
Tao, Molei | Georgia Institute of Technology |
Leok, Melvin | University of California, San Diego |
Keywords: Optimization, Algebraic/geometric methods, Machine learning
Abstract: There has been significant interest in generalizations of the Nesterov accelerated gradient descent algorithm due to its improved performance guarantee compared to the standard gradient descent algorithm, and its applicability to large scale optimization problems arising in deep learning. A particularly fruitful approach is based on numerical discretizations of differential equations that describe the continuous time limit of the Nesterov algorithm, and a generalization involving time-dependent Bregman Lagrangian and Hamiltonian dynamics that converges at an arbitrarily fast rate to the minimum. We develop a Lie group variational discretization based on an extended path space formulation of the Bregman Lagrangian on Lie groups, and analyze its computational properties with two examples in attitude determination and vision-based localization.
|
|
14:00-14:15, Paper MoA07.3 | Add to My Program |
Multi-Agent Trajectory Optimization against Plan-Deviation Attacks Using Co-Observations and Reachability Constraints |
|
Yang, Ziqi | Boston University |
Tron, Roberto | Boston University |
Keywords: Optimization, Cyber-Physical Security, Attack Detection
Abstract: In this paper, we focus on using path planning and inter-agent measurements to improve the security of multi-robot systems against possible takeovers from cyber-attackers. We build upon recent trajectory optimization approaches where introspective measurement capabilities of the robots are used in an observation schedule to detect deviations from the preordained routes. This paper proposes additional constraints that can be incorporated in the previous trajectory optimization algorithm based on Alternating Direction Method of Multipliers (ADMM). The new constraints provide guarantees that a compromised robot cannot reach a designed safety zone between observations despite adversarial movement by the attacker. We provide a simulation showcasing the new components of the formulation in a multi-agent map exploration task with several safety zones.
|
|
14:15-14:30, Paper MoA07.4 | Add to My Program |
Linearly Convergent Distributed Optimization Methods with Compressed Innovation |
|
Zhang, Jiaqi | Tsinghua University |
You, Keyou | Tsinghua University |
Keywords: Optimization, Distributed control, Network analysis and control
Abstract: Data compression is essential to reduce communication cost in distributed optimization over peer-to-peer networks. In this work, we propose a novel communication-efficient linearly convergent distributed (COLD) algorithm with compressed innovation---the difference between a model and its estimate. COLD supports a class of quantizers with delta-contraction property. For strongly convex distributed problems, we explicitly quantify in theory how the compression affects the linear convergence rate. To the best of our knowledge, we are the first to achieve linear convergence in distributed optimization allowing biased and delta-contracted compressors, which is a typical case in practice. Numerical experiments validate our theoretical results.
|
|
14:30-14:45, Paper MoA07.5 | Add to My Program |
Asynchronous Parallel Nonconvex Optimization under the Polyak-Lojasiewicz Condition |
|
Yazdani, Kasra | University of Florida |
Hale, Matthew | University of Florida |
Keywords: Optimization, Machine learning, Optimization algorithms
Abstract: Communication delays and synchronization are major bottlenecks for parallel computing, and tolerating asynchrony is therefore crucial for accelerating distributed computation. Motivated by modern interest in optimization problems that do not satisfy convexity assumptions, we present an asynchronous block coordinate descent algorithm for nonconvex optimization problems whose objective functions satisfy the Polyak-Lojasiewicz condition. This condition is a generalization of strong convexity to nonconvex problems and requires neither convexity nor uniqueness of minimizers. Under only assumptions of mild smoothness of objective functions and bounded delays, we prove that a linear convergence rate is obtained. Numerical experiments for logistic regression problems are presented to illustrate the impact of asynchrony upon convergence in this setting.
|
|
14:45-15:00, Paper MoA07.6 | Add to My Program |
On the Convergence of NEAR-DGD for Nonconvex Optimization with Second Order Guarantees |
|
Iakovidou, Charikleia | Northwestern University |
Wei, Ermin | Northwestern Univeristy |
Keywords: Optimization, Optimization algorithms, Agents-based systems
Abstract: We consider the setting where the nodes of an undirected, connected network collaborate to solve a shared objective modeled as the sum of smooth functions. We assume that each summand is privately known by a unique node. NEAR-DGD is a distributed first order method which permits adjusting the amount of communication between nodes relative to the amount of computation performed locally in order to balance convergence accuracy and total application cost. In this work, we generalize the convergence properties of a variant of NEAR-DGD from the strongly convex to the nonconvex case. Under mild assumptions, we show convergence to minimizers of a custom Lyapunov function. Moreover, we demonstrate that the gap between those minimizers and the second order stationary solutions of the original problem can become arbitrarily small depending on the choice of algorithm parameters. Finally, we accompany our theoretical analysis with a numerical experiment to evaluate the empirical performance of NEAR-DGD in the nonconvex setting.
|
|
MoA08 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Predictive Control for Nonlinear Systems I |
|
|
Chair: Hovakimyan, Naira | University of Illinois at Urbana-Champaign |
Co-Chair: Robu, Bogdan | Grenoble Alpes University, GIPSA-Lab |
|
13:30-13:45, Paper MoA08.1 | Add to My Program |
Limited Information Model Predictive Control for Pursuit-Evasion Games |
|
Sani, Mukhtar | Université Grenoble Alpes |
Robu, Bogdan | Grenoble Alpes University, GIPSA-Lab |
Hably, Ahmad | GIPSA-Lab |
Keywords: Predictive control for nonlinear systems, Intelligent systems, Nonholonomic systems
Abstract: This paper explores the use of model predictive control (MPC) in dealing with the pursuit-evasion game (PEG) problem where players have incomplete information on their opponents. This is different from most cases in the literature where each player knows all the information (states information and dynamics) on the opponent. The burden caused by such demand for the opponent’s full information induces the need for more sensors during physical implementation as well as high computation time. However, we found that only the current positions, i.e. x-y coordinate of the opponent, are indispensable. Thus, knowing the orientation and the dynamics of the opponent are insignificant to the performance of the game. We propose a new method to exploit a two-player PEG in the presence and absence of obstacles, where each player can only rely on the current position information of its opponent. Several simulation results show that the PEG problem can be handled and obstacles can be avoided using the proposed control protocol. We also show that our approach is robust to measurement noise and can perform better, in terms of the computation than the approach with full information.
|
|
13:45-14:00, Paper MoA08.2 | Add to My Program |
Active Cell Balancing by Model Predictive Control for Real Time Range Extension |
|
Chen, Jun | Oakland University |
Behal, Aman | Univ. of Central Florida |
Li, Chong | Columbia University |
Keywords: Automotive control, Optimal control, Predictive control for nonlinear systems
Abstract: This paper studies the active cell balancing problem by using model predictive control (MPC) for real time range extension. Specifically, three MPC formulations are proposed and compared: the first one being a tracking controller to force all cells to follow the same trajectory generated by a nominal cell model, the second one trying to maximize the lowest cell SOC/voltage and the last one minimizing the difference between the highest and lowest cell SOC/voltages. Both steady state and transient conditions are simulated to assess the effectiveness of the proposed controllers, and a range extension of 4% is found for dynamic driving cycle and 7% for steady state condition. Comparing to the literature, our approaches achieve similar range extension, without making the restrictive assumption that the final battery state-of-charge is known in advance, making our approaches more applicable. Real time implementability is demonstrated via throughput analysis.
|
|
14:00-14:15, Paper MoA08.3 | Add to My Program |
Robust Tube-Enhanced Multi-Stage NMPC with Stability Guarantees |
|
Subramanian, Sankaranarayanan | TU Dortmund |
Abdelsalam, Yehia | Technical University of Dortmund (TU Dortmund) |
Lucia, Sergio | TU Dortmund University |
Engell, Sebastian | TU Dortmund |
Keywords: Predictive control for nonlinear systems, Robust control, Stability of nonlinear systems
Abstract: We propose a robust Nonlinear Model Predictive Control (NMPC) scheme that provides an improved trade-off between optimality and complexity when compared to other available strategies. Two controllers are employed in the proposed framework: A multi-stage primary controller that optimizes a given objective in the presence of uncertainties with tightened constraints and a multi-stage ancillary controller that tracks the predicted tree of state and input trajectories of the primary controller. The primary controller optimizes the original objective by considering different realizations of the most significant uncertainties in the predictions. The ancillary controller provides robustness against other uncertainties by tracking the predicted tree of state and input trajectories of the primary controller. We establish sufficient conditions for closed-loop stability. The advantages of the scheme are demonstrated for a continuous stirred tank reactor (CSTR) example.
|
|
14:15-14:30, Paper MoA08.4 | Add to My Program |
Set-Valued Model Predictive Control |
|
Risso, Nathalie | University of Arizona |
Altin, Berk | University of California, Santa Cruz |
Sanfelice, Ricardo G. | University of California at Santa Cruz |
Sprinkle, Jonathan | University of Arizona |
Keywords: Predictive control for nonlinear systems, Stability of nonlinear systems, Lyapunov methods
Abstract: Model predictive control (MPC) is a valuable tool to deal with systems that require optimal solutions and constraint satisfaction. In the case of systems with uncertainty, the formulation of predictive controllers requires models which are capable to capture system dynamics, constraints and also system uncertainty. In this work we present a formulation for a set-valued model predictive control (SVMPC) where uncertainty is represented in terms of sets. The approach presented here considers a model where the state is set valued and dynamics are defined by a set valued map. The cost function associated to the proposed MPC associates a real-valued cost to each set valued (or tube-based) trajectory. For this formulation, we study conditions that can yield the constrained optimal control problem associated to the set-valued MPC formulation feasible and stable, thus extending existing stability results from classic MPC to a set based approach. Examples illustrate the results along the paper.
|
|
14:30-14:45, Paper MoA08.5 | Add to My Program |
Backup Plan Constrained Model Predictive Control |
|
Kim, Hunmin | University of Illinois Urbana-Champaign |
YOON, HYUNGJIN | University of Nevada, Reno |
Wan, Wenbin | University of Illinois at Urbana–Champaign |
Hovakimyan, Naira | University of Illinois at Urbana-Champaign |
Sha, Lui | University of Illinois at Urbana Champaign |
Voulgaris, Petros G. | Univ of Nevada, Reno |
Keywords: Predictive control for nonlinear systems, Autonomous vehicles, Optimal control
Abstract: This article proposes a new safety concept: dynamically formulated backup plan safety. The backup plan safety is defined as the ability to complete one of the alternative missions formulated in real-time in the case of primary mission abortion. To incorporate this new safety concept in control problems, we formulate a feasibility maximization problem that adopts additional (virtual) input horizons toward the alternative missions on top of the input horizon toward the primary mission. Cost functions for the primary and alternative missions construct multiple objectives, and multi-horizon inputs evaluate them. To address the feasibility maximization problem, we develop a multi-horizon multi-objective model predictive path integral control (3M) algorithm. Model predictive path integral control (MPPI) is a sampling-based scheme that can help the proposed algorithm deal with nonlinear dynamic systems and achieve computational efficiency by parallel computation. Simulations of the aerial vehicle control problems demonstrate the new concept of backup plan safety and the performance of the proposed algorithm.
|
|
14:45-15:00, Paper MoA08.6 | Add to My Program |
Constrained Neural Networks for Approximate Nonlinear Model Predictive Control |
|
Adhau, Saket | Norwegian University of Science and Technology |
Naik, Vihangkumar Vinaykumar | IMT School for Advanced Studies Lucca |
Skogestad, Sigurd | Norwegian Univ. of Science and Technology (NTNU) |
Keywords: Predictive control for nonlinear systems, Machine learning, Optimal control
Abstract: Solving Non-Linear Model Predictive Control (NMPC) online is often challenging due to the computational complexities involved. This issue can be avoided by approximating the optimization problem using supervised learning methods which comes with a trade-off on the optimality and/or constraint satisfaction. In this paper, a novel supervised learning framework for approximating NMPC is proposed, where we explicitly impart constraint knowledge within the neural networks. This knowledge is inherited by augmenting the loss function of the neural networks during the training phase with insights from KKT conditions. Logarithmic barrier functions are utilized to augment the loss function including conditions of primal and dual feasibility. The proposed framework can be applied to other machine learning based parametric approximators. This approach is easy to implement and its efficacy is demonstrated on a benchmark NMPC problem for continuous stirred tank reactor (CSTR).
|
|
MoA09 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Discrete Event Systems I |
|
|
Chair: Cai, Kai | Osaka City University |
Co-Chair: Yin, Xiang | Shanghai Jiao Tong University |
|
13:30-13:45, Paper MoA09.1 | Add to My Program |
Compositional Verification of Finite Automata under Event Preemption |
|
Tang, Yiheng | Friedrich-Alexander-Universität Erlangen-Nürnberg |
Moor, Thomas | Friedrich-Alexander Universität Erlangen-Nürnberg |
Keywords: Automata, Manufacturing systems and automation
Abstract: Given a number of synchronised automata, compositional verification seeks to verify non-conflictingness without the explicit computation of an overall model. Technically, the approach alternates conflict-preserving abstractions with the composition of a small number of strategically chosen automata and the literature reports substantial computational benefits for examples of practical relevance. In this paper, we re-visit this approach in order to address the situation of preemptive events, i.e., events that are known beforehand to be scheduled at highest priority. Our study is motivated by high-level programming languages commonly used in industrial automation, where events associated with actuators preempt events associated with sensors.
|
|
13:45-14:00, Paper MoA09.2 | Add to My Program |
Distributed Sensing and Information Transmission of Discrete-Event Systems with Edge Sensors |
|
Liu, Yingying | Shanghai Jiao Tong University |
Yin, Xiang | Shanghai Jiao Tong University |
Li, Shaoyuan | Shanghai Jiao Tong University |
Keywords: Discrete event systems, Automata, Supervisory control
Abstract: In this paper, we investigate the problem of distributed sensing and information transmission in partially observed discrete-event systems, where the sensing and information transmission are complicated by a set of edge sensors. Each edge sensor selectively transmits its observable events, according to an information transmission policies, to a central site for the purpose of decision making. In this paper, we consider a general class of decision making requirement at the central site called the distinguishability. Then we investigate both the verification and synthesis problems. For the verification problem, two different approaches, one based on the observer and the other based on the verifier, are proposed to check whether or not a given set of sensor transmission policies fulfills the distinguishability requirement at the central site. For the synthesis problem, we also develop an effective algorithm to design an observer-based optimal information transmission policy for each edge sensor such that they are verified to be distinguishable.
|
|
14:00-14:15, Paper MoA09.3 | Add to My Program |
Modeling and Analysis of Networked Discrete Event Systems with Multiple Control Channels |
|
Liu, Zhaocong | Shanghai Jiao Tong Univ |
Hou, Junyao | ShanghaiJiaoTong University |
Yin, Xiang | Shanghai Jiao Tong University |
Li, Shaoyuan | Shanghai Jiao Tong University |
Keywords: Discrete event systems, Automata, Supervisory control
Abstract: In this paper, we propose a novel framework for modeling and analysis of networked discrete-event systems (DES). We assume that the plant is controlled by a feedback supervisor whose control decisions are subject to communication delays and losses. Furthermore, we consider a general setting where the supervisor sends control decisions to different actuators via different communication channels whose dynamics are independent.We provide a system theoretic approach by identifying the state-space of overall networked system and investigating the dynamic of the entire state-space. Our approach precisely specifies the roles of the supervisor, the communication channels and the actuators. Also, we compare the proposed networked DES model with the existing one and show that the proposed networked model captures physical situations of networked systems more precisely.
|
|
14:15-14:30, Paper MoA09.4 | Add to My Program |
A Game-Theoretical Approach for Optimal Supervisory Contro of Discrete Event Systems for Cyclic Tasks |
|
Lv, Peng | Shanghai Jiao Tong University |
Yin, Xiang | Shanghai Jiao Tong University |
Ji, Yiding | Boston University |
Li, Shaoyuan | Shanghai Jiao Tong University |
Keywords: Discrete event systems, Automata, Supervisory control
Abstract: In this paper, we investigate the problem of optimal supervisory control for cyclic tasks in the context of discrete-event systems (DES). We consider the completion of each single task as the visit of a marked state, and overall control objective is to complete tasks cyclically in the sense that marked states are visited infinitely often. Following the standard optimal supervisory control framework, two types of costs, disable cost and occurrence cost, are considered. However, instead of considering the standard accumulated total cost or the average cost per event, we propose a new measure for the control performance using the average cost per task. We show that such an optimality measure is more suitable for tasks that need to be completed cyclically. Our goal is to design a live and non-blocking supervisor such that the average cost per task in the worst-case is minimized. To solve the problem, we propose a game-theoretical approach by converting the optimal control problem as a two-player graph game. The constructed game is then solved in two stages: one focuses on the optimal execution within each single task cycle and the other focuses on the scheduling strategy among different tasks. Illustrative examples are provided to demonstrate the proposed algorithm.
|
|
14:30-14:45, Paper MoA09.5 | Add to My Program |
Enforcement of K-Step Opacity with Edit Functions |
|
Wintenberg, Andrew | The University of Michigan, Ann Arbor |
Blischke, Matthew | The University of Michigan, Ann Arbor |
Lafortune, Stephane | Univ. of Michigan |
Ozay, Necmiye | Univ. of Michigan |
Keywords: Discrete event systems, Computer/Network Security, Automata
Abstract: Opacity is an information flow property for dynamic systems describing plausible deniability, that is whether an eavesdropper can deduce that ``secret'' behavior has occurred. In particular, K-step opacity considers secret actions that have occurred within the last K-steps in the past. We consider the problem of K-step opacity enforcement over automata using obfuscation. We present a general framework for K-step opacity enforcement and transform the problem of enforcing K-step opacity to enforcing current-state opacity. We can then apply existing obfuscation synthesis methods for current-state opacity to K-step opacity. We demonstrate this approach by enforcing privacy in the context of a novel contact tracing model.
|
|
14:45-15:00, Paper MoA09.6 | Add to My Program |
N-Step Nonblocking Supervisory Control of Discrete-Event Systems |
|
Zhang, Renyuan | Northwestern Polytechnical University |
Wang, Zenghui | Northwestern Polytechnical University |
Cai, Kai | Osaka City University |
Keywords: Discrete event systems, Supervisory control, Automata
Abstract: In this paper, we propose a new automaton property of N-step nonblockingness for a given positive integer N. This property quantifies the standard nonblocking property by capturing the practical requirement that all tasks be completed within a bounded number of steps. Accordingly, we formulate a new N-step nonblocking supervisory control problem, and characterize its solvability in terms of a new concept of N-step language completability. It is proved that there exists a unique supremal N-step completable sublanguage of a given language, and we develop a generator-based algorithm to compute the supremal sublanguage. Finally, together with the supremal controllable sublanguage, we design an algorithm to compute a maximally permissive supervisory control solution to the new N-step nonblocking supervisory control problem.
|
|
MoA10 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Adaptive Control I |
|
|
Chair: Gharesifard, Bahman | University of California, Los Angeles |
Co-Chair: Zhou, Jing | University of Agder |
|
13:30-13:45, Paper MoA10.1 | Add to My Program |
A Note on Nussbaum-Type Control and Lie-Bracket Approximation |
|
Weber, Marc | RWTH Aachen University |
Ebenbauer, Christian | RWTH Aachen University |
Gharesifard, Bahman | University of California, Los Angeles |
Keywords: Adaptive control, Algebraic/geometric methods, Nonlinear systems
Abstract: In this paper, we propose an adaptive control law for completely unknown scalar linear systems based on Lie-bracket approximation methods. We investigate stability and convergence properties for the resulting Lie-bracket system, compare our proposal with existing Nussbaum-type solutions and demonstrate our results with an example. Even though we prove global stability properties of the Lie-bracket system, the stability properties of the proposed dynamics remain open, making the proposed control law an object of further studies. We elaborate the difficulties of establishing stability results by investigating connections to partial stability as well as studying the corresponding Chen-Fliess expansion.
|
|
13:45-14:00, Paper MoA10.2 | Add to My Program |
Continuous-Time Extremum Seeking for Scalar Systems Via a Time-Delay Approach to Averaging |
|
Zhu, Yang | Zhejiang University |
Fridman, Emilia | Tel-Aviv Univ |
Keywords: Adaptive control, Delay systems, LMIs
Abstract: In this paper, we present a constructive approach to continuous-time extremum seeking (ES) by using a time delay approach to averaging. We consider gradient-based ES of static maps in the case of single-input, and we study two ES methods: the classical one and a more recent bounded ES method. By transforming the ES dynamics into a time delay system where the delay is the period of dither, we derive the practical stability conditions for the resulting time-delay system. The time-delay system stability guarantees the stability of the original ES plant. Under the assumption of some known bounds on the extremum value and the Hessian, the time delay approach provides a quantitative calculation on the lower bound of the frequency and on the resulting ultimate bound. We also give a bound on the neighborhood of the extremum point starting from which the solution is ultimately bounded. When the Hessian and the extremum value bounds are unknown, we provide, for the first time, the asymptotic ultimate bound in terms of the frequency in the case of bounded ES. Moreover, our explicit bound on the seeking error of ES control systems allows to select appropriate tuning parameters (such as dither frequency, magnitude, and control gain).
|
|
14:00-14:15, Paper MoA10.3 | Add to My Program |
Closed-Loop Deep Neural Network-Based FES Control for Human Limb Tracking |
|
Griffis, Emily | University of Florida |
Le, Duc M. | University of Florida |
Stubbs, Kimberly J. | University of Florida |
Dixon, Warren E. | University of Florida |
Keywords: Adaptive control, Lyapunov methods, Neural networks
Abstract: Functional electrical stimulation (FES) can be used as rehabilitative treatment for lost motor neuron function in people with neurological disorders. This paper considers a leg extension machine coupled to a participant for FES-induced closed-loop lower-limb tracking of a desired trajectory. FES-induced control faces challenges as the muscle dynamics exhibit nonlinear behaviors and have unstructured uncertainty. A closed-loop data-driven deep neural network (DNN)-based adaptive control method for FES-induced lower-limb position trajectory tracking is developed. A Lyapunov-based stability analysis is used to develop a closed-loop state-feedback adaptation law for the outer-layer weights of the DNN, which is combined with a feedback controller to yield semi-global asymptotic tracking.
|
|
14:15-14:30, Paper MoA10.4 | Add to My Program |
Wasserstein Contraction Bounds on Closed Convex Domains with Applications to Stochastic Adaptive Control |
|
Lekang, Tyler | University of Minnesota, Twin Cities |
Lamperski, Andrew | University of Minnesota |
Keywords: Adaptive control, Lyapunov methods, Stochastic systems
Abstract: This paper is motivated by the problem of quantitatively bounding the convergence of adaptive control methods for stochastic systems to a stationary distribution. Such bounds are useful for analyzing statistics of trajectories and determining appropriate step sizes for simulations. To this end, we extend a methodology from (unconstrained) stochastic differential equations (SDEs) which provides contractions in a specially chosen Wasserstein distance. This theory focuses on unconstrained SDEs with fairly restrictive assumptions on the drift terms. Typical adaptive control schemes place constraints on the learned parameters and their update rules violate the drift conditions. To this end, we extend the contraction theory to the case of constrained systems represented by reflected stochastic differential equations and generalize the allowable drifts. We show how the general theory can be used to derive quantitative contraction bounds on a nonlinear stochastic adaptive regulation problem.
|
|
14:30-14:45, Paper MoA10.5 | Add to My Program |
Adaptive Backstepping Attitude Control of a Rigid Body with State Quantization |
|
Schlanbusch, Siri Marte | University of Agder |
Zhou, Jing | University of Agder |
Schlanbusch, Rune | Teknova |
Keywords: Adaptive control, Quantized systems, Nonlinear systems
Abstract: In this paper, the attitude tracking control problem of a rigid body is investigated where the states are quantized. An adaptive backstepping based control scheme is developed and a new approach to stability analysis is developed by constructing a new compensation scheme for the effects of the vector state quantization. It is shown that all closed-loop signals are ensured uniformly bounded and the tracking errors converge to a compact set containing the origin. Experiments on a 2 degrees-of-freedom helicopter system illustrate the proposed control scheme.
|
|
14:45-15:00, Paper MoA10.6 | Add to My Program |
ASPR-Based Adaptive Feedback Control with Adaptive PFC and FF Input Based on RBF NN for Unstable Systems |
|
Akaike, Kota | Kumamoto University |
Mizumoto, Ikuro | Kumamoto Univ |
Keywords: Adaptive control, Direct adaptive control, Robust adaptive control
Abstract: It is well recognized that Almost Strictly Positive Real~(ASPR)-based output feedback control has robustness with respect to disturbances and system's uncertainty. However, most practical systems do not have ASPR property. To solve this problem, the introduction of a parallel feedforward compensator~(PFC) bas been proposed in order to render the resulting augmented system ASPR and adaptive type PFC design methods have been also provided already. The adaptive method can design an appropriate PFC by only utilizing input and output online data of the system without the strict information of the system, and thus it is effective for the uncertain systems. However, most of the schemes were only for stable systems. In this paper, we proposed the adaptive PFC design method for unstable systems. Moreover, the adaptive feedforward~(FF) input based on Radial Basis Function~(RBF) Neural Network~(NN) is introduced in order to achieve the adequate output tracking. The stability of the obtained adaptive control system is also analyzed and the boundedness of all the signals in the control system will be shown.
|
|
MoA11 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Linear Systems I |
|
|
Chair: Maestre, J.M. | University of Seville |
Co-Chair: Johansson, Mikael | KTH - Royal Institute of Technology |
|
13:30-13:45, Paper MoA11.1 | Add to My Program |
A Linear Programming Approach to Computing Safe Sets for Software Rejuvenation |
|
Arauz, Teresa | University of Seville |
Maestre, J.M. | University of Seville |
Romagnoli, Raffaele | Carnegie Mellon University |
Sinopoli, Bruno | Washington University in St Louis |
Camacho, Eduardo F. | Univ. of Sevilla |
Keywords: Linear systems, Fault tolerant systems, Emerging control applications
Abstract: Software rejuvenation was born to fix operating system faults by periodically refreshing the run-time code and data. This mechanism has been extended to protect control systems from cyber-attacks. This work proposes a software rejuvenation design method in discrete-time where invariant sets for the safety and mission controllers are designed to schedule the timing of software refreshes. To compute a minimal robust positively invariant (min-RPI) set and the bounded time between software refreshes to ensure system safety, an LP based approach is proposed for stable and unstable systems. Finally, the designed approach is illustrated by the case study of a simulated lab-scale microgrid.
|
|
13:45-14:00, Paper MoA11.2 | Add to My Program |
Infinite-Horizon Risk-Constrained Linear Quadratic Regulator with Average Cost |
|
Zhao, Feiran | Tsinghua University |
You, Keyou | Tsinghua University |
Basar, Tamer | Univ of Illinois, Urbana-Champaign |
Keywords: Linear systems, Optimal control, Stochastic optimal control
Abstract: The behaviour of a stochastic dynamical system may be largely influenced by those low-probability, yet extreme events. To address it, this paper proposes an infinite-horizon risk-constrained Linear Quadratic Regulator (LQR) framework with average cost. Besides the standard LQR objective, the average one-stage predictive variance of the state penalty is constrained within a user-specified level. By leveraging the dual theory, its optimal solution is first shown to be stationary and affine in the state, i.e., u(lambda^*) = -K(lambda^*)x + l(lambda^*), where lambda^* is an optimal multiplier to address the risk constraint. Then, we establish the stability of the resulting closed-loop system. Furthermore, we propose a primal-dual method with sublinear convergence rate to find an optimal policy-multiplier pair (u(lambda^*), lambda^*). Finally, a numerical example is given to demonstrate the effectiveness of the proposed framework and the primal-dual method.
|
|
14:00-14:15, Paper MoA11.3 | Add to My Program |
Willems’ Fundamental Lemma Based on Second-Order Moments |
|
Ferizbegovic, Mina | KTH |
Hjalmarsson, Håkan | KTH Royal Inst. of Tech |
Mattsson, Per | Uppsala University |
Schön, Thomas (Bo) | Uppsala University |
Keywords: Linear systems, Identification, Subspace methods
Abstract: In this paper, we propose variations of Willems’ fundamental lemma that utilize second-order moments such as correlation functions in the time domain and power spectra in the frequency domain. We believe that using a formulation with estimated correlation coefficients is suitable for data compression, and possibly can reduce noise. Also, the formulations in the frequency domain can enable modeling of a system in a frequency region of interest.
|
|
14:15-14:30, Paper MoA11.4 | Add to My Program |
H-2 Output Feedback Control of Differential-Algebraic Systems |
|
Sperila, Andrei | Faculty of Automatics and Computer Science, "Politehnica" Univer |
Oara, Cristian | Univ. Polytechnica Bucharest |
Ciubotaru, Bogdan D. | Faculty of Automatic Control and Computers, Polytechnic Universi |
Keywords: Optimal control, Differential-algebraic systems, Linear systems
Abstract: By employing the properties of centered realizations, we devise a modified version of the dual Riccati equation approach to optimal H-2 control for differential-algebraic systems that is guaranteed to be more computationally efficient and numerically accurate than other methods from literature. Moreover, we show that if the optimal controller has an improper transfer function matrix, then all controllers which ensure a finite H-2 norm will share this property.
|
|
14:30-14:45, Paper MoA11.5 | Add to My Program |
Contributions to Output Controllability for Linear Time Varying Systems |
|
Danhane, Baparou | University of Lorraine, CNRS, CRAN, F-54000 Nancy, France |
Loheac, Jerome | CNRS, Universite De Lorraine |
Jungers, Marc | CNRS - Université De Lorraine |
Keywords: Linear systems, Time-varying systems
Abstract: The purpose of this paper is to provide some contributions to one notion of Output controllability for Linear Time Varying (LTV) systems which is the Complete State to Output Controllability (CSOC), notion introduced in the 60s by P.E.~Sarachik and G.M.~Kranc. More precisely, we consider LTV systems with direct transmission of the input to the output and establish criteria to ensure the CSOC in finite time of these systems. We also give, under the assumption of CSOC in finite time, an explicit expression of a continuous control built by means of a Gramian matrix.
|
|
14:45-15:00, Paper MoA11.6 | Add to My Program |
Pole-Placement for Non-Overshooting Reference Tracking |
|
Taghavian, Hamed | KTH Royal Institute of Technology |
Drummond, Ross | University of Oxford |
Johansson, Mikael | KTH - Royal Institute of Technology |
Keywords: Linear systems
Abstract: We revisit the classical pole-placement controller design problem with the objective of ensuring non-overshooting and error-free reference tracking. To this end, we present a number of novel conditions for external positivity of discrete-time transfer functions which characterise this non-overshooting property. These conditions are both necessary and sufficient for first- and second-order systems, but only sufficient for higher-order systems. In addition, they have simple geometric interpretations in terms of allowed locations of the closed-loop poles. Based on these conditions, we propose a control design procedure that guarantees a stable closed-loop system with non-overshooting, error-free tracking of step changes in the reference.
|
|
MoA12 Invited Session, Coordinated Universal Time (UTC) |
Add to My Program |
Estimation and Control of Infinite Dimensional Systems I |
|
|
Chair: Burns, John A | Virginia Tech |
Co-Chair: Demetriou, Michael A. | Worcester Polytechnic Institute |
Organizer: Demetriou, Michael A. | Worcester Polytechnic Institute |
Organizer: Burns, John A | Virginia Tech |
|
13:30-13:45, Paper MoA12.1 | Add to My Program |
Accurate Approximate Regulation of Nonlinear Delay Differential Control Systems (I) |
|
Aulisa, Eugenio | Texas Tech University |
Burns, John A | Virginia Tech |
Gilliam, David S. | Texas Tech University |
Paruchuri, Sai Tej | Lehigh University |
Keywords: Computational methods, Delay systems, Nonlinear output feedback
Abstract: In this paper we present an approximate feedback controller design methodology for tracking/disturbance-rejection regulation problems governed by nonlinear delay differential control systems. The method considered here is a version of the practical regulation approach developed by the authors in a series of articles. It is important to note that this approach to regulation does not require the existence of an exo-system to define disturbances and signals to be tracked. Therefore, this control law enables tracking and disturbance rejection for general reference and disturbance signals. The idea is similar to the inclusion of a cascade controller design providing a sequence of increasingly more accurate and better preforming controllers. The underlying principle derives from well known geometric methods which rely on the existence of an attractive invariant manifold in the case when the reference and disturbance signals are outputs of an autonomous, linear, neutrally stable exo-system. However, we are able accomplish high performance tracking without this assumption. References are provided for the history of the methodology and proofs of the error estimates for general systems. We show, in our included example, that the tracking error can be significantly enhanced using one extra step in the sequence of controllers. In particular, at each step in the cascade controller the error from the previous step provides the reference signal for the next step. In this way, at each step the errors are reduced geometrically.
|
|
13:45-14:00, Paper MoA12.2 | Add to My Program |
Robust Control of PDEs with Disturbances Using Mobile Actuators Constrained Over Time-Varying Reachability Sets (I) |
|
Demetriou, Michael A. | Worcester Polytechnic Institute |
Iftime, Orest V. | University of Groningen |
Zhuk, Sergiy | IBM |
Keywords: Distributed parameter systems
Abstract: We design a practical mobile actuator guidance policy for linear parabolic equations in 2D: the guidance is chosen so that H-2-measure of uncertainty is minimized provided the system is subject to a distributed disturbance. We first present a guidance policy where the mobile actuator location to be selected will be fixed over a certain time interval of interest. Further we add extra complexity by taking into account the dynamics of the mobile actuator over the 2D domain of interest under reachability constraints. The proposed approach is illustrated through numerical studies.
|
|
14:00-14:15, Paper MoA12.3 | Add to My Program |
Sampled-Data Boundary Control of a Class of Reaction-Diffusion PDEs with Collocated Sensing and Actuation (I) |
|
Rathnayake, Bhathiya | Student (Rensselaer Polytechnic Institute, New York 12180, USA) |
Diagne, Mamadou | Rensselaer Polytechnic Institute |
Karafyllis, Iasson | National Technical University of Athens |
Keywords: Distributed parameter systems, Sampled-data control, Nonlinear output feedback
Abstract: This paper provides an observer-based sampled-data boundary control strategy for a class of reaction-diffusion PDEs with collocated sensing and Robin actuation. Infinite-dimensional backstepping design is used as the underlying control approach. For the first time, it is shown that the continuous-time output feedback boundary control applied in a sample-and-hold fashion ensures global closed-loop exponential stability, provided that the sampling period is sufficiently small. Further, robustness to perturbations of the sampling schedule is guaranteed. A simulation example demonstrates that the convergence can still be achieved with sampling periods significantly larger than the theoretical maximum of the sampling schedule diameter.
|
|
14:15-14:30, Paper MoA12.4 | Add to My Program |
Adaptive Control of Coupled Hyperbolic PDEs with Piecewise-Constant Inputs and Identification (I) |
|
Wang, Ji | Xiamen University |
Krstic, Miroslav | University of California, San Diego |
Keywords: Distributed parameter systems, Adaptive control, Identification
Abstract: We present an event-triggered state-feedback boundary control scheme with a, likewise, event-triggered batch least-squares parameter identification for a 2times 2 hyperbolic PDE-ODE system, where two coefficients of the in-domain couplings between two transport PDEs, and the system parameter of a scalar ODE are unknown. The triggering condition is designed based on evaluating both the actuation deviation caused by the difference between the plant states and their sampled values, and the growth of the plant norm. When either condition is met, the piecewise-constant control input and parameter estimates are updated simultaneously. In the closed-loop system, the following results are obtained: 1) the absence of a Zeno phenomenon; 2) finite-time exact identification of the unknown parameters from all but a measure zero set of initial conditions; 3) exponential regulation of the plant states to zero. In the numerical simulation, the design is verified in an application of axial vibration control of a mining cable elevator, where the damping coefficients of the cable and the cage are unknown.
|
|
14:30-14:45, Paper MoA12.5 | Add to My Program |
Stabilization by Switching of Semilinear Heat Equation with Spatially Scheduled Actuators (I) |
|
Kang, Wen | University of Science and Technology Beijing |
Fridman, Emilia | Tel-Aviv Univ |
Liu, Chuan-Xin | University of Science and Technology Beijing |
Keywords: Distributed parameter systems, Sampled-data control, Lyapunov methods
Abstract: This paper discusses sampled-data stabilization by switching for 1-D nonlinear reaction-diffusion equation with spatially scheduled actuators. We suggest that the interval [0,1] is divided into N subdomains. We assume that N sensors are placed in each subdomain and measure the average value of the state in the discrete time. We stabilize the system by switching sampled-data static output-feedback. This switching control law can be implemented either by using one moving actuator that can move to the active subdomain in the negligible time or by N actuators placed in each subdomain. In the latter case switching control may reduce the energy that the system spends. Constructive conditions are derived to ensure that the resulting closed-loop system is exponentially stable by means of the Lyapunov approach. A numerical example illustrates the efficiency of the method.
|
|
14:45-15:00, Paper MoA12.6 | Add to My Program |
Minimum Safety Factor Control in Tokamaks Via Optimal Allocation of Spatially Moving Electron Cyclotron Current Drive (I) |
|
Paruchuri, Sai Tej | Lehigh University |
Pajares, Andres | Lehigh University |
Schuster, Eugenio | Lehigh University |
Keywords: Distributed parameter systems, Feedback linearization, Control applications
Abstract: Tokamaks are torus-shaped devices designed to confine a plasma (ionized gas at around 100 million degrees where fusion reactions can take place) using helical magnetic fields. Such magnetic confinement enables light ions, such as isotopes of hydrogen, to stay confined long enough to undergo a fusion reaction. The pitch of the helical magnetic field in a tokamak is characterized by the safety factor q. The safety factor is closely related to the magnetohydrodynamic stability of the plasma. For instance, instabilities that can degrade or even terminate plasma confinement can occur at spatial locations with rational values of the safety factor q. Thus, actively increasing the minimum magnitude of the safety factor can reduce the occurrence of low-order (low rational q values) instabilities. Non-inductive sources of current like neutral beam injection (NBI) and electron cyclotron current drive (ECCD) are used to control the q-profile. ECCD generates electromagnetic waves to drive current and/or heat the plasma. Mirrors are used to control the spatial region of incidence of the generated electromagnetic waves. In this work, the ECCD mirror’s position is treated as a controllable input, and its effects are included in the response model used for control design. A controller based on feedback linearization is proposed to simultaneously allocate the NBI and ECCD powers and the ECCD position to track a target minimum safety factor. The effectiveness of the controller is assessed for a DIII-D tokamak scenario in nonlinear one-dimensional simulations using COTSIM (Control-Oriented Transport SIMulator).
|
|
MoA13 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Distributed Control I |
|
|
Chair: Smith, Roy S. | ETH Zurich |
Co-Chair: Gasparri, Andrea | Roma Tre University |
|
13:30-13:45, Paper MoA13.1 | Add to My Program |
A Distributed Framework for Linear Adaptive MPC |
|
Parsi, Anilkumar | ETH Zurich |
Aboudonia, Ahmed | ETH Zurich |
Iannelli, Andrea | ETH Zurich |
Lygeros, John | ETH Zurich |
Smith, Roy S. | ETH Zurich |
Keywords: Predictive control for linear systems, Distributed control, Adaptive control
Abstract: Adaptive model predictive control (MPC) robustly ensures safety while reducing uncertainty during operation. In this paper, a distributed version is proposed to deal with network systems featuring multiple agents and limited communication. To solve the problem in a distributed manner, structure is imposed on the control design ingredients without sacrificing performance. Decentralized and distributed adaptation schemes that allow for a reduction of the uncertainty online compatibly with the network topology are also proposed. The algorithm ensures robust constraint satisfaction, recursive feasibility and finite gain l2 stability, and yields lower closed-loop cost compared to robust distributed MPC in simulations.
|
|
13:45-14:00, Paper MoA13.2 | Add to My Program |
Data-Driven Output Synchronization of Heterogeneous Leader-Follower Multi-Agent Systems |
|
Jiao, Junjie | Technical University of Munich |
van Waarde, Henk J. | University of Cambridge |
Trentelman, Harry L. | Univ. of Groningen |
Camlibel, M. Kanat | University of Groningen |
Hirche, Sandra | Technische Universität München |
Keywords: Distributed control, Control of networks, Linear systems
Abstract: This paper deals with data-driven output synchronization for heterogeneous leader-follower linear multi-agent systems. Given a multi-agent system that consists of one autonomous leader and a number of heterogeneous followers with external disturbances, we provide necessary and sufficient data-based conditions for output synchronization. We also provide a design method for obtaining such output synchronizing protocols directly from data. The results are then extended to the special case that the followers are disturbance-free. Finally, a simulation example is provided to illustrate our results.
|
|
14:00-14:15, Paper MoA13.3 | Add to My Program |
A Time Transformation Approach to Finite-Time Distributed Control with Reduced Information Exchange |
|
Kurtoglu, Deniz | Izmir Democracy University |
Yucelen, Tansel | University of South Florida |
Keywords: Control of networks, Cooperative control
Abstract: This paper focuses on time-critical multiagent systems over the finite-time interval [0,T). Specifically, we propose a distributed control algorithm that guarantees agents to approach a time-varying command at time T with reduced information exchange. The key feature of the proposed algorithm allows T to be user-defined, where it neither depends on the initial conditions of agents nor unknown but bounded time rate of change of the command. Furthermore, an event- triggered approach is utilized to reduce agent-to-agent information exchange. Predicated on a time transformation approach, the proposed algorithm defined over the finite-time interval [0,T) is first transformed to an equivalent form over the infinite-time interval [0,infinity) through a strictly increasing and continuously differentiable function that guarantees solutions in both time intervals being equivalent. Over the infinite-time interval [0,infinity), we then use a quadratic energy function, the Bernoulli equation, and the comparison principle to establish stability and convergence properties of our algorithm.
|
|
14:15-14:30, Paper MoA13.4 | Add to My Program |
A Finite-Time Distributed Protocol for Link Prediction in Networked Multi-Agent Systems |
|
Lippi, Martina | Roma Tre University |
Santilli, Matteo | University of Roma Tre |
Oliva, Gabriele | University Campus Bio-Medico of Rome |
Gasparri, Andrea | Roma Tre University |
Keywords: Distributed control, Agents-based systems, Networked control systems
Abstract: In this paper we address the finite-time distributed link prediction problem in networked multi-agent systems which consists in estimating the likelihood of existence of neighboring links in the network, exploiting only local information. Notably, this setting extends the framework introduced by Pech et al., which considered a complete network topology and was based on a centralized architecture, to the case of sparse graphs and distributed computation. Briefly, first we derive an optimality condition for the problem and then we develop a distributed protocol which drives the agents to satisfy in finite-time the optimality condition by relying on a state-of-the-art distributed k-hop observer. Numerical simulations demonstrate the effectiveness of the proposed distributed protocol.
|
|
14:30-14:45, Paper MoA13.5 | Add to My Program |
Event-Triggered Distributed Stabilization of Interconnected Multiagent Systems with Abnormal Agent and Control Layers |
|
Rezaei, Vahid | University of Colorado at Denver |
Keywords: Distributed control, Cooperative control
Abstract: A graph theoretic framework recently has been proposed for the robust distributed stabilization of interconnected multiagent systems, while systematically capturing the architectural aspect of cyber-physical systems with separate agent or physical layer and control or cyber layer. We consider a scenario where the control layer faces a series of centralized denial of service attacks, and the agent layer is subject to the modeling uncertainties. We propose a step-by-step procedure to design a control layer that, in the presence of the aforementioned abnormalities, guarantees a level of robustness and resiliency for the final two-layer interconnected multiagent system. The incorporation of an event-triggered strategy further ensures an effective use of the limited energy and communication resources over the control layer. We theoretically prove the resilient, robust, and Zeno-free convergence of all state trajectories to the origin and, via a simulation study, discuss the feasibility of the proposed ideas.
|
|
14:45-15:00, Paper MoA13.6 | Add to My Program |
Collaborative Guidance of UAV-Transported Semi-Flexible Payloads in Environments with Obstacles |
|
Hegde, Aditya | Indian Institute of Science |
Ghose, Debasish | Indian Institute of Science |
Keywords: Distributed control, Cooperative control
Abstract: Collaborative load manipulation and transportation is an emerging application of multi-unmanned aerial vehicle (UAV) systems. We address a problem where a team of UAVs transport a semi-flexible payload in an environment with multiple obstacles. By considering a semi-flexible payload, we restrict its flexibility to allow for safe-shape manipulation, which helps to navigate between obstacles while avoiding inter-UAV and payload-obstacle collisions. Control barrier functions (CBFs) are used to construct the obstacle avoidance and shape constraints, which are applied along with actuator constraints to an optimization-based control problem. The analysis is supported with simulation results for a team of four UAVs manipulating a payload in the presence of obstacles and settling on a standoff circle about a target.
|
|
MoA14 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Sensor Networks |
|
|
Chair: Alamo, Teodoro | Universidad De Sevilla |
Co-Chair: PRIEL, AVIV | Technion Israeli Institute of Technology |
|
13:30-13:45, Paper MoA14.1 | Add to My Program |
Breadth-First Coupled Sensor Configuration and Path-Planning in Unknown Environments |
|
St. Laurent, Chase | Worcester Polytechnic Institute |
Cowlagi, Raghvendra V. | Worcester Polytechnic Institute |
Keywords: Sensor networks, Autonomous systems, Estimation
Abstract: We present a breadth-first sensor configuration strategy to find near-optimal placement and sensor field of view (FoV). The strategy couples the sensor configuration procedure directly with the decision making task of planning a path for an agent in an unknown static environment comprised of threats. This coupled sensor configuration and path-planning (CSCP) strategy iteratively uses Gaussian Process Regression to construct a threat field estimate and find a candidate optimal path with minimum threat exposure. The strategy utilizes a unique task-driven information gain (TDIG) metric, which yields the sensor configurations when maximized. Due to the non-convex and non-submodular nature of the problem, we present an approximation for the optimization of the TDIG metric. Finally, we discuss the performance of the breadth-first strategy in contrast to a standard and depth-first strategy as well as traditional information-maximization.
|
|
13:45-14:00, Paper MoA14.2 | Add to My Program |
An Improved Distributed Consensus Kalman Filter Design Approach |
|
PRIEL, AVIV | Technion Israeli Institute of Technology |
Zelazo, Daniel | Technion - Israel Institute of Technology |
Keywords: Sensor networks, Kalman filtering
Abstract: This paper proposes an improved design approach for distributed consensus Kalman filtering (DCKF). We provide an improved consensus gain factor compared to the sub-optimal design proposed in [1]. This factor is derived from an LMI appearing in the stability analysis of the DCKF and can be computed using semi-definite programming. We also propose a decentralized consensus gain that can be computed by each agent in the sensor network, and depends only on local properties of the network, i.e., the number of neighbors of each sensor. We show in simulation that this approach holds even for networks with time varying communication regime. Our results are compared to other existing solutions in the literature with a numerical example.
|
|
14:00-14:15, Paper MoA14.3 | Add to My Program |
A New L-Step Neighbourhood Distributed Moving Horizon Estimator |
|
Venturino, Antonello | Université Paris-Saclay (CentraleSupelec and ONERA) |
Bertrand, Sylvain | ONERA |
Stoica Maniu, Cristina | CentraleSupélec/L2S |
Alamo, Teodoro | Universidad De Sevilla |
Camacho, Eduardo F. | Univ. of Sevilla |
Keywords: Estimation, Sensor networks, Linear systems
Abstract: This paper focuses on Distributed State Estimation over a peer-to-peer sensor network composed by possible low-computational sensors. We propose a new l-step Neighbourhood Distributed Moving Horizon Estimation technique with fused arrival cost and pre-estimation, improving the accuracy of the estimation, while reducing the computation time compared to other approaches from the literature. Simultaneously, convergence of the estimation error is improved by means of spreading the information amongst neighbourhoods, which comes natural in the sliding window data present in the Moving Horizon Estimation paradigm.
|
|
14:15-14:30, Paper MoA14.4 | Add to My Program |
Optimization in Open Networks Via Dual Averaging |
|
Hsieh, Yu-Guan | Université Grenoble Alpes |
Iutzeler, Franck | Univ. Grenoble Alpes |
Malick, Jérome | CNRS-INRIA INRIA Rhône-Alpes, 655 Avenue De l'Europe, Montbonnot |
Mertikopoulos, Panayotis | French National Center for Scientific Research (CNRS) |
Keywords: Optimization, Optimization algorithms, Sensor networks
Abstract: In networks of autonomous agents (e.g., fleets of vehicles, scattered sensors), the problem of minimizing the sum of the agents’ local functions has received a lot of interest. We tackle here this distributed optimization problem in the case of open networks when agents can join and leave the network at any time. Leveraging recent online optimization techniques, we propose and analyze the convergence of a decentralized asynchronous optimization method for open networks.
|
|
14:30-14:45, Paper MoA14.5 | Add to My Program |
Dynamic Coverage Meets Regret: Unifying Two Control Performance Measures for Mobile Agents in Spatiotemporally Varying Environments |
|
Haydon, Benjamin | North Carolina State University |
Mishra, Kirti | The Ohio State University |
Keyantuo, Patrick | University of California, Berkeley |
Panagou, Dimitra | University of Michigan, Ann Arbor |
Chow, Fotini Katopodes | University of California, Berkeley |
Moura, Scott | University of California, Berkeley |
Vermillion, Christopher | North Carolina State University |
Keywords: Stochastic optimal control, Sensor networks, Energy systems
Abstract: Numerous mobile robotic applications require agents to persistently explore and exploit spatiotemporally varying, partially observable environments. Ultimately, the mathematical notion of regret, which quite simply represents the instantaneous or time-averaged difference between the optimal reward and realized reward, serves as a meaningful measure of how well the agents have exploited the environment. However, while numerous theoretical regret bounds have been derived within the machine learning community, restrictions on the manner in which the environment evolves preclude their application to persistent missions. On the other hand, meaningful theoretical properties can be derived for the related concept of dynamic coverage, which serves as an exploration measurement but does not have an immediately intuitive connection with regret. In this paper, we demonstrate a clear correlation between an appropriately defined measure of dynamic coverage and regret, then go on to derive performance bounds on dynamic coverage as a function of the environmental parameters. We evaluate the correlation for several variants of an airborne wind energy system, for which the objective is to adjust the operating altitude in order to maximize power output in a spatiotemporally evolving wind field.
|
|
MoA15 Invited Session, Coordinated Universal Time (UTC) |
Add to My Program |
Event-Triggered and Self-Triggered Control I |
|
|
Chair: Heemels, W.P.M.H. | Eindhoven University of Technology |
Co-Chair: Nowzari, Cameron | George Mason University |
Organizer: Heemels, W.P.M.H. | Eindhoven University of Technology |
Organizer: Hirche, Sandra | Technische Universität München |
Organizer: Johansson, Karl H. | Royal Institute of Technology |
Organizer: Nowzari, Cameron | George Mason University |
|
13:30-13:45, Paper MoA15.1 | Add to My Program |
Leader-Following Event-Triggered Practical Consensus with Disturbance Rejection of Multiple Uncertain Spacecraft Systems Over Switching Networks (I) |
|
Wang, Tianqi | The Chinese University of Hong Kong |
Huang, Jie | The Chinese University of Hong Kong |
Keywords: Flight control, Cooperative control, Adaptive control
Abstract: This paper studies the leader-following practical consensus with disturbance rejection problem of a group of rigid spacecraft systems with uncertain inertia matrices, which extends our previous work by further considering disturbance rejection. Like our previous results, we use the adaptive distributed observer technique to handle switching communication constraints. To handle the disturbance, we further adopt the internal model approach to convert our problem into a practical stabilization problem of an augmented system. Specific effort is made on designing an event-triggered control law with an event-triggered mechanism that avoids the Zeno phenomenon.
|
|
13:45-14:00, Paper MoA15.2 | Add to My Program |
Dynamic Self-Triggered Control for Nonlinear Systems Based on Hybrid Lyapunov Functions (I) |
|
Hertneck, Michael | University of Stuttgart |
Allgöwer, Frank | University of Stuttgart |
Keywords: Control over communications, Sampled-data control, Networked control systems
Abstract: Self-triggered control (STC) is a well-established technique to reduce the amount of samples for sampled-data systems, and is hence particularly useful for Networked Control Systems. At each sampling instant, an STC mechanism determines not only an updated control input but also when the next sample should be taken. In this paper, a dynamic STC mechanism for nonlinear systems is proposed. The mechanism incorporates a dynamic variable for determining the next sampling instant. Such a dynamic variable for the trigger decision has been proven to be a powerful tool for increasing sampling intervals in the closely related concept of ETC, but was so far not exploited for STC. This gap is closed in this paper. For the proposed mechanism, the dynamic variable is chosen to be the filtered values of the Lyapunov function at past sampling instants. The next sampling instant is, based on the dynamic variable and on hybrid Lyapunov function techniques, chosen such that an average decrease of the Lyapunov function is ensured. The proposed mechanism is illustrated with a numerical example from the literature. For this example, the obtained sampling intervals are significantly larger than for existing static STC mechanisms.
|
|
14:00-14:15, Paper MoA15.3 | Add to My Program |
Event-Triggered Safety-Critical Control for Systems with Unknown Dynamics (I) |
|
Xiao, Wei | Boston University |
Belta, Calin | Boston University |
Cassandras, Christos G. | Boston University |
Keywords: Lyapunov methods, Constrained control, Uncertain systems
Abstract: This paper addresses the problem of safety-critical control for systems with unknown dynamics. It has been shown that stabilizing affine control systems to desired (sets of) states while optimizing quadratic costs subject to state and control constraints can be reduced to a sequence of quadratic programs (QPs) by using Control Barrier Functions (CBFs) and Control Lyapunov Functions (CLFs). Our recently proposed High Order CBFs (HOCBFs) can accommodate constraints of arbitrary relative degree. One of the main challenges in this approach is obtaining accurate system dynamics, which is especially difficult for systems that require online model identification given limited computational resources and system data. In order to approximate the real unmodeled system dynamics, we define adaptive affine control dynamics which are updated based on the error states obtained by real-time sensor measurements. We define an HOCBF for a safety requirement on the unmodeled system based on the adaptive dynamics and error states, and reformulate the safety-critical control problem as the above mentioned QP. Then, we determine the events required to solve the QP in order to guarantee safety, and derive a condition that guarantees the satisfaction of the HOCBF constraint between events. We illustrate the effectiveness of the proposed framework on adaptive cruise control and compare it with the classical time-driven approach.
|
|
14:15-14:30, Paper MoA15.4 | Add to My Program |
Event-Triggered Observer Design for Linear Systems (I) |
|
Petri, Elena | CRAN, Université De Lorraine, CNRS |
Postoyan, Romain | CNRS, CRAN, Université De Lorraine |
Astolfi, Daniele | CNRS - LAGEPP Univ Lyon 1 |
Nesic, Dragan | University of Melbourne |
Heemels, W.P.M.H. | Eindhoven University of Technology |
Keywords: Hybrid systems, Observers for Linear systems, Networked control systems
Abstract: We present an event-triggered observer design for linear time-invariant systems, where the measured output is sent to the observer only when a triggering condition is satisfied. We proceed by emulation and we first construct a continuous-time Luenberger observer. We then propose a dynamic rule to trigger transmissions, which only depends on the plant output and an auxiliary scalar state variable. The overall system is modeled as a hybrid system, for which a jump corresponds to an output transmission. We show that the proposed event-triggered observer guarantees global practical asymptotic stability for the estimation error dynamics. Moreover, under mild boundedness conditions on the plant state and its input, we prove that there exists a uniform strictly positive minimum inter-event time between any two consecutive transmissions, guaranteeing that the system does not exhibit Zeno solutions. Finally, the proposed approach is applied to a numerical case study of a lithium-ion battery.
|
|
14:30-14:45, Paper MoA15.5 | Add to My Program |
Event-Triggered Control for Systems with State Delays Using a Positive Systems Approach (I) |
|
Mazenc, Frederic | Inria Saclay |
Malisoff, Michael | Louisiana State University |
Barbalata, Corina | Louisiana State University |
Jiang, Zhong-Ping | New York University |
Keywords: Delay systems
Abstract: We provide a new positive system approach to event-triggered feedback control of linear systems with state delays. We use an interval observer method and linear Lyapunov functions to prove global exponential stability of the closed loop systems. Our two examples illustrate the usefulness of our method for counteracting the effects of potentially destabilizing state delays.
|
|
14:45-15:00, Paper MoA15.6 | Add to My Program |
Event-Triggered State Estimation with Multiple Noisy Sensor Nodes (I) |
|
Scheres, Koen | Eindhoven University of Technology |
Chong, Michelle S. | Eindhoven University of Technology |
Postoyan, Romain | CNRS, CRAN, Université De Lorraine |
Heemels, W.P.M.H. | Eindhoven University of Technology |
Keywords: Estimation, Sensor networks, Hybrid systems
Abstract: General nonlinear continuous-time systems are considered for which its state is estimated via a packet-based communication network. We assume that the system has multiple sensor nodes, affected by measurement noise, which can transmit at discrete (non-equidistant) points in time. Moreover, each node can transmit asynchronously. For this setup, we develop a state estimation framework, where the transmission instances of the individual sensor nodes can be generated in either time-triggered or event-triggered fashions. In the latter case, we guarantee the absence of Zeno behavior by construction. It is shown that, under the provided design conditions, an input-to-state stability property is obtained for the estimation error with respect to the measurement noise and process disturbances and that the state is thus reconstructed asymptotically in the absence of noise.
|
|
MoA16 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Formal Verification and Synthesis I |
|
|
Chair: Jungers, Raphaël M. | University of Louvain |
Co-Chair: Lindemann, Lars | University of Pennsylvania |
|
13:30-13:45, Paper MoA16.1 | Add to My Program |
Leveraging Classification Metrics for Qualitative System-Level Analysis with Temporal Logic Specifications |
|
Badithela, Apurva | California Institute of Technology |
Wongpiromsarn, Tichakorn | Iowa State University |
Murray, Richard M. | California Inst. of Tech |
Keywords: Formal Verification/Synthesis, Autonomous systems, Machine learning
Abstract: In many autonomy applications, performance of perception algorithms is important for effective planning and control. In this paper, we introduce a framework for computing the probability of satisfaction of formal system specifications given a confusion matrix, a statistical average performance measure for multi-class classification. We define the probability of satisfaction of a linear temporal logic formula given a specific initial state of the agent and true state of the environment. Then, we present an algorithm to construct a Markov chain that represents the system behavior under the composition of the perception and control components such that the probability of the temporal logic formula computed over the Markov chain is consistent with the probability that the temporal logic formula is satisfied by our system. We illustrate this approach on a simple example of a car with pedestrian on the sidewalk environment, and compute the probability of satisfaction of safety requirements for varying parameters of the vehicle. We also illustrate how satisfaction probability changes with varied precision and recall derived from the confusion matrix. Based on our results, we identify several opportunities for future work in developing quantitative system-level analysis that incorporates perception models.
|
|
13:45-14:00, Paper MoA16.2 | Add to My Program |
Time-Robust Control for STL Specifications |
|
Rodionova, Alena | University of Pennsylvania |
Lindemann, Lars | University of Pennsylvania |
Morari, Manfred | University of Pennsylvania |
Pappas, George J. | University of Pennsylvania |
Keywords: Formal Verification/Synthesis, Autonomous systems, Robust control
Abstract: We present a robust control framework for time-critical systems in which satisfying real-time constraints robustly is of utmost importance for the safety of the system. Signal Temporal Logic (STL) provides a formal means to express a large variety of real-time constraints over signals and is suited for planning and control purposes as it allows us to reason about the time robustness of such constraints. The time robustness of STL particularly quantifies the extent to which timing uncertainties can be tolerated without violating real-time specifications. In this paper, we first pose a control problem in which we aim to find an optimal input sequence to a control system that maximizes the time robustness of an STL constraint. We then propose a Mixed Integer Linear Program (MILP) encoding and provide correctness guarantees along with a complexity analysis of the encoding. We also show in two case studies that maximizing STL time robustness allows to account for timing uncertainties of the underlying control system.
|
|
14:00-14:15, Paper MoA16.3 | Add to My Program |
Zonotope-Based Controller Synthesis for LTL Specifications |
|
Ren, Wei | Univeristy of Louvain |
Calbert, Julien | UCLouvain |
Jungers, Raphaël M. | University of Louvain |
Keywords: Autonomous systems, Formal Verification/Synthesis, Hierarchical control
Abstract: This paper studies the controller synthesis problem for Linear Temporal Logic (LTL) specifications using (constrained) zonotope techniques. To begin with, we implement (constrained) zonotope techniques to partition the state space and further to verify whether the LTL specification can be satisfied. Once the LTL specification can be satisfied, the next step is to design a controller to guarantee the satisfaction of the LTL specification for dynamic systems. Based on the verification of the LTL specification, an abstraction-based control design approach is proposed in this paper: a novel abstraction construction is developed first, then finite local abstract controllers are designed to achieve the LTL specification, and finally the designed abstract controllers are combined and refined as the controller for the original system. The proposed control design strategy is illustrated via a numerical example from autonomous robots.
|
|
14:15-14:30, Paper MoA16.4 | Add to My Program |
Qualitative Planning in Imperfect Information Games with Active Sensing and Reactive Sensor Attacks: Cost of Unawareness |
|
Kulkarni, Abhishek | University of Florida at Gainesville |
Han, Shuo | University of Illinois at Chicago |
Leslie, Nandi | U.S. Army Research Laboratory |
Kamhoua, Charles | U.S. Army Research Laboratory |
Fu, Jie | University of Florida |
Keywords: Formal Verification/Synthesis, Discrete event systems, Cyber-Physical Security
Abstract: We consider the probabilistic planning problem where a robot (called Player 1, or P1) can jointly plan the control actions and sensor queries in a sensor network and an attacker (called player 2, or P2) can carry out attacks on the sensors. We model such an adversarial interaction using a formal model--a reachability game with partially controllable observation functions. The main contribution of this paper is to assess the cost of P1's unawareness: Suppose P1 misinterprets the sensor failures as probabilistic node failures due to unreliable network communication, and P2 is aware of P1's misinterpretation and P1's partial observability. Then, from which states can P2 carry out sensor attacks to ensure, with probability one, that P1 will not be able to complete her reachability task even though, due to misinterpretation, P1 believes that she can almost-surely achieve her task. We develop an algorithm to solve the almost-sure winning sensor-attack strategy given P1's observation-based strategy. Our attack analysis could be used for attack detection in wireless communication networks and the design of provably secured attack-aware sensor allocation in decision-theoretic models for cyber-physical systems.
|
|
14:30-14:45, Paper MoA16.5 | Add to My Program |
Alternating Simulation on Hierarchical Abstractions |
|
Calbert, Julien | UCLouvain |
Legat, Benoît | UCLouvain |
Egidio, Lucas N. | Université Catholique De Louvain |
Jungers, Raphaël M. | University of Louvain |
Keywords: Formal Verification/Synthesis, Hybrid systems, Optimal control
Abstract: Abstraction techniques provide formal guarantees for generic optimal control problems on nonlinear and hybrid systems. Computing an abstraction solving the problem over the whole state-space is computationally demanding in high dimensional spaces. We circumvent this curse of dimensionality by introducing a hierarchical abstraction approach for solving an optimal control problem for nonlinear systems with three nested partitions. These nested partitions allow the construction of auxiliary systems that characterize simulation relations, which are suitably exploited to provide upper and lower bounds for a branch and bound algorithm to yield an optimal solution for the control problem. An example illustrates the proposed method.
|
|
14:45-15:00, Paper MoA16.6 | Add to My Program |
Symbolic Abstractions from Data: A PAC Learning Approach |
|
Devonport, Alex | University of California, Berkeley |
Saoud, Adnane | CentraleSupelec |
Arcak, Murat | University of California, Berkeley |
Keywords: Formal Verification/Synthesis, Hybrid systems, Statistical learning
Abstract: Symbolic control techniques aim to satisfy complex logic specifications. A critical step in these techniques is the construction of a symbolic (discrete) abstraction, a finite-state system whose behaviour mimics that of a given continuous-state system. The methods used to compute symbolic abstractions, however, require knowledge of an accurate closed-form model. To generalize them to systems with unknown dynamics, we present a new data-driven approach that does not require closed-form dynamics, instead relying only the ability to evaluate successors of each state under given inputs. To provide guarantees for the learned abstraction, we use the Probably Approximately Correct (PAC) statistical framework. We first introduce a PAC-style behavioural relationship and an appropriate refinement procedure. We then show how the symbolic abstraction can be constructed to satisfy this new behavioural relationship. Moreover, we provide PAC bounds that dictate the number of data required to guarantee a prescribed level of accuracy and confidence. Finally, we present an illustrative example.
|
|
MoA17 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Control Applications I |
|
|
Chair: Patrascu, Monica | University Politehnica of Bucharest |
Co-Chair: Chakrabarty, Ankush | Mitsubishi Electric Research Laboratories (MERL) |
|
13:30-13:45, Paper MoA17.1 | Add to My Program |
Error Analysis of Rotating Wave Approximation in Control of Spins in Nuclear Magnetic Resonance Spectroscopy |
|
Sarkar, Sambeda | Indian Institute of Technology Bombay |
Paruchuri, Pradyumna | Indian Institute of Technology, Bombay |
khaneja, navin | Harvard University |
Keywords: Time-varying systems, Control applications, Quantum information and control
Abstract: Rotating wave approximation (RWA) is a well-known technique that has been used to obtain analytical solutions in complex quantum optical models. In this article, the problem of approximating the solution to a single input bilinear system by means of RWA is addressed. This is motivated by the calculation of the response of the nuclei in nuclear magnetic resonance (NMR) spectroscopy to an open-loop controller. The paper presents an analysis on the error resulting from RWA in controlling spin states of the nuclei. We prove the applicability of the rotating wave approximation in the control of spin states by providing a bound on the error of approximation. A simulation of the same has been presented to validate the claim.
|
|
13:45-14:00, Paper MoA17.2 | Add to My Program |
Accommodation of Pulsed Field Gradients with Cascade Field Regulation in Powered Magnets |
|
McPheron, Benjamin | Anderson University |
Schiano, Jeffrey L | Pennsylvania State Univ |
Litvak, Ilya M. | National High Magnetic Field Laboratory |
Brey, William | National High Magnetic Field Laboratory, Florida State Universit |
Keywords: Control applications, Sampled-data control
Abstract: High magnetic fields significantly improve the resolution and sensitivity of nuclear magnetic resonance (NMR) spectroscopy measurements, which presents exciting research opportunities in areas of chemistry, biology, and material science. Powered magnets can provide much higher magnetic fields than persistent mode superconducting magnets but suffer from temporal magnetic field fluctuations due to power supply ripple and variations in cooling water temperature and flow rate which make powered magnets non-viable for high resolution NMR experiments. Previous work has demonstrated that a multi-rate sampled data cascade control system may be used to improve the resolution of NMR experiments in powered magnets. Despite these advances in reducing temporal magnetic field fluctuations, the field regulation design does not accommodate the use of pulsed field gradients, which are necessary in many NMR experiments. This work presents a control topology which accommodates the use of pulsed field gradient signals with the field regulation system. This control approach is verified using NMR measurements.
|
|
14:00-14:15, Paper MoA17.3 | Add to My Program |
A Novel Fixed-Time Sliding Mode Control of Quadrotor with Experiments and Comparisons |
|
Yu, Li | National University of Defense Technology |
He, Guang | National University of Defense Technology |
Wang, Xiangke | National University of Defense Technology |
Shen, Lincheng | National University of Defense Technology |
Keywords: Autonomous vehicles, Control applications, Variable-structure/sliding-mode control
Abstract: In this paper, a practical fixed-time sliding mode controller is designed for quadrotors. A novel fixed-time stable system is derived, which has a faster convergence rate than existing methods. The proof of the fixed-time convergence is presented. Meanwhile, the faster convergence rate compared with the other two methods is detailed using simulations. Based on this derivation, a fixed-time sliding mode controller with bounded convergence time independent of initial conditions is developed. The comparative simulation and flight experiment results are presented, showing that the proposed control scheme is practical to an actual quadrotor and can achieve good control performance.
|
|
14:15-14:30, Paper MoA17.4 | Add to My Program |
Micro-Scale Particle Assembly Control with Particle-To-Particle Potential Interaction |
|
Matei, Ion | Palo Alto Research Center |
Plochowietz, Anne | Palo Alto Research Center |
de Kleer, Johan | Palo Alto Research Center |
Baras, John S. | University of Maryland |
Keywords: MEMs and Nano systems, Modeling, Optimal control
Abstract: We address the problem of simultaneous control of micro-objects (particles) immersed in dielectric fluid. An electric field, shaped by an array of thousands of electrodes is used to transport and position particles using dielectrophoretic forces. We use a lumped, 2D, capacitive based (nonlinear) model of motion for the particles behavior that include particle to particle interactions. The particle positions are tracked using a high speed camera and image processing algorithms. We use a model predictive control (MPC) approach to derive control inputs (i.e., electrode potentials) that shape the particles into a desired pattern. To scale the problem with the number of inputs, the control inputs are parameterized as smooth time and space dependent functions avoiding the need to consider each electrode potential as a separate control input. We use automatic differentiation to compute gradients of the particle potential energy, and of the MPC loss and constrain functions. We demonstrate our approach on a scenario where nine particles are transported to a set a final positions while a tenth one is kept stationary.
|
|
14:30-14:45, Paper MoA17.5 | Add to My Program |
Robust Position Control for High Slip Risk Tricycle Robots with Real-Coded Genetic Algorithms |
|
Patrascu, Monica | University of Bergen |
Gheorghe, Viorel Ionut | University Politehnica of Bucharest |
Keywords: Control applications, Robotics, Evolutionary computing
Abstract: Robust position control is the staple of navigation in mobile robotics. Tricycle robots are nonholonomic ground vehicles with three wheels. While actuated separately, traction and steering are processes coupled through nonlinearities generated by the kinematic model. During operation, environmental changes can cause the model parameters to vary through three unstable behaviors. This study presents a robust control system with modified P-D controllers tuned with real-coded genetic algorithms within a response matching scheme. Results show robustness for 2D trajectories under significant parameter variation, with respect to nonholonomic robot body limitations.
|
|
14:45-15:00, Paper MoA17.6 | Add to My Program |
Vehicle Rollover Avoidance by Parameter-Adaptive Reference Governor |
|
Berntorp, Karl | Mitsubishi Electric Research Labs |
Chakrabarty, Ankush | Mitsubishi Electric Research Laboratories (MERL) |
Di Cairano, Stefano | Mitsubishi Electric Research Labs |
Keywords: Automotive control, Automotive systems
Abstract: This paper describes an approach to the vehicle rollover prevention problem that includes estimation of parameters affecting the roll dynamics and a controller accounting for uncertainties in such parameter estimation. We develop a parameter-adaptive reference governor (PARG) that modifies the driver steering input to enforce a rollover avoidance constraint, and state and input constraints. We design a recursive Bayesian estimator that produces confidence estimates of the parameters, including the center-of-gravity height. The confidence estimates inform a supervised learning algorithm, which constructs online constraint admissible sets that are leveraged by the PARG to ensure rollover prevention. Simulation results on a Fishhook maneuver show that the method robustly prvents rollover, and that the resulting parameter estimates are contained in the confidence sets produced by the Bayesian estimator.
|
|
MoA18 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Biological Systems and Biologically-Inspired Methods I |
|
|
Chair: Broucke, Mireille E. | Univ. of Toronto |
Co-Chair: Gouze, Jean-Luc | INRIA |
|
13:30-13:45, Paper MoA18.1 | Add to My Program |
Adaptive Internal Models in the Optokinetic System |
|
Erin, Battle | University of Toronto |
Broucke, Mireille E. | Univ. of Toronto |
Keywords: Biological systems, Output regulation, Adaptive control
Abstract: We present a control-theoretic model of the optokinetic system, an eye movement system for tracking a moving visual surround. The model adheres to the neural circuit in the brain and is based on the application of adaptive internal models to capture the contribution of the cerebellum. The model is validated through simulations, recovering the basic behaviors of the optokinetic system known from experimental studies.
|
|
13:45-14:00, Paper MoA18.2 | Add to My Program |
Neuron Growth Control by PDE Backstepping: Axon Length Regulation by Tubulin Flux Actuation in Soma |
|
Demir, Cenk | University of California, San Diego |
Koga, Shumon | University of California, San Diego |
Krstic, Miroslav | University of California, San Diego |
Keywords: Cellular dynamics, Distributed parameter systems, Stability of nonlinear systems
Abstract: In this work, stabilization of an axonal growth in a neuron associated with the dynamics of tubulin concentration is proposed by designing a boundary control. The dynamics are given by a parabolic Partial Differential Equation (PDE) of the tubulin concentration, with a spatial domain of the axon's length governed by an Ordinary Differential Equation (ODE) coupled with the tubulin concentration in the growth cone. We propose a novel backstepping method for the coupled PDE-ODE dynamics with a moving boundary, and design a control law for the tubulin concentration flux in the soma. Through employing the Lyapunov analysis to a nonlinear target system, we prove a local exponential stability of the closed-loop system under the proposed control law in the spatial H_1-norm.
|
|
14:00-14:15, Paper MoA18.3 | Add to My Program |
Modulation of Stochastic Gene Expression by Nuclear Export Processes |
|
Smith, Madeline | University of Delaware |
Soltani, Mohammad | University of Delaware |
Kulkarni, Rahul | University of Massachusetts Boston |
Singh, Abhyudai | University of Delaware |
Keywords: Biological systems, Genetic regulatory systems, Cellular dynamics
Abstract: Inside mammalian cells, single genes are known to be transcribed in stochastic bursts leading to the synthesis of nuclear RNAs that are subsequently exported to the cytoplasm to create mRNAs. We systematically characterize the role of export processes in shaping the extent of random fluctuations (i.e. noise) in the mRNA level of a given gene. Using the method of Partitioning of Poisson arrivals, we derive an exact analytical expression for the noise in mRNA level assuming that the nuclear retention time of each RNA is an independent and identically distributed random variable following an arbitrary distribution. These results confirm recent experimental/theoretical findings that decreasing the nuclear export rate buffers the noise in mRNA level, and counterintuitively, decreasing the noise in the nuclear retention time enhances the noise in the mRNA level. Next, we further generalize the model to consider a dynamic extrinsic disturbance that affects the nuclear-to-cytoplasm export. Our results show that noise in the mRNA level varies non-monotonically with the disturbance timescale. More specifically, high- and low-frequency external disturbances have little impact on the mRNA noise level, while noise is amplified at intermediate frequencies. In summary, our results systematically uncover how the coupling of bursty transcription with nuclear export can both attenuate or amplify noise in mRNA levels depending on the nuclear retention time distribution and the presence of extrinsic fluctuations.
|
|
14:15-14:30, Paper MoA18.4 | Add to My Program |
Blood Glucose Regulation in Patients with Type 1 Diabetes Mellitus: A Robust MRAC Approach |
|
Franco, Roberto | TecNM/I.T. La Laguna |
Ríos, Héctor | Tecnológico Nacional De México/I.T. La Laguna |
Ferreira de Loza, Alejandra | IPN CITEDI |
Efimov, Denis | Inria |
Cassany, Louis | IMS University of Bordeaux |
Gucik-Derigny, David | Université Bordeaux I |
Cieslak, Jérôme | IMS Laboratory - University of Bordeaux |
Henry, David | Universite De Bordeaux |
Keywords: Biological systems, Metabolic systems, Adaptive control
Abstract: This paper deals with the problem of robust blood glucose regulation in critically ill patients affected by type 1 diabetes mellitus. The blood glucose measurement and insulin infusion are intravenous. The proposed algorithm regulates blood glucose and keeps it in the normoglycemia range, i.e., 70–180 mg/dl. To this aim, a control law is proposed based on a Nonlinear Model Reference Adaptive Control approach. The algorithm is composed of nonlinear adaptive gains that ensure convergence to zero of the regulation error. The approach is validated in the UVA/Padova metabolic simulator for ten in silico adult patients with unannounced meals. The results perform well and have minimal risk of hyperglycemic and hypoglycemic events.
|
|
14:30-14:45, Paper MoA18.5 | Add to My Program |
Hierarchical MPC Applied to Bacterial Resource Allocation and Metabolite Synthesis |
|
Yabo, Agustín Gabriel | INRIA |
Caillau, Jean-Baptiste | Université Côte d'Azur, CNRS, Inria, LJAD |
Gouze, Jean-Luc | INRIA |
Keywords: Biological systems, Predictive control for nonlinear systems, Optimal control
Abstract: Microorganisms have evolved submitted to a continuous optimisation process that has improved their capacity to proliferate in nature, developing highly optimized distribution mechanisms of their resources. Considering the microbial self-replication process as a resource allocation problem is a novel approach that has motivated numerous applications to the artificial production of metabolites of interest. Model-based optimal control studies are essential in understanding these naturally-evolved allocation strategies, but they are usually represented by open-loop control laws. In this context, we introduce a hierarchical shrinking-horizon non-linear MPC scheme that aims to maximise the production of a metabolite of interest. The control loop acts on an external signal that is able to disrupt the natural allocation process. The approach uses an optimal control-based input parametrisation that takes into account the structure of the open-loop natural allocation strategy of the cell, to emulate a closed-loop control law. We provide examples of the open-loop control strategies, and simulations of the hierarchical scheme.
|
|
14:45-15:00, Paper MoA18.6 | Add to My Program |
Optimal Trajectory Generation with State Inequality Constraints for a Bioreactor |
|
Kurth, Anna-Carina | Institute for System Dynamics, University of Stuttgart |
Arnold, Eckhard | University of Stuttgart |
Sawodny, Oliver | University of Stuttgart |
Keywords: Optimal control, Biological systems
Abstract: The dynamics of microorganisms in a bioreactor are described by population systems. The tracking of the biomass of a desired trajectory is ensured by means of a nonlinear control law, as long as the reference trajectory complies with certain naturally given inequality constraints. To generate such a reference trajectory, an optimal control problem based on a double integrator with nonlinear state inequality constraints is solved. The necessary optimality criteria are extracted and the differential equations are solved. Using these solutions and parameter assumptions, which are shown to be non-limiting, it is proven that at most one constraint at a time is active. In addition, it is shown that the intervals in which no constraint is active and those in which one constraint is active alternate.
|
|
MoA19 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Robotics I |
|
|
Chair: Jiang, Zhong-Ping | New York University |
Co-Chair: Andersson, Sean B. | Boston University |
|
13:30-13:45, Paper MoA19.1 | Add to My Program |
Analysis of an Extremum Seeking Controller under Bounded Disturbance |
|
Pinto, Samuel C. | Boston University |
Andersson, Sean B. | Boston University |
Keywords: Robotics, Autonomous robots, Uncertain systems
Abstract: One of the applications of Extremum Seeking (ES) is to localize the source of a scalar field by using a mobile agent that can measure this field at its current location. While the scientific literature has presented many approaches to this problem, a formal analysis of the behavior of ES controllers for source seeking in the presence of disturbances is still lacking. This paper aims to fill this gap by analyzing a specific version of an ES control algorithm in the presence of source movement and measurement disturbances. We define an approximate version of this controller that captures the main features but allows for a simplified analysis and formally characterize its convergence properties. Through simulations and physical experiments, we compare the theoretically-predicted regions of attraction of the simplified system with the behavior of the full system and show that the simplified version is a good predictor of the behavior of the initial ES controller.
|
|
13:45-14:00, Paper MoA19.2 | Add to My Program |
Swing up Control of a Soft Inverted Pendulum with Revolute Base |
|
Weerakoon, Lasitha | University of Maryland |
Chopra, Nikhil | University of Maryland, College Park |
Keywords: Robotics, Nonlinear systems
Abstract: In this paper, we introduce a novel soft robotic system, which is a soft inverted pendulum with a revolute joint at the base. This is an underactuated system as the revolute joint is not actuated. The soft body is hypothesized to be of constant curvature and it is actuated. Motivated by swing up controllers for classical underactuated systems, a switching based swing up and stabilization control of the proposed soft robot system is studied. We demonstrate that the swing up control guides the soft robot to the desired energy level, which is the upright position. Once the swing up is completed, the control is switched to an LQR controller to achieve the stabilization at the vertically upright pose. The switching occurs when the swing up control brings the robot inside the region of attraction for the LQR controller. The simulation results are depicted to illustrate the effectiveness of the proposed control approach for the soft inverted pendulum system.
|
|
14:00-14:15, Paper MoA19.3 | Add to My Program |
Asymptotic Trajectory Tracking of Autonomous Bicycles Via Backstepping and Optimal Control |
|
Cui, Leilei | New York University |
Wang, Shuai | Tencent |
Zhang, Zhengyou | Tencent |
Jiang, Zhong-Ping | New York University |
Keywords: Robotics, Lyapunov methods, Stability of nonlinear systems
Abstract: This paper studies the trajectory tracking problem for an autonomous bicycle that is a non-minimum phase, strongly nonlinear system. As compared with most existing methods dealing only with approximate trajectory tracking, this paper proposes a constructive design to achieve asymptotic trajectory tracking. More specifically, under the assumption that the desired trajectory is generated by a virtual bicycle, a novel asymptotic trajectory tracking controller design scheme is presented. Firstly, the nonlinear dynamics of the autonomous bicycle is established. Secondly, it is decomposed into two interconnected subsystems: a tracking subsystem and a balancing subsystem. For the tracking subsystem, the popular backstepping approach is applied to determine the propulsive force of the bicycle. For the balancing subsystem, optimal control is applied to determine the steering angular velocity of the handlebar in order to balance the bicycle and align the bicycle with the desired yaw angle. Thirdly, to tackle the strong coupling between the tracking and the balancing systems, the small-gain technique is applied for the first time to prove the asymptotic stability of the closed-loop bicycle system. Finally, the efficacy of the proposed exact trajectory tracking control methodology is validated by numerical simulations.
|
|
14:15-14:30, Paper MoA19.4 | Add to My Program |
Robust Disturbance Rejection for Robotic Bipedal Walking: System-Level-Synthesis with Step-To-Step Dynamics Approximation |
|
Xiong, Xiaobin | California Institute of Technology |
Chen, Yuxiao | California Institute of Technology |
Ames, Aaron | California Institute of Technology |
Keywords: Robotics, Robust control, Model/Controller reduction
Abstract: We present a stepping stabilization control that addresses external push disturbances on bipedal walking robots. The stepping control is synthesized based on the step-to-step (S2S) dynamics of the robot that is controlled to have an approximately constant center of mass (COM) height. We first learn a linear S2S dynamics with bounded model discrepancy from the undisturbed walking behaviors of the robot, where the walking step size is taken as the control input to the S2S dynamics. External pushes are then considered as disturbances to the learned S2S (L-S2S) dynamics. We then apply the system-level-synthesis (SLS) approach on the disturbed L-S2S dynamics to robustly stabilize the robot to the desired walking while satisfying the kinematic constraints of the robot. We successfully realize the proposed approach on the walking of the bipedal robot AMBER and Cassie subject to push disturbances, showing that the approach is general, effective, and computationally-efficient for robust disturbance rejection.
|
|
14:30-14:45, Paper MoA19.5 | Add to My Program |
Estimate-To-State Stability for Hybrid Human-Prosthesis Systems |
|
Gehlhar, Rachel | California Institute of Technology |
Ames, Aaron D. | California Institute of Technology |
Keywords: Robotics, Stability of hybrid systems, Stability of nonlinear systems
Abstract: Control methods for lower-limb powered prostheses remain mostly model-independent and cannot always guarantee stability. Model-dependent prosthesis control methods yield a wider range of stability properties, but require knowledge of the interaction force between the human and prosthesis. Any error in force estimation compromise the formal guarantees. This paper addresses this uncertainty by formalizing the stability of the human-prosthesis system subject to force estimation error. A novel notion of estimate-to-state stability is introduced and provides a means to guarantee exponential convergence of the prosthesis to a set when the controller's model contains estimation error. Conditions are established to ensure input-to-state stability for the human's hybrid periodic orbits when subject to disturbances from the prosthesis control action deviating from its nominal control law. A class of estimate-to-state stable prosthesis controllers is proposed and implemented in simulation, demonstrating how the human-prosthesis system converges to a tube around the desired trajectory resulting in stable walking.
|
|
14:45-15:00, Paper MoA19.6 | Add to My Program |
Optimization-Free Ground Contact Force Constraint Satisfaction in Quadrupedal Locomotion |
|
Sihite, Eric | Northeastern University |
Dangol, Pravin | Northeastern University |
Ramezani, Alireza | Northeastern University |
Keywords: Robotics, Supervisory control, Constrained control
Abstract: We are seeking control design paradigms for legged systems that allow bypassing costly algorithms that depend on heavy on-board computers widely used in these systems and yet being able to match what they can do by using less expensive optimization-free frameworks. In this work, we present our preliminary results in modeling and control design of a quadrupedal robot called Husky Carbon, which under development at Northeastern University (NU) in Boston. In our approach, we utilized a supervisory controller and an Explicit Reference Governor (ERG) to enforce ground reaction force constraints. These constrained are usually enforced using costly optimizations. However, in this work, the ERG manipulates the state references applied to the supervisory controller to enforce the ground contact constraints through an update law based on Lyapunov stability arguments. As a result, the approach is much faster to compute than the widely used optimization-based methods.
|
|
MoB01 Invited Session, Coordinated Universal Time (UTC) |
Add to My Program |
Challenges in Data-Driven, Optimization-Based Control |
|
|
Chair: Langbort, Cedric | Univ of Illinois, Urbana-Champaign |
Co-Chair: Azizzadenesheli, Kamyar | Purdue University |
Organizer: Langbort, Cedric | Univ of Illinois, Urbana-Champaign |
Organizer: Azizzadenesheli, Kamyar | Purdue University |
|
15:10-15:25, Paper MoB01.1 | Add to My Program |
Thompson Sampling for Linear Quadratic Mean-Field Teams (I) |
|
Gagrani, Mukul | University of Southern California |
Sudhakara, Sagar | University of Southern California |
Mahajan, Aditya | McGill University |
Nayyar, Ashutosh | University of Southern California |
Ouyang, Yi | Preferred Networks |
Keywords: Learning, Mean field games, Statistical learning
Abstract: We consider optimal control of an unknown multi-agent linear quadratic (LQ) system where the dynamics and the cost are coupled across the agents through the mean-field (i.e., empirical mean) of the states and controls. Directly using single-agent LQ learning algorithms in such models results in regret which increases polynomially with the number of agents. We propose a new Thompson sampling based learning algorithm which exploits the structure of the system model and show that the expected Bayesian regret of our proposed algorithm for a system with agents of |M| different types at time horizon T is tilde{mathcal{O}} big( |M|^{1.5} sqrt{T} big) irrespective of the total number of agents, where the tilde{mathcal{O}} notation hides logarithmic factors in T. We present detailed numerical experiments to illustrate the salient features of the proposed algorithm.
|
|
15:25-15:40, Paper MoB01.2 | Add to My Program |
Anytime Proximity Moving Horizon Estimation: Stability and Regret for Nonlinear Systems (I) |
|
Gharbi, Meriem | University of Stuttgart |
Gharesifard, Bahman | University of California, Los Angeles |
Ebenbauer, Christian | RWTH Aachen University |
Keywords: Observers for nonlinear systems, Optimal control, Optimization algorithms
Abstract: In this paper, we reduce computational burden of moving horizon estimation (MHE) for discrete-time constrained nonlinear systems by providing an iteration scheme that computes a suboptimal state estimate after a limited number of gradient-based optimization algorithm iterations. The optimization algorithm is warm-started by an a priori estimate constructed based on a locally stable, model-based, and recursive state estimation strategy. Due to this implicit stabilizing regularization approach of the a priori estimate, we establish local exponential stability of the underlying estimation error by using Lyapunov arguments. Furthermore, by assuming convexity of the MHE problem, we characterize the performance of the iteration scheme relative to an estimator that knows for instance the optimal solutions in terms of regret upper bounds.
|
|
15:40-15:55, Paper MoB01.3 | Add to My Program |
On Direct vs Indirect Data-Driven Predictive Control |
|
Krishnan, Vishaal | University of California, Riverside |
Pasqualetti, Fabio | University of California, Riverside |
Keywords: Learning, Identification for control
Abstract: In this work, we compare the direct and indirect approaches to data-driven predictive control of stochastic linear time-invariant systems. The distinction between the two approaches lies in the fact that the indirect approach involves identifying a lower dimensional model from data which is then used in a certainty-equivalent control design, while the direct approach avoids this intermediate step altogether. Working within an optimization-based framework, we find that the suboptimality gap measuring the control performance w.r.t. the optimal model-based control design vanishes with the size of the dataset only with the direct approach, while the indirect approach incurs an asymptotic bias. On the other hand, the indirect approach, by relying on the identification of a lower dimensional model, has lower variance and outperforms the direct approach for smaller datasets. Ultimately, by revealing the existence of two non-asymptotic regimes for the performance of direct and indirect data-driven predictive control designs, our study suggests that neither approach is invariably superior and that the choice of design must, in practice, be informed by the available dataset.
|
|
15:55-16:10, Paper MoB01.4 | Add to My Program |
On the Stability of Nonlinear Receding Horizon Control: A Geometric Perspective (I) |
|
Westenbroek, Tyler | University of California, Berkeley |
Simchowitz, Max | UC Berkeley |
Jordan, Michael I. | UC Berkeley |
Sastry, Shankar | Univ. of California at Berkeley |
Keywords: Predictive control for nonlinear systems, Optimal control, Feedback linearization
Abstract: The widespread adoption of nonlinear Receding Horizon Control (RHC) strategies by industry has led to more than 30 years of intense research efforts to provide stability guarantees for these methods. However, current theoretical guarantees require that each (generally nonconvex) planning problem can be solved to (approximate) global optimality, which is an unrealistic requirement for the derivative-based local optimization methods generally used in practical implementations of RHC. This paper takes the first step towards understanding stability guarantees for nonlinear RHC when the inner planning problem is solved to first-order stationary points, but not necessarily global optima. Special attention is given to feedback linearizable systems, and a mixture of positive and negative results are provided. We establish that, under certain strong conditions, first-order solutions to RHC exponentially stabilize linearizable systems. Crucially, this guarantee requires that state costs applied to the planning problems are in a certain sense `compatible' with the global geometry of the system, and a simple counter-example demonstrates the necessity of this condition. These results highlight the need to rethink the role of global geometry in the context of optimization-based control.
|
|
16:10-16:25, Paper MoB01.5 | Add to My Program |
Assured Learning-Based Optimal Control Subject to Timed Temporal Logic Constraints (I) |
|
Fotiadis, Filippos | Georgia Institute of Technology |
Verginis, Christos | University of Texas at Austin |
Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Topcu, Ufuk | The University of Texas at Austin |
Keywords: Learning, Formal Verification/Synthesis, Optimal control
Abstract: We develop an algorithm for the optimal control of systems governed by unknown, nonlinear dynamics, to deliver tasks expressed as timed temporal logic constraints. The algorithm first computes a sequence of points in the operating environment, along with associated time stamps, so that the system completes its task if it follows the sequence. For the algorithm's second step, we develop a data-driven, on-the-fly control mechanism that learns how to transition from a point in the sequence to the next within a pre-specified time horizon. This algorithm accounts for the unknown dynamics, any unsafe zones in the environment and additional optimality criteria. We show that, after a finite period of data gathering, the resulting controller guarantees that the system indeed follows the sequence of points, leading to the satisfaction of the task.
|
|
16:25-16:40, Paper MoB01.6 | Add to My Program |
Model Learning Predictive Control in Nonlinear Dynamical Systems (I) |
|
Lale, Sahin | Caltech |
Azizzadenesheli, Kamyar | Purdue University |
Hassibi, Babak | Caltech |
Anandkumar, Animashree | California Institute of Technology |
Keywords: Iterative learning control, Learning, Machine learning
Abstract: We study the problem of online learning and control in partially observable nonlinear dynamical systems, where the model dynamics are unknown and the controlling agent has only access to the system outputs. We propose Model Learning Predictive Control (MLPC), an efficient online control framework that learns to control the unknown system and minimizes the overall control cost. MLPC employs Random Fourier Features (RFF) to represent the nonlinear system dynamics and learns the underlying system up to a confidence interval. Once a reliable estimate of the dynamics is obtained, MLPC deploys an MPC oracle with the estimated system dynamics for planning. MLPC occasionally updates the underlying model estimates and improves the accuracy and the effectiveness of the MPC policies. We derive a novel finite-time approximation error bound under RFF learning and provide stability guarantees for single trajectory online control. We show that MLPC attains tilde{O}(T^{2/3}) regret after T time steps in online control of stable partially observable nonlinear systems against the controller that uses the same MPC oracle with the true system dynamics. We empirically demonstrate the performance of MLPC on the inverted pendulum task and show the flexibility of the proposed general framework via deploying different planning strategies for the controller design to achieve low-cost control policies.
|
|
MoB02 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Machine Learning II |
|
|
Chair: Yamashita, Atsushi | The Univeristy of Tokyo |
Co-Chair: Pasqualetti, Fabio | University of California, Riverside |
|
15:10-15:25, Paper MoB02.1 | Add to My Program |
Robust Adversarial Classification Via Abstaining |
|
Al Makdah, Abed AlRahman | University of California Riverside |
Katewa, Vaibhav | Indian Institute of Science Bangalore |
Pasqualetti, Fabio | University of California, Riverside |
Keywords: Pattern recognition and classification, Machine learning, Learning
Abstract: In this work, we consider a binary classification problem and cast it into a binary hypothesis testing framework, where the observations can be perturbed by an adversary. To improve the adversarial robustness of a classifier, we include an abstain option, where the classifier abstains from making a decision when it has low confidence about the prediction. We propose metrics to quantify the nominal performance of a classifier with an abstain option and its robustness against adversarial perturbations. We show that there exist a tradeoff between the two metrics regardless of what method is used to choose the abstain region. Our results imply that the robustness of a classifier with an abstain option can only be improved at the expense of its nominal performance. Further, we provide necessary conditions to design the abstain region for a 1- dimensional binary classification problem. We validate our theoretical results on the MNIST dataset, where we numerically show that the tradeoff between performance and robustness also exist for the general multi-class classification problems.
|
|
15:25-15:40, Paper MoB02.2 | Add to My Program |
Learning Stochastic Optimal Policies Via Gradient Descent |
|
Massaroli, Stefano | The Univeristy of Tokyo |
Poli, Michael | KAIST |
Peluchetti, Stefano | Cogent Labs |
Park, Jinkyoo | Korea Advanced Institute of Science and Technology |
Yamashita, Atsushi | The Univeristy of Tokyo |
Asama, Hajime | The University of Tokyo |
Keywords: Machine learning, Optimization, Stochastic optimal control
Abstract: We systematically develop a learning-based treatment of stochastic optimal control (SOC), relying on direct optimization of parametric control policies. We propose a derivation of adjoint sensitivity results for stochastic differential equations through direct application of variational calculus. Then, given an objective function for a predetermined task specifying the desiderata for the controller, we optimize their parameters via iterative gradient descent methods. In doing so, we extend the range of applicability of classical SOC techniques, often requiring strict assumptions on the functional form of system and control. We verify the performance of the proposed approach on a continuous-time, finite horizon portfolio optimization with proportional transaction costs.
|
|
15:40-15:55, Paper MoB02.3 | Add to My Program |
Federated Learning with Incrementally Aggregated Gradients |
|
Mitra, Aritra | University of Pennsylvania |
Jaafar, Rayana | University of Pennsylvania |
Pappas, George J. | University of Pennsylvania |
Hassani, Hamed | University of Pennsylvania |
Keywords: Machine learning, Optimization algorithms, Large-scale systems
Abstract: We consider the standard federated learning (FL) framework where a set of clients coordinate with a central server to train a statistical model. In a single-machine centralized setting, it is well-known that for smooth and strongly convex finite-sum optimization problems, one can design algorithms that guarantee exact linear convergence to the global minimum without computing full (batch) gradients at every iteration. Despite its popularity, an analog of the above result does not exist in FL. Motivated by this gap, we consider a setting where the local loss function of each client can be expressed as a finite sum of smooth component functions. For this setting, we propose a novel computationally-efficient FL algorithm called FedTrack that rests on two key ideas: (i) using the most recently communicated versions of the clients' gradients in the local update rule, and (ii) incrementally aggregating gradients of the component functions of each client. While the first idea serves to overcome the effect of heterogeneity across the clients' local loss functions, the second helps to significantly reduce the overall number of gradient computations. For both strongly convex and non-convex local loss functions, we prove that the convergence guarantees of FedTrack match their centralized counterparts (up to constants). In particular, for the strongly convex setting, we show that FedTrack guarantees exact linear convergence to the global minimum deterministically.
|
|
15:55-16:10, Paper MoB02.4 | Add to My Program |
Robust Learning of Recurrent Neural Networks in Presence of Exogenous Noise |
|
Amini, Arash | Lehigh UNiversity |
Liu, Guangyi | Lehigh University |
Motee, Nader | Lehigh University |
Keywords: Machine learning, Robust control, Estimation
Abstract: Recurrent Neural networks (RNN) have shown promising potential for learning dynamic features of sequential data. However, artificial neural networks are known to exhibit poor robustness in presence of noisy input, where the sequential architecture of RNNs exacerbates the problem. In this paper, we will use ideas from control and estimation theories to propose a tractable robustness analysis for RNN models that are subject to noisy inputs. The variance of the output of the noisy system is adopted as a robustness measure to quantify the impact of noise on learning. It is shown that the robustness measure can be estimated efficiently using linearization techniques. Using these results, we proposed a learning method to enhance robustness of a RNN with respect to exogenous Gaussian noise with known statistics. Several theoretical upper bounds are also presented to show how the robustness measure depends on the trainable parameters. Our extensive simulations on benchmark problems reveal that our proposed methodology significantly improves robustness of recurrent neural networks in a systematic manner.
|
|
16:10-16:25, Paper MoB02.5 | Add to My Program |
Learning a Stability Filter for Uncertain Differentially Flat Systems Using Gaussian Processes |
|
Greeff, Melissa | University of Toronto |
Hall, Adam W. | University of Toronto |
Schoellig, Angela P | University of Toronto |
Keywords: Machine learning, Optimization, Feedback linearization
Abstract: Many physical system models exhibit a structural property known as differential flatness. Intuitively, differential flatness allows us to separate the system’s nonlinear dynamics into a linear dynamics component and a nonlinear term. In this work, we exploit this structure and propose using a nonparametric Gaussian Process (GP) to learn the unknown nonlinear term. We use this GP in an optimization problem to optimize for an input that is most likely to feedback linearize the system (i.e., cancel this nonlinear term). This optimization is subject to input constraints and a stability filter, described by an uncertain Control Lyapunov Function (CLF), which probabilistically guarantees exponential trajectory tracking when possible. Furthermore, for systems that are control-affine, we choose to express this structure in the selection of the kernel for the GP. By exploiting this selection, we show that the optimization problem is not only convex but can be efficiently solved as a second-order cone program. We compare our approach to related works in simulation and show that we can achieve similar performance at much lower computational cost.
|
|
16:25-16:40, Paper MoB02.6 | Add to My Program |
A Variational Inequality Approach to Bayesian Regression Games |
|
Guo, Wenshuo | University of California, Berkeley |
Jordan, Michael I. | UC Berkeley |
Lin, Tianyi | University of California, Berkeley |
Keywords: Machine learning, Game theory, Variational methods
Abstract: Bayesian regression games are a special class of two-player general-sum Bayesian games in which the learner is partially informed about the adversary's objective through a Bayesian prior. This formulation captures the uncertainty in regard to the adversary, and is useful in problems where the learner and adversary may have conflicting, but not necessarily perfectly antagonistic objectives. Although the Bayesian approach is a more general alternative to the standard minimax formulation, the applications of Bayesian regression games have been limited due to computational difficulties, and the existence and uniqueness of a Bayesian equilibrium are only known for quadratic cost functions. First, we prove the existence and uniqueness of a Bayesian equilibrium for a class of convex and smooth Bayesian games by regarding it as a solution of an infinite-dimensional variational inequality (VI) in Hilbert space. We consider two special cases in which the infinite-dimensional VI reduces to a high-dimensional VI or a nonconvex stochastic optimization, and provide two simple algorithms of solving them with strong convergence guarantees. Numerical results on real datasets demonstrate the promise of this approach.
|
|
MoB03 Invited Session, Coordinated Universal Time (UTC) |
Add to My Program |
Learning-Based Control and Sweeping Processes II |
|
|
Chair: Zeidan, Vera | Michigan State University |
Co-Chair: Cao, Tan | SUNY Korea |
Organizer: Cao, Tan | SUNY Korea |
Organizer: Mordukhovich, Boris | Wayne State Univ |
Organizer: Malisoff, Michael | Louisiana State University |
Organizer: Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
|
15:10-15:25, Paper MoB03.1 | Add to My Program |
Inverse Reinforcement Learning for Multi-Player Apprentice Games in Continuous-Time Nonlinear Systems (I) |
|
Lian, Bosen | The University of Texas at Arlington |
Xue, Wenqian | Northeastern University |
Lewis, Frank L. | University of Texas at Arlington |
Chai, Tianyou | Northeastern University |
Davoudi, Ali | University of Texas-Arlington |
Keywords: Adaptive control, Optimal control, Nonlinear systems
Abstract: We extend the inverse reinforcement learning (inverse RL) algorithms to multi-player apprentice games described by nonlinear differential equations. In these games, both the expert and the learner have N-player control inputs. Inverse RL algorithms solve the games by learner reconstructing the unknown cost function of each expert player using the demonstration of expert's behavior (states and control inputs of each player), thereby mimicking the given behaviors. We first develop a model-based inverse RL algorithm with two learning stages: an optimal control learning stage and an inverse optimal control learning stage. Then, a model-free off-policy integral inverse RL algorithm is developed by using online expert's demonstrations and learner's behavior trajectories without knowing system dynamics of either expert or the learner. Finally, simulations verify the effectiveness of proposed algorithms.
|
|
15:25-15:40, Paper MoB03.2 | Add to My Program |
Adaptive Model Predictive Safety Certification for Learning-Based Control |
|
Didier, Alexandre | ETH Zurich |
Wabersich, Kim Peter | ETH Zurich |
Zeilinger, Melanie N. | ETH Zurich |
Keywords: Predictive control for linear systems, Learning
Abstract: We propose an adaptive Model Predictive Safety Certification (MPSC) scheme for learning-based control of linear systems with bounded disturbances and uncertain parameters with known bounds. An MPSC is a modular framework, which can be used in combination with any learning-based controller to ensure state and input constraint satisfaction of a dynamical system by solving an online optimisation problem. By continuously connecting the current system state with a safe terminal set using a robust tube, safety can be ensured. Thereby, the main sources of conservative safety interventions are model uncertainties and short planning horizons. We develop an adaptive mechanism to improve the system model, which leverages set-membership estimation to guarantee recursively feasible and non-decreasing safety performance improvements. In order to accommodate short prediction horizons, iterative safe set enlargements using previously computed robust backup plans are proposed. Finally, we illustrate the increase of the safety performance through the parameter and safe set adaptation for numerical examples with up to 16 state dimensions.
|
|
15:40-15:55, Paper MoB03.3 | Add to My Program |
Applications of Controlled Sweeping Processes to Nonlinear Crowd Motion Models with Obstacles |
|
Cao, Tan | SUNY Korea |
Mordukhovich, Boris | Wayne State Univ |
Nguyen, Nguyen-Truc-Dao | Wayne State University |
NGUYEN, Thi-Dai-Trang | Wayne State University |
Keywords: Optimal control, Control applications, Variational methods
Abstract: This paper mainly focuses on solving the dynamic optimization of the planar controlled crowd motion models with obstacles which is an application of a class of optimal control problems governed by a general perturbed nonconvex sweeping process. This can be considered as a significant extension of the previous work regarding the controlled crowd motion models, where the obstacles have not been considered. The necessary optimality conditions for the problem under consideration are established and illustrated by a nontrivial example of practical importance.
|
|
15:55-16:10, Paper MoB03.4 | Add to My Program |
Optimization of Controlled Free-Time Sweeping Processes with Applications to Marine Surface Vehicle Modeling |
|
Cao, Tan | SUNY Korea |
T. Khalil, Nathalie | Universidade Do Porto |
Mordukhovich, Boris | Wayne State Univ |
Nguyen, Nguyen-Truc-Dao | Wayne State University |
NGUYEN, Thi-Dai-Trang | Wayne State University |
Lobo Pereira, Fernando | Porto University |
Keywords: Optimal control, Variational methods, Control applications
Abstract: The paper is devoted to a free-time optimal control problem for sweeping processes. We develop a constructive finite-difference approximation procedure that allows us to establish necessary optimality conditions for discrete optimal solutions and then show how these optimality conditions are applied to solving a controlled marine surface vehicle model.
|
|
16:10-16:25, Paper MoB03.5 | Add to My Program |
Real-Time Modular Deep Neural Network-Based Adaptive Control of Uncertain Nonlinear Systems |
|
Le, Duc M. | University of Florida |
Greene, Max L. | University of Florida |
Makumi, Wanjiku A. | University of Florida |
Dixon, Warren E. | University of Florida |
Keywords: Neural networks, Adaptive control, Lyapunov methods
Abstract: A real-time deep neural network (DNN) adaptive control architecture is developed for uncertain control-affine nonlinear systems to track a time-varying desired trajectory. A Lyapunov-based analysis is used to develop adaptation laws for the output-layer weights and develop constraints for inner-layer weight adaptation laws. Unlike existing works in neural network and DNN-based control, the developed method establishes a framework to simultaneously update the weights of multiple layers for a DNN of arbitrary depth in real-time. The real-time controller and weight update laws enable the system to track a time-varying trajectory while compensating for unknown drift dynamics and parametric DNN uncertainties. A nonsmooth Lyapunov-based analysis is used to guarantee semiglobal asymptotic tracking. Comparative numerical simulation results are included to demonstrate the efficacy of the developed method.
|
|
16:25-16:40, Paper MoB03.6 | Add to My Program |
Numerical Solution for a Controlled Nonconvex Sweeping Process |
|
Nour, Chadi | Lebanese American University |
Zeidan, Vera | Michigan State University |
Keywords: Optimal control, Numerical algorithms, Optimization
Abstract: A numerical method and the theory leading to its success are developed in this paper to solve nonstandard optimal control problems involving sweeping processes, in which the sweeping set C is non-convex and coincides with the zerosublevel set of a smooth function having a Lipschitz gradient, and the fixed initial state is allowed to be any point of C. This numerical method was introduced in [7] for a special form of our problem in which the function whose zero-sublevel set defines C is restricted to be twice differentiable and convex, and the initial state is confined in the interior of their convex set C. The remarkable feature of this method is manifested in approximating the sweeping process by a sequence of standard control systems invoking an innovative exponential penalty term in lieu of the normal cone, whose presence in the sweeping process renders most standard methods inapplicable. For a general setting, we prove that the optimal solution of the approximating standard optimal control problem converges uniformly to an optimal solution of the original problem (see Remark 4.1). This numerical method is shown to be efficient through an example in which C is not convex and the initial state is on its boundary.
|
|
MoB04 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Identification II |
|
|
Chair: Van den Hof, Paul M.J. | Eindhoven University of Technology |
Co-Chair: Bianchi, Federico | Politecnico Di Milano |
|
15:10-15:25, Paper MoB04.1 | Add to My Program |
Learning Local Modules in Dynamic Networks without Prior Topology Information |
|
C. Rajagopal, Venkatakrishnan | Eindhoven University of Technology |
Ramaswamy, Karthik R. | Eindhoven University of Technology |
Van den Hof, Paul M.J. | Eindhoven University of Technology |
Keywords: Identification, Network analysis and control, Learning
Abstract: Recently different identification methods have been developed for identifying a single module in a dynamic network. In order to select an appropriate predictor model one typically needs prior knowledge on the topology (interconnection structure) of the dynamic network, as well as on the correlation structure of the process disturbances. In this paper we present a new approach that incorporates the estimation of this prior information into the identification, leading to a fully data-driven approach for estimating the dynamics of a local module. The developed algorithm uses non-causal Wiener filters and a series of convex optimizations with parallel computation capabilities to estimate the topology, which subsequently is used to build the appropriate input/output setting for a predictor model in the local direct method under correlated process noise. A regularized kernel-based method is then employed to estimate the dynamic of the target module. This leads to an identification algorithm with attractive statistical properties that is scalable to handle larger-scale networks too. Numerical simulations illustrate the potential of the developed algorithm.
|
|
15:25-15:40, Paper MoB04.2 | Add to My Program |
Parsimonious System Identification from Quantized Observations |
|
Sleem, Omar | Pennsylvania State University |
Lagoa, Constantino M. | Pennsylvania State Univ |
Keywords: Identification, Optimization
Abstract: Quantization plays an important role as an interface between analog and digital environments. Since quantization is a many to few mapping, it is a non-linear irreversible process. This made, in addition of the quantization noise signal dependency, the traditional methods of system identification no longer applicable. In this work, we propose a method for parsimonious system identification when only quantized measurements of the output are observable. More precisely, we develop an algorithm that aims at identifying a low order system that is compatible with a priori information on the system and the collected quantized output information. Moreover, the proposed approach can be used even if only fragmented information on the quantized output is available. The proposed algorithm relies on an ADMM approach to l_p quasi-norm optimization. Numerical results highlight the performance of the proposed approach when compared to the l_1 minimization in terms of the sparsity of the induced solution.
|
|
15:40-15:55, Paper MoB04.3 | Add to My Program |
Optimal Observations for Identification of a Single Transfer Function in Acyclic Networks |
|
Jahandari, Sina | University of Minnesota |
Materassi, Donatello | University of Minnesota |
Keywords: Identification, Stochastic systems, Optimization
Abstract: The paper presents a systematic approach for finding the optimal set of predictors for consistent identification of a single transfer function in an acyclic dynamic network. It is assumed that the topology of the network is known, the forcing inputs are not measured, and the observations have positive additive costs. For a class of networks where the target node is not involved in a feedback loop, sufficient and necessary conditions are derived to consistently identify a certain transfer function via a multi-input single-output prediction error method. This enables designing a systematic graphical approach based on the notion of d-separation to look for an optimal set of predictors that minimizes an appropriate additive cost function. It is shown that the required conditions for consistency and optimality are equivalent to the notion of separation in an undirected graph resulted from systematically manipulating the graphical representation of the network. Then, some well-known algorithms from computer science can be used to find the optimal set of predictors.
|
|
15:55-16:10, Paper MoB04.4 | Add to My Program |
A Switched Nonlinear System Identification Method with Switching Location Refinement |
|
Yu, Miao | Politecnico Di Milano |
Bianchi, Federico | Politecnico Di Milano |
Piroddi, Luigi | Politecnico Di Milano |
Keywords: Identification, Switched systems, Randomized algorithms
Abstract: The identification of switched nonlinear systems involves solving a combinatorial problem that simultaneously addresses the sample-mode assignment and nonlinear model structure selection tasks. The complexity of this problem is often prohibitive, since mode switchings can take place at arbitrary times. To reduce it to an affordable level, one can constrain the mode switchings to occur only at few specific instants. This approach is effective if combined with a refinement strategy, aiming at correcting the number and location of switchings. In this paper, one such strategy is discussed, which employs a local optimization process to correct the position of switchings, and is also capable of detecting and removing redundant modes. An iterative method, applying both an identification step and a refinement step at all iterations, is tested on several numerical benchmarks to illustrate the effectiveness of the refinement strategy. The method does not require prior assumptions on the number of modes.
|
|
16:10-16:25, Paper MoB04.5 | Add to My Program |
Regularized Identification of Fast Time-Varying Systems - Comparison of Two Regularization Strategies |
|
Gancza, Artur | Gdansk University of Technology, Faculty of Electronics Telecomm |
Niedzwiecki, Maciej | Gdansk University of Technology |
Keywords: Identification, Time-varying systems, Estimation
Abstract: The problem of identification of a time-varying FIR system is considered and solved using the local basis function approach. It is shown that the estimation (tracking) results can be improved by means of regularization. Two variants of regularization are proposed and compared: the classical L2 (ridge) regularization and a new, reweighted L2 one. It is shown that the new approach outperforms the classical one and is computationally attractive.
|
|
16:25-16:40, Paper MoB04.6 | Add to My Program |
Interval Predictor Models for Robust System Identification |
|
Crespo, Luis G | NASA |
Kenny, Sean | NASA |
Colbert, Brendon | Arizona State University |
Slagel, Tanner | NASA LaRC |
Keywords: Identification, Uncertain systems, Optimization
Abstract: This paper proposes a framework for the quantification of structured uncertainty in a plant model according to multivariable input-output data. The only restriction imposed upon such a model is for its outputs to depend continuously on the parameters. An Interval Predictor Model (IPM) prescribes the parameters of a computational model as a bounded, path-connected set thereby making each predicted output an interval-valued function of the inputs. The formulation proposed seeks the parameter set leading to the tightest enclosure of the data. This set, which is modeled as a semi-algebraic set having a tunable complexity level, enables the characterization of parameter dependencies commonly found in practice. This representation of the uncertain parameters makes the resulting plant model amenable to robustness analysis and robust control techniques based on polynomial optimization. Furthermore, scenario theory is used to evaluate the generalization properties of the identified model. This evaluation yields a formally-verifiable, distribution-free upper bound on the probability of future data falling outside the predicted output intervals.
|
|
MoB05 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Stochastic Optimal Control I |
|
|
Chair: Rawlings, James B. | University of California, Santa Barbara |
Co-Chair: Mitchell, Ian M. | University of British Columbia |
|
15:10-15:25, Paper MoB05.1 | Add to My Program |
Stochastic Exponential Stability of Nonlinear Stochastic Model Predictive Control |
|
McAllister, Robert D. | University of California, Santa Barbara |
Rawlings, James B. | University of California, Santa Barbara |
Keywords: Stochastic optimal control, Stability of nonlinear systems, Stochastic systems
Abstract: In this work, we define and establish a selection of significant and new properties for nonlinear stochastic model predictive control (SMPC). First, we ensure that the underlying stochastic properties of the closed-loop stochastic system are indeed well-defined. We then define robust exponential stability in expectation (RESiE) and establish that nonlinear SMPC, under suitable assumptions, renders the origin of the closed-loop system RESiE. We conclude with a numerical example to demonstrate the implications of this analysis.
|
|
15:25-15:40, Paper MoB05.2 | Add to My Program |
Safe Motion Planning against Multimodal Distributions Based on a Scenario Approach |
|
Ahn, Heejin | University of British Columbia |
Chen, Colin | The University of British Columbia |
Mitchell, Ian M. | University of British Columbia |
Kamgarpour, Maryam | University of British Columbia |
Keywords: Stochastic optimal control, Autonomous vehicles
Abstract: We present the design of a motion planning algorithm that ensures safety for an autonomous vehicle. In particular, we consider a multimodal distribution over uncertainties; for example, the uncertain predictions of future trajectories of surrounding vehicles reflect discrete decisions, such as turning or going straight at intersections. We develop a computationally efficient, scenario-based approach that solves the motion planning problem with high confidence given a quantifiable number of samples from the multimodal distribution. Our approach is based on two preprocessing steps, which 1) separate the samples into distinct clusters and 2) compute a bounding polytope for each cluster. Then, we rewrite the motion planning problem approximately as a mixed-integer problem using the polytopes. We demonstrate via simulation on the nuScenes dataset that our approach ensures safety with high probability in the presence of multimodal uncertainties, and is computationally more efficient and less conservative than a conventional scenario approach.
|
|
15:40-15:55, Paper MoB05.3 | Add to My Program |
On the Time Discretization of the Feynman-Kac Forward-Backward Stochastic Differential Equations for Value Function Approximation |
|
Hawkins, Kelsey | Georgia Institute of Technology |
Pakniyat, Ali | University of Alabama |
Tsiotras, Panagiotis | Georgia Institute of Technology |
Keywords: Stochastic optimal control, Computational methods, Nonlinear systems
Abstract: Novel numerical estimators are proposed for the forward-backward stochastic differential equations (FBSDE) appearing in the Feynman-Kac representation of the value function. In contrast to the current numerical method approaches based on discretization of the continuous-time FBSDE results, we propose a converse approach, by first obtaining a discrete-time approximation of the on-policy value function, and then developing a discrete-time result which resembles the continuous-time counterpart. This approach yields improved numerical estimators in the function approximation phase, and demonstrates enhanced error analysis for those value function estimators. Numerical results and error analysis are demonstrated on a scalar nonlinear stochastic optimal control problem, and they show improvements in the performance of the proposed estimators in comparison with the state-of-the-art methodologies.
|
|
15:55-16:10, Paper MoB05.4 | Add to My Program |
A Convex Optimization Approach to Chance-Constrained Linear Stochastic Drift Counteraction Optimal Control |
|
Tang, Sunbochen | University of Michigan |
Li, Nan | University of Michigan |
Kolmanovsky, Ilya V. | The University of Michigan |
Zidek, Robert A. E. | University of Michigan |
Keywords: Stochastic optimal control, Constrained control, Optimization algorithms
Abstract: In this paper, we propose a convex optimization approach to chance-constrained drift counteraction optimal control (DCOC) problems for linear systems with additive stochastic disturbances. Chance-constrained DCOC aims to compute an optimal control law to maximize the time duration before the probability of violating a prescribed set of constraints can no longer be maintained to be below a specified risk level. While conventional approaches to this problem involve solving a mixed-integer programming problem, we show that an optimal solution to the problem can also be found by solving a convex second-order cone programming problem without integer variables. We illustrate the application of chance-constrained DCOC to an automotive adaptive cruise control example.
|
|
16:10-16:25, Paper MoB05.5 | Add to My Program |
Stochastic Optimal Control Via Hilbert Space Embeddings of Distributions |
|
Thorpe, Adam | University of New Mexico |
Oishi, Meeko | University of New Mexico |
Keywords: Stochastic optimal control, Machine learning, Autonomous systems
Abstract: Kernel embeddings of distributions have recently gained significant attention in the machine learning community as a data-driven technique for representing probability distributions. Broadly, these techniques enable efficient computation of expectations by representing integral operators as elements in a reproducing kernel Hilbert space. We apply these techniques to the area of stochastic optimal control theory and present a method to compute approximately optimal policies for stochastic systems with arbitrary disturbances. Our approach reduces the optimization problem to a linear program, which can easily be solved via the Lagrangian dual, without resorting to gradient-based optimization algorithms. We focus on discrete-time dynamic programming, and demonstrate our proposed approach on a linear regulation problem, and on a nonlinear target tracking problem. This approach is broadly applicable to a wide variety of optimal control problems, and provides a means of working with stochastic systems in a data-driven setting.
|
|
16:25-16:40, Paper MoB05.6 | Add to My Program |
Forward-Backward Rapidly-Exploring Random Trees for Stochastic Optimal Control |
|
Hawkins, Kelsey | Georgia Institute of Technology |
Pakniyat, Ali | University of Alabama |
Theodorou, Evangelos A. | Georgia Institute of Technology |
Tsiotras, Panagiotis | Georgia Institute of Technology |
Keywords: Stochastic optimal control, Nonlinear systems, Learning
Abstract: We propose a numerical method for the computation of the forward-backward stochastic differential equations (FBSDE) appearing in the Feynman-Kac representation of the value function in stochastic optimal control problems. By the use of the Girsanov change of probability measures, it is demonstrated how a rapidly-exploring random tree (RRT) can be utilized for the forward integration pass, as long as the controlled drift term is appropriately compensated in the backward integration pass. A numerical approximation of the value function is proposed by solving a series of function approximation problems backwards in time along the edges of the constructed RRT. Moreover, a local entropy-weighted least squares Monte Carlo (LSMC) method is developed to concentrate function approximation accuracy in regions most likely to be visited by optimally controlled trajectories.
|
|
MoB06 Invited Session, Coordinated Universal Time (UTC) |
Add to My Program |
Game Equilibrium Seeking and Learning II |
|
|
Chair: Staudigl, Mathias | Maastricht University |
Co-Chair: Shanbhag, Uday V. | Pennsylvania State University |
Organizer: Franci, Barbara | Maastricht University |
Organizer: Grammatico, Sergio | Delft Univ. of Tech |
Organizer: Pavel, Lacra | University of Toronto |
Organizer: Shanbhag, Uday V. | Pennsylvania State University |
Organizer: Staudigl, Mathias | Maastricht University |
|
15:10-15:25, Paper MoB06.1 | Add to My Program |
Consistent Control of a Stackelberg Game with Infinitely Many Followers (I) |
|
Thünen, Anna | RWTH Aachen University |
Herty, Michael | RWTH Aachen University |
Keywords: Mean field games, Optimal control, Hierarchical control
Abstract: We present a Stackelberg game with a large number of followers where every player--leader and followers--has its own state and control. We derive the mean field limit of infinitely many followers and address aspects of consistent control. Finally, we propose a numerical method based on the derived model and present numerical results.
|
|
15:25-15:40, Paper MoB06.2 | Add to My Program |
No-Regret Distributed Learning in Two-Network Zero-Sum Games (I) |
|
Huang, Shijie | Academy of Mathematics and Systems Science, Chinese Academy of S |
Lei, Jinlong | Tongji University |
Hong, Yiguang | Chinese Academy of Sciences |
Shanbhag, Uday V. | Pennsylvania State University |
Keywords: Game theory, Learning, Distributed parameter systems
Abstract: We consider a distributed learning problem in a two-network zero-sum game with finite action sets, where the agents within each network is connected through time-varying directed graphs and the agents from distinct networks are connected by time-varying bipartite graphs. Each agent in a network has its own cost function and can receive information from its neighbors, while the networks have no global decision-making capability. We propose a distributed multiplicative weights algorithm to compute a mixed-strategy Nash equilibrium. We first establish a sublinear regret bound on the sequence of iterates for each agent. Additionally, we study the time-averaged sequence of iterates and prove its convergence to the set of mixed-strategy Nash equilibria with suitably selected diminishing step-sizes.
|
|
15:40-15:55, Paper MoB06.3 | Add to My Program |
Equilibrium Tracking and Convergence in Dynamic Games (I) |
|
Mertikopoulos, Panayotis | French National Center for Scientific Research (CNRS) |
Staudigl, Mathias | Maastricht University |
Keywords: Game theory, Learning, Optimization algorithms
Abstract: In this paper, we examine the equilibrium tracking and convergence properties of no-regret learning algorithms in continuous games that evolve over time. Specifically, we focus on learning via "mirror descent", a widely used class of no- regret learning schemes where players take small steps along their individual payoff gradients and then "mirror" the output back to their action sets. In this general context, we show that the induced sequence of play stays asymptotically close to the evolving equilibrium of the sequence of stage games (assuming they are strongly monotone), and converges to it if the game stabilizes to a strictly monotone limit. Our results apply to both gradient- and payoff-based feedback, i.e., the "bandit" case where players only observe the payoffs of their chosen actions.
|
|
15:55-16:10, Paper MoB06.4 | Add to My Program |
No-Regret Learning for Repeated Concave Games with Lossy Bandits (I) |
|
Liu, Wenting | Tongji University |
Lei, Jinlong | Tongji University |
Yi, Peng | Tongji University |
Keywords: Game theory, Optimization algorithms, Randomized algorithms
Abstract: This paper considers no-regret learning for repeated continuous-kernel games with lossy bandit information. At each round, each player chooses an action perturbed around its intended action, and gets the utility value at the corresponding action profile. However, due to various uncertainties or high inquiring costs, the bandit feedback may be lost at random. Therefore, we focus on studying the asynchronous learning strategy of the players to adaptively adjust next actions for minimizing the long-term regret loss compared with a best-fixed action in the hindsight. The paper provides a novel no-regret learning algorithm, called Reweighted Online Gradient Descent with bandit (ROGD-b). We first give the regret analysis for continuous concave games with differentiable and Lipschitz utilities. Furthermore, we show that the action profile converges to Nash equilibrium with probability 1 when the game is strictly monotone. Numerical experiments are given to illustrate the performance of the algorithm.
|
|
16:10-16:25, Paper MoB06.5 | Add to My Program |
Adaptive Interventions for Social Welfare Maximization in Network Games (I) |
|
Shakarami, Mehran | University of Groningen |
Cherukuri, Ashish | University of Groningen |
Monshizadeh, Nima | University of Groningen |
Keywords: Game theory, Control of networks, Adaptive control
Abstract: We consider the problem of steering the actions of noncooperative players in quadratic network games to the social optimum. To this end, a central regulator modifies the marginal returns of the players, while the players’ strategies are determined by continuous pseudo-gradient dynamics. Depending on the available information on the players parameters and network quantities, suitable static and dynamic intervention protocols are devised that maximize the social welfare at steady-state. We show that adaptive interventions can compensate for the lack of knowledge on network topology and coupling weights. Numerical examples are provided to demonstrate the effectiveness of the proposed interventions.
|
|
16:25-16:40, Paper MoB06.6 | Add to My Program |
Second-Order Mirror Descent: Exact Convergence Beyond Strictly Stable Equilibria in Concave Games |
|
Gao, Bolin | University of Toronto |
Pavel, Lacra | University of Toronto |
Keywords: Game theory, Learning, Agents-based systems
Abstract: We propose a second-order extension of the continuous-time game-theoretic mirror descent (MD) dynamics, which we refer to as MD2, and show that MD2 can converge to merely (not necessarily strictly) variationally stable Nash equilibria while enjoying no-regret. MD2 also overcomes the inexact convergence problem of discounted MD. We then derive the rate of convergence of an augmented MD2 whenever the equilibrium is strongly variationally stable. Selected simulations are provided to illustrate and affirm our results.
|
|
MoB07 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Optimization II |
|
|
Chair: Annaswamy, Anuradha M. | Massachusetts Inst. of Tech |
Co-Chair: Uribe, Cesar A. | Rice University |
|
15:10-15:25, Paper MoB07.1 | Add to My Program |
Structured Projection-Free Online Convex Optimization with Multi-Point Bandit Feedback |
|
Ding, Yuhao | University of California, Berkeley |
Lavaei, Javad | UC Berkeley |
Keywords: Optimization, Optimization algorithms, Machine learning
Abstract: We consider structured online convex optimization (OCO) with bandit feedback, where either the loss function is smooth or the constraint set is strongly convex. Projection-free methods are among the most popular and computationally efficient algorithms for solving this problem, mainly due to their ability to handle convex constraints appearing in machine learning for which computing projections is often impractical in high-dimensional settings. Despite the improved regret bound results for the full-information setting where the gradients of the functions are readily available, it remains unclear whether simple projection-free zero-order algorithms become more efficient for structured OCO problems in the case when multiple function values can be sampled at each time instance. In this paper, we develop some simple projection-free algorithms and prove that they indeed achieve the same improved regret bounds as the full-information case under various additional problem structures. This implies that leveraging the structural properties of the problem compensates for the lack of access to the gradients. Experiments on the online matrix completion reveal several attractive advantages of the proposed algorithms, including their simplicity, easy implementation, and effectiveness, as they outperform other competing algorithms.
|
|
15:25-15:40, Paper MoB07.2 | Add to My Program |
Parallel Alternating Direction Primal-Dual (PADPD) Algorithm for Centralized Optimization |
|
Alaviani, Seyyed Shaho | University of Georgia |
Kelkar, Atul | Clemson University |
Keywords: Optimization, Optimization algorithms, Machine learning
Abstract: In this paper, a centralized two-block separable convex optimization with equality constraint is considered. The first fully parallel primal-dual discrete-time algorithm called Parallel Alternating Direction Primal-Dual (PADPD) is proposed. In the algorithm, the primal variables are updated in an alternating fashion like Alternating Direction Method of Multipliers (ADMM). The algorithm can handle non-smoothness of objective functions with strong convergence. Unlike existing discrete-time algorithms such as Method of Multipliers (MM), ADMM, Bi- Alternating Direction Method of Multipliers (BiADMM), and Primal-Dual Fixed Point (PDFP) algorithms, all primal and dual variables in the proposed algorithm are updated independent of each other. The algorithm can be directly extended to any finite multi-block optimization without further assumptions while preserving its convergence. It is shown that the rate of convergence of the algorithm for Quadratic/Linear cost functions is exponential or linear under suitable assumptions. Finally, a numerical example is given to show that PADPD not only can compute more iterations (since it is fully parallel) for the same time-step but also can have faster convergence rate than that of ADMM.
|
|
15:40-15:55, Paper MoB07.3 | Add to My Program |
Homogeneous Formulation of Convex Quadratic Programs for Infeasibility Detection |
|
Raghunathan, Arvind | Mitsubishi Electric Research Labs |
Keywords: Optimization, Optimization algorithms, Predictive control for linear systems
Abstract: Convex Quadratic Programs (QPs) have come to play a central role in the computation of control action for constrained dynamical systems. In this paper, we present a novel Homogeneous QP (HQP) formulation which is obtained by embedding the original QP in a larger space. The key properties of the HQP are: (i) is always feasible, (ii) an optimal solution to QP can be readily obtained from a solution to HQP, and (iii) infeasibility of QP corresponds to a particular solution of HQP. An immediate consequence is that all the existing algorithms for QP are now also capable of robustly detecting infeasibility. In particular, we present an Infeasible Interior Point Method (IIPM) for the HQP and show polynomial iteration complexity when applied to HQP. A key distinction with prior IPM approaches is that we do not need to solve second-order cone programs. Numerical experiments on the formulation are provided using existing codes.
|
|
15:55-16:10, Paper MoB07.4 | Add to My Program |
On Robustness of the Normalized Random Block Coordinate Method for Non-Convex Optimization |
|
Turan, Berkay | University of California Santa Barbara |
Uribe, Cesar A. | Rice University |
Wai, Hoi-To | Chinese University of Hong Kong |
Alizadeh, Mahnoosh | University of California Santa Barbara |
Keywords: Optimization, Optimization algorithms
Abstract: Large-scale optimization problems are usually characterized not only by large amounts of data points but points living in a high-dimensional space. Block coordinate methods allow for efficient implementations where steps can be made (block) coordinate-wise. Many existing algorithms rely on trustworthy gradient information and may fail to converge when such information becomes corrupted by possibly adversarial agents. We study the setting where the partial gradient with respect to each coordinate block is arbitrarily corrupted with some probability. We analyze the robustness properties of the normalized random block coordinate method (NRBCM) for non-convex optimization problems. We prove that NRBCM finds an {cal O}(1/sqrt{T})-stationary point after T iterations if the corruption probabilities of partial gradients with respect to each block are below 1/2. With the additional assumption of gradient domination, faster rates are shown. Numerical evidence on a logistic classification problem supports our results.
|
|
16:10-16:25, Paper MoB07.5 | Add to My Program |
Inner Approximations of the Positive-Semidefinite Cone Via Grassmannian Packings |
|
Zheng, Tianqi | Johns Hopkins University |
Guthrie, James | Johns Hopkins University |
Mallada, Enrique | Johns Hopkins University |
Keywords: Optimization, Optimization algorithms
Abstract: We investigate the problem of finding inner ap-proximations of positive semidefinite (PSD) cones. We developa novel decomposition framework of the PSD cone by meansof conical combinations of smaller dimensional sub-cones. Weshow that many inner approximation techniques could besummarized within this framework, including the set of (scaled)diagonally dominant matrices, Factor-widthkmatrices, andChordal Sparse matrices. Furthermore, we provide a moreflexible family of inner approximations of the PSD cone, wherewe aim to arrange the sub-cones so that they are maximallyseparated from each other. In doing so, these approximationstend to occupy large fractions of the volume of the PSD cone.The proposed approach is connected to a classical packingproblem in Riemannian Geometry. Precisely, we show thatthe problem of finding maximally distant sub-cones in anambient PSD cone is equivalent to the problem of packingsub-spaces in a Grassmannian Manifold. We further leverageexisting computational method for constructing packings inGrassmannian manifolds to build tighter approximations ofthe PSD cone. Numerical experiments show how the proposedframework can balance between accuracy and computationalcomplexity, to efficiently solve positive-semidefinite programs.
|
|
16:25-16:40, Paper MoB07.6 | Add to My Program |
A Stable High-Order Tuner for General Convex Functions |
|
Moreu, Jose | Massachusetts Institute of Technology |
Annaswamy, Anuradha M. | Massachusetts Inst. of Tech |
Keywords: Optimization algorithms, Adaptive control, Lyapunov methods
Abstract: Iterative gradient-based algorithms have been increasingly applied for the training of a broad variety of machine learning models including large neural-nets. In particular, momentum-based methods, with accelerated learning guarantees, have received a lot of attention due to their provable guarantees of fast learning in certain classes of problems and multiple algorithms have been derived. However, properties for these methods hold only for constant regressors. When time-varying regressors occur, which is commonplace in dynamic systems, many of these momentum-based methods cannot guarantee stability. Recently, a new High-order Tuner (HT) was developed and shown to have 1) stability and asymptotic convergence for time-varying regressors and 2) non-asymptotic accelerated learning guarantees for constant regressors. These results were derived for a linear regression framework which leads to a quadratic loss function. In this paper, we extend and discuss the results of this same HT for general convex loss functions. Through the exploitation of convexity and smoothness definitions, we establish similar stability and asymptotic convergence guarantees. Finally, we provide numerical simulations supporting the satisfactory behavior of the HT algorithm as well as an accelerated learning property.
|
|
MoB08 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Predictive Control for Nonlinear Systems II |
|
|
Chair: Zhang, Fumin | Georgia Institute of Technology |
Co-Chair: Di Cairano, Stefano | Mitsubishi Electric Research Labs |
|
15:10-15:25, Paper MoB08.1 | Add to My Program |
Sequential Quadratic Programming Algorithm for Real-Time Mixed-Integer Nonlinear MPC |
|
Quirynen, Rien | Mitsubishi Electric Research Laboratories (MERL) |
Di Cairano, Stefano | Mitsubishi Electric Research Labs |
Keywords: Predictive control for nonlinear systems, Optimization algorithms, Hybrid systems
Abstract: Nonlinear model predictive control (NMPC) has grown mature and algorithmic techniques exist, e.g., based on sequential quadratic programming (SQP) methods, to handle relatively complex constrained control systems. In addition, model predictive control for hybrid dynamical systems, including both continuous and discrete decision variables, can be implemented efficiently based on state of the art mixed-integer quadratic programming (MIQP) algorithms. This paper proposes a novel mixed-integer SQP (MISQP) optimization algorithm as a heuristic search technique to find feasible, but possibly suboptimal, solutions for real-time implementations of mixed-integer NMPC (MINMPC). Two variants of the MISQP algorithm are described and motivated. Based on a preliminary software implementation, the real-time MISQP performance is illustrated for closed-loop MINMPC simulations on a nontrivial vehicle control case study, featuring worst-case computation times below 30 milliseconds.
|
|
15:25-15:40, Paper MoB08.2 | Add to My Program |
Iterative Model Predictive Control for Piecewise Systems |
|
Rosolia, Ugo | Caltech |
Ames, Aaron D. | California Institute of Technology |
Keywords: Predictive control for nonlinear systems, Optimization algorithms, Numerical algorithms
Abstract: In this paper, we present an iterative Model Predictive Control (MPC) design for piecewise nonlinear systems. We consider finite time control tasks where the goal of the controller is to steer the system from a starting configuration to a goal state while minimizing a cost function. First, we present an algorithm that leverages a feasible trajectory that completes the task to construct a control policy which guarantees that state and input constraints are recursively satisfied and that the closed-loop system reaches the goal state in finite time. Utilizing this construction, we present a policy iteration scheme that iteratively generates safe trajectories which have non-decreasing performance. Finally, we test the proposed strategy on a discretized Spring Loaded Inverted Pendulum (SLIP) model with massless legs. We show that our methodology is robust to changes in initial conditions and disturbances acting on the system. Furthermore, we demonstrate the effectiveness of our policy iteration algorithm in a minimum time control task.
|
|
15:40-15:55, Paper MoB08.3 | Add to My Program |
Bayesian Learning Model Predictive Control for Process-Aware Source Seeking |
|
Li, Yingke | Georgia Tech |
Liu, Tianyi | Georgia Institute of Technology |
Zhou, Enlu | Georgia Institute of Technology |
Zhang, Fumin | Georgia Institute of Technology |
Keywords: Predictive control for nonlinear systems, Optimization, Stochastic optimal control
Abstract: Classical source seeking algorithms aim to make the robot reach the source location eventually. This paper proposes a process-aware source seeking approach which finds an informative trajectory to reach the source location. A multi-objective optimization problem is formulated based on rewards for both the search process and the terminal condition. Due to the unknown source location, solutions are found through Bayesian learning model predictive control (BLMPC). The consistency of the Bayesian estimator, as well as the convergence of the proposed algorithm are proved. The performance of the algorithm is evaluated through simulation results. The process-aware source seeking algorithm demonstrates improvements over other classical source seeking algorithms.
|
|
15:55-16:10, Paper MoB08.4 | Add to My Program |
Gradient-Based Nonlinear Model Predictive Control for Systems with State-Dependent Mass Matrix |
|
Völz, Andreas | Friedrich-Alexander-University Erlangen-Nürnberg |
Graichen, Knut | University Erlangen-Nürnberg (FAU) |
Keywords: Predictive control for nonlinear systems, Robotics, Optimal control
Abstract: The dynamics of many systems are most naturally expressed in terms of a state-dependent mass matrix, the most prominent example being rigid body dynamics. However, model predictive control solvers often do not support this structure natively and instead require a reformulation of the system. This paper presents a concise and self-contained derivation of the optimality conditions for this system class and describes the implementation of an efficient solver based on a tailored gradient method. The approach is evaluated using the dynamical model of a robot arm with seven degrees of freedom, whereby it is shown that computation times below one millisecond can be achieved. Furthermore, by exploiting the structure of the optimality conditions with state-dependent mass matrix, the computation times can be reduced significantly.
|
|
16:10-16:25, Paper MoB08.5 | Add to My Program |
Model Predictive Interaction Control for Force Closure Grasping |
|
Gold, Tobias | University Erlangen-Nürnberg (FAU) |
Rohrmüller, Martin | University Erlangen-Nürnberg (FAU) |
Völz, Andreas | Friedrich-Alexander-University Erlangen-Nuremberg |
Graichen, Knut | University Erlangen-Nürnberg (FAU) |
Keywords: Robotics, Predictive control for nonlinear systems, Control applications
Abstract: The problem of grasping in robotics requires the realization of highly constrained motion and interaction. In this paper, a method is presented in which grasping-based constraints are explicitly taken into account on control level using model predictive control. The approach is an extension of the recently proposed model predictive interaction control (MPIC) for a single-arm robot. The basic idea is to consider the force behavior within the optimization problem, whereby a linear-elastic model is assumed in this paper. The method is evaluated in simulation as well as on an anthropomorphic hand.
|
|
16:25-16:40, Paper MoB08.6 | Add to My Program |
Force-And-Moment-Based Model Predictive Control for Achieving Highly Dynamic Locomotion on Bipedal Robots |
|
Li, Junheng | University of Southern California |
Nguyen, Quan | University of Southern California |
Keywords: Robotics, Predictive control for nonlinear systems, Optimal control
Abstract: In this paper, we propose a novel framework on force-and-moment-based Model Predictive Control (MPC) for dynamic legged robots. Specifically, we present a formulation of MPC designed for 10 degree-of-freedom (DoF) bipedal robots using simplified rigid body dynamics with input forces and moments. This MPC controller will calculate the optimal inputs applied to the robot, including 3-D forces and 2-D moments at each foot. These desired inputs will then be generated by mapping these forces and moments to motor torques of 5 actuators on each leg. We evaluate our proposed control design on physical simulation of a 10 degree-of-freedom (DoF) bipedal robot. The robot can achieve fast walking speed up to 1.6 m/s on rough terrain, with accurate velocity tracking. With the same control framework, our proposed approach can achieve a wide range of dynamic motions including walking, hopping, and running using the same set of control parameters.
|
|
MoB09 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Discrete Event Systems II |
|
|
Chair: Rudie, Karen | Queen's Univ |
Co-Chair: Pola, Giordano | University of L'Aquila |
|
15:10-15:25, Paper MoB09.1 | Add to My Program |
Event-Triggered Exponential Output Synchronization of Heterogeneous Multi-Agent Systems Over Directed Switching Networks |
|
Cheng, Yi | Nanjing University of Science and Technology |
Ugrinovskii, Valery | University of New South Wales |
Keywords: Discrete event systems, Distributed control, Cooperative control
Abstract: This paper considers event-triggered output synchronization of heterogeneous multi-agent systems over directed switching networks. We first propose an internal model for each follower using the event-triggered scheme to track the state of the leader using intermittent communications while avoiding Zeno behavior. The internal model is then embedded in the distributed controller to achieve output synchronization of the system of heterogeneous agents. We prove that the output synchronization errors converge exponentially to zero, and also give a bound on the convergence rate. A numerical example is presented to illustrate the theoretical results.
|
|
15:25-15:40, Paper MoB09.2 | Add to My Program |
Periodic Trajectories in P-Time Event Graphs and the Non-Positive Circuit Weight Problem |
|
Zorzenon, Davide | Technische Universität Berlin |
Komenda, Jan | Czech Academy of Sciences |
Raisch, Joerg | Technical University Berlin |
Keywords: Discrete event systems, Petri nets
Abstract: P-time event graphs (P-TEGs) are specific timed discrete-event systems, in which the timing of events is constrained by intervals. An important problem is to check, for all natural numbers d, the existence of consistent d-periodic trajectories for a given P-TEG. In graph theory, the Proportional-Inverse-Constant-Non-positive Circuit weight Problem (PIC-NCP) consists in finding all the values of a parameter such that a particular parametric weighted directed graph does not contain circuits with positive weight. In a related paper, we have proposed a strongly polynomial algorithm that solves the PIC-NCP in lower worst-case complexity compared to other algorithms reported in literature. In the present paper, we show that the first problem can be formulated as an instance of the second; consequently, we prove that the same algorithm can be used to find d-periodic trajectories in P-TEGs. Moreover, exploiting the connection between the PIC-NCP and max-plus algebra we prove that, given a P-TEG, the existence of a consistent 1-periodic trajectory of a certain period is a necessary and sufficient condition for the existence of a consistent d-periodic trajectory of the same period, for any value of d.
|
|
15:40-15:55, Paper MoB09.3 | Add to My Program |
Output Feedback Reachability of Controlled-Observable States for Nondeterministic Finite-State Systems |
|
Masciulli, Tommaso | University of L'Aquila |
Pola, Giordano | University of L'Aquila |
De Santis, Elena | University of L'Aquila |
Di Benedetto, Maria Domenica | University of L'Aquila |
Keywords: Discrete event systems
Abstract: In this letter control design of nondeterministic finite state systems with reachability specifications is addressed. The class of controllers we use is rather general and combines feedforward and output feedback schemes. The proposed controller allows not only the state of the system to reach the desired target set but also the identification of which state of the target set has been reached. Necessary and sufficient conditions are derived for the control problem to admit a solution and a controller is designed. The solution to the investigated problem has important implications in the context of controlled-observability, recovery control and symbolic control design of nonlinear and hybrid systems, as illustrated also through some examples.
|
|
15:55-16:10, Paper MoB09.4 | Add to My Program |
State Estimation of Discrete-Event Systems Subject to Intermittent and Permanent Loss of Observations (I) |
|
Tong, Yin | Southwest Jiaotong University |
Luo, Jiate | Southwest Jiaotong University |
Seatzu, Carla | Univ. of Cagliari |
Keywords: Discrete event systems, Automata, Estimation
Abstract: The state estimation problem in discrete-event systems (DESs) consists in determining the set of states consistent with the observations. Due to sensor malfunction, the occurrence of some events may become unobservable. Moreover, such loss of observations may be intermittent or permanent. In this paper, we study the state estimation problem of discrete-event systems subject to both intermittent and permanent loss of observations. We assume that both types of observation loss are not recoverable. For each event subject to a permanent loss of observation, we propose an automaton model to describe this kind of malfunction. We show that the automaton model describing the behavior of the system in the presence of observation loss can be obtained through the parallel composition of the above automata and an automaton that is obtained with appropriate changes to keep into account observation loss. Finally, the state estimation problem can be solved by constructing the observer of the new model.
|
|
16:10-16:25, Paper MoB09.5 | Add to My Program |
Supervisory Control for Stabilization under Multiple Local Average Payoff Constraints (I) |
|
Ji, Yiding | Boston University |
Yin, Xiang | Shanghai Jiao Tong University |
Xiao, Wei | Boston University |
Keywords: Discrete event systems, Automata, Supervisory control
Abstract: This work investigates stabilization of discrete event systems via supervisory control under a series of quantitative constraints. Every event in the system model is weighted by a vector which represents resource payoffs associated with the event. Multidimensional weight flows are generated when events occur successively. The supervisor aims to drive the generated strings to a reach set of target states to stabilize the system. Meanwhile, the supervisor is also responsible for regulating the weight flows so as to guarantee that at each dimension, the average weight every a certain number of events does not fall below a given threshold. Next, the formulated supervisory control problem is transformed to a two-player game between the supervisor and the environment on a specially defined game structure. Then we specify the objective of the game and synthesize game winning supervisors, which turn out to provably solve the proposed problem.
|
|
16:25-16:40, Paper MoB09.6 | Add to My Program |
A Visualization of Inference-Based Supervisory Control in Discrete-Event Systems |
|
Ean, Richard | Queen's University at Kingston |
Rudie, Karen | Queen's Univ |
Keywords: Discrete event systems, Supervisory control, Automata
Abstract: A visualization to aid in the construction of inference-/based decentralized supervisors is presented. In the inference-/based architecture, supervisors have different levels of ambiguity, which reflects to what degree a supervisor is confident in its control decision and to what degree a supervisor infers a control decision based on the supervisor’s knowledge of another supervisor’s control decision.
|
|
MoB10 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Adaptive Control II |
|
|
Chair: Zhang, Zhengqiang | Qufu Normal University |
Co-Chair: Miller, Daniel E. | University of Waterloo |
|
15:10-15:25, Paper MoB10.1 | Add to My Program |
Model Reference Adaptive Control with Linear-Like Closed-Loop Behavior |
|
Shahab, Mohamad T. | KAUST |
Miller, Daniel E. | University of Waterloo |
Keywords: Adaptive control, Robust adaptive control
Abstract: It is typically proven in adaptive control that asymptotic stabilization and tracking holds, and that at best a bounded-noise bounded-state property is proven. Recently, it has been shown in both the pole-placement control and the d-step ahead control settings that if, as part of the adaptive controller, a parameter estimator based on the original projection algorithm is used and the parameter estimates are restricted to a convex set, then the closed-loop system experiences linear-like behavior: exponential stability, a bounded gain on the noise in every p-norm, and a convolution bound on the exogenous inputs; this can be leveraged to provide tolerance to unmodelled dynamics and plant parameter time-variation. In this paper, we extend the approach to the more general Model Reference Adaptive Control (MRAC) problem and demonstrate that we achieve the same desirable linear-like closed-loop properties.
|
|
15:25-15:40, Paper MoB10.2 | Add to My Program |
Stochastic Deep Model Reference Adaptive Control |
|
Joshi, Girish | University of Illinois Urbana Champaign |
Chowdhary, Girish | University of Illinois at Urbana Champaign |
vanBloemenWaanders, Bart | Sandia National Laboratories |
Keywords: Adaptive control, Stochastic systems, Neural networks
Abstract: In this paper, we present a Stochastic Deep Neural Network-based Model Reference Adaptive Control (S-DMRAC). Building on our work ``Deep Model Reference Adaptive Control(DMRAC)", we extend the controller capability by using Bayesian deep neural networks (DNN) to representations for modeling non-linearities. A Bayesian approach to DNN learning helped avoid over-fitting the data and provide confidence intervals over the predictions. A Lyapunov-based method is used to adapt the output-layer weights of the DNN model in real-time, while a data-driven supervised learning algorithm is used to update the inner-layer weights, ensuring boundedness and guaranteed tracking performance with a learning-based real-time feedback controller. We also show that the controller's stochastic nature ensured "Induced Persistency of excitation", leading to convergence of the overall system signal.
|
|
15:40-15:55, Paper MoB10.3 | Add to My Program |
Identification-Based Adaptive Control for Systems with Time-Varying Parameters |
|
Chen, Kaiwen | Imperial College London |
Astolfi, Alessandro | Imperial College & Univ. of Rome |
Keywords: Adaptive control, Time-varying systems, Nonlinear systems
Abstract: This paper proposes an identification-based adaptive control scheme for nonlinear systems with time-varying parameters designed on the basis of the so-called congelation of variables method. First a scalar example to demonstrate the design methodology, which relies on re-arranging the identifier subsystems from a cascaded topology to a cyclic topology, is discussed. A small-gain-like control synthesis exploiting the cyclic topology is then presented to replace the classical control synthesis based on the swapping lemma, which exploits the cascaded topology. Then a state feedback design for a class of lower triangular nonlinear systems is presented: this combines the same design methodology with the backstepping techniques. Boundedness of all closed-loop signals and convergence of the system state are proved. Finally, simulation results showing that the proposed controller achieves superior performance than the classical design are presented.
|
|
15:55-16:10, Paper MoB10.4 | Add to My Program |
An Adaptive Disturbance Decoupling Perspective to Platooning |
|
Liu, Di | University of Groningen |
Besselink, Bart | University of Groningen |
Baldi, Simone | Southeast University |
Yu, Wenwu | Southeast University |
Trentelman, Harry L. | Univ. of Groningen |
Keywords: Adaptive control, Autonomous vehicles, Decentralized control
Abstract: Despite the progress in the field of longitudinal formations of automated vehicles, only recently an interpretation of longitudinal platooning has been given in the framework of disturbance decoupling, i.e. the problem of making a controlled output independent of a disturbance. The appealing feature of this interpretation is that the disturbance decoupling approach naturally yields a decentralized controller that guarantees stability and string stability. In this work, we further exploit the disturbance decoupling framework and we show that convergence to a stable, string stable and disturbance decoupled behavior can be achieved even in the presence of parametric uncertainty of the engine time constant. We refer to this framework as adaptive disturbance decoupling.
|
|
16:10-16:25, Paper MoB10.5 | Add to My Program |
Constructing Continuous Multi-Behavioral Planar Systems through Motivation Dynamics and Bifurcations |
|
Baxevani, Kleio | University of Delaware |
Tanner, Herbert G. | University of Delaware |
Keywords: Adaptive systems, Nonlinear systems, Autonomous systems
Abstract: This paper offers new analytical conditions on the system parameters of a particular class of planar dynamical systems which would allow them to undergo a Hopf bifurcation. These systems are constructed as a means of generating multiple behaviors from the same single continuous dynamical system model, without resorting to switching between distinct component continuous dynamics associated to each behavioral mode. This work builds on recent advances which introduced motivation dynamics as an efficient way to design multi-behavioral systems. The contribution of this paper is that it expands the scope of the motivational dynamics approach, and offers explicit analytic conditions on the system parameters to guarantee the existence of bifurcations, which can then be utilized to better engineer the structure and location of the resulting equilibria. Numerical simulations confirm the theoretical predictions for the onset of the Hopf bifurcations.
|
|
16:25-16:40, Paper MoB10.6 | Add to My Program |
Globally Stable Adaptive Neural Network Tracking Control for Uncertain Output-Feedback Systems with Prior Tracking Accuracy |
|
Zhang, Zhengqiang | Qufu Normal University |
Wang, Qiufeng | Qufu Normal University |
Keywords: Adaptive control, Neural networks, Nonlinear output feedback
Abstract: This paper is concerned with the problem of globally stable adaptive neural network tracking control for a class of output feedback systems with unknown functions. Unknown functions are approximated via online radial basis function neural network, continuously differentiable functions are introduced into Lyapunov functions to realize parameter estimation. Barbalat's lemma is used to prove that all closed-loop signals are globally uniformity ultimately bounded and tracking error can reach prior accuracy. A simulation example is given to verify the effectiveness of the control method.
|
|
MoB11 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Linear Systems II |
|
|
Chair: Jovanovic, Mihailo R. | University of Southern California |
Co-Chair: Leva, Alberto | Politecnico Di Milano |
|
15:10-15:25, Paper MoB11.1 | Add to My Program |
Measuring LTI System Resilience against Adversarial Disturbances Based on Efficient Generalized Eigenvalue Computations |
|
Börner, Johannes | Technical University Darmstadt |
Steinke, Florian | TU Darmstandt |
Keywords: Fault tolerant systems, Optimal control, Linear systems
Abstract: Resilient systems are able to recover quickly and easily from disturbed system states that might result from hazardous events or malicious attacks. In this paper a novel resilience metric for linear time invariant systems is proposed: the minimum control energy required to disturb the system is set into relation to the minimum control energy needed to recover. This definition extends known disturbance rejection metrics considering random effects to account for adversarial disturbances. The worst-case disturbance and the related resilience index can be computed efficiently via solving a generalized eigenvalue problem that depends on the controllability Gramians of the control and disturbance inputs. The novel metric allows improving system resilience by optimizing the restorative control structure or by hardening the system against specific attack options. The new approach is demonstrated for a coupled mechanical system.
|
|
15:25-15:40, Paper MoB11.2 | Add to My Program |
Automaton-Based Implicit Controlled Invariant Set Computation for Discrete-Time Linear Systems |
|
Liu, Zexiang | University of Michigan |
Anevlavis, Tzanis | University of California, Los Angeles |
Ozay, Necmiye | Univ. of Michigan |
Tabuada, Paulo | University of California at Los Angeles |
Keywords: Formal Verification/Synthesis, Linear systems
Abstract: In this paper, we derive closed-form expressions for implicit controlled invariant sets for discrete-time controllable linear systems with measurable disturbances. In particular, a disturbance-reactive (or disturbance feedback) controller in the form of a parametrized finite automaton is considered. We show that, for a class of automata, the robust positively invariant sets of the corresponding closed-loop systems can be expressed by a set of linear inequality constraints in the joint space of system states and controller parameters. This leads to an implicit representation of the invariant set in a lifted space. We further show how the same parameterization can be used to compute invariant sets when the disturbance is not available for measurement.
|
|
15:40-15:55, Paper MoB11.3 | Add to My Program |
On the Lack of Gradient Domination for Linear Quadratic Gaussian Problems with Incomplete State Information |
|
Mohammadi, Hesameddin | University of Southern California |
Soltanolkotabi, Mahdi | USC |
Jovanovic, Mihailo R. | University of Southern California |
Keywords: Learning, Linear systems, Optimal control
Abstract: Policy gradient algorithms in model-free reinforcement learning have been shown to achieve global exponential convergence for the Linear Quadratic Regulator problem despite the lack of convexity. However, extending such guarantees beyond the scope of standard LQR and full-state feedback has remained open. A key enabler for existing results on LQR is the so-called gradient dominance property of the underlying optimization problem that can be used as a surrogate for strong convexity. In this paper, we take a step further by studying the convergence of gradient descent for the Linear Quadratic Gaussian problem and demonstrate through examples that LQG does not satisfy the gradient dominance property. Our study shows the non-uniqueness of equilibrium points and thus disproves the global convergence of policy gradient methods for LQG.
|
|
15:55-16:10, Paper MoB11.4 | Add to My Program |
A Note on Realization Theory |
|
Keel, Lee | Tennessee State University |
Bhattacharyya, Shankar P. | Texas a & M Univ |
Keywords: Linear systems, Stability of linear systems, Modeling
Abstract: Realization theory as developed hitherto, deals with the following question: How can an arbitrary transfer function be constructed using standard building blocks. The answer is well known to control engineers, is useful, and consists of integrators, multipliers and summers, because the latter components are standard building blocks and can be mass produced, as integrated circuits.If the transfer function to be realized is improper, differentiators would be required as additional building blocks. Since differentiators amplify high frequency noise they are generally avoided and control theory avoids building improper transfer functions. In general integrators are also susceptible to low frequency noise and may be unsuitable in environments where low frequency noise is present. This suggests that it may be interesting to consider as a standard building block, a system which does not amplify low or high frequencies. It turns out that a first order filter does precisely that, namely the high and low frequency gains are constant, independent of frequency. In the following note, we ask the question: Can an arbitrary transfer function be constructed using first order filters, summers, and multipliers? We show that the answer is ``yes'' regardless of whether the transfer function is proper or improper, and show how such a realization may be constructed for arbitrary linear systems, proper or improper. Indeed we show that almost any first order filter can be used as a building block. We also show through simulations that the first order implementation of a differentiator has superior noise rejection compared to a pure differentiator.
|
|
16:10-16:25, Paper MoB11.5 | Add to My Program |
Existence Conditions for ODE Functional Observer Design of Descriptor Systems |
|
Jaiswal, Juhi | Indian Institute of Technology Patna |
Tomar, Nutan Kumar | Indian Institute of Technology Patna |
Keywords: Differential-algebraic systems, Observers for Linear systems
Abstract: This paper studies the design of functional observers for linear time-invariant descriptor systems which are not necessarily square and regular. A novel characterization for the existence of functional observers is presented. The observer is realized by a system of ordinary differential equations (ODEs). The existence of ODE observers is proved by means of a set of simple rank conditions on the system coefficient matrices. The proposed conditions are less restrictive than the existing ones. The derivation is purely algebraic, and the algorithm is illustrated by designing an observer-based controller.
|
|
16:25-16:40, Paper MoB11.6 | Add to My Program |
On the Criticality of the Model Parametrisation Method in Industrial Autotuning Controllers |
|
seva, Silvano | Politecnico Di Milano |
Cimino, Chiara | Politecnico Di Milano |
Leva, Alberto | Politecnico Di Milano |
Keywords: PID control, Linear systems, Adaptive control
Abstract: With reference to model-based (auto)tuning, we discuss the somehow overlooked role of the procedure used to parametrise the chosen process model structure. We evidence the detrimental effect of neglecting that procedure, particularly when evaluating/comparing tuning rules, thus when setting up a tuning procedure for a given application. We finally formulate a proposal for choosing the best model parametrisation procedure, in a given set, based on the information available in the tuning phase, and on the selected control quality indicator(s).
|
|
MoB12 Invited Session, Coordinated Universal Time (UTC) |
Add to My Program |
Estimation and Control of Infinite Dimensional Systems II |
|
|
Chair: Fridman, Emilia | Tel-Aviv Univ |
Co-Chair: Demetriou, Michael A. | Worcester Polytechnic Institute |
Organizer: Demetriou, Michael A. | Worcester Polytechnic Institute |
Organizer: Burns, John A | Virginia Tech |
|
15:10-15:25, Paper MoB12.1 | Add to My Program |
Observer Design in Infinite-Dimensional Setting Using Delayed Measurements (I) |
|
Orlov, Yury | CICESE |
Keywords: Distributed parameter systems, Delay systems, Observers for Linear systems
Abstract: An asymptotic observer for a linear system, evolving in a Hilbert space, is designed over linear state measurements with time-varying delays. The proposed predictor-based approach reduces the problem to the standard one with non-delayed information on the state, thereby being invariant to the dimensionality of the underlying system. Capabilities of the resulting observer design are illustrated for the linearized Kuramoto-Sivashinsky PDE with periodic boundary conditions and with delayed finite-dimensional measurements.
|
|
15:25-15:40, Paper MoB12.2 | Add to My Program |
Nonlinear Observer Design for a 1D Heat Conduction Process (I) |
|
Schaum, Alexander | Kiel University |
Koch, Stefan | Graz University of Technology |
Reichhartinger, Markus | Graz University of Technology |
Meurer, Thomas | Kiel University |
Moreno, Jaime A. | Universidad Nacional Autonoma De Mexico-UNAM |
Horn, Martin | Graz University of Technology |
Keywords: Distributed parameter systems, Observers for nonlinear systems, Lyapunov methods
Abstract: This paper deals with the observer design for a nonlinear 1D heat equation based on a single in-domain measurement motivated by rapid thermal silicon wafer pro- duction processes. A pointwise measurement injection observer is proposed that takes into account the basic nonlinear heat conduction and radiation mechanisms. Exponential convergence of the estimation error under mild and practically reasonable assumptions is formally proven using Lyapunov techniques. The proposed observer ensures a good trade-off between convergence speed and implementation effort. Its effectivenessis demonstrated using numerical simulations.
|
|
15:40-15:55, Paper MoB12.3 | Add to My Program |
Finite Dimensional Functional Observer Design for Parabolic Systems (I) |
|
Hu, Weiwei | University of Georgia |
Demetriou, Michael A. | Worcester Polytechnic Institute |
Keywords: Distributed parameter systems, Observers for Linear systems
Abstract: This paper combines two control design aspects for a class of infinite dimensional systems, and each of the designs aims at significantly reducing the implementation complexity and computational load. A functional observer, and its extension of an unknown input functional observer, aims to reconstruct a functional of the infinite dimensional state. The resulting compensator only requires the solution to an operator Sylvester equation plus one differential equation for each dimension of the control signal, as opposed to an infinite dimensional filter evolution equation and an associated operator Riccati equation for the filter operator covariance. When the functional to be estimated coincides with the expression of a full state feedback control signal, then the functional observer becomes the minimum order compensator. When the parabolic system admits a decomposition whereby the system is decomposed into a lower finite dimensional subspace comprising the unstable eigenspectrum and an infinite stable subspace, then the functional observer-based compensator design becomes the minimum order compensator for the finite dimensional subsystem. This approach dramatically reduces the computation for solving the ARE needed for the full state controller and the associated Sylvester equation needed for the functional observer. Numerical results for a parabolic PDE in one and two spatial dimensions are included.
|
|
15:55-16:10, Paper MoB12.4 | Add to My Program |
Transient and Asymptotic Properties of Robust Adaptive Controllers in the Presence of Non-Coercive Lyapunov Functions (I) |
|
Paranjape, Aditya A | Imperial College London |
Natarajan, Vivek | Indian Institute of Technology Bombay |
Ghosh, Supratim | Tata Consultancy Services |
Keywords: Adaptive control, Distributed parameter systems, Lyapunov methods
Abstract: Adaptive control architectures often make use of Lyapunov functions to design adaptive laws. We are specifically interested in adaptive control methods, such as the well-known L1 adaptive architecture, which employ a parameter observer for this purpose. In such architectures, the observation error plays a critical role in determining analytical bounds on the tracking error as well as the system state. In this paper, we show how the non-existence of coercive Lyapunov operators can impact the analytical bounds, and with it the performance of such adaptive systems.
|
|
16:10-16:25, Paper MoB12.5 | Add to My Program |
Nash Equilibrium Seeking in Heterogeneous Noncooperative Games with Players Acting through Heat PDE Dynamics and Delays (I) |
|
Oliveira, Tiago Roux | State University of Rio De Janeiro |
Rodrigues, Victor Hugo Pereira | Federal University of Rio De Janeiro (UFRJ) |
Krstic, Miroslav | University of California, San Diego |
Basar, Tamer | Univ of Illinois, Urbana-Champaign |
Keywords: Adaptive control, Distributed parameter systems, Game theory
Abstract: We propose a non-model based strategy for locally stable convergence to Nash equilibria in a quadratic noncooperative (duopoly) game with player actions subject to heterogeneous PDE dynamics. In this duopoly scenario, where different players use different types of PDEs, one player compensates for a delay (transport PDE) and the other a heat (diffusion) PDE, each player having access only to his own payoff value. The proposed approach employs extremum seeking, with sinusoidal perturbation signals applied to estimate the Gradient (first derivative) and Hessian (second derivative) of unknown quadratic functions. In our previous works, we solved Nash equilibrium seeking problems with homogeneous games, where the PDE dynamics of distinct nature were not allowed. This is the first instance of noncooperative games being tackled in a model-free fashion in the presence of heat PDE dynamics AND delays. In order to compensate distinct PDE-modeled processes in the inputs of the two players, we employ boundary control with averaging-based estimates. We apply a small-gain analysis for the resulting Input-to-State Stable (ISS) coupled hyperbolic-parabolic PDE system as well as averaging theory in infinite dimensions, due to the infinite-dimensional state of the heat PDE and the delay, in order to obtain local convergence results to a small neighborhood of the Nash equilibrium. We quantify the size of these residual sets and illustrate the theoretical results numerically on an example combining hyperbolic and parabolic dynamics in a 2-player setting.
|
|
16:25-16:40, Paper MoB12.6 | Add to My Program |
Sub-Predictors and Classical Predictors for Finite-Dimensional Observer-Based Control of Parabolic PDEs |
|
Katz, Rami | Tel Aviv University |
Fridman, Emilia | Tel-Aviv Univ |
Keywords: Distributed parameter systems, Delay systems, Lyapunov methods
Abstract: We study constant input delay compensation by using finite-dimensional observer-based controllers in the case of the 1D heat equation. We consider Neumann actuation with nonlocal measurement and employ modal decomposition with N+1 modes in the observer. We introduce a chain of M sub-predictors that leads to a closed-loop ODE system coupled with infinite-dimensional tail. Given an input delay r, we present LMI stability conditions for finding M and N and the resulting exponential decay rate and prove that the LMIs are always feasible for any r. We also consider a classical observer-based predictor and show that the corresponding LMI stability conditions are feasible for any r provided N is large enough. A numerical example demonstrates that the classical predictor leads to a lower-dimensional observer. However, it is hard for implementation due to the distributed input signal.
|
|
MoB13 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Distributed Control II |
|
|
Chair: Werner, Herbert | Hamburg University of Technology |
Co-Chair: Mårtensson, Jonas | KTH Royal Institute of Technology |
|
15:10-15:25, Paper MoB13.1 | Add to My Program |
Convergence Analysis of Nonconvex Distributed Stochastic Zeroth-Order Coordinate Method |
|
Zhang, Shengjun | University of North Texas |
Dong, Yunlong | Huazhong University of Science and Technology |
XIE, DONG | University of North Texas |
Yao, Lisha | University of North Texas |
Bailey, Colleen | University of North Texas |
Fu, Shengli | University of North Texas |
Keywords: Distributed control, Decentralized control, Optimization
Abstract: This paper investigates the stochastic distributed nonconvex optimization problem of minimizing a global cost function formed by the summation of n local cost functions. We solve such a problem by involving zeroth-order (ZO) information exchange. In this paper, we propose a ZO distributed primal-dual coordinate method (ZODIAC) to solve the stochastic optimization problem. Agents approximate their own local stochastic ZO oracle along with coordinates with an adaptive smoothing parameter. We show that the proposed algorithm achieves the convergence rate of mathcal{O}(sqrt{p}/sqrt{T}) for general nonconvex cost functions. We demonstrate the efficiency of proposed algorithms through a numerical example in comparison with the existing state-of-the-art centralized and distributed ZO algorithms.
|
|
15:25-15:40, Paper MoB13.2 | Add to My Program |
Exponentially Converging Distributed Gradient Descent with Intermittent Communication Via Hybrid Methods |
|
Hendrickson, Katherine | University of Florida |
Hustig-Schultz, Dawn | University of California, Santa Cruz |
Hale, Matthew | University of Florida |
Sanfelice, Ricardo G. | University of California at Santa Cruz |
Keywords: Distributed control, Hybrid systems, Stability of hybrid systems
Abstract: We present a hybrid systems framework for multi-agent optimization in which agents execute computations in continuous time and communicate in discrete time. The optimization algorithm is a hybrid version of parallelized coordinate descent. Agents implement a sample-and-hold strategy in which gradients are computed at communication times and held constant during flows between communications. Completeness of maximal solutions under these hybrid dynamics is established. Under assumptions of smoothness and strong convexity, we show that this system exponentially converges to the minimizer of an objective function. Simulation results illustrate this convergence rate.
|
|
15:40-15:55, Paper MoB13.3 | Add to My Program |
Gradient Methods for Fixed Structure Controller Synthesis and System Identification of Decomposable Systems |
|
Heinke, Simon | Hamburg University of Technology |
Werner, Herbert | Hamburg University of Technology |
Keywords: Distributed control, Large-scale systems, Identification for control
Abstract: In this paper we are considering the problem of fixed structure controller synthesis and system identification for large scale systems. The systems considered are referred to as decomposable systems and can be seen as resulting from the interconnection of a large number of identical subsystems. Combining recent results from nonsmooth controller synthesis with the special structure of decomposable systems, we show that the computational complexity of the controller synthesis and system identification procedure scale linearly with the size of the system. Employing results from robust control, the computational complexity of the controller synthesis can be reduced further, however, at the cost of additional conservatism. Using numerical examples we demonstrate that the proposed method can reduce the conservatism compared to existing LMI methods for full order decomposable controllers.
|
|
15:55-16:10, Paper MoB13.4 | Add to My Program |
Event-Triggered Distributed Model Predictive Control for Platoon Coordination at Hubs in a Transport System |
|
Bai, Ting | KTH Royal Institute of Technology |
Johansson, Alexander | KTH |
Johansson, Karl H. | Royal Institute of Technology |
Mårtensson, Jonas | KTH Royal Institute of Technology |
Keywords: Distributed control, Large-scale systems, Transportation networks
Abstract: This paper considers the problem of hub-based platoon coordination for a large-scale transport system, where trucks have individual utility functions to optimize. An event-triggered distributed model predictive control method is proposed to solve the optimal scheduling of waiting times at hubs for individual trucks. In this distributed framework, trucks are allowed to decide their waiting times independently and only limited information is shared between trucks. Both the predicted reward gained from platooning and the predicted cost for waiting at hubs are included in each truck's utility function. The performance of the coordination method is demonstrated in a simulation with one hundred trucks over the Swedish road network.
|
|
16:10-16:25, Paper MoB13.5 | Add to My Program |
Time-Inverted Kuramoto Dynamics for κ-Clustered Circle Coverage |
|
Boldrer, Manuel | University of Trento |
Riz, Francesco | University of Trento |
Pasqualetti, Fabio | University of California, Riverside |
Palopoli, Luigi | University of Trento |
Fontanelli, Daniele | University of Trento |
Keywords: Distributed control, Networked control systems, Nonlinear systems
Abstract: In this paper we analyse the equilibrium configurations for the time-inverted Kuramoto Model with homogeneous agents and a fixed ring topology, where time-inverted means that the coupling between the different states is via a negative factor. This model exhibits a dual behaviour with respect to the classic Kuramoto Model with a positive coupling. In the paper, we show the existence of two possible stable equilibrium configurations: the splay state formation (1-clustered coverage) and the deployment in clusters (κ-clustered coverage). We provide sufficient conditions for the splay state formation and a stability analysis for the networked system. Moreover, we provide some initial results towards the controllability of the final equilibrium configurations. In particular, we lay the foundations to understand the conditions to switch between stable equilibria.
|
|
16:25-16:40, Paper MoB13.6 | Add to My Program |
A Hybrid Distributed Strategy for Robust Global Phase Synchronization of Second-Order Kuramoto Oscillators |
|
Bosso, Alessandro | University of Bologna |
Azzollini, Ilario Antonio | University of Bologna |
Baldi, Simone | Southeast University |
Zaccarian, Luca | LAAS-CNRS and University of Trento |
Keywords: Distributed control, Stability of hybrid systems
Abstract: This work proposes a distributed control strategy for the robust global leader-follower phase synchronization of Kuramoto oscillators with inertia. For a convenient design, the phase angles are represented as elements of the unit circle. In particular, we exploit a “half-angle” representation inspired by unit quaternions. The ensuing non-Euclidean state space poses some challenges for robust global stabilization, which can be conveniently overcome with dynamic hybrid feedback. For this reason, we propose a hybrid solution obtained by combining a distributed observer with local hysteresis-based tracking controllers. The overall closed-loop system is analyzed through reduction theorems and Lyapunov-based arguments.
|
|
MoB14 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Agents-Based Systems |
|
|
Chair: Piet-Lahanier, Helene | ONERA |
Co-Chair: Kaminer, Isaac | Naval Postgraduate School |
|
15:10-15:25, Paper MoB14.1 | Add to My Program |
Topology Inference for Networked Dynamical Systems: A Causality and Correlation Perspective |
|
Li, Yushan | Shanghai Jiao Tong University |
He, Jianping | Shanghai Jiao Tong University |
Keywords: Agents-based systems, Cooperative control, Networked control systems
Abstract: Networked dynamical systems (NDSs) have gained considerable attention in recent years, where the networked agents cooperate to accomplish the common task through the interaction topology. In this paper, we focus on the topology inference problem of NDSs. Different from traditional methods, we aim to infer the internal interaction topology from the perspective of node causality and correlation, covering both directed/undirected topology structures and the asymptotic/marginal stabilities of NDSs. Specifically, we propose a causality-based method that takes the noise characteristic into account and asymptotically approaches the real interaction topology structure, with only single round observations over the dynamical process. When the observation number is small, we further design a correlation-based modification to effectively alleviate the influence of noises. We demonstrate the close relation between the proposed method and the traditional Granger estimator and ordinary least square (OLS) estimator in terms of observation rounds and horizon.We further prove the equivalence conditions of the proposed method with the two estimators from the system stability and noise characteristic. The proposed causality-correlation combined method enjoys analogous asymptotic inference performance with Granger estimator of multiple observation rounds, and outperforms OLS estimator in single observation round, verified by extensive simulations.
|
|
15:25-15:40, Paper MoB14.2 | Add to My Program |
Localization of Partially Hidden Targets Using a Fleet of UAVs Via Robust Bounded-Error Estimation |
|
Ibenthal, Julius | ONERA |
Meyer, Luc | ONERA, Univ Paris Saclay |
Piet-Lahanier, Helene | ONERA |
Kieffer, Michel | CNRS - Univ Paris-Sud - CentraleSupelec |
Keywords: Agents-based systems, Information theory and control, Cooperative control
Abstract: This paper addresses the cooperative search of static ground targets by a group of Unmanned Aerial Vehicles (UAVs) over some region of interest. The search strategy dependents on the availability and accuracy of the information collected. When a target is detected, a probabilistic description of the measurement noise is usually considered, as well as probabilities of false alarm and non-detection, which may prove difficult to characterize a priori. An alternative modeling is introduced here. The ability to detect and identify a target depends deterministically on the point of view from which the target is observed. Introducing the notion of detectability sets for targets, we propose a robust distributed set-membership estimator to provide set estimates of target locations. The obtained set estimates are guaranteed to contain all target locations when the search is completed. The target search is formulated as a multi-agent cooperative control problem where the control inputs are obtained using a Model Predictive Control (MPC) approach minimizing a measure of the set estimates representing the detection performance. The proposed set estimator and cooperative control scheme are distributed, i.e., accounting only for information from neighbors within communication range. The effectiveness of the proposed algorithm is illustrated by simulation.
|
|
15:40-15:55, Paper MoB14.3 | Add to My Program |
An Extremum Seeking Algorithm for Monotone Nash Equilibrium Problems |
|
Suad, Krilašević | Delft University of Technology |
Grammatico, Sergio | Delft Univ. of Tech |
Keywords: Agents-based systems, Learning, Hybrid systems
Abstract: In this paper we consider the problem of finding a Nash equilibrium (NE) via zeroth-order feedback information in games with merely monotone pseudogradient mapping. Based on hybrid system theory, we propose a novel extremum seeking algorithm which converges to the set of Nash equilibria in a semi-global practical sense. Finally, we present two simulation examples. The first shows that the standard extremum seeking algorithm fails, while ours succeeds in reaching NE. In the second, we simulate an allocation problem with fixed demand.
|
|
15:55-16:10, Paper MoB14.4 | Add to My Program |
Multi-Agent Maximization of a Monotone Submodular Function Via Maximum Consensus |
|
Rezazadeh, Navid | University of California, Irvine |
Kia, Solmaz S. | University of California Irvine (UCI) |
Keywords: Agents-based systems, Optimization algorithms, Communication networks
Abstract: This paper studies distributed submodular optimization subject to partition matroid. We work in the value oracle model where the only access of the agents to the utility function is through a black box that returns the utility function value. The agents are communicating over a connected undirected graph and have access only to their own strategy set. As known in the literature, submodular maximization subject to matroid constraints is NP-hard. Hence, our objective is to propose a polynomial-time distributed algorithm to obtain a suboptimal solution with guarantees on the optimality bound. Our proposed algorithm is based on a distributed stochastic gradient ascent scheme built on the multilinear-extension of the submodular set function. We use a maximum consensus protocol to minimize the inconsistency of the shared information over the network caused by delay in the flow of information while solving for the fractional solution of the multilinear extension model. Furthermore, we propose a distributed framework of finding a set solution using the fractional solution. We show that our distributed algorithm results in a strategy set that when the team objective function is evaluated at worst case the objective function value is in 1-1/e-O(T) of the optimal solution in the value oracle model. An example demonstrates our results.
|
|
16:10-16:25, Paper MoB14.5 | Add to My Program |
Modeling and Control of Large-Scale Adversarial Swarm Engagements (I) |
|
Tsatsanifos, Theodoros | Naval Postgraduate School |
Clark, Abe | Naval Postgraduate School |
Walton, Claire | University of Texas at San Antonio |
Kaminer, Isaac | Naval Postgraduate School |
Gong, Qi | University of California, Santa Cruz |
Keywords: Agents-based systems, Optimal control, Simulation
Abstract: We theoretically and numerically study the problem of optimal control of large-scale autonomous systems under explicitly adversarial conditions, including probabilistic destruction of agents during the simulation. Large-scale autonomous systems often include an adversarial component, where different agents or groups of agents explicitly compete with one another. An important component of these systems that is not included in current theory or modeling frameworks is random destruction of agents in time. In this case, the modeling and optimal control framework should consider the attrition of agents as well as their position. We propose and test three numerical modeling schemes, where survival probabilities of all agents are smoothly and continuously decreased in time, based on the relative positions of all agents during the simulation. In particular, we apply these schemes to the case of agents defending a high-value unit from an attacking swarm. We show that these models can be successfully used to model this situation, provided that attrition and spatial dynamics are coupled. Our results have relevance to an entire class of adversarial autonomy situations, where the positions of agents and their survival probabilities are both important.
|
|
16:25-16:40, Paper MoB14.6 | Add to My Program |
Towards Resilience for Multi-Agent QD-Learning (I) |
|
Xie, Yijing | University of Texas at Arlington |
Mou, Shaoshuai | Purdue University |
Sundaram, Shreyas | Purdue University |
Keywords: Learning, Resilient Control Systems, Agents-based systems
Abstract: This paper considers the multi-agent reinforcement learning (MARL) problem for a networked (peer-to-peer) system in the presence of Byzantine agents. We build on an existing distributed Q-learning algorithm, and allow certain agents in the network to behave in an arbitrary and adversarial manner (as captured by the Byzantine attack model). Under the proposed algorithm, if the network topology is (2F+1)-robust and up to F Byzantine agents exist in the neighborhood of each regular agent, we establish the almost sure convergence of all regular agents’ value functions to the neighborhood of the optimal value function of all regular agents. For each state, if the optimal Q-values of all regular agents corresponding to different actions are sufficiently separated, our approach allows each regular agent to learn the optimal policy for all regular agents.
|
|
MoB15 Invited Session, Coordinated Universal Time (UTC) |
Add to My Program |
Event-Triggered and Self-Triggered Control II |
|
|
Chair: Hirche, Sandra | Technische Universität München |
Co-Chair: Zhao, Yun-Bo | University of Science and Technology of China |
Organizer: Heemels, W.P.M.H. | Eindhoven University of Technology |
Organizer: Hirche, Sandra | Technische Universität München |
Organizer: Johansson, Karl H. | Royal Institute of Technology |
Organizer: Nowzari, Cameron | George Mason University |
|
15:10-15:25, Paper MoB15.1 | Add to My Program |
Event-Triggered ell_2-Optimal Formation Control for Agents Modeled As LPV Systems (I) |
|
saadabadi, Hamideh | TUHH |
Werner, Herbert | Hamburg University of Technology |
Keywords: Networked control systems, Distributed control, Nonholonomic systems
Abstract: This paper proposes a novel approach to event-triggered formation control for homogeneous, non-holonomic multi-agent systems with undirected interaction topology, where the non-holonomic vehicle dynamics are represented by polytopic linear-parameter-varying (LPV) models. The proposed event-triggered strategy is able to reduce the communication cost by transmitting information only when needed. To maintain a formation, each agent is equipped with an inner state-feedback loop that is time-triggered, while an outer position loop is closed by each agent individually through the communication network whenever a local trigger condition is satisfied. The control strategy can be implemented in a distributed manner; the trigger condition is based only on locally available information. The proposed method allows to simultaneously design a controller and a trigger level that guarantee stability and a bound on the overall ell_2 performance of the network. The synthesis problem is formulated as an LMI problem. Under the additional assumption that the agents are homogeneously scheduled, the synthesis problem can be decomposed to reduce its complexity to the size of a single agent, regardless of the number of agents, without degrading the performance. The effectiveness of the results is illustrated in a simulation scenario with non-holonomic agents modeled as dynamic unicycles.
|
|
15:25-15:40, Paper MoB15.2 | Add to My Program |
Temporal-Logic-Based Intermittent, Optimal, and Safe Continuous-Time Learning for Trajectory Tracking (I) |
|
Kanellopoulos, Aris | Georgia Institute of Technology |
Fotiadis, Filippos | Georgia Institute of Technology |
Sun, Chuangchuang | Massachusetts Institute of Technology |
Xu, Zhe | Arizona State University |
Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Topcu, Ufuk | The University of Texas at Austin |
Dixon, Warren E. | University of Florida |
Keywords: Machine learning, Autonomous systems, Sampled-data control
Abstract: In this paper, we develop safe reinforcement-learning-based controllers for systems tasked with accomplishing complex missions that can be expressed as linear temporal logic specifications, similar to those required by search-and-rescue missions. We decompose the original mission into a sequence of tracking sub-problems under safety constraints. We impose the safety conditions by utilizing barrier functions to map the constrained optimal tracking problem in the physical space to an unconstrained one in the transformed space. Furthermore, we develop policies that intermittently update the control signal to solve the tracking sub-problems with reduced burden in the communication and computation resources. Subsequently, an actor-critic algorithm is utilized to solve the underlying Hamilton-Jacobi-Bellman equations. Finally, we support our proposed framework with stability proofs and showcase its efficacy via simulation results.
|
|
15:40-15:55, Paper MoB15.3 | Add to My Program |
Resource-Aware Stochastic Self-Triggered Model Predictive Control |
|
Lian, Yingzhao | EPFL |
Jiang, Yuning | EPFL |
Stricker, Naomi | ETH Zurich |
Thiele, Lothar | ETH Zurich |
Jones, Colin N. | EPFL |
Keywords: Predictive control for linear systems, Stochastic optimal control
Abstract: This paper considers the control of uncertain systems operated under limited resource factors, such as battery life or hardware longevity. We consider here resource-aware self-triggered control techniques that schedule system operation non-uniformly in time in order to balance performance against resource consumption. When running in an uncertain environment, unknown disturbances may deteriorate system performance by acting adversarially against the planned event triggering schedule. In this work, we propose a resource-aware stochastic predictive control scheme to tackle this challenge, where a novel zero-order hold feedback control scheme is proposed to accommodate a time-inhomogeneous predictive control update.
|
|
15:55-16:10, Paper MoB15.4 | Add to My Program |
Value of Information in Networked Control Systems Subject to Delay (I) |
|
Wang, Siyi | Technical University of Munich |
Liu, Qingchen | Technical University of Munich |
Ugo Abara, Precious | Technical University of Munich |
Baras, John S. | University of Maryland |
Hirche, Sandra | Technische Universität München |
Keywords: Networked control systems, Communication networks, Information theory and control
Abstract: In this paper, we study the trade-off between the transmission cost and the control performance of a networked control system subject to network-induced delay. Within the linear–quadratic–Gaussian (LQG) framework, the joint design of control policy and networking strategy is decomposed into separate optimization problems. Based on the trade-off analysis, a delay-dependent Value-of-Information (VoI) metric which quantifies the value of transmitting a data packet is introduced. The VoI enables the decision-makers embedded in subsystems to design the triggering policy. The proposed scalable VoI inherits the task criticality of the existing VoI metric. Additionally, the sensitivity to the system parameters such as information freshness and network delays is directly derivable. The VoI-based scheduling policy is shown to outperform the periodical triggering policy and the Age-of-Information (AoI) based policy for network control systems under transmission delay. The effectiveness of the constructed VoI with arbitrary network delay is validated through numerical simulations.
|
|
16:10-16:25, Paper MoB15.5 | Add to My Program |
Self-Triggered Model Predictive Control for Perturbed Nonlinear Systems: An Iterative Implementation (I) |
|
Wang, Tao | University of Science and Technology of China |
Li, Pengfei | University of Science and Technology of China |
Kang, Yu | University of Science and Technology of China |
Zhao, Yun-Bo | University of Science and Technology of China |
Keywords: Networked control systems, Predictive control for nonlinear systems, Stability of nonlinear systems
Abstract: In this paper, a novel iterative self-triggered model predictive control strategy is proposed for continuous-time nonlinear systems with external disturbances. For this strategy, the triggering instants are determined by iteratively using the self-triggered mechanism. To be specific, the triggering mechanism, on the one hand, determines the next sampling instants of the sensor by a prespecified condition, and, on the other hand, decides whether or not to treat the current sampling instant as the triggering instant. Without continuous monitoring of the state, the sensing cost of the sensor can be alleviated. The utilization of the sampling states after the triggering instant leads to a larger triggering interval, and the computational load of the controller can thus be reduced. The effectiveness of the proposed strategy is validated by a numerical example.
|
|
16:25-16:40, Paper MoB15.6 | Add to My Program |
Abstracting the Sampling Behaviour of Stochastic Linear Periodic Event-Triggered Control Systems (I) |
|
Delimpaltadakis, Giannis | Delft University of Technology |
Laurenti, Luca | TU Delft |
Mazo Jr., Manuel | Delft University of Technology |
Keywords: Formal Verification/Synthesis, Networked control systems, Stochastic systems
Abstract: Recently, there have been efforts towards understanding the sampling behaviour of event-triggered control (ETC), for obtaining metrics on its sampling performance and predicting its sampling patterns. Finite-state abstractions, capturing the sampling behaviour of ETC systems, have proven promising in this respect. So far, such abstractions have been constructed for non-stochastic systems. Here, inspired by this framework, we abstract the sampling behaviour of stochastic narrow-sense linear periodic ETC (PETC) systems via Interval Markov Chains (IMCs). Particularly, we define functions over sequences of state-measurements and interevent times that can be expressed as discounted cumulative sums of rewards, and compute bounds on their expected values by constructing appropriate IMCs and equipping them with suitable rewards. Finally, we argue that our results are extendable to more general forms of functions, thus providing a generic framework to define and study various ETC sampling indicators.
|
|
MoB16 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Formal Verification and Synthesis II |
|
|
Chair: Zamani, Majid | University of Colorado Boulder |
Co-Chair: Mazo Jr., Manuel | Delft University of Technology |
|
15:10-15:25, Paper MoB16.1 | Add to My Program |
Synthesis of Interconnected Control Systems under Reachability Specifications |
|
Al Khatib, Mohammad | Technische Universität Chemnitz |
Streif, Stefan | Technische Universität Chemnitz |
Keywords: Formal Verification/Synthesis, Large-scale systems, Nonlinear systems
Abstract: In this work we synthesize constrained control laws in a compositional way to enforce a reachability specification for an interconnected control system. We first synthesize for each system a controller enforcing a local reachability specification using linear feasibility programs. Then within the framework of parametric assume-guarantee contracts, which encode the behavior of a control system in a parametric domain, we establish a small gain theorem guaranteeing the overall specification after interconnecting the systems together. The resulting controller could then be implemented in a decentralized, distributed, or mixed setup. We also show the effectiveness of our approach with a room temperature control example.
|
|
15:25-15:40, Paper MoB16.2 | Add to My Program |
Reach-Avoid Analysis for Delay Differential Equations |
|
Xue, Bai | Institute of Software, Chinese Academy of Sciences |
Bai, Yunjun | SKLCS, Institute of Software, Chinese Academy of Sciences, Univ |
Zhan, Naijun | Institute of Software, Chinese Academy of Sciences |
Liu, Wenyou | SKLCS, Institute of Software, Chinese Academy of Sciences, Unive |
Jiao, Li | State Key Laboratory of Computer Science, Institute of Software, |
Keywords: Formal Verification/Synthesis, Nonlinear systems, Computational methods
Abstract: Time-delay systems are ubiquitous in nature and occur in connection with various aspects of physical, chemical, biological and economic systems. In this paper we propose a semi-definite programming method to address reach-avoid problems for time-delay systems modeled by polynomial delay differential equations (DDEs). The reach-avoid problem of interest is to compute an inner-approximation (i.e., sub-set) of a reach-avoid set, which is the set of initial functions enabling the time-delay system to eventually enter a desirable target set while remaining inside a specified safe set till the target hit. In our approach we first derive an estimate of discrepancies between current states and delayed ones for the time-delay system, and then propose a semi-definite program for inner-approximating the reach-avoid set via incorporating the discrepancy estimate. The incorporation of discrepancy estimates facilitates the reduction of conservativeness in computing inner-approximations. Finally, three examples with comparisons are presented to demonstrate the performance of our approach.
|
|
15:40-15:55, Paper MoB16.3 | Add to My Program |
Self-Triggered Control for Near-Maximal Average Inter-Sample Time (I) |
|
de Albuquerque Gleizer, Gabriel | Delft University of Technology |
Madnani, Khushraj | Delft Univ |
Mazo Jr., Manuel | Delft University of Technology |
Keywords: Formal Verification/Synthesis, Sampled-data control, Networked control systems
Abstract: Self-triggered control (STC) is a sample-and-hold control method aimed at reducing communications in networked-control systems; however, existing STC mechanisms often maximize how late the next sample is, thus not optimizing sampling performance in the long-term. In this work, we devise a method to construct self-triggered policies that provide near-maximal average inter-sample time (AIST) while respecting given control performance constraints. To achieve this, we rely on finite-state abstractions of a reference event-triggered control, while also allowing earlier samples. These early triggers constitute controllable actions of the abstraction, for which an AIST-maximizing strategy can be obtained by solving a mean-payoff game. We provide optimality bounds, and how to further improve them through abstraction refinement techniques.
|
|
15:55-16:10, Paper MoB16.4 | Add to My Program |
Safety Verification of Dynamical Systems Via K-Inductive Barrier Certificates (I) |
|
Anand, Mahathi | Ludwig Maximilian University of Munich |
Murali, Vishnu | University of Colorado, Boulder |
Trivedi, Ashutosh | University of Colorado Boulder |
Zamani, Majid | University of Colorado Boulder |
Keywords: Formal Verification/Synthesis
Abstract: Safety verification of dynamical systems via barrier certificates has recently gained considerable attention. A barrier certificate is typically a real-valued function over states of the system such that its value over the unsafe states is strictly greater than its value at the initial states. Moreover, the system dynamics must guarantee a decrease in the value of the barrier certificate in time with each transition. The existence of a barrier certificate thus ensures that the system trajectories never reach unsafe regions. Unfortunately, these conditions are often restrictive as they require barrier certificates to be non-increasing at every time step. Inspired by the success of k-induction in software verification, we propose two refinements of the notion of barrier certificates. In our first refinement of k-inductive barrier certificates, we relax the strict non-increment requirement to a net non-increment in k-steps with a potential(bounded) increment in each step. On the other hand, the second refinement of k-inductive barrier certificates relaxes the strict non-increment requirement at each step to a strict safety requirement under the assumption that the previous k-steps remained safe. We present two computational methods based on sum-of-squares (SOS) programming and SMT solvers to synthesize suitable k-inductive barrier certificates and demonstrate their effectiveness over a case study.
|
|
16:10-16:25, Paper MoB16.5 | Add to My Program |
Fast Synthesis for Symbolic Self-Triggered Control under Right-Recursive LTL Specifications (I) |
|
Pruekprasert, Sasinee | National Institute of Informatics, Tokyo |
Eberhart, Clovis | National Institute of Informatics |
Dubut, Jérémy | National Institute of Informatics |
Keywords: Formal Verification/Synthesis, Game theory, Nonlinear systems
Abstract: We extend previous work on symbolic self-triggered control for non-deterministic continuous-time nonlinear systems without stability assumptions to a larger class of specifications. Our goal is to synthesise a controller for two objectives: the first one is modelled as a right-recursive LTL formula, and the second one is to ensure that the average communication rate between the controller and the system stays below a given threshold. We translate the control problem to solving a mean-payoff parity game played on a discrete graph. Apart from extending the class of specifications, we propose a heuristic method to shorten the computation time. Finally, we illustrate our results on the example of a navigating nonholonomic robot with several specifications.
|
|
16:25-16:40, Paper MoB16.6 | Add to My Program |
STL Robustness Risk Over Discrete-Time Stochastic Processes (I) |
|
Lindemann, Lars | University of Pennsylvania |
Matni, Nikolai | University of Pennsylvania |
Pappas, George J. | University of Pennsylvania |
Keywords: Formal Verification/Synthesis, Autonomous systems, Stochastic systems
Abstract: We present a framework to interpret signal temporal logic (STL) formulas over discrete-time stochastic processes in terms of the induced risk. Each realization of a stochastic process either satisfies or violates an STL formula. In fact, we can assign a robustness value to each realization that indicates how robustly this realization satisfies an STL formula. We then define the risk of a stochastic process not satisfying an STL formula robustly, referred to as the STL robustness risk. In our definition, we permit general classes of risk measures such as, but not limited to, the conditional value-at-risk. While in general hard to compute, we propose an approximation of the STL robustness risk. This approximation has the desirable property of being an upper bound of the STL robustness risk when the chosen risk measure is monotone, a property satisfied by most risk measures. Motivated by the interest in data-driven approaches, we present a sampling-based method for estimating the approximate STL robustness risk from data for the value-at-risk. While we consider the value-at-risk, we highlight that such sampling-based methods are viable for other risk measures.
|
|
MoB17 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Control Applications II |
|
|
Chair: Karlsson, Niklas | Verizon Media |
Co-Chair: Munoz-Arias, Mauricio | University of Groningen |
|
15:10-15:25, Paper MoB17.1 | Add to My Program |
Voltage Regulation for a Self-Excited Induction Generator |
|
Esquivel-Sancho, Luis Miguel | Tecnológico De Costa Rica |
Pereira-Arroyo, Roberto | Costa Rica Institute of Technology |
Munoz-Arias, Mauricio | University of Groningen |
Keywords: Electrical machine control, Power generation, Nonlinear systems
Abstract: The self-excited induction generator presents important advantages for isolated generation systems ranging from low cost to simplicity of construction, operation, and maintenance. The complexity of the output voltage regulation problem for the generator is due to the nonlinear dynamics effects in presence of variable systems loads. The main contribution of the current work is first a generalization of the modeling approach to the self-exited squirrel-cage induction generator which is based on a stationary d-q frame of reference. The modeling approach is done via an energy-based strategy, more specifically, the port-Hamiltonian framework. Furthermore, we present here a novel control law based on a trajectory tracking strategy in order to attain asymptotic stability on a time-variant desired voltage at the output of the generator. The control action based on a so-called error system approach is focused on an exerted torque on the generator's shaft. The performance of our modeling and control approach is validated via numerical simulations.
|
|
15:25-15:40, Paper MoB17.2 | Add to My Program |
Charging Electric Vehicles with Valet: A Novel Business Model to Promote Transportation Electrification |
|
Lai, Zhijie | The Hong Kong University of Science and Technology |
Li, Sen | The Hong Kong University of Science and Technology |
Keywords: Emerging control applications, Intelligent systems, Smart cities/houses
Abstract: This paper considers on-demand valet charging for electric vehicles(EVs) to promote transportation electrification. We propose a novel business model where a platform recruits a fleet of couriers to provide valet charging services to EV owners at an affordable price. Couriers are dispatched to deliver the out-of-battery EVs to charging stations, plug in the cars, and drive them back to customers when the EVs are fully charged. To depict the proposed business model, a queuing network is formulated to capture the stochasticity of the matching dynamics, and an economic equilibrium model is proposed to characterize the interactions among couriers, EV owners, and the platform. Based on the model, we investigate the optimal pricing strategies of the platform and quantify the impacts of charging infrastructure planning on the market outcomes. We show that the interests of different stakeholders are not consistent: couriers enjoy a higher surplus under a lower charging station density, while the platform prefers a higher density as it leads to a higher markup. We identify the critical density of charging stations that achieves the highest EV penetration, and we show that it trades off the time traveling to and waiting at the charging stations. We also briefly analyze the regulatory challenges and suggest prospective directions for government interventions.
|
|
15:40-15:55, Paper MoB17.3 | Add to My Program |
Labor-Right Protecting Dispatch of Meal Delivery Platforms |
|
Weng, Wentao | Massachusetts Institute of Technology |
Yu, Yang | Tsinghua University |
Keywords: Queueing systems, Modeling, Control applications
Abstract: The boom in the meal delivery industry brings growing concern about the labor rights of riders. Current dispatch policies of meal-delivery platforms focus mainly on satisfying consumers or minimizing the number of riders for cost savings. There are few discussions on improving the working conditions of riders by algorithm design. The lack of concerns on labor rights in mechanism and dispatch design has resulted in a very large time waste for riders and their risky driving. In this research, we propose a queuing-model-based framework to discuss optimal dispatch policy with the goal of labor rights protection. We apply our framework to develop an algorithm minimizing the waiting time of food delivery riders with guaranteed user experience. Our framework also allows us to manifest the value of restaurants' data about their offline-order numbers on improving the benefits of riders.
|
|
15:55-16:10, Paper MoB17.4 | Add to My Program |
Data-Based Approach for Final Product Quality Inspection: Application to a Semiconductor Industry |
|
AL-KHARAZ, Mohammed | Laboratoire d'Informatique Et Systèmes (LIS) - Aix Marseille Uni |
Ananou, Bouchra | LSIS |
ouladsine, mustapha | Université D'aix Marseille III |
COMBAL, MICHEL | ST MICROELECTRONICS |
PINATON, Jacques | STMicroelectronics |
Keywords: Process Control, Manufacturing systems and automation
Abstract: The early information about the health state of the final product quality plays a vital role in the intact management of production. In semiconductor manufacturing, quality control of a too-small number of wafers is routinely carried on specific metrology stations, and the obtained quality measurements are generalized over the entire lot. The unavailability of sufficient product quality information results in a lack of that for a high proportion of products. The latter leads to some overlooked quality problems that might cause a malfunction in the final product. This malfunction is usually conducive to yield loss, resource consumption through its remaining production line steps and also needs a considerable amount of time to be source-identified. This paper proposes a final quality classification data-driven approach using machine learning techniques and alarm events data collected during the production operations. We use the k-mean clustering algorithm to group production lots into clusters based on their passages over equipment. Each cluster has its decision tree classification model elaborated after various information extraction techniques and manipulation applied to alarm event texts. The obtained results show a satisfactory performance demonstrated on a real-world dataset collected over the whole semiconductor fabrication facility.
|
|
16:10-16:25, Paper MoB17.5 | Add to My Program |
Scalable Multi-Objective Optimization in Programmatic Advertising Via Feedback Control |
|
Karlsson, Niklas | Yahoo |
Keywords: Control applications, Large-scale systems, Optimization algorithms
Abstract: The majority of online advertising is served through real-time bidding, and advertising campaigns are often defined as optimization problems. This paper deals with advertiser profit maximization subject to multiple advertiser performance constraints. We derive the optimal bidding mechanism for a large family of multi-constrained advertising problems and demonstrate how the solution can be implemented as three separate subsystems dealing with impression valuation, campaign control, and bid shading optimization. Feedback control plays a critical role to make this optimization scalable and adaptive. A proof of concept campaign control system is proposed and evaluated in simulations.
|
|
16:25-16:40, Paper MoB17.6 | Add to My Program |
State Estimation for Spark-Ignition Engines Using New Noise Adaptive Laws in Unscented Kalman Filter |
|
Singh, Vyoma | IIT Mandi |
Jain, Tushar | Indian Institute of Technology Mandi |
Keywords: Estimation, Kalman filtering, Nonlinear systems identification
Abstract: To ensure maximum efficiency, low emissions, and lower fuel consumption in the vehicles, advanced control schemes are required. Due to the engine operation, the sensors cannot be installed to measure all the variables that are needed for an effective control. While addressing this issue, a new adaptive Unscented Kalman filter (UKF) algorithm is proposed in this paper to estimate the intake manifold pressure, engine speed, and fuel flow rate. New adaptive laws are designed to update the process noise and measurement noise covariance matrices within the constrained augmented state-based UKF (CASUKF). Another contribution lies in the new combination of the novel adaptive laws, and CASUKF, unlike other variants of the UKF that either adapt the process noise and measurement noise covariance matrices on the standard UKF or implement CASUKF with constant values of the process noise and measurement noise matrices. Simulation results are provided for the nonlinear mean value spark-ignition engine model, and the effectiveness of the algorithm is also compared with other variants of the UKF.
|
|
MoB18 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Biological Systems and Biologically-Inspired Methods II |
|
|
Chair: Cao, Ming | University of Groningen |
Co-Chair: Mehta, Prashant G. | Univ of Illinois, Urbana-Champaign |
|
15:10-15:25, Paper MoB18.1 | Add to My Program |
Different Environment Feedback in Fast-Slow Eco-Evolutionary Dynamics and Resulting Limit Cycles |
|
Gong, Lulu | University of Groningen |
Cao, Ming | University of Groningen |
Keywords: Biological systems, Game theory, Stability of nonlinear systems
Abstract: The fast-slow dynamics of an eco-evolutionary system are studied, where we consider the feedback actions of environmental resources that are classified into those that are self-renewing and those externally supplied. We show although these two types of resources are drastically different, the resulting closed-loop systems bear close resemblances, which include the same equilibria and their stability conditions on the boundary of the phase space, and the similar appearances of equilibria in the interior. After closer examination of specific choices of parameter values, we disclose that the global dynamical behaviors of the two types of closed-loop systems can be fundamentally different in terms of limit cycles: the system with self-renewing resources undergoes a generalized Hopf bifurcation such that one stable limit cycle and one unstable limit cycle can coexist; the system with externally supplied resources can only have the stable limit cycle induced by a supercritical Hopf bifurcation.
|
|
15:25-15:40, Paper MoB18.2 | Add to My Program |
Controlling a CyberOctopus Soft Arm with Muscle-Like Actuation (I) |
|
Chang, Heng-Sheng | University of Illinois, Urbana-Champaign |
Halder, Udit | University of Illinois at Urbana Champaign |
Gribkova, Ekaterina | University of Illinois, Urbana-Champaign |
Tekinalp, Arman | University of Illinois at Urbana-Champaign |
Naughton, Noel | University of Illinois at Urbana Champaign |
Gazzola, Mattia | University of Illinois at Urbana-Champaign |
Mehta, Prashant G. | Univ of Illinois, Urbana-Champaign |
Keywords: Biological systems, Robotics, Nonlinear systems
Abstract: This paper presents an application of the energy shaping methodology to control a flexible, elastic Cosserat rod model of a single octopus arm. The novel contributions of this work are two-fold: (i) a control-oriented modeling of the anatomically realistic internal muscular architecture of an octopus arm; and (ii) the integration of these muscle models into the energy shaping control methodology. The control-oriented modeling takes inspiration in equal parts from theories of nonlinear elasticity and energy shaping control. By introducing a stored energy function for muscles, the difficulties associated with explicitly solving the matching conditions of the energy shaping methodology are avoided. The overall control design problem is posed as a bilevel optimization problem. Its solution is obtained through iterative algorithms. The methodology is numerically implemented and demonstrated in a full-scale dynamic simulation environment Elastica. Two bio-inspired numerical experiments involving the control of octopus arms are reported.
|
|
15:40-15:55, Paper MoB18.3 | Add to My Program |
Discrete-Time Output Regulation and Visuomotor Adaptation |
|
Hafez, Mohamed Ashraf | University of Toronto |
Mejia Uzeda, Erick | University of Toronto |
Broucke, Mireille E. | Univ. of Toronto |
Keywords: Output regulation, Biological systems, Adaptive control
Abstract: We consider a disturbance rejection problem for discrete-time LTI systems with a known plant and unknown exosystem, and we utilize adaptive internal models to solve this problem. The main application is short-term visuomotor adaptation, a subconscious brain process taking place over repetitive trials and elicited by a visual error closely following the execution of a movement. Our model is vetted by recovering results from visuomotor experiments involving removal or loss of measurements during adaptation.
|
|
15:55-16:10, Paper MoB18.4 | Add to My Program |
A Nonlinear Observability Analysis of Ambient Wind Estimation with Uncalibrated Sensors, Inspired by Insect Neural Encoding |
|
van Breugel, Floris | University of Nevada, Reno |
Keywords: Biological systems, Observers for nonlinear systems, Sensor fusion
Abstract: Estimating the direction of ambient fluid flow is key for many flying or swimming animals and robots, but can only be accomplished through indirect measurements and active control. Recent work with tethered flying insects indicates that their sensory representation of orientation, apparent wind, direction of movement, and control is represented by a 2-dimensional angular encoding in the central brain. This representation simplifies sensory integration by projecting the direction (but not scale) of measurements with different units onto a universal polar coordinate frame. To align these angular measurements with one another and the motor system does, however, require a calibration of angular gain and offset for each sensor. This calibration could change with time due to changes in the environment or physical structure. The circumstances under which small robots and animals with angular sensors and changing calibrations could self-calibrate and estimate the direction of ambient fluid flow while moving remains an open question. Here, a methodical nonlinear observability analysis is presented to address this. The analysis shows that it is mathematically feasible to continuously estimate flow direction and perform self-calibrations by adopting frequent changes in course (or active prevention thereof) and orientation, and requires fusion and temporal differentiation of three sensory measurements: apparent flow, orientation (or its derivative), and direction of motion (or its derivative). These conclusions are consistent with the zigzagging trajectories exhibited by many plume tracking organisms, suggesting that perhaps flow estimation is a secondary driver of their trajectory structure.
|
|
16:10-16:25, Paper MoB18.5 | Add to My Program |
In-Phase Oscillations from the Cooperation of Cellular and Network Positive Feedback in Synaptically-Coupled Oscillators |
|
Juarez-Alvarez, Omar | Universidad Nacional Autónoma De Mexico |
Franci, Alessio | Universidad Nacional Autónoma De Mexico (UNAM) |
Keywords: Stability of nonlinear systems, Reduced order modeling, Biologically-inspired methods
Abstract: We study the emergent dynamics of a network of synaptically coupled slow-fast oscillators. Synaptic coupling provides a network-level positive feedback mechanism that cooperates with cellular-level positive feedback to ignite in-phase network oscillations. Using analytical bifurcation analysis, we prove that the Perron eigenvector of the network adjacency matrix fully determines the oscillation pattern. Besides shifting the focus from the spectral properties of the network Laplacian matrix to the network adjacency matrix, we discuss other key differences between synaptic and diffusive coupling.
|
|
16:25-16:40, Paper MoB18.6 | Add to My Program |
A Neuromorphic Control Architecture Inspired by the Limbic System |
|
Rubio Scola, Ignacio | Conicet - National University of Rosario |
Garcia Carrillo, Luis Rodolfo | New Mexico State University |
Stewart, Terrence C. | National Research Council of Canada |
Sornborger, Andrew T. | Los Alamos National Laboratory |
Keywords: Control applications, Biologically-inspired methods, Control software
Abstract: We introduce a performance-guaranteed Limbic System-Inspired Control (LISIC) which is appropriate for implementation in neuromorphic hardware. The control strategy aims to stabilize the tracking error of a class of nonlinear systems with uncertain dynamics and external perturbations. The objective of the LISIC structure is to identify and compensate model differences between the theoretical assumptions considered and the actual conditions encountered in the real-time system to be controlled, using a minimum of energy in the computation. To validate our approach, we make use of a neuromorphic architecture composed by spiking neuronal networks, and using Nengo Brain Maker software we emulate a hardware implementation of our controller in Intel’s neuromorphic research processor codenamed Loihi. Numerical results are provided to demonstrate the tracking of the perturbed inverted pendulum with the neuromorphic control system.
|
|
MoB19 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Robotics II |
|
|
Chair: Thomas, Gray | University of Michigan |
Co-Chair: Dimarogonas, Dimos V. | KTH Royal Institute of Technology |
|
15:10-15:25, Paper MoB19.1 | Add to My Program |
An Energy Shaping Exoskeleton Controller for Human Strength Amplification (I) |
|
Thomas, Gray | University of Michigan |
Gregg, Robert D. | University of Michigan |
Keywords: Robotics, Nonlinear systems, Healthcare and medical systems
Abstract: In this work, we introduce a novel approach to assistive exoskeleton (or powered orthosis) control which avoids needing task and ground contact information. Our approach is based on directly designing the Hamiltonian dynamics of the target closed-loop behavior, shaping the energy of the human and the robot. Relative to previous energy shaping controllers for assistive exoskeletons, we introduce ground reaction force and torque information into the target behavior definition, reformulate the kinematics so as to avoid explicit matching conditions due to under-actuation, and avoid the need to switch between swing and stance energy shapes. Our controller introduces new states into the target Hamiltonian energy that represent a virtual second leg that is connected to the physical leg using virtual springs. The impulse the human imparts to the physical leg is amplified and applied to the virtual leg, but the ground reaction force acts only on the physical leg. A state transformation allows the proposed control to be available using only encoders, an IMU, and ground reaction force sensors. We prove that this controller is stable and passive when acted on by the ground reaction force and demonstrate the controller's strength amplifying behavior in a simulation. A linear analysis based on small signal assumptions allows us to explain the relationship between our tuning parameters and the frequency domain amplification bandwidth.
|
|
15:25-15:40, Paper MoB19.2 | Add to My Program |
An Input-Output Feedback Linearization Approach to the Motion Control of Flexible Joint Manipulators |
|
Montoya-Chairez, Jorge | Instituto Politécnico Nacional-CITEDI |
Moreno Valenzuela, Javier | Instituto Politécnico Nacional-CITEDI |
Keywords: Feedback linearization, Robotics, Stability of nonlinear systems
Abstract: In robot manipulators, the joint flexibility may be or not introduced intentionally. Thus, flexible joint robots (FJRs) are useful in aerospace applications and human rehabilitation, for example. Besides, FJRs appear in industrial manipulators. The feedback linearization control has been applied to many mechatronics systems, including FJRs. However, if spring damping between the links and rotors is present, the state feedback linearization design is no longer feasible. In order to overcome this situation, in this paper, an input-output feedback linearization approach is developed to achieve trajectory tracking control of FJRs. The study is complemented with simulation results, which validates the proposed theory. By assuming the presence of spring damping, a comparison between the known state feedback linearization technique and the proposed input-output feedback linearization is given, showing better results for the introduced approach.
|
|
15:40-15:55, Paper MoB19.3 | Add to My Program |
Semiglobal Asymptotic Stability of Nonlinear PD-Type Plus Gravity Compensation Controllers for Input-Saturated Robot Manipulators |
|
Jimenez-Quiroz, Marco | Instituto Politécnico Nacional - CITEDI |
Moyrón Durán, Jerónimo | Instituto Politécnico Nacional - CITEDI |
Moreno Valenzuela, Javier | Instituto Politécnico Nacional-CITEDI |
Keywords: Robotics, Autonomous systems, Stability of nonlinear systems
Abstract: In this paper, the stability of PD-type controllers plus gravity compensation for position regulation of input-saturated robot manipulators is discussed, where symmetrical hard saturation functions are employed to model the input constraints. Based on Lyapunov's theory, a change of variable and a proper representation of the control input, it is shown that the closed--loop equilibrium point for the studied class of PD-type controllers plus gravity compensation is asymptotically stable. Furthermore, an estimate of the region of attraction by means of a level set of the Lyapunov function is given, showing that this region can arbitrarily be enlarged by the controller gains, even in the presence of input saturation. Thus, semiglobal asymptotic stability is achieved. A case study using a crank-slider mechanism is included where simulation results illustrate the concepts developed.
|
|
15:55-16:10, Paper MoB19.4 | Add to My Program |
Regulation Control of a Suspended Cable-Driven Robot Via Energy Shaping |
|
Jafari Harandi, Mohmmad Reza | K.N Toosi University of Technology |
Molaei, Amir | Concordia University |
Taghirad, Hamid D. | K.N. Toosi U. of Tech |
Keywords: Robotics, Constrained control, Nonlinear systems
Abstract: In underactuated robots (URs), the motion of the unactuated configuration variable is coupled to that of the others by complex dynamics, which makes their control complicated. Additionally, if the UR is a cable-driven robot (CDR), as cables merely support tensile force, the positiveness of the cable’s tension should also be taken into account in controller design. In this paper, we investigate the regulation control of a suspended three degrees of freedom (DOF) CDR using interconnection and damping assignment passivity-based control (IDA-PBC) via potential energy shaping. This method requires analytical solutions to a set of partial differential equations (PDEs). The IDA-PBC approach for the control of the three-DOF CDR results in a complex PDE that cannot be analytically solved using the proposed methods in the literature. To this aim, we transform the governing PDE into a number of Pfaffian differential equations. Then, the resultant Pfaffian equations are solved analytically to obtain the homogeneous solutions of the PDE. Additionally, we have also considered the positiveness of the cables’ tension by suitably defining the parameters of the controller. The efficacy of the proposed controller is investigated through simulation, and its performance is compared with a sliding mode control approach.
|
|
16:10-16:25, Paper MoB19.5 | Add to My Program |
Suboptimal Control Design for Differential Wheeled Mobile Robots with Theta-D Technique |
|
Yao, Jie | Missouri University of Science and Technology |
Xin, Ming | University of Missouri |
Keywords: Robotics, Nonlinear systems, Optimal control
Abstract: In this paper, a suboptimal controller is designed by the Theta-D technique for a differential wheeled mobile robot with holonomic and nonholonomic constraints. The challenge of wheel actuators not being exerted into the coordinate of the robot base is addressed by a proper transformation, leading to controllability of the mobile robot system. This transformation lowers the complexity of control design since it yields reduced order state-space equations. The Theta-D algorithm provides an approximate closed-form suboptimal controller that is easy to implement onboard. It is compared favorably with the similar state-dependent Riccati equation technique in terms of computation efficiency and control effort. The simulation experiments verify that the proposed technique is an effective and efficient tool for designing the controller of the differential wheeled mobile robots.
|
|
16:25-16:40, Paper MoB19.6 | Add to My Program |
Hierarchical Control for Uncertain Discrete-Time Nonlinear Systems under Signal Temporal Logic Specifications |
|
Yu, Pian | School of Electrical Engineering and Computer Science, KTH Royal |
Dimarogonas, Dimos V. | KTH Royal Institute of Technology |
Keywords: Robotics, Uncertain systems, Formal Verification/Synthesis
Abstract: This paper studies the hierarchical control of uncertain discrete-time nonlinear systems under input constraints. Firstly, the notion of robust approximate simulation relation is defined. We show that by properly designing a control interface, the robust approximate simulation relation can be constructed from a low-complexity, deterministic (abstract) system to the original system. Then, we apply the hierarchical control approach to the robust control synthesis under signal temporal logic specifications. The results show that this approach reduces the computational complexity of the control synthesis, and is in some cases applicable to a larger set of initial states. The effectiveness of the proposed approach is verified by a simulation example.
|
| |