| |
Last updated on November 17, 2022. This conference program is tentative and subject to change
Technical Program for Tuesday December 6, 2022
|
TuPL Plenary Session, Tulum Ballroom A-H |
Add to My Program |
Contraction Theory in Systems and Control |
|
|
Chair: Valcher, Maria Elena | Universita' Di Padova |
Co-Chair: Prieur, Christophe | CNRS |
|
08:30-09:30, Paper TuPL.1 | Add to My Program |
Contraction Theory in Systems and Control |
|
Bullo, Francesco | Univ of California at Santa Barbara |
Keywords: Machine learning, Cooperative control, Optimal control
Abstract: We survey a comprehensive theory on the application of the Banach contraction principle to control and dynamical systems. We start with some historical highlights. Next, we generalize the basic contraction property from discrete to continuous time, from Euclidean to non-Euclidean norms, from closed to open systems, and, finally, from single to interconnected systems. We then apply these theoretical tools to modern problems involving machine learning, multi-agent coordination, and optimal control. Finally, we conclude with strengths and weaknesses of the theory as well as open directions of research.
|
|
TuAT01 Regular Session, Tulum Ballroom A |
Add to My Program |
Switched Systems I |
|
|
Chair: Berger, Guillaume O. | CU Boulder |
Co-Chair: Jungers, Raphaël M. | University of Louvain |
|
10:00-10:20, Paper TuAT01.1 | Add to My Program |
State Estimation for Asynchronously Switched Sampled-Data Systems |
|
Shankar, Sharad | University of California, Santa Barbara |
Yang, Guosong | University of California, Santa Barbara |
Hespanha, Joao P. | Univ. of California, Santa Barbara |
Keywords: Switched systems, Hybrid systems, Kalman filtering
Abstract: Asynchronously switched sampled-data systems can help model power systems and vehicles that evolve in continuous-time with switching behavior and discrete time measurements. We address the problem of jointly estimating a switching signal, with uncertainty in the exact switching times, as well as the continuous states of the system. We prove stability of the standard Kalman filter under uncertainty in the switching time, with statistical bounds relating to the sampling period. We then propose a method for estimation of switching times as well as a method for efficient joint estimation of the state and switching signal inspired by the interacting multiple-model extended-Viterbi algorithm. We validate our algorithms in simulation for a power converter and maneuvering vehicle.
|
|
10:20-10:40, Paper TuAT01.2 | Add to My Program |
Using Data Informativity for Online Stabilization of Unknown Switched Linear Systems |
|
Eising, Jaap | University of California, San Diego |
Liu, Shenyu | Beijing Institute of Technology |
Martinez, Sonia | University of California at San Diego |
Cortes, Jorge | University of California, San Diego |
Keywords: Switched systems, Sampled-data control, Adaptive control
Abstract: This work studies data-driven switched controller design for discrete-time switched linear systems. Instead of having access to the full system dynamics, an initialization phase is performed, during which noiseless measurements of the state and the input are collected for each mode. Under certain conditions on these measurements, we develop a stabilizing switched controller for the switched system. To be precise, the controller switches between identifying the active mode of the system and applying a predetermined stabilizing feedback. We prove that if the system switches according to certain specifications, this controller stabilizes the closed-loop system. Simulations on a network example illustrate our approach.
|
|
10:40-11:00, Paper TuAT01.3 | Add to My Program |
Ensuring the Safety of Uncertified Linear State-Feedback Controllers Via Switching |
|
Lu, Yiwen | Tsinghua University |
Mo, Yilin | Tsinghua University |
Keywords: Switched systems, Linear systems, Stochastic optimal control
Abstract: Sustained research efforts have been devoted to learning optimal controllers for linear stochastic dynamical systems with unknown parameters, but due to the corruption of noise, learned controllers are usually uncertified in the sense that they may destabilize the system. To address this potential instability, we propose a "plug-and-play" modification to the uncertified controller which falls back to a known stabilizing controller when the norm of the state exceeds a certain threshold. We show that the switching strategy enhances the safety of the uncertified controller by making the linear-quadratic cost bounded even if the underlying linear feedback gain is destabilizing. We also prove the near-optimality of the proposed strategy by quantifying the maximum performance loss caused by switching as asymptotically negligible. Finally, we demonstrate the effectiveness of the proposed switching strategy using simulation on an industrial process example.
|
|
11:00-11:20, Paper TuAT01.4 | Add to My Program |
Online Adaptive Identification of Switched Affine Systems Using a Two-Tier Filter Architecture with Memory |
|
Patel, Pritesh | IIT Delhi |
Basu Roy, Sayan | Indraprastha Institute of Information Technology Delhi |
Bhasin, Shubhendu | Indian Institute of Technology Delhi |
Keywords: Switched systems, Identification, Adaptive control
Abstract: This work proposes an online adaptive identification method for multi-input multi-output (MIMO) switched affine systems with guaranteed parameter convergence. A family of online parameter estimators is used that is equipped with a dual-layer low pass filter architecture to facilitate parameter learning and identification of each subsystem. The filters capture information about the unknown parameters in the form of a prediction error which is used in the parameter estimation algorithm. A salient feature of the proposed method that distinguishes it from most previous results is the use of a memory bank that stores filter values and promotes parameter learning during both active and inactive phases of a subsystem. Specifically, the learnt experience from the previous active phase of a subsystem is retained in the memory and leveraged for parameter learning in its subsequent active and inactive phases. Further, a new notion of intermittent initial excitation (IIE) is introduced that extends the previously established initial excitation (IE) condition to the switched system framework. IIE is shown to be sufficient to ensure exponential convergence of the switched system parameters.
|
|
11:20-11:40, Paper TuAT01.5 | Add to My Program |
An LMI-Based Algorithm for Switching State-Feedback Control of Continuous-Time Switched Systems |
|
Souza, Andressa | University of Campinas |
Oliveira, Ricardo C. L. F. | University of Campinas - UNICAMP |
Peres, Pedro L. D. | University of Campinas |
Keywords: Switched systems, LMIs, Robust control
Abstract: This paper proposes an algorithm based on LMIs to simultaneously design a switching control law and state-feedback gains that robustly stabilize a continuous-time switched linear system. The method is based on the computation of a stable convex combination of a polytope of matrices. First, the problem is reformulated as a relaxed LMI condition where the scalar parameters appear affinely. Differently from the majority of existing approaches, that resort to the use of gridding, a stable convex combination is determined through an iterative procedure. The algorithm is then extended to provide at the same time a switching strategy and the gains that stabilize the closed-loop system, also including the maximization of a bound to the decay rate. Numerical examples illustrate the results.
|
|
11:40-12:00, Paper TuAT01.6 | Add to My Program |
Data-Driven Invariant Subspace Identification for Black-Box Switched Linear Systems |
|
Berger, Guillaume O. | CU Boulder |
Jungers, Raphaël M. | University of Louvain |
Wang, Zheming | Université Catholique De Louvain |
Keywords: Switched systems, Lyapunov methods, Subspace methods
Abstract: We present an algorithmic framework for the identification of candidate invariant subspaces for switched linear systems. Namely, the framework allows to compute an orthonormal basis in which the matrices of the system are close to block-triangular matrices, based on a finite set of observed one-step trajectories and with a priori confidence level. The link between the existence of an invariant subspace and a common block-triangularization of the system matrices is well known. Under some assumptions on the system, one can also infer the existence of an invariant subspace when the matrices are close to be block-triangular. Our approach relies on quadratic Lyapunov analysis and recent tools in scenario optimization. We present two applications of our results for problems of consensus and opinion dynamics; the first one allows to identify the disconnected components in a switching hidden network, while the second one identifies the stationary opinion vector of a switching gossip process with antagonistic interactions.
|
|
TuAT02 Regular Session, Tulum Ballroom B |
Add to My Program |
Adaptive Control I |
|
|
Chair: Sofrony, Jorge Ivan | Universidad Nacional De Colombia |
Co-Chair: Bhasin, Shubhendu | Indian Institute of Technology Delhi |
|
10:00-10:20, Paper TuAT02.1 | Add to My Program |
An Adaptive Pilot Model with Reaction Time-Delay |
|
Habboush, Abdullah | Bilkent University |
Yildiz, Yildiray | Bilkent University |
Keywords: Adaptive control, Adaptive systems, Human-in-the-loop control
Abstract: Practical adaptive control implementations where human pilots coexist in the loop are still uncommon, despite their success in handling uncertain dynamical systems. This is owing to their special nonlinear characteristics which lead to unfavorable interactions between pilots and adaptive controllers. To pave the way for the implementation of adaptive controllers in piloted applications, we propose an adaptive human pilot model that takes into account the time delay in the pilot's response while operating on an adaptive control system. The model can be utilized in the evaluation of adaptive controllers through the simulation environment and guide in their design.
|
|
10:20-10:40, Paper TuAT02.2 | Add to My Program |
PI-Like Estimator-Based Adaptive Extremum Seeking Control Using Initial Excitation |
|
Garg, Tushar | IIIT Delhi |
Basu Roy, Sayan | Indraprastha Institute of Information Technology Delhi |
Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Keywords: Adaptive control, Adaptive systems, Lyapunov methods
Abstract: In this paper, a novel proportional integral (PI)-like estimator-based adaptive extremum seeking control (AdESC) algorithm is proposed for online optimization, where parameter convergence is achieved under a relaxed mathematical condition called initial excitation (IE). The proposed AdESC algorithm utilizes a new set of low-pass filter dynamics, which omits the requirement of switching mechanism in past literature for rank-checking while still ensuring parameter convergence. A detailed Lyapunov analysis is carried out using singular-perturbation like principle to establish closed-loop stability of the AdESC algorithm.
|
|
10:40-11:00, Paper TuAT02.3 | Add to My Program |
Adaptive Visual Servoing Trajectory Tracking Control of Nonholonomic Mobile Robot with Position Estimation |
|
Huang, Shunping | Xiamen University |
Lan, Weiyao | Xiamen University |
Yu, Xiao | Xiamen University |
Keywords: Adaptive control, Autonomous robots, Visual servo control
Abstract: This paper investigates the trajectory tracking problem of nonholonomic mobile robot in the environment where localization cannot be obtained. An adaptive visual servoing velocity controller without direct position measurement is presented in the paper. The proposed dynamic controller is based on a position estimation algorithm using feedback from encoders, inertial measurement unit (IMU) and a visual system. It is shown that the controlled mobile robot is able to track the desired trajectory. Simulation example illustrates the effectiveness of the proposed approach.
|
|
11:00-11:20, Paper TuAT02.4 | Add to My Program |
Output-Feedback Design of Longitudinal Platooning with Adaptive Disturbance Decoupling |
|
Liu, Di | University of Groningen |
Besselink, Bart | University of Groningen |
Baldi, Simone | Southeast University |
Yu, Wenwu | Southeast University |
Trentelman, Harry L. | Univ. of Groningen |
Keywords: Adaptive control, Autonomous vehicles, Cooperative control
Abstract: In longitudinal platooning, ‘disturbance decoupling’ refers to the problem of making the inter-vehicle spacing independent of the disturbance input from the preceding vehicle, while ‘adaptive’ refers to handling vehicle parametric uncertainty such as engine time constant. This work constructs a platooning architecture to achieve disturbance decoupling and adaptive goals without absolute position and relative velocity feedback, required in the state of the art. The interest in targeting disturbance decoupling is to have string stability (attenuation of disturbances along the platoon) as a by-product, as well as a decentralized design that helps handling parametric uncertainty. Stability analysis is provided along with numerical simulations.
|
|
11:20-11:40, Paper TuAT02.5 | Add to My Program |
Model Reference Adaptive Anti-Windup Compensation |
|
Sofrony, Jorge Ivan | Universidad Nacional De Colombia |
Turner, Matthew C. | University of Southampton |
Richards, Christopher | University of Louisville |
Keywords: Adaptive control, Constrained control, Robust adaptive control
Abstract: This paper proposes an anti-windup mechanism for a model reference adaptive control (MRAC) scheme subject to actuator saturation constraints. The proposed compensator has the same architecture as well known non-adaptive schemes, which rely on the assumption that the system model is known fairly accurately. This is in contrast to the adaptive nature of the controller, which assumes that the system (or parts of it) is unknown. The approach proposed here uses of an “estimate” of the system matrices for the anti-windup com- pensator formulation and modifies the adaptation laws that update the controller gains. It will be observed that if the (unknown) ideal control gain is reached, a type of “model recovery anti-windup” formulation is obtained. In addition, it is shown that if the control signal eventually lies within the control constraints, then, under certain conditions, the system states will converge to those of the reference model as desired. The paper highlights the main challenges involved in the design of anti-windup compensators for model-reference adaptive control systems and demonstrates its success via a simulation of a flight control application.
|
|
11:40-12:00, Paper TuAT02.6 | Add to My Program |
State and Input Constrained Model Reference Adaptive Control |
|
Ghosh, Poulomee | Indian Institute of Technology Delhi |
Bhasin, Shubhendu | Indian Institute of Technology Delhi |
Keywords: Adaptive control, Constrained control, Linear systems
Abstract: Satisfaction of state and input constraints is one of the most critical requirements in control engineering applications. In classical model reference adaptive control (MRAC) formulation, although the states and the input remain bounded, the bound is neither user-defined nor known a-priori. In this paper, an MRAC is developed for multivariable linear time-invariant (LTI) plant with user-defined state and input constraints using a simple saturated control design coupled with a barrier Lyapunov function (BLF). Without any restrictive assumptions that may limit practical implementation, the proposed controller guarantees that both the plant state and the control input remain within a user-defined safe set for all time while simultaneously ensuring that the plant state trajectory tracks the reference model trajectory. The controller ensures that all the closed-loop signals remain bounded and the trajectory tracking error converges to zero asymptotically. Simulation results validate the efficacy of the proposed constrained MRAC in terms of better tracking performance and limited control effort compared to the standard MRAC algorithm.
|
|
TuAT03 Regular Session, Tulum Ballroom C |
Add to My Program |
Robotics I |
|
|
Chair: Tron, Roberto | Boston University |
Co-Chair: Laha, Riddhiman | Technical University of Munich |
|
10:00-10:20, Paper TuAT03.1 | Add to My Program |
Optimal Linear Multiple Estimation for Landmark-Based Planning Via Control Synthesis |
|
Wang, Chenfei | Boston University |
Tron, Roberto | Boston University |
Keywords: Robotics, Robust control, Optimization
Abstract: A common way to implement navigation in mobile robots is through the use of landmarks. In this case, the main goal of the controller is to make progress toward a goal location (stability), while avoiding the boundary of the environment (safety). In our previous work, we proposed a method to synthesize global controllers for environments with a polyhedral decomposition; our solution uses a Quadratically Constrained Quadratic Program with Chance Constraints to take into account the uncertainty in landmark measurements. Building upon this work, we introduce the concept of virtual landmarks, which are defined as linear combinations of actual landmark measurements that minimize the uncertainty in the resulting control actions. Interestingly, our results show that the first minimum-variance landmark is independent of the feedback control matrix, thus decoupling the design of the landmark from the one of the controller; attempting to derive additional, statistically independent landmarks, however, requires solving non-convex problems that involve also the controller. Numerical experiments demonstrate that, by minimizing the variance of the inputs, the resulting controller will be less conservative, and the robot is able to complete navigation tasks more effectively (faster and with less jitter in the trajectory).
|
|
10:20-10:40, Paper TuAT03.2 | Add to My Program |
Noncooperative Herding with Control Barrier Functions: Theory and Experiments |
|
Grover, Jaskaran | CMU |
Mohanty, Nishant | Carnegie Mellon Univarsity |
Luo, Wenhao | University of North Carolina at Charlotte |
Liu, Changliu | Carnegie Mellon University |
Sycara, Katia | Carnegie Mellon University |
Keywords: Robotics, Optimization, Control applications
Abstract: In this paper, we consider the problem of protecting a high-value unit from inadvertent attack by a group of agents using defending robots. Specifically, we develop a control strategy for the defending agents that we call ``dog robots" to prevent a flock of ``sheep agents" from breaching a protected zone. We take recourse to control barrier functions to pose this problem and exploit the interaction dynamics between the sheep and dogs to find dogs' velocities that result in the sheep getting repelled from the zone. We solve a QP reactively that incorporates the defending constraints to compute the desired velocities for all dogs. Owing to this, our proposed framework is composable textit{i.e.} it allows for simultaneous inclusion of multiple protected zones in the constraints on dog robots' velocities. We provide a theoretical proof of feasibility of our strategy for the one dog/one sheep case. Additionally, we provide empirical results of two dogs herding upto ten sheep averaged over a hundred simulations and report high success rates. We also demonstrate this algorithm experimentally on non-holonomic robots. Videos of these results are available at url{https://tinyurl.com/4dj2kjwx}.
|
|
10:40-11:00, Paper TuAT03.3 | Add to My Program |
On Forward Kinematics of a 3SPR Parallel Manipulator |
|
Roudneshin, Masoud | Concordia University |
Ghaffari, Kamran | Touché Technologies |
Aghdam, Amir G. | Concordia University |
Keywords: Robotics, Optimization, Estimation
Abstract: This paper presents a new numerical method to solve the forward kinematics (FK) of a parallel manipulator with a three-limb spherical-prismatic-revolute (3SPR) structure. Unlike the existing numerical approaches that require the manipulator’s Jacobian matrix and its inverse at each iteration, the proposed algorithm requires much less complex computations to estimate the FK parameters. A cost function is introduced that measures the difference between actual FK values and its estimates. At each iteration, the problem is handled in two steps. First, the estimates of the platform orientation from the heave estimates are obtained. Then, the heave estimates are updated along the gradient direction of the proposed cost function. To evaluate the performance of the proposed algorithm, we compare results with those of a Jacobian-based (JB) approach for a 3SPR parallel manipulator.
|
|
11:00-11:20, Paper TuAT03.4 | Add to My Program |
Continuous Jumping for Legged Robots on Stepping Stones Via Trajectory Optimization and Model Predictive Control |
|
Nguyen, Chuong | University of Southern California |
Bao, Lingfan | University of Southern California |
Nguyen, Quan | University of Southern California |
Keywords: Robotics
Abstract: Performing highly agile dynamic motions, such as jumping or running on uneven stepping stones has remained a challenging problem in legged robot locomotion. This paper presents a framework that combines trajectory optimization and model predictive control to perform robust and consecutive jumping on stepping stones. In our approach, we first utilize trajectory optimization based on full-nonlinear dynamics of the robot to generate periodic jumping trajectories for various jumping distances. A jumping controller based on a model predictive control is then designed for realizing smooth jumping transitions, enabling the robot to achieve continuous jumps on stepping stones. Thanks to the incorporation of MPC as a real-time feedback controller, the proposed framework is also validated to be robust to uneven platforms with unknown height perturbations and model uncertainty on the robot dynamics.
|
|
11:20-11:40, Paper TuAT03.5 | Add to My Program |
An Adaptive Cooperative Manipulation Control Framework for Multi-Agent Disturbance Rejection |
|
Aladele, Victor | Georgia Institute of Technology |
de Cos, Carlos Rodriguez | KTH Royal Institute of Technology |
Dimarogonas, Dimos V. | KTH Royal Institute of Technology |
Hutchinson, Seth | Georgia Tech |
Keywords: Robotics, Cooperative control, Adaptive control
Abstract: The success of a cooperative manipulation process depends on the level of disturbance rejection between the cooperating agents. However, this attribute may be jeopardized due to unexpected behaviors, such as joint saturation or internal collisions. This leads to deterioration in the performance of the manipulation task. In this paper, we present an adaptive distributed control framework that directly mitigates these internal disturbances, both in the joint (and task) spaces. With our approach, we show that including the manipulator-load coupling in the definition of the task error yields improved performance and robustness. To validate this statement, we provide stability guarantees and simulation results for two implementation cases.
|
|
11:40-12:00, Paper TuAT03.6 | Add to My Program |
Adaptive Admittance Control for Cooperative Manipulation Using Dual Quaternion Representation and Logarithmic Mapping |
|
Nebbia Colomba, Rachele | University of Pisa |
Laha, Riddhiman | Technical University of Munich |
Figueredo, Luis Felipe Cruz | University of Brasília |
Haddadin, Sami | Technische Universität München |
Keywords: Robotics, Cooperative control, Adaptive control
Abstract: This paper addresses the problem of admittance control for bimanual and cooperative manipulators. First, a novel admittance controller is devised to be consistent with the geometry of the cooperative task space and exploits the dual quaternion logarithmic mapping of wrenches and corresponding elastic displacements. By describing the coupled motion exploiting the geometric features of dual quaternions, the desired compliant behaviour is enforced on the whole dual-arm system and not on the single manipulator separately. The solution is thereafter extended by means of an adaptive modulation of the impedance gains along the logarithmic space of the dual quaternion algebra for cooperative dual task-space variables. The overall scheme consists of a stiffness adapter, cooperative admittance controller, and wrench adapter complete with an inner motion controller along with the cooperative system. Stability proofs and conditions for the stiffness and damping adaptation of the cooperative variables is provided. Finally, we present a use case depicting a cooperative variable stiffness adaptation policy with respect to the relative elastic behaviour between arms to reduce internal stress whilst satisfying grasp feasibility under shear forces and varying load conditions. Our formulation is validated by thorough simulation studies on two Franka Emika Panda robots carrying a shared object under varying loading conditions. The experimental investigations reinforce our claim of robust performance when compared with fixed stiffness and standard width-adaptation solutions.
|
|
TuAT04 Regular Session, Tulum Ballroom D |
Add to My Program |
Machine Learning I |
|
|
Chair: Chakrabarty, Ankush | Mitsubishi Electric Research Laboratories (MERL) |
Co-Chair: Paulson, Joel | The Ohio State University |
|
10:00-10:20, Paper TuAT04.1 | Add to My Program |
Performance-Driven Controller Tuning Via Derivative-Free Reinforcement Learning |
|
Lei, Yuheng | Tsinghua University |
Chen, Jianyu | Tsinghua University |
Li, Shengbo Eben | Tsinghua University |
Zheng, Sifa | Tsinghua University |
Keywords: Machine learning, Autonomous vehicles, Identification for control
Abstract: Choosing an appropriate parameter set for the designed controller is critical for the final performance but usually requires a tedious and careful tuning process, which implies a strong need for automatic tuning methods. However, among existing methods, derivative-free ones suffer from poor scalability or low efficiency, while gradient-based ones are often unavailable due to possibly non-differentiable controller structure. To resolve the issues, we tackle the controller tuning problem using a novel derivative-free reinforcement learning (RL) framework, which performs timestep-wise perturbation in parameter space during experience collection and integrates derivative-free policy updates into the advanced actor-critic RL architecture to achieve high versatility and efficiency. To demonstrate the framework’s efficacy, we conduct numerical experiments on two concrete examples from autonomous driving, namely, adaptive cruise control with PID controller and trajectory tracking with MPC controller. Experimental results show that the proposed method outperforms popular baselines and highlight its strong potential for controller tuning.
|
|
10:20-10:40, Paper TuAT04.2 | Add to My Program |
Efficient Multi-Step Lookahead Bayesian Optimization with Local Search Constraints |
|
Paulson, Joel | The Ohio State University |
Sorourifar, Farshud | Ohio State University |
Chakrabarty, Ankush | Mitsubishi Electric Research Laboratories (MERL) |
Keywords: Machine learning, Computational methods, Optimization algorithms
Abstract: Bayesian optimization (BO) is a data-efficient approach for optimizing expensive-to-evaluate black-box functions that suffer from noisy evaluations. Traditional BO algorithms ignore the relationship between consecutive input values, which is known to lead to significant "jumps" in the search space that cannot be implemented in practice, especially in online experimental systems. For example, in performance-driven control applications, large changes in the chosen setpoint parameters may trigger fail-safe mechanisms and/or lead to violation of critical safety constraints. In such applications, it is necessary to limit the allowable search space at each BO iteration, which can be done by incorporating local search constraints into the original problem setting. In this paper, we show how this novel BO setting can be cast as a Markov decision process (MDP) for which the optimal policy is characterized by an intractable dynamic programming (DP) problem. To overcome this challenge, we take advantage of approximate DP methods, particularly rollout with fast policy search, to derive an efficient multi-step lookahead BO policy. We also propose a novel base policy needed for the rollout algorithm, which explicitly incorporates the local search restrictions in an efficient and intuitive manner. Lastly, we empirically show that our proposed multi-step lookahead BO policy outperforms existing methods on a well-known benchmark problem.
|
|
10:40-11:00, Paper TuAT04.3 | Add to My Program |
Optimizing Closed-Loop Performance with Data from Similar Systems: A Bayesian Meta-Learning Approach |
|
Chakrabarty, Ankush | Mitsubishi Electric Research Laboratories (MERL) |
Keywords: Machine learning, Computational methods, Simulation
Abstract: Bayesian optimization (BO) has demonstrated potential for optimizing control performance in data-limited settings, especially for systems with unknown dynamics or unmodeled performance objectives. The BO algorithm efficiently trades-off exploration and exploitation by leveraging uncertainty estimates using surrogate models. These surrogates are usually learned using data collected from the target dynamical system to be optimized. Intuitively, the convergence rate of BO is better for surrogate models that can accurately predict the target system performance. In classical BO, initial surrogate models are constructed using very limited data points, and therefore rarely yield accurate predictions of system performance. In this paper, we propose the use of meta-learning to generate an initial surrogate model based on data collected from performance optimization tasks performed on a variety of systems that are different to the target system. To this end, we employ deep kernel networks (DKNs) which are simple to train and which comprise encoded Gaussian process models that integrate seamlessly with classical BO. The effectiveness of our proposed DKN-BO approach for speeding up control system performance optimization is demonstrated using a well-studied nonlinear system with unknown dynamics and an unmodeled performance function.
|
|
11:00-11:20, Paper TuAT04.4 | Add to My Program |
Learning the Conditional Law: Signatures and Conditional GANs in Filtering and Prediction of Diffusion Processes |
|
Germ, Fabian | University of Edinburgh |
Sabate Vidales, Marc | The University of Edinburgh |
Keywords: Machine learning, Filtering, Estimation
Abstract: We consider the filtering and prediction problem for a diffusion process. The signal and observation are modeled by stochastic differential equations (SDEs) driven by correlated Wiener processes. In classical estimation theory, measure-valued stochastic partial differential equations (SPDEs) are derived for the filtering and prediction measures. These equations can be hard to solve numerically. We provide an approximation algorithm using conditional generative adversarial networks (GANs) in combination with signatures, an object from rough path theory. The signature of a sufficiently smooth path determines the path completely. As a result, in some cases, GANs based on signatures have been shown to efficiently approximate the law of a stochastic process. For our algorithm we extend this method to sample from the conditional law, given noisy, partial observation. Our generator is constructed using neural differential equations (NDEs), relying on their universal approximator property. We show well-posedness in providing a rigorous mathematical framework. Numerical results show the efficiency of our algorithm.
|
|
11:20-11:40, Paper TuAT04.5 | Add to My Program |
Risk-Averse Multi-Armed Bandits with Unobserved Confounders: A Case Study in Emotion Regulation in Mobile Health |
|
Shen, Yi | Duke University |
Dunn, Jessilyn | Duke University |
Zavlanos, Michael M. | Duke University |
Keywords: Machine learning, Healthcare and medical systems, Optimization
Abstract: In this paper, we consider a risk-averse multi-armed bandit (MAB) problem where the goal is to learn a policy that minimizes the risk of low expected return, as opposed to maximizing the expected return itself, which is the objective in the usual approach to risk-neutral MAB. Specifically, we formulate this problem as a transfer learning problem between an expert and a learner agent in the presence of contexts that are only observable by the expert but not by the learner. Thus, such contexts are unobserved confounders (UCs) from the learner's perspective. Given a dataset generated by the expert that excludes the UCs, the goal for the learner is to identify the true minimum-risk arm with fewer online learning steps, while avoiding possible biased decisions due to the presence of UCs in the expert's data. To achieve this, we first formulate a mixed-integer linear program that uses the expert data to obtain causal bounds on the Conditional Value at Risk (CVaR) of the true return for all possible UCs. We then transfer these causal bounds to the learner by formulating a causal bound constrained Upper Confidence Bound (UCB) algorithm to reduce the variance of online exploration and, as a result, identify the true minimum-risk arm faster, with fewer new samples. We provide a regret analysis of our proposed method and show that it can achieve zero or constant regret. Finally, we use an emotion regulation in mobile health example to show that our proposed method outperforms risk-averse MAB methods without causal bounds.
|
|
11:40-12:00, Paper TuAT04.6 | Add to My Program |
Learning Reduced Nonlinear State-Space Models: An Output-Error Based Canonical Approach |
|
Janny, Steeven | LAGEPP, Université Claude Bernard |
Possamai, Quentin | University of Lyon |
Bako, Laurent | Ecole Centrale De Lyon |
Nadri, Madiha | Universite Claude Bernard Lyon 1 |
Wolf, Christian | INSA-Lyon, LIRIS |
Keywords: Machine learning, Identification, Observers for nonlinear systems
Abstract: The identification of a nonlinear dynamic model is an open topic in control theory, especially from sparse input-output measurements. A fundamental challenge of this problem is that very few to zero prior knowledge is available on both the state and the nonlinear system model. To cope with this challenge, we investigate the effectiveness of deep learning in the modeling of dynamic systems with nonlinear behavior by advocating an approach which relies on three main ingredients: (i) we show that under some structural conditions on the to-be-identified model, the state can be expressed in function of a sequence of the past inputs and outputs; (ii) this relation which we call the state map can be modelled by resorting to the well-documented approximation power of deep neural networks; (iii) taking then advantage of existing learning schemes, a state-space model can be finally identified. After the formulation and analysis of the approach, we show its ability to identify three different nonlinear systems. The performances are evaluated in terms of open-loop prediction on test data generated in simulation as well as a real world data-set of unmanned aerial vehicle flight measurements.
|
|
TuAT05 Invited Session, Tulum Ballroom E |
Add to My Program |
Risk-Aware Learning, Verification, and Control |
|
|
Chair: Kalogerias, Dionysios | Yale University |
Co-Chair: Tsiamis, Anastasios | University of Pennsylvania |
Organizer: Lindemann, Lars | University of Pennsylvania |
Organizer: Tsiamis, Anastasios | University of Pennsylvania |
Organizer: Chapman, Margaret P | University of Toronto |
Organizer: Kalogerias, Dionysios | Yale University |
Organizer: Pappas, George J. | University of Pennsylvania |
|
10:00-10:20, Paper TuAT05.1 | Add to My Program |
Distributionally Robust Model Predictive Control with Total Variation Distance |
|
Dixit, Anushri | Caltech |
Ahmadi, Mohamadreza | California Institute of Technology |
Burdick, Joel W. | California Inst. of Tech |
Keywords: Stochastic optimal control, Predictive control for linear systems, Stochastic systems
Abstract: This paper studies the problem of distributionally robust model predictive control (MPC) using total variation distance ambiguity sets. For a discrete-time linear system with additive disturbances, we provide a conditional value-at-risk reformulation of the MPC optimization problem that is distributionally robust in the expected cost and chance constraints. The distributionally robust chance constraint is over-approximated as a simpler, tightened chance constraint that reduces the computational burden. Numerical experiments support our results on probabilistic guarantees and computational efficiency.
|
|
10:20-10:40, Paper TuAT05.2 | Add to My Program |
Safe Control for Nonlinear Systems with Stochastic Uncertainty Via Risk Control Barrier Functions |
|
Singletary, Andrew | California Institute of Technology |
Ahmadi, Mohamadreza | California Institute of Technology |
Ames, Aaron D. | California Institute of Technology |
Keywords: Uncertain systems, Stochastic systems, Stability of nonlinear systems
Abstract: Guaranteeing safety for robotic and autonomous systems in real-world environments is a challenging task that requires the mitigation of stochastic uncertainties. Control barrier functions have, in recent years, been widely used for enforcing safety related set-theoretic properties, such as forward invariance and reachability, of nonlinear dynamical systems. In this paper, we extend this rich framework to nonlinear discrete-time systems subject to stochastic uncertainty and propose a framework for assuring risk-sensitive safety in terms of coherent risk measures. To this end, we introduce risk control barrier functions (RCBFs), which are compositions of barrier functions and dynamic, coherent risk measures. We show that the existence of such barrier functions implies invariance in a coherent risk sense. Furthermore, we formulate conditions based on finite-time RCBFs to guarantee finite-time reachability to a desired set in the coherent risk. Conditions for risk-sensitive safety and finite-time reachability of sets composed of Boolean compositions of multiple RCBF are also formulated. We show the efficacy of the proposed method through its application to a cart-pole system in a safety-critical scenario.
|
|
10:40-11:00, Paper TuAT05.3 | Add to My Program |
Learning Nonlinear Couplings in Swarm of Agents from a Single Sample Trajectory (I) |
|
Amini, Arash | Lehigh UNiversity |
Motee, Nader | Lehigh University |
Sun, Qiyu | University of Central Florida |
Keywords: Statistical learning, Learning, Networked control systems
Abstract: We prove that the governing dynamics of a class of stochastic networks can be learned through their coupling functions. It is shown that the dynamics of such networks can generate geometrically ergodic trajectories under some reasonable assumptions. We show that a general class of coupling functions can be learned using only one sample trajectory from the network. This approach is plausible in numerous applications when one prefers to run an experiment only once but for a more extended period rather than repeating the same experiment multiple times under different initial conditions. Building upon ideas from the concentration inequalities for geometrically ergodic Markov chains, we formulate several results about the convergence of the empirical estimator to the actual coupling function. Extensive simulation results support our theoretical findings.
|
|
11:00-11:20, Paper TuAT05.4 | Add to My Program |
Robust Control Barrier Functions for Nonlinear Control Systems with Uncertainty: A Duality-Based Approach (I) |
|
Cohen, Max | Boston University |
Belta, Calin | Boston University |
Tron, Roberto | Boston University |
Keywords: Lyapunov methods, Nonlinear systems, Uncertain systems
Abstract: This paper studies the design of controllers that guarantee stability and safety of nonlinear control affine systems with parametric uncertainty in both the drift and control vector fields. To this end, we introduce novel classes of robust control barrier functions (RCBF) and robust control Lyapunov functions (RCLF) that facilitate the synthesis of safety-critical controllers in the presence of parametric uncertainty using quadratic programming. Since the initial bounds on the system uncertainty may be highly conservative, we present a data-driven approach to reducing such bounds using input-output data collected online. In particular, we leverage an integral set-membership identification algorithm that iteratively shrinks the set of possible system parameters online and guarantees stability and safety during learning. The efficacy of the developed approach is illustrated via numerical examples.
|
|
11:20-11:40, Paper TuAT05.5 | Add to My Program |
Risk-Aware UAV-UGV Rendezvous with Chance-Constrained Markov Decision Process (I) |
|
Shi, Guangyao | University of Maryland |
Karapetyan, Nare | University of Maryland |
Asghar, Ahmad Bilal | University of Maryland |
Reddinger, Jean-Paul | DEVCOM Army Research Laboratory |
Dotterweich, James | DEVCOM Army Research Laboratory |
Humann, James | DEVCOM Army Research Laboratory |
Tokekar, Pratap | Virginia Tech |
Keywords: Autonomous robots, Cooperative control, Stochastic optimal control
Abstract: We study a chance-constrained variant of the cooperative aerial-ground vehicle routing problem, in which an Unmanned Aerial Vehicle (UAV) with limited battery capacity and an Unmanned Ground Vehicle (UGV) that can also act as a mobile recharging station need to jointly accomplish a mission such as monitoring a set of points. Due to the limited battery capacity of the UAV, two vehicles sometimes have to deviate from their task to rendezvous and recharge the UAV@. Unlike prior work that has focused on the deterministic case, we address the challenge of stochastic energy consumption of the UAV@. We are interested in finding the optimal policy that decides when and where to rendezvous such that the expected travel time of the UAV is minimized and the probability of running out of charge is less than a user-defined tolerance. We formulate this problem as a Chance Constrained Markov Decision Process (CCMDP). To the best knowledge of the authors, this is the first CMDP-based formulation for the UAV-UGV routing problems under power consumption uncertainty. We adopt a Linear Programming (LP) based approach to solve the problem optimally. We demonstrate the effectiveness of our formulation in the context of an Intelligence Surveillance and Reconnaissance (ISR) mission.
|
|
11:40-12:00, Paper TuAT05.6 | Add to My Program |
State-Output Risk-Constrained Quadratic Control of Partially Observed Linear Systems (I) |
|
Koumpis, Nikolaos | Yale University |
Tsiamis, Anastasios | University of Pennsylvania |
Kalogerias, Dionysios | Yale University |
Keywords: Stochastic optimal control, Stochastic systems, Optimization
Abstract: We propose a methodology for performing risk-averse quadratic regulation of partially observed Linear Time-Invariant (LTI) systems, disturbed by process and output noise. To compensate against the induced variability due to both types of noises, state regulation is subject to two risk constraints. The latter renders the resulting controller to be cautious of stochastic disturbances, by restricting the statistical variability, namely, a simplified version of the cumulative expected predictive variance, of both the state and the output. It turns out that our proposed formulation results in an optimal risk-averse policy that preserves favorable characteristics of the classical Linear Quadratic (LQ) control. In particular, the optimal policy has an affine structure with respect to the minimum mean square error (mmse) estimates. The linear component of the policy regulates the state more strictly in riskier directions, where the process and output noise covariance, cross-covariance, and the corresponding penalties are simultaneously large. This is achieved by ``inflating" the state penalty in a systematic way. The additional affine terms force the state against pure and cross third-order statistics of the process and output disturbances. Another favorable characteristic of our optimal policy is that it can be pre-computed off-line, thus, avoiding limitations of prior work. Stability analysis shows that the derived controller is always internally stable regardless of parameter tuning. The functionality of the proposed risk-averse policy is illustrated through a working example via extensive numerical simulations.
|
|
TuAT06 Regular Session, Tulum Ballroom F |
Add to My Program |
Kernel-Based Identification |
|
|
Chair: Alamo, Teodoro | Universidad De Sevilla |
Co-Chair: Smith, Roy S. | ETH Zurich |
|
10:00-10:20, Paper TuAT06.1 | Add to My Program |
Dealing with Collinearity in Large-Scale Linear System Identification Using Bayesian Regularization |
|
Cao, Wenqi | Shanghai Jiao Tong University |
Pillonetto, Gianluigi | University of Padova |
Keywords: Identification, Statistical learning, Large-scale systems
Abstract: We consider the identification of large-scale linear and stable dynamic systems whose outputs may be the result of many correlated inputs. Hence, severe ill-conditioning may affect the estimation problem. This is a scenario often arising when modeling complex physical systems given by the interconnection of many sub-units where feedback and algebraic loops can be encountered. We develop a strategy based on Bayesian regularization where any impulse response is modeled as the realization of a zero-mean Gaussian process. The stable spline covariance is used to include information on smooth exponential decay of the impulse responses. We then design a new Markov chain Monte Carlo scheme that deals with collinearity and is able to efficiently reconstruct the posterior of the impulse responses. It is based on a variation of Gibbs sampling which updates possibly overlapping blocks of the parameter space on the basis of the level of collinearity affecting the different inputs. Numerical experiments are included to test the goodness of the approach where hundreds of impulse responses form the system and inputs correlation may be very high.
|
|
10:20-10:40, Paper TuAT06.2 | Add to My Program |
Bayesian Inference of Total Least-Squares with Known Precision |
|
Friml, Dominik | Brno University of Technology |
Vaclavek, Pavel | Brno University of Technology |
Keywords: Identification, Identification for control, Stochastic systems
Abstract: This paper provides a Bayesian analysis of the total least-squares problem with independent Gaussian noise of known variance. It introduces a derivation of the likelihood density function, conjugate prior probability-density function, and the posterior probability-density function. All in the shape of the Bingham distribution, introducing an unrecognized connection between orthogonal least-squares methods and directional analysis. The resulting Bayesian inference expands on available methods with statistical results. A recursive statistical identification algorithm of errors-in-variables models is laid-out. An application of the introduced inference is presented using a simulation example, emulating part of the identification process of linear permanent magnet synchronous motor drive parameters. The paper represents a crucial step towards enabling Bayesian statistical methods for problems with errors in variables.
|
|
10:40-11:00, Paper TuAT06.3 | Add to My Program |
Kernel Regularization for Unstable Systems |
|
Fujimoto, Yusuke | The University of Kitakyushu |
Keywords: Identification, Closed-loop identification
Abstract: Recent applications have proven the effectiveness of kernel-based regularization approach in system identification. Most of the existing kernel-based regularization methods design the regularization term based on the exponential convergence of impulse response, thus enforcing stability of the system. Therefore, it is difficult to identify unstable systems with existing methods. This paper employs the coefficients of Laurent series expansion defined on the unit circle as a model structure, and proposes a way to use kernels for stable systems in identifying the coefficients. Since the coefficients show exponential convergence even when the system is unstable, the proposed method is applicable for unstable systems. The effectiveness of the proposed method is demonstrated by a numerical example.
|
|
11:00-11:20, Paper TuAT06.4 | Add to My Program |
Kernel Based State-Space Kriging: Application to Predictive Control |
|
Carnerero, A. Daniel | University of Seville |
Ramirez, Daniel R. | Univ. of Sevilla |
Alamo, Teodoro | Universidad De Sevilla |
Keywords: Nonlinear systems identification, Kalman filtering, Predictive control for nonlinear systems
Abstract: In this paper, we extend the State-Space Kriging (SSK) modelling technique presented in a previous work by the authors so that non autonomous systems can be considered. The SSK method computes predictions as linear combinations of past outputs. In order to model nonlinearity, we develop here a new version of the SSK where kernel functions are used to model nonlinear dynamics instead of resorting on considerations about local data, which in practice is akin to a linearization. Also, a Kalman filter is used to improve the obtained predictions at each time step in the case of noisy measurements. A constrained Nonlinear Model Predictive Control (NMPC) scheme is built upon the black-box input-output system obtained by means of the SSK prediction method. Finally, a numerical example of a continuously stirred tank reactor (CSTR) is provided in order to assess the performance of the proposed controller.
|
|
11:20-11:40, Paper TuAT06.5 | Add to My Program |
Kernel-Based Identification of Local Limit Cycle Dynamics with Linear Periodically Parameter-Varying Models |
|
Ozan, Defne Ege | Imperial College London |
Yin, Mingzhou | ETH Zurich |
Iannelli, Andrea | ETH Zurich |
Smith, Roy S. | ETH Zurich |
Keywords: Nonlinear systems identification, Linear parameter-varying systems, Modeling
Abstract: Limit cycle oscillations are phenomena arising in nonlinear dynamical systems and characterized by periodic, locally-stable, and self-sustained state trajectories. Systems controlled in a closed loop along a periodic trajectory can also be modelled as systems experiencing limit cycle behavior. The goal of this work is to identify from data, the local dynamics around the limit cycle using linear periodically parameter-varying models. Using a coordinate transformation onto transversal surfaces, the dynamics are decomposed into two parts: one along the limit cycle, and one on the transversal surfaces. Then, the model is identified from trajectory data using kernel-based methods with a periodic kernel design. The kernel-based model is extended to also account for variations in system parameters associated with different operating conditions. The performance of the proposed identification method is demonstrated on a benchmark nonlinear system and on a simplified airborne wind energy model. The method provides accurate model parameter estimation, compared to the analytical linearization, and good prediction capability.
|
|
11:40-12:00, Paper TuAT06.6 | Add to My Program |
Optimally Regularized Local Basis Function Approach to Identification of Time-Varying Systems |
|
Niedzwiecki, Maciej | Gdansk University of Technology |
Gancza, Artur | Gdansk University of Technology, Faculty of Electronics Telecomm |
Keywords: Estimation, Identification, Optimization
Abstract: Accurate identification of stochastic systems with fast-varying parameters is a challenging task which cannot be accomplished using model-free estimation methods, such as weighted least squares, which assume only that system coefficients can be regarded as locally constant. The current state of the art solutions are based on the assumption that system parameters can be locally approximated by a linear combination of appropriately chosen basis functions. The paper shows that when the internal correlation structure of estimated parameters is known, the tracking performance of the local basis function estimation algorithms can be further improved by means of regularization. The optimal form of the regularization matrix is derived analytically and it is shown that the best settings of the regularized algorithm can be determined in the computationally efficient way using cross-validation.
|
|
TuAT07 Regular Session, Tulum Ballroom G |
Add to My Program |
Estimation |
|
|
Chair: Bazanella, Alexandre S. | Univ. Federal Do Rio Grande Do Sul |
Co-Chair: Reger, Johann | TU Ilmenau |
|
10:00-10:20, Paper TuAT07.1 | Add to My Program |
High-Gain Observer Design for Nonlinear Systems with Delayed Output Measurements Using Time-Varying Gains |
|
Adil, Ania | University of Tizi-Ouzou |
N'Doye, Ibrahima | King Abdullah University of Science and Technology (KAUST) |
Laleg-Kirati, Taous-Meriem | King Abdullah University of Science and Technology (KAUST) |
Keywords: Estimation, Delay systems
Abstract: This paper proposes a high-gain observer design for nonlinear systems with delayed output measurements using time-varying gains. The proposed observer is endowed with an exponential stability guarantee and relies on the generalization of the Halanay-type inequalities. We establish that the estimated state and the adapted gain are exponentially bounded and prevent the oscillatory response of the estimates. The time-varying gain feature limits the constant high-gain values of the standard high-gain observer design to the minimum gain required to achieve stability. Furthermore, we derive an explicit relation between the maximum bound of the delay and the maximum gain parameter by using a Lyapunov-Krasovskii functional jointly with the time-varying Halanay inequality. Finally, a comparison with the standard high-gain observer is provided through numerical simulations to demonstrate the superiority of the proposed high-gain observer in rejecting the noise and reducing the peaking phenomena.
|
|
10:20-10:40, Paper TuAT07.2 | Add to My Program |
A Bayesian Approach to Event-Triggered Remote State Estimation with Intermittent Measurements Over a Gaussian Channel |
|
Deng, Di | University of Science and Technology of China |
Xiong, Junlin | University of Science and Technology of China |
Keywords: Estimation, Filtering, Networked control systems
Abstract: This paper investigates the stochastic event-triggered remote state estimation problem with intermittent measurements over a Gaussian communication channel. Because of the channel noises, the remote estimator does not know whether the measurements are received and transmitted by the sensor. Assuming the prior distribution of the estimation process is Gaussian, an approximate minimum mean squared error estimator with adaptive weights is derived by the Bayesian inference method. The proposed estimate convexly combines the estimates for three arrival and transmission cases. Then the proposed estimator is shown to be conventional forms under three extreme situations. Numerical results demonstrate that our estimator has comparable performance to the Kalman filtering with intermittent measurements without the knowledge of the arrival variables and the triggering decisions.
|
|
10:40-11:00, Paper TuAT07.3 | Add to My Program |
GTP-SLAM: Game-Theoretic Priors for Simultaneous Localization and Mapping in Multi-Agent Scenarios |
|
Chiu, Chih-Yuan | University of California, Berkeley |
Fridovich-Keil, David | The University of Texas at Austin |
Keywords: Estimation, Game theory, Autonomous robots
Abstract: Robots operating in multi-player settings must simultaneously model the environment and the behavior of human or robotic agents who share that environment. This modeling is often approached using Simultaneous Localization and Mapping (SLAM); however, SLAM algorithms usually neglect multi-player interactions. In contrast, the motion planning literature often uses dynamic game theory to explicitly model noncooperative interactions of multiple agents in a known environment with perfect localization. Here, we present GTP-SLAM, a novel, iterative best response-based SLAM algorithm that accurately performs state localization and map reconstruction, while using emph{game theoretic priors} to capture the inherent non-cooperative interactions among multiple agents in an uncharted scene. By formulating the underlying SLAM problem as a potential game, we inherit a strong convergence guarantee. Empirical results indicate that, when deployed in a realistic traffic simulation, our approach performs localization and mapping more accurately than a standard bundle adjustment algorithm across a wide range of noise levels.
|
|
11:00-11:20, Paper TuAT07.4 | Add to My Program |
Accelerating Extremum Seeking Convergence by Richardson Extrapolation Methods |
|
Metsch, Jan-Henrik | University of Freiburg |
Neuhauser, Jonathan | Karlsruhe Institute of Technology |
Jouffroy, Jerome | University of Southern Denmark |
Laleg-Kirati, Taous-Meriem | King Abdullah University of Science and Technology (KAUST) |
Reger, Johann | TU Ilmenau |
Keywords: Estimation, Optimization algorithms, Variational methods
Abstract: In this paper, we propose the concept of accelerated convergence that has originally been developed to speed up the convergence of numerical methods for extremum seeking (ES) loops. We demonstrate how the dynamics of ES loops may be analyzed to extract structural information about the generated output of the loop. This information is then used to distil the limit of the loop without having to wait for the system to converge to it.
|
|
11:20-11:40, Paper TuAT07.5 | Add to My Program |
Toward Scalable Risk Analysis for Stochastic Systems Using Extreme Value Theory |
|
Arsenault, Evan | University of Toronto |
Wang, Yuheng | University of Toronto |
Chapman, Margaret P | University of Toronto |
Keywords: Stochastic systems, Estimation
Abstract: We aim to analyze the behaviour of a finite-time stochastic system, whose model is not available, in the context of more rare and harmful outcomes. Standard estimators are not effective in making predictions about such outcomes due to their rarity. Instead, we use Extreme Value Theory (EVT), the theory of the long-term behaviour of normalized maxima of random variables. We quantify risk using the upper-semideviation rho(Y) := E(max{Y - mu,0}) of an integrable random variable Y with mean mu := E(Y). rho(Y) is the risk-aware part of the common mean-upper-semideviation functional, mu + lambda rho(Y) with lambda in [0,1]. To assess more rare and harmful outcomes, we propose an EVT-based estimator for rho(Y) in a given fraction of the worst cases. We show that our estimator enjoys a closed-form representation in terms of the popular conditional value-at-risk functional. In experiments, we illustrate the extrapolation power of our estimator using a small number of i.i.d. samples (<50). Our approach is useful for estimating the risk of finite-time systems when models are inaccessible and data collection is expensive. The numerical complexity does not grow with the size of the state space.
|
|
11:40-12:00, Paper TuAT07.6 | Add to My Program |
On-Line Estimation of Stability and Passivity Metrics |
|
Welikala, Shirantha | University of Notre Dame |
Lin, Hai | University of Notre Dame |
Antsaklis, Panos J. | University of Notre Dame |
Keywords: Estimation, Identification for control, Optimization
Abstract: We consider the problem of on-line evaluation of critical characteristic parameters such as the L_2-gain, input feedforward passivity index and output feedback passivity index of non-linear systems using their input-output data. Typically, having an accurate measure of such emph{system indices} enables the application of systematic control design techniques. Moreover, if such system indices can efficiently be evaluated on-line, they can be exploited to design intelligent control solutions. However, the existing off-line estimation methods of such system indices are computationally inefficient and require a large amount of input-output data. On the other hand, the existing on-line estimation methods take an averaging-based approach, which may be sub-optimal, computationally inefficient and susceptible to estimate saturation. To overcome these challenges in the on-line estimation of system indices, we establish and exploit several interesting theoretical results on a particular class of fractional function optimization problems. Finally, we provide several numerical examples to highlight our contributions.
|
|
TuAT08 Regular Session, Tulum Ballroom H |
Add to My Program |
Alternating Direction Method of Multipliers |
|
|
Chair: Engelmann, Alexander | TU Dortmund University |
Co-Chair: Anderson, James | Columbia University |
|
10:00-10:20, Paper TuAT08.1 | Add to My Program |
Decentralized Non-Convex Optimization Via Bi-Level SQP and ADMM |
|
Stomberg, Gösta | TU Dortmund University |
Engelmann, Alexander | TU Dortmund University |
Faulwasser, Timm | TU Dortmund University |
Keywords: Optimization algorithms, Distributed control
Abstract: Decentralized non-convex optimization is important in many problems of practical relevance. Existing decentralized methods, however, typically either lack convergence guarantees for general non-convex problems, or they suffer from a high subproblem complexity. We present a novel bi-level SQP method, where the inner quadratic problems are solved via ADMM. A decentralized stopping criterion from inexact Newton methods allows the early termination of ADMM as an inner algorithm to improve computational efficiency. The method has local convergence guarantees for non-convex problems. Moreover, it only solves sequences of Quadratic Programs, whereas many existing algorithms solve sequences of Nonlinear Programs. The method shows competitive numerical performance for an optimal power flow problem.
|
|
10:20-10:40, Paper TuAT08.2 | Add to My Program |
Distributed Solution of Mixed-Integer Programs by ADMM with Closed Duality Gap |
|
Liu, Zonglin | University of Kassel |
Stursberg, Olaf | University of Kassel |
Keywords: Optimization algorithms, Machine learning, Distributed control
Abstract: This paper introduces a new method to efficiently solve distributed mixed-integer programs (MIP) as arising in problems of distributed machine learning or distributed control. The method is based on the alternating direction method of multipliers (ADMM), and it determines an exact penalty weight in order to close the duality gap of the MIP. The weight, which can be computed with low effort, allows one to formulate and solve a two-stage ADMM procedure for determining an optimum (or a sub-optimum) of the MIP. Numeric examples confirm the efficiency of the proposed method.
|
|
10:40-11:00, Paper TuAT08.3 | Add to My Program |
FedADMM: A Federated Primal-Dual Algorithm Allowing Partial Participation |
|
Wang, Han | Columbia University |
Marella, Siddartha | Columbia University |
Anderson, James | Columbia University |
Keywords: Optimization algorithms, Machine learning, Optimization
Abstract: Federated learning is a framework for distributed optimization that places emphasis on communication efficiency. In particular, it follows a client-server broadcast model and is particularly appealing because of its ability to accommodate heterogeneity in client compute and storage resources, non-i.i.d. data assumptions, and data privacy. Our contribution is to offer a new federated learning algorithm, FedADMM, for solving non-convex composite optimization problems with non-smooth regularizers. We prove converges of FedADMM for the case when not all clients are able to participate in a given communication round under a very general sampling model.
|
|
11:00-11:20, Paper TuAT08.4 | Add to My Program |
A Sensitivity Assisted Alternating Directions Method of Multipliers for Distributed Optimization |
|
Krishnamoorthy, Dinesh | TU Eindhoven |
Kungurtsev, Vyacheslav | Czech Technical University in Prague, |
Keywords: Optimization algorithms, Distributed control, Large-scale systems
Abstract: Alternating Directions Method of Multipliers (ADMM) is a form of decomposition-coordination method that typically requires several iterations/communication rounds between the subproblems and the master problem to converge. Repeatedly solving the subproblems over several iterations add to the total computation time. Noting that the subproblems solved from one iteration to the next differs only by a few variables, this paper proposes a novel sensitivity-assisted ADMM framework for nonlinear programming (NLP) problems, where the subproblems are cheaply approximated using the parametric sensitivities. By exploiting the parametric sensitivities, the computation of the subproblems can be reduced to a single linear solve instead of solving the full NLP problem, thereby reducing the overall computation cost. Different algorithmic variations are discussed and demonstrated using two numerical examples.
|
|
11:20-11:40, Paper TuAT08.5 | Add to My Program |
Distributed and Constrained H2 Control Design Via System Level Synthesis and Dual Consensus ADMM |
|
Grontas, Panagiotis D | Swiss Federal Institute of Technology (ETH) Zürich |
Fisher, Michael W | University of Waterloo |
Dörfler, Florian | Swiss Federal Institute of Technology (ETH) Zurich |
Keywords: Optimization algorithms, Distributed control
Abstract: Design of optimal distributed linear feedback controllers to achieve a desired aggregate behavior, while simultaneously satisfying state and input constraints, is a challenging but important problem in many applications. System level synthesis is a recent technique which has been used to reparametrize the optimal control problem as a convex program. Prior work on system level synthesis with state and input constraints has included closed-loop finite impulse response and locality constraints or, in the case where these constraints were lifted using a simple pole approximation, only a centralized design was considered. However, closed-loop finite impulse response and locality constraints cannot be satisfied in many applications. Furthermore, the centralized design using the simple pole approximation lacks robustness to communication failures and disturbances, has high computational cost and does not preserve data privacy of local controllers. The main contribution of this work is to develop a distributed solution to system level synthesis with the simple pole approximation in order to incorporate state and input constraints without closed-loop finite impulse response or locality constraints, and in a distributed implementation. To achieve this, it is first shown that the dual of this problem is a distributed consensus problem. Then, an algorithm is developed based on the alternating direction method of multipliers to solve the dual while recovering a primal solution, and a convergence certificate is provided. Finally, the method's performance is demonstrated on a test case of control design for distributed energy resources that collectively provide stability services to the power grid.
|
|
11:40-12:00, Paper TuAT08.6 | Add to My Program |
DTAC-ADMM: Delay-Tolerant Augmented Consensus ADMM-Based Algorithm for Distributed Resource Allocation |
|
Doostmohammadian, Mohammadreza | Aalto University |
Jiang, Wei | Aalto University, Finland |
Charalambous, Themistoklis | University of Cyprus |
Keywords: Optimization algorithms, Networked control systems, Energy systems
Abstract: Latency is inherent in almost all real-world networked applications. In this paper, we propose a distributed resource allocation strategy over multi-agent networks with delayed communications. The state of each agent (or node) represents its share of assigned resources out of a fixed amount (equal to the overall demand). Every node locally updates its state towards optimizing a global allocation cost function via received information of its neighboring nodes even when the data exchange over the network is heterogeneously delayed at different links. The update is based on the alternating direction method of multipliers (ADMM) formulation subject to both sum-preserving coupling-constraint and local box-constraints. The solution is derivative-free and holds for general (not necessarily differentiable) convex cost models. We use the notion of augmented consensus over undirected networks to model delayed information-exchange for convergence analysis. We simulate our delay-tolerant algorithm for optimal energy reservation-production scheduling.
|
|
TuAT09 Regular Session, Maya Ballroom I |
Add to My Program |
Consensus of Multi-Agent Systems |
|
|
Chair: Cristofaro, Andrea | Sapienza University of Rome |
Co-Chair: Aldana-López, Rodrigo | Universidad De Zaragoza |
|
10:00-10:20, Paper TuAT09.1 | Add to My Program |
Multiconsensus Control of Homogeneous LTI Hybrid Systems under Time-Driven Jumps |
|
Cristofaro, Andrea | Sapienza University of Rome |
Mattioni, Mattia | La Sapienza Università Di Roma |
Keywords: Linear systems, Agents-based systems, Decentralized control
Abstract: In this paper, we consider a network of homogeneous LTI hybrid dynamics under time-driven aperiodic jumps and exchanging information over a fixed communication graph. Based on the notion of almost equitable partitions, we explicitly characterize the clusters induced by the network over the nodes and, consequently, the corresponding multi-consensus trajectories. Then, we design a decentralized control ensuring convergence of all agents to the corresponding multi-consensus trajectory. Simulations over an academic example illustrate the results.
|
|
10:20-10:40, Paper TuAT09.2 | Add to My Program |
Strict Lyapunov Functions for Dynamic Consensus in Linear Systems Interconnected Over Directed Graphs |
|
Dutta, Maitreyee | IIT Bombay |
Panteley, Elena | CNRS |
Loria, Antonio | CNRS |
Sukumar, Srikant | IIT Bombay |
Keywords: Linear systems, Autonomous systems
Abstract: We study dynamic consensus for general networked (homogeneous) linear autonomous systems, that is, it is only assumed that they are stabilizable. Dynamic consensus pertains to a general form of consensus in which, as a result of the systems' interactions, they exhibit a rich collective dynamic behavior. This generalizes the classical consensus paradigm in which case all systems stabilize to a common equilibrium point. Our main statements apply to systems interconnected over generic directed connected graphs and, most significantly, the proofs are constructive. Indeed, even though our controllers are reminiscent of others previously used in the literature, to the best of our knowledge, we provide for the first time in the literature strict Lyapunov functions for fully distributed consensus over generic directed graphs.
|
|
10:40-11:00, Paper TuAT09.3 | Add to My Program |
Consensus Error Performance of Linear Multi-Agent Systems |
|
Ding, Yanling | City University of Hong Kong |
Peng, Hui | Guangdong University of Technology, School of Automation |
Chen, Jie | City University of Hong Kong |
Keywords: Linear systems, Delay systems, Agents-based systems
Abstract: In this paper, we investigate the consensus problem of linear time-invariant systems with respect to stochastic noises and deterministic disturbances. We consider first-order dynamic agents interconnected by a directed communication graph with time delay, and we seek to determine the best achievable performance that measures the error in achieving consensus under the influence of noises and disturbances. Our results consist of analytical expressions of the minimal error achievable, measured under both mathcal{H}_2 and mathcal{H}_infty criteria. The expressions show explicitly how agent and network dynamics may interact to cast fundamentally dispersion effects on the consensus among the agents.
|
|
11:00-11:20, Paper TuAT09.4 | Add to My Program |
Dynamic Consensus with Prescribed Convergence Time for Multi-Leader Formation Tracking |
|
Aldana-López, Rodrigo | Universidad De Zaragoza |
Gomez-Gutierrez, David | Intel Labs |
Aragues, Rosario | Universidad De Zaragoza |
Sagues, Carlos | Universidad De Zaragoza |
Keywords: Decentralized control, Distributed control, Robotics
Abstract: This work addresses the problem of distributed formation tracking for a group of follower holonomic mobile robots around a reference signal. The reference signal is comprised of the geometric center of the positions of multiple leaders. This work's main contribution is a novel Modulated Distributed Virtual Observer (MDVO) for the reference signal. Moreover, the proposed MDVO is based on an exact dynamic consensus algorithm with a prescribed convergence time. In addition, we provide simulation examples showcasing two different application scenarios for the proposal.
|
|
11:20-11:40, Paper TuAT09.5 | Add to My Program |
A Novel Protocol with Pure Relative Output Information for Consensus of Linear Multi-Agent Systems |
|
Ma, YuWen | Department of Automation, Shanghai Jiao Tong University |
Li, Xianwei | Shanghai Jiao Tong University |
Li, Shaoyuan | Shanghai Jiao Tong University |
Keywords: Cooperative control, Distributed control, Networked control systems
Abstract: This paper deals with the output-feedback consensus problem for homogeneous linear multi-input multi-output multi-agent systems on directed graphs. A novel output-feedback protocol is proposed. A main merit of the protocol is that it only requires relative output measurement of neighboring agents, but does not need to exchange protocol state information through communication. Necessary and sufficient conditions are proposed for the existence of the protocol, which are shown to be feasible if the graph has a spanning tree and some easy-to-verify assumptions are imposed on agent dynamics. Consensus under the designed protocol is proved and consensus states are explicitly derived. Different from most of the existing related results, the proposed protocol still meets the separation principle-like property and thus the protocol gains are easy to compute. A numerical example is finally provided to demonstrate the effectiveness of the proposed protocol.
|
|
11:40-12:00, Paper TuAT09.6 | Add to My Program |
Matrix-Scaled Consensus |
|
Trinh, Hoang Minh | Hanoi University of Science and Technology (HUST) |
Vu, Van Dung | Viettel Group |
Tran, Quoc Van | KAIST; Hanoi Univ. of Sci & Tech (HUST) |
Ahn, Hyo-Sung | Gwangju Institute of Science and Technology (GIST) |
Keywords: Agents-based systems, Networked control systems, Cooperative control
Abstract: This paper proposes matrix-scaled consensus algorithm, which generalizes the scaled consensus algorithm in [1]. In (scalar) scaled consensus algorithms, the agents' states do not converge to a common value, but to different points along a straight line in the state space, which depends on the scaling factors and the initial states of the agents. In the matrix-scaled consensus algorithm, a positive/negative definite matrix weight is assigned to each agent. Each agent updates its state based on the product of the sum of relative matrix scaled states and the sign of the matrix weight. Under the proposed algorithm, each agent asymptotically converges to a final point differing with a common consensus point by the inverse of its own scaling matrix. Thus, the final states of the agents are not restricted to a straight line but can be extended to almost the whole state-space. Convergence analysis of matrix-scaled consensus for single and double-integrator agents are studied in detail. Simulation results are given to support the analysis.
|
|
TuAT10 Regular Session, Maya Ballroom II |
Add to My Program |
Stochastic Systems I |
|
|
Chair: Lanzetti, Nicolas | ETH Zürich |
Co-Chair: Gharesifard, Bahman | University of California, Los Angeles |
|
10:00-10:20, Paper TuAT10.1 | Add to My Program |
Computable Convergence Rate Bound for Ratio Consensus Algorithms |
|
Gerencsér, Balázs | Alfréd Rényi Institute of Mathematics |
Keywords: Stochastic systems, Agents-based systems, Randomized algorithms
Abstract: The objective of the paper is to establish a computable upper bound for the almost sure convergence rate for a class of ratio consensus algorithms defined via column-stochastic matrices. Our result extends the works of Iutzeler et al. from 2013 on similar bounds that have been obtained in a more restrictive setup with limited conclusions. The present paper complements the results by Gerencsér and Gerencsér from 2022, identifying the exact almost sure convergence rate of a wide class of ratio consensus algorithms in terms of a spectral gap, which is, however, not computable in general. The upper bound provided in the paper will be compared to the actual rate of almost sure convergence experimentally on a range of modulated random geographic graphs with random local interactions.
|
|
10:20-10:40, Paper TuAT10.2 | Add to My Program |
Identifiability and Estimation of Partially-Observed Influence Models |
|
Zhao, Lu | University of Texas at Arlington |
Wan, Yan | University of Texas at Arlington |
Keywords: Stochastic systems, Estimation, Identification
Abstract: The influence model (IM) is a discrete-time stochastic model that captures the spatiotemporal dynamics of networked Markov chains. Partially-observed IM (POIM) is an IM in which the statuses for some sites are unobserved. Identifiability and estimation of POIMs from incomplete state information are critical for POIM applications. In this paper, we develop a new estimation algorithm for both homogeneous and heterogeneous POIMs. The method, called EM-JMPE, integrates expectation maximization (EM) and joint-margin probability estimation (JMPE) to achieve reduced computation. In addition, we study the identifiability of POIMs by exploring the reduced-size joint-margin matrix, based on which necessary conditions for both homogeneous and heterogeneous POIMs are provided. The simulation studies verify the developed results.
|
|
10:40-11:00, Paper TuAT10.3 | Add to My Program |
Modeling of Political Systems Using Wasserstein Gradient Flows |
|
Lanzetti, Nicolas | ETH Zürich |
Hajar, Joudi | ETH Zürich |
Dörfler, Florian | Swiss Federal Institute of Technology (ETH) Zurich |
Keywords: Stochastic systems, Game theory, Variational methods
Abstract: The study of complex political phenomena such as parties' polarization calls for mathematical models of political systems. In this paper, we aim at modeling the time evolution of a political system whereby various parties selfishly interact to maximize their political success (e.g., number of votes). More specifically, we identify the ideology of a party as a probability distribution over a one-dimensional real-valued ideology space, and we formulate a gradient flow in the probability space (also called a Wasserstein gradient flow) to study its temporal evolution. We characterize the equilibria of the arising dynamic system, and establish local convergence under mild assumptions. We calibrate and validate our model with real-world time-series data of the time evolution of the ideologies of the Republican and Democratic parties in the US Congress. Our framework allows to rigorously reason about various political effects such as parties' polarization and homogeneity. Among others, our mechanistic model can explain why political parties become more polarized and less inclusive with time (their distributions get "tighter"), until all candidates in a party converge asymptotically to the same ideological position.
|
|
11:00-11:20, Paper TuAT10.4 | Add to My Program |
Privacy-Preserving POMDP Planning Via Belief Manipulation |
|
Zheng, Wei | University of Notre Dame |
Jung, Taeho | University of Notre Dame |
Lin, Hai | University of Notre Dame |
Keywords: Stochastic systems, Markov processes, Constrained control
Abstract: The privacy issue has become one of the most critical concerns in cyber-physical systems (CPSs) as CPSs are vulnerable to information leakage. In particular, a passive intruder could infer the secret information of the system through observations, and the system may be critically compromised or damaged once the intruder has high confidence on certain secret states. In this paper, we investigate the planning problem of a stochastic system in the presence of a passive eavesdropping intruder. In this system, the planner is modeled as a Markov decision process (MDP) who can access the state information and control the system transition. The intruder, who has a partial observation of the system state, is modeled as a hidden Markov model. The goal of the intruder is to infer the secrets of the system in terms of whether the current system is in some sensitive states, and the goal of the defender is to maximize the reward while preventing the intruder from inferring the secret. Distinct from existing work that embedded privacy as a part of the reward or utility function, we quantify privacy as a constraint for the planning. The problem is formulated as a constrained partially observable MDP (POMDP) planning problem and a belief state partition approach is proposed to solve the privacy-preserving planning problem via value iteration. Our observation is that the defender could prevent the intruder from inferring sensitive information via belief manipulation. However, the introduction of the privacy concern may sacrifice the system performance or even cause the problem to be infeasible. A necessary and sufficient condition is given to check the feasibility of the planning problem and examples are shown to illustrate our proposed algorithm.
|
|
11:20-11:40, Paper TuAT10.5 | Add to My Program |
Consensus Using a Network of Finite Memory Polya Urns |
|
Singh, Somya | Queen's University, Kingston |
Alajaji, Fady | Queen's University |
Gharesifard, Bahman | University of California, Los Angeles |
Keywords: Stochastic systems, Markov processes, Delay systems
Abstract: We introduce a finite memory interacting Polya urn process over a connected network which models consensus dynamics for interacting individuals. More specifically, each urn (individual) in the network is initially equipped with some red and black balls, with the fraction corresponding to the individual's opinion (or belief) on a certain color. At each time instant and for each urn, a ball is drawn from a ``super-urn", which consists of all balls present in that urn and its neighboring urns; then reinforcing balls of the color just drawn are added to the urn for a limited period of M future time instants, where M denotes the memory parameter. Additionally, and important for our objective, as of time t=M+1, we remove the balls which were present in the urns initially. By examining the structure of the resulting underlying reducible Markov process, we show that individuals eventually reach consensus in the sense that they all achieve identical probabilities of drawing a red ball. Moreover, when the network has homogeneous reinforcement parameters, we construct a class of linear dynamical systems with time delay whose trajectory gives the probability of drawing a red ball for each node i at a time instant t. We examine the asymptotic behavior of such a network and exactly determine its consensus value. Our simulation confirms our theoretical findings by demonstrating the asymptotic behavior of draw variables of the network in some case studies.
|
|
11:40-12:00, Paper TuAT10.6 | Add to My Program |
Saturated Total-Population Dependent Branching Process and Viral Markets |
|
Agarwal, Khushboo | IIT Bombay, India |
Veeraruna, Kavitha | IIT Bombay, India |
Keywords: Stochastic systems, Markov processes, Modeling
Abstract: Interesting posts are continually forwarded by the users of the online social network (OSN). Such propagation leads to re-forwarding of the post to some of the previous recipients, which increases as the post reaches a large number of users. Consequently, the effective forwards (after deleting the re-forwards) reduce, eventually leading to the saturation of the total number of copies. We model this process as a new variant of the branching process, the `saturated total-population-dependent branching process', and analyse it using the stochastic approximation technique. Notably, we obtain deterministic trajectories which approximate the total and unread copies of the post `asymptotically and almost surely' over any finite time window; this trajectory depends only on four parameters related to the network characteristics. Further, we provide expressions for the peak unread copies, maximum outreach and the life span of the post. We observe known exponential growth but with time-varying rates. We also validate our theory through detailed simulations on the SNAP Twitter dataset.
|
|
TuAT11 Regular Session, Maya Ballroom III |
Add to My Program |
Distributed Parameter Systems I |
|
|
Chair: Morgansen, Kristi A. | University of Washington |
Co-Chair: Mora, Luis | University of Waterloo |
|
10:00-10:20, Paper TuAT11.1 | Add to My Program |
Sensor Placement on a Cantilever Beam Using Observability Gramians |
|
Brace, Natalie | University of Washington |
Andrews, Nicholas | University of Washington |
Upsal, Jeremy | University of Washington |
Morgansen, Kristi A. | University of Washington |
Keywords: Distributed parameter systems, Flexible structures, Kalman filtering
Abstract: Working from an observability characterization based on output energy sensitivity to changes in initial conditions, we derive both analytical and empirical observability Gramian tools for a class of continuum material systems. Using these results, optimal sensor placement is calculated for an Euler-Bernoulli cantilever beam for the following cases: analytical observability for the continuum system and analytical observability for a finite number of modes. Error covariance of an Unscented Kalman Filter is determined for both cases and compared to randomly placed sensors to demonstrate effectiveness of the techniques.
|
|
10:20-10:40, Paper TuAT11.2 | Add to My Program |
Nonlinear Perturbation of a Class of Conservative Linear System |
|
Singh, Shantanu | Tel Aviv University |
Weiss, George | Tel Aviv University |
Keywords: Distributed parameter systems, Nonlinear systems
Abstract: In this article we show the existence and uniqueness of classical and generalized solutions of a class of nonlinear infinite dimensional systems. Such systems are obtained by modifying the second order differential equation that is part of the description of conservative linear systems "out of thin air" introduced by M. Tucsnak and G. Weiss in 2003. The modified system contains a new nonlinear damping term, that is maximal monotone and possibly set-valued and hence state trajectories obey a differential inclusion. We show that this new class of nonlinear infinite dimensional systems is incrementally scattering passive (hence well-posed). The proof is based on the Crandall-Pazy theorem which shows that the Lax-Phillips type nonlinear semigroup (that represents the entire system) is a contraction.
|
|
10:40-11:00, Paper TuAT11.3 | Add to My Program |
Input-Output Stability of a Reaction Diffusion Equation with In-Domain Disturbances |
|
Shreim, Suha | Université Grenoble Alpes, Grenoble INP, Gipsa-Lab |
Ferrante, Francesco | Universita Degli Studi Di Perugia |
Prieur, Christophe | CNRS |
Keywords: Distributed parameter systems, Output regulation, Lyapunov methods
Abstract: The input-output stability (IOS) of a reaction-diffusion equation by means of a finite-dimensional linear time-invariant control system is studied. The reaction-diffusion plant admits a finite number of unstable poles and is open-loop unstable. The infinite-dimensional plant is put in feedback with a dynamic controller to achieve output stability via a Dirichlet boundary measurement and regulated output. The control design problem consists of deriving sufficient conditions in the form of matrix inequalities which allows us to show that the order of the finite-dimensional controller can be selected large enough to achieve IOS even when the control design is not optimal.
|
|
11:00-11:20, Paper TuAT11.4 | Add to My Program |
Exponential Decay Rate Bound of One-Dimensional Distributed Port-Hamiltonian Systems with Boundary Dissipation |
|
Mora, Luis | University of Waterloo |
Morris, Kirsten | University of Waterloo |
Keywords: Stability of linear systems, Distributed parameter systems, Linear systems
Abstract: Distributed port-Hamiltonian systems with boundary damping and possible internal dissipation are considered. The multiplier method is used to show exponential decay Me^{-alpha t} with an expression for M and alpha in terms of the system parameters. The exponential stability of port-Hamiltonian systems has been studied in the literature, but previous results did not provide an explicit bound on the decay rate. This result is illustrated by the boundary stabilization of a Timoshenko beam.
|
|
11:20-11:40, Paper TuAT11.5 | Add to My Program |
Event-Based Boundary Control of the Stefan Problem: A Dynamic Triggering Approach |
|
Rathnayake, Bhathiya | Student (University of California, San Diego) |
Diagne, Mamadou | University of California San Diego |
Keywords: Distributed parameter systems, Sampled-data control, Lyapunov methods
Abstract: The Stefan problem models a liquid-solid phase change phenomena as time evolution of a temperature profile in a liquid-solid material and its moving interface. This paper develops an event-triggered boundary control strategy for the one-phase Stefan problem. The proposed method consists of a full-state feedback backstepping boundary control law developed to drive the liquid-solid interface position to a desired setpoint and a dynamic event triggering mechanism which determines the time instants at which the control input requires to be updated. The existence of a minimal-dwell time between two consecutive events is proved, which eliminates the occurrence of the so-called textit{Zeno phenomenon} in the closed-loop system. The control input is updated at event times and applied in a textit{Zero-Order-Hold (ZOH)} fashion. The well-posedness of the closed-loop system along with certain model validity conditions is proved. Furthermore, using the Lyapunov approach, it is shown that the proposed control approach globally exponentially converges the temperature profile to the melting temperature of the material in L_2-norm and the moving interface to the desired setpoint. Finally, a simulation example is provided to validate the theoretical developments.
|
|
11:40-12:00, Paper TuAT11.6 | Add to My Program |
Stabilisation of Unstable Distributed Port-Hamiltonian Systems in Scattering Form |
|
Macchelli, Alessandro | University of Bologna - Italy |
Le Gorrec, Yann | Ensmm, Femto-St / As2m |
Ramirez, Hector | Universidad Federico Santa Maria |
Keywords: Distributed parameter systems, Stability of linear systems, Lyapunov methods
Abstract: In this paper, we consider the exponential stabilisation of a distributed parameter port-Hamiltonian system interconnected with an unstable finite-dimensional linear system at its free end and control input at the opposite one. The infinite-dimensional system can also have in-domain anti- damping. The control design passes through the definition of a finite-dimensional linear system that “embeds” the response of the distributed parameter model, and that can be stabilised by acting on the available control input. The conditions that link the exponential stability of the latter system with the exponential stability of the original one are obtained thanks to a Lyapunov analysis. Simulations are presented to show the pros and cons of the proposed synthesis methodology.
|
|
TuAT12 Invited Session, Maya Ballroom IV |
Add to My Program |
Event-Triggered Control |
|
|
Chair: Johansson, Karl H. | Royal Institute of Technology |
Co-Chair: Heemels, W.P.M.H. | Eindhoven University of Technology |
Organizer: Nowzari, Cameron | George Mason University |
Organizer: Heemels, W.P.M.H. | Eindhoven University of Technology |
Organizer: Johansson, Karl H. | KTH Royal Institute of Technology |
Organizer: Hirche, Sandra | Technische Universität München |
|
10:00-10:20, Paper TuAT12.1 | Add to My Program |
Event-Triggered Consensus for Second-Order Systems: A Hybrid Systems Perspective (I) |
|
Zegers, Federico | Johns Hopkins University Applied Physics Laboratory |
Guralnik, Dan | University of Florida |
Edwards, Sage | University of Florida |
Lee, Chia-Ling | University of Florida |
Dixon, Warren E. | University of Florida |
Keywords: Hybrid systems, Networked control systems, Distributed control
Abstract: This paper explores the consensus problem for a homogeneous multi-agent system composed of agents with second-order dynamics. To promote scalability and efficient resource utilization, the agents communicate intermittently and asynchronously using an event-triggered control strategy. The approximate consensus problem is recast as a set stabilization problem, and sufficient conditions for stability are obtained through a Lyapunov-based analysis. Unlike previous results that only provide a stability analysis, we formulate a well-posed hybrid system that admits non-Zeno complete maximal solutions, renders the approximate consensus attractor robust to vanishing perturbations, employs triggers that have a positive minimum inter-event time, and achieves approximate consensus. Moreover, if the drift matrix is invertible and only has eigenvalues with non-positive real parts, then the average of the agent states is bounded and solutions of the hybrid system are bounded. In addition, the average consensus problem may be handled as a special case of the proposed formulation. Specifically, it is shown that, if the initial velocity of all agents is zero, then the MAS converges to the average of the initial conditions. Simulation results are provided to validate the development.
|
|
10:20-10:40, Paper TuAT12.2 | Add to My Program |
An Energy Function-Based and Norm-Free Event-Triggering Approach to Schedule Control Data Transmissions in State Feedback Control (I) |
|
Kurtoglu, Deniz | University of South Florida |
Yucelen, Tansel | University of South Florida |
Muse, Jonathan | Wright Patterson Air Force Base |
Keywords: Networked control systems, Lyapunov methods
Abstract: In this paper, we propose a new event-triggering approach to schedule control data transmissions in state feedback control of linear time-invariant dynamical systems. Specifically, an energy function-based and norm-free event-triggering condition is presented, where the embedded processor broadcasts a sampled data of its control signal value through a zero-order-hold operator to the dynamical system when the left side of the event-triggering condition equals to its right side. Here, the energy function-based feature implies that the right side of this event-triggering condition involves an energy function as well as its time-derivative for making the selection of its right side user-adjustable. Moreover, the norm-free feature implies that the left side of this event-triggering condition does not depend on signal norms to yield better control data transmission reduction. We also present illustrative numerical examples in order to demonstrate the efficacy of the proposed event-triggering approach in scheduling state feedback control data transmissions.
|
|
10:40-11:00, Paper TuAT12.3 | Add to My Program |
Analysis of Time versus Event-Triggered Consensus for a Single-Integrator Multi-Agent System (I) |
|
Meister, David | University of Stuttgart |
Aurzada, Frank | Technical University of Darmstadt |
Lifshits, Mikhail A. | St. Petersburg State University |
Allgöwer, Frank | University of Stuttgart |
Keywords: Agents-based systems, Networked control systems, Control of networks
Abstract: It is well known that the employed triggering scheme has great impact on the control performance when control loops operate under scarce communication resources. Various practical and simulative works have demonstrated the potential of event-triggered control to reduce communication while providing a similar performance level when compared to time-triggered control. For non-cooperative networked control systems, analytical performance comparisons of time- and event-triggered control support this finding under certain assumptions. While being well-studied in the non-cooperative setting, it remains unclear if and how the performance relationship of the triggering schemes is altered in a multi-agent system setup. To close this gap, in this paper, we consider a homogeneous single-integrator multi-agent consensus problem for which we compare the performance of time- and event-triggered control schemes analytically. Under the assumption of equal average triggering rates, we use the long-term average of the quadratic deviation from consensus as a performance measure to contrast the triggering schemes. Contrary to the non-cooperative setting, we prove that event-triggered control performs worse than time-triggered control beyond a certain number of agents in this setup. In addition, we derive the asymptotic order of the performance measure as a function of the number of agents under both triggering schemes.
|
|
11:00-11:20, Paper TuAT12.4 | Add to My Program |
Event-Triggered Prediction-Based Delay Compensation Approach |
|
Mazenc, Frederic | Inria Saclay |
Malisoff, Michael | Louisiana State University |
Barbalata, Corina | Louisiana State University |
Keywords: Stability of linear systems, Robust control
Abstract: We provide a new event-triggered delay compensation approach for linear systems with arbitrarily long constant input delays. Our prediction map is expressible as a solution of a discrete time system. Our method ensures input-to-state stability. We also provide an analog under measurement delays, where the prediction map is expressible as a solution of a continuous-discrete system. Significant novel features are our combined use of matrices of absolute values and our prediction based event triggers, instead of Euclidean norms, and the fact that the predictor dynamics always has the same dimension as that of the original system. Our marine robotic example illustrates an advantage of using our new methods.
|
|
11:20-11:40, Paper TuAT12.5 | Add to My Program |
Stability and Safety through Event-Triggered Intermittent Control with Application to Spacecraft Orbit Stabilization |
|
Ong, Pio | California Institute of Technology |
Bahati, Gilbert | California Institute of Technology |
Ames, Aaron D. | California Institute of Technology |
Keywords: Discrete event systems, Stability of nonlinear systems, Stability of hybrid systems
Abstract: In systems where the ability to actuate is a scarce resource, e.g., spacecrafts, it is desirable to only apply a given controller in an intermittent manner---with periods where the controller is on and periods where it is off. Motivated by the event-triggered control paradigm, where state-dependent triggers are utilized in a sample-and-hold context, we generalize this concept to include state triggers where the controller is off thereby creating a framework for intermittent control. Our approach utilizes certificates---either Lyapunov or barrier functions---to design intermittent trigger laws that guarantee stability or safety; the controller is turned on for the period for which is beneficial with regard to the certificate, and turned off until a performance threshold is reached. The main result of this paper is that the intermittent controller scheme guarantees (set) stability when Lyapunov functions are utilized, and safety (forward set invariance) in the setting of barrier functions. As a result, our trigger designs can leverage the intermittent nature of the actuator, and at the same time, achieve the task of stabilization or safety. We further demonstrate the application and benefits of intermittent control in the context of the spacecraft orbit stabilization problem.
|
|
11:40-12:00, Paper TuAT12.6 | Add to My Program |
Event-Triggered Saturating Control for Practical Synchronization of Lur’e Systems |
|
Lisbôa, Cristyan | UFRGS |
Flores, Jeferson Vieira | UFRGS |
Moreira, Luciano Gonçalves | IFSUL |
Gomes da Silva Jr, Joao Manoel | Universidade Federal Do Rio Grande Do Sul (UFRGS) |
Keywords: Networked control systems, Constrained control, Nonlinear systems
Abstract: This paper addresses the practical master-slave synchronization of nonlinear discrete-time Lur’e systems subject to saturation using event-triggered control. Considering an emulation approach, the aim is to design an event generator that features a relaxed triggering criterion to reduce the number of control updates. Constructive conditions for generic slope-restricted nonlinearities are derived to ensure the regional practical stabilization of the synchronization error, i.e., to guarantee that it converges to a specified vicinity of the origin. In order to tune the event generator parameters, an optimization problem is proposed aiming at reducing the number of events when compared to triggering functions without relaxations. An illustrative example shows the advantages of our approach.
|
|
TuAT13 Regular Session, Maya Ballroom V |
Add to My Program |
Predictive Control for Linear Systems I |
|
|
Chair: Lazar, Mircea | Eindhoven University of Technology |
Co-Chair: Schulze Darup, Moritz | TU Dortmund University |
|
10:00-10:20, Paper TuAT13.1 | Add to My Program |
Recursive Data-Driven Predictive Control with Persistence of Excitation Conditions |
|
Verheijen, Peter | Eindhoven University of Technology |
Gonçalves da Silva, Gustavo R. | Eindhoven University of Technology |
Lazar, Mircea | Eindhoven University of Technology |
Keywords: Predictive control for linear systems, Closed-loop identification, Adaptive control
Abstract: In this paper we develop a recursive linear predictive control algorithm with integral action and plug-and-play capabilities. Typically, adaptive model predictive control requires a recursive estimation step for updating the prediction model and then builds prediction matrices on-line. In contrast to this approach, we develop a least-squares algorithm for recursively estimating the prediction matrices directly. We then exploit an analytic relation between standard and integral prediction matrices to recursively estimate the latter. Furthermore, to assess the convergence of the closed-loop estimation, we discuss various methods that generate a persistently exciting input. The efficiency of the recursive integral predictive controller is demonstrated on a motion control application.
|
|
10:20-10:40, Paper TuAT13.2 | Add to My Program |
On Data Reutilization for Historian Based Predictive Control |
|
Maestre, Jose Maria (Pepe) | University of Seville |
Masero, Eva | University of Seville |
Salvador, José R. | Universidad Loyola Andalucía |
Ramirez, Daniel R. | Univ. of Sevilla |
Zhu, Quanyan | New York University |
Keywords: Predictive control for linear systems, Emerging control applications, Machine learning
Abstract: This paper presents a robust finite-horizon control scheme based on data that produces feasible control sequences. The scheme makes use of a database that includes information from prior experiences of the same and others controllers handling similar systems. By the convex combination of feasible histories plus an auxiliary control law that deals with uncertainties, this scheme can be used as a robust historian-based predictive controller. Further application could include a cooperative learning-based strategy in which multiple controllers share their previous executions to gain collective benefits in terms of performance. The validity of the proposed controller is tested in a simulated case study.
|
|
10:40-11:00, Paper TuAT13.3 | Add to My Program |
Linear-Quadratic Gaussian Control with Time-Varying Disturbance Forecast |
|
Cheng, Jiangnan | Cornell University |
Tang, Kevin | Cornell University |
Keywords: Predictive control for linear systems, Linear systems
Abstract: Linear-quadratic-Gaussian (LQG) control is a classical optimal control problem where the disturbance in the system dynamics is traditionally treated as random noise. Motivated by the possibility of forecasting future disturbance in some relevant works for linear-quadratic regulator (LQR) systems where the disturbance distribution is arbitrary, we introduce a time-varying disturbance forecast model in the LQG problem. Our model characterizes the Gaussianity of the disturbances and thus enables us to give theoretical results including optimal average cost even though the forecast error can be unbounded. Numerical examples are provided to illustrate the theoretical results.
|
|
11:00-11:20, Paper TuAT13.4 | Add to My Program |
Fast Stochastic MPC Implementation Via Policy Learning |
|
Mammarella, Martina | CNR-IEIIT |
Altamimi, Abdulelah | Pennsylvania State University |
Chamanbaz, Mohammadreza | The University of Sydney |
Dabbene, Fabrizio | CNR-IEIIT |
Lagoa, Constantino M. | Pennsylvania State Univ |
Keywords: Predictive control for linear systems, Neural networks, Randomized algorithms
Abstract: Stochastic Model Predictive Control (MPC) gained popularity thanks to its capability of overcoming the conservativeness of robust approaches, at the expense of a higher computational demand. This represents a critical issue especially for sampling-based methods. In this work we propose a policy learning MPC approach, which aims at reducing the cost of solving stochastic optimization problems. The presented scheme relies upon the use of neural networks for identifying a mapping between the current state of the system and the probabilistic constraints. This allows to reduce the sample complexity to be less than or equal to the dimension of the decision variable, significantly scaling down the computational burden of stochastic MPC approaches, while preserving the same probabilistic guarantees. The efficacy of the proposed policy-learning MPC is proved by means of a numerical example.
|
|
11:20-11:40, Paper TuAT13.5 | Add to My Program |
State Space Models vs. Multi-Step Predictors in Predictive Control: Are State Space Models Complicating Safe Data-Driven Designs? |
|
Köhler, Johannes | ETH Zurich |
Wabersich, Kim Peter | ETH Zurich |
Berberich, Julian | University of Stuttgart |
Zeilinger, Melanie N. | ETH Zurich |
Keywords: Predictive control for linear systems, Identification for control
Abstract: This paper contrasts recursive state space models and direct multi-step predictors for linear predictive control. We provide a tutorial exposition for both model structures to solve the following problems: 1. stochastic optimal control; 2. system identification; 3. stochastic optimal control based on the estimated model. Throughout the paper, we provide detailed discussions of the benefits and limitations of these two model parametrizations for predictive control and highlight the relation to existing works. Additionally, we derive a novel (partially tight) constraint tightening for stochastic predictive control with parametric uncertainty in the multi-step predictor.
|
|
11:40-12:00, Paper TuAT13.6 | Add to My Program |
On Explicit Data-Driven (M)PC |
|
Klädtke, Manuel | TU Dortmund University |
Teichrib, Dieter | TU Dortmund University |
Schlüter, Nils | TU Dortmund University |
Schulze Darup, Moritz | TU Dortmund University |
Keywords: Predictive control for linear systems, Optimal control, Machine learning
Abstract: We show that the explicit realization of data-driven predictive control (DPC) for linear deterministic systems is more tractable than previously thought. To this end, we compare the optimal control problems (OCP) corresponding to deterministic DPC and classical model predictive control (MPC), specify its close relation, and systematically eliminate ambiguity inherent in DPC. As a central result, we find that the explicit solutions to these types of DPC and MPC are of exactly the same complexity. We illustrate our results with two numerical examples highlighting features of our approach.
|
|
TuAT14 Regular Session, Maya Ballroom VI |
Add to My Program |
Control Applications I |
|
|
Chair: Karlsson, Niklas | Amazon |
Co-Chair: Gutiérrez-Oribio, Diego | École Centrale De Nantes |
|
10:00-10:20, Paper TuAT14.1 | Add to My Program |
Feedback Control-Based Multiobjective Optimization in Programmatic Advertising Involving a Cost Per Bid Constraint |
|
Karlsson, Niklas | Amazon |
Keywords: Emerging control applications, Information technology systems, Optimization
Abstract: Online advertising is mostly served through real-time bidding, and advertising campaigns are often defined as optimization problems. Historically, these problems consider only the advertiser return on investment subject to constraints specified by the advertiser. The operational cost of the optimization platform is typically not accounted for explicitly. This paper addresses this limitation of previous work by including a cost per bid constraint to an otherwise advertiser-centric optimization problem. Maintaining the constraint prevents the operational cost of the platform to exceed what the platform provider charges the advertiser for the service. The optimal bidding mechanism is derived and it is shown how the solution can be implemented as four separate subsystems. Feedback control plays an important role to make this optimization scalable and adaptive. A proof of concept campaign control system is proposed and evaluated in simulations.
|
|
10:20-10:40, Paper TuAT14.2 | Add to My Program |
Global Monotonic Radio-Frequency Impedance Matching Via Control Lyapunov Function under Safety Constraints |
|
Rodríguez, Carlos | CICESE |
Viola, Jairo | University of California, Merced |
Alvarez, Joaquin | CICESE |
Chen, YangQuan | University of California, Merced |
Keywords: Emerging control applications, Lyapunov methods, Nonlinear output feedback
Abstract: One of the critical technologies in plasma etching in semiconductor wafers is radio frequency impedance matching control. An impedance matching network is essential to ensure the maximum power transfer in any system, and its application is ubiquitous. Many researches have been proposed control strategies to get a robust performances, reduce convergence speed, and more accurate. However, these proposal present a non-monotonic decreasing behavior of the reflected power, thus affecting the overall matching control performance. In this paper, we propose a controller technique, based on a Control Lyapunov Function (CLF), which ensures global asymptotic stability of the system. This takes place under a safe space of capacitor values defined by a control barrier function (CBF). Numerical simulations on a benchmark setup show the novelty of the proposed control scheme. Concerning others in the literature, it presents lower convergence time and robustness under several non-ideal conditions. Also, our study includes analysis of other indexes like the mean and maximum value of the reflection coefficient integral and reflected power.
|
|
10:40-11:00, Paper TuAT14.3 | Add to My Program |
Distributed Finite Time K-Means Clustering with Quantized Communucation and Transmission Stopping |
|
Rikos, Apostolos I. | KTH Royal Institute of Technology |
Oliva, Gabriele | University Campus Bio-Medico of Rome |
Hadjicostis, Christoforos N. | University of Cyprus |
Johansson, Karl H. | Royal Institute of Technology |
Keywords: Emerging control applications, Quantized systems, Large-scale systems
Abstract: In this paper, we present a distributed version of the k-means algorithm for multi-agent systems with directed communication links. The goal of k-means is to partition the network's agents in mutually exclusive sets (groups) such that agents in the same set have (and possibly share) similar information and are able to calculate a representative value for their group. Our distributed algorithm allows each node to transmit quantized values in an event-driven fashion, and exhibits distributed stopping capabilities. Transmitting quantized values leads to more efficient usage of the available bandwidth and reduces the communication bottleneck, whereas distributed stopping preserves available resources. We characterize the properties of the proposed distributed algorithm and show that its execution (on any static and strongly connected digraph) will partition all agents in mutually exclusive clusters in finite time. We conclude with examples that illustrate the operation, performance, and potential advantages of the proposed algorithm.
|
|
11:00-11:20, Paper TuAT14.4 | Add to My Program |
Robust Stabilization of Furuta's Pendulum Based on Continuous High Order Sliding Mode Controllers |
|
Ortega Perez, Jose Antonio | Facultad De Ingenieria, UNAM |
Gutiérrez-Oribio, Diego | École Centrale De Nantes |
Fridman, Leonid | Universidad Nacional Autonoma De Mexico |
Moreno Pérez, Jaime Alberto | UNAM |
Keywords: Variable-structure/sliding-mode control
Abstract: In this manuscript a robust stabilization controller for the Furuta's pendulum in presence of parametric uncertainties is proposed. We used a normal form transformation and partial feedback linearisation to tackle the problem with a cascade system approach, then a continuous sliding mode algorithm stabilize the cascade system. A closed-loop stability analysis is presented to proof asymptotic stability of the Furuta's pendulum equilibrium point. Also, experiments over a real Furuta's pendulum are presented to corroborate the theoretical results shown.
|
|
11:20-11:40, Paper TuAT14.5 | Add to My Program |
The Value of Pooling in Last-Mile Delivery |
|
Shetty, Akhil | University of California, Berkeley |
Qin, Junjie | Purdue University |
Poolla, Kameshwar | Univ. of California at Berkeley |
Varaiya, Pravin | Univ. of California at Berkeley |
Keywords: Transportation networks, Emerging control applications
Abstract: Last-mile delivery services have become ubiquitous in the recent past. Delivery services for food (eg., DoorDash, Grubhub, Uber Eats) and groceries (eg., Instacart, Cornershop) earned a combined revenue of 25B in 2020, and are expected to exceed 72B in revenues by 2025. The COVID-19 pandemic accelerated the growth of such services by making their value proposition even more attractive. The lower risk of contact coupled with the convenience of ordering from the comfort of their homes led to widespread customer adoption. Even so, most last-mile delivery services are not profitable. The high cost of delivery is cited as the major cause of losses. Thus, analyzing the factors influencing delivery costs is crucial for understanding the long-term viability of these services. The pooling of orders is a critical source of efficiency in last-mile delivery. We propose a queuing-based spatial model for the delivery process to analyze the value created by pooling. We demonstrate how the trade-off between delivery times and the cost of delivery, mediated by the extent of pooling, dictates which services will be economically viable. Our simulation study of a typical grocery delivery service in Los Angeles, California suggests that delivery times of less than 1 hour are unprofitable for most regions in the US. We find that driver wages account for 90% of the delivery cost. We also discuss the potential impact of technological innovations such as automated delivery and labor regulations on the profitability of last-mile delivery services.
|
|
11:40-12:00, Paper TuAT14.6 | Add to My Program |
Model Predictive Control for Price-Based Demand-Responsive Building Control by Leveraging Active Latent Heat Storage |
|
Yang, Shiyu | Cornell University |
Gao, H. Oliver | Cornell University |
You, Fengqi | Cornell University |
Keywords: Building and facility automation, Control applications, Process Control
Abstract: Active latent heat storage (ALHS) involving phase-change materials constitutes a promising energy-efficient solution for building energy management (BEM). Current BEM systems based on conventional reactive control lack the level of control delicacy required to exploit the full potential of ALHS for BEM. This study proposes a smart model predictive control (MPC) approach for BEM to minimize energy costs while maintaining indoor climate by fully applying ALHS. An MPC framework considering ALHS dynamics and dynamic electricity prices is proposed. A case study entailing a set of simulations is designed based on a single-family house with a space heating system integrated with ALHS. The proposed MPC approach, compared to conventional reactive control, enables more than 70% of reductions in electricity costs. Further analysis reveals that coupling ALHS with MPC is critical to exploiting the ALHS potential for BEM: while conventional reactive control of an ALHS-equipped building increases the electricity cost, an MPC-enabled building could reduce the electricity cost by 45.1% due to ALHS adoption.
|
|
TuAT15 Regular Session, Maya Ballroom VII |
Add to My Program |
Optimal Control and Nonlinear Systems |
|
|
Chair: Tan, Xiaobo | Michigan State University |
Co-Chair: Bouffanais, Roland | University of Ottawa |
|
10:00-10:20, Paper TuAT15.1 | Add to My Program |
Finite-Time Event-Triggered Control for a Class of Nonlinear Systems |
|
Tripathy, Niladri Sekhar | Indian Institute of Technology Jodhpur |
Chamanbaz, Mohammadreza | The University of Sydney |
Bouffanais, Roland | University of Ottawa |
Keywords: Nonlinear systems, Optimal control, Stability of nonlinear systems
Abstract: This paper considers an approximate solution of the event-triggered Hamilton-Jacobi-Bellman (ET-HJB) equation to derive a finite-time suboptimal event-triggered control law for a class of nonlinear systems. To reduce the communication and computation overhead, the control law is computed and actuated after violating a predefined state-dependent event triggering condition. To obtain the controller gain, the ET-HJB equation is approximated as a state-dependent differential Riccati equation (SDRE). After converting the ET-HJB into an SDRE, a frozen time concept is used to eliminate the issues related to state dependency in the system and input matrices between two consecutive events. This helps reframe the SDRE into a simple differential Riccati equation (DRE), where the state-dependent system and input matrices remain fixed until the next event occurs. Using the solution of a differential Lyapunov equation, the solution of the DRE is computed forward in time. The designed event-triggered control law is readily amenable to an online implementation, and also it ensures the input-to-state stability of closed-loop systems. Simulation results are reported to prove the efficacy of the proposed control approach.
|
|
10:20-10:40, Paper TuAT15.2 | Add to My Program |
Some Remarks on the Issue of Normality in State-Constrained Optimal Control Problems |
|
Karamzin, Dmitry | Federal Research Center "Computer Science and Control" of the Ru |
Lobo Pereira, Fernando | Porto University |
Keywords: Optimal control, Nonlinear systems, Constrained control
Abstract: A conventional optimal control problem subject to state constraints is investigated. The controllability matrix is presented and the normality condition is formulated as the condition of full rank for this matrix. As an application of the obtained result, the normality of solution to calculus of variation problem with state constraints and fixed endpoints is justified. The approach is employed which also ensures the property of controllability of the constrained control system in question.
|
|
10:40-11:00, Paper TuAT15.3 | Add to My Program |
Fast Nonlinear Model Predictive Control Using Barrier Formulations and Squashing with a Generalized Gauss-Newton Hessian |
|
Baumgärtner, Katrin | University of Freiburg |
Wang, Yizhen | University Freiburg |
Zanelli, Andrea | ETH Zurich |
Diehl, Moritz | University of Freiburg |
Keywords: Optimal control, Predictive control for nonlinear systems, Optimization algorithms
Abstract: We propose an approximate algorithm for Nonlinear Model Predictive Control (NMPC) which is based on a reformulation of the inequality constrained optimal control problem using barrier terms and squashing functions. Within an SQP framework, the particular structure of the reformulated problem can be leveraged by using a Generalized Gauss-Newton Hessian approximation. Moreover, the quadratic subproblems can be efficiently solved using a single backward and forward sweep of the Ricatti recursion. We show locally linear convergence of the proposed algorithm, as well as locally quadratic convergence in the case of linear system dynamics. The computational speed-up, which can be achieved with the proposed method, is illustrate in a simulation study.
|
|
11:00-11:20, Paper TuAT15.4 | Add to My Program |
Optimal Control of Active Drifter Systems |
|
Gaskell, Eric | Michigan State University |
Tan, Xiaobo | Michigan State University |
Keywords: Optimal control, Nonlinear systems, Robotics
Abstract: Drifters are energy-efficient sampling platforms for waterways and other water bodies with pronounced flows. The motion of passive drifters is determined by the underlying flow and thus limited. To overcome this limitation and enhance maneuverability, we consider an active drifter, which has a variable control surface for modulating the hydrodynamic drag force, and a thruster for propulsion or braking. In order to maintain the active drifter as an energy-efficient platform, the use of the thruster must be sparing and carefully considered. In this paper we present an optimal control problem for a one-degree-of-freedom active drifter system, where the cost function aims to balance the objectives of shortest time and minimal thruster use. Despite the complex, realistic nonlinear dynamics of the active drifter, an analytical solution to the optimal control is found by exploiting Pontryagin's Minimum Principle. In particular, for a given desired final state, each candidate optimal control solution is propagated backward in time, to compute the points in the state space where the optimal control switches; such points are parameterized by the co-state variables, linked implicitly to the system's initial conditions. The proposed approach naturally results in a final-state-dependent partition of the state space, where each region corresponds to a given optimal control value. A feedback control law is readily derived from the aforementioned map. The efficacy of the proposed approach is supported with a numerical example, where the trade-off between time-optimality and fuel-efficiency is illustrated.
|
|
11:20-11:40, Paper TuAT15.5 | Add to My Program |
Cooperative Tuning of Multi-Agent Optimal Control Systems |
|
Lu, Zehui | Purdue University |
Jin, Wanxin | Purdue University |
Mou, Shaoshuai | Purdue University |
Anderson, Brian D.O. | Australian National University |
Keywords: Optimal control, Optimization, Cooperative control
Abstract: This paper investigates the problem of cooperative tuning of multi-agent optimal control systems, where a network of agents (i.e. multiple coupled optimal control systems) adjusts parameters in their dynamics, objective functions, or controllers in a coordinated way to minimize the sum of their loss functions. Different from classical techniques for tuning parameters in a controller, we allow tunable parameters appearing in both the system dynamics and the objective functions of each agent. A framework is developed to allow all agents to reach a consensus on the tunable parameter, which minimizes team loss. The key idea of the proposed algorithm rests on the integration of consensus-based distributed optimization for a multi-agent system and a gradient generator capturing the optimal performance as a function of the parameter in the feedback loop tuning the parameter for each agent. Both theoretical results and simulations for a synchronous multi-agent rendezvous problem are provided to validate the proposed method for cooperative tuning of multi-agent optimal control.
|
|
11:40-12:00, Paper TuAT15.6 | Add to My Program |
Infinite-Dimensional Sums-Of-Squares for Optimal Control |
|
Berthier, Eloïse | Inria - Ecole Normale Supérieure |
Carpentier, Justin | Inria |
Rudi, Alessandro | INRIA |
Bach, Francis | INRIA - Ecole Normale Supérieure |
Keywords: Numerical algorithms, Optimal control, Sampled-data control
Abstract: In this paper, we introduce an approximation method to solve an optimal control problem via the Lagrange dual of its weak formulation, which applies to problems with an unknown, non-necessarily polynomial, dynamics accessed through samples, akin to model-free reinforcement learning. It is based on a sum-of-squares representation of the Hamiltonian, and extends a previous method from polynomial optimization to the generic case of smooth problems. Such a representation is infinite-dimensional and relies on a particular space of functions – a reproducing kernel Hilbert space – chosen to fit the structure of the control problem. After subsampling, it leads to a practical method that amounts to solving a semi-definite program. We illustrate our approach numerically on a low-dimensional control problem.
|
|
TuAT16 Regular Session, Maya Ballroom VIII |
Add to My Program |
Observers for Nonlinear Systems |
|
|
Chair: Astolfi, Daniele | Cnrs - Lagepp |
Co-Chair: Reger, Johann | TU Ilmenau |
|
10:00-10:20, Paper TuAT16.1 | Add to My Program |
Towards Improving the Estimation Performance of a Given Nonlinear Observer: A Multi-Observer Approach |
|
Petri, Elena | CRAN, Université De Lorraine, CNRS |
Postoyan, Romain | CNRS, CRAN, Université De Lorraine |
Astolfi, Daniele | CNRS - LAGEPP Univ Lyon 1 |
Nesic, Dragan | University of Melbourne |
Andrieu, Vincent | Université De Lyon |
Keywords: Observers for nonlinear systems, Hybrid systems, Nonlinear systems
Abstract: Various methods are nowadays available to design observers for broad classes of systems. Nevertheless, the question of the tuning of the observer to achieve satisfactory estimation performance remains largely open. This paper presents a general supervisory design framework for online tuning of the observer gains with the aim of achieving various trade-offs between robustness and speed of convergence. We assume that a robust nominal observer has been designed for a general nonlinear system and the goal is to improve its performance. We present for this purpose a novel hybrid multi-observer, which consists of the nominal one and a bank of additional observer-like systems, that are collectively referred to as modes and that differ from the nominal observer only in their output injection gains. We then evaluate on-line the estimation cost of each mode of the multi-observer and, based on these costs, we select one of them at each time instant. Two different strategies are proposed. In the first one, initial conditions of the modes are reset each time the algorithm switches between different modes. In the second one, the initial conditions are not reset. We prove a convergence property for the hybrid estimation scheme and we illustrate the efficiency of the approach in improving the performance of a given nominal high-gain observer on a numerical example.
|
|
10:20-10:40, Paper TuAT16.2 | Add to My Program |
A Hybrid Sensorless Observer for the Robust Global Asymptotic Flux Reconstruction of Permanent Magnet Synchronous Machines |
|
Bosso, Alessandro | University of Bologna |
Tilli, Andrea | University of Bologna |
Conficoni, Christian | Alma Mater Studiorum, University of Bologna |
Keywords: Observers for nonlinear systems, Hybrid systems, Stability of hybrid systems
Abstract: We propose a hybrid sensorless observer for permanent magnet synchronous machines with global asymptotic stability guarantees. Exploiting the constraint of the rotor flux on a circle of unknown radius, we design an integrator system with periodic jumps triggered by a clock to generate a linear regression containing the flux estimation error. Then, a normalized projected gradient descent identifier provides the observer estimates. For the closed-loop system, it is shown that there is a robustly globally asymptotically stable compact attractor, which, additionally, ensures zero estimation error if appropriate Persistency of Excitation (PE) conditions are satisfied. In this respect, sufficient conditions ensuring PE are provided for the angular speed and the clock period.
|
|
10:40-11:00, Paper TuAT16.3 | Add to My Program |
Online Estimation of Hilbert-Schmidt Operators and Application to Kernel Reconstruction of Neural Fields |
|
Brivadis, Lucas | Université Paris-Saclay, CNRS, CentraleSupélec |
Chaillet, Antoine | CentraleSupélec |
Auriol, Jean | CNRS |
Keywords: Observers for nonlinear systems, Distributed parameter systems, Estimation
Abstract: An adaptive observer is designed for online estimation of Hilbert-Schmidt operators from online measurement of part of the state for some class of nonlinear infinite-dimensional dynamical systems. Convergence is ensured under detectability and persistency of excitation assumptions. The class of systems considered is motivated by an application to kernel reconstruction of neural fields, commonly used to model spatiotemporal activity of neuronal populations. Numerical simulations confirm the relevance of the approach.
|
|
11:00-11:20, Paper TuAT16.4 | Add to My Program |
L1-Robust Interval Observer Design for Uncertain Nonlinear Dynamical Systems |
|
Pati, Tarun | Arizona State University |
Khajenejad, Mohammad | University of California, San Diego |
Daddala, Sai Praveen | Arizona State University |
Yong, Sze Zheng | Northeastern University |
Keywords: Observers for nonlinear systems, Uncertain systems
Abstract: This paper presents a novel interval observer design for uncertain locally Lipschitz continuous-time (CT) and discrete-time (DT) systems with noisy nonlinear observations that is input-to-state stable (ISS) and minimizes the L1-gain of the observer error system with respect to the uncertainties. Using mixed-monotone decompositions, the proposed observer is correct and positive by construction without the need for additional constraints/assumptions. This, in turn, allows us to directly leverage techniques for positive systems to design an ISS and L1-robust interval observer via (mixed-integer) linear programs instead of semi-definite programs with linear matrix inequalities. Further, our observer design offers additional degrees of freedom that may serve as a surrogate for coordinate transformations. Finally, we demonstrate the effectiveness of our proposed observer on some CT and DT systems.
|
|
11:20-11:40, Paper TuAT16.5 | Add to My Program |
Observer Design for Non-Globally Lipschitz Nonlinear Systems Using Hilbert Projection Theorem |
|
Zemouche, Ali | CRAN UMR CNRS 7039 & Inria: EPI-DISCO |
Rajamani, Rajesh | Univ. of Minnesota |
Keywords: Observers for nonlinear systems, LMIs, Estimation
Abstract: This paper deals with observer design for a class of Lipschitz nonlinear systems. Specifically, we propose a mathematically rigorous technique to handle systems having non-globally Lipschitz properties on the whole set mathbb{R}^{n}. The unique assumption made on the nonlinearity is for it to be Lipschitz on a compact convex set Omega subset mathbb{R}^{n}, in which lives the system state. The idea consists in extending the nonlinear function to become globally Lipschitz on the whole space mathbb{R}^{n}. Such an extension is performed by using the famous Hilbert projection theorem, which generalizes some existing results in the literature. The projection is then involved in the observer structure to overcome the non-satisfaction of the global property by the original nonlinear function. More importantly, to overcome the conservatism related to the boundedness of the system states, an extension to systems having only some bounded states is proposed under different but less conservative assumptions. It is shown that all the previous observer design methods in the literature that rely on a global Lipschitz property can be applied straightforwardly without changing their synthesis conditions.
|
|
11:40-12:00, Paper TuAT16.6 | Add to My Program |
Non-Asymptotic Observer Design for Nonlinear Systems Based on Linearization |
|
Korder, Kristina | Technische Universität Ilmenau |
Noack, Matti | TU Ilmenau |
Reger, Johann | TU Ilmenau |
Keywords: Observers for nonlinear systems, Observers for Linear systems
Abstract: This work establishes a new approach for a linearization based non-asymptotic state observer design. To receive a linear input-output relation of the nonlinear system for the state observer and to avoid the necessity of time-derivatives of the measurement and actuator signals, a combination of linearization with the modulating function technique is applied. The estimates are obtained non-asymptotically using a sliding time window of finite length. This procedure allows a continuous-time and recursive update of the state estimates and extends the possible applications of the modulating function technique to a wider range of nonlinear systems. The class of unitary modulating functions is introduced to ensure the solvability of the estimation problem when subject to time-varying coefficients generated from the linearized state trajectory. Two examples illustrate the real-time capability of the algorithm.
|
|
TuAT17 Invited Session, Acapulco |
Add to My Program |
Brain Dynamics and Control |
|
|
Chair: Nozari, Erfan | University of California, Riverside |
Co-Chair: Pequito, Sergio | Rensselaer Polytechnic Institute |
Organizer: Nozari, Erfan | University of California, Riverside |
Organizer: Pequito, Sergio | Uppsala University |
Organizer: Pasqualetti, Fabio | University of California, Riverside |
|
10:00-10:20, Paper TuAT17.1 | Add to My Program |
Contraction Analysis of Hopfield Neural Networks with Hebbian Learning (I) |
|
Centorrino, Veronica | Scuola Superiore Meridionale, University of Naples Federico II |
Bullo, Francesco | Univ of California at Santa Barbara |
Russo, Giovanni | University of Salerno |
Keywords: Stability of nonlinear systems, Machine learning, Network analysis and control
Abstract: Motivated by advances in neuroscience and machine learning, this paper is concerned with the modeling and analysis of Hopfield neural networks with dynamic recurrent connections undergoing Hebbian learning. To capture the synaptic sparsity of neural circuits, we propose a low dimensional formulation for the model and then characterize its key dynamical properties. First, we give a biologically-inspired forward invariance result. Then, we give sufficient conditions for the non-Euclidean contractivity of the model. Our contraction analysis leads to stability and robustness of time-varying trajectories -- for networks with both excitatory and inhibitory synapses governed by both Hebbian and anti-Hebbian rules. Our proposed contractivity test is based upon biologically meaningful quantities, e.g., neural and synaptic decay rate, maximum out-degree, and the maximum synaptic strength. Finally, we show that the model satisfies Dale's principle. The effectiveness of our results is illustrated via a numerical example.
|
|
10:20-10:40, Paper TuAT17.2 | Add to My Program |
Distributed Online Estimation of Biophysical Neural Networks (I) |
|
B. Burghi, Thiago | University of Cambridge |
O'Leary, Timothy | University of Cambridge |
Sepulchre, Rodolphe | University of Cambridge |
Keywords: Neural networks, Estimation, Distributed parameter systems
Abstract: In this work, we propose a distributed adaptive observer for a class of nonlinear networked systems inspired by biophysical neural network models. Neural systems learn by adjusting intrinsic and synaptic weights in a distributed fashion, with neuronal membrane voltages carrying information from neighbouring neurons in the network. We show that this learning principle can be used to design an adaptive observer based on a decentralized learning rule that greatly reduces the number of observer states required for exponential convergence of parameter estimates. This novel design is relevant for biological, biomedical and neuromorphic applications.
|
|
10:40-11:00, Paper TuAT17.3 | Add to My Program |
Frequency-Dependent Modulation of Stochasticity in Postsynaptic Neuron Firing Times (I) |
|
Vahdat, Zahra | University of Delaware |
Singh, Abhyudai | University of Delaware |
Keywords: Stochastic systems, Hybrid systems, Biological systems
Abstract: Action potential (AP)-triggered neurotransmitter release forms the key basis of inter-neuronal communication. We present a stochastic hybrid system model that captures the release of neurotransmitter-filled vesicles from a presynaptic neuron. More specifically, vesicles arrive as a Poisson process to attach to a given number of docking sites, and each docked vesicle has a certain probability of release when an AP is generated in the presynaptic neuron. The released neurotransmitters enhance the membrane potential of the postsynaptic neuron, and this increase is coupled to the continuous exponential decay of the membrane potential. The buildup of potential to a critical threshold level results in an AP firing in the postsynaptic neuron, with the potential subsequently resetting back to its resting level. Our model analysis develops formulas that quantify the fluctuations in the number of released vesicles and mechanistically connects them to fluctuations in both the postsynaptic membrane potential and the AP firing times. Increasing the frequency of APs in the presynaptic neuron leads to saturation effects on the postsynaptic side, resulting in a limiting frequency range of neurotransmission. Interestingly, AP firing in the postsynaptic neuron becomes more precise with increasing AP frequency in the presynaptic neuron. We also investigate how noise in the AP timing varies with different parameters, such as the probability of releases, the number of docking sites, the voltage threshold for AP firing, and the timescale of voltage decay. In summary, our results systematically explain how stochastic mechanisms in neurotransmission enhance or impinge the precision of AP fringing times.
|
|
11:00-11:20, Paper TuAT17.4 | Add to My Program |
On the Linearizing Effect of Spatial Averaging in Large-Scale Populations of Homogeneous Nonlinear Systems (I) |
|
Ahmed, Sabbir | University of California, Riverside |
Nozari, Erfan | University of California, Riverside |
Keywords: Linear systems, Nonlinear systems, Stochastic systems
Abstract: Understanding the dynamics resulting from large-scale populations of systems poses one of the greatest challenges ahead of modern science. While it is often expected that the emerging dynamics from such populations compound in complexity, we here show that, on the contrary, the aggregation of complex individual dynamics can in fact lead to simpler behavior overall. In particular, mounting empirical evidence from neuroscience and beyond has pointed out the linearity of macroscopic dynamics that result from the interaction of large populations of microscopic subsystems, despite the highly nonlinear dynamics possessed by the individual subsystems. Rigorous analyses and theoretical grounds for such observations, however, have remained lacking. In this paper, we develop a general theoretical framework showing that the average dynamics of a broad family of populations of nonlinear stochastic subsystems converge to linear time-varying (LTV) dynamics transiently and to linear time-invariant (LTI) dynamics in steady state. Simulations are provided to illustrate this effect in populations of static (feedforward) nonlinear maps as well as a wide range of nonlinear systems exhibiting bistable, limit cycle, and chaotic dynamics.
|
|
11:20-11:40, Paper TuAT17.5 | Add to My Program |
Control Strategies for Neural Populations with Rectified Activation Function (I) |
|
Menara, Tommaso | University of California, San Diego |
Cortes, Jorge | University of California, San Diego |
Keywords: Nonlinear systems, Biological systems, Network analysis and control
Abstract: In the human brain, highly recurrent cortical circuitry supports information processing, coordinates learning episodes, and regulates healthy and diseased states. A key outstanding challenge of neural engineering is to ultimately control the collective dynamics of distinct neural populations, so as to promote the emergence or recovery of desired activity patterns. In this paper, we investigate the control of a general rate model of neural activity with rectified activation function. We first show that any target state in the open positive orthant can be reached in finite time. Furthermore, we present an array of results to perform (feedback and feedforward) efficient control in prototypical classes of networks with distinct connection types and in the case of sparse control inputs. Due to the relevance of rate models in both biological and artificial neural networks, our results lay the groundwork to enhance the dynamical behavior of in vivo and synthetic neural circuitry.
|
|
11:40-12:00, Paper TuAT17.6 | Add to My Program |
Vibrational Control of Cluster Synchronization: Connections with Deep Brain Stimulation (I) |
|
Qin, Yuzhen | University of California, Riverside |
Bassett, Danielle | University of Pennsylvania |
Pasqualetti, Fabio | University of California, Riverside |
Keywords: Biological systems, Networked control systems, Nonlinear systems
Abstract: Cluster synchronization underlies various functions in the brain. Abnormal patterns of cluster synchronization are often associated with neurological disorders. Deep brain stimulation (DBS) is a neurosurgical technique used to treat several brain diseases, which has been observed to regulate neuronal synchrony patterns. Despite its widespread use, the mechanisms of DBS remain largely unknown. In this paper, we hypothesize that DBS plays a role similar to vibrational control since they both highly rely on high-frequency excitation to function. Under the framework of Kuramoto-oscillator networks, we study how vibrations introduced to network connections can stabilize cluster synchronization. We derive some sufficient conditions and also provide an effective approach to design vibrational control. Also, a numerical example is presented to demonstrate our theoretical findings.
|
|
TuBT01 Regular Session, Tulum Ballroom A |
Add to My Program |
Switched Systems II |
|
|
Chair: Shim, Hyungbo | Seoul National University |
Co-Chair: De Iuliis, Vittorio | Università Degli Studi Dell'Aquila |
|
13:30-13:50, Paper TuBT01.1 | Add to My Program |
Robust Global Asymptotic Stabilization of Linear Cascaded Systems with Hysteretic Interconnection |
|
Bosso, Alessandro | University of Bologna |
Zaccarian, Luca | LAAS-CNRS and University of Trento |
Tilli, Andrea | University of Bologna |
Barbieri, Matteo | University of Bologna |
Keywords: Switched systems, Stability of hybrid systems, Lyapunov methods
Abstract: We address the problem of setpoint regulation for cascaded minimum-phase linear systems interconnected through a scalar hysteresis, modeled as a Prandtl-Ishlinskii operator. Employing well-posed constrained differential inclusions to represent the hysteretic dynamics, we formulate the control problem in terms of stabilization of a compact set of equilibria depending on the hysteresis states. For our design, we firstly consider a proportional-integral controller for linear systems with hysteretic input, and provide model-free sufficient conditions based on high-gain arguments for closed-loop stability. Then, the controller is dynamically extended to obtain an inversion-free stabilizer of the overall cascade. For the presented schemes, we prove robust global asymptotic stability of a compact set that ensures setpoint regulation, regardless of the hysteresis states.
|
|
13:50-14:10, Paper TuBT01.2 | Add to My Program |
First-Moment Stability of Markov Jump Linear Systems with Homogeneous and Inhomogeneous Transition Probabilities |
|
De Iuliis, Vittorio | Università Degli Studi Dell'Aquila |
Manes, Costanzo | Universita' Dell'Aquila |
D'Innocenzo, Alessandro | University of L'Aquila |
Keywords: Switched systems, Markov processes, Stability of hybrid systems
Abstract: This work provides 1-moment stability conditions for discrete-time Markov Jump Linear Systems under time-homogeneous and time-inhomogeneous transition probabilities. For the latter, we further address the polytopic switching case. The analysis is carried out leveraging the comparison principle and the theory of positive systems, without nevertheless limiting the overall analysis to the latter. We provide sufficient stability conditions that only involve non-negative matrices and can be checked via linear programming.
|
|
14:10-14:30, Paper TuBT01.3 | Add to My Program |
Stability of Linear Systems with Slow and Fast Time Variation and Switching |
|
Liberzon, Daniel | Univ of Illinois, Urbana-Champaign |
Shim, Hyungbo | Seoul National University |
Keywords: Switched systems, Time-varying systems, Stability of linear systems
Abstract: We establish exponential stability for a class of linear systems with slow and fast time variation and switching. We use the averaging method to approximate the original system by the average system which only exhibits slow time variation and switching. We then apply a stability criterion recently developed for such systems to prove stability of the average system and, consequently, of the original system.
|
|
14:30-14:50, Paper TuBT01.4 | Add to My Program |
Asymptotic Stability of Continuous-Time Switched Affine Systems with Unknown Equilibrium Points |
|
Deaecto, Grace S. | FEM/UNICAMP |
Geromel, Jose C. | UNICAMP |
Brito, João | Brazilian Synchrotron Light Laboratory - LNLS |
Keywords: Switched systems, Uncertain systems, LMIs
Abstract: This paper deals with switched affine systems control ensuring asymptotic stability of an unknown equilibrium point. More specifically, our study is focused on the class of systems with the affine term depending linearly on an uncertain vector, which can be used to model input and load variations of dc-dc power converters. The main goal is to design a switching function that governs the state trajectories towards an unknown equilibrium point containing in one of its components the desired value specified by the designer. The other components are adapted in real time based on the uncertainty estimation. The design conditions are solved by a two-steps procedure expressed in terms of linear matrix inequalities, and they do not require any stability property of the individual subsystems. The control design of a dc-dc flyback converter is used for illustration and comparison together with an academic example composed of two third order unstable subsystems.
|
|
14:50-15:10, Paper TuBT01.5 | Add to My Program |
Parallelized Algorithm for Persistent Feasibility in Linear Systems with Multiple, External Switching Signals |
|
Hall, Richard | Duke University |
Bridgeman, Leila | Duke University |
Keywords: Switched systems, Large-scale systems, Constrained control
Abstract: Ensuring feasibility in externally switched systems usually requires identifying time-varying, control invariant (CI) sets that ensure state and input constraints can be respected under any possible switching signal. As with traditional, time-invariant CI sets, these time-varying constraints can be very difficult to compute for higher dimensional systems. Furthermore, if multiple switching signals are present, the number of switching sequences that must be considered grows exponentially. Previous works would struggle to account for this growth. Inspired by distributed systems, this work examines a class of high dimensional systems with multiple switching signals. The switching signals are constrained using directed graphs that are significantly more flexible than dwell time based methods. An iterative algorithm is developed that computes the time-varying CI sets for this class of systems. Critically, this algorithm is parallelized over the number of agents, preventing the exponential growth in computation time as agents are added to the system. The scalability of the results is demonstrated on a randomized numerical example.
|
|
15:10-15:30, Paper TuBT01.6 | Add to My Program |
Active Control Strategy for Disturbed Switched Systems under Asynchronous DoS Attacks |
|
Zhao, Rui | Tianjin University |
Zuo, Zhiqiang | Tianjin University |
Wang, Yijing | Tianjin University |
Zhang, Wentao | Nanyang Technological University |
Keywords: Switched systems, Control over communications
Abstract: This paper studies the security issue of switched systems suffering from disturbance and asynchronous denial of service (DoS) attacks. To this end, an active control strategy is presented to deal with the asynchronous DoS attacks. The switching signal is designed to guarantee the uniformly ultimate boundedness when the mismatched behavior between subsystem mode and controller mode is involved. Moreover, an algorithm calculating the converging bound is suggested. Finally, the effectiveness of the results is verified by a numerical example.
|
|
TuBT02 Regular Session, Tulum Ballroom B |
Add to My Program |
Adaptive Control II |
|
|
Chair: Mahajan, Aditya | McGill University |
Co-Chair: Fidan, Baris | University of Waterloo |
|
13:30-13:50, Paper TuBT02.1 | Add to My Program |
Modular Backstepping Design with Improved Parameteric Convergence for Nonlinear Plants with Input Constraints |
|
Gerasimov, Dmitry | ITMO University |
Pashenko, Artem | ITMO University |
Suzdalev, Oleg Dimitri | ITMO University |
Nikiforov, Vladimir O. | ITMO University |
Keywords: Adaptive control, Direct adaptive control, Constrained control
Abstract: The paper addresses the problem of adaptive backstepping control of nonlinear parametrically uncertain plants with constrained input and unknown control gain. The main focus of the paper consists in design of a modular controller with improved parameters tuning achieved by complete compensation of the tuning rate in the closed-loop system and applying a Kreisselmeier-type adaptation algorithm generating high-order time derivatives (HOTD) of the adjustable parameters. It is shown that the transients of the adjustable parameters can be accelerated by increasing the adaptation gain without significant increase of the initial swing of the tracking error.
|
|
13:50-14:10, Paper TuBT02.2 | Add to My Program |
Decentralized Adaptive Control for Interconnected Cyber-Physical Systems under Coordinated Attacks |
|
Chen, Kaiwen | Imperial College London |
Astolfi, Alessandro | Imperial College & Univ. of Rome |
Parisini, Thomas | Imperial College & Univ. of Trieste |
Keywords: Adaptive control, Decentralized control, Cyber-Physical Security
Abstract: This paper introduces a decentralized adaptive control scheme for networks of cyber-physical systems under sensor and coordinated actuator attacks. First, a simplified model, in which each node system is isolated, is studied. A partially adaptive controller with a non-adaptive linear component and an adaptive nonlinear component is proposed: this guarantees bounded system trajectories and asymptotic regulation of the state. Then, the whole network system under coordinated actuator attacks is studied and a decentralized fully adaptive controller is proposed. By jointly placing the fully adaptive controller and the partially adaptive controller, according to the so-called feedback vertex set of the underlying graph, boundedness and convergence properties of the simplified model can be guaranteed for the entire network. Finally, a simulation example for a multi-stage water distribution system is presented. The results show that the proposed controller guarantees robustness of the closed-loop network system against sensor and coordinated actuator attacks and it is capable of restoring the transient performance to that of the attack-free case.
|
|
14:10-14:30, Paper TuBT02.3 | Add to My Program |
Lyapunov Analysis of Least Squares Based Direct Adaptive Control |
|
Zengin, Nursefa | University of Waterloo |
Fidan, Baris | University of Waterloo |
Khoshnevisan, Ladan | University of Waterloo |
Keywords: Adaptive control, Direct adaptive control, Automotive control
Abstract: Adaptive control strategies usually are designed based on gradient methods for the sake of simplicity in Lyapunov analysis. However, least squares (LS)-based parameter identifiers, with proper selection of design parameters, exhibit better transient performance than the gradient-based ones, from the aspects of convergence speed and robustness to measurement noise. On the other hand, most of the LS-based adaptive control procedures are designed via the indirect adaptive control approaches, due to the difficulty in integrating an LS-based adaptive law within the direct approaches starting with a certain Lyapunov-like cost function to be driven to (a neighborhood of) zero. In this paper, a formal constructive analysis framework is proposed to integrate the recursive LS-based parameter identification with direct adaptive control. Application of the proposed procedure in adaptive cruise control design is studied through Matlab/Simulink and CarSim simulations, validating the analytical results.
|
|
14:30-14:50, Paper TuBT02.4 | Add to My Program |
A Time-Delay Approach for Extremum Seeking of Scalar Dynamical Systems |
|
Zhu, Yang | Zhejiang University |
Fridman, Emilia | Tel-Aviv Univ |
Keywords: Adaptive control, Delay systems, LMIs
Abstract: Recently a new time-delay approach was introduced for extremum seeking (ES), but the results were confined to static maps. In the present paper, the time-delay approach is extended for the 1st time to ES of dynamic maps in the case of scalar plants. We not only provide a precise perturbed system of ES without any approximation, but also suggest a direct Lyapunov-Krasovskii method for the transformed time-delay plant to find efficient stability conditions for the closed-loop ES system in the form of linear matrix inequalities (LMIs). Differently from the approaches based on classical averaging and Lie brackets, we provide the quantitative bounds on the frequency and the resulting extremum seeking error, under assumption that the extremum point, the extremum value and the Hessian are uncertain from some known intervals.
|
|
14:50-15:10, Paper TuBT02.5 | Add to My Program |
Thompson-Sampling Based Reinforcement Learning for Networked Control of Unknown Linear Systems |
|
Sayedana, Borna | McGill University |
Afshari, Mohammad | McGill University |
Caines, Peter E. | McGill University |
Mahajan, Aditya | McGill University |
Keywords: Adaptive control, Networked control systems, Machine learning
Abstract: In recent years, there has been considerable interest in reinforcement learning for linear quadratic Gaussian (LQG) systems. In this paper, we consider a generalization of such systems where the controller and the plant are connected over an unreliable packet drop channel. Packet drops cause the system dynamics to switch between controlled and uncontrolled modes. This switching phenomena introduces new challenges in designing learning algorithms. We identify a sufficient condition under which the regret of Thompson sampling-based reinforcement learning algorithm with dynamic episodes (TSDE) at horizon T is bounded by tilde{mathcal{O}}(sqrt{T}), where the tilde{mathcal{O}}(cdot) notation hides logarithmic factors in T. These are the first results to generalize regret bounds of LQG systems to packet-drop networked control models.
|
|
15:10-15:30, Paper TuBT02.6 | Add to My Program |
Adaptive Output Feedback Fault-Tolerant Tracking Control for a Class of Nonlinear Systems with Sensor Failures and Fusion Mechanism |
|
Sun, Chen | Beihang University |
Lin, Yan | Shandong University of Science and Technolog |
Li, Lin | Beihang University |
Keywords: Adaptive control, Fault tolerant systems, Nonlinear output feedback
Abstract: In this paper, an adaptive output feedback fault-tolerant tracking scheme is proposed for a class of nonlinear systems with system uncertainties, external disturbances and sensor failures, in which redundant sensors are utilized to measure the system output. While a sensor fusion mechanism is designed to fuse sensor outputs and possible external diagnostic data, a coupled controller is designed based on sensor fused output, a new high-gain update law and a state observer. It is shown that the proposed method can ensure the closed-loop system stability and improve the tracking performance.
|
|
TuBT03 Regular Session, Tulum Ballroom C |
Add to My Program |
Robotics II |
|
|
Chair: Nuño, Emmanuel | University of Guadalajara |
Co-Chair: Sartoretti, Guillaume | Carnegie Mellon University |
|
13:30-13:50, Paper TuBT03.1 | Add to My Program |
A Nonlinear Observer for a Flexible Robot Arm and Its Use in Fault and Collision Detection |
|
Gaz, Claudio Roberto | Kingston University London |
Cristofaro, Andrea | Sapienza University of Rome |
Palumbo, Pasquale | University of Milano-Bicocca |
De Luca, Alessandro | Sapienza Università Di Roma |
Keywords: Robotics, Observers for nonlinear systems, Fault detection
Abstract: A nonlinear observer is presented for the estimation of the joint velocities and of the deformation modes and their rates for a two-link flexible robot arm, using only motor encoders and a tip position sensor. The state observer can be used then for trajectory tracking control. By monitoring the mismatch between measured and estimated outputs, we can also detect the occurrence of permanent or intermittent system anomalies such as actuator faults or link collisions.
|
|
13:50-14:10, Paper TuBT03.2 | Add to My Program |
Passivity-Based Motion and Force Tracking Control for Constrained Elastic Joint Robots |
|
Meng, Xuming | German Aerospace Center (DLR) |
Keppler, Manuel | German Aerospace Center (DLR) |
Ott, Christian | TU Wien |
Keywords: Robotics, Stability of nonlinear systems, Lyapunov methods
Abstract: In the past, several motion and force controls were successfully implemented on rigid-joint robots with constraints. With the invention of mechanically compliant robots, the focus on designing controllers for elastic joint robots with constraints is increasing, especially involving the complexity of the joint elasticity in control. Aiming to bridge the gap between the control schemes of rigid- and elastic-joint robots, this letter presents a controller consisting of a PD+ task-space tracking and integral force control, while the intrinsic inertial and elastic properties of the system are fully preserved. We provide a passivity analysis and prove uniform asymptotic stability of the equilibrium. Simulations on a planar two-armed benchmark system with constraints validate the proposed control law.
|
|
14:10-14:30, Paper TuBT03.3 | Add to My Program |
A Globally Convergent Adaptive Velocity Observer for Nonholonomic Mobile Robots Affected by Unknown Disturbances |
|
Romero, Jose Guadalupe | Instituto Tecnológico Autónomo De México |
Navarro-Alarcon, David | The Hong Kong Polytechnic University |
Nuño, Emmanuel | University of Guadalajara |
Que, Haoyi | Zhejiang University |
Keywords: Observers for nonlinear systems, Estimation, Stability of nonlinear systems
Abstract: In this paper, we present a novel adaptive observer for nonholonomic differential-drive robots to simultaneously estimate the system's angular and linear velocities, along with its external matched disturbances. The proposed method is based on the immersion and invariance technique and makes use of a dynamic scaling factor. The stability and convergence proof of the velocity and disturbance errors are performed using a strict Lyapunov function. We present a detailed simulation study to validate the performance of our approach.
|
|
14:30-14:50, Paper TuBT03.4 | Add to My Program |
Keyframe-Based CPG for Stable Gait Design and Online Transitions in Legged Robots |
|
Shaw, Scott | Northeastern University |
Sartoretti, Guillaume | National University of Singapore |
Keywords: Robotics, Biologically-inspired methods
Abstract: While animals easily and seamlessly transition between gaits to overcome challenging environments, current methods to design stable gait transitions often rely on computationally expensive optimization. In this work, we introduce a central pattern generator (CPG) model that allows intuitive gait definition and online, real-time gait transitions while ensuring stability and forward progression. Specifically, we propose to rely on keyframes -- discrete key leg configurations that can be sequenced into a gait -- to define arbitrary legged gaits. We introduce a new task-space CPG, which relies on the well-known Kuramoto model and a new feedforward term to ensure synchronized convergence to these keyframes. We then show how this framework can naturally be extended to allow arbitrary gait transitions by developing two stabilization techniques. First, we reason about the robot's predicted stability to disable specific oscillator updates during transition, while minimizing the resulting effect on forward locomotion. Second, we control the robot's body position within the grounded legs to ensure current and predicted stability based on inexpensive forward prediction of the CPG model. We validate our approach by presenting simulation and experimental results on a hexapod robot following and transitioning among hexapedal and quadrupedal gaits in a number of indoor and outdoor locomotion and mobile manipulation scenarios.
|
|
14:50-15:10, Paper TuBT03.5 | Add to My Program |
Efficient Path Planning and Tracking for Multi-Modal Legged-Aerial Locomotion Using Integrated Probabilistic Road Maps (PRM) and Reference Governors (RG) |
|
Sihite, Eric | Northeastern University |
Mottis, Benjamin | Ecole Polytechnique Fédérale De Lausanne |
Ghanem, Paul | University of Maryland College Park |
Ramezani, Alireza | Northeastern University |
Gharib, Morteza | Caltech |
Keywords: Robotics, Simulation, Mechatronics
Abstract: There have been several successful implementations of bio-inspired legged robots that can trot, walk, and hop robustly even in the presence of significant unplanned disturbances. Despite all of these accomplishments, practical control and high-level decision-making algorithms in multi-modal legged systems are overlooked. In nature, animals such as birds impressively showcase multiple modes of mobility including legged and aerial locomotion. They are capable of performing robust locomotion over large walls, tight spaces, and can recover from unpredictable situations such as sudden gusts or slippery surfaces. Inspired by these animals' versatility and ability to combine legged and aerial mobility to negotiate their environment, our main goal is to design and control legged robots that integrate two completely different forms of locomotion, ground and aerial mobility, in a single platform. Our robot, the Husky Carbon, is being developed to integrate aerial and legged locomotion and to transform between legged and aerial mobility. This work utilizes a Reference Governor (RG) based on low-level control of Husky's dynamical model to maintain the efficiency of legged locomotion, uses Probabilistic Road Maps (PRM) and 3D A* algorithms to generate an optimal path based on the energetic cost of transport for legged and aerial mobility.
|
|
15:10-15:30, Paper TuBT03.6 | Add to My Program |
Passivity-Based Task Space Control of Hybrid Rigid-Soft (HyRiSo) Robots with Parametric Uncertainty |
|
Weerakoon, Lasitha | University of Maryland |
Chopra, Nikhil | University of Maryland, College Park |
Keywords: Robotics
Abstract: This paper introduces a novel robotic system, coined as a hybrid rigid-soft (HyRiSo) robot composed of rigid links and soft links serially attached. HyRiSo combines the benefits of the dexterity of a soft manipulator with the support capability of a classical stiff arm. Due to the heterogeneous modes of actuation for the revolute joints and to enable the bending of the soft links, it is challenging to design an integrated controller for this class of robots. We demonstrate in this paper that the well-known passivity-based adaptive and robust controllers can be utilized to address this challenge. Specifically, we use these controllers for task space tracking in the presence of uncertain mass, stiffness, damping and actuation. We demonstrate the effectiveness of the proposed HyRiSo robot by showing task space tracking in the presence of complex obstacles and joint limits. We provide numerical examples using a 2-rigid-2-soft HyRiSo robot and compare the performance of the proposed task space controllers illustrating the efficacy of these frameworks.
|
|
TuBT04 Regular Session, Tulum Ballroom D |
Add to My Program |
Machine Learning II |
|
|
Chair: Paulson, Joel | The Ohio State University |
Co-Chair: Chopra, Nikhil | University of Maryland, College Park |
|
13:30-13:50, Paper TuBT04.1 | Add to My Program |
Value Function Estimation and Uncertainty Propagation in Reinforcement Learning: A Koopman Operator Approach |
|
Zanini, Francesco | Università Di Padova |
Chiuso, Alessandro | Univ. Di Padova |
Keywords: Learning, Stochastic optimal control, Nonlinear systems
Abstract: In Reinforcement Learning, model-free and model-based methods are known to have both strengths and weaknesses. The most recent literature on these topics focuses on trying to combine them in a way that preserves their advantages: sample-efficiency and good asymptotic behaviour respectively. This paper presents a new approach for directly estimating value-functions from data, which however relies on a particular model of the environment, therefore lying in between the two perspectives. The estimation problem indeed is targeting only the rewards of the transitions. This is achieved through the Koopman operator framework, which allows to describe a generic dynamical system in terms of its action on functions rather than on states, lifting the evolution in a Reproducing Kernel Hilbert Space. By leveraging a Bayesian perspective the latter approach enables also the propagation of estimation error and the definition of confidence intervals.
|
|
13:50-14:10, Paper TuBT04.2 | Add to My Program |
Finite-Sample Analysis of Off-Policy Natural Actor-Critic with Linear Function Approximation |
|
Chen, Zaiwei | California Institute of Technology |
Khodadadian, Sajad | Georgia Institute of Technology |
Maguluri, Siva Theja | Georgia Institute of Technology |
Keywords: Machine learning, Stochastic optimal control, Iterative learning control
Abstract: In this paper, we develop a novel variant of natural actor-critic algorithm using off-policy sampling and linear function approximation, and establish a sample complexity of mathcal{O}(epsilon^{-3}), outperforming all the previously known convergence bounds of such algorithms. In order to overcome the divergence due to deadly triad in off-policy policy evaluation under function approximation, we develop a critic that employs n-step TD-learning algorithm with a properly chosen n. We present finite-sample convergence bounds on this critic, which are of independent interest. Furthermore, we develop a variant of natural policy gradient under function approximation, with an improved convergence rate of mathcal{O}(1/T) after T iterations. Combining the finite sample bounds of the actor and the critic, we obtain an overall mathcal{O}(epsilon^{-3}) sample complexity. Our results were derived solely based on the assumption that the behavior policy sufficiently explores the state-action space, which is a much lighter assumption compared to the related literature.
|
|
14:10-14:30, Paper TuBT04.3 | Add to My Program |
Diffeomorphically Learning Stable Koopman Operators |
|
Bevanda, Petar | Technical University of Munich |
Beier, Max Leon | Technical University of Munich |
Kerz, Sebastian | Technical University Munich |
Lederer, Armin | Technical University of Munich |
Sosnowski, Stefan | Technical University of Munich |
Hirche, Sandra | Technische Universität München |
Keywords: Machine learning, Nonlinear systems identification, Intelligent systems
Abstract: System representations inspired by the infinite-dimensional Koopman operator (generator) are increasingly considered for predictive modeling. Due to the operator's linearity, a range of nonlinear systems admit linear predictor representations - allowing for simplified prediction, analysis and control. However, finding meaningful finite-dimensional representations for prediction is difficult as it involves determining features that are both Koopman-invariant (evolve linearly under the dynamics) as well as relevant (spanning the original state) - a generally unsupervised problem. In this work, we present Koopmanizing Flows - a novel continuous-time framework for supervised learning of linear predictors for a class of nonlinear dynamics. In our model construction a latent diffeomorphically related linear system unfolds into a linear predictor through the composition with a monomial basis. The lifting, its linear dynamics and state reconstruction are learned simultaneously, while an unconstrained parameterization of Hurwitz matrices ensures asymptotic stability regardless of the operator approximation accuracy. The superior efficacy of Koopmanizing Flows is demonstrated in comparison to a state-of-the-art method on the well-known LASA handwriting benchmark.
|
|
14:30-14:50, Paper TuBT04.4 | Add to My Program |
Analysis and Synthesis of Adaptive Gradient Algorithms in Machine Learning: The Case of AdaBound and MAdamSSM |
|
Chakrabarti, Kushal | University of Maryland |
Chopra, Nikhil | University of Maryland, College Park |
Keywords: Machine learning, Neural networks, Optimization algorithms
Abstract: Adaptive gradient algorithms have become the prevalent tool in training complex neural networks; recent examples include AdaBound and MAdam. For a better understanding of such existing optimization algorithms and design ideas for new algorithms, well-known tools from classical control appear to be promising. This area of research is built upon modeling optimization algorithms as closed-loop dynamical systems. Consequently, this paper exploits a control-theoretic methodology in analyzing AdaBound, a recent adaptive gradient algorithm, and proposing a novel optimizer for machine learning. Specifically, inspired by the recently developed state-space perspective in the G-AdaGrad and the Adam algorithm, we present a simple convergence analysis of the AdaBound algorithm for non-convex optimization problems. Next, we propose a new variant of the MAdam algorithm upon applying the concept of transfer functions. Our experimental results demonstrate the efficiency of the proposed algorithm in training CNN models for image classification problems. The findings in this paper suggest further exploration of the existing tools from control theory in complex machine learning problems.
|
|
14:50-15:10, Paper TuBT04.5 | Add to My Program |
Reinforcement Learning with Unbiased Policy Evaluation and Linear Function Approximation |
|
Winnicki, Anna | University of Illinois at Urbana Champaign |
Srikant, R | Univ of Illinois, Urbana-Champaign |
Keywords: Machine learning, Stochastic optimal control
Abstract: We provide performance guarantees for a variant of simulation-based policy iteration for controlling Markov decision processes that involves the use of stochastic approximation algorithms along with state-of-the-art techniques that are useful for very large MDPs, including lookahead, function approximation, and gradient descent. Specifically, we analyze two algorithms; the first algorithm involves a least squares approach where a new set of weights associated with feature vectors is obtained via least squares minimization at each iteration and the second algorithm is a two-time-scale algorithm taking several steps of gradient descent towards the least squares solution before obtaining the next iterate using a stochastic approximation algorithm.
|
|
15:10-15:30, Paper TuBT04.6 | Add to My Program |
Delay-Aware Decentralized Q-Learning for Wind Farm Control |
|
Bizon Monroc, Claire | INRIA |
Bouba, Eva | IFP Energies Nouvelles |
Busic, Ana | Inria |
Dubuc, Donatien | IFP Energies Nouvelles |
Zhu, Jiamin | IFPEN |
Keywords: Machine learning, Smart grid
Abstract: Wind farms are subject to the so-called "wake effect", where upstream turbines facing the wind create sub-optimal wind conditions for turbines located downstream. One strategy to address this issue is to use yaw actuators to misalign the wind turbines with regard to the incoming wind direction, thus deflecting wakes away from downstream turbines. Tractable models for yaw optimization are however subject to inaccuracies, ignore wake dynamics and lack adaptability. This incentivizes the use of model-free methods. In this paper, we propose a delay-aware decentralized Q-learning algorithm for yaw control on wind farms. We introduce a strategy to handle delayed cost collection, and show that our method significantly increases power production in simulations with realistic wake dynamics. We validate our results for two farm layouts on mid-fidelity wind farm simulator FAST.Farm.
|
|
TuBT05 Invited Session, Tulum Ballroom E |
Add to My Program |
Control and Estimation of Traffic Flow Systems |
|
|
Chair: Yu, Huan | The Hong Kong University of Science and Technology |
Co-Chair: Zhang, Liguo | Beijing University of Technology |
Organizer: Yu, Huan | The Hong Kong University of Science and Technology |
Organizer: Zhang, Liguo | Beijing University of Technology |
Organizer: Zheng, Yang | University of California San Diego |
|
13:30-13:50, Paper TuBT05.1 | Add to My Program |
Artificial Traffic Fluids Emerging from the Design of Cruise Controllers (I) |
|
Karafyllis, Iasson | National Technical University of Athens |
Theodosis, Dionysios | Technical University of Crete |
Papageorgiou, Markos | Technical Univ. of Crete |
Keywords: Lyapunov methods, Nonlinear systems
Abstract: In this paper, we present two Control Lyapunov Function based families of cruise controllers for the two-dimensional movement of autonomous vehicles on lane-free roads using the bicycle kinematic model. The control Lyapunov functions are based on measures of the energy of the system with the kinetic energy expressed in ways similar to Newtonian or relativistic mechanics. The derived feedback laws (cruise controllers) are decentralized, as each vehicle determines its control input based on its own speed and on the relative speeds and distances from adjacent vehicles and from the boundary of the road. Moreover, the corresponding macroscopic models are derived, obtaining fluid-like models that consist of a conservation equation and a momentum equation with pressure and viscous terms. Finally, we show that, by selecting appropriately the parameters of the feedback laws, we get free hand to create an artificial fluid that approximates the emerging traffic flow.
|
|
13:50-14:10, Paper TuBT05.2 | Add to My Program |
Robustness of String Stability to Delay Mismatch and Safety of CTH Predictor-Feedback CACC (I) |
|
Bekiaris-Liberis, Nikolaos | Technical University of Crete |
Keywords: Autonomous vehicles, Traffic control, Delay systems
Abstract: For the predictor-feedback Cooperative Adaptive Cruise Control (CACC) design for homogenous platoons, recently developed by Davis, we establish robustness of string stability to delay mismatch and positivity of spacing and speed states. Each individual vehicle’s dynamics are described by a second-order linear system with delayed desired acceleration, under acceleration information transmitted to the ego vehicle from a single, preceding vehicle. The nominal design (in the delay-free case) is a constant time-headway (CTH) policy and no restriction on the delay size, in relation with the desired time headway, is imposed. The proofs rely on combination of an input-output approach (on the frequency domain) and on deriving estimates on explicit, closed-loop solutions; under specific, sufficient conditions that are derived on initial conditions and parameters of the baseline, CTH controller.
|
|
14:10-14:30, Paper TuBT05.3 | Add to My Program |
Coupled Macroscopic Modelling of Electric Vehicle Traffic and Energy Flows for Electromobility Control (I) |
|
Cicic, Mladen | CNRS, GIPSA-Lab |
Canudas de Wit, Carlos | CNRS, GIPSA-Lab |
Keywords: Traffic control, Modeling, Constrained control
Abstract: The simultaneous proliferation of electric vehicles and intermittent renewable energy sources promises to expedite decarbonization of two sectors with highest emissions. However, both these developments threaten to endanger power system stability, which may hinder their widespread adoption. To tackle these challenges, there is a need for joint modelling of electric vehicle traffic flows, together with their battery dynamics. We propose a macroscopic electromobility model, augmenting the well-known LWR model, describing the traffic dynamics, with an inhomogeneous advection equation, describing the evolution of vehicles' State of Charge (SoC). The Riemann problem for the joint model is solved for the case of triangular fundamental diagram, and the solutions are used to formulate a Godunov-like scheme for model discretization. Additionally, we propose an advection-based charging station model, discretize it, and link it with the rest of the traffic and SoC model. We demonstrate the capabilities and use of the full coupled model by proposing a pedagogic example where a simple control law regulates the average SoC of all vehicles on a ring road by controlling the traffic flow entering the charging station.
|
|
14:30-14:50, Paper TuBT05.4 | Add to My Program |
Integrated Traffic Simulation-Prediction System Using Neural Networks with Application to the Los Angeles International Airport Road Network (I) |
|
Zhang, Yihang | University of Southern California |
Papadopoulos, Aristotelis-Angelos | University of Southern California |
Chen, Pengfei | University of Southern California |
Alasiri, Faisal | University of Southern California |
Yuan, Tianchen | University of Southern California |
Zhou, Jin | University of Southern California |
Ioannou, Petros A. | Univ. of Southern California |
Keywords: Traffic control, Simulation, Machine learning
Abstract: Traffic simulators can capture the complex dynamics of road networks thus are widely used to understand the behavior of the traffic system and predict the effects of transportation technologies, events and new policy changes. To build a traffic simulation model that can accurately capture the real traffic conditions, the Origin-Destination (OD) matrix is needed to be fed into the simulator as an input. In general, it is difficult to acquire information about the real OD matrix of a network. In this paper, we propose a neural network model for estimating the OD matrix that excites the microscopic simulation model for complex traffic networks to reproduce real world traffic flows using only the flow rate information of a small subset of links in the traffic network, which is relatively easy to collect. An optimization method is used to generate the training dataset of the neural network. Combining the OD matrix estimation model, the training dataset generation method, and the microscopic traffic simulator, an integrated traffic simulation-prediction system is developed to predict the behavior of the transportation system. We test the proposed system on the road network of the central terminal area of the Los Angeles International Airport and demonstrate that the integrated traffic simulation-prediction system can reproduce the real world traffic flow with low relative root mean square errors especially for high volume links and can be used to simulate the behavior of real world traffic scenarios.
|
|
14:50-15:10, Paper TuBT05.5 | Add to My Program |
Saturated Boundary Feedback Control of LWR Traffic Flow Models with Lane-Changing |
|
Zhao, Hanxu | Beijing University of Technology |
Zhan, Jingyuan | Beijing University of Technology |
Zhang, Liguo | Beijing University of Technology |
Keywords: Distributed parameter systems, Constrained control, Traffic control
Abstract: This paper investigates the boundary feedback control for the two-lane traffic flow with lane-changing interactions in the presence of actuator saturations. The macroscopic traffic dynamics are described by the Lighthill-Whitham-Richards (LWR) model for both two lanes. Two variable speed limits (VSLs) are applied at boundaries for controlling the vehicle velocity so as to stabilize the traffic density of each lane. A saturated boundary feedback controller is proposed to drive the traffic densities of both lanes to the steady states. In contrast to the fruitful results in saturated control of ordinary differential equation (ODE) systems, there are few related studies for hyperbolic partial differential equation (PDE) systems. We define the exponential stability for the hyperbolic PDEs under saturated control. Then sufficient conditions for ensuring exponential stability of the two-lane traffic flow system are developed in terms of matrix inequalities under both the linear and nonlinear cases, by employing the Lyapunov function method along with a sector condition in L2-norm and H2-norm, respectively. Finally, the effectiveness of the proposed conditions is validated by numerical simulations.
|
|
15:10-15:30, Paper TuBT05.6 | Add to My Program |
Data-Driven Optimal Control of Traffic Signals for Urban Road Networks |
|
Liu, Tong | New York University |
Wang, Hong | Oak Ridge National Laboratory |
Jiang, Zhong-Ping | New York University |
Keywords: Traffic control, Learning, Optimal control
Abstract: This paper studies the issue of data-driven optimal control design for traffic signals of oversaturated urban road networks. The signal control system based on the store and forward model is generally uncontrollable for which the controllable decomposition is needed. Instead of identifying the unknown parameters like saturation rates and turning ratios, a finite number of measured trajectories can be used to parametrize the system and help directly construct a transformation matrix for Kalman controllable decomposition through the fundamental lemma of J. C. Willems. On top of that, an infinite-horizon linear quadratic regulator (LQR) problem is formulated considering the constraints of green times for traffic signals. The problem can be solved through a two-phase data-driven learning process, where one solves an infinite-horizon unconstrained LQR problem and the other solves a finite-horizon constrained LQR problem. The simulation result shows the theoretical analysis is effective and the proposed data-driven controller can yield desired performance for reducing traffic congestion.
|
|
TuBT06 Regular Session, Tulum Ballroom F |
Add to My Program |
Identification of Linear Systems |
|
|
Chair: Fosson, Sophie | Politecnico Di Torino |
Co-Chair: Van den Hof, Paul M.J. | Eindhoven University of Technology |
|
13:30-13:50, Paper TuBT06.1 | Add to My Program |
Infinite-Dimensional Sparse Learning in Linear System Identification |
|
Yin, Mingzhou | ETH Zurich |
Akan, Mehmet Tolga | Eindhoven University of Technology |
Iannelli, Andrea | ETH Zurich |
Smith, Roy S. | ETH Zurich |
Keywords: Identification, Statistical learning, Optimization algorithms
Abstract: Regularized methods have been widely applied to system identification problems without known model structures. This paper proposes an infinite-dimensional sparse learning algorithm based on atomic norm regularization. Atomic norm regularization decomposes the transfer function into first-order atomic models and solves a group lasso problem that selects a sparse set of poles and identifies the corresponding coefficients. The difficulty in solving the problem lies in the fact that there are an infinite number of possible atomic models. This work proposes a greedy algorithm that generates new candidate atomic models maximizing the violation of the optimality condition of the existing problem. This algorithm is able to solve the infinite-dimensional group lasso problem with high precision. The algorithm is further extended to reduce the bias and reject false positives in pole location estimation by iteratively reweighted adaptive group lasso and complementary pairs stability selection respectively. Numerical results demonstrate that the proposed algorithm performs better than benchmark parameterized and regularized methods in terms of both impulse response fitting and pole location estimation.
|
|
13:50-14:10, Paper TuBT06.2 | Add to My Program |
Fundamental Limit on SISO System Identification |
|
Li, Jiayun | Tsinghua University |
Sun, Shuai | Tsinghua University |
Mo, Yilin | Tsinghua University |
Keywords: Identification, Linear systems
Abstract: This paper is concerned with the fundamental limit on the identification of discrete-time SISO (Single Input Single Output) system, where the diagonal canonical form of the system is inferred from a finite number of input/output sample trajectories. Through the analysis of the Fisher information matrix used in Cram ́er-Rao bound, we show that the sample complexity of the identification problem using any unbiased estimator explodes superpolynomially with respect to system dimension in the average sense, assuming that the eigenvalues of the system matrix are uniformly distributed. Furthermore, we extend our result to the widely applied Ho-Kalman algo- rithm and prove that the algorithm is ill-conditioned for high dimensional SISO systems as well. Numerical results further demonstrate the conclusion of this paper.
|
|
14:10-14:30, Paper TuBT06.3 | Add to My Program |
Batch-Least Squares System Identification Algorithm for 2D Repetitive Processes |
|
Seltzer, Dustin | Pennsylvania State University |
Schiano, Jeffrey L | Pennsylvania State Univ |
Keywords: Identification, Manufacturing systems and automation, Linear systems
Abstract: The batch-least squares (BLS) approach to parameter identification is a well-known technique utilized for system identification. In this paper, we adapt the BLS approach to a 2D repetitive process model. Our approach first derives a regression model for the 2D repetitive process, which is then used for implementing the BLS algorithm. Our adapted BLS algorithm then uses the repetitive nature of the system to identify both the parameters associated with the system repetition and inputs applied. We evaluated the adapted BLS algorithm on an additive manufacturing process.
|
|
14:30-14:50, Paper TuBT06.4 | Add to My Program |
Set-Membership Identification of Continuous-Time Systems through Model Transformation |
|
Cerone, Vito | Politecnico Di Torino |
Fosson, Sophie | Politecnico Di Torino |
Pirrera, Simone | Politecnico Di Torino |
Regruto, Diego | Politecnico Di Torino |
Keywords: Identification, Optimization
Abstract: In this paper, we consider the identification of continuous-time linear-time-invariant systems from sampled input-output data corrupted by unknown but bounded noise. We formulate the problem in the framework of errors-in- variables set-membership identification. First, we derive a mathematical description of the feasible parameter set in terms of polynomial constraints. Such a description accounts for a- priori information on the model structure and the noise bounds, and for the effects of the discretization error. Then, we select the parameter estimate from the feasible parameter set by computing the solution to a suitable polynomial optimization problem. We show the effectiveness of the proposed approach through a numerical example.
|
|
14:50-15:10, Paper TuBT06.5 | Add to My Program |
Local Identification in Diffusively Coupled Linear Networks |
|
Kivits, E.M.M. (Lizan) | Eindhoven University of Technology |
Van den Hof, Paul M.J. | Eindhoven University of Technology |
Keywords: Identification, Linear systems, Network analysis and control
Abstract: Physical dynamic networks most commonly consist of interconnections of physical components that can be described by diffusive couplings. Diffusive couplings imply symmetric cause-effect relationships in the interconnections and therefore diffusively coupled networks can be represented by undirected graphs. This paper shows how local dynamics of (undirected) diffusively coupled networks can be identified on he bases of local signals only. Sensors and actuators are allocated to guarantee consistent identification. An algorithm is developed for identifying the local dynamics.
|
|
15:10-15:30, Paper TuBT06.6 | Add to My Program |
Generalized DCM Models for Pre-Filtering Compensation |
|
Gindullina, Elvina | University of Padova |
Zorzi, Mattia | University of Padova |
Bertoldo, Alessandra | University of Padova |
Chiuso, Alessandro | Univ. Di Padova |
Keywords: Identification, Modeling, Biological systems
Abstract: Estimation of brain effective connectivity (EC) faces several issues, among which are computational complexity and estimation accuracy. Here, we focus on increasing estimation accuracy, in particular addressing the bias introduced by the pre-processing pipeline employed on resting-state (rs) fMRI signals, which includes temporal band-pass prefiltering. In this paper, we propose an alternative model (so-called ``Filtered rs-DCM'') that adapts a stochastic linear rs-DCM model to the filtered input. Filtered rs-DCM is built by augmenting the neuronal rs-DCM with the filter model used in the pre-processing step. The parameter estimation is performed with EM-algorithm with a sparsity inducing prior. The simulation experiments run on band-pass filtered synthetic datasets demonstrate that this approach outperforms classical DCM models in terms of EC estimation accuracy.
|
|
TuBT07 Regular Session, Tulum Ballroom G |
Add to My Program |
Fault Tolerant Systems I |
|
|
Chair: Clark, Andrew | Washington University in St. Louis |
Co-Chair: Murguia, Carlos | Eindhoven University of Technology |
|
13:30-13:50, Paper TuBT07.1 | Add to My Program |
Fault Detection on a Class of Robotic Manipulators Using Time-Variant Transmissibilities |
|
Khalil, Abdelrahman | Memorial University of Newfoundland |
Aljanaideh, Khaled | The MathWorks |
Al Janaideh, Mohammad | Memorial University of Newfoundland |
Keywords: Mechatronics
Abstract: This paper investigates detecting faults in robotic manipulators with bounded nonlinearities and time-variant parameters using transmissibility operators. Transmissibility operators are mathematical relations between a set of system responses to another set of responses within the same system. Both parameter variation and system nonlinearities are considered to be unknown. Transmissibility operators are shown in the literature to be independent of the system inputs. The bounded nonlinearities are considered as independent excitations on the system, which renders transmissibilities independent of these nonlinearities. To overcome the unknown variant parameters, we propose identifying transmissibilities using recursive least-squares in the structure of noncausal FIR models. While parameter variation can be treated as system nonlinearities, the recursive least squares algorithm is used to optimize what time-variant dynamics to include in the transmissibility operator and what dynamics to push to the system nonlinearities over time. The identified transmissibilities are then used for the purpose of fault detection in an experimental robotic arm with variant picked mass. The experimental results show the proposed approach to be used effectively to detect faults in robotic manipulators.
|
|
13:50-14:10, Paper TuBT07.2 | Add to My Program |
Efficient Fault Detection for Discrete-Time PWA Systems |
|
Liu, Xinyang | Harbin Institute of Technology |
Liu, Zonglin | University of Kassel |
Wang, Zhenhua | Harbin Institute of Technology |
Stursberg, Olaf | University of Kassel |
Keywords: Fault detection, Hybrid systems, Estimation
Abstract: This paper considers a class of fault detection problems for discrete-time piecewise-affine (PWA) systems. Inspired by the state-of-the-art of set-membership methods to estimate the state of PWA systems, a novel detection method based on zonotopes and over-approximation of sets is proposed to overcome high computational complexity caused by unknown discrete states. The efficiency and high accuracy of detection of the new method is confirmed through both theoretical analysis and numerical examples.
|
|
14:10-14:30, Paper TuBT07.3 | Add to My Program |
Integrated Design of Input and Observer Gain for Active Fault Diagnosis Based on Hybrid Stochastic-Deterministic Approach |
|
Tan, Junbo | Tsingahu University |
He, Jiabao | Tsinghua University |
Zhang, Shengli | Shenzhen University |
Wang, Xueqian | Tsinghua University |
Liang, Bin | Tsinghua University |
Keywords: Fault diagnosis, Fault detection, Optimization algorithms
Abstract: This paper proposes a hybrid stochastic-deterministic approach to realize the observer-based active fault diagnosis (AFD) for linear time-varying systems. A novel conception called hybrid zonotope-Gaussian dispersity for random variables of systems is introduced to establish an online optimization strategy for AFD. The optimal gain of observer and the optimal input are simultaneously designed at each step by solving a non-convex quadratic fractional programming problem, which is proved to be equivalent to a mixed integer quadratic programming problem. The proposed approach avoids the strict set-separation constraint conditions of traditional off-line AFD technology by making full use of the online measured outputs such that the diagnosisability of systems has potential to be further improved. At the end, a quadrotor model is used to verify the effectiveness of our proposed method.
|
|
14:30-14:50, Paper TuBT07.4 | Add to My Program |
Joint State, Disturbance and Fault Estimation for Weakly Output Redundant Discrete-Time Linear Systems |
|
Yang, Guitao | Imperial College London |
Barboni, Angelo | Imperial College London |
Rezaee, Hamed | Imperial College London |
Serrani, Andrea | The Ohio State University |
Parisini, Thomas | Imperial College & Univ. of Trieste |
Keywords: Fault detection, Algebraic/geometric methods, Linear systems
Abstract: The problem considered in this paper is dual to the control problem for over-actuated systems found in the literature. We show that, due to a certain notion of weak output redundancy, there always exists an unobservability subspace containing the input subspace, which ensures that the original system can be partitioned into two subsystems, one of which is not affected by actuator faults. We use this fact to estimate the disturbance and the fault in a cascaded fashion: we first design a discrete-time filter on a properly designed residual signal, that can reconstruct the disturbance. The estimated disturbance can then be used to perform fault detection and estimation in a cascaded fashion. A numerical example verifies the efficacy of the proposed strategy.
|
|
14:50-15:10, Paper TuBT07.5 | Add to My Program |
A Compositional Approach to Safety-Critical Resilient Control for Systems with Coupled Dynamics |
|
Maruf, Abdullah Al | University of Washington |
Niu, Luyao | University of Washington |
Clark, Andrew | Washington University in St. Louis |
Mertoguno, Sukarno | Georgia Institute of Technology |
Poovendran, Radha | University of Washington |
Keywords: Fault tolerant systems, Formal Verification/Synthesis, Resilient Control Systems
Abstract: Complex, interconnected Cyber-physical Systems (CPS) are increasingly common in applications including smart grids and transportation. Ensuring safety of interconnected systems whose dynamics are coupled is challenging because the effects of faults and attacks in one sub-system can propagate to other sub-systems and lead to safety violations. In this paper, we study the problem of safety-critical control for CPS with coupled dynamics when some sub-systems are subject to failure or attack. We first propose resilient-safety indices (RSIs) for the faulty or compromised sub-systems that bound the worst-case impacts of faulty or compromised sub-systems on a set of specified safety constraints. By incorporating the RSIs, we provide a sufficient condition for the synthesis of control policies in each failure- and attack- free sub-systems. The synthesized control policies compensate for the impacts of the faulty or compromised sub-systems to guarantee safety. We formulate sum-of-square optimization programs to compute the RSIs and the safety-ensuring control policies. We present a case study that applies our proposed approach on the temperature regulation of three coupled rooms. The case study demonstrates that control policies obtained using our algorithm guarantee system's safety constraints.
|
|
15:10-15:30, Paper TuBT07.6 | Add to My Program |
Ultra Local Nonlinear Unknown Input Observers for Robust Fault Reconstruction |
|
Ghanipoor, Farhad | Eindhoven University of Technology |
Murguia, Carlos | Eindhoven University of Technology |
Mohajerin Esfahani, Peyman | TU Delft |
Van De Wouw, Nathan | Eindhoven University of Technology |
Keywords: Fault diagnosis, Nonlinear systems, Optimization
Abstract: In this paper, we present a methodology for actuator and sensor fault estimation in nonlinear systems. The method consists of augmenting the system dynamics with an approximated ultra-local model (a finite chain of integrators) for the fault vector and constructing a Nonlinear Unknown Input Observer (NUIO) for the augmented dynamics. Then, fault reconstruction is reformulated as a robust state estimation problem in the augmented state (true state plus fault-related state). We provide sufficient conditions that guarantee the existence of the observer and stability of the estimation error dynamics (asymptotic stability of the origin in the absence of faults and ISS guarantees in the faulty case). Then, we cast the synthesis of observer gains as a semidefinite program where we minimize the L2-gain from the model mismatch induced by the approximated fault model to the fault estimation error. Finally, simulations are given to illustrate the performance of the proposed methodology.
|
|
TuBT08 Regular Session, Tulum Ballroom H |
Add to My Program |
Barrier Functions in Constrained Control |
|
|
Chair: Khorrami, Farshad | NYU Tandon School of Engineering |
Co-Chair: Panagou, Dimitra | University of Michigan, Ann Arbor |
|
13:30-13:50, Paper TuBT08.1 | Add to My Program |
Predictive Control Barrier Functions for Online Safety Critical Control |
|
Breeden, Joseph | University of Michigan, Ann Arbor |
Panagou, Dimitra | University of Michigan, Ann Arbor |
Keywords: Constrained control, Predictive control for nonlinear systems, Autonomous systems
Abstract: This paper presents a methodology for constructing Control Barrier Functions (CBFs) that proactively consider the future safety of a system along a nominal trajectory, and effect corrective action before the trajectory leaves a designated safe set. Specifically, this paper presents a systematic approach for propagating a nominal trajectory on a receding horizon, and then encoding the future safety of this trajectory into a CBF. If the propagated trajectory is unsafe, then a controller satisfying the CBF condition will modify the nominal trajectory before the trajectory becomes unsafe. Compared to existing CBF techniques, this strategy is proactive rather than reactive and thus potentially results in smaller modifications to the nominal trajectory. The proposed strategy is shown to be provably safe, and then is demonstrated in simulated scenarios where it would otherwise be difficult to construct a traditional CBF. In simulation, the predictive CBF results in less modification to the nominal trajectory and smaller control inputs than a traditional CBF, and faster computations than a nonlinear model predictive control approach.
|
|
13:50-14:10, Paper TuBT08.2 | Add to My Program |
Differentiable Predictive Control with Safety Guarantees: A Control Barrier Function Approach |
|
Shaw Cortez, Wenceslao | Pacific Northwest National Laboratory |
Drgona, Jan | Pacific Northwest National Laboratory |
Tuor, Aaron | Pacific Northwest National Laboratory |
Halappanavar, Mahantesh | Pacific Northwest National Laboratory |
Vrabie, Draguna | Pacific Northwest National Laboratory |
Keywords: Constrained control, Learning, Sampled-data control
Abstract: We develop a novel form of differentiable predictive control (DPC) with safety and robustness guarantees based on control barrier functions. DPC is an unsupervised learning-based method for obtaining approximate solutions to explicit model predictive control (MPC) problems. In DPC, the predictive control policy parametrized by a neural network is optimized offline via direct policy gradients obtained by automatic differentiation of the MPC problem. The proposed approach exploits a new form of sampled-data barrier function to enforce offline and online safety requirements in DPC settings while only interrupting the neural network-based controller near the boundary of the safe set. The effectiveness of the proposed approach is demonstrated in simulation.
|
|
14:10-14:30, Paper TuBT08.3 | Add to My Program |
Compatibility Checking of Multiple Control Barrier Functions for Input Constrained Systems |
|
Tan, Xiao | KTH Royal Institute of Technology |
Dimarogonas, Dimos V. | KTH Royal Institute of Technology |
Keywords: Constrained control, Nonlinear systems, Computational methods
Abstract: State and input constraints are ubiquitous in control system design. One recently developed tool to deal with these constraints is control barrier functions (CBF) which transform state constraints into conditions in the input space. CBF-based controller design thus incorporates both the CBF conditions and input constraints in a quadratic program. However, the CBF-based controller is well-defined only if the CBF conditions are compatible. In the case of perturbed systems, robust compatibility is of relevance. In this work, we propose an algorithmic solution to verify or falsify the (robust) compatibility of given CBFs a priori. Leveraging the Lipschitz properties of the CBF conditions, a grid sampling and refinement method with theoretical analysis and guarantees is proposed.
|
|
14:30-14:50, Paper TuBT08.4 | Add to My Program |
Learning a Better Control Barrier Function |
|
Dai, Bolun | New York University |
Krishnamurthy, Prashanth | NYU Tandon School of Engineering |
Khorrami, Farshad | NYU Tandon School of Engineering |
Keywords: Constrained control, Learning, Optimal control
Abstract: Control barrier functions (CBFs) are widely used in safety-critical controllers. However, constructing a valid CBF is challenging, especially under nonlinear or non-convex constraints and for high relative degree systems. Meanwhile, finding a conservative CBF that only recovers a portion of the true safe set is usually possible. In this work, starting from a ``conservative" handcrafted CBF (HCBF), we develop a method to find a CBF that recovers a reasonably larger portion of the safe set. Since the learned CBF controller is not guaranteed to be safe during training iterations, we use a model predictive controller (MPC) to ensure safety during training. Using the collected trajectory data containing safe and unsafe interactions, we train a neural network to estimate the difference between the HCBF and a CBF that recovers a closer solution to the true safe set. With our proposed approach, we can generate safe controllers that are less conservative and computationally more efficient. We validate our approach on two systems: a second-order integrator and a ball-on-beam.
|
|
14:50-15:10, Paper TuBT08.5 | Add to My Program |
High-Order Control Barrier Function for Constraining Position in Motorized Rehabilitative Cycling |
|
Sweatland, Hannah | University of Florida |
Isaly, Axton | University of Florida |
Dixon, Warren E. | University of Florida |
Keywords: Constrained control, Nonlinear systems, Healthcare and medical systems
Abstract: Control barrier functions (CBFs) have commonly been used to encode the safety requirements of a dynamical system and to constrain the control input to guarantee forward invariance of a safe set. High-order control barrier functions (HOCBFs) are a method of ensuring the safety of a system of high relative degree. The method developed in this paper can be applied to a variety of nonlinear systems with more general dynamics than previous works, and is demonstrated on a motorized rehabilitative cycle of relative degree two. A motor controller is designed to constrain the crank position to a time-varying user-defined safe range. Because of the uncertain and nonlinear dynamics of the system, robust control methods are borrowed from Lyapunov theory to develop worst-case controllers that render the intersection of a series of sets forward invariant. The controller is designed so that it provides minimal assistance within the safe range, maximizing the efforts of the rider and facilitating more effective therapy.
|
|
15:10-15:30, Paper TuBT08.6 | Add to My Program |
Safe and Robust Observer-Controller Synthesis Using Control Barrier Functions |
|
Agrawal, Devansh Ramgopal | University of Michigan |
Panagou, Dimitra | University of Michigan, Ann Arbor |
Keywords: Constrained control, Observers for nonlinear systems, Robust control
Abstract: This paper addresses the synthesis of safety-critical controllers using estimate feedback. We propose an observer-controller interconnection to ensure that the nonlinear system remains safe despite bounded disturbances on the system dynamics and measurements that correspond to partial state information. The co-design of observers and controllers is critical, since even in undisturbed cases, observers and controllers designed independently may not render the system safe. We propose two approaches to synthesize observer-controller interconnections. The first approach utilizes Input-to-State Stable observers, and the second uses Bounded Error observers. Using these stability and boundedness properties of the observation error, we construct novel Control Barrier Functions that impose inequality constraints on the control inputs which, when satisfied, certifies safety. We propose quadratic program-based controllers to satisfy these constraints, and prove Lipschitz continuity of the derived controllers. Simulations and experiments on a quadrotor demonstrate the efficacy of the proposed methods.
|
|
TuBT09 Regular Session, Maya Ballroom I |
Add to My Program |
Network Analysis and Control I |
|
|
Chair: Yamamoto, Kaoru | Kyushu University |
Co-Chair: Park, Shinkyu | KAUST |
|
13:30-13:50, Paper TuBT09.1 | Add to My Program |
On Separation of Distributed Estimation and Control for LTI Systems |
|
Savas, Anthony | Princeton University |
Park, Shinkyu | KAUST |
Poor, H. Vincent | Princeton Univ |
Leonard, Naomi Ehrich | Princeton University |
Keywords: Network analysis and control, Distributed control, Cooperative control
Abstract: The separation principle in a centralized estimation and control problem gives us the flexibility to design a feedback controller independent of the state estimator. However, the same principle does not hold when the estimation and control are distributed over a network of agents. In this case, the estimator may need to be redesigned when the controller is revised, which can be computationally expensive. We investigate a weaker notion of the separation principle in the distributed estimation and control of linear time-invariant (LTI) systems. As a main contribution, applying the small-gain theorem, we characterize the notion using matrix inequalities and compute a set of feedback controllers that agents in the network can adopt without redesigning the estimator. We also analyze how the frequency of information exchange between neighboring agents affects the characterization. We illustrate our analytical results through simulations of a multi-vehicle system problem.
|
|
13:50-14:10, Paper TuBT09.2 | Add to My Program |
Plug-And-Play Network Reconfiguration Algorithms to Maintain Regularity and Low Network Reconfiguration Needs |
|
Stuedli, Sonja | The University of Newcastle |
Yan, Yamin | The Hong Kong University of Science and Technology |
Seron, Maria M. | The University of Newcastle |
Middleton, Richard | The University of Newcastle |
Keywords: Networked control systems, Communication networks, Control of networks
Abstract: In this paper we propose algorithms to connect and disconnect nodes to/from an existing graph, which are able to maintain the regularity of a graph while requiring minimal reconfiguration of the network. Further, simulations suggest that the algorithms maintain a minimum bound on the algebraic connectivity of the graph. These properties of the algorithms suit their use in plug-and-play networks.
|
|
14:10-14:30, Paper TuBT09.3 | Add to My Program |
Algebraic Connectivity of Layered Path Graphs under Node Deletion |
|
Yoshise, Ryusei | Kyushu University |
Yamamoto, Kaoru | Kyushu University |
Keywords: Cooperative control, Distributed control, Networked control systems
Abstract: This paper studies the relation between node deletion and algebraic connectivity for graphs with a hierarchical structure represented by layers. To capture this structure, the concepts of layered path graph and its (sub)graph cone are introduced. The problem is motivated by a mobile robot formation control guided by a leader. In particular, we consider a scenario in which robots may leave the network resulting in the removal of the nodes and the associated edges. We show that the existence of at least one neighbor in the upper layer is crucial for the algebraic connectivity not to deteriorate by node deletion.
|
|
14:30-14:50, Paper TuBT09.4 | Add to My Program |
Inferring Topology of Networked Dynamical Systems by Active Excitations |
|
Li, Yushan | Shanghai Jiao Tong University |
He, Jianping | Shanghai Jiao Tong University |
Chen, Cailian | Shanghai Jiao Tong University |
Guan, Xin-Ping | Shanghai Jiao Tong University |
Keywords: Agents-based systems, Network analysis and control, Autonomous systems
Abstract: Topology inference for networked dynamical systems (NDSs) has received considerable attention in recent years. The majority of pioneering works have dealt with inferring the topology from abundant observations of NDSs, so as to approximate the real one asymptotically. Leveraging the characteristic that NDSs will react to various disturbances and the disturbance’s influence will consistently spread, this paper focuses on inferring the topology by a few active excitations. The key challenge is to distinguish different influences of system noises and excitations from the exhibited state deviations, where the influences will decay with time and the exciatation cannot be arbitrarily large. To practice, we propose a one-shot excitation based inference method to infer h-hop neighbors of a node. The excitation conditions for accurate one-hop neighbor inference are first derived with probability guarantees. Then, we extend the results to h-hop neighbor inference and multiple excitations cases, providing the explicit relationships between the inference accuracy and excitation magnitude. Specifically, the excitation based inference method is not only suitable for scenarios where abundant observations are unavailable, but also can be leveraged as auxiliary means to improve the accuracy of existing methods. Simulations are conducted to verify the analytical results.
|
|
14:50-15:10, Paper TuBT09.5 | Add to My Program |
Three Time Scales Modeling of the Undirected Clustered Network |
|
Adhikari, Bikash | University of Lorraine |
Panteley, Elena | CNRS |
Morarescu, Irinel-Constantin | CRAN, CNRS, Université De Lorraine |
Keywords: Agents-based systems, Decentralized control, Linear systems
Abstract: Dynamics evolving on different time scales appear naturally in networks of clusters. In this paper, we emphasize that the synchronization of interconnected agents partitioned in clusters involves three time-scales. Using a coordinate transformation, we reformulate the overall closed-loop clustered network dynamics in new coordinates describing the mean-field dynamics, the intra-cluster and the inter-cluster synchronization error dynamics. Then, under realistic assumptions, we show that the network dynamics can be represented in a two-parameter standard singular perturbation form in the new coordinate. The mean-field dynamics, which is the network’s long-term behavior, evolve on the slowest time-scale. The intra-cluster error dynamics, which characterizes the synchronization inside clusters, evolve on the fastest time scale. Finally, the inter-cluster error dynamics, which characterizes the synchronization between clusters, is fast with respect to the mean-field one and slow with respect to the intra-cluster one. Using the two-parameter singular perturbation form, we establish approximations of the variables using standards techniques based on Tychonoff’s theorem. Numerical simulations validate our theoretical results.
|
|
15:10-15:30, Paper TuBT09.6 | Add to My Program |
On Necessary and Sufficient Conditions for Identifiability and Identification of Switching Dynamical Networks |
|
Sun, Weiyang | Zhejiang University |
Xu, Jinming | Zhejiang University |
Chen, Jiming | Zhejiang University |
Keywords: Networked control systems, Network analysis and control
Abstract: We consider the identifiability and identification problem of a dynamical network modeled by a switched linear dynamical system governed by a sequence of switchings. We aim at exactly reconstructing the network sequence along with the switching signal from the state trajectories of all nodes. The identification of a switching dynamical network is more challenging than that of a time-invariant dynamical network in that the data is generated by a number of different subsystems at different periods. By observability conditions, we first establish a necessary and sufficient condition for the identifiability of switching dynamical networks. Building on this, a sufficient identification condition is developed leveraging coupled Lyapunov-transpose matrix equations, which allows us to design a new algorithm that can exactly recover the whole network sequence, even if the switching signal is completely unknown a prior. Finally, a numerical example is provided to verify the effectiveness of the theoretical results.
|
|
TuBT10 Regular Session, Maya Ballroom II |
Add to My Program |
Stochastic Systems II |
|
|
Chair: Satheeskumar Varma, Vineeth | CNRS |
Co-Chair: Materassi, Donatello | University of Minnesota |
|
13:30-13:50, Paper TuBT10.1 | Add to My Program |
Stability Analysis of Socially Inspired Adaptive Voter Model |
|
Kravitzch, Emmanuel | Avignon Université, Computer Sciences (LIA UAPV) |
Hayel, Yezekael | University of Avignon |
Satheeskumar Varma, Vineeth | CNRS |
Berthet, Antoine O. | L2S Centrale-Supélec |
Keywords: Stochastic systems, Agents-based systems, Network analysis and control
Abstract: In this work, we study an instance of continuous time voter model over directed graphs on social networks with a specific refinement: the agents can break or create new links in the graph. The edges of the graph thus coevolve with the agents’ spin. Specifically, the agents may break their links with neighbours of different spin, and create links with the neighbours of their neighbours (2-hop neighbours), provided they have same spin. We characterize the absorbing configurations and present a particular case that corresponds to a single agent facing two antagonistic ideologies. By asymptotic analysis, we observe two regimes depending on the parameters: in one regime, hesitation disappears rapidly, while when the link creation rate is high enough, slow extinction or metastability occurs. We compute the threshold value and illustrate these results with numerical simulations.
|
|
13:50-14:10, Paper TuBT10.2 | Add to My Program |
Score-And-Search Methods for the Recovery of Structure in Networks of Dynamic Systems |
|
Biparva, Darya | University of Minnesota |
Materassi, Donatello | University of Minnesota |
Keywords: Stochastic systems, Identification, Estimation
Abstract: A relevant problem in many areas of science is to determine the structure of a network by observing its nodes. A desirable property of any network reconstruction techniques is consistency, namely the convergence of the reconstructed network to the actual structure when the time horizon of the observations goes to infinity. Unfortunately, when feedthrough components are present in the network, multiple structures could give rise to the same observations. Hence, in these situations, the best theoretical result that can be achieved is the determination of all the possible structures compatible with what is being observed. There are some results offering such theoretical guarantees, but these methods rely on a large number of statistical tests making their sample complexity relatively large. This article proposes the adoption of reconstruction techniques where, given the observed data, each structure is evaluated according to a score representing the likelihood that such structure is the actual one. Such a technique is proven to have the same consistency properties of state-of-the-art methods based on statistical tests, while numerical experiments show it to have a lower sample complexity.
|
|
14:10-14:30, Paper TuBT10.3 | Add to My Program |
Recursively Feasible Model Predictive Control Using Latent Force Models Applied to Disturbed Quadcopters (I) |
|
Gruner, Jonas | Universität Zu Lübeck |
Schmid, Niklas | ETH Zürich |
Männel, Georg | Fraunhofer IMTE |
Graßhoff, Jan | Fraunhofer Research Institution for Individualized and Cell-Base |
Abbas, Hossam | University of Lübeck |
Rostalski, Philipp | University of Luebeck |
Keywords: Predictive control for linear systems, Constrained control, Stochastic optimal control
Abstract: In this work, recursively feasible model predictive control (MPC) is considered for systems under additive disturbances. Combining a nonparametric Gaussian Process (GP) prior for modeling the additive disturbance with the model of the undisturbed system results in a model structure referred to as latent force model (LFM). Using spectral factorization, the whole LFM can be represented by an equivalent/approximate stochastic state-space model used as the predictor in the MPC formulation. Chance constraints are incorporated by constraint tightening using so-called probabilistic reachable sets of the LFM state and recursive feasibility is guaranteed by optimizing the initial value of the MPC predicted trajectory. The LFM formulation allows leveraging the disturbance information to all components of the MPC, which can significantly enhance its performance. The proposed LFM-based MPC approach is demonstrated on a simulated quadcopter under additive disturbances. The performance of the closed-loop controller using the LFM-state-space reformulation is compared to standard MPC.
|
|
14:30-14:50, Paper TuBT10.4 | Add to My Program |
Almost Sure Stability of Stochastic Switched Systems: Graph Lifts-Based Approach |
|
Della Rossa, Matteo | UCLouvain |
Jungers, Raphaël M. | University of Louvain |
Keywords: Stochastic systems, Switched systems, Automata
Abstract: In this paper, we develop tools to establish almost sure stability of stochastic switched systems whose switching signal is constrained by an automaton. After having provided the necessary generalizations of existing results in the setting of stochastic graphs, we provide a characterization of almost sure stability in terms of multiple Lyapunov functions. We introduce the concept of lifts, providing formal expansions of stochastic graphs, together with the guarantee of conserving the underlying probability framework. We show how these techniques, firstly introduced in the deterministic setting, provide hierarchical methods in order to compute tight upper bounds for the almost sure decay rate. The theoretical developments are finally illustrated via a numerical example.
|
|
14:50-15:10, Paper TuBT10.5 | Add to My Program |
Performance Analysis of Least Squares of Continuous-Time Model Based on Sampling Data |
|
Zhu, Xinghua | Chinese Academy of Sciences |
Gan, Die | Chinese Academy of Science |
Liu, Zhixin | Academy of Mathematics and Systems Science, ChineseAcademyof Scie |
Keywords: Stochastic systems, Identification, Lyapunov methods
Abstract: In this letter, we consider the parameter estimation problem of continuous-time linear stochastic regression models described by stochastic differential equations using the sampling data. An online least squares (LS) algorithm is proposed by minimizing the accumulative prediction error at discrete sampling time instants. By employing both the stochastic Lyapunov function and martingale estimate methods, we establish the convergence analysis of the proposed algorithm under conditions of the excitation of the sampling data and the sampling time interval. We also provide the upper bound of the accumulative regret for the adaptive predictor. A simulation example is given to verify our theoretical results.
|
|
15:10-15:30, Paper TuBT10.6 | Add to My Program |
A Hybrid Observer for Practical Observability of Linear Stochastic Systems |
|
Gong, Zilong | Imperial College London |
Scarciotti, Giordano | Imperial College London |
Keywords: Stochastic systems, Observers for Linear systems, Stability of hybrid systems
Abstract: This paper focuses on the problem of observability and observer design for linear stochastic systems. To introduce our idea, we first construct an idealistic observer. This idealistic observer is not causal as it requires perfect knowledge of the Brownian motion. However, after introducing an a posteriori method to reconstruct the variations of the Brownian motion in discrete time, we propose a realistic hybrid observer which approximates the idealistic observer. The performance of this hybrid observer can be made arbitrarily close to that of the idealistic observer. Numerical simulations illustrate the results of the paper.
|
|
TuBT11 Regular Session, Maya Ballroom III |
Add to My Program |
Distributed Parameter Systems II |
|
|
Chair: Marx, Swann | LS2N |
Co-Chair: Mora, Luis | University of Waterloo |
|
13:30-13:50, Paper TuBT11.1 | Add to My Program |
Operator-Valued Kernels and Control of Infinite Dimensional Dynamic Systems |
|
Aubin-Frankowski, Pierre-Cyril | INRIA |
Bensoussan, Alain | UTD University of Texas at Dallas |
Keywords: Optimal control, Time-varying systems, Machine learning
Abstract: The Linear Quadratic Regulator (LQR), which is arguably the most classical problem in control theory, was recently related to kernel methods in cite{aubin2020hard_control} for finite dimensional systems. We show the result extends to infinite dimensional systems, i.e. control of linear partial differential equations. The quadratic objective paired with the linear dynamics encode the relevant kernel, defining a Hilbert space of controlled trajectories, for which we obtain a concise formula based on the solution of the differential Riccati equation. This paves the way to applying representer theorems from kernel methods to solve infinite dimensional optimal control problems.
|
|
13:50-14:10, Paper TuBT11.2 | Add to My Program |
Sampled-Data Finite-Dimensional Observer-Based Boundary Control of 1D Stochastic Parabolic PDEs |
|
Wang, Pengfei | Tel Aviv University |
Fridman, Emilia | Tel-Aviv Univ |
Keywords: Distributed parameter systems, Stochastic systems, LMIs
Abstract: Recently, finite-dimensional controllers were introduced for 1D stochastic parabolic PDEs via the modal decomposition method. In the present paper, we suggest a sampled-data implementation of a finite-dimensional observer-based boundary controller for 1D stochastic parabolic PDEs under discrete-time non-local measurement, where both the considered system and the measurement are subject to nonlinear multiplicative noise. We provide mean-square L2 exponential stability analysis of the full-order closed-loop system, where we employ It^{o}'s formula. We consider sampled-data control that employs zero-order hold device and use the time-delay approach to the sampled-data system. We construct Lyapunov functional V, and further combine it with Halanay's inequality with respect to expected value of V to compensate sampling in the infinite-dimensional tail. We provide linear matrix inequalities (LMIs) for finding the observer dimension N and upper bounds on sampling intervals and noise intensities that preserve the exponential stability. We prove that the LMIs are always feasible for large enough N and small enough sampling intervals and noise intensities. A numerical example demonstrates the efficiency of our method.
|
|
14:10-14:30, Paper TuBT11.3 | Add to My Program |
Traffic Flow Control at Signalized Intersections Using Signal Spatio-Temporal Logic |
|
Patil, Sagar | National Institute of Informatics |
Hashimoto, Kazumune | Osaka University |
Kishida, Masako | National Institute of Informatics |
Keywords: Traffic control, Formal Verification/Synthesis, Predictive control for nonlinear systems
Abstract: This work investigates traffic signal optimization to improve mobility and safety of traffic flow at signalized intersections. The existing pre-timed and actuated control techniques are incapable of optimally using traffic data in achieving congestion-free traffic flow. To overcome this limitation, we propose a model predictive control (MPC) with the signal spatio-temporal logic (SSTL) from formal methods to utilize traffic data. For this, we construct a cell transmission traffic model and describe traffic mobility and safety specifications using SSTL. Then to optimize traffic signal timings in this setting, we formulate an optimization problem with an objective to minimize traffic congestion subject to constraints given as SSTL specifications. A numerical example is provided to illustrate the traffic flow control using MPC with SSTL.
|
|
14:30-14:50, Paper TuBT11.4 | Add to My Program |
A Sensory Feedback Control Law for Octopus Arm Movements |
|
Wang, Tixian | University of Illinois at Urbana-Champaign |
Halder, Udit | University of Illinois at Urbana Champaign |
Gribkova, Ekaterina | University of Illinois, Urbana-Champaign |
Gillette, Rhanor | University of Illinois, Urbana-Champaign |
Gazzola, Mattia | University of Illinois at Urbana-Champaign |
Mehta, Prashant G. | Univ of Illinois, Urbana-Champaign |
Keywords: Biological systems, Nonlinear systems, Nonlinear output feedback
Abstract: The main contribution of this paper is a novel sensory feedback control law for an octopus arm. The control law is inspired by, and helps integrate, several observations made by biologists. The proposed control law is distinct from prior work which has mainly focused on open-loop control strategies. Several analytical results are described including characterization of the equilibrium and its stability analysis. Numerical simulations demonstrate life-like motion of the soft octopus arm, qualitatively matching behavioral experiments. Quantitative comparison with bend propagation experiments helps provide the first explanation of such canonical motion using a sensory feedback control law. Several remarks are included that help draw parallels with natural pursuit strategies such as motion camouflage or classical pursuit.
|
|
14:50-15:10, Paper TuBT11.5 | Add to My Program |
Learning Linear Feedback Controllers for Suppressing the Vortex-Shedding Flow past a Cylinder |
|
Jussiau, William | ONERA - the French Aerospace Lab |
Leclercq, Colin | ONERA |
Demourant, Fabrice | Onera |
Apkarian, Pierre | ONERA - the French Aerospace Lab |
Keywords: Fluid flow systems, Optimization, Robust control
Abstract: In this paper, we propose a method to learn linear controllers that completely suppress the vortex-shedding regime past a cylinder at Reynolds number 100, using a single actuator and sensor pair in a feedback configuration. The method is based on the Youla parametrization for guaranteeing closed-loop stability in a neighbourhood of the fixed point. Among the set of controllers stabilizing the fixed point, a derivative-free optimization algorithm searches for a controller suppressing the natural limit cycle oscillation. We show existence of simple single input, single output LTI controllers driving the full vortex shedding regime to the natural unstable base flow for infinite dimension, highly nonlinear Navier-Stokes dynamics.
|
|
15:10-15:30, Paper TuBT11.6 | Add to My Program |
Super-Twisting Sliding Mode Control for the Stabilization of a Linear Hyperbolic System |
|
Balogoun, Ismaïla | Laboratoire Des Sciences Du Numériques De Nantes |
Marx, Swann | LS2N |
Liard, Thibault | LS2N, École Centrale De Nantes |
Plestan, Franck | Ecole Centrale De Nantes-LS2N |
Keywords: Distributed parameter systems
Abstract: This paper deals with the stabilization of a linear hyperbolic system subject to a boundary disturbance. Our feedback design relies on a super-twisting control algorithm, which leads to a feedback that is continuous with respect to the state, in contrast with the classical sliding mode design. Our first result is the existence of solutions of the closed-loop system. Moreover, the global asymptotic stability, that is our second result, is proved together with the guarantee that the disturbance is rejected
|
|
TuBT12 Tutorial Session, Maya Ballroom IV |
Add to My Program |
Recent Advances in Data-Driven Control: Concepts, Theory, and Applications |
|
|
Chair: Bazanella, Alexandre S. | Univ. Federal Do Rio Grande Do Sul |
Co-Chair: Allgöwer, Frank | University of Stuttgart |
|
13:30-13:50, Paper TuBT12.1 | Add to My Program |
An Introduction to Data-Driven Control, from Kernels to Behaviors (I) |
|
Bazanella, Alexandre S. | Univ. Federal Do Rio Grande Do Sul |
Campestrini, Luciola | University of Rio Grande Do Sul |
Eckhard, Diego | Universidade Federal Do Rio Grande Do Sul |
Keywords: Identification for control, Statistical learning, Adaptive control
Abstract: Classical control theory deals with input-output descriptions of plants and controllers, a.k.a. kernel representations. A consistent Data-Driven control theory has been developed for linear systems using these representations. This theory is centered around the model reference design paradigm and is founded upon the classical tools of system identification - i.e. prediction error and correlation based identification. Data-driven design of state feedback controllers and of implicit control laws, on the other hand, has found a perfect fit in the behavioral description of plants and controllers. Subspace identification and predictive control became the foundations of design and analysis methodologies arising in this context. In this paper we will summarize the fundamentals of the `classical' data-driven control theory, describe some of the applications that it has encountered and point to the challenges still ahead in this approach. Next we'll motivate the behavioral approach to Data-Driven control, and highlight an important result that has been proven both for kernels and for behaviors: the conceptual conditions under which `direct' data-driven design outperforms `indirect' design - i.e. data-driven identification plus model-based design.
|
|
13:50-14:10, Paper TuBT12.2 | Add to My Program |
A Tutorial on the Informativity Framework for Data-Driven Control (I) |
|
van Waarde, Henk J. | University of Groningen |
Eising, Jaap | University of Groningen |
Camlibel, M. Kanat | University of Groningen |
Trentelman, Harry L. | Univ. of Groningen |
Keywords: Identification for control, Robust control, Behavioural systems
Abstract: The purpose of this paper is to provide a tutorial on the so-called informativity framework for direct data-driven control. This framework views data-driven analysis and design through the lens of robust control, and aims at assessing system properties and determining controllers for sets of systems unfalsified by the data. We will first introduce the informativity approach at an abstract level. Thereafter, we will study several case studies where we highlight the strength of the approach in the context of stabilizability analysis and stabilizing feedback design in different setups involving exact and noisy data, and for both input-state and input-output measurements. Finally, we provide an account of other applications of the data informativity framework.
|
|
14:10-14:30, Paper TuBT12.3 | Add to My Program |
On the Role of Regularization in Direct Data-Driven LQR Control (I) |
|
Dörfler, Florian | Swiss Federal Institute of Technology (ETH) Zurich |
Tesi, Pietro | University of Florence |
De Persis, Claudio | University of Groningen |
Keywords: Optimal control, Identification for control, Behavioural systems
Abstract: The linear quadratic regulator (LQR) problem is a cornerstone of control theory and a widely studied benchmark problem. When a system model is not available, the conventional approach to LQR design is indirect, i.e., based on a model identified from data. Recently a suite of direct data- driven LQR design approaches has surfaced by-passing explicit system identification (SysID) and based on ideas from subspace methods and behavioral systems theory. In either approach, the data underlying the design can be taken at face value (certainty- equivalence) or the design is robustified to account for noise. An emerging topic in direct data-driven LQR design is to regularize the optimal control objective to account for implicit SysID (in a least-square or low-rank sense) or to promote robust stability. These regularized formulations are flexible, computationally attractive, and theoretically certifiable; they can interpolate between direct vs. indirect and certainty-equivalent vs. robust approaches; and they can be blended resulting in remarkable empirical performance. This manuscript reviews and compares different approaches to regularized direct data-driven LQR.
|
|
14:30-14:50, Paper TuBT12.4 | Add to My Program |
Offset-Free Data-Driven Predictive Control (I) |
|
Lazar, Mircea | Eindhoven University of Technology |
Verheijen, Peter | Eindhoven University of Technology |
Keywords: Predictive control for linear systems, Robust control, Control software
Abstract: This paper presents a tutorial overview of model predictive control (MPC), subspace predictive control (SPC) and data-enabled predictive control (DeePC), with emphasis on offset-free design. Based on recent results on offset-free SPC design and on the relation between SPC and DeePC, we show that when incremental inputs are used in the identification step (SPC) or in the Hankel matrix (DeePC), the resulting data-driven predictive controllers are offset-free. Moreover, they are equivalent with offset-free MPC in the deterministic case. For noisy data the equivalence does not hold and a regularized version of the DeePC algorithm is necessary. We show that regularized DeePC can be formulated as a Tikhonov regularized least squares problem. This enables systematic tuning of the regularization parameter. We compare the performance of the offset-free data-driven predictive controllers in an application example from high-precision mechatronics.
|
|
14:50-15:10, Paper TuBT12.5 | Add to My Program |
Stability in Data-Driven MPC: An Inherent Robustness Perspective (I) |
|
Berberich, Julian | University of Stuttgart |
Köhler, Johannes | ETH Zurich |
Muller, Matthias A. | Leibniz University Hannover |
Allgöwer, Frank | University of Stuttgart |
Keywords: Predictive control for linear systems, Uncertain systems, Optimal control
Abstract: Data-driven model predictive control (DD-MPC) based on Willems' Fundamental Lemma has received much attention in recent years, allowing to control systems directly based on an implicit data-dependent system description. The literature contains many successful practical applications as well as theoretical results on closed-loop stability and robustness. In this paper, we provide a tutorial introduction to DD-MPC for unknown linear time-invariant (LTI) systems with focus on (robust) closed-loop stability. We first address the scenario of noise-free data, for which we present a DD-MPC scheme with terminal equality constraints and derive closed-loop properties. In case of noisy data, we introduce a simple yet powerful approach to analyze robust stability of DD-MPC by combining continuity of DD-MPC w.r.t. noise with inherent robustness of model-based MPC, i.e., robustness of nominal MPC w.r.t. small disturbances. Moreover, we discuss how the presented proof technique allows to show closed-loop stability of a variety of DD-MPC schemes with noisy data, as long as the corresponding model-based MPC is inherently robust.
|
|
15:10-15:30, Paper TuBT12.6 | Add to My Program |
Lessons Learned from Data-Driven Building Control Experiments: Contrasting Gaussian Process-Based MPC, Bilevel DeePC, and Deep Reinforcement Learning (I) |
|
Di Natale, Loris | Empa / EPFL |
Lian, Yingzhao | EPFL |
Maddalena, Emilio | École Polytechnique Fédérale De Lausanne |
Shi, Jicheng | École Polytechnique Fédérale De Lausanne |
Jones, Colin N. | EPFL |
Keywords: Building and facility automation, Learning, Smart cities/houses
Abstract: This manuscript offers the perspective of experimentalists on a number of modern data-driven techniques: model predictive control relying on Gaussian processes, adaptive data-driven control based on behavioral theory, and deep reinforcement learning. These techniques are compared in terms of data requirements, ease of use, computational burden, and robustness in the context of real-world applications. Our remarks and observations stem from a number of experimental investigations carried out in the field of building control in diverse environments, from lecture halls and apartment spaces to a hospital surgery center. The final goal is to support others in identifying what technique is best suited to tackle their own problems.
|
|
TuBT13 Regular Session, Maya Ballroom V |
Add to My Program |
Predictive Control for Linear Systems II |
|
|
Chair: Maestre, Jose Maria (Pepe) | University of Seville |
Co-Chair: Jiang, Yuning | EPFL |
|
13:30-13:50, Paper TuBT13.1 | Add to My Program |
Parallel MPC for Linear Systems with State and Input Constraints |
|
Shi, Jiahe | ShanghaiTech University |
Jiang, Yuning | EPFL |
Oravec, Juraj | Slovak University of Technology in Bratislava |
Houska, Boris | ShanghaiTech University |
Keywords: Predictive control for linear systems, Distributed control
Abstract: This paper proposes a parallelizable algorithm for linear-quadratic model predictive control (MPC) problems with state and input constraints. The algorithm itself is based on a parallel MPC scheme that has originally been designed for systems with input constraints. In this context, one contribution of this paper is the construction of time-varying yet separable constraint margins ensuring recursive feasibility and asymptotic stability of sub-optimal parallel MPC in a general setting, which also includes state constraints. Moreover, it is shown how to tradeoff online run-time guarantees versus the conservatism that is introduced by the tightened state constraints. The corresponding performance of the proposed method as well as the cost of the recursive feasibility guarantees is analyzed in the context of controlling a large-scale mechatronic system. This is illustrated by numerical experiments for a large-scale control system with more than 100 states and 60 control inputs leading to run-times in the millisecond range.
|
|
13:50-14:10, Paper TuBT13.2 | Add to My Program |
Tree-Based Model Predictive Control Strategy for Software Rejuvenation |
|
Arauz, Teresa | University of Seville |
Maestre, Jose Maria (Pepe) | University of Seville |
Quevedo, Daniel E. | Queensland University of Technology |
Camacho, Eduardo F. | Univ. of Sevilla |
Keywords: Predictive control for linear systems, Cyber-Physical Security, Resilient Control Systems
Abstract: Software rejuvenation is a cyberdefense mechanism that periodically resets the mission control software of a system to limit the impact of cyberattacks, activating in case it is necessary a safety controller to guarantee that the system returns to a target safe set. Unlike other works that consider feedback gains as mission controllers, here we propose a tree-based model predictive control scheme to explicitly account for the software refresh events and the possibility of cyberattacks. The benefits of the proposed method are illustrated using a simulated microgrid as a case study.
|
|
14:10-14:30, Paper TuBT13.3 | Add to My Program |
Reconfigurable Plug-And-Play Distributed Model Predictive Control for Reference Tracking |
|
Aboudonia, Ahmed | ETH Zurich |
Martinelli, Andrea | ETH Zurich |
Hoischen, Nicolas | ETH Zurich |
Lygeros, John | ETH Zurich |
Keywords: Predictive control for linear systems, Distributed control
Abstract: A plug-and-play model predictive control (PnP MPC) scheme is proposed for varying-topology networks to track piecewise constant references. The proposed scheme allows subsystems to occasionally join and leave the network while preserving asymptotic stability and recursive feasibility and comprises two main phases. In the redesign phase, passivity-based control is used to ensure that asymptotic stability of the network is preserved. In the transition phase, reconfigurable terminal ingredients are used to ensure that the distributed MPC problem is initially feasible after the PnP operation. The efficacy of the proposed scheme is evaluated by applying it to a network of mass-spring-damper systems and comparing it to a benchmark scheme. It is found that the novel redesign phase results in faster PnP operations, whereas the novel transition phase increases flexibility by accepting more requests.
|
|
14:30-14:50, Paper TuBT13.4 | Add to My Program |
Finite-Horizon Minimal Realizations for Model Predictive Control of Large-Scale Systems |
|
Meijer, Tomas Jesse | Eindhoven University of Technology |
Nouwens, S.A.N. | Eindhoven University of Technology |
Dolk, Victor Sebastiaan | Eindhoven University of Technology |
de Jager, Bram | Technische Universiteit Eindhoven |
Heemels, W.P.M.H. | Eindhoven University of Technology |
Keywords: Predictive control for linear systems, Model/Controller reduction, Large-scale systems
Abstract: In model predictive control (MPC) for large-scale applications, the computational limitations for on-line optimization often lead to the use of (relatively) short prediction horizons. In this paper, we show that as a result, the controller optimizes over only a fraction of the dynamics of the large-scale system. Based on this observation, which we will formalize, we propose a method to construct reduced-order models of minimal order, by exploiting the system-theoretic concept of finite-horizon observability, that exactly match the response of the large-scale system within a finite horizon. These so-called finite-horizon minimal realizations are used to implement equivalent MPC schemes with reduced computational effort (or the same computational effort but with a larger prediction horizon) without sacrificing accuracy/performance (as the equivalent optimization problem has the same optimizers as the original MPC problem). By computing finite-horizon minimal realizations, we can determine the dynamics as “seen” by the MPC, which can provide useful design insights, in particular, when tuning the prediction horizon. We demonstrate the strengths of our results in a numerical case study.
|
|
14:50-15:10, Paper TuBT13.5 | Add to My Program |
Safe and Efficient Model Predictive Control Using Neural Networks: An Interior Point Approach |
|
Tabas, Daniel | University of Washington |
Zhang, Baosen | University of Washington |
Keywords: Predictive control for linear systems, Neural networks, Constrained control
Abstract: Model predictive control (MPC) provides a useful means for controlling systems with constraints, but suffers from the computational burden of repeatedly solving an optimization problem in real time. Offline (explicit) solutions for MPC attempt to alleviate real time computational challenges using either multiparametric programming or machine learning. The multiparametric approaches are typically applied to linear or quadratic MPC problems, while learning-based approaches can be more flexible and are less memory-intensive. Existing learning-based approaches offer significant speedups, but the challenge becomes ensuring constraint satisfaction while maintaining good performance. In this paper, we provide a neural network parameterization of MPC policies that explicitly encodes the constraints of the problem. By exploring the interior of the MPC feasible set in an unsupervised learning paradigm, the neural network finds better policies faster than projection-based methods and exhibits substantially shorter solve times. We use the proposed policy to solve a robust MPC problem, and demonstrate the performance and computational gains on a standard test system.
|
|
15:10-15:30, Paper TuBT13.6 | Add to My Program |
Robust Risk-Aware Model Predictive Control of Linear Systems with Bounded Disturbances |
|
Gao, Yulong | The Royal Institute of Technology (KTH) |
Liu, Changxin | KTH Royal Institute of Technology |
Johansson, Karl H. | Royal Institute of Technology |
Keywords: Predictive control for linear systems, Robust control, Optimization
Abstract: We propose a robust risk-aware model predictive control (MPC) algorithm for linear discrete-time systems with bounded disturbances. The MPC problem is formulated to maximize the size of the predicted disturbance sets (which are subsets of the maximal disturbance sets) at each time step, under state and control constraints and a reachability specification. It is shown that the proposed scheme has the following properties: i) its feasible state set (i.e., the set of states starting from which the MPC problem is feasible) is as large as that of the MPC problem for the corresponding undisturbed system; ii) it maintains recursive feasibility if the conventional robust MPC problem is feasible. The proposed controller enlarges the feasible state set, at the expense of the risk of possible constraint violation that is quantified by the optimal solution of the problem. When the sets are represented as zonotopes, we further provide a computationally tractable reformulation and design an online implementation algorithm with adaptive prediction horizon. We illustrate the effectiveness of the proposed methods using a simulated example.
|
|
TuBT14 Regular Session, Maya Ballroom VI |
Add to My Program |
Control Applications II |
|
|
Chair: Tanaka, Takashi | University of Texas at Austin |
Co-Chair: Massaro, Matteo | Università Degli Studi Di Padova |
|
13:30-13:50, Paper TuBT14.1 | Add to My Program |
A Gain-Scheduled Robust H_{infty} Control for a Mixed Traffic System Travelling at Different Desired Speeds in the Presence of Delay |
|
Mousavi, Shima Sadat | ETH Zurich |
Bahrami, Somayeh | Razi University |
Kouvelas, Anastasios | ETH Zurich |
Keywords: Traffic control, Robust control
Abstract: In this paper, a mixed platoon on an open-road is studied, that includes one connected and automated vehicle (CAV) and multiple human-driven vehicles (HDVs). We represent the car-following behavior of HDVs by the optimal velocity model (OVM), while considering the driver reaction time as a delay in their dynamics. Also, we assume that the changes of traffic conditions result in a time-varying desired velocity. Considering a varying desired speed, reaction time delay, and parametric uncertainties in the HDV dynamics, a gain-scheduled robust H_{infty} control strategy is developed, that can stabilize the traffic flow in the presence of undesired perturbations. We indicate the efficiency of the proposed static output-feedback controller through simulation results.
|
|
13:50-14:10, Paper TuBT14.2 | Add to My Program |
On Differential Privacy and Traffic State Estimation Problem for Connected Vehicles |
|
Vishnoi, Suyash | The University of Texas at Austin |
Taha, Ahmad | Vanderbilt University |
Nugroho, Sebastian Adi | University of Michigan - Ann Arbor |
Claudel, Christian G. | UT Austin |
Keywords: Transportation networks, Traffic control, Estimation
Abstract: This letter focuses on the problem of traffic state estimation for highway networks with junctions in the form of on- and off-ramps while maintaining differential privacy of traffic data. Two types of sensors are considered, fixed sensors such as inductive loop detectors and connected vehicles which provide traffic density and speed data. The celebrated nonlinear second-order Aw-Rascle-Zhang (ARZ) model is utilized to model the traffic dynamics. The model is formulated as a nonlinear state-space difference equation. Sensitivity relations are derived for the given data which are then used to formulate a differentially private mechanism which adds a Gaussian noise to the data to make it differentially private. A Moving Horizon Estimation (MHE) approach is implemented for traffic state estimation using a linearized ARZ model. MHE is compared with Kalman Filter variants namely Extended Kalman Filter, Ensemble Kalman Filter and Unscented Kalman Filter. Several research and engineering questions are formulated and analysis is performed to find corresponding answers.
|
|
14:10-14:30, Paper TuBT14.3 | Add to My Program |
The Optimal Trajectory of Road Vehicles on Straights |
|
Massaro, Matteo | Università Degli Studi Di Padova |
Lovato, Stefano | Università Degli Studi Di Padova |
Keywords: Optimal control, Automotive control, Optimization
Abstract: The conventional wisdom is that the minimum-time trajectory of road vehicles on straights is `straight'. Actually, this is not always the case, at least for two-wheeled vehicles. Indeed, on four-wheeled vehicles the maximum accelerations are mainly limited by tyre-road friction and engine power. On the contrary, on two-wheeled vehicles the maximum accelerations are also limited by wheelie (front tyre lift) and stoppie (rear tyre lift). As a consequence, the maximum performance is not always obtained by going `straight' on straights. The issue is investigated within a nonlinear optimal control framework, where the objective is to minimize the manoeuvre time of the vehicle. A recently introduced free-trajectory method based on g-g diagrams is employed, and extended with refinement on the vehicle and road models. A direct solution method is employed, i.e. the optimal control problem (OCP) is transcribed into a nonlinear programming problem (NLP).
|
|
14:30-14:50, Paper TuBT14.4 | Add to My Program |
Safe Hierarchical Navigation in Crowded Dynamic Uncertain Environments |
|
Chen, Hongyi | Georgia Institute of Technology |
Feng, Shiyu | Georgia Institute of Technology |
Zhao, Ye | Georgia Tech |
Liu, Changliu | Carnegie Mellon University |
Vela, Patricio A. | Georgia Institute of Technology |
Keywords: Hierarchical control, Optimization, Robotics
Abstract: This paper describes a hierarchical solution consisting of a multi-phase planner and a low-level safe controller to jointly solve the safe navigation problem in crowded, dynamic, and uncertain environments. The planner employs dynamic gap analysis and trajectory optimization to achieve collision avoidance with respect to the predicted trajectories of dynamic agents within the sensing and planning horizon and with robustness to agent uncertainty. To address uncertainty over the planning horizon and real-time safety, a fast reactive safe set algorithm (SSA) is adopted, which monitors and modifies the unsafe control during trajectory tracking. Compared to other existing methods, our approach offers theoretical guarantees of safety and achieves collision-free navigation with higher probability in uncertain environments, as demonstrated in scenarios with 20 and 50 dynamic agents.
|
|
14:50-15:10, Paper TuBT14.5 | Add to My Program |
Port-Hamiltonian Modeling of Hydraulics in 4th Generation District Heating Networks |
|
Strehle, Felix | Karlsruhe Institute of Technology (KIT) |
Machado Martínez, Juan Eduardo | University of Groningen |
Cucuzzella, Michele | University of Pavia |
Malan, Albertus Johannes | Karlsruhe Institute of Technology |
Scherpen, Jacquelien M.A. | University of Groningen |
Hohmann, Soeren | KIT |
Keywords: Modeling, Networked control systems, Energy systems
Abstract: In this paper, we use elements of graph theory and port-Hamiltonian systems to develop a modular dynamic model describing the hydraulic behavior of 4th generation district heating networks. In contrast with earlier generation networks with a single or few heat sources and pumps, newer installations will prominently feature distributed heat generation units, bringing about a number of challenges for the control and stable operation of these systems, e.g., flow reversals and interactions among pumps controllers, which may lead to severe oscillations. We focus thus on flexible system setups with an arbitrary number of distributed heat sources and end-users interconnected through a meshed, multi-layer distribution network of pipes. Moreover, differently from related works on the topic, we incorporate dynamic models for the pumps in the system and explicitly account for the presence of pressure holding units. By inferring suitable (power-preserving) interconnection ports, we provide a number of claims about the passivity properties of the overall, interconnected system, which can prove to be highly beneficial in the design of decentralized control schemes and stability analyses.
|
|
15:10-15:30, Paper TuBT14.6 | Add to My Program |
A Lower-Bound for Variable-Length Source Coding in Linear-Quadratic-Gaussian Control with Shared Randomness |
|
Cuvelier, Travis | University of Texas at Austin |
Tanaka, Takashi | University of Texas at Austin |
Heath Jr., Robert W. | North Carolina State University |
Keywords: Information theory and control, Control over communications, Distributed control
Abstract: In this letter, we consider a Linear Quadratic Gaussian (LQG) control system where feedback occurs over a noiseless binary channel and derive lower bounds on the minimum communication cost (quantified via the channel bitrate) required to attain a given control performance. We assume that at every time step an encoder can convey a packet containing a variable number of bits over the channel to a decoder at the controller. Our system model provides for the possibility that the encoder and decoder have shared randomness, as is the case in systems using dithered quantizers. We define two extremal prefix-free requirements that may be imposed on the message packets; such constraints are useful in that they allow the decoder, and potentially other agents to uniquely identify the end of a transmission in an online fashion. We then derive a lower bound on the rate of prefix-free coding in terms of directed information; in particular we show that a previously known bound still holds in the case with shared randomness. We generalize the bound for when prefix constraints are relaxed, and conclude with a rate-distortion formulation.
|
|
TuBT15 Invited Session, Maya Ballroom VII |
Add to My Program |
Numerical Methods for Optimal Control |
|
|
Chair: Vila, Eduardo M G | Imperial College London |
Co-Chair: Kerrigan, Eric C. | Imperial College London |
Organizer: Vila, Eduardo M G | Imperial College London |
Organizer: McInerney, Ian | The University of Manchester |
Organizer: Kerrigan, Eric C. | Imperial College London |
|
13:30-13:50, Paper TuBT15.1 | Add to My Program |
A Feasible Sequential Linear Programming Algorithm with Application to Time-Optimal Path Planning Problems (I) |
|
Kiessling, David | KU Leuven |
Zanelli, Andrea | ETH Zurich |
Nurkanovic, Armin | University of Freiburg |
Gillis, Joris | Katholieke Universiteit Leuven |
Diehl, Moritz | University of Freiburg |
Zeilinger, Melanie N. | ETH Zurich |
Pipeleers, Goele | Katholieke Universiteit Leuven |
Swevers, Jan | K. U. Leuven |
Keywords: Optimization algorithms, Optimization, Optimal control
Abstract: In this paper, we propose a Feasible Sequential Linear Programming (FSLP) algorithm applied to timeoptimal control problems (TOCP) obtained through direct multiple shooting discretization. This method is motivated by TOCP with nonlinear constraints which arise in motion planning of mechatronic systems. The algorithm applies a trust-region globalization strategy ensuring global convergence. For fully determined problems our algorithm provides locally quadratic convergence. Moreover, the algorithm keeps all iterates feasible enabling early termination at suboptimal, feasible solutions. This additional feasibility is achieved by an efficient iterative strategy using evaluations of constraints, i.e., zeroorder information. Convergence of the feasibility iterations can be enforced by reduction of the trust-region radius. These feasibility iterations maintain feasibility for general Nonlinear Programs (NLP). Therefore, the algorithm is applicable to general NLP. We demonstrate our algorithm’s efficiency and the feasibility updates for a TOCP of an overhead crane motion planning simulation case.
|
|
13:50-14:10, Paper TuBT15.2 | Add to My Program |
A Non-Interior-Point Method for the Optimal Control Problem with Equilibrium Constraints (I) |
|
Lin, Kangyu | Kyoto University |
Ohtsuka, Toshiyuki | Kyoto Univ |
Keywords: Numerical algorithms, Optimal control, Constrained control
Abstract: In this study, we propose a non-interior-point method to solve the optimal control problem with equilibrium constraints (OCPEC). OCPEC is an intractable problem owing to its inherently unstable constraint system, resulting in the numerical difficulties such as a feasible region without strict interior. Compared to interior-point methods, the proposed method does not require all iterates to remain in the feasible region, and therefore has more freedom to select the stepsize to mitigate the numerical difficulties. This stepsize freedom is achieved by introducing a smoothing-equation reformulation for the complementary conditions of inequality constraints in the Karush--Kuhn--Tucker conditions. We solve the discretized OCPEC in the framework of the continuation method that solves a sequence of nonlinear programming problems. A sparsity exploitation technique is introduced to efficiently solve the linear system, and a line search strategy with a dedicated merit function is introduced to guarantee the convergence. We demonstrate the effectiveness of the proposed algorithm using several numerical examples and make a comparison with an interior-point method solver.
|
|
14:10-14:30, Paper TuBT15.3 | Add to My Program |
Solving Optimal Control Problems with Non-Smooth Solutions Using an Integrated Residual Method and Flexible Mesh (I) |
|
Nita, Lucian | Imperial College London |
Kerrigan, Eric C. | Imperial College London |
Vila, Eduardo M G | Imperial College London |
Nie, Yuanbo | University of Sheffield |
Keywords: Optimal control, Numerical algorithms, Computational methods
Abstract: Solutions to optimal control problems can be discontinuous, even if all the functionals defining the problem are smooth. This can cause difficulties when numerically computing solutions to these problems. While conventional numerical methods assume state and input trajectories are continuous and differentiable or smooth, our method is able to capture discontinuities in the solution by introducing time-mesh nodes as decision variables. This allows one to obtain a higher accuracy solution for the same number of mesh nodes compared to a fixed time-mesh approach. Furthermore, we propose to first solve a sequence of suitably-defined least-squares problems to ensure that the error in the dynamic equation is below a given tolerance. The cost functional is then minimized subject to an inequality constraint on the dynamic equation residual. We demonstrate our implementation on an optimal control problem that has a chattering solution. Solving such a problem is difficult, since the solution involves infinitely many switches of decreasing duration. This simulation shows how the flexible mesh is able to capture discontinuities present in the solution and achieve superlinear convergence as the number of mesh intervals is increased.
|
|
14:30-14:50, Paper TuBT15.4 | Add to My Program |
NOSNOC: A Software Package for Numerical Optimal Control of Nonsmooth Systems |
|
Nurkanovic, Armin | University of Freiburg |
Diehl, Moritz | University of Freiburg |
Keywords: Numerical algorithms, Optimal control, Switched systems
Abstract: This letter introduces the NOnSmooth Numerical Optimal Control (NOSNOC) open-source software package. It is a modular MATLAB tool based on CasADi and IPOPT for numerically solving Optimal Control Problems (OCP) with piecewise smooth systems (PSS). The tool supports: 1) automatic reformulation of systems with state jumps into PSS (via the time-freezing reformulation [Nurkanovic et al., 2021]) and of PSS into computationally more convenient forms, 2) automatic discretization of the OCP via, e.g., the recently introduced Finite Elements with Switch Detection [Nurkanovic et al., 2022] which enables high accuracy optimal control and simulation of PSS, 3) solution methods for the resulting discrete-time OCP. The nonsmooth discrete-time OCP are solved with techniques of continuous optimization in a homotopy procedure, without the use of integer variables. This enables the treatment of a broad class of nonsmooth systems in a unified way. Two tutorial examples are given. A benchmark shows that NOSNOC provides both faster and more accurate solutions than conventional approaches, including mixed-integer formulations.
|
|
14:50-15:10, Paper TuBT15.5 | Add to My Program |
Efficient Numerical Optimal Control for Highly Oscillatory Systems |
|
Harzer, Jakob | Albert Ludwig University of Freiburg |
De Schutter, Jochem | ALU Freiburg |
Diehl, Moritz | University of Freiburg |
Keywords: Modeling, Numerical algorithms, Model/Controller reduction
Abstract: We present an efficient transcription method for highly oscillatory optimal control problems. For these problems, the optimal state trajectory consists of fast oscillations that change slowly over the time horizon. Out of a large number of oscillations, we only simulate a subset to approximate the slow change by constructing a semi-explicit differential-algebraic equation that can be integrated with integration steps much larger than one period. For the solution of optimal control problems with direct methods, we provide a way to parametrize and regularize the controls. Finally, we utilize the method to find a fuel-optimal orbit transfer of a low-thrust satellite. Using the novel method, we reduce the size of the resulting nonlinear program by more than one order of magnitude.
|
|
15:10-15:30, Paper TuBT15.6 | Add to My Program |
Continuous Optimization for Control of Hybrid Systems with Hysteresis Via Time-Freezing |
|
Nurkanovic, Armin | University of Freiburg |
Diehl, Moritz | University of Freiburg |
Keywords: Hybrid systems, Optimal control, Switched systems
Abstract: This letter regards numerical optimal control of a class of hybrid systems with hysteresis using techniques solely from nonlinear optimization, without any integer variables. Hysteresis is a rate independent memory effect, which often results in severe nonsmoothness in the dynamics. These systems are not simply Piecewise Smooth Systems (PSS); they are a more complicated form of hybrid systems. We introduce a time-freezing reformulation, which transforms these systems into a PSS. From the theoretical side, this reformulation opens the door to study systems with hysteresis via the rich tools developed for Filippov systems. From the practical side, it enables the use of the recently developed Finite Elements with Switch Detection (Nurkanovic et al., 2022), which makes high accuracy numerical optimal control of hybrid systems with hysteresis possible. We provide a time optimal control problem example and compare our approach to mixed integer formulations from the literature.
|
|
TuBT16 Regular Session, Maya Ballroom VIII |
Add to My Program |
Observers for Nonlinear Systems and Applications |
|
|
Chair: Moreno, Jaime A. | Universidad Nacional Autonoma De Mexico-UNAM |
Co-Chair: Fontanelli, Daniele | University of Trento |
|
13:30-13:50, Paper TuBT16.1 | Add to My Program |
KKL Observer Design for Sensorless Induction Motors |
|
Bernard, Pauline | MINES ParisTech, Université PSL |
Devos, Thomas | Schneider Electric |
Jebai, Al Kassem | Schneider Electric |
Martin, Philippe | MINES ParisTech, PSL Research University |
Praly, Laurent | MINES ParisTech |
Keywords: Observers for nonlinear systems, Observers for Linear systems, Electrical machine control
Abstract: We propose a novel observer for speed and torque estimation in induction motors, using only electrical measurements and assuming the parameters known. The design is based on a novel representation of the motor model taking the form of a cascade of a flux subsystem and a velocity subsystem with known injection terms. After giving sufficient conditions for the uniform strong observability of those subsystems, we exploit the KKL approach to design a globally asymptotically stable observer without relying on any time-scale separation. We provide a method to optimize the observer parameters and illustrate its performance on simulation in a realistic scenario.
|
|
13:50-14:10, Paper TuBT16.2 | Add to My Program |
Equivariant Filter Design for Discrete-Time Systems |
|
Ge, Yixiao | Australian National University |
van Goor, Pieter | Australian National University |
Mahony, Robert | Australian National University, |
Keywords: Observers for nonlinear systems, Filtering, Algebraic/geometric methods
Abstract: The kinematics of many nonlinear control systems, especially in the robotics field, admit a transitive Lie-group symmetry, which is useful in high performance observer design. The recently proposed equivariant filter (EqF) exploits equivariance to generate high performance filters for a wide range of real-world systems. However, existing work on the equivariant filter, and equivariance of control systems in general, is based on a continuous-time formulation. In this paper, we first present the equivariant structure of a discrete-time system. We then use this to propose a discrete-time version of the equivariant filter. A novelty of the approach is that the geometry of the symmetry group naturally appears as parallel transport in the reset step of the filter. Preliminary results for linear second order kinematics with separate bearing and range measurements indicate that the discrete EqF significantly outperforms both a discretized version of the continuous EqF and a classical discrete EKF.
|
|
14:10-14:30, Paper TuBT16.3 | Add to My Program |
Bi-Homogeneous Observers for Uncertain 2-DOF Mechanical Systems |
|
Texis-Loaiza, Oscar | Universidad Nacional Autónoma De México |
Meléndez-Pérez, René | Universidad Nacional Autónoma De México |
Moreno, Jaime A. | Universidad Nacional Autonoma De Mexico-UNAM |
Fridman, Leonid | Universidad Nacional Autonoma De Mexico |
Keywords: Observers for nonlinear systems
Abstract: A global, bi-homogeneous, observer for a class of 2-DOF mechanical systems with Coriolis and centrifugal forces, dry and viscous frictions and non-vanishing bounded uncertainties/perturbations is proposed, ensuring predefined upper bound of convergence time.
|
|
14:30-14:50, Paper TuBT16.4 | Add to My Program |
Globally Exponentially Convergent Observer for the Rigid Body System on SE(3) |
|
Shanbhag, Soham | Korea Advanced Institute of Science and Technology |
Chang, Dong Eui | Korea Advanced Institute of Science and Technology |
Keywords: Observers for nonlinear systems, Time-varying systems
Abstract: We propose globally exponentially convergent observers for the rigid body translation and rotation system evolving on SE(3) with bounded angular velocity whose bound is not known a priori. We design the observers in the ambient Euclidean space and show exponential rate of convergence of the observers to the state of the system. We perform simulations to compare the proposed observers with observers available in the literature. We also show simulation of our observer with data collected using an Intel realsense T265 sensor.
|
|
14:50-15:10, Paper TuBT16.5 | Add to My Program |
On Local/global Constructibility for Mobile Robots Using Bounded Range Measurements |
|
Riz, Francesco | University of Trento |
Palopoli, Luigi | University of Trento |
Fontanelli, Daniele | University of Trento |
Keywords: Observers for nonlinear systems, Nonholonomic systems, Kalman filtering
Abstract: We propose a state-constructibility analysis for a nonholonomic vehicle moving across an environment instrumented with fixed-frame range sensors (anchors). We consider a sparse deployment in which anchors are in a small number, and have finite and non-overlapping sensing ranges. Under the most extreme conditions (i.e., the robot being in sight of two anchors at different times), we provide a sufficient condition on the manoeuvres that the robot is required to execute within the range of each of the anchors in order to achieve global constructibility. When the robot travels along straight lines, these conditions are not met, but we can still have “local” constructibility to a degree quantifiable through the smallest eigenvalue of the Constructibility Gramian (CG). Our second contribution is to show how this metric changes according to the geometric parameters of the linear trajectory with respect to the position of the anchors and their sensing range.
|
|
15:10-15:30, Paper TuBT16.6 | Add to My Program |
Disturbance Observer and Depth Enhanced Visual-Inertial Navigation System for Multi-Rotor MAVs: An Observability Analysis |
|
Gomaa, Mahmoud A. K. | Memorial University of Newfoundland |
De Silva, Oscar | Memorial University of Newfoundland |
Jayasiri, Awantha | National Research Council |
Mann, George K. I. | Memorial University of Newfoundland |
Keywords: Observers for nonlinear systems, Sensor fusion, Kalman filtering
Abstract: This paper proposes a new filtering-based depth enhanced visual internal navigation system (DE-VINS) with external disturbance observation. This filter resolves the drifting and degraded performance of drag force model VINS filters at hovering conditions and during the existence of external disturbances. A theoretical nonlinear observability analysis is performed to verify the filter design. The performance of the proposed DE-VINS is investigated through two sets of numerical simulations using a Matlab simulator and compared against the state-of-the-art drag force VINS filters. The results show improved performance of the DE-VINS in terms of estimation accuracy and consistency at zero-velocity flight (hovering) during the existence of external disturbances while estimating the magnitude and direction of the disturbance force.
|
|
TuBT17 Invited Session, Acapulco |
Add to My Program |
Biological Controllers |
|
|
Chair: Filo, Maurice | Swiss Federal Institute of Technology in Zurich |
Co-Chair: Khammash, Mustafa H. | ETH Zurich |
Organizer: Filo, Maurice | ETH Zurich |
Organizer: Khammash, Mustafa H. | ETH Zurich |
|
13:30-13:50, Paper TuBT17.1 | Add to My Program |
On the Computation of the Minimum Set of Reactions for Optimal Growth in Constraint-Based Models |
|
Oarga, Alexandru | University of Zaragoza |
Julvez, Jorge | University of Zaragoza |
Keywords: Systems biology, Petri nets, Optimization
Abstract: Technical advances in sequencing have allowed the reconstruction of metabolic models of multiple microorganisms, which have proven useful in advancing metabolic engineering and drug discovery. Optimization methods have provided a way to accurately predict flux phenotypes of various unicellular organisms and their response to gene knockouts. Despite the broad application of these methods, the role that different biochemical reactions have in providing robustness and flexibility has not been studied extensively. In this work, a method is presented to identify those sets of reactions that are essential for growth and those that are redundant and therefore account for the robustness of metabolism. The problem of computing a minimum set of reactions that can produce optimum growth is formally stated. It is proven that such a problem is NP-complete and a technique to reduce the search space of the problem is proposed. The presented approach is experimentally applied in various genome-scale models. The contribution of this work is to provide insight into the roles that different reactions play in the production of growth and to propose methods that can be directly applied in model curation and analysis.
|
|
13:50-14:10, Paper TuBT17.2 | Add to My Program |
Multicellular PI Control for Gene Regulation in Microbial Consortia |
|
Martinelli, Vittoria | Università Degli Studi Di Napoli Federico II |
Salzano, Davide | University of Naples Federico II |
Fiore, Davide | University of Naples Federico II |
di Bernardo, Mario | University of Naples Federico II |
Keywords: Genetic regulatory systems, Biomolecular systems, PID control
Abstract: We describe two multicellular implementations of the classical P and PI feedback controllers for the regulation of gene expression in a target cell population. Specifically, we propose to distribute the proportional and integral actions over two different cellular populations in a microbial consortium comprising a third target population whose output needs to be regulated. By engineering communication among the different cellular populations via appropriate orthogonal quorum sensing molecules, we are able to close the feedback loop across the consortium. We derive analytical conditions on the biological parameters guaranteeing the regulation of the output of the target population and we validate the robustness and modularity of proposed control schemes via in silico experiments in BSim, a realistic agent-based simulator of bacterial populations.
|
|
14:10-14:30, Paper TuBT17.3 | Add to My Program |
On the Design of a PID Bio-Controller with Set Point Weighting and Filtered Derivative Action |
|
Alexis, Emmanouil | Oxford University |
Cardelli, Luca | University of Oxford |
Papachristodoulou, Antonis | University of Oxford |
Keywords: PID control, Biomolecular systems, Genetic regulatory systems
Abstract: Effective and robust regulation of biomolecular processes is crucial for designing reliable synthetic bio-devices functioning in uncertain and constantly changing biological environments. Proportional-Integral-Derivative (PID) controllers are undeniably the most common way of implementing feedback control in modern technological applications. Here, we introduce a highly tunable PID bio-controller with set point weighting and filtered derivative action presented as a chemical reaction network with mass action kinetics. To demonstrate its effectiveness, we apply our PID scheme on a simple biological process of two mutually activated species, one of which is assumed to be the output of interest. To highlight its performance advantages we compare it to PI regulation using numerical simulations in both the deterministic and stochastic setting.
|
|
14:30-14:50, Paper TuBT17.4 | Add to My Program |
Exploiting the Nonlinear Structure of the Antithetic Integral Controller to Enhance Dynamic Performance (I) |
|
Filo, Maurice | ETH Zurich |
Kumar, Sant | ETH Zurich |
Anastassov, Stanislav | ETH |
Khammash, Mustafa H. | ETH Zurich |
Keywords: Biomolecular systems, Genetic regulatory systems, Biological systems
Abstract: The design of biomolecular feedback controllers has been identified as an important goal across a broad range of biological applications spanning synthetic biology, cell therapy, metabolic engineering, etc. This originates from the need to regulate various cellular processes in a robust and timely fashion. Recently, antithetic integral controllers found their way into synthetic biology due to the Robust Perfect Adaptation (RPA) property they endow --- the biological analogue of robust steady-state tracking. The antithetic integral motif hinges on a sequestration reaction between two molecules that annihilates their function. Here, we demonstrate that the complex resulting from the nonlinear sequestration reaction can be leveraged as an inhibitor to enhance the dynamic performance while maintaining the RPA property. We establish that this additional inhibition by the sequestration complex gives rise to a filtered Proportional-Integral (PI) controller thus offering more flexibility in shaping the dynamic response and reducing cell-to-cell variability. Furthermore, we explore the effect of various biological inhibitory mechanisms on the overall performance. The various analyses in the paper are carried out using analytical tools and are supported by numerical simulations. Finally, an experimental validation is performed using the cyberloop --- a hybrid platform where the controller is implemented in silico to control a genetic circuit in vivo.
|
|
14:50-15:10, Paper TuBT17.5 | Add to My Program |
Emergent Interactions Due to Resource Competition in CRISPR-Mediated Genetic Activation Circuits (I) |
|
Manoj, Krishna | Massachusetts Institute of Technology |
Del Vecchio, Domitilla | Massachusetts Institute of Technology |
Keywords: Genetic regulatory systems, Biomolecular systems, Systems biology
Abstract: CRISPR-mediated gene regulation has gained considerable attention due to its scalability, allowing to create increasingly large genetic circuits. Unintended interactions due to competition for the dCas9 resource among different small guide RNAs have been characterized extensively for CRISPR-mediated repression (CRISPRi). Such an analysis is to a large extent missing for CRISPR-mediated activation (CRISPRa). In this paper, we model CRISPRa considering two required shared resources (dCas9 and an activator protein), and identify the interaction graphs that emerge through resource competition. The presence of two shared resources among multiple scaffold RNAs (scRNA) is responsible for two main phenomena. First, we mathematically prove the existence of a ``self-sequestration" effect, wherein an scRNA represses its own target gene instead of activating it, thereby negating the CRISPRa function. Second, we demonstrate that unwanted repression of non-target genes is substantially stronger when compared to a scenario with a single resource. These results indicate that new control approaches to concurrently regulate multiple resources will be useful for mitigating the undesirable effects of resource competition in CRISPRa.
|
|
15:10-15:30, Paper TuBT17.6 | Add to My Program |
A Contraction Theory-Based Framework for the Design of Robustness to Global Perturbations in Biomolecular Circuits |
|
Patel, Abhilash | Indian Institute of Technology Kanpur |
Sen, Shaunak | Indian Institute of Technology Delhi |
Kar, Indra Narayan | Indian Institute of Technology Delhi |
Keywords: Biomolecular systems, Biological systems, Genetic regulatory systems
Abstract: It is challenging to design robustness against global perturbations that affect the plant as well as the controller. A particular instance of practical importance is the design of biomolecular circuits that are robust to global variables such as temperature. To address this, we developed a framework based on contraction theory and used mathematical models as case studies. We mathematically modelled the global effect on parameters as an additive state-dependent uncertainty. We quantified the difference between the perturbed and the nominal system trajectories for the global perturbations. For specific biomolecular circuit designs, we noted that as the contraction rate increased, the envelope between the nominal and the perturbed trajectory shrunk, resulting in robust behaviour. These results should help in the rational design of robustness in systems with global perturbations such as in biomolecular circuits.
|
|
TuCT01 Regular Session, Tulum Ballroom A |
Add to My Program |
Switched and Hybrid Systems |
|
|
Chair: Zamani, Majid | University of Colorado Boulder |
Co-Chair: Ruderman, Michael | University of Agder |
|
16:00-16:20, Paper TuCT01.1 | Add to My Program |
On Stochastic ISS of Time-Varying Switched Systems with Generic Levy Switching Signals |
|
Athni Hiremath, Sandesh | University of Kaiserslautern |
Ahmed, Saeed | Jan C. Willems Center for Systems and Control, Faculty of Scienc |
Keywords: Stability of hybrid systems, Switched systems, Stochastic systems
Abstract: Switched systems in which switching among subsystems occurs at random time instants find numerous applications in engineering. Stability analysis of such systems, however, is quite challenging. This paper investigates the stochastic input-to-state stability of this class of switched systems. The random switching instants are modeled by a non-decreasing, positive, and real-valued Levy process, which, at every time instant, selects the active subsystem from a family of deterministic systems. No assumption on the stability of subsystems is presumed; they can be stable or unstable. Stochastic properties of the switching signal are coupled with a family of Lyapunov-like functions to obtain a sufficient condition for stochastic input-to-state stability.
|
|
16:20-16:40, Paper TuCT01.2 | Add to My Program |
Orientation Control of the Bouncing Ball |
|
Clark, William | Cornell University |
Kassabova, Dora | Cornell University |
Keywords: Nonholonomic systems, Hybrid systems, Simulation
Abstract: Control of a hybrid dynamical system can manifest in one of two main ways: either through the continuous or the discrete dynamics. An example of controls influencing the continuous dynamics is legged locomotion, where the joints are actuated but the location and nature of the impacts are uncontrolled. In contrast, an example of discrete control would be in tennis; the player can only influence the trajectory of the ball through striking it. This work examines the latter case with two key emphases. The first is that controls manifest through changing the location of the guard (as opposed to changing only the reset). The second is that the location of the guard is described by "external variables" while the goal is to control "internal variables." As a simple test of this theory, orientation control of a bouncing ball is explored; the ball is only controlled during impacts which are exclusively position-dependent.
|
|
16:40-17:00, Paper TuCT01.3 | Add to My Program |
Dynamics of Inertial Pair Coupled Via Frictional Interface |
|
Ruderman, Michael | University of Agder |
Zagvozdkin, Andrei | University of Texas at Dallas |
Rachinskii, Dmitrii | University of Texas at Dallas |
Keywords: Modeling, Nonlinear systems, Hybrid systems
Abstract: Understanding the dynamics of two inertial bodies coupled via a friction interface is essential for a wide range of systems and motion control applications. Coupling terms within the dynamics of an inertial pair connected via a passive frictional contact are non-trivial and have long remained understudied in system communities. This problem is particularly challenging from a point of view of modeling the interaction forces and motion state variables. This paper deals with a generalized motion problem in systems with a free (of additional constraints) friction interface, assuming the classical Coulomb friction with discontinuity at the velocity zero crossing. We formulate the dynamics of motion as the closed-form ordinary differential equations containing the sign operator for mapping both, the Coulomb friction and the switching conditions, and discuss the validity of the model in the generalized force and motion coordinates. The system has one active degree of freedom (the driving body) and one passive degree of freedom (the driven body). We demonstrate the global convergence of trajectories for a free system with no external excitation forces. Then, an illustrative case study is presented for a harmonic oscillator with a frictionally coupled second mass that is not grounded or connected to a fixed frame. This simplified example illustrates a realization and main features of the proposed (general) modeling framework. Some future development and related challenges are discussed at the end of the paper.
|
|
17:00-17:20, Paper TuCT01.4 | Add to My Program |
Modeling of Integrating and Non-Minimum Phase Dynamics Using Limit Cycles |
|
Pandey, Saurabh | Indian Institute of Technology Guwahati, Assam |
Majhi, Somanath | Indian Institute of Technology Guwahati |
Keywords: Modeling, Identification, Linear systems
Abstract: Using the frequency domain approach, a relay based modeling of industrial processes in terms of dead time models with non-minimum phase and integrating dynamics is addressed in this paper. An asymmetrical relay is fedback to bring sustained oscillatory responses from the unknown process dynamics broadly known as limit cycle. Considering the relay settings and the limit cycle information, an explicit set of mathematical expressions for identification of non-minimum phase integrating first order plus dead time, integrating first order plus dead time and pure integrating plus dead time processes is deduced which does not need an appropriate guess in the solution of nonlinear equations to yield process model parameters. During the identification test, effect of measurement noise over the process output information is minimized by using a Fourier series based curve fitting algorithm. Well-known examples from literature are simulated to illustrate the advantage of proposed relay based identification algorithm. Finally, the comparison between identified and true process models is carried out through integral of absolute error criterion and frequency response plots.
|
|
17:20-17:40, Paper TuCT01.5 | Add to My Program |
Stability of Scheduling Policies for Processing Networks |
|
Seidman, Thomas I. | Univ. of Maryland, Baltimore County |
Holloway, Lawrence E. | Univ. of Kentucky |
Seidman, Gregory | Pie Insurance |
Keywords: Switched systems, Manufacturing systems and automation, Modeling
Abstract: This paper considers the stability of a network of processors executing a set of tasks on orders that arrive either externally or from other processors in the network. These orders wait at each task until a control policy assigns the processor to execute the orders for that task. We consider a class of control policies, work-conserving control policies, that consider look-ahead of workload from existing orders. These policies are shown, in general, to ensure that work accumulation in the system is bounded and thus the system is stable. We then consider the class of apparently-work-conserving controllers, which operate on task-level information instead of global information. By adding supplemental information dynamics, it is shown that these apparently-work-conserving controllers can be made equivalent to work-conserving and thus ensure stability.
|
|
17:40-18:00, Paper TuCT01.6 | Add to My Program |
A Scenario Approach for Synthesizing K-Inductive Barrier Certificates |
|
Murali, Vishnu | University of Colorado, Boulder |
Trivedi, Ashutosh | University of Colorado Boulder |
Zamani, Majid | University of Colorado Boulder |
Keywords: Randomized algorithms, Hybrid systems, Optimization
Abstract: The notion of k-inductive barrier certificates generalize the idea of k-induction to verification of discrete-time continuous-state dynamical systems by requiring restrictions over k-grams (sequence of k-states in evolution) of the system transitions. The promise of k-inductive barrier certificates is in the simplicity of the form of barrier certificates (lower-degree of polynomials) at the cost of more complex non-convex constraints involving logical implications. Recent breakthroughs in convex robust programming via the scenario approach deliver a sampling-based randomized algorithm for the computation of barrier certificates. In the absence of system dynamics (a.k.a. black box models), extending scenario approach to k-inductive barrier certificates faces challenges due to the resulting lack of convexity. This letter overcomes non-convexity challenges by providing a sound approach for data-driven computation of k-inductive barrier certificates. We present computational methods to solve the resulting scenario programs for k-inductive barrier certificates, provide out-of-sample performance guarantees, and experimentally demonstrate the effectiveness of the proposed results.
|
|
TuCT02 Regular Session, Tulum Ballroom B |
Add to My Program |
Adaptive Control III |
|
|
Chair: Annaswamy, Anuradha M. | Massachusetts Inst. of Tech |
Co-Chair: Nielsen, Christopher | University of Waterloo |
|
16:00-16:20, Paper TuCT02.1 | Add to My Program |
Accelerated Performance and Accelerated Learning with Discrete-Time High-Order Tuners |
|
Cui, Yingnan | Massachusetts Institute of Technology |
Annaswamy, Anuradha M. | Massachusetts Inst. of Tech |
Keywords: Adaptive control, Identification, Iterative learning control
Abstract: We consider two high-order tuners that have been shown to have accelerated performance, one based on Polyak’s heavy ball method and another based on Nesterov’s acceleration method. We show that parameter estimates are bounded and converge to the true values exponentially fast when the regressors are persistently exciting. Simulation results corroborate the accelerated performance and accelerated learning properties of these high-order tuners in comparison to algorithms based on normalized gradient descent.
|
|
16:20-16:40, Paper TuCT02.2 | Add to My Program |
Adaptive Control of SE(3) Hamiltonian Dynamics with Learned Disturbance Features |
|
Duong, Thai | University of California, San Diego |
Atanasov, Nikolay | University of California, San Diego |
Keywords: Adaptive control, Identification for control, Machine learning
Abstract: Adaptive control is a critical component of reliable robot autonomy in rapidly changing operational conditions. Adaptive control designs benefit from a disturbance model, which is often unavailable in practice. This motivates the use of machine learning techniques to learn disturbance features from training data offline, which can subsequently be employed to compensate the disturbances online. This paper develops geometric adaptive control with a learned disturbance model for rigid-body systems, such as ground, aerial, and underwater vehicles, that satisfy Hamilton's equations of motion over the SE(3) manifold. Our design consists of an offline disturbance model identification stage, using a Hamiltonian-based neural ordinary differential equation (ODE) network trained from state-control trajectory data, and an online adaptive control stage, estimating and compensating the disturbances based on geometric tracking errors. We demonstrate our adaptive geometric controller in trajectory tracking simulations of fully-actuated pendulum and under-actuated quadrotor systems.
|
|
16:40-17:00, Paper TuCT02.3 | Add to My Program |
Multiple Model Reference Adaptive Tracking Control of Multivariable Systems with Blending |
|
Lovi, Alex | University of Waterloo |
Fidan, Baris | University of Waterloo |
Nielsen, Christopher | University of Waterloo |
Keywords: Adaptive control, Indirect adaptive control, Identification for control
Abstract: This paper develops a multiple fixed model blending-based adaptive online parameter identification scheme and a multiple model reference adaptive control (MMRAC) for multiple input, multiple output (MIMO) linear time-invariant (LTI) systems with polytopic parameter uncertainty. The parameter identification scheme produces bounded estimates that asymptotically converge to the unknown system's matrices. The proposed MMRAC is developed by combining the proposed adaptive parameter identification scheme with a state-space model reference control approach. This MMRAC scheme is proven to asymptotically track signals generated by a LTI reference model. A set of simulation test results are presented to illustrate the stability and effectiveness of the proposed MMRAC scheme.
|
|
17:00-17:20, Paper TuCT02.4 | Add to My Program |
Regret Minimization for Linear Quadratic Adaptive Controllers Using Fisher Feedback Exploration |
|
Colin, Kévin | KTH Royal Institute of Technology |
Ferizbegovic, Mina | KTH |
Hjalmarsson, Håkan | KTH Royal Inst. of Tech |
Keywords: Adaptive control, Information theory and control, Linear systems
Abstract: In this paper, we study the trade-off between exploration and exploitation for linear quadratic adaptive control. This trade-off can be expressed as a function of the exploration and exploitation costs, called cumulative regret. It has been shown over the years that the optimal asymptotic rate of the cumulative regret is in many instances O(√T). In particular, this rate can be obtained by adding a white noise external excitation, with a variance decaying as O(1/√T). As the amount of excitation is pre-determined, such approaches can be viewed as open loop control of the external excitation. In this contribution, we approach the problem of designing the external excitation from a feedback perspective leveraging the well known benefits of feedback control for decreasing sensitivity to external disturbances and system-model mismatch, as compared to open loop strategies. We base the feedback on the Fisher information matrix which is a measure of the accuracy of the model. Specifically, the amplitude of the exploration signal is seen as the control input while the minimum eigenvalue of the Fisher matrix is the variable to be controlled. We call such exploration strategies Fisher Feedback Exploration (F2E). We propose one explicit F2E design, called Inverse Fisher Feedback Exploration (IF2E), and argue that this design guarantees the optimal asymptotic rate for the cumulative regret. We provide theoretical support for IF2E and in a numerical example we illustrate benefits of IF2E and compare it with the open loop approach as well as a method based on Thompson sampling.
|
|
17:20-17:40, Paper TuCT02.5 | Add to My Program |
Worst-Case Performance of Greedy Policies in Bandits with Imperfect Context Observations |
|
Park, Hongju | University of Georgia |
Shirani Faradonbeh, Mohamad Kazem | University of Georgia |
Keywords: Adaptive control, Iterative learning control, Statistical learning
Abstract: Contextual bandits are canonical models for sequential decision-making under uncertainty in environments with time-varying components. In this setting, the expected reward of each bandit arm consists of the inner product of an unknown parameter with the context vector of that arm. The classical bandit settings heavily rely on assuming that the contexts are fully observed, while the study of the richer model of imperfectly observed contextual bandits is immature. This work considers Greedy reinforcement learning policies that take action as if the current estimates of the parameter and of the unobserved contexts coincide with the corresponding true values. We establish that the non-asymptotic worst-case regret grows poly-logarithmically with the time horizon and the failure probability, while it scales linearly with the number of arms. Numerical analysis showcasing the above efficiency of Greedy policies is also provided.
|
|
17:40-18:00, Paper TuCT02.6 | Add to My Program |
Synthesis of Minimax Adaptive Controller for a Finite Set of Linear Systems |
|
Cederberg, Daniel | Linköping University |
Hansson, Anders | Linkoping University |
Rantzer, Anders | Lund University |
Keywords: Adaptive control, Learning, Optimization
Abstract: The design of an adaptive controller with bounded L_2-gain from disturbances to errors for linear time-invariant systems with uncertain parameters restricted to a finite set is investigated. The design of the controller requires finding matrices satisfying non-convex matrix inequalities. We propose an approach for finding these matrices based on repeatedly linearizing the terms that cause the non-convexity of the inequalities. Empirical evidence suggests that the approach leads to adaptive controllers with significantly smaller upper bound on the L_2-gain.
|
|
TuCT03 Regular Session, Tulum Ballroom C |
Add to My Program |
Robotics III |
|
|
Chair: Fagiolini, Adriano | University of Palermo |
Co-Chair: Pb, Sujit | IISER Bhopal |
|
16:00-16:20, Paper TuCT03.1 | Add to My Program |
Observability Analysis and Reduced-Order Observer Design for a Super-Coiled Polymer-Driven Robotic Eye |
|
Rajendran, Sunil Kumar | George Mason University |
Wei, Qi | George Mason University |
Yao, Ningshi | George Mason University |
Zhang, Feitian | Peking University |
Keywords: Robotics, Observers for Linear systems, Control applications
Abstract: With the aid of a robotic eye platform, several ocular motor disorders such as strabismus can be studied by ophthalmologists and biomedical researchers to better understand the biomechanisms of the human eye. Our previous work modeled a 2-DOF robotic eye driven by Super-Coiled Polymer (SCP) artificial muscles and presented a Deep Deterministic Policy Gradient (DDPG) learning-based controller. The control policy requires access to the full system states that include the orientation of the robotic eye and temperature changes of the SCPs. While the angular orientations of the robotic eye can be determined using embedded sensors or image-processing of the visual feed, it is quite laborious and expensive to measure the temperatures of the slender SCP muscles without affecting robot dynamics. To address this problem, this paper designs a linear reduced-order state observer based on the linearization of the nonlinear dynamics of the robotic eye. The linearized model is analytically shown to be fully observable along any trajectory within the operation range. To quantify the local observability of the dynamical system, local unobservability indices and local estimation condition numbers are determined along trajectories of the system states. The performance of the designed observer is tested through simulation in both open-loop and closed-loop foveation control of the robotic eye.
|
|
16:20-16:40, Paper TuCT03.2 | Add to My Program |
Estimation of Time-Varying Parameters Defining Contact of a Planar Manipulator with a Surface |
|
Rosales, Antonio | VTT Technical Research Centre of Finland |
Freidovich, Leonid | Umeå University |
Keywords: Robotics, Observers for Linear systems, Estimation
Abstract: The knowledge of parameters, defining interaction of a robotic manipulator with environment, is crucial when robots execute contact-tasks involving tracking of trajectories while desired forces are applied on the environment. For contact tasks on planar surfaces, the inclination and stiffness of the surface are key parameters since the first one defines the direction of the desired force and trajectory, which are typically defined relative to a frame attached to the environment, and the second one is required to compute the control signal. There exist methods for estimation of inclination and stiffness, whenever they are constant. The estimation of time-varying stiffness and inclination is less studied. In this paper, we propose a method to estimate on-line the stiffness and inclination of the planar surface, when they are varying during the task execution. The method is based on adaptive observers that ensure asymptotic or finite-time convergence of the estimates to the real values of the parameters.
|
|
16:40-17:00, Paper TuCT03.3 | Add to My Program |
Dynamic Image-Based Visual Servoing for Quadrotor to Track a Planar Target with Unknown Motion |
|
Kumar, Yogesh | IIIT Delhi |
Basu Roy, Sayan | Indraprastha Institute of Information Technology Delhi |
Pb, Sujit | IISER Bhopal |
Keywords: Robotics, Stability of nonlinear systems, Observers for nonlinear systems
Abstract: This paper presents an image-based visual servoing (IBVS) technique for a quadrotor to track a moving planar target having unknown linear and angular velocities. Image moment-based features are derived after transforming the optical measurements to a virtual image plane parallel to the planar target. The kinematic controller is designed incorporating the linear and angular target velocity estimator blocks. The theoretical analysis demonstrates that the errors converge exponentially to an ultimate bound for time-varying target motion and exponentially to zero if the target motion is purely translational with constant velocity. Unlike past literature, the proposed IBVS scheme can handle unknown arbitrary translational and angular (along the z-axis) motions of the target in a unified and model-free manner. Simulation results using circular and Liassajous target motion are provided to validate the efficacy of the proposed scheme.
|
|
17:00-17:20, Paper TuCT03.4 | Add to My Program |
On the Stability of the Soft Pendulum with Affine Curvature: Open-Loop, Collocated Closed-Loop, and Switching Control |
|
Trumic, Maja | University of Belgrade |
Della Santina, Cosimo | TU Delft |
Jovanovic, Kosta | University of Belgrade |
Fagiolini, Adriano | University of Palermo |
Keywords: Robotics, Stability of nonlinear systems, Emerging control applications
Abstract: This letter investigates the stability properties of the soft inverted pendulum with affine curvature - a template model for nonlinear control of underactuated soft robots. We look into how changes in physical parameters affect stability and equilibrium. We give conditions under which zero dynamics corresponding to a collocated choice of the output is (locally or globally) stable or unstable. We leverage these results to design a switching controller that stabilizes a class of nonlinear equilibria of the pendulum, and it can drive the system from one of these equilibria to another.
|
|
17:20-17:40, Paper TuCT03.5 | Add to My Program |
RISE-Based Trajectory Tracking Control of an Aerial Manipulator under Uncertainty |
|
Lee, Dongjae | Seoul National University |
Byun, Jeonghyun | Seoul National University |
Kim, H. Jin | Seoul National University |
Keywords: Robotics, Robust control, Control applications
Abstract: This study presents a robust integral of the sign of the error (RISE)-based controller for an aerial manipulator consisting of a multi-rotor and a robotic arm which guarantees tracking error convergence to zero in the presence of uncertainties. To rigorously address underactuatedness issue, the system dynamics is decomposed into the two subsystems for which a robust controller is derived. As an intermediate result, if there exists no uncertainty, we show that the nominal closed-loop system with the proposed nominal controller is asymptotically stable without assuming that the attitude error term in the underactuated part is zero by cascaded system analysis tool. Then, a robust controller combining a nominal controller and a RISE controller is proposed and applied to both subsystems. Tracking error convergence is strictly proved through Lyapunov-based stability analysis. The performance of the controller is demonstrated in simulation with comparative studies where the proposed controller outperforms the other compared controllers in error convergence.
|
|
17:40-18:00, Paper TuCT03.6 | Add to My Program |
A Passivity Preserving H-Infinity Synthesis Technique for Robot Control |
|
Larby, Daniel Edward | University of Cambridge |
Forni, Fulvio | University of Cambridge |
Keywords: Robotics, Nonlinear systems, LMIs
Abstract: Most impedance control schemes in robotics implement a desired passive impedance, allowing for stable interaction between the controlled robot and the environment. However, there is little guidance on the selection of the desired impedance. In general, finding the best stiffness and damping parameters is a challenging task. This paper contributes to this problem by connecting impedance control to robust control, with the goal of shaping the robot performances via feedback. We provide a method based on linear matrix inequalities with sparsity constraints to derive impedance controllers that satisfy a H infinity performance criterion. Our controller guarantees passivity of the controlled robot and local performances near key poses.
|
|
TuCT04 Regular Session, Tulum Ballroom D |
Add to My Program |
Learning |
|
|
Chair: Beckers, Thomas | University of Pennsylvania |
Co-Chair: Mavridis, Christos | University of Maryland, College Park |
|
16:00-16:20, Paper TuCT04.1 | Add to My Program |
Data-Driven H-Infinity Control for Unknown Linear Time-Invariant Systems with Bounded Disturbances |
|
Hu, Kaijian | The University of Hong Kong, and Shenzhen Institute of Research |
Liu, Tao | The University of Hong Kong |
Keywords: Learning, Robust control, Linear systems
Abstract: Motivated by learning controllers directly from data, this paper studies the problem of designing a data-driven controller for unknown linear time-invariant (LTI) systems with bounded disturbances. An H_infty controller is designed using a group of input/state/output data without any model information. The obtained results are further applied to the ARX systems using only input/output data. The main advantage of the proposed controller is that it can balance the trade-off between the robustness and control performance of the closed-loop system by choosing the number of data sequences and the length of each sequence. The effectiveness of the proposed method is shown using simulation results of an outdoor antenna servo system.
|
|
16:20-16:40, Paper TuCT04.2 | Add to My Program |
Data-Based Control Design for Linear Discrete-Time Systems with Robust Stability Guarantees |
|
D'Amico, William | Politecnico Di Milano |
Farina, Marcello | Politecnico Di Milano |
Keywords: Learning, Robust control, LMIs
Abstract: This paper proposes a method based on virtual reference feedback tuning with robust closed-loop stability guarantees in a linear single-input and single-output setting. The proposed method is not a fully direct data-driven approach since an uncertainty set for the system is obtained through a set membership identification. Based on the uncertainty set, robust stability conditions are enforced as linear matrix inequality constraints within an optimization problem whose cost function relies on virtual reference feedback tuning. The effectiveness of the algorithm is demonstrated in a simulation example.
|
|
16:40-17:00, Paper TuCT04.3 | Add to My Program |
Meta-Learning Online Control for Linear Dynamical Systems |
|
Muthirayan, Deepan | University of California at Irvine |
Kalathil, Dileep | Texas A&M University (TAMU) |
Khargonekar, Pramod | Univ. of California, Irvine |
Keywords: Learning, Robust control, Uncertain systems
Abstract: In this work, we consider the problem of finding a meta-learning online control algorithm that can learn across the tasks when faced with a sequence of N (similar) control tasks. Each task involves controlling a linear dynamical system for T time steps. The cost function and system noise at each time step are adversarial and unknown to the algorithm before taking the control action. The goal of a meta-learning algorithm is to sequentially prescribe the individual online control policies for each new task by exploiting the information from previous tasks and the property of task similarity. We propose a meta-learning online control algorithm for this setting and characterize its performance using the metric of textit{meta-regret}, which is the average cumulative regret of the tasks. We show that, when the number of tasks are sufficiently large, the meta-regret of our proposed approach is smaller by a factor D/D^{*} compared to an independent-learning online control algorithm which does not perform learning across the task, where D is a problem constant and D^{*} is a scalar that decreases with increase in task similarity. Thus, when the sequence of tasks are similar, the regret of the proposed meta-learning online control is significantly lower than that of the naive approaches without meta-learning. We also present numerical results to demonstrate the superior performance achieved by our meta-learning algorithm.
|
|
17:00-17:20, Paper TuCT04.4 | Add to My Program |
Learning Optimal Team-Decisions |
|
Kjellqvist, Olle | Lund University |
Gattami, Ather | Bitynamics Research |
Keywords: Learning, Decentralized control
Abstract: In this paper, we study linear quadratic team decision problems, where a team of agents minimizes a convex quadratic cost function over T time steps subject to possibly distinct linear measurements of the state of nature. We assume that the state of nature is a Gaussian random variable and that the agents do not know the cost function nor the linear functions mapping the state of nature to their measurements. We present a gradient-descent based algorithm with an expected regret of O(log(T)) for full information gradient feedback and O(sqrt(T)) for bandit feedback. In the case of bandit feedback, the expected regret has an additional multiplicative term O(d) where d reflects the number of learned parameters.
|
|
17:20-17:40, Paper TuCT04.5 | Add to My Program |
Gaussian Process Port-Hamiltonian Systems: Bayesian Learning with Physics Prior |
|
Beckers, Thomas | University of Pennsylvania |
Seidman, Jacob H. | University of Pennsylvania |
Perdikaris, Paris | University of Pennsylvania |
Pappas, George J. | University of Pennsylvania |
Keywords: Learning, Grey-box modeling, Mechatronics
Abstract: Data-driven approaches achieve remarkable results for the modeling of complex dynamics based on collected data. However, these models often neglect basic physical principles which determine the behavior of any real-world system. This omission is unfavorable in two ways: The models are not as data-efficient as they could be by incorporating physical prior knowledge, and the model itself might not be physically correct. We propose Gaussian Process Port-Hamiltonian systems (GP-PHS) as a physics-informed Bayesian learning approach with uncertainty quantification. The Bayesian nature of GP-PHS uses collected data to form a distribution over all possible Hamiltonians instead of a single point estimate. Due to the underlying physics model, a GP-PHS generates passive systems with respect to designated inputs and outputs. Further, the proposed approach preserves the compositional nature of Port-Hamiltonian systems.
|
|
17:40-18:00, Paper TuCT04.6 | Add to My Program |
Sparse Gaussian Process Regression Using Progressively Growing Learning Representations |
|
Mavridis, Christos | University of Maryland, College Park |
Kontoudis, George | University of Maryland |
Baras, John S. | University of Maryland |
Keywords: Learning, Machine learning, Pattern recognition and classification
Abstract: We present a new sparse Gaussian process regression model whose covariance function is parameterized by the locations of a progressively growing set of pseudo-inputs generated by an online deterministic annealing optimization algorithm. A series of entropy-regularized optimization problems is solved sequentially, introducing a bifurcation phenomenon, according to which, pseudo-inputs are gradually generated. This results in an active learning approach, which, in contrast to most existing works, can modify already selected pseudo-inputs and is trained using a recursive gradient-free stochastic approximation algorithm. Finally, the proposed algorithm is able to incorporate prior knowledge in the form of a probability density, according to which new observations are sampled. Experimental results showcase the efficacy and potential advantages of the proposed methodology.
|
|
TuCT05 Regular Session, Tulum Ballroom E |
Add to My Program |
Iterative Learning Control |
|
|
Chair: Iannelli, Andrea | ETH Zurich |
Co-Chair: Del Vecchio, Carmen | Università Del Sannio |
|
16:00-16:20, Paper TuCT05.1 | Add to My Program |
Stochastic Multi-Armed Bandits with Non-Stationary Rewards Generated by a Linear Dynamical System |
|
Gornet, Jonathan | Washington University in Saint Louis |
Hosseinzadeh, Mehdi | Washington University in St. Louis |
Sinopoli, Bruno | Washington University in St Louis |
Keywords: Iterative learning control, Finance, Kalman filtering
Abstract: The stochastic multi-armed bandit has provided a framework for studying decision-making in unknown environments. We propose a variant of the stochastic multi-armed bandit where the rewards are dynamic and structured, combining the frameworks developed in non-stationary and structured stochastic multi-armed bandits. This paper introduces a case when the rewards given by a chosen action are sampled from a stochastic dynamical system. The proposed strategy for this bandit variant is to learn a model of the dynamical system while choosing the optimal action based on the learned model. Motivated by mathematical finance areas such as Intertemporal Capital Asset Pricing Model proposed by Merton and Stochastic Portfolio Theory proposed by Fernholz that both model asset returns with stochastic differential equations, this strategy is applied to quantitative finance as a high-frequency trading strategy, where the goal is to maximize returns within a time period.
|
|
16:20-16:40, Paper TuCT05.2 | Add to My Program |
A Model-Based Reinforcement Learning Approach for Robust PID Tuning |
|
Jesawada, Hozefa | Electrical Engineering Department |
Yerudkar, Amol | University of Sannio |
Del Vecchio, Carmen | Università Del Sannio |
Singh, Navdeep | Veermata Jijabai Technological Institute (VJTI) |
Keywords: Iterative learning control, Machine learning, Information theory and control
Abstract: Proportional-Integral-Derivative (PID) controller is widely used across various industrial process control applications because of its straightforward implementation. However, it can be challenging to fine-tune the PID parameters in practice to achieve robust performance. The paper proposes a model-based reinforcement learning (RL) framework to tune PID controllers leveraging the probabilistic inference for learning control (PILCO) method. In particular, an optimal policy given by PILCO is transformed into a set of robust PID tuning parameters for underactuated mechanical systems. The robustness of the devised controller is verified with simulation studies for a benchmark cart-pole system under server disturbances and system parameter uncertainties.
|
|
16:40-17:00, Paper TuCT05.3 | Add to My Program |
Riemannian Constrained Policy Optimization Via Geometric Stability Certificates |
|
Talebi, Shahriar | University of Washington |
Mesbahi, Mehran | University of Washington |
Keywords: Iterative learning control, Linear systems, Optimization algorithms
Abstract: In this paper, we consider policy optimization over the Riemannian submanifolds of stabilizing controllers arising from constrained Linear Quadratic Regulators (LQR), including output feedback and structured synthesis. In this direction, we provide a Riemannian Newton-type algorithm that enjoys local convergence guarantees and exploits the inherent geometry of the problem. Instead of relying on the exponential mapping or a global retraction, the proposed algorithm revolves around the developed stability certificate and the constraint structure, fully utilizing the intrinsic geometry of the synthesis problem. We then show case the utility of the proposed algorithm through numerical examples.
|
|
17:00-17:20, Paper TuCT05.4 | Add to My Program |
Regret Analysis of Online Gradient Descent-Based Iterative Learning Control with Model Mismatch |
|
Balta, Efe C. | ETH Zurich |
Iannelli, Andrea | ETH Zurich |
Smith, Roy S. | ETH Zurich |
Lygeros, John | ETH Zurich |
Keywords: Iterative learning control, Learning, Optimization algorithms
Abstract: In Iterative Learning Control (ILC), a sequence of feedforward control actions is generated at each iteration on the basis of partial model knowledge and past measurements with the goal of steering the system toward a desired reference trajectory. This is framed here as an online learning task, where the decision-maker takes sequential decisions by solving a sequence of optimization problems having only partial knowledge of the cost functions. Having established this connection, the performance of an online gradient descent-based scheme using inexact gradient information is analyzed in the setting of static and dynamic regret, standard measures in online learning. Fundamental limitations of the scheme and its integration with adaptation mechanisms are further investigated, followed by numerical simulations on a benchmark ILC problem.
|
|
17:20-17:40, Paper TuCT05.5 | Add to My Program |
Cross-Coupled Iterative Learning Control for Complex Systems: A Monotonically Convergent and Computationally Efficient Approach |
|
Aarnoudse, Leontine | TU Eindhoven |
Kon, Johan | Eindhoven University of Technology |
Classens, Koen | Eindhoven University of Technology |
van Meer, Max | Eindhoven University of Technology |
Poot, Maurice | Eindhoven University of Technology |
Tacx, Paul | Eindhoven University of Technology |
Strijbosch, Nard | Eindhoven University of Technology |
Oomen, Tom | Eindhoven University of Technology |
Keywords: Iterative learning control, Mechatronics
Abstract: Cross-coupled iterative learning control (ILC) can achieve high performance for manufacturing applications in which tracking a contour is essential for the quality of a product. The aim of this paper is to develop a framework for norm-optimal cross-coupled ILC that enables the use of exact contour errors that are calculated offline, and iteration- and time-varying weights. Conditions for the monotonic convergence of this iteration-varying ILC algorithm are developed. In addition, a resource-efficient implementation is proposed in which the ILC update law is reframed as a linear quadratic tracking problem, reducing the computational load significantly. The approach is illustrated on a simulation example.
|
|
17:40-18:00, Paper TuCT05.6 | Add to My Program |
Design of ILC Laws with Conditions for Stabilizing Linear 2D Discrete Roesser Models |
|
Maniarski, Robert | University of Zielona Góra |
Paszke, Wojciech | University of Zielona Gora |
Tao, Hongfeng | Jiangnan University |
Rogers, Eric | University of Southampton |
Keywords: Iterative learning control, Stability of linear systems, LMIs
Abstract: This paper is concerned with designing of iterative learning control (ILC) schemes via transforming it into an equivalent problem of designing stabilizing control laws for linear Roesser model for two-dimensional (2D) systems. Then, based on a non-conservative version of stability and stabilization conditions for linear 2D systems, suitable ILC law are designed using the linear matrix inequality (LMI) approach. In addition, some control performance indexes are improved by considering the proper pole locations of so-called intertrial transfer function and hence the speed of trial-to-trial error convergence is increased. Also, an additional LMI constraint is proposed that allows us to minimize the gain of designed controllers to facilitate their practical implementation. Furthermore, we address in this work the problem of ILC control law design for strictly proper system dynamics by means of anticipative action based on previous trial data is considered. Finally, a numerical example is given to demonstrate the effectiveness of the proposed approach and some advantages are emphasized when compared to the existing alternatives.
|
|
TuCT06 Regular Session, Tulum Ballroom F |
Add to My Program |
Neural Networks for Identification |
|
|
Chair: Lazar, Mircea | Eindhoven University of Technology |
Co-Chair: Wang, Ruigang | The University of Sydney |
|
16:00-16:20, Paper TuCT06.1 | Add to My Program |
Physics-Guided Neural Networks for Feedforward Control: From Consistent Identification to Feedforward Controller Design |
|
Bolderman, Max | Eindhoven University of Technology |
Lazar, Mircea | Eindhoven University of Technology |
Butler, Hans | ASML |
Keywords: Nonlinear systems identification, Neural networks, Mechatronics
Abstract: Model-based feedforward control improves tracking performance of motion systems if the model describing the inverse dynamics is of sufficient accuracy. Model sets, such as neural networks (NNs) and physics-guided neural networks (PGNNs) are typically used as flexible parametrizations that enable accurate identification of the inverse system dynamics. Currently, these (PG)NNs are used to identify the inverse dynamics directly. However, direct identification of the inverse dynamics is sensitive to noise that is present in the training data, and thereby results in biased parameter estimates which limit the achievable tracking performance. In order to push performance further, it is therefore crucial to account for noise when performing the identification. To address this problem, this paper proposes a forward system identification using (PG)NNs from noisy data. Afterwards, two methods are proposed for inverting PGNNs to design a feedforward controller. The developed methodology is validated on a real-life industrial linear motor, where it showed significant improvements in tracking performance with respect to the direct inverse identification.
|
|
16:20-16:40, Paper TuCT06.2 | Add to My Program |
Kalman-Bucy-Informed Neural Network for System Identification |
|
Nagel, Tobias | Fraunhofer Institute for Manufacturing Engineering and Automatio |
Huber, Marco | University of Stuttgart |
Keywords: Nonlinear systems identification, Identification, Machine learning
Abstract: Identifying parameters in a system of nonlinear, ordinary differential equations is vital for designing a robust controller. However, if the system is stochastic in its nature or if only noisy measurements are available, standard optimization algorithms for system identification usually fail. We present a new approach that combines the recent advances in physics-informed neural networks and the well-known achievements of Kalman filters in order to find parameters in a continuous-time system with noisy measurements. In doing so, our approach allows estimating the parameters together with the mean value and covariance matrix of the system's state vector. We show that the method works for complex systems by identifying the parameters of a double pendulum.
|
|
16:40-17:00, Paper TuCT06.3 | Add to My Program |
Rectified Linear Unit Based Local Linear Model Tree for Nonlinear System Identification Incorporating Prior Knowledge |
|
Glass, Leon | Robert Bosch GmbH |
Hilali, Wael | Robert Bosch GmbH |
Nelles, Oliver | University of Siegen |
Keywords: Grey-box modeling, Fuzzy systems, Neural networks
Abstract: We combine the local model network algorithm Rectified Linear Unit based Local Linear Model Tree (ReLUMoT) with recently developed tools for differentiable solution of (neural) ordinary differential equations (ODEs). The result is a model for system identification, which allows for straightforward inclusion of prior knowledge about the underlying physical process. By applying the model to a well-known Wiener-Hammerstein system identification benchmark data set, we show how the inclusion of prior knowledge can improve model quality, interpretability, memory footprint and system analysis of the trained model. Our models compare favorably to existing local model networks on this benchmark, both with and without the inclusion of prior knowledge.
|
|
17:00-17:20, Paper TuCT06.4 | Add to My Program |
NARX Identification Using Derivative-Based Regularized Neural Networks |
|
Peeters, Lars | Eindhoven University of Technology |
Beintema, Gerben Izaak | Eindhoven University of Technology |
Forgione, Marco | USI-SUPSI |
Schoukens, Maarten | Eindhoven University of Technology |
Keywords: Nonlinear systems identification, Identification
Abstract: This work presents a novel regularization method for the identification of Nonlinear Autoregressive eXogenous (NARX) models. The regularization method promotes the exponential decay of the influence of past input samples on the current model output. This is done by penalizing the sensitivity of the NARX model simulated output with respect to the past inputs. This promotes the stability of the estimated models and improves the obtained model quality. The effectiveness of the approach is demonstrated through a simulation example, where a neural network NARX model is identified with this novel method. Moreover, it is shown that the proposed regularization approach improves the model accuracy in terms of simulation error performance compared to that of other regularization methods and model classes.
|
|
17:20-17:40, Paper TuCT06.5 | Add to My Program |
Safety Verification of Neural Network Control Systems Using Guaranteed Neural Network Model Reduction |
|
Xiang, Weiming | Augusta University |
Shao, Zhongzhu | Southwest Jiaotong University |
Keywords: Model/Controller reduction, Neural networks, Model Validation
Abstract: This paper aims to enhance the computational efficiency of safety verification of neural network control systems by developing a guaranteed neural network model reduction method. First, a concept of model reduction precision is proposed to describe the guaranteed distance between the outputs of a neural network and its reduced-size version. A reachability-based algorithm is proposed to accurately compute the model reduction precision. Then, by substituting a reduced-size neural network controller into the closed-loop system, an algorithm to compute the reachable set of the original system is developed, which is able to support much more computationally efficient safety verification processes. Finally, the developed methods are applied to a case study of the Adaptive Cruise Control system with a neural network controller, which is shown to significantly reduce the computational time of safety verification and thus validate the effectiveness of the method.
|
|
17:40-18:00, Paper TuCT06.6 | Add to My Program |
Learning Over All Stabilizing Nonlinear Controllers for a Partially-Observed Linear System |
|
Wang, Ruigang | The University of Sydney |
Barbara, Nicholas H. | The University of Sydney |
Revay, Max | University of Sydney |
Manchester, Ian R. | University of Sydney |
Keywords: Machine learning, Stability of nonlinear systems, Neural networks
Abstract: This paper proposes a nonlinear policy architecture for control of partially-observed linear dynamical systems providing built-in closed-loop stability guarantees. The policy is based on a nonlinear version of the Youla parameterization, and augments a known stabilizing linear controller with a nonlinear operator from a recently developed class of dynamic neural network models called the recurrent equilibrium network (REN). {We prove that RENs are universal approximators of contracting and Lipschitz nonlinear systems, and subsequently show that the the proposed Youla-REN architecture is a universal approximator of stabilizing nonlinear controllers}. The REN architecture simplifies learning since unconstrained optimization can be applied, and we consider both a model-based case where exact gradients are available and reinforcement learning using random search with zeroth-order oracles. In simulation examples our method converges faster to better controllers and is more scalable than existing methods, while guaranteeing stability during learning transients.
|
|
TuCT07 Regular Session, Tulum Ballroom G |
Add to My Program |
Fault Tolerant Systems II |
|
|
Chair: Maggio, Martina | Lund University |
Co-Chair: Cristofaro, Andrea | Sapienza University of Rome |
|
16:00-16:20, Paper TuCT07.1 | Add to My Program |
An Analytical Framework for Control Synthesis of Cyber-Physical Systems with Safety Guarantee |
|
Niu, Luyao | University of Washington |
Maruf, Abdullah Al | University of Washington |
Clark, Andrew | Washington University in St. Louis |
Mertoguno, Sukarno | Georgia Institute of Technology |
Poovendran, Radha | University of Washington |
Keywords: Fault tolerant systems, Formal Verification/Synthesis, Resilient Control Systems
Abstract: Cyber-physical systems (CPS) are required to operate safely under fault and malicious attacks. The simplex architecture and the recently proposed cyber resilient architectures, e.g., Byzantine fault tolerant++ (BFT++), provide safety for CPS under faults and malicious cyber attacks, respectively. However, these existing architectures make use of different timing parameters and implementations to provide safety, and are seemingly unrelated. In this paper, we propose an analytical framework to represent the simplex, BFT++ and other practical cyber resilient architectures (CRAs). We construct a hybrid system that models CPS adopting any of these architectures. We derive sufficient conditions via our proposed framework under which a control policy is guaranteed to be safe. We present an algorithm to synthesize the control policy. We validate the proposed framework using a case study on lateral control of a Boeing 747, and demonstrate that our proposed approach ensures safety of the system.
|
|
16:20-16:40, Paper TuCT07.2 | Add to My Program |
Stability of Linear Systems under Extended Weakly-Hard Constraints |
|
Vreman, Nils | Lund University |
Pazzaglia, Paolo | Saarland University |
Magron, Victor | LAAS, CNRS |
Wang, Jie | Academy of Mathematics and Systems Science, CAS |
Maggio, Martina | Lund University |
Keywords: Fault tolerant systems, Embedded systems, Constrained control
Abstract: Control systems can show robustness to many events, like disturbances and model inaccuracies. It is natural to speculate that they are also robust to sporadic deadline misses when implemented as digital tasks on an embedded platform. This paper proposes a comprehensive stability analysis for control systems subject to deadline misses, leveraging a new formulation to describe the patterns experienced by the control task under different handling strategies. Such analysis brings the assessment of control systems robustness to computational problems one step closer to the controller implementation.
|
|
16:40-17:00, Paper TuCT07.3 | Add to My Program |
Experimenting with Networked Control Software Subject to Faults |
|
Josephrexon, Brindha Jeniefer | Saarland University |
Maggio, Martina | Lund University |
Keywords: Fault tolerant systems, Fault diagnosis, Control applications
Abstract: Faults and errors are common in the execution of digital controllers on top of embedded hardware. Researchers from the embedded system domain devised models to understand and bound the occurrence of these faults. Using these models, control researchers have demonstrated robustness properties of control systems, and of their corresponding digital implementations. In this paper, we build a framework to experiment with the injection of faults in a networked control system that regulates the behaviour of a Furuta pendulum. We use the software framework to experiment on computational problems that cause the control signals not to be available on time, and network faults that cause dropped packets during the transmission of sensor data and actuator commands.
|
|
17:00-17:20, Paper TuCT07.4 | Add to My Program |
Online Monitoring of Dynamic Systems for Signal Temporal Logic Specifications with Model Information |
|
Yu, Xinyi | Shanghai Jiao Tong University |
Dong, Weijie | Shanghai Jiao Tong University |
Yin, Xiang | Shanghai Jiao Tong University |
Li, Shaoyuan | Shanghai Jiao Tong University |
Keywords: Discrete event systems, Formal Verification/Synthesis, Fault detection
Abstract: Online monitoring aims to evaluate or to predict, at runtime, whether or not the behaviors of a system satisfy some desired specification. It plays a key role in safety-critical cyber-physical systems. In this work, we propose a new model-based approach for online monitoring for specifications described by Signal Temporal Logic (STL) formulae. Specifically, we assume that the observed state traces are generated by an underlying dynamic system whose model is known. The main idea is to consider the dynamic of the system when evaluating the satisfaction of the STL formulae. To this end, effective approaches for the computation of feasible sets for STL formulae are provided. We show that, by explicitly utilizing the model information of the dynamic system, the proposed online monitoring algorithm can falsify or certify of the specification in advance compared with existing algorithms, where no model information is used. We also demonstrate the proposed monitoring algorithm by case studies.
|
|
17:20-17:40, Paper TuCT07.5 | Add to My Program |
Optimal Fault Detection Observer Design Using Excluding Degree |
|
Xu, Feng | Tsinghua University |
Wan, Yiming | Huazhong University of Science and Technology |
Wang, Ye | The University of Melbourne |
Keywords: Fault detection, Linear systems, Uncertain systems
Abstract: This paper proposes a novel optimal gain design method for robust fault detection using set-valued observers. First, a new fault detection performance specification is proposed for set-valued observers based on a new notion named the excluding degree of the origin from a zonotope. Second, a fractional programming problem is formulated to describe fault detection optimality of set-valued observers. Third, optimal fault detection gains are obtained by solving the fractional programming problem. Finally, numerical examples are used to illustrate the effectiveness of the proposed method.
|
|
17:40-18:00, Paper TuCT07.6 | Add to My Program |
Abstraction-Free Control Synthesis to Satisfy Temporal Logic Constraints under Sensor Faults and Attacks |
|
Niu, Luyao | University of Washington |
Li, Zhouchi | Worcester Polytechnic Institute |
Clark, Andrew | Washington University in St. Louis |
Keywords: Formal Verification/Synthesis, Stochastic systems, Fault tolerant systems
Abstract: We study the problem of synthesizing a controller to satisfy a complex task in the presence of sensor faults and attacks. We model the task using Gaussian distribution temporal logic (GDTL), and propose a solution approach that does not rely on computing any finite abstraction to model the system. We decompose the GDTL specification into a sequence of reach-avoid sub-tasks. We develop a class of fault-tolerant finite time convergence control barrier functions (CBFs) to guarantee that a dynamical system reaches a set within finite time almost surely in the presence of malicious attacks. We use the fault-tolerant finite time convergence CBFs to guarantee the satisfaction of 'reach' property. We ensure 'avoid' part in each sub-task using fault-tolerant zeroing CBFs. These fault-tolerant CBFs formulate a set of linear constraints on the control input for each sub-task. We prove that if the error incurred by system state estimation is bounded by a certain threshold, then our synthesized controller fulfills each reach-avoid sub-task almost surely for any possible sensor fault and attack, and thus the GDTL specification is satisfied with probability one. We demonstrate our proposed approach using a numerical study on the coordination of two wheeled mobile robots.
|
|
TuCT08 Regular Session, Tulum Ballroom H |
Add to My Program |
Learning and Optimization |
|
|
Chair: Notarstefano, Giuseppe | University of Bologna |
Co-Chair: Kiumarsi, Bahare | Michigan State University |
|
16:00-16:20, Paper TuCT08.1 | Add to My Program |
A Learning-Based Distributed Algorithm for Personalized Aggregative Optimization |
|
Carnevale, Guido | University of Bologna |
Notarstefano, Giuseppe | University of Bologna |
Keywords: Optimization algorithms, Optimization, Distributed control
Abstract: This paper addresses distributed aggregative optimization, i.e., a recently emerged framework in which agents in a network want to minimize the sum of local objective functions depending both on a local decision variable and on an aggregated version of all the variables (e.g., the mean). We consider a "personalized'' set-up in which each local function is given by the sum of a known term and an unknown one capturing user's dissatisfaction. We propose a novel data-driven distributed algorithm taking advantage of users' noisy feedbacks to learn the parameters of the unknown function concurrently with the optimization steps. We prove an upper bound for the dynamic regret related to (i) the initial conditions, (ii) the temporal variations of the objective functions, and (iii) the learning errors. Moreover, by considering the average dynamic regret, we prove that both initial conditions and learning errors do not affect the asymptotic performance of the algorithm. Finally, a numerical simulation in the context of opinion dynamics validates our findings.
|
|
16:20-16:40, Paper TuCT08.2 | Add to My Program |
Designing Safety Certificates for H-Infinity Control of Unknown Linear Systems |
|
Tooranjipour, Pouria | Michigan State Univeristy |
Kiumarsi, Bahare | Michigan State University |
Keywords: Optimization, Uncertain systems, Machine learning
Abstract: This paper synthesizes safety barrier certificates for H-infinity control of unknown linear systems. Inspired by the idea of expanding the domain of attraction (DoA) using control barrier functions (CBFs), we construct a region, called safe optimal DoA, in which both optimality and safety are simultaneously respected. The effect of unknown disturbances is attenuated by unifying the robustness property of CBFs and the H∞ control methods. To construct the control barrier certificates, a feasible optimization problem is developed using the relaxed game algebraic Riccati equation. A sum-of-squares (SOS)-based safe policy iteration algorithm is then proposed to solve the optimization problem iteratively. The resulting optimal policy minimizes a predefined cost function over an obtained maximum barrier-certified region for which there is no conflict between safety and optimality. Furthermore, to remedy the requirement of having the system dynamics, an online data-driven approach is presented using off-policy reinforcement learning. Finally, to demonstrate the efficacy of the proposed controller, a numerical example is given.
|
|
16:40-17:00, Paper TuCT08.3 | Add to My Program |
A Homotopic Approach to Policy Gradients for Linear Quadratic Regulators with Nonlinear Controls |
|
Chen, Craig | Duke University |
Agazzi, Andrea | Duke University |
Keywords: Optimization algorithms, Machine learning, Stability of nonlinear systems
Abstract: We study the convergence of deterministic policy gradient algorithms in continuous state and action space for the prototypical Linear Quadratic Regulator (LQR) problem when the search space is not limited to the family of linear policies. We first provide a counterexample showing that extending the policy class to piecewise linear functions results in local minima of the policy gradient algorithm. To solve this problem, we develop a new approach that involves sequentially increasing a discount factor between iterations of the original policy gradient algorithm. We finally prove that this homotopic variant of policy gradient methods converges to the global optimum of the undiscounted Linear Quadratic Regulator problem for a large class of Lipschitz, non-linear policies.
|
|
17:00-17:20, Paper TuCT08.4 | Add to My Program |
Stochastic Learning Rate with Memory: Optimization in the Stochastic Approximation and Online Learning Settings |
|
Mamalis, Theodoros | University of Illinois at Urbana-Champaign |
Stipanovic, Dusan M. | Univ of Illinois, Urbana-Champaign |
Voulgaris, Petros G. | Univ of Nevada, Reno |
Keywords: Optimization algorithms, Machine learning, Statistical learning
Abstract: In this work, multiplicative stochasticity is applied to the learning rate of stochastic optimization algorithms, giving rise to stochastic learning-rate schemes. In-expectation theoretical convergence results of Stochastic Gradient Descent equipped with this novel stochastic learning rate scheme under the stochastic setting, as well as convergence results under the online optimization settings are provided. Empirical results consider the case of an adaptively uniformly distributed multiplicative stochasticity and include not only Stochastic Gradient Descent, but also other popular algorithms equipped with a stochastic learning rate. They demonstrate noticeable optimization performance gains, with respect to their deterministic-learning-rate versions.
|
|
17:20-17:40, Paper TuCT08.5 | Add to My Program |
Pick Your Neighbor: Local Gauss-Southwell Rule for Fast Asynchronous Decentralized Optimization |
|
Costantini, Marina | EURECOM |
Liakopoulos, Nikolaos | Amazon |
Mertikopoulos, Panayotis | French National Center for Scientific Research (CNRS) |
Spyropoulos, Thrasyvoulos | Eurecom |
Keywords: Optimization algorithms, Decentralized control, Networked control systems
Abstract: In decentralized optimization environments, each agent i in a network of n nodes has its own private function f_i, and nodes communicate with their neighbors to cooperatively minimize the aggregate objective sum_{i=1}^n f_i. In this setting, synchronizing the nodes' updates incurs significant communication overhead and computational costs, so much of the recent literature has focused on the analysis and design of asynchronous optimization algorithms, where agents activate and communicate at arbitrary times without needing a global synchronization enforcer. However, most works assume that when a node activates, it selects the neighbor to contact based on a fixed probability (e.g., uniformly at random), a choice that ignores the optimization landscape at the moment of activation. Instead, in this work we introduce an optimization-aware selection rule that chooses the neighbor providing the highest dual cost improvement (a quantity related to a dualization of the problem based on consensus). This scheme is related to the coordinate descent (CD) method with the Gauss-Southwell (GS) rule for coordinate updates; in our setting however, only a subset of coordinates is accessible at each iteration (because each node can communicate only with its neighbors), so the existing literature on GS methods does not apply. To overcome this difficulty, we develop a new analytical framework for smooth and strongly convex f_i that covers the class of set-wise CD algorithms - a class that directly applies to decentralized scenarios, but is not limited to them - and we show that the proposed set-wise GS rule achieves a speedup factor of up to the maximum degree in the network (which is in the order of Theta(n) for highly connected graphs). The speedup predicted by our analysis is validated in numerical experiments with synthetic data.
|
|
17:40-18:00, Paper TuCT08.6 | Add to My Program |
Stability, Linear Convergence, and Robustness of the Wang-Elia Algorithm for Distributed Consensus Optimization |
|
Bin, Michelangelo | Imperial College London |
Notarnicola, Ivano | University of Bologna |
Parisini, Thomas | Imperial College & Univ. of Trieste |
Keywords: Optimization algorithms, Network analysis and control, Decentralized control
Abstract: We revisit an algorithm for distributed consensus optimization proposed in 2010 by J. Wang and N. Elia. By means of a Lyapunov-based analysis, we prove input-to-state stability of the algorithm relative to a closed invariant set composed of optimal equilibria and with respect to perturbations affecting the algorithm's dynamics. In the absence of perturbations, this result implies linear convergence of the local estimates and Lyapunov stability of the optimal steady state. Moreover, we unveil fundamental connections with the well-known Gradient Tracking and with distributed integral control. Overall, our results suggest that a control theoretic approach can have a considerable impact on (distributed) optimization, especially when robustness is considered.
|
|
TuCT09 Regular Session, Maya Ballroom I |
Add to My Program |
Network Analysis and Control II |
|
|
Chair: Giordano, Giulia | University of Trento |
Co-Chair: Polushin, Ilia G. | Western University |
|
16:00-16:20, Paper TuCT09.1 | Add to My Program |
Excitation and Measurement Patterns for the Identifiability of Directed Acyclic Graphs |
|
Mapurunga, Eduardo | Universidade Federal Do Rio Grande Do Sul |
Gevers, Michel | Univ. Catholique De Louvain |
Bazanella, Alexandre S. | Univ. Federal Do Rio Grande Do Sul |
Keywords: Network analysis and control, Closed-loop identification, Large-scale systems
Abstract: This paper deals with the design of Excitation and Measurement Patterns (EMP) for the identification of a class of dynamical networks whose topology has the structure of a Directed Acyclic Graph (DAG). In addition to the by now well known condition that the identifiabiltiy of any dynamical network requires that the sources be excited, the sinks be measured, and all other nodes be either excited or measured, we show that for DAGs two other types of nodes have special excitation and measurement requirements. Armed with this result, we propose a systematic procedure for the design of EMPs that guarantee identifiability of a network with DAG topology.
|
|
16:20-16:40, Paper TuCT09.2 | Add to My Program |
Fair and Sparse Solutions in Network-Decentralised Flow Control |
|
Blanchini, Franco | Univ. Degli Studi Di Udine |
Devia, Carlos Andres | Delft University of Technology (TU Delft) |
Giordano, Giulia | University of Trento |
Pesenti, Raffaele | University of Venice - Ca' Foscari |
Rosset, Francesca | University of Udine |
Keywords: Network analysis and control, Decentralized control
Abstract: We proposed network-decentralised control strategies, in which each actuator can exclusively rely on local information, without knowing the network topology and the external input, ensuring that the flow asymptotically converges to the optimal one with respect to the p-norm. For 1 < p < infinity, the flow converges to a unique constant optimal u_p^*. We show that the state converges to the optimal Lagrange multiplier of the optimisation problem. Then, we consider networks where the flows are affected by unknown spontaneous dynamics and the buffers need to be driven exactly to a desired set-point. We propose a network-decentralised proportional-integral controller that achieves this goal along with asymptotic flow optimality; now it is the integral variable that converges to the optimal Lagrange multiplier. The extreme cases p equal to 1 and p equal to infinity are of some interest since the former encourages sparsity of the solution while the latter promotes fairness. Unfortunately, for p equal to 1 or p equal to infinity these strategies become discontinuous and lead to chattering of the flow, hence no optimality is achieved. We then show how to approximately achieve the goal as the limits for p tending to 1 or p tending to infinity.
|
|
16:40-17:00, Paper TuCT09.3 | Add to My Program |
Fixed-Point Centrality for Networks |
|
Gao, Shuang | UC Berkeley, McGill University |
Keywords: Network analysis and control, Large-scale systems, Mean field games
Abstract: This paper proposes a family of network centralities called fixed-point centralities. This centrality family is defined via the fixed point of permutation equivalent mappings related to the underlying network. Such a centrality notion is immediately extended to define fixed-point centralities for infinite graphs characterized by graphons. Variation bounds of such centralities with respect to the variations of the underlying graphs and graphons under mild assumptions are established. Fixed-point centralities connect with a variety of different models on networks including graph neural networks, static and dynamic games on networks, and Markov decision processes.
|
|
17:00-17:20, Paper TuCT09.4 | Add to My Program |
Consensus of Network of Unstable Homogeneous Linear Systems |
|
Ong, Chong-Jin | National University of Singapore |
Canyakmaz, Ilayda | National University of Singapore |
Keywords: Agents-based systems, Networked control systems, Network analysis and control
Abstract: This work addresses the synchronization/consensus problem for identical multi-agent system (MAS). The dynamics of the agent is a general linear system that can be unstable. This work uses a gain matrix in a dynamic compensator setting and shows that under mild conditions, synchronization/consensus is achieved when the gain is sufficiently large. The proposed controller structure can be seen as a special case of existing MAS structures but offers consensus conditions that are simpler than the existing results, especially in the case of switching networks. It can be applied to the following three communication settings: fixed network, switching among undirected and connected networks and switching among directed and connected networks. An example is provided to illustrate the results.
|
|
17:20-17:40, Paper TuCT09.5 | Add to My Program |
Modular Scattering-Based Design of Dissipative Networks |
|
Polushin, Ilia G. | Western University |
Keywords: Nonlinear systems, Network analysis and control, Large-scale systems
Abstract: Results towards modular design of complex dissipative networks are presented. Graph separation condition for proper interconnection of dissipative networks are given, and methods for design of scattering transformations that guarantees the fulfilment of the graph separation condition are presented. To ensure full generality of the results, behavioral framework for modeling of dissipative networks is utilized. Properly interconnected dissipative networks can be used as modular blocks that can be further interconnected with other dissipative networks thus allowing for iterative design of complex networks of dissipative systems.
|
|
17:40-18:00, Paper TuCT09.6 | Add to My Program |
Dec-AltProjGDmin: Fully-Decentralized Alternating Projected Gradient Descent for Low-Rank Column-Wise Compressive Sensing |
|
Moothedath, Shana | Iowa State University |
Vaswani, Namrata | Iowa State University |
Keywords: Network analysis and control, Learning, Decentralized control
Abstract: This work develops a fully-decentralized alternating projected gradient descent algorithm, called Dec- AltProjGD, for solving the following low-rank (LR) matrix recovery problem: recover an LR matrix from independent columnwise linear projections (LR column-wise Compressive Sensing). We prove its correctness under simple assumptions and argue that Dec-AltProjGD is both faster and more communication-efficient than various other potential solution approaches, in addition to also having one of the best sample complexity guarantees. To our best knowledge, this work is the first attempt to develop a provably correct fully-decentralized algorithm for any problem involving the use of an alternating projected GD algorithm and one in which the constraint set to be projected to is a non-convex set.
|
|
TuCT10 Regular Session, Maya Ballroom II |
Add to My Program |
Stochastic Systems III |
|
|
Chair: Lavaei, Abolfazl | Newcastle University |
Co-Chair: Villanueva, Mario E. | ShanghaiTech University |
|
16:00-16:20, Paper TuCT10.1 | Add to My Program |
Path Integral Methods with Stochastic Control Barrier Functions |
|
Tao, Chuyuan | University of Illinois Urbana-Champaign |
Yoon, Hyungjin | University of Nevada, Reno |
Kim, Hunmin | University of Illinois Urbana-Champaign |
Hovakimyan, Naira | University of Illinois at Urbana-Champaign |
Voulgaris, Petros G. | Univ of Nevada, Reno |
Keywords: Stochastic optimal control, Uncertain systems
Abstract: Safe control designs for robotic systems remain challenging because of the difficulties of explicitly solving optimal control with nonlinear dynamics perturbed by stochastic noise. However, recent technological advances in computing devices enable online optimization or sampling-based methods to solve control problems. For example, Control Barrier Functions (CBFs) have been proposed to numerically solve convex optimization problems that ensure the control input to stay in the safe set. Model Predictive Path Integral (MPPI) control uses forward sampling of stochastic differential equations to solve optimal control problems online. Both control algorithms are widely used for nonlinear systems because they avoid calculating the derivatives of the nonlinear dynamic functions. In this paper, we use Stochastic Control Barrier Functions (SCBFs) constraints to limit sample regions in the sampling-based algorithm, ensuring safety in a probabilistic sense and improving sample efficiency with a stochastic differential equation. We also show that our algorithm needs fewer samples than the original MPPI algorithm does by providing a sampling complexity analysis.
|
|
16:20-16:40, Paper TuCT10.2 | Add to My Program |
Multi-Symmetric Lyapunov Equations |
|
Gao, Xvting | ShanghaiTech University |
Villanueva, Mario E. | IMT School for Advanced Studies Lucca |
Houska, Boris | ShanghaiTech University |
Keywords: Stochastic systems, Stochastic optimal control, Lyapunov methods
Abstract: This paper studies multi-symmetric Lyapunov equations and their application to stochastic control. It is shown how Lyapunov recursions can be used to efficiently compute the tensor-valued cumulants of the transient- and limit distributions of stochastic linear systems. This analysis is based on the assumption that the process noise distribution admits a moment expansion but, apart from this, all derivations and numerical algorithms are kept entirely general—without introducing any further assumptions on the distribution. Moreover, it is shown both theoretically and numerically how to exploit these recursions to construct accurate approximations of a rather general class of stochastic optimal control problems for linear discrete-time systems with conditional-value-at-risk constraints.
|
|
16:40-17:00, Paper TuCT10.3 | Add to My Program |
Dissipativity, Inverse Optimal Control, and Stability Margins for Nonlinear Discrete-Time Stochastic Feedback Regulators |
|
Haddad, Wassim M. | Georgia Inst. of Tech |
Lanchares, Manuel | Georgia Institute of Technology |
Keywords: Stochastic systems, Nonlinear systems, Robust control
Abstract: In this paper, we derive stability margins for optimal and inverse optimal stochastic feedback regulators. Specifically, gain, sector, and disk margin guarantees are obtained for discrete-time nonlinear stochastic dynamical systems controlled by nonlinear optimal and inverse optimal controllers that minimize a nonlinear-nonquadratic performance criterion. Furthermore, using the newly developed notion of stochastic dissipativity we derive a return difference inequality to provide connections between stochastic dissipativity and optimality of nonlinear controllers for discrete-time stochastic dynamical systems.
|
|
17:00-17:20, Paper TuCT10.4 | Add to My Program |
On Estimate of Settling-Time Distributions of Finite-Time Stable Stochastic Systems |
|
Hoshino, Kenta | Kyoto University |
Keywords: Stochastic systems, Stability of nonlinear systems, Lyapunov methods
Abstract: This study characterizes the settling time of stochastic systems with finite-time stability. Finite-time stability guarantees finite-time convergence of system states to an equilibrium. One of the primary features of finite-time stability is settling time, i.e., the time required for the convergence of system states. In stochastic systems, settling time is a random variable and must be characterized stochastically. In this study, the probability distribution of random settling time is estimated. This complements the estimation of settling time in deterministic systems. A lower bound to the probability of settling time is obtained by combining a comparison theorem for stochastic differential equations (SDEs) with a partial differential equation (PDE) that describes the distribution of stopping times. Besides theoretical characterization, a numerical example is also presented to validate the results.
|
|
17:20-17:40, Paper TuCT10.5 | Add to My Program |
Stochastic Safety in Space Conjunctions |
|
Gomez, Aitor | Aalborg University |
Wisniewski, Rafal | Aalborg University |
Keywords: Stochastic systems, Aerospace, Optimization
Abstract: The stochastic reach-avoid problem termed p-safety is further examined in the context of space debris and short-term orbital encounters. We define the collision probability problem, and reformulate it as a strong p-safety problem,which offers a computable solution. Enabling computation comes at the cost of a more restrictive formulation which requires several relaxation schemes. To this end, Bernstein forms are employed as polynomial approximation of the nonlinear dynamics, and sum-of-squares as bases to attain certificates of positivity. Finally, a stochastic version of the unperturbed planetary equations is used to model the dynamics.
|
|
17:40-18:00, Paper TuCT10.6 | Add to My Program |
Learning Based Stochastic Data-Driven Predictive Control |
|
Athni Hiremath, Sandesh | University of Kaiserslautern |
Mishra, Vikas Kumar | Technische Universitat Kaiserlautern |
Bajcinca, Naim | University of Kaiserslautern |
Keywords: Stochastic systems, Behavioural systems, Neural networks
Abstract: We consider a stochastic linear system with ad- ditive Gaussian noise and formulate a stochastic variant of Willems et al. fundamental lemma. Based on this, we formu- late a stochastic optimal control problem wherein the system behavior is specified using the developed stochastic fundamental lemma. We call this the stochastic data-driven optimal control problem, which we then show to be equivalent to a statistical regression problem. Following this we construct a parame- terized nonlinear estimator and use it to develop a learning algorithm to solve a stochastic data-driven predictive control problem. The proposed algorithm further enables us to consider different generalizations of the problem such as varying initial and Hankel matrix data obtained from stochastic linear and nonlinear system. Based on numerical simulations, we observe that the condition of persistency of excitation of inputs is not necessary for learning. This motivates us to formulate a lemma which indicates that the order of persistency of excitation required by inputs in the fundamental lemma is not strictly necessary.
|
|
TuCT11 Regular Session, Maya Ballroom III |
Add to My Program |
Robust Control I |
|
|
Chair: Bridgeman, Leila | Duke University |
Co-Chair: Rantzer, Anders | Lund University |
|
16:00-16:20, Paper TuCT11.1 | Add to My Program |
On Decentralized H-Infinity Optimal Positive Systems |
|
Vladu, Emil | Lund University |
Rantzer, Anders | Lund University |
Keywords: Robust control, Compartmental and Positive systems
Abstract: This paper gives a closed-form expression for an H-infinity optimal controller with diagonal gain matrix. This phenomenon occurs for certain network systems with acyclic graphs, and potential applications include irrigation networks. Moreover, the above is identified as a special case of a particular controller structure which is shown to be H-infinity optimal if the controller and the resulting closed-loop system are positive. A subsequent result then establishes a connection with open- loop positive systems, and its usefulness in applications is demonstrated on a large set of compartmental systems.
|
|
16:20-16:40, Paper TuCT11.2 | Add to My Program |
Robust FOPID Stabilization for Smith Predictor Structures |
|
Ghorbani, Majid | Tallinn University of Technology, Department of Computer Systems |
Tepljakov, Aleksei | Tallinn University of Technology |
Petlenkov, Eduard | Tallinn University of Technology |
Keywords: Robust control, Delay systems, Uncertain systems
Abstract: In this letter, an effective graphical tuning method of Fractional-Order Proportional Integral Derivative (FOPID) controllers is proposed for the Smith Predictor (SP) control structure. At first, necessary and sufficient conditions are achieved for the robust stability of the SP structure based on the zero exclusion principle. Then, by benefiting from the D-decomposition technique and the value set concept, the problem of robustly stabilizing the SP scheme is solved using FOPID controllers. Moreover, an auxiliary function is provided to enhance the performance of the SP structure by the sensitivity function. Simulation results successfully confirm the validity and effectiveness of the proposed method, which is provided in illustrative examples.
|
|
16:40-17:00, Paper TuCT11.3 | Add to My Program |
Distributed Robust Control for Systems with Structured Uncertainties |
|
Li, Jing Shuang | California Institute of Technology |
Doyle, John C. | Caltech |
Keywords: Robust control, Distributed control, Optimal control
Abstract: We present D-Phi iteration: an algorithm for distributed, localized, and scalable robust control of systems with structured uncertainties. This algorithm combines the System Level Synthesis (SLS) parametrization for distributed control with stability criteria from L1, L-Infinity, and nu robust control. We show in simulation that this algorithm achieves good nominal performance while greatly increasing the robust stability margin compared to the LQR controller. To the best of our knowledge, this is the first distributed and localized algorithm for structured robust control; furthermore, algorithm complexity depends only on the size of local neighborhoods and is independent of global system size. We additionally characterize the suitability of different robustness criteria for distributed and localized computation.
|
|
17:00-17:20, Paper TuCT11.4 | Add to My Program |
Dissipative Imitation Learning for Robust Dynamic Output Feedback |
|
Strong, Amy | Duke University |
LoCicero, Ethan | Duke University |
Bridgeman, Leila | Duke University |
Keywords: Robust control, Learning, LMIs
Abstract: Robust imitation learning seeks to mimic expert controller behavior while ensuring stability, but current methods require accurate plant models. Here, robust imitation learning is addressed for stabilizing poorly modeled plants with linear dynamic output feedback. Open-loop input-output properties are used to characterize an uncertain plant, and the feedback matrix of the dynamic controller is learned while enforcing stability through the controller's open-loop QSR-dissipativity properties. The imitation learning method is applied to two systems with parametric uncertainty.
|
|
17:20-17:40, Paper TuCT11.5 | Add to My Program |
Robust Differential Dynamic Programming |
|
Gramlich, Dennis | RWTH Aachen |
Scherer, Carsten W. | University of Stuttgart |
Ebenbauer, Christian | RWTH Aachen University |
Keywords: Robust control, LMIs, Nonlinear systems
Abstract: Differential Dynamic Programming is an optimal control technique often used for trajectory generation. Many variations of this algorithm have been developed in the literature, including algorithms for stochastic dynamics or state and input constraints. In this contribution, we develop a robust version of Differential Dynamic Programming that uses generalized plants and multiplier relaxations for uncertainties. To this end, we study a version of the Bellman principle and use convex relaxations to account for uncertainties in the dynamic program. The resulting algorithm can be seen as a robust trajectory generation tool for nonlinear systems.
|
|
17:40-18:00, Paper TuCT11.6 | Add to My Program |
On Modal Observers for Beyond Rigid Body H_infty Control in High-Precision Mechatronics |
|
Broens, Yorick | Eindhoven University of Technology |
Butler, Hans | ASML |
Tóth, Roland | Eindhoven University of Technology |
Keywords: Robust control, Mechatronics
Abstract: The ever increasing need for performance results in increasingly rigorous demands on throughput and positioning accuracy of high-precision motion systems, which often suffer from position dependent effects that originate from relative actuation and sensing of the moving-body. Due to the highly stiff mechanical design, such systems are typically controlled using rigid body control design approaches. Nonetheless, the presence of position dependent flexible dynamics severely limits attainable position tracking performance. This paper presents two extensions of the conventional rigid body control framework towards active control of position dependent flexible dynamics. Additionally, a novel control design approach is presented, which allows for shaping of the full closed-loop system by means of structured H_infty co-design. The effectiveness of the approach is validated through simulation using a high-fidelity model of a state-of-the-art moving-magnet planar actuator.
|
|
TuCT12 Invited Session, Maya Ballroom IV |
Add to My Program |
Event-Triggered Control for Multi-Vehicle, Multi-Robot, and Multi-Agent
Systems |
|
|
Chair: Nowzari, Cameron | George Mason University |
Co-Chair: Solowjow, Friedrich | RWTH Aachen University |
Organizer: Nowzari, Cameron | George Mason University |
Organizer: Heemels, W.P.M.H. | Eindhoven University of Technology |
Organizer: Johansson, Karl H. | KTH Royal Institute of Technology |
Organizer: Hirche, Sandra | Technische Universität München |
|
16:00-16:20, Paper TuCT12.1 | Add to My Program |
Quantized Sampled-Data Attitude Control of Ground Vehicles: An Event-Based Approach |
|
Borri, Alessandro | CNR-IASI |
Di Ferdinando, Mario | University of L'Aquila |
Bianchi, Domenico | University of L'Aquila |
Pepe, Pierdomenico | University of L' Aquila |
Di Gennaro, Stefano | University of L'Aquila |
Keywords: Automotive control, Sampled-data control, Quantized systems
Abstract: Attitude control systems for ground vehicles have been an important topic in automotive research for decades, and have been extensively studied by resorting to classical continuous-time nonlinear design. Although this approach can incorporate saturation constraints and actuator dynamics in the design, the computed control laws are often approximated and applied within digital environments in absence of formal performance guarantees. In this work, we present a quantized sampled-data approach to the vehicle attitude control problem. Starting from classical nonlinear design achieving tracking of prescribed trajectories in continuous time (emulation approach), we derive conditions for preserving the practical stability of the error dynamics by means of quantized sampled-data event-based controllers. Simulations performed in an non-ideal setting confirm the potential of the approach.
|
|
16:20-16:40, Paper TuCT12.2 | Add to My Program |
Distributed Platoon Control of Nonlinear Vehicles with Event-Triggered Extended State Observers: Closed-Loop Stability (I) |
|
Liu, Anquan | Shanghai University |
Li, Tao | East China Normal University |
Keywords: Cooperative control, Estimation, Autonomous vehicles
Abstract: We study the platoon control based on nonlinear vehicle models. A third-order vehicle model with parameter uncertainties and external disturbances is considered and the constant spacing policy is adopted. The control law of each follower vehicle only depends on the information obtained by on-board sensors, including its own velocity, acceleration, the velocity of the preceding vehicle and the inter-vehicle distance. Firstly, an event-triggered extended state observer (ESO) is designed to estimate the unmodeled dynamics in the vehicle model. Then based on the estimate, a distributed control law is presented by using the dynamic surface control method. We show that the control parameters can be properly designed to make the observation errors of the ESOs bounded and ensure that the inter-vehicle distance errors enter into a small neighborhood of zero. We prove that the Zeno behavior is avoided under the designed event-triggered mechanism. The effectiveness of the proposed control law is demonstrated by numerical simulations.
|
|
16:40-17:00, Paper TuCT12.3 | Add to My Program |
Event-Triggered L2-Optimal Formation Control with State-Estimation for Agents Modeled As LPV Systems (I) |
|
Gebhardt, Gerald | Hamburg University of Technology |
Saadabadi, Hamideh | TUHH |
Werner, Herbert | Hamburg University of Technology, Institute of Control Systems |
Keywords: Control of networks, Estimation, Linear parameter-varying systems
Abstract: This paper proposes a distributed scheme with different estimators for the event-triggered formation control of polytopic homogeneously scheduled linear parameter-varying (LPV) multi-agent systems (MAS). Each agent consists of a time-triggered inner feedback loop and a larger event-triggered outer feedback loop to track a formation reference signal and reject input and output noise. If a local event-trigger condition is violated, the event-triggered outer feedback loop is closed through the communication network. The event-trigger condition is only based on locally available information. To design the controller, a synthesis problem is formulated as a linear matrix inequality of the size of a single agent under the assumption, that local estimators trigger intercommunication events with neighboring agents if the event-trigger condition is violated. The design procedure guarantees stability and bounded l2-performance. Furthermore, the estimators are interchangeable for a given controller. We compare in simulation zero-order hold, open-loop estimation, and closed-loop estimation strategies. Simulation trials are carried out with non-holonomic dynamic unicycles modeled as polytopic LPV systems.
|
|
17:00-17:20, Paper TuCT12.4 | Add to My Program |
An Event-Triggered Distributed Observer for Leader-Following Consensus of Multiple Rigid Body Systems Over Switching Networks (I) |
|
Wang, Tianqi | The Chinese University of Hong Kong |
Huang, Jie | The Chinese University of Hong Kong |
Keywords: Autonomous systems, Cooperative control, Flight control
Abstract: The distributed observer is effective for solving the leader-following attitude consensus problem of multiple rigid-body systems. In this paper, we first develop an event-triggered distributed observer for a leader rigid body system over jointly connected switching networks. It is shown that, under two standard assumptions, the state of the event-triggered distributed observer will exponentially converge to the leader's state and the inter-event time sequence admits a positive lower bound. Then, we synthesize an event-triggered distributed observer based control law to solve the leader following attitude consensus problem of multiple rigid body systems. The effectiveness of our design will be illustrated with a numerical example.
|
|
17:20-17:40, Paper TuCT12.5 | Add to My Program |
Continuous-Time and Event-Triggered Online Optimization for Linear Multi-Agent Systems (I) |
|
Yu, Yang | Tongji University |
Li, Xiuxian | Tongji University |
Li, Li | Tongji University, P.R.C |
Xie, Lihua | Nanyang Tech. Univ |
Keywords: Cooperative control, Distributed control, Optimization
Abstract: This paper studies the decentralized online convex optimization problem for heterogeneous linear multi-agent systems. Agents have access to their time-varying local cost functions related to their own outputs, and there are also time-varying coupling inequality constraints among them. The target of each agent is to minimize the global cost function by selecting appropriate local actions only through communication between neighbors. We design a distributed controller based on the saddle-point method which achieves constant regret bound and sublinear fit bound. Moreover, to diminish the communication overhead, another distributed controller is developed with an event-triggered communication scheme and it is shown that the above bounds are still achieved in the case of discrete communications with no Zeno behavior. A numerical example is given to verify the proposed algorithms.
|
|
17:40-18:00, Paper TuCT12.6 | Add to My Program |
Towards Remote Fault Detection by Analyzing Communication Priorities (I) |
|
Gräfe, Alexander | RWTH Aachen University |
Baumann, Dominik | Uppsala University |
Trimpe, Sebastian | RWTH Aachen University |
Keywords: Fault detection, Networked control systems, Distributed control
Abstract: The ability to detect faults is an important safety feature for event-based multi-agent systems. In most existing algorithms, each agent tries to detect faults by checking its own behavior. But what if one agent becomes unable to recognize misbehavior, for example due to failure in its onboard fault detection? To improve resilience and avoid propagation of individual errors to the multi-agent system, agents should check each other remotely for malfunction or misbehavior. In this paper, we build upon a recently proposed predictive triggering architecture that involves communication priorities shared throughout the network to manage limited bandwidth. We propose a fault detection method that uses these priorities to detect errors in other agents. The resulting algorithms is not only able to detect faults, but can also run on a low-power microcontroller in real-time, as we demonstrate in hardware experiments.
|
|
TuCT13 Regular Session, Maya Ballroom V |
Add to My Program |
Predictive Control for Linear Systems III |
|
|
Chair: Borrelli, Francesco | Unversity of California at Berkeley |
Co-Chair: Pangborn, Herschel | Pennsylvania State University |
|
16:00-16:20, Paper TuCT13.1 | Add to My Program |
Improved Active Set Dynamic Programming for Solving Linear-Quadratic Optimal Control Problems |
|
Mitze, Ruth | Ruhr-Universität Bochum |
Monnigmann, Martin | Ruhr-Universität Bochum |
Keywords: Predictive control for linear systems, Optimal control, Computational methods
Abstract: The solution of a constrained linear-quadratic optimal control problem is a continuous piecewise-affine function. This solution can be determined by constructing all active sets that define the affine pieces. It has recently been proposed to combine the construction of all active sets with dynamic programming (DP). In this type of approach, DP is not applied to find the optimal input signal sequence that solves the optimal control problem, but DP is used to build up the set of all active sets for increasing horizon length. The existing approach for constructing the set of active sets with DP outperforms classical active set enumeration methods for long horizons, but they are less efficient for short horizons. We show that a simple bound on the candidate active set cardinality, which also is used in active set enumeration, renders the DP approach competitive also for short horizons.
|
|
16:20-16:40, Paper TuCT13.2 | Add to My Program |
Computationally Efficient Robust MPC Using Optimized Constraint Tightening |
|
Parsi, Anilkumar | ETH Zurich |
Anagnostaras, Panagiotis | ETH Zurich |
Iannelli, Andrea | ETH Zurich |
Smith, Roy S. | ETH Zurich |
Keywords: Predictive control for linear systems, Optimization, Uncertain systems
Abstract: A robust model predictive control (MPC) method is presented for linear, time-invariant systems affected by bounded additive disturbances. The main contribution is the offline design of a disturbance-affine feedback gain whereby the resulting constraint tightening is minimized. This is achieved by formulating the constraint tightening problem as a convex optimization problem with the feedback term as a variable. The resulting MPC controller has the computational complexity of nominal MPC, and guarantees recursive feasibility, stability and constraint satisfaction. The advantages of the proposed approach compared to existing robust MPC methods are demonstrated using numerical examples.
|
|
16:40-17:00, Paper TuCT13.3 | Add to My Program |
Noncausal Lifting Linearization for Nonlinear Dynamic Systems under Model Predictive Control |
|
Park, Seho | Pennsylvania State University |
Pangborn, Herschel | Pennsylvania State University |
Keywords: Predictive control for linear systems, Predictive control for nonlinear systems, Identification for control
Abstract: This paper presents a lifting linearization method for applying linear Model Predictive Control (MPC) to nonlinear dynamic systems. While existing lifting linearization methods provide accurate linear approximations when the nonlinearity is a function of the state only, they require additional assumptions or result in bilinear lifted representations when the nonlinearity is also a function of the control input. The proposed method approximates control-affine and control-nonaffine nonlinear dynamics with noncausal linear dynamics to achieve improved model accuracy. This noncausality in the lifted linear dynamics is then addressed within an MPC framework. Numerical examples illustrate that the proposed approach closely matches the performance of nonlinear MPC at a fraction of the computational cost, outpacing the performance of existing linearization methods.
|
|
17:00-17:20, Paper TuCT13.4 | Add to My Program |
Robust Output Feedback MPC with Reduced Conservatism under Ellipsoidal Uncertainty |
|
Ji, Tianchen | University of Illinois at Urbana-Champaign |
Geng, Junyi | Carnegie Mellon University |
Driggs-Campbell, Katherine | University of Illinois at Urbana-Champaign |
Keywords: Predictive control for linear systems, Robust control
Abstract: Robust design of autonomous systems under uncertainty is an important yet challenging problem. This work proposes a robust controller that consists of a state estimator and a tube based predictive control law. The class of linear systems under ellipsoidal uncertainty is considered. In contrast to existing approaches based on polytopic sets, the constraint tightening is directly computed from the ellipsoidal sets of disturbances without over-approximation, thus leading to less conservative bounds. Conditions to guarantee robust constraint satisfaction and robust stability are presented. Further, by avoiding the usage of Minkowski sum in set computation, the proposed approach can also scale up to high-dimensional systems. The results are illustrated by examples.
|
|
17:20-17:40, Paper TuCT13.5 | Add to My Program |
Recursively Feasible Stochastic Predictive Control Using an Interpolating Initial State Constraint |
|
Köhler, Johannes | ETH Zurich |
Zeilinger, Melanie N. | ETH Zurich |
Keywords: Predictive control for linear systems, Stochastic optimal control, Constrained control
Abstract: We present a stochastic model predictive control (SMPC) framework for linear systems subject to possibly unbounded disturbances. State of the art SMPC approaches with closed-loop chance constraint satisfaction recursively initialize the nominal state based on the previously predicted nominal state or possibly the measured state under some case distinction. We improve these initialization strategies by allowing for a continuous optimization over the nominal initial state in an interpolation of these two extremes. The resulting SMPC scheme can be implemented as one standard quadratic program and is more flexible compared to state-of-the-art initialization strategies. As the main technical contribution, we show that the proposed SMPC framework also ensures closed-loop satisfaction of chance constraints and suitable performance bounds.
|
|
17:40-18:00, Paper TuCT13.6 | Add to My Program |
Safe Stochastic Model Predictive Control |
|
Brüdigam, Tim | Technical University of Munich |
Jacumet, Robert | Technical University of Munich |
Wollherr, Dirk | Technische Universität München |
Leibold, Marion | TU Muenchen |
Keywords: Predictive control for linear systems, Uncertain systems, Robust control
Abstract: Combining efficient and safe control for safety-critical systems is challenging. Robust methods may be overly conservative, whereas probabilistic controllers require a trade-off between efficiency and safety. In this work, we propose a safety algorithm that is compatible with any stochastic Model Predictive Control method for linear systems with additive uncertainty and polytopic constraints. This safety algorithm uses the control inputs of a stochastic Model Predictive Control as long as a safe backup planner can ensure safety with respect to satisfying hard constraints subject to bounded uncertainty. Besides ensuring safe behavior, the proposed stochastic Model Predictive Control algorithm guarantees recursive feasibility and input-to-state stability of the system origin. The benefits of the safe stochastic Model Predictive Control algorithm are demonstrated in a numerical simulation, highlighting the advantages compared to purely robust or stochastic predictive controllers.
|
|
TuCT14 Regular Session, Maya Ballroom VI |
Add to My Program |
Control Applications III |
|
|
Chair: Pettersen, Kristin Y. | Norwegian University of Science and Technology (NTNU) |
Co-Chair: Allgöwer, Frank | University of Stuttgart |
|
16:00-16:20, Paper TuCT14.1 | Add to My Program |
Power-Optimized Processor Slowdown Control |
|
Mann, Ariana | Stanford University |
Bambos, Nicholas | Stanford University |
Keywords: Queueing systems, Stochastic optimal control, Electrical machine control
Abstract: For user-serving computational workloads in data centers (like ML/AI inference tasks with e-commerce applications), managing both the latency experienced by the user and the energy consumption of such workloads is essential. The slowdown quality-of-service metric is increasingly used in real-world systems, as it captures the perceived user delay proportional to the expected job service time. In this work, we introduce a framework to control job slowdown in a data center with multi-class workloads, by adjusting the processor rate. Increasing the processor rate decreases slowdown at the cost of additional power; the goal is to select the processor rate to optimize this tradeoff. We present optimal rate control characterizations for both 1) when the sizes of all jobs in the queue are observed and 2) when only the head-of-line job size and the total queue length are observed. We compare the controls analytically and numerically, and demonstrate how this framework can be employed to achieve desired slowdown targets with minimal power.
|
|
16:20-16:40, Paper TuCT14.2 | Add to My Program |
Deep Neural Network Based Model Predictive Control for Standoff Tracking by a Quadrotor UAV |
|
Dong, Fei | Beihang University |
Li, Xingchen | Tsinghua University |
You, Keyou | Tsinghua University |
Song, Shiji | Tsinghua University |
Keywords: Flight control, Predictive control for nonlinear systems, Neural networks
Abstract: The standoff tracking requires an unmanned aerial vehicle (UAV) to loiter in a circular orbit above a target of interest. To achieve it, we propose a deep neural network (DNN) based model predictive control (MPC) for a quadrotor UAV by taking into account the full UAV model and input constraints. Moreover, we propose a new Lyapunov guidance vector (LGV) with tunable convergence rates to plan a reference trajectory for the MPC. The computation latency on the field-programmable gate array (FPGA) at 200MHz is significantly reduced to a constant of 0.12ms. The hardware-in-the-loop (HIL) experiments verify the effectiveness and robustness of our method.
|
|
16:40-17:00, Paper TuCT14.3 | Add to My Program |
Data-Driven Predictive Disturbance Observer for Quasi Continuum Manipulators |
|
Müller, Daniel | University of Stuttgart |
Feilhauer, Justinus | University of Stuttgart |
Wickert, Jennifer | University of Stuttgart |
Berberich, Julian | University of Stuttgart |
Allgöwer, Frank | University of Stuttgart |
Sawodny, Oliver | University of Stuttgart |
Keywords: Control applications, Robotics, Predictive control for nonlinear systems
Abstract: Deriving accurate and time efficient models for soft robots has proven to be a difficult task. This is mainly due to the behaviour of the soft materials and the interaction with other parts of the robot, resulting in complicated models with many states that require complex sensor concepts. Soft robots such as the Bionic Soft Arm (BSA) are subject to uncertainties. The resulting mismatch between plant and model can be viewed as a disturbance. Disturbance observers significantly improve the control performance of the model-based controllers. However, they assume an underlying model of the disturbance that is usually unknown. To address this problem, we propose a novel Data-Driven Predictive Disturbance Observer (DPDO), which predicts the future disturbance based on past measurements that are updated online and, thereby, enables the model-based controller to achieve zero control error. The approach is first tested in simulation, examining the influence of noise, time delays and typical model uncertainties for the BSA. Afterwards, the algorithm is applied to the real system, demonstrating its practicability.
|
|
17:00-17:20, Paper TuCT14.4 | Add to My Program |
Data-Driven Control of Planar Snake Robot Locomotion |
|
Scarpa, Maria Luisa | Imperial College London |
Nortmann, Benita Alessandra Lucia | Imperial College London |
Pettersen, Kristin Y. | Norwegian University of Science and Technology (NTNU) |
Mylvaganam, Thulasi | Imperial College London |
Keywords: Control applications, Robotics, Time-varying systems
Abstract: A direct data-driven strategy for snake-robot locomotion control is proposed in this paper. The approach leads to a time-varying state feedback controller with robustness guarantees. Instead of relying on exact model knowledge - which is often not available in practice - the proposed control strategy requires only input-state data collected during offline experiments. The efficacy of the proposed strategy is demonstrated via simulations. Notably, by using data to compensate for inaccurate models, the proposed control strategy can lead to significant improvements in closed-loop performance compared to existing (model-based) control strategies, while also eliminating the need for manual tuning of control parameters.
|
|
17:20-17:40, Paper TuCT14.5 | Add to My Program |
Application of a PI-Controller to a 25 MW Floating Wind Turbine |
|
dos Santos, Carlos Renan | Institute for Energy Technology |
Abdelmoteleb, Serag-Eldin | Norwegian University of Science and Technology |
Escalera Mendoza, Alejandra | The University at Texas at Dallas |
Bachynski-Polic, Erin | Norwegian University of Science and Technology |
Griffith, D. Todd | University of Texas at Dallas |
Oggiano, Luca | Institute for Energy Technology |
Keywords: PID control, Nonlinear systems, Power generation
Abstract: Proportional-integral controllers are extensively applied to the pitch control of wind turbines. Despite its simplicity, this control strategy achieves good performance in onshore applications. However, the application of proportional-integral controllers to floating wind turbines faces some challenges, such as negative feedback due to the platform motion. In this sense, the present work proposes a parametric study to assess the influence of tuning parameters of the pitch controller on the performance of a 25 MW floating wind turbine. Effects of including floating feedback in the control strategy are also investigated. Finally, optimum parameters for a proportional-integral pitch controller are defined for the 25 MW wind turbine.
|
|
17:40-18:00, Paper TuCT14.6 | Add to My Program |
Approximate Solutions to the Optimal Flow Problem of Multi-Area Integrated Electrical and Gas Systems |
|
Ananduta, Wicak | TU Delft |
Grammatico, Sergio | Delft Univ. of Tech |
Keywords: Control applications, Smart grid, Optimization algorithms
Abstract: We formulate the optimal flow problem in a multi-area integrated electrical and gas system as a mixed-integer optimization problem by approximating the non-linear gas flows with piece-wise affine functions, thus resulting in a set of mixed-integer linear constraints. For its solution, we propose a novel algorithm that consists in one stage for solving a convexified problem and a second stage for recovering a mixed-integer solution. The latter exploits the gas flow model and requires solving a linear program. We provide an optimality certificate for the computed solution and show the advantages of our algorithm with respect to the state-of-the-art method via numerical simulations.
|
|
TuCT15 Regular Session, Maya Ballroom VII |
Add to My Program |
Geometric Control |
|
|
Chair: Iori, Tomoyuki | Osaka University |
Co-Chair: Clark, William | Cornell University |
|
16:00-16:20, Paper TuCT15.1 | Add to My Program |
Testing Generic Strong Accessibility of Nonlinear Control Systems Via Polynomial (Quadratic) Immersion |
|
Carravetta, Francesco | IASI-CNR |
Sarafrazi, Mohammad Amin | University of Tehran |
Bartosiewicz, Zbigniew | Bialystok University of Technology |
Kotta, Ülle | Tallinn University of Technology |
Keywords: Algebraic/geometric methods, Nonlinear systems
Abstract: Following our previous result on generic strong accessibility (GSA) for nonlinear systems, where we showed that GSA can be checked through a rank test on a 'nonlinear controllability matrix', we show here that the same test can be made further more efficient through a preliminary polynomial system immersion, which is in fact always {it quadratic} and for which we give here a geometric interpretation, that allows to take advantage of polynomial algebra algorithms.
|
|
16:20-16:40, Paper TuCT15.2 | Add to My Program |
Algebraic Approach to Global Finite-Time Stabilization of Multi-Input Polynomial Systems |
|
Takayama, Yoshinari | Kyoto University |
Hoshino, Kenta | Kyoto University |
Ohtsuka, Toshiyuki | Kyoto Univ |
Keywords: Algebraic/geometric methods, Stability of nonlinear systems, Lyapunov methods
Abstract: In this study, we address the problem of finite-time stabilization for multi-input polynomial systems. Specifically, we propose a new constructive method for designing feedback laws that make the closed-loop trajectory converge to the origin in a prescribed time semiglobally in the initial states. Here, we develop this method based on an algebraic approach using parametric syzygy systems in polynomial rings. As a result, the problem can be solved using algebraic operations on polynomials. Moreover, the desired state feedback control laws can be obtained in parameterized form. We also clarify the sufficient condition of finite-time stability with which our method can design the desired feedback laws.
|
|
16:40-17:00, Paper TuCT15.3 | Add to My Program |
Hybrid Geometric Controllers for Fully-Actuated Left-Invariant Systems on Matrix Lie Groups |
|
Akhtar, Adeel | University of California at Santa Cruz |
Sanfelice, Ricardo G. | University of California at Santa Cruz |
Keywords: Algebraic/geometric methods, Hybrid systems, Robotics
Abstract: This paper proposes a hybrid geometric control scheme for a system defined on a matrix Lie group in the form of a left-invariant vector field. Our solution to the point stabilization problem is coordinate free (or geometric). Specifically, we propose a hybrid geometric controller that uses a controller from a local class of geometric controllers and an open-loop geometric controller. Our method guarantees that the given point in the manifold is robustly globally asymptotically stable for the closed-loop system when each controller from the local geometric class is combined with the geometric open-loop controller using a hybrid systems framework.
|
|
17:00-17:20, Paper TuCT15.4 | Add to My Program |
Stabilization of Nonholonomic Pendulum Skate by Controlled Lagrangians |
|
Silva Garcia, Jorge | University of Texas at Dallas |
Ohsawa, Tomoki | University of Texas at Dallas |
Keywords: Algebraic/geometric methods, Stability of nonlinear systems, Nonholonomic systems
Abstract: We consider the problem of stabilizing what we call a pendulum skate, a simple model of a figure skater developed by Gzenda and Putkaradze. By exploiting the symmetry of the system as well as taking care of the part of the symmetry broken by the gravity, the equations of motion are given as nonholonomic Euler-Poincaré equation with advected parameters. Our main interest is the stability of the sliding and spinning equilibria of the system. We show that the former is unstable and the latter is stable only under certain conditions. We use the method of Controlled Lagrangians to find a control to stabilize the sliding equilibrium.
|
|
17:20-17:40, Paper TuCT15.5 | Add to My Program |
Lie Algebraic Cost Function Design for Control on Lie Groups |
|
Teng, Sangli | University of Michigan |
Clark, William | Cornell University |
Bloch, Anthony M. | Univ. of Michigan |
Vasudevan, Ramanarayan | University of Michigan |
Ghaffari, Maani | University of Michigan |
Keywords: Algebraic/geometric methods, Optimal control, Robotics
Abstract: This paper presents a control framework on Lie groups by designing the control objective in its Lie algebra. Control on Lie groups is challenging due to its nonlinear nature and difficulties in system parameterization. Existing methods to design the control objective on a Lie group and then derive the gradients for controller design are non-trivial and can result in slow convergence in tracking control. We show that with a proper left-invariant metric, setting the gradients of the cost function as the tracking error in the Lie algebra leads to a quadratic Lyapunov function that enables globally exponential convergence. In the PD control case, we show that our controller can maintain an exponential convergence rate even when the initial error is approaching pi in SO(3). We also show the merit of this proposed framework in trajectory optimization. The proposed cost function enables the iterative Linear Quadratic Regulator (iLQR) to converge much faster than the Differential Dynamic Programming (DDP) with a well-adopted cost function when the initial trajectory is poorly initialized on SO(3).
|
|
17:40-18:00, Paper TuCT15.6 | Add to My Program |
On First Integrals of Hamiltonian System with Holonomic Hamiltonian |
|
Iori, Tomoyuki | Osaka University |
Keywords: Algebraic/geometric methods, Optimal control
Abstract: In this study, the solution of the Hamilton-Jacobi equation (HJE) with holonomic Hamiltonian is investigated in terms of the first integrals of the corresponding Hamiltonian system. Holonomic functions are related to a specific type of partial differential equations called Pfaffian systems, whose solution space can be regarded as a finite-dimensional real vector space. In the finite-dimensional solution space, the existence of first integrals that define a solution of the HJE is characterized by a finite number of algebraic equations for finite-dimensional vectors, which can be easily solved and verified. The derived characterization was illustrated through a numerical example.
|
|
TuCT16 Regular Session, Maya Ballroom VIII |
Add to My Program |
Observers for Linear and Nonlinear Systems |
|
|
Chair: Millan, Pablo | Universidad Loyola Andalucía |
Co-Chair: Iovine, Alessio | CNRS |
|
16:00-16:20, Paper TuCT16.1 | Add to My Program |
Distributed Hybrid Observer with Prescribed Convergence Rate for a Linear Plant Using Multi-Hop Decomposition |
|
Bertollo, Riccardo | Università Di Trento |
Millan, Pablo | Universidad Loyola Andalucía |
Orihuela, Luis | Universidad Loyola Andalucía |
Seuret, Alexandre | University of Sevilla |
Zaccarian, Luca | LAAS-CNRS and University of Trento |
Keywords: Observers for Linear systems, Sensor networks, Hybrid systems
Abstract: We propose a distributed hybrid observer for a sensor network where the plant and local observers run in continuous time and the information exchange among the sensing nodes is sampled-data. Process disturbances, measurement noise and communication noise are considered, and we prove that under some necessary detectability assumptions the observer gains can be tuned to guarantee exponential Input-to-State Stability with a prescribed convergence rate. Simulations illustrate the performance of the proposed observer.
|
|
16:20-16:40, Paper TuCT16.2 | Add to My Program |
Free Energy Principle for the Noise Smoothness Estimation of Linear Systems with Colored Noise |
|
Anil Meera, Ajith | TU Delft |
Wisse, Martijn | Tu Delft |
Keywords: Observers for Linear systems, Filtering, Robotics
Abstract: The free energy principle (FEP) from neuroscience provides a framework called active inference for the joint estimation and control of state space systems, subjected to colored noise. However, the active inference community has been challenged with the critical task of manually tuning the noise smoothness parameter. To solve this problem, we introduce a novel online noise smoothness estimator based on the idea of free energy principle. We mathematically show that our estimator can converge to the free energy optimum during smoothness estimation. Using this formulation, we introduce a joint state and noise smoothness observer design called DEMs. Through rigorous simulations, we show that DEMs outperforms state-of-the-art state observers with least state estimation error. Finally, we provide a proof of concept for DEMs by applying it on a real life robotics problem - state estimation of a quadrotor hovering in wind, demonstrating its practical use.
|
|
16:40-17:00, Paper TuCT16.3 | Add to My Program |
An Exact Robust Hyperexponential Differentiator |
|
Efimov, Denis | Inria |
Polyakov, Andrey | Inria, Univ. Lille |
Zimenko, Konstantin | ITMO University |
Wang, Jian | Hangzhou Dianzi University |
Keywords: Observers for Linear systems, Time-varying systems
Abstract: A simple differentiator is proposed, which is modeled by a second order time-varying linear differential equation. It is shown that for any signal of interest, whose second derivative is an essentially bounded function of time, the differentiation error converges to zero with a hyperexponential rate (faster than any exponential). An implicit discretization scheme of the differentiator is given, which preserves all main properties of the continuous-time counterpart. In addition, the differentiation error is robustly stable with respect to the measurement noise with a linear gain. The efficiency of the suggested differentiator is illustrated through comparison in numeric experiments with popular alternatives.
|
|
17:00-17:20, Paper TuCT16.4 | Add to My Program |
Robust Observer Synthesis for Bilinear Parameter Varying System |
|
Etienne, Lucien | Institut Mine Télécom Lille Douai |
Langueh, Kokou Anani Agbessi | IMT Lille-Douai |
Karkaba, Hassan | IMT Nord Europe |
Iovine, Alessio | CNRS |
Keywords: Observers for nonlinear systems, Linear parameter-varying systems, LMIs
Abstract: In the present paper, sufficient conditions for the synthesis of robust Unknown Input Observers (UIOs) are proposed for a class of nonlinear systems, both in continuous and discrete time. The considered class is general enough to contain bilinear systems as well as Linear Parameter-Varying (LPV) systems with no parameter variation on the output matrix. The proposed conditions are numerically tractable, and are expressed in terms of Linear Matrix Inequalities (LMIs) or Linear Matrix Equalities (LMEs). Furthermore, the gain synthesis problem is shown to be formulated as a convex optimisation one, directly enabling the minimization of the influence of noisy measurements and model uncertainty. Simulations on energy systems are provided to illustrate the proposed methodologies.
|
|
17:20-17:40, Paper TuCT16.5 | Add to My Program |
mathcal{H}_{infty}-Optimal Interval Observer Synthesis for Uncertain Nonlinear Dynamical Systems Via Mixed-Monotone Decompositions |
|
Khajenejad, Mohammad | University of California, San Diego |
Yong, Sze Zheng | Northeastern University |
Keywords: Observers for nonlinear systems, Uncertain systems, Estimation
Abstract: This paper introduces a novel mathcal{H}_{infty}-optimal interval observer synthesis for bounded-error/uncertain locally Lipschitz nonlinear continuous-time (CT) and discrete-time (DT) systems with noisy nonlinear observations. Specifically, using mixed-monotone decompositions, the proposed observer is correct by construction, i.e., the interval estimates readily frame the true states without additional constraints or procedures. In addition, we provide sufficient conditions for input-to-state (ISS) stability of the proposed observer and for minimizing the mathcal{H}_{infty} gain of the framer error system in the form of semi-definite programs (SDPs) with Linear Matrix Inequalities (LMIs) constraints. Finally, we compare the performance of the proposed mathcal{H}_{infty}-optimal interval observers with some benchmark CT and DT interval observers.
|
|
17:40-18:00, Paper TuCT16.6 | Add to My Program |
An Interval Observer for Continuous-Time Persidskii Systems |
|
Efimov, Denis | Inria |
Polyakov, Andrey | Inria, Univ. Lille |
Ping, Xubin | Xidian University |
Keywords: Nonlinear systems, Observers for nonlinear systems, Biological systems
Abstract: The paper deals with design of interval observers for a class of generalized Persidskii systems. The conditions of stability for the suggested nonlinear interval observer are formulated using linear matrix inequalities. The nonnegativity of this class of models is investigated for nonlinearities satisfying the incremental passivity conditions. The efficiency of the proposed observer is demonstrated on a Lotka-Volterra model.
|
|
TuCT17 Invited Session, Acapulco |
Add to My Program |
Analysis and Design Methods for Biomolecular Networks |
|
|
Chair: Gupta, Ankit | ETH Zürich |
Co-Chair: Khammash, Mustafa H. | ETH Zurich |
Organizer: Gupta, Ankit | ETH Zürich |
Organizer: Khammash, Mustafa H. | ETH Zurich |
|
16:00-16:20, Paper TuCT17.1 | Add to My Program |
Identifying Competition Phenotypes in Synthetic Biochemical Circuits |
|
Ali Al-Radhawi, Muhammad | Northeastern University |
Del Vecchio, Domitilla | Massachusetts Institute of Technology |
Sontag, Eduardo | Northeastern University |
Keywords: Biomolecular systems, Identification, Systems biology
Abstract: Synthetic gene circuits require cellular resources, which are often limited. This leads to competition for resources by different genes, which alter a synthetic genetic circuit's behavior. However, the manner in which competition impacts behavior depends on the identity of the ``bottleneck'' resource which might be difficult to discern from input-output data. In this paper, we aim at classifying the mathematical structures of resource competition in biochemical circuits. We find that some competition structures can be distinguished by their response to different competitors or resource levels. Specifically, we show that some response curves are always linear, convex, or concave. Furthermore, high levels of certain resources protect the behavior from low competition, while others do not. We also show that competition phenotypes respond differently to various interventions. Such differences can be used to eliminate candidate competition mechanisms when constructing models based on given data. On the other hand, we show that different networks can display mathematically equivalent competition phenotypes.
|
|
16:20-16:40, Paper TuCT17.2 | Add to My Program |
Continuous and Sampled-Data H_infty Control of Linear Stochastic Reaction Networks (I) |
|
Briat, Corentin | ETH Zürich |
Khammash, Mustafa H. | ETH Zurich |
Keywords: Robust control, Markov processes, Biomolecular systems
Abstract: The effective control of cellular processes in living cells requires that models of such processes take into account the discrete and stochastic nature of chemical reaction networks that underlie the dynamics inside the cell. Stochastic biochemical reaction networks have emerged as a powerful paradigm for realistic mathematical models of cellular phenomena. While classical control algorithm like PI and PID controllers have been recently proposed as controllers in living cells, methods for designing more advanced optimal controllers remain elusive. Here we formulate the H_infty control problem for linear stochastic reaction networks and provide an approach for its solution using Dynamic Programming—one that gives rise to a non-standard Riccati differential equation. We next address the H_infty sampled-data control problem for stochastic networks, and present a solution based on Hybrid Dynamic Programming that requires the solution of coupled Riccati differential and difference equations.
|
|
16:40-17:00, Paper TuCT17.3 | Add to My Program |
Sequestration-Based Feedback Control of Blood Platelet Levels (I) |
|
Dey, Supravat | Department of Electrical and Computer Engineering, University O |
Vargas-Garcia, Cesar A. | Fundación Universitaria Konrad Lorenz |
Singh, Abhyudai | University of Delaware |
Keywords: Cellular dynamics, Systems biology, Biomolecular systems
Abstract: Nonlinear feedback controllers are ubiquitous features of biological systems at different scales. A key motif arising in these systems is sequestration-based feedback. As a physiological example of this type of feedback architecture, platelets (specialized cells involved in blood clotting) differentiate from stem cells, and this process is activated by a protein called Thrombopoietin (TPO). Platelets actively sequester and degrade TPO, creating negative feedback whereby any depletion of platelets increases the levels of freely available TPO that upregulates platelet production. We show similar examples of sequestration-based feedback in intracellular biomolecular circuits involved in heat-shock response and microRNA regulation. Our systematic analysis of this feedback motif reveals that platelets-induced degradation of TPO is critical in enhancing system robustness to external disturbances. In contrast, reversible sequestration of TPO without degradation results in poor robustness to disturbances. We develop exact analytical results quantifying the limits to which the sensitivity to disturbances can be attenuated by sequestration-based feedback. In summary, our systematic analysis highlights design principles for enhancing the robustness of sequestration-based feedback mechanisms to external disturbances with applications to both physiological and cellular systems.
|
|
17:00-17:20, Paper TuCT17.4 | Add to My Program |
Identifiability of Linear Noise Approximation Models of Chemical Reaction Networks from Stationary Distributions (I) |
|
Grunberg, Theodore | Massachusetts Institute of Technology |
Del Vecchio, Domitilla | Massachusetts Institute of Technology |
Keywords: Biomolecular systems, Systems biology, Identification
Abstract: Biomolecular systems can often be modeled by chemical reaction networks with unknown parameters. In many cases, the available data is constituted of samples from the stationary distribution, wherein each sample is given by a cell in a population. In this work, we develop a framework to assess identifiability of parameters in such a situation. Working with the Linear Noise Approximation (LNA) we give an algebraic formulation of identifiability and use it to certify identifiability with Hilbert's Nullstellensatz. We include applications to particular biomolecular systems, focusing on the identifiability of a sequestration-based motif and of a feedback arrangement based on it.
|
|
17:20-17:40, Paper TuCT17.5 | Add to My Program |
Padé SSA: A Frequency Domain Method for Estimating the Dynamics of Stochastic Reaction Networks (I) |
|
Gupta, Ankit | ETH Zürich |
Khammash, Mustafa H. | ETH Zurich |
Keywords: Biological systems, Stochastic systems, Numerical algorithms
Abstract: Dynamic analysis and control of living cells relies on mathematical representations of cellular processes that are themselves modelled as biomolecular reaction networks. Stochastic models for biomolecular reaction networks are commonly employed for analysing intracellular networks having constituent species with low-copy numbers. In such models, the main object of interest is the probability distribution of the state vector of molecular counts which evolves according to a set of ordinary differential equations (ODEs) called the Chemical Master Equation (CME). Typically this set is very large or even infinite, making the CME practically unsolvable in most cases. Hence the outputs based on the CME solution, like the statistical moments of various state components, are generally estimated with Monte Carlo (MC) procedures by simulating the underlying Markov chain with Gillespie's Stochastic Simulation Algorithm (SSA). However to obtain statistical reliability of the MC estimators, often a large number of simulated trajectories are required, which imposes an exorbitant computational burden. The aim of this paper is to present a frequency domain method for mitigating this burden by exploiting a small number of simulated trajectories to robustly estimate certain intrinsic eigenvalues of the stochastic dynamics. This method enables reliable estimation of time-varying outputs of interest from a small number of sampled trajectories and this estimation can be carried out for several initial states without requiring additional simulations. We demonstrate our method with a couple of examples.
|
|
17:40-18:00, Paper TuCT17.6 | Add to My Program |
Graphical Construction of Stability Certificates for Biomolecular Interaction Networks |
|
Ali Al-Radhawi, Muhammad | Northeastern University |
Keywords: Biomolecular systems, Lyapunov methods, Uncertain systems
Abstract: We study robust stability of Biological Interaction Networks (BINs) by constructing stability certificates in the form of Robust Lyapunov Functions (RLFs) using graphical methods. Previous works have mainly constructed RLFs by utilizing linear programs or iterative algorithms. Such algorithms become tedious or computationally infeasible for large networks. In addition, they do not identify motifs or graph modifications that maintain stability. In this work, we provide several graphical criteria for constructing stability certificates. We characterize a set of stability-preserving graph modifications which include, in particular, the enzymatic catalysis motif. Hence, stability of a class of arbitrarily large networks can be examined by simple visual inspection. We present applications of this technique to Post-Translational Modification (PTM) cycles, Ribosome Flow Model (RFM), and T-cell kinetic proofreading.
|
| |