|
WeA01 |
Auditorium |
Forty Plus Years of Model Reduction and Still Learning |
Tutorial Session |
Chair: Beck, Carolyn L. | Univ of Illinois, Urbana-Champaign |
Co-Chair: Sandberg, Henrik | KTH Royal Institute of Technology |
Organizer: Beck, Carolyn L. | Univ of Illinois, Urbana-Champaign |
Organizer: Sandberg, Henrik | KTH Royal Institute of Technology |
Organizer: Scherpen, Jacquelien M.A. | University of Groningen |
Organizer: Astolfi, Alessandro | Imperial College & Univ. of Rome |
|
10:00-10:20, Paper WeA01.1 | |
>Forty Plus Years of Model Reduction and Still Learning (I) |
|
Beck, Carolyn L. | Univ of Illinois, Urbana-Champaign |
Sandberg, Henrik | KTH Royal Institute of Technology |
Scherpen, Jacquelien M.A. | University of Groningen |
Astolfi, Alessandro | Imperial College & Univ. of Rome |
Bhattacharjee, Debraj | Imperial College London |
Kawano, Yu | Hiroshima University |
Moreschini, Alessio | Imperial College London |
Keywords: Reduced order modeling, Linear systems, Nonlinear systems
Abstract: The approximation of complex dynamical systems models by reduced order models has been considered an important research problem for over four decades, not only in the field of control, but also in economics, image processing, circuit analysis, statistical mechanics, aircraft structures, and more recently in hybrid energy systems, to name just a small sample of fields. In this paper, we provide an overview of the development of balanced truncation and interpolation approaches for reducing linear and non-linear dynamical systems models for the purpose of control analysis and design.
|
|
10:20-10:40, Paper WeA01.2 | |
Balanced Truncation of Linear Systems: Fundamentals and Extensions (I) |
|
Sandberg, Henrik | KTH Royal Institute of Technology |
Keywords: Reduced order modeling, Linear systems, Nonlinear systems
Abstract: Balanced truncation model reduction provides a lower-order approximation of a given linear differential or difference equation model, by neglecting states that have relatively low effect on the overall model transfer function, specifically states that are both weakly controllable and weakly observable. Such states are determined by an analysis of the system Hankel singular values. In this talk we will overview the balanced truncation model reduction process for linear time-invariant systems, and note extensions to time-varying, multi-dimensional and interconnected systems. Starting with a given LTI state space model for a system, we will derive the similarity transformation that results in an equivalent balanced realization; this transformation is based on solutions to the system controllability and observability Lyapunov equations. Using the balanced state space system realization, we will show that truncating the states that are both weakly controllable and observable, namely those associated with small system Hankel singular values, results in an a priori guaranteed error bound given by twice the sum of these truncated Hankel singular values. Extensions and generalizations to time-varying and interconnected systems will be outlined in the context of the classic LTI approach, and some examples will be discussed to conclude the presentation.
|
|
10:40-11:20, Paper WeA01.3 | |
Data-Based Model Reduction for Non-Linear Systems Based on Differential Balancing (I) |
|
Scherpen, Jacquelien M.A. | University of Groningen |
Kawano, Yu | Hiroshima University |
Keywords: Reduced order modeling, Linear systems, Nonlinear systems
Abstract: In this talk we will present an extension of the standard balancing theory for nonlinear systems which is based on an analysis around equilibrium points, to the contraction framework. This extension offers computational advantages. We provide definitions for controllability and observability functions and their differential versions which can be used for simultaneous diagonalization procedures, providing a measure for importance of the states. Generalised balancing methods based on these developments provide a computationally interesting approach. In addition, we propose a data-based model reduction method based on differential balancing for nonlinear systems whose input vector fields are constants by utilizing their variational systems. The difference between controllability and reachability for the variational system is exploited for computational reasons. For a fixed state trajectory, it is possible to compute the values of the differential Gramians by using impulse and initial state responses of the variational system. Therefore, differential balanced truncation is doable along state trajectories without solving nonlinear partial differential equations. Examples illustrate the methods.
|
|
11:20-12:00, Paper WeA01.4 | |
Stability and Signal Generator Agnostic Moment Matching (I) |
|
Bhattacharjee, Debraj | Imperial College London |
Moreschini, Alessio | Imperial College London |
Astolfi, Alessandro | Imperial College & Univ. of Rome |
Keywords: Reduced order modeling, Linear systems, Nonlinear systems
Abstract: Moment matching is a model reduction technique that con- structs reduced-order models retaining moments of a given high-order system. These moments are associated with the steady-state output response (if any) of the nonlinear system, which is interconnected in an open-loop fashion with a signal generator. In the first part of this presentation, we introduce a data-driven procedure for computing reduced-order models directly from input-output data generated by an agnostic signal generator. This approach avoids the need to explicitly compute the moments of the system to be reduced. Instead, the moments are directly identified from the output of the high-order system, and the resulting reduced-order models asymptotically match the moments of the high-order system. However, there is still a caveat with the “standard” open-loop configuration: the high-order system must be internally stable for moment matching to be achieved. In the second part of this presentation, we revisit the notion of moment matching by introducing the concept of closed-loop interpolation. This notion relies upon the construction of a particular signal generator, which is output feedback interconnected with the plant to be interpolated. Unlike the open-loop configuration, in a closed-loop interpolation scheme, the internal stability condition of the high-order system can be relaxed. The existence of a family of models that parameterize all systems achieving moment matching in a closed-loop fashion will be finally presented.
|
|
WeA02 |
Amber 1 |
Learning, Optimization, and Game Theory I |
Invited Session |
Chair: Doan, Thinh T. | University of Texas at Austin |
Co-Chair: Sayin, Muhammed Omer | Bilkent University |
Organizer: Doan, Thinh T. | University of Texas at Austin |
Organizer: Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Organizer: Sayin, Muhammed Omer | Bilkent University |
|
10:00-10:20, Paper WeA02.1 | |
>A Multi-Player Potential Game Approach for Sensor Network Localization with Noisy Measurements (I) |
|
Xu, Gehui | Imperial College London |
Chen, Guanpu | KTH Royal Institute of Technology |
Fidan, Baris | University of Waterloo |
Hong, Yiguang | Tongji University |
Qi, Hongsheng | AMSS, Chinese Academy of Sciences |
Parisini, Thomas | Imperial C., Aalborg U. & Univ. of Trieste |
Johansson, Karl H. | KTH Royal Institute of Technology |
Keywords: Sensor networks, Game theory
Abstract: Sensor network localization (SNL) is a challenging problem due to its inherent non-convexity and the effects of noise in inter-node ranging measurements and anchor node position. We formulate a non-convex SNL problem as a multi-player non-convex potential game and investigate the existence and uniqueness of a Nash equilibrium (NE) in both the ideal setting without measurement noise and the practical setting with measurement noise. We first show that the NE exists and is unique in the noiseless case, and corresponds to the precise network localization. Then, we study the SNL for the case with errors affecting the anchor node position and the inter-node distance measurements. Specifically, we establish that in case these errors are sufficiently small, the NE exists and is unique. It is shown that the NE is an approximate solution to the SNL problem, and that the position errors can be quantified accordingly. Based on these findings, we apply the results to case studies involving only inter-node distance measurement errors and only anchor position information inaccuracies.
|
|
10:20-10:40, Paper WeA02.2 | |
>Online Mechanism Design for Differentially Private Data Acquisition (I) |
|
Anjarlekar, Ameya | University of Illinois at Urbana Champaign |
Etesami, Rasoul | University of Illinois at Urbana-Champaign |
Srikant, R | Univ of Illinois, Urbana-Champaign |
Keywords: Optimization, Statistical learning, Game theory
Abstract: We address a problem involving a buyer seeking to train a logistic regression model by acquiring data from privacy-sensitive sellers. Along with compensating the sellers for their data, the buyer provides differential privacy guarantees to them where the payments depend on the privacy guarantees. In addition, each seller has a different privacy sensitivity associated with their data, which is the cost per unit of loss of privacy. The buyer transacts sequentially with the sellers, wherein the seller will disclose their privacy sensitivity, and the buyer immediately provides a payment and guarantees differential privacy. After receiving the payment, the seller provides their data to the buyer. The buyer's goal is to optimize a weighted combination of test loss and payments, i.e., achieve a tradeoff between getting a good ML model and limiting its payments. Additionally, the buyer must design the payments and differential privacy guarantees in an online fashion. Further, the online problem is history-dependent, which adds to the challenge. Consequently, we design a payment mechanism that ensures incentive compatibility and individual rationality and is asymptotically optimal. Additionally, we also provide experimental results to validate our findings.
|
|
10:40-11:00, Paper WeA02.3 | |
>Input-Output Data-Driven Sensor Selection (I) |
|
Fotiadis, Filippos | The University of Texas at Austin |
Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Keywords: Data driven control, Uncertain systems, Optimization
Abstract: In this paper, we design a learning-based sensor selection procedure for an unknown cyber-physical system. In particular, a set of sensors that maximize a metric of observability of the system is chosen, but without using knowledge of the system's dynamics. The metric of observability is related to the notion of the mathcal{H}_2 norm, which quantifies the strength of the sensor signals generated under a given control input excitation. It is shown that the evaluation of this metric boils down to solving a set of model-based Lyapunov equations which, however, is a task that cannot be carried out directly since the system is unknown. Nevertheless, we tackle this by expressing the metric solely with respect to input-output data, and we use the new expression to choose the best sensors for the system in a model-free manner, and in polynomial time. Simulations are performed to demonstrate the efficiency of the proposed approach.
|
|
11:00-11:20, Paper WeA02.4 | |
>Adaptive Optimal Output-Feedback Control of Discrete-Time Systems Based on Hybrid Iteration (I) |
|
Liu, Sitong | Northeastern University |
Gao, Weinan | Northeastern University |
Jiang, Zhong-Ping | New York University |
Keywords: Reinforcement learning, Optimal control, Adaptive control
Abstract: In this paper, a novel adaptive dynamic programming (ADP) algorithm, named output-feedback hybrid iteration (HI), is proposed to address the adaptive optimal control problem of discrete-time linear systems. The proposed output-feedback HI strategy learns the optimal control policy through two phases. First, a novel data-driven value-iteration (VI) scheme is employed to learn an admissible output-feedback control policy using the input/output data without relying on the knowledge of system matrices. Then, with the obtained admissible control policy, the optimal output feedback-control policy is approximated with an accelerated convergence rate through output-feedback policy iteration (PI). Online input/output data are utilized to reconstruct the full state of the system and integrated into the new output-feedback HI algorithm. Simulation results are presented and demonstrate the efficacy and practicality of the proposed output-feedback HI approach in comparison with traditional PI and VI techniques.
|
|
11:20-11:40, Paper WeA02.5 | |
>Learning-Based Quantum Control for Optimal Pure State Manipulation |
|
Chen, Anthony Siming | University of Manchester |
Herrmann, Guido | University of Manchester |
Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Vijayan, Jayadev | University of Manchester |
Keywords: Optimal control, Quantum information and control, Adaptive control
Abstract: In this paper, we propose an adaptive critic learning approach for two classes of optimal pure state transition problems for closed quantum systems: i) when the target state is an eigenstate, and ii) when the target state is a superposition pure state. First, we describe a finite-dimensional quantum system based on the Schrodinger equation with the action of control fields. Then, we consider the target state to be i) an eigenstate of the internal Hamiltonian and ii) an arbitrary pure state via a unitary transformation. Meanwhile, the quantum state manipulation is formulated as an optimal control problem for solving the complex partial differential Hamilton-Jacobi-Bellman (HJB) equation, of which the control solution is found using continuous-time Q-learning of an adaptive critic. Finally, numerical simulation for a spin-1/2 particle system demonstrates the effectiveness of the proposed approach.
|
|
11:40-12:00, Paper WeA02.6 | |
>Control-Oriented Identification Via Stochastic Optimization |
|
Anderson, Sean | University of California Santa Barbara |
Hespanha, Joao P. | Univ. of California, Santa Barbara |
Keywords: Identification for control, Data driven control, Optimization
Abstract: Data-driven control benefits from rich datasets, but constructing such datasets becomes challenging when gathering data is limited. We consider an offline experiment design approach to gathering data where we design a control input to collect data that will most improve the performance of a feedback controller. We consider a setting in which the dynamics are modeled parametrically and formulate a control-oriented identification procedure by way of a stochastic optimization problem that explicitly optimizes the post-experiment closed-loop control performance. We propose solving this problem via stochastic gradient descent by first constructing a gradient estimator of our stochastic objective. We then focus on a particular setting with linear dynamics and quadratic objective, which benefits from a numerically tractable gradient estimator. We show our formulation numerically outperforms an A- and L-optimal experiment design approach, illustrate the effects of scaling the system state and input dimensions, and compare against a recent robust dual control approach.
|
|
WeA03 |
Amber 2 |
Analysis and Control of Network Dynamics in the Social and Life Sciences |
Invited Session |
Chair: Ye, Mengbin | Centre for Optimisation and Decision Science, Curtin University |
Co-Chair: Leonard, Naomi Ehrich | Princeton University |
Organizer: Ye, Mengbin | Curtin University |
Organizer: Zino, Lorenzo | Politecnico Di Torino |
Organizer: Cao, Ming | University of Groningen |
Organizer: Leonard, Naomi Ehrich | Princeton University |
|
10:00-10:20, Paper WeA03.1 | |
>Evolution of Cooperation among Unequals with Game-Environment Feedback (I) |
|
Yu, Xiaotong | Shanghai University |
Liang, Haili | Shanghai University |
Rong, Zhihai | Donghua University |
Ren, Xiaoqiang | Shanghai University |
Wang, Xiaofan | Shanghai University |
Cao, Ming | University of Groningen |
Keywords: Game theory, Markov processes, Stochastic systems
Abstract: In social activities, conflicts arising from the clash between individual self-interest and group interests frequently result in social dilemmas. The theory of direct reciprocity suggests that repeated interactions can alleviate this dilemma, but it often assumes homogeneity among individuals. However, the widespread heterogeneity in the real world often diminishes cooperation. Fortunately, the dynamics of bidirectional feedback between game mechanisms and environmental conditions can exert a positive influence on the evolutionary progression of cooperative behavior. This paper investigates how heterogeneous individuals affect the evolution of cooperation with environmental feedback. Despite extensive literature showing that endowment inequality among individuals tends to diminish cooperation, our findings suggest that the implementation of appropriate environmental feedback mechanisms can facilitate the development of cooperation even in contexts characterized by higher levels of inequality. Furthermore, our results demonstrate that the implementation of suitably tailored environmental feedback mechanisms can significantly augment the propensity of homogeneous individuals for cooperative behavior. These results provide insights for decision-makers in formulating strategies related to fairness and the sharing of public goods.
|
|
10:20-10:40, Paper WeA03.2 | |
>Control Strategies for Recommendation Systems in Social Networks |
|
Sprenger, Ben | ETH Zurich |
De Pasquale, Giulia | ETH Zurich |
Soloperto, Raffaele | ETH Zurich |
Lygeros, John | ETH Zurich |
Dörfler, Florian | Swiss Federal Institute of Technology (ETH) Zurich |
Keywords: Network analysis and control, Linear systems, Networked control systems
Abstract: A closed-loop control model to analyze the impact of recommendation systems on opinion dynamics within social networks is introduced. The core contribution is the development and formalization of model-free and model-based approaches to recommendation system design, integrating the dynamics of social interactions within networks via an extension of the Friedkin-Johnsen (FJ) model. Comparative analysis and numerical simulations demonstrate the effectiveness of the proposed control strategies in maximizing user engagement and their potential for influencing opinion formation processes.
|
|
10:40-11:00, Paper WeA03.3 | |
>Sparse Dynamic Network Reconstruction through L1-Regularization of a Lyapunov Equation (I) |
|
Belaustegui, Ian Xul | Princeton University |
Ordorica Arango, Marcela | Princeton University |
Rossi-Pool, Román | Universidad Nacional Autónoma De México |
Leonard, Naomi Ehrich | Princeton University |
Franci, Alessio | University of Liege |
Keywords: Network analysis and control, Optimization, Stochastic systems
Abstract: An important problem in many areas of science is that of recovering interaction networks from high-dimensional time-series of many interacting dynamical processes. A common approach is to use the elements of the correlation matrix or its inverse as proxies of the interaction strengths, but the reconstructed networks are necessarily undirected. Transfer entropy methods have been proposed to reconstruct directed networks, but the reconstructed network lacks information about interaction strengths. We propose a network reconstruction method that inherits the best of the two approaches by reconstructing a directed weighted network from noisy data under the assumption that the network is sparse and the dynamics are governed by a linear (or weakly-nonlinear) stochastic dynamical system. The two steps of our method are i) constructing an (infinite) family of candidate networks by solving the covariance matrix Lyapunov equation for the state matrix and ii) using L_1-regularization to select a sparse solution. We further show how to use prior information on the (non)existence of a few directed edges to dramatically improve the quality of the reconstruction.
|
|
11:00-11:20, Paper WeA03.4 | |
>On Controlling a Coevolutionary Model of Actions and Opinions (I) |
|
Raineri, Roberta | Politecnico Di Torino |
Como, Giacomo | Politecnico Di Torino |
Fagnani, Fabio | Politecnico Di Torino |
Ye, Mengbin | Curtin University |
Zino, Lorenzo | Politecnico Di Torino |
Keywords: Network analysis and control
Abstract: We deal with a control problem for a complex social network in which each agent has an action and an opinion, evolving according to a coevolutionary model. In particular, we consider a scenario in which a committed minority ---a set of stubborn nodes--- aims to steer a population, initially at a consensus, to a different consensus state. Our study focuses on determining the conditions under which such a goal is reached, and, ultimately how to optimally define a minimal committed minority. First, we derive a general monotone convergence result for the controlled coevolutionary model, under mild and general assumptions on the agents' revision sequence. Then, we build on our theoretical result to establish a methodological approach to investigate the research problem.
|
|
11:20-11:40, Paper WeA03.5 | |
>Dynamic Curing and Network Design in SIS Epidemic Processes (I) |
|
Yi, Yuhao | Sichuan University |
Shan, Liren | Toyota Technological Institute at Chicago |
Pare, Philip E. | Purdue University |
Johansson, Karl H. | KTH Royal Institute of Technology |
Keywords: Network analysis and control, Optimization, Markov processes
Abstract: This paper studies efficient algorithms for dynamic curing policies and the corresponding network design problems to guarantee the fast extinction of epidemic spread in a susceptible-infected-susceptible (SIS) model. We consider a Markov process-based SIS epidemic model. We provide a computationally efficient curing algorithm based on the curing policy proposed by Drakopoulos, Ozdaglar, and Tsitsiklis (2014). We provide approximation guarantees on the curing budget of the proposed dynamic curing algorithm. When the total infection rate is high, the original curing policy includes a waiting period in which no measure is taken to mitigate the spread until the rate slows down. To avoid the waiting period, we study network design problems to reduce the total infection rate by deleting edges or reducing the weight of edges. Then the curing processes become continuous since the total infection rate is restricted by network design. To this end, we provide algorithms with provable guarantees for the considered network design problems. In summary, the proposed curing and network design algorithms together provide an effective and computationally efficient approach that mitigates SIS epidemic spread in networks.
|
|
11:40-12:00, Paper WeA03.6 | |
>On Final Opinions of the Friedkin-Johnsen Model Over Random Graphs with Partially Stubborn Community (I) |
|
Wang, Lingfei | KTH Royal Institute of Technology |
Xing, Yu | KTH Royal Institute of Technology |
Johansson, Karl H. | KTH Royal Institute of Technology |
Keywords: Network analysis and control, Behavioural systems, Agents-based systems
Abstract: This paper studies the formation of final opinions for the Friedkin-Johnsen (FJ) model with a community of partially stubborn agents. The underlying network of the FJ model is symmetric and generated from a random graph model, in which each link is added independently from a Bernoulli distribution. It is shown that the final opinions of the FJ model will concentrate around those of an FJ model over the expected graph as the network size grows, on the condition that the stubborn agents are well connected to other agents. Probability bounds are proposed for the distance between these two final opinion vectors, respectively for the cases where there exist non-stubborn agents or not. Numerical experiments are provided to illustrate the theoretical findings. The simulation shows that, in presence of non-stubborn agents, the link probability between the stubborn and the non-stubborn communities affect the distance between the two final opinion vectors significantly. Additionally, if all agents are stubborn, the opinion distance decreases with the agent stubbornness.
|
|
WeA04 |
Amber 3 |
Encrypted Control and Optimization |
Invited Session |
Chair: Schulze Darup, Moritz | TU Dortmund University |
Co-Chair: Kim, Junsoo | Seoul National University of Science and Technology |
Organizer: Schulze Darup, Moritz | TU Dortmund University |
Organizer: Alexandru, Andreea B. | Duality Technologies |
Organizer: Kim, Junsoo | Seoul National University of Science and Technology |
|
10:00-10:20, Paper WeA04.1 | |
>A Self-Triggered Control Watermarking Scheme for Detecting Replay Attacks (I) |
|
Wolleswinkel, Bart | Delft University of Technology |
Ferrari, Riccardo M.G. | Delft University of Technology |
Mazo Jr., Manuel | Delft University of Technology |
Keywords: Networked control systems, Attack Detection, Cyber-Physical Security
Abstract: We propose a novel watermarking scheme by modifying a self-triggered control (STC) policy, aimed at detecting replay attacks for linear time-invariant (LTI) systems. We show that by employing non-deterministic early triggering of the STC mechanism, replay attacks can be detected by a modified χ2 detector which takes into account the aperiodic nature of the inter-sample times. Specifically, we consider the case where a periodic reference signal is tracked, which makes these systems vulnerable to replay attacks. The proposed approach is modular and can be retrofitted to legacy systems. An approach for designing an online optimal early triggering mechanism is provided. This is validated through an illustrative numerical example in which we compare our method to scenarios employing both additive and multiplicative watermarking.
|
|
10:20-10:40, Paper WeA04.2 | |
>Approximated Explicit NMPC Via Reinforcement Learning for Homomorphically Encrypted Process Control (I) |
|
Dzurková, Diana | Slovak University of Technology in Bratislava |
Valábek, Patrik | Slovak University of Technology in Bratislava |
Mészáros, Olivér | Slovak University of Technology in Bratislava |
Kalúz, Martin | Slovak University of Technology in Bratislava |
Klauco, Martin | Slovak University of Technology in Bratislava |
Keywords: Control Systems Privacy, Optimal control, Reinforcement learning
Abstract: This research proposes a novel approach to generating explicit, nearly-optimal control policies in the form of neural networks with a structure that allows further mathematical operations within homomorphic encryption frameworks. The novelty of this paper also lies in presenting a reinforcement learning pathway to train the explicit control law without the necessity of prior model knowledge. A Deep Deterministic Policy Gradient algorithm is used to train the neural network, with the objective function adopted from nonlinear model predictive control. This paper presents a generalized methodology to train the control policy and evaluate it in a homomorphic encryption setup. Particular results are presented based on a software-in-the-loop simulation setup, where specifics like communication delays and computational overheads are considered.
|
|
10:40-11:00, Paper WeA04.3 | |
>Encrypted System Identification As-A-Service Via Reliable Encrypted Matrix Inversion (I) |
|
Adamek, Janis | TU Dortmund |
Binfet, Philipp | TU Dortmund University |
Schlüter, Nils | TU Dortmund University |
Schulze Darup, Moritz | TU Dortmund University |
Keywords: Control Systems Privacy, Cyber-Physical Security, Identification for control
Abstract: Encrypted computation opens up promising avenues across a plethora of application domains, including machine learning, health-care, finance, and control. Arithmetic homomorphic encryption, in particular, is a natural fit for cloud-based computational services. However, computations are essentially limited to polynomial circuits, while comparisons, transcendental functions, and iterative algorithms are notoriously hard to realize. Against this background, the paper presents an encrypted system identification service enabled by a reliable encrypted solution to least squares problems. More precisely, we devise an iterative algorithm for matrix inversion and present reliable initializations as well as certificates for the achieved accuracy without compromising the privacy of provided I/O-data. The effectiveness of the approach is illustrated with three popular identification tasks.
|
|
11:00-11:20, Paper WeA04.4 | |
>Bootstrapping Guarantees: Stability and Performance Analysis for Dynamic Encrypted Control |
|
Schlor, Sebastian | University of Stuttgart |
Allgöwer, Frank | University of Stuttgart |
Keywords: Networked control systems, Communication networks
Abstract: Encrypted dynamic controllers that operate for an unlimited time have been a challenging research subject. The fundamental difficulty is the accumulation of errors and scaling factors in the internal state during operation. Bootstrapping, a technique commonly employed in fully homomorphic cryptosystems, can be used to avoid overflows in the controller state but can potentially introduce significant numerical errors. This paper analyzes dynamic encrypted control with explicit consideration of bootstrapping. By recognizing the bootstrapping errors occurring in the controller’s state as an uncertainty in the robust control framework, we can provide stability and performance guarantees for the whole encrypted control system. Further, the conservatism of the stability and performance test is reduced by using a lifted version of the control system.
|
|
11:20-11:40, Paper WeA04.5 | |
>Authentication of Multi-Agent System with Verifiable Computation and Distributed Aggregation (I) |
|
Lee, Seungbeom | Seoul National University |
Kim, Dongwoo | Seoul National University |
Chung, Heewon | Desilo |
Kim, Junsoo | Seoul National University of Science and Technology |
Shim, Hyungbo | Seoul National University |
Keywords: Cyber-Physical Security, Attack Detection, Computer/Network Security
Abstract: We propose an authentication scheme for a multi-agent system over integers, based on verifiable computation primitives. The naive approach, employing Freivalds' algorithm in centralized way, faces several challenges. Specifically, unreliability of the network introduces the risk of information tampering by other agents. To this end, it requires locally updating and merging the proofs of the individual states in a distributed manner. Our proposed method addresses these issues with assuming presence of a leader agent who is responsible for validating the correctness of all the states of the agents. This can be achieved by a distributed protocol that aggregates proofs from the individual agents, relying on the well-known knowledge-of-exponent assumption. By using this distributed protocol, computational complexity and communication cost are reduced compared to centralized verification. Furthermore, we incorporate a clustering technique to minimize storage requirements.
|
|
11:40-12:00, Paper WeA04.6 | |
>Privacy-Preserving Average Consensus Algorithm with Beaver Triple |
|
Wang, Peng | Shanghai Jiao Tong University |
Lu, Yang | Lancaster University |
Lian, Jianming | Oak Ridge National Laboratory |
Pan, Lulu | Shanghai Jiao Tong University |
Shao, Haibin | Shanghai Jiao Tong University |
Li, Ning | Shanghai Jiao Tong University |
Keywords: Agents-based systems, Control Systems Privacy
Abstract: A privacy-preserving average consensus algorithm is designed based on the Beaver triple technique against passive adversaries. The Beaver triple technique is integrated into a restructure of the discrete-time average consensus algorithm to preserve the privacy of initial values of agents in a multi-agent system. The performance of the algorithm is theoretically analyzed.
|
|
WeA05 |
Amber 4 |
Optimization and Control in Energy System Management |
Invited Session |
Chair: Jiang, Yuning | EPFL |
Co-Chair: Heer, Philipp | Empa |
Organizer: Jiang, Yuning | EPFL |
Organizer: Dai, Xinliang | Karlsruhe Institute of Technology |
Organizer: Guo, Yi | Swiss Federial Laboratories for Materials Science and Technology |
Organizer: Heer, Philipp | Empa |
Organizer: Hagenmeyer, Veit | Karlsruhe Institute of Technology (KIT) |
Organizer: Jones, Colin N. | EPFL |
|
10:00-10:20, Paper WeA05.1 | |
>Mitigating Short-Sightedness of MPC for District Heating Networks Using Dual Dynamic Programming (I) |
|
Sibeijn, Max | Delft University of Technology |
Khosravi, Mohammad | Delft University of Technology |
Keviczky, Tamas | Delft University of Technology |
Keywords: Predictive control for nonlinear systems, Energy systems, Optimization algorithms
Abstract: In this paper, we use dual dynamic programming to address the myopic nature of MPC for scheduling of district heating networks by designing value functions that can approximate the effects of time-varying elements on the objective function beyond the initial prediction horizon. To this end, we formulate the control problem as a two-level MPC. More precisely, in the first-level, we consider a short-horizon nonlinear MPC equipped with a terminal cost approximating the value function. Subsequently, a long-horizon linear MPC is solved in the second-level to establish a lower bound on the terminal cost function from the first-level, thereby improving the value function approximation. Specifically, we consider scheduling of thermal and hydraulic components within district heating networks. Our numerical example demonstrates that our method can anticipate demand variations beyond the prediction horizon while maintaining computational efficiency.
|
|
10:20-10:40, Paper WeA05.2 | |
>Reverse Kron Reduction of Three-Phase Radial Network (I) |
|
Low, Steven | California Institute of Technology |
Keywords: Power systems, Identification
Abstract: We consider the problem of identifying the admittance matrix of a three-phase radial network from voltage and current measurements at a subset of nodes. These measurements are used to estimate a virtual network represented by the Kron reduction (Schur complement) of the full admittance matrix. We focus on recovering exactly the full admittance matrix from its Kron reduction, i.e., computing the inverse of Schur complement. The key idea is to decompose Kron reduction into a sequence of iterations that maintains an invariance structure, and exploit this structure to reverse each step of the iterative Kron reduction.
|
|
10:40-11:00, Paper WeA05.3 | |
>Optimal Control of Grid-Interfacing Inverters with Current Magnitude Limits (I) |
|
Joswig-Jones, Trager | University of Washington |
Zhang, Baosen | University of Washington |
Keywords: Power systems, Constrained control, Optimal control
Abstract: Grid-interfacing inverters act as the interface between renewable resources and the electric grid, and have the potential to offer fast and programmable responses compared to synchronous generators. With this flexibility there has been significant research efforts into determining the best way to control these inverters. An important nonlinear constraint in inverter control is a limit on the magnitude of the current, stemming from the need to protect semiconductor devices. Existing approaches either simply saturate a controller that is designed for unconstrained systems, or assume small perturbations and linearize a saturated system. These approaches can lead to stability issues or limit the control actions to be too conservative. In this paper, we directly focus on a nonlinear system that explicitly accounts for the saturation of the current magnitude. We use a Lyapunov stability approach to determine a stability condition for the system, guaranteeing that a class of controllers would be stabilizing if they satisfy a simple semidefinite programming condition. With this condition we fit a linear-feedback controller by sampling the output of (offline) model predictive control problems. This learned controller has improved performances with existing designs.
|
|
11:00-11:20, Paper WeA05.4 | |
>Real-Time Pricing Mechanism for V2G Using Distributed Bilevel Optimization (I) |
|
You, Jiawei | ShanghaiTech University |
Dai, Xinliang | Karlsruhe Institute of Technology |
Jiang, Yuning | EPFL |
Yin, Haoyu | Washington University in St. Louis |
Shi, Yuanming | ShanghaiTech University |
Jones, Colin N. | EPFL |
Keywords: Energy systems, Optimization algorithms
Abstract: This paper explores the impact of the burgeoning electric vehicle (EV) presence on distribution grid operations, highlighting the challenges they present to conventional pricing strategies due to their dual role as power consumers and suppliers, coupled with their energy storage capabilities. We propose an advanced real-time pricing model for the electricity market, employing a novel distributed bilevel optimization framework. This framework distinguishes between the distribution system operator (DSO) at the upper level and the EVs at the lower level, each aiming to optimize profit margins. The optimization includes power flow constraints at the upper level to ensure efficient operation within safe system limits, while model predictive control (MPC) is used to optimize lower-level EV responses. Additionally, we provide a rigorous convergence analysis of the proposed bilevel optimization method. Detailed convergence studies and simulation results demonstrate the effectiveness and superiority of the proposed algorithm.
|
|
11:20-11:40, Paper WeA05.5 | |
>Balancing Fairness and Efficiency in Energy Resource Allocations |
|
Li, Jiayi | University of Washington, Seattle |
Motoki, Matthew | University of Washington |
Zhang, Baosen | University of Washington |
Keywords: Energy systems, Optimization, Intelligent systems
Abstract: Bringing fairness to energy resource allocation remains a challenge, due to the complexity of system structures and economic interdependencies among users and system operators' decision-making. The rise of distributed energy resources has introduced more diverse heterogeneous user groups, surpassing the capabilities of traditional efficiency-oriented allocation schemes. Without explicitly bringing fairness to user-system interaction, this disparity often leads to disproportionate payments for certain user groups due to their utility formats or group sizes. Our paper addresses this challenge by formalizing the problem of fair energy resource allocation and introducing the framework for aggregators. This framework enables optimal fairness-efficiency trade-offs by selecting appropriate objectives in a principled way. By jointly optimizing over the total resources to allocate and individual allocations, our approach reveals optimized allocation schemes that lie on the Pareto front, balancing fairness and efficiency in resource allocation strategies.
|
|
11:40-12:00, Paper WeA05.6 | |
>A Dual Bisection Approach to Economic Dispatch of Generators with Prohibited Operating Zones |
|
Manieri, Lucrezia | Politecnico Di Milano |
Falsone, Alessandro | Politecnico Di Milano |
Prandini, Maria | Politecnico Di Milano |
Keywords: Energy systems, Optimization algorithms, Large-scale systems
Abstract: We address economic dispatch of power generators with prohibited operating zones. The problem can be formulated as an optimization program with a quadratic cost, non-convex local operating constraints, and a scalar quadratic coupling constraint accounting for load demand and power losses. A duality-based resolution approach integrating a bisection iterative scheme is proposed to reduce computational complexity while guaranteeing finite time feasibility of the primal iterates and a cost improvement throughout iterations. Extensive simulations show that the approach outperforms state-of-the-art competitors and consistently computes feasible primal solutions with a close-to-zero optimality gap at a low computational cost.
|
|
WeA06 |
Amber 5 |
Agent-Based Systems I |
Regular Session |
Chair: Selmic, Rastko | Concordia University |
Co-Chair: Astolfi, Daniele | Cnrs - Lagepp |
|
10:00-10:20, Paper WeA06.1 | |
>Zonotope-Based Cyberattack Detection for Leader-Following Multi-Agent Systems |
|
Rahimifard, Mahshid | Concordia University |
Selmic, Rastko | Concordia University |
Keywords: Agents-based systems, Attack Detection
Abstract: This paper proposes a novel simultaneous leader-following consensus control and cyberattack detection method based on zonotopes for linear multi-agent systems with unknown but bounded noises. A free-weighting matrix for each agent is computed by solving an optimization problem to reduce the conservativeness of the state estimation, thus reducing the computational complexity. The radius of the intersection zonotope is guaranteed to be limited by a bound that we found in the paper. We provide rigorous conditions that guarantee the solution of the leader-following consensus protocol. In the proposed method, the knowledge of the type of cyberattack is not required. Simulation results demonstrate the proposed method's effectiveness in detecting attacks and achieving leader-following consensus control in the presence of a false data injection attack on the sensor data.
|
|
10:20-10:40, Paper WeA06.2 | |
>The Geometry of Cyclical Social Trends |
|
Chazelle, Bernard | Princeton |
Karntikoon, Kritkorn | Princeton University |
Nogler, Jakob | ETH Zurich |
Keywords: Agents-based systems, Communication networks, Time-varying systems
Abstract: We investigate the emergence of periodic behavior in opinion dynamics and its underlying geometry. For this, we use a bounded-confidence model with contrarian agents in a convolution social network. This means that agents adapt their opinions by interacting with their neighbors in a time-varying social network. Being contrarian, the agents are kept from reaching consensus. This is the key feature that allows the emergence of cyclical trends. We show that the systems either converge to nonconsensual equilibrium or are attracted to periodic or quasi-periodic orbits. We bound the dimension of the attractors and the period of cyclical trends. We exhibit instances where each orbit is dense and uniformly distributed within its attractor. We also investigate the case of randomly changing social networks.
|
|
10:40-11:00, Paper WeA06.3 | |
>Adaptively Distributed Nash Equilibrium Seeking for Nonlinear Heterogeneous Networked Systems |
|
Feng, Zhi | Beihang University |
Dong, Xiwang | Beihang University |
Hu, Guoqiang | Nanyang Technological University, Singapore |
Lu, Jinhu | Beihang University |
Keywords: Agents-based systems, Cooperative control, Adaptive control
Abstract: This paper studies an adaptively distributed Nash Equilibrium (NE) seeking problem in noncooperative games for nonlinear heterogeneous multi-agent systems (MASs) subject to uncertain system dynamics and communications. In contrast to existing related works that consider players with low-order dynamics, the goal is to make the outputs of nonlinear MASs converge towards the NE in a distributed and adaptive manner. By leveraging the consensus-based designs and pseudo-gradient strategies, novel adaptively distributed NE seeking algorithms that are independent of known information on the topology's algebraic connectivity and the pseudo-gradient's Lipschitz and monotone constants, are developed to seek the NE of games with a global asymptotic convergence. It is proved that the developed algorithms are capable of steering the outputs of agents towards the NE asymptotically. The examples and simulation results are presented to verify the proposed designs' effectiveness.
|
|
11:00-11:20, Paper WeA06.4 | |
>An Emergent Synchronization Property in a Set of Non-Competing Multi-Agent Systems |
|
D'Alfonso, Luigi | Università Della Calabria |
Fedele, Giuseppe | University of Calabria |
Keywords: Agents-based systems, Cooperative control, Distributed control
Abstract: This research investigates an emergent synchronization property within a set of non-competing multi-agent systems. The primary objective is to develop behavioral mathematical models that enable multiple groups of agents to adhere to predetermined paths while concurrently achieving consensus among corresponding members within each group. In this pursuit, the inherent coordination challenge is addressed while acknowledging absence of communication channels between the different agent groups but requiring a centralized and omniscient authority that assigns suitable and prescribed reference curves to track for each swarm. As a further assumption, the described emergent synchronization property is achieved by exploiting a complete communication graph, with an all-to-all communication topology in each multi-agent system. The proposed solution exploits a diffeomorphic mapping to translate the task into two independent and simpler problems: the first one relates to reaching the curve of interest and the second to moving on it while maintaining the synchronization condition between couple of agents.
|
|
11:20-11:40, Paper WeA06.5 | |
>Fixed-Time Consensus Tracking of Nonlinear Multi-Agent Systems under DoS Attacks |
|
Wang, Xinhe | Southeast University |
Wen, Guanghui | Southeast University |
Zheng, Wei Xing | Western Sydney University |
Keywords: Agents-based systems, Cooperative control, Resilient Control Systems
Abstract: This paper studies the fixed-time consensus tracking problem of nonlinear multi-agent systems, where communication links are subjected to denial-of-service (DoS) attacks. The DoS attacks make the communication networks switch arbitrarily and therefore the network connectivity may be broken, which poses a serious challenge to the consensus tracking problem within the fixed-time framework. To deal with this issue, a switching-based fixed-time control strategy is proposed. Critical conditions for fixed-time consensus tracking of nonlinear multi-agent systems under DoS attacks are developed. In particular, the adverse impact of DoS attacks on the settling-time can be clearly reflected by the fixed-time strategy derived in this paper. Finally, numerical examples are provided to validate the theoretical analysis.
|
|
11:40-12:00, Paper WeA06.6 | |
>Global Consensus for Heterogeneous Saturated Multi-Agent Systems Via Sampled-Data Control |
|
Qian, Juan | Nanjing University of Posts and Telecommunications |
Wang, Xiaoling | Nanjing University of Posts and Telecommunications |
Astolfi, Daniele | Cnrs - Lagepp |
Su, Housheng | Huazhong University of Science and Technology |
Jiang, Guo-Ping | Nanjing University of Posts and Telecommunications |
Keywords: Agents-based systems, Cooperative control, Sampled-data control
Abstract: This paper addresses the global consensus problem for multi-input multi-output saturated systems within a sampled-data framework, aiming to advance global consensus, manage heterogeneous actuator saturation across components, and preserve distributed characteristics by using sampled-data feedback. We propose a distributed control algorithm that incorporates a redesigned saturation function, represented as decentralized dynamic saturation levels. These levels for each agent’s dimensions are autonomously updated through an adaptive strategy, which mitigates heterogeneous saturation by carefully selecting constant and time-varying saturation parameters in it. Lyapunov analysis proves that global consensus can be achieved under the proposed control law, provided the sampling periods of all agents remain below a calculated threshold. An example is given to demonstrate the effectiveness of this approach.
|
|
WeA07 |
Amber 6 |
Game Theory VI |
Regular Session |
Chair: Fabiani, Filippo | IMT School for Advanced Studies Lucca |
Co-Chair: Parasnis, Rohit | Massachusetts Institute of Technology |
|
10:00-10:20, Paper WeA07.1 | |
>Should Social Media Platforms Differentiate Their Content Moderation Policies for Minority Communities in the Presence of Competition? |
|
Sasaki, So | The University of Illinois at Urbana-Champaign |
Langbort, Cedric | Univ of Illinois, Urbana-Champaign |
Keywords: Game theory, Control of networks, Emerging control applications
Abstract: Social networks are intricate ecosystems comprising diverse communities with distinct laws, customs, and preferences. Social media platforms sometimes set content moderation policies tailored to individual communities but other times impose a uniform policy across multiple communities, leaving smaller communities dissatisfied. In this article, we propose a game-theoretical model that considers platform competition for users with diverse preferences within a complex network. Users allocate their time to platforms, while platforms optimize their moderation policies to maximize user engagement. Our result provides the methodology of optimizing the moderation policies. The aggregated user preference and the user engagement are determined by the community imbalance indicator, which encapsulates the network structure and user affiliations. We conducted simulations for a social network with over 60,000 users across seven country-based communities to identify the aggregated optimal moderation policy and assess the impact of differentiating the policy for minority countries. This research contributes to a deeper understanding of the relationship between moderation policies and platform competition.
|
|
10:20-10:40, Paper WeA07.2 | |
>Generalized Nash Equilibrium Problems under Partial-Decision Information with Biased Agents |
|
Franci, Barbara | Maastricht University |
Fabiani, Filippo | IMT School for Advanced Studies Lucca |
Zino, Lorenzo | Politecnico Di Torino |
Keywords: Game theory, Network analysis and control
Abstract: We consider generalized Nash equilibrium problems under a partial-decision information regime, in which each agent typically reconstructs the opponents’ strategies through a linear averaging dynamics. In contrast, we consider a state-dependent, nonlinear susceptibility term within the communication mechanism, thereby modelling possible biases on the part of agents in processing information. By including such a term in a relaxed forward-backward iteration scheme, we design a distributed algorithm possessing convergence guarantees to a generalized Nash equilibrium (GNE). Simulation results illustrate how the susceptibility term affects the GNE computation.
|
|
10:40-11:00, Paper WeA07.3 | |
>A Stackelberg Game Model of Flocking |
|
Wang, Chenlan | University of Michigan, Ann Arbor |
Moharrami, Mehrdad | University of Iowa |
Liu, Mingyan | University of Michigan |
Keywords: Game theory, Modeling
Abstract: We study a Stackelberg game to examine how two agents determine to cooperate while competing with each other. Each selects an arrival time to a destination, the earlier one fetching a higher reward. There is, however, an inherent penalty in arriving too early as well as a risk in traveling alone. This gives rise to the possibility of the agents cooperating by traveling together while competing for the reward. In our prior work [1] we studied this problem as a sequential game among a set of N competing agents in continuous time, and defined the formation of a group traveling together as arriving at exactly the same time. In the present study, we relax this definition to allow arrival times within a small window, and study a 2-agent game in both continuous and discrete time, referred to as the flock formation game. We derive and examine the properties of the subgame perfect equilibrium (SPE) of this game.
|
|
11:00-11:20, Paper WeA07.4 | |
>Optimal Interventions in Coupled-Activity Network Games: Application to Sustainable Forestry |
|
Parasnis, Rohit | Massachusetts Institute of Technology |
Amin, Saurabh | Massachusetts Institute of Technology |
Keywords: Game theory, Network analysis and control
Abstract: We consider the problem of promoting sustainability in production forests wherein a given number of strategic entities are authorized to own or manage concession regions. These entities harvest agricultural commodities and sell them in a market. We study optimal price-shaping in a coupled- activity network game model in which the concession owners (agents) engage in two activities: (a) the sustainable activity of producing a commodity that does not interfere with protected forest resources, and (b) the unsustainable activity of infringing into protected regions to expand their agricultural footprint. We characterize the policy that maximally suppresses the aggregate unsustainable activity under budget constraints. Our analysis provides novel insights on the agents’ influence on each other due to intra-activity and cross-activity network effects. We also identify a measure of node centrality that resembles the Bonacich-Katz centrality and helps us determine pricing incentives that minimize the aggregate unsustainable activity over the set of all feasible policies.
|
|
11:20-11:40, Paper WeA07.5 | |
>Constrained Multi-Cluster Game: Distributed Nash Equilibrium Seeking Over Directed Graphs |
|
Nguyen, Duong | Arizona State University |
Bianchi, Mattia | ETH Zurich |
Dörfler, Florian | Swiss Federal Institute of Technology (ETH) Zurich |
Nguyen, Duong | Arizona State University |
Nedich, Angelia | Arizona State University |
Keywords: Game theory, Distributed control, Networked control systems
Abstract: Motivated by the complex dynamics of cooperative and competitive interactions within networked agent systems, multi-cluster games provide a framework for modeling the interconnected goals of self-interested clusters of agents. For this setup, the existing literature does not provide comprehensive gradient-based solutions that simultaneously address constraint sets and directed communication networks-both of which are essential for many practical applications-due to the complexities posed by the projection and the imbalanced mixing matrix. To address this gap, this paper proposes a distributed Nash equilibrium seeking algorithm that integrates consensus-based methods and gradient-tracking techniques, utilizing row-stochastic weight matrices for inter-cluster communication and column-stochastic weight matrices for intra-cluster communication. To handle constraints, we introduce an averaging procedure to control errors introduced by projection methods within the gradient-tracking procedure. We establish the linear convergence of the proposed algorithm, focusing on the contraction property of the optimality gap. The efficacy of the algorithm is demonstrated through its application to microgrid energy management.
|
|
11:40-12:00, Paper WeA07.6 | |
>Communication-Efficient and Differentially-Private Distributed Nash Equilibrium Seeking with Linear Convergence |
|
Chen, Xiaomeng | Hong Kong University of Science and Technology |
Huo, Wei | HKUST |
Ding, Kemi | Southern University of Science and Technology |
Dey, Subhrakanti | Uppsala University |
Shi, Ling | Hong Kong University of Science and Technology |
Keywords: Game theory, Networked control systems
Abstract: The distributed computation of a Nash equilibrium (NE) for non-cooperative games is gaining increased attention in recent years. Due to the nature of distributed systems, privacy and communication efficiency are two critical concerns. Traditional approaches often address these critical concerns in isolation. This work introduces a unified framework, named CDP-NES, designed to improve communication efficiency in the privacy-preserving NE seeking algorithm for distributed non-cooperative games over directed graphs. Leveraging both general compression operators and the noise adding mechanism, CDP-NES perturbs local states with Laplace noise and applies difference compression prior to their exchange among neighbors. We prove that CDP-NES not only achieves linear convergence in games with restricted monotone mappings but also guarantees -differential privacy, addressing privacy and communication efficiency concerns simultaneously. Finally, numerical simulations are provided to illustrate the effectiveness of the proposed algorithm.
|
|
WeA08 |
Amber 7 |
Optimization I |
Regular Session |
Chair: Nedich, Angelia | Arizona State University |
Co-Chair: Notarstefano, Giuseppe | University of Bologna |
|
10:00-10:20, Paper WeA08.1 | |
>Distributionally Robust Resource Allocation with Trust-Aided Parametric Information Fusion |
|
Guo, Yanru | University of Michigan at Ann Arbor |
Zhou, Bo | University of Michigan |
Jiang, Ruiwei | University of Michigan |
Yang, Xi Jessie | University of Michigan |
Shen, Siqian | University of Michigan |
Keywords: Optimization, Robust control, Uncertain systems
Abstract: Reference information plays an essential role for making decisions under uncertainty, yet may vary across multiple data sources. In this paper, we study resource allocation in stochastic dynamic environments, where we perform information fusion based on trust of different data sources, to design an ambiguity set for attaining distributionally robust resource allocation solutions. We dynamically update the trust parameter to simulate the decision maker's trust change based on losses caused by mis-specified reference information. We show an equivalent tractable linear programming reformulation of the distributionally robust optimization model and demonstrate the performance in a wildfire suppression application, where we use drone and satellite data to estimate the needs of resources in different regions. We demonstrate how our methods can improve trust and decision accuracy. The computational time grows linearly in the number of data sources and problem sizes.
|
|
10:20-10:40, Paper WeA08.2 | |
>Zeroth-Order Katyusha: An Accelerated Derivative-Free Method for Composite Convex Optimization |
|
Zhang, Silan | Peking University |
Tang, Yujie | Peking University |
Keywords: Optimization, Optimization algorithms
Abstract: We investigate accelerated zeroth-order algorithms for smooth composite convex optimization problems. While for unconstrained optimization, existing methods that merge 2-point zeroth-order gradient estimators with first-order frameworks usually lead to satisfactory performance, for constrained/composite problems, there is still a gap in the complexity bound that is related to the non-vanishing variance of the 2-point gradient estimator near an optimal point. To bridge this gap, we propose the Zeroth-Order Loopless Katyusha (ZO-L-Katyusha) algorithm, leveraging the variance reduction as well as acceleration techniques from the first-order loopless Katyusha algorithm. We show that ZO-L-Katyusha is able to achieve accelerated linear convergence for compositve smooth and strongly convex problems, and has the same oracle complexity as the unconstrained case. Moreover, the number of function queries to construct a zeroth-order gradient estimator in ZO-L-Katyusha can be made to be O(1) on average. These results suggest that ZO-L-Katyusha provides a promising approach towards bridging the gap in the complexity bound for zeroth-order composite optimization.
|
|
10:40-11:00, Paper WeA08.3 | |
>Sample-Based Trust Region Dynamics in Contextual Global Optimization |
|
Sabug, Lorenzo Jr. | Politecnico Di Milano |
Fagiano, Lorenzo | Politecnico Di Milano |
Ruiz, Fredy | Politecnico Di Milano |
Keywords: Optimization, Optimization algorithms, Adaptive systems
Abstract: The problem of contextual black-box optimization is treated, in which a generally non-convex scalar objective depends not only on the decision variables, but also on uncontrollable, observable context variables. Assuming Lipschitz continuity of the objective function with respect to its arguments, the proposed approach builds a Set Membership model from observed samples. According to the observed context, a submodel that relates the objective to the decision variables is isolated, and used by a zeroth-order technique to pick the appropriate decision variable for sampling. A novel trust region dynamic is introduced, associating and changing its size with samples instead of iterations. Such a technique makes the resulting contextual optimization algorithm more flexible with respect to the context behavior, whether it is changing smoothly, at random, or a combination of both. Benchmark tests and a case study demonstrate the efficacy of the proposed method in considering context information for global optimization.
|
|
11:00-11:20, Paper WeA08.4 | |
>Nonconvex Big-Data Optimization Via System Theory: A Block-Wise Incremental Gradient Method |
|
Carnevale, Guido | University of Bologna |
Notarnicola, Ivano | University of Bologna |
Notarstefano, Giuseppe | University of Bologna |
Keywords: Optimization, Optimization algorithms, Large-scale systems
Abstract: In this paper, we propose and analyze Block-wise Incremental Gradient with Averaging (BIG-A), i.e., a novel optimization algorithm tailored for large-scale and big-data nonconvex optimization problems with a composite cost function. At each iteration, the algorithm uses a block of a single function gradient to properly update auxiliary variables providing a proxy of a descent direction. We interpret BIG-A as a dynamical system arising from the interconnection between a fast, time-varying scheme and a slow, time-invariant one. This interpretation allows us to prove the convergence properties by using system theory results relying on the LaSalle-Yoshizawa invariance principle and singular perturbations. The solution estimate sequence generated by BIG-A is shown to converge toward the set of stationary points of the problem, which is not assumed to be convex nor satisfying the Polyak-Lojasiewicz condition. If strong convexity is also assumed, linear convergence toward the unique optimal solution is established. Finally, numerical simulations confirm the theoretical findings.
|
|
11:20-11:40, Paper WeA08.5 | |
>Asymptotic Behavior of the Nonautonomous Arrow–Hurwicz Differential System |
|
Niederlaender, Simon | Siemens AG |
Keywords: Optimization, Optimization algorithms, Lyapunov methods
Abstract: In a real Hilbert space setting, we investigate the asymptotic behavior of the solutions of the nonautonomous Arrow–Hurwicz differential system. We show that its solutions weakly converge in average towards a saddle point of some limiting closed convex-concave bifunction provided that the associated gap function vanishes sufficiently fast. If, in addition, the limiting saddle function verifies a strict convexity-concavity condition, we find that the solutions of the nonautonomous Arrow–Hurwicz differential system not only converge in an ergodic sense, but in fact admit a weak limit. Numerical experiments illustrate our theoretical findings.
|
|
11:40-12:00, Paper WeA08.6 | |
>Dissipative Gradient Descent Ascent Method: A Control Theory Inspired Algorithm for Min-Max Optimization |
|
Zheng, Tianqi | Johns Hopkins University |
Loizou, Nicolas | Johns Hopkins University |
You, Pengcheng | Peking University |
Mallada, Enrique | Johns Hopkins University |
Keywords: Optimization, Optimization algorithms, Lyapunov methods
Abstract: Gradient Descent Ascent (GDA) methods for min-max optimization problems typically produce oscillatory behavior that can lead to instability, e.g., in bilinear settings. To address this problem, we introduce a dissipation term into the GDA updates to dampen these oscillations. The proposed Dissipative GDA (DGDA) method can be seen as performing standard GDA on a state-augmented and regularized saddle function that does not strictly introduce additional convexity/concavity. We theoretically show the linear convergence of DGDA in the bilinear and strongly convex-strongly concave settings and assess its performance by comparing DGDA with other methods such as GDA, Extra-Gradient (EG), and Optimistic GDA. Our findings demonstrate that DGDA surpasses these methods, achieving superior convergence rates. We support our claims with two numerical examples that showcase DGDA's effectiveness in solving saddle point problems.
|
|
WeA09 |
Amber 8 |
Predictive Control for Nonlinear Systems I |
Regular Session |
Chair: Faulwasser, Timm | TU Hamburg |
Co-Chair: Mesbah, Ali | University of California, Berkeley |
|
10:00-10:20, Paper WeA09.1 | |
>Safe and Stable Closed-Loop Learning for Neural-Network-Supported Model Predictive Control |
|
Hirt, Sebastian | TU Darmstadt |
Pfefferkorn, Maik | Technical University of Darmstadt |
Findeisen, Rolf | TU Darmstadt |
Keywords: Predictive control for nonlinear systems, Machine learning, Constrained control
Abstract: Safe learning of control policies remains challenging, both in optimal control and reinforcement learning. In this article, we consider safe learning of parametrized predictive controllers that operate with incomplete information about the underlying process. To this end, we employ Bayesian optimization for learning the best parameters from closed-loop data. Our method focuses on the system's overall long-term performance in closed-loop while keeping it safe and stable. Specifically, we parametrize the stage cost function of an MPC using a feedforward neural network. This allows for a high degree of flexibility, enabling the system to achieve a better closed-loop performance with respect to a superordinate measure. However, this flexibility also necessitates safety measures, especially with respect to closed-loop stability. To this end, we explicitly incorporated stability information in the Bayesian-optimization-based learning procedure, thereby achieving rigorous probabilistic safety guarantees. The proposed approach is illustrated using a numeric example.
|
|
10:20-10:40, Paper WeA09.2 | |
>Exploiting Manifold Turnpikes in Model Predictive Path-Following without Terminal Constraints |
|
Itani, Mohammad | TU Hamburg |
Faulwasser, Timm | Hamburg University of Technology |
Keywords: Predictive control for nonlinear systems, Mechatronics
Abstract: Model predictive path-following control (MPFC) considers geometric reference paths in output spaces without pre-assigned timing information. It combines trajectory generation and tracking into one receding-horizon optimal control problem. In this paper, we discuss MPFC without terminal constraints from a geometric point of view. Specifically, we consider implicitly parameterized paths and the recently introduced notion of manifold turnpikes to propose sufficient conditions for practical convergence of the system output towards a neighborhood of the reference path. We draw upon a simulation example to demonstrate the efficacy of the proposed scheme.
|
|
10:40-11:00, Paper WeA09.3 | |
>Imitation Learning of MPC with Neural Networks: Error Guarantees and Sparsification |
|
Alsmeier, Hendrik | TU Darmstadt |
Savchenko, Anton | Technical University of Darmstadt |
Theiner, Lukas | TU Darmstadt |
Mesbah, Ali | University of California, Berkeley |
Findeisen, Rolf | TU Darmstadt |
Keywords: Predictive control for nonlinear systems, Machine learning, Constrained control
Abstract: This paper presents a framework for bounding the approximation error in imitation model predictive controllers utilizing neural networks. Leveraging the Lipschitz properties of these neural networks, we derive a bound that guides dataset design to ensure the approximation error remains at chosen limits. We discuss how this method can be used to design a stable neural network controller with performance guarantees employing existing robust model predictive control approaches for data generation. Additionally, we introduce a training adjustment, which is based on the sensitivities of the optimization problem and reduces dataset density requirements based on the derived bounds. We verify that the proposed augmentation results in improvements to the network's predictive capabilities and a reduction of the Lipschitz constant. Moreover, on a simulated inverted pendulum problem, we show that the approach results in a closer match of the closed-loop behavior between the imitation and the original model predictive controller.
|
|
11:00-11:20, Paper WeA09.4 | |
>Bilevel Optimization for Real-Time Control with Application to Locomotion Gait Generation |
|
Olkin, Zachary | California Institute of Technology |
Ames, Aaron D. | California Institute of Technology |
Keywords: Predictive control for nonlinear systems, Robotics, Optimization algorithms
Abstract: Model Predictive Control (MPC) is a common tool for the control of nonlinear, real-world systems, such as legged robots. However, solving MPC quickly enough to enable its use in real-time is often challenging. One common solution is given by real-time iterations, which does not solve the MPC problem to convergence, but rather close enough to give an approximate solution. In this paper, we extend this idea to a bilevel control framework where a "high-level" optimization program modifies a controller parameter of a "low-level" MPC problem which generates the control inputs and desired state trajectory. We propose an algorithm to iterate on this bilevel program in real-time and provide conditions for its convergence and improvements in stability. We then demonstrate the efficacy of this algorithm by applying it to a quadrupedal robot where the high-level problem optimizes a contact schedule in real-time. We show through simulation that the algorithm can yield improvements in disturbance rejection and optimality, while creating qualitatively new gaits.
|
|
11:20-11:40, Paper WeA09.5 | |
>A Two-Stage Variable-Horizon Economic Model Predictive Control without Terminal Constraint |
|
Xiong, Weiliang | Zhejiang University of Technology, University of Wollongong |
Xia, Xiangjun | University of Wollongong |
Du, Haiping | University of Wollongong |
He, Defeng | Zhejiang University of Technology |
Keywords: Predictive control for nonlinear systems, Stability of nonlinear systems, Nonlinear systems
Abstract: This paper proposes a variable-horizon economic model predictive control scheme without terminal constraint for nonlinear systems. By observing the relationship between the optimal prediction trajectory and the terminal set, the horizon can be adjusted simply through two user-designed sequences. The approach does not require any dissipativity or local controllability assumptions with respect to (w.r.t) the closed loop system under economic optimization. The feasibility is rigorously proven, and the closed-loop system admits asymptotic stability under nondecreasing horizon. Finally, a non-dissipative Continuous Stirred Tank Reactor (CSTR) experiment verified the effectiveness and merits.
|
|
11:40-12:00, Paper WeA09.6 | |
>Stochastic Model Predictive Control with Probabilistic Control Barrier Functions and Smooth Sample-Based Approximation |
|
Wang, Ye | The University of Melbourne |
Shen, Xun | Osaka University |
Qian, Hongyu | Harbin Institute of Technology |
Keywords: Predictive control for nonlinear systems, Stochastic systems
Abstract: Addressing chance constraints in stochastic model predictive control (MPC) poses a significant challenge, especially in investigating recursive feasibility in the closed-loop system. We propose a novel stochastic MPC for nonlinear systems subject to stochastic uncertainties. We incorporate probabilistic control barrier functions (PCBFs) in the proposed stochastic MPC formulation. A notable merit of employing PCBFs in stochastic MPC is the alleviation from dependence on parameterization with a feedback control law. Furthermore, we present a smooth sample-based approximation approach to the stochastic MPC, which enhances the tractability of the proposed stochastic MPC formulation. Finally, we provide a numerical example to exhibit the efficacy of the proposed stochastic MPC.
|
|
WeA10 |
Brown 1 |
Biological Systems: Modelling, Analysis and Algorithms |
Invited Session |
Chair: Katz, Rami | University of Trento |
Co-Chair: Palumbo, Pasquale | University of Milano-Bicocca |
Organizer: Borri, Alessandro | CNR-IASI |
Organizer: Katz, Rami | University of Trento |
Organizer: Giordano, Giulia | University of Trento |
Organizer: Palumbo, Pasquale | University of Milano-Bicocca |
Organizer: Singh, Abhyudai | University of Delaware |
|
10:00-10:20, Paper WeA10.1 | |
>Modeling the Development of Drug Resistance During Chemotherapy Treatment (I) |
|
Pompa, Marcello | CNR-IASI |
Drexler, Dániel András | Obuda University |
Panunzi, Simona | Consiglio Nazionale Delle Ricerche |
Gombos, Balázs | Research Center for Natural Sciences |
Füredi, András | Research Center for Natural Sciences |
Szakács, Gergely | Medical University of Vienna |
Kovács, Levente | Obuda University |
De Gaetano, Andrea | CNR |
Keywords: Systems biology, Biomedical, Healthcare and medical systems
Abstract: The development of drug resistance is a major obstacle in cancer treatment, leading to frequent tumor recurrence. Upon being first diagnosed, many tumors tend to respond well to chemotherapy, only to later display resistance to previously effective drugs. Understanding the mechanisms behind this emergent drug resistance is crucial for designing more effective treatment protocols. In this study, we present a novel mathematical model that investigates the competition between heterogeneous tumor cell populations with varying metabolic strategies and drug resistance profiles. Rather than hypothesizing metabolic plasticity (cells switching metabolic pathways), our model hypothesizes the coexistence of two (or more) cell populations, of which one has greater growth potential and greater drug sensitivity than the other. Considering this metabolic heterogeneity, together with potentially flexible, environment-induced genetic variations, would help us better characterize variable tumor response to treatment. The model parameters are here estimated using data from mice tumors treated with pegylated liposomal doxorubicin. Simulations using the model reveal that an initial period of drug sensitivity, during which the tumor shrinks, can mask the expansion of a small subpopulation of metabolically less efficient but drug-resistant cells. Over time, the resistant subpopulation outcompetes the originally expanding, metabolically more efficient subpopulation, leading to the observed resurgence of the tumor in a drug-resistant form.
|
|
10:20-10:40, Paper WeA10.2 | |
>Exponential Growth Conditions for a Coarse-Grain Model of Ribosome Synthesis (I) |
|
Palumbo, Pasquale | University of Milano-Bicocca |
Papa, Federico | IASI-CNR |
Busti, Stefano | University of Milano-Bicocca, Department of Biotechnology and Bi |
Vanoni, Marco | Università Di Milano Bicocca |
Keywords: Systems biology, Biological systems, Modeling
Abstract: Ribosomes are fundamental cellular structures, playing a crucial role in the synthesis of proteins, the building blocks of life. They are composed of a large and a small subunit, each consisting of a combination of ribosomal RNA (rRNA) and numerous associated proteins. These subunits work collaboratively to translate messenger RNA (mRNA) into proteins through a process known as translation. Understanding the synthesis of ribosomal components is crucial for elucidating their function and role in cellular processes, including the setting of the cell growth rate. Recently, a coarse-grain mathematical model has been presented, accounting for the two distinct sub-units, their common precursor and the overall amount of proteins and ribosomes. The feedback of the ribosome-over-protein ratio on the ribosome synthesis was as well addressed. This note carries out the qualitative analysis on such a model, providing a condition ensuring the exponential growth of the cell as an emergent property of the network of the molecular players under investigation. The proposed analysis overcomes the drawbacks of a previous work, which provided a set of restrictive sufficient conditions for exponential growth; indeed, here a single necessary and sufficient condition is given, which is also easy to check from the knowledge of the model parameters. Besides, a sensitivity analysis is proposed, detailing which parameters the emergent growth rate and the ribosome-over-protein ratio are more sensitive to.
|
|
10:40-11:00, Paper WeA10.3 | |
>Analysis of Feedback Schemes in Proliferating Stem Cells (I) |
|
Borri, Alessandro | CNR-IASI |
Palumbo, Pasquale | University of Milano-Bicocca |
Singh, Abhyudai | University of Delaware |
Keywords: Systems biology, Stochastic systems, Biological systems
Abstract: Stem cells have emerged as a pivotal player in biomedical research, offering unprecedented opportunities for regenerative medicine, disease modeling, and drug discovery. These undifferentiated cells possess the unique ability to self-renew and differentiate into various specialized cell types, making them invaluable in tissue repair and regeneration. This note analyzes a stochastic model aiming at unraveling the control machinery providing the proper ratio of stem-versus-differentiated cells, in agreement with a couple of feedback schemes regulating cell proliferation. What emerges from the closed-loop schemes analysis is that in order to have an asymptotically stable, nontrivial accumulation of differentiated cells a negative feedback on cell proliferation is required. Numerical simulations in the stochastic setting validate the approximate theoretical results achieved to relate how random fluctuations propagate with respect to variations of model parameters.
|
|
11:00-11:20, Paper WeA10.4 | |
>A Necessary Condition for Non-Monotonic Dose Response, with an Application to a Kinetic Proofreading Model (I) |
|
Yu, Polly Y. | Harvard University |
Sontag, Eduardo | Northeastern University |
Keywords: Systems biology, Biological systems
Abstract: Steady state nonmonotonic (“biphasic”) dose responses are often observed in experimental biology, which raises the control-theoretic question of identifying which possible mechanisms might underlie such behaviors. It is well known that the presence of an incoherent feedforward loop (IFFL) in a network may give rise to a nonmonotonic response. It has been conjectured that this condition is also necessary, i.e. that a nonmonotonic response implies the existence of an IFFL. In this paper, we show that this conjecture is false, and in the process prove a weaker version: that either an IFFL must exist or both a positive feedback loop and a negative feedback loop must exist. Towards this aim, we give necessary and sufficient conditions for when minors of a symbolic matrix have mixed signs. Finally, we study in full generality when a model of immune T-cell activation could exhibit a steady state nonmonotonic dose response.
|
|
11:20-11:40, Paper WeA10.5 | |
>Spiking Systems in Population-Infection Dynamics |
|
Blanchini, Franco | Univ. Degli Studi Di Udine |
Giordano, Giulia | University of Trento |
Keywords: Systems biology, Biological systems
Abstract: Motivated by a class of models in population dynamics, we introduce the concept of spiking dynamical systems. A spiking system admits an asymptotically stable equilibrium but, under proper perturbations on the initial conditions in a compact region including the equilibrium, its output exhibits a spike of arbitrarily large magnitude before the state returns within the region. We consider a model that describes a well-documented phenomenon in caterpillar-virus dynamics: a sudden increase of the caterpillar population occurs, due to a temporary reduction of the viral population, and is then followed by a sudden decrease. We prove that the caterpillar-virus system is spiking according to our proposed mathematical definition: the model can yield arbitrarily large population densities for caterpillars, and then the original conditions are suddenly restored. When the model also takes into account environmental constraints that keep the caterpillar population bounded, the spike cannot be arbitrarily large, but the population density can get arbitrarily close to the maximal one that can be achieved in the absence of virus.
|
|
11:40-12:00, Paper WeA10.6 | |
>Analysis of the Identifying Regulation with Adversarial Surrogates Algorithm |
|
Teichner, Ron | Technion - Israel Institue of Technology |
Meir, Ron | Technion |
Margaliot, Michael | Tel Aviv University |
Keywords: Machine learning, Identification
Abstract: Given a time-series z[k], k=1...N of noisy measured outputs along a single trajectory of a dynamical system, the Identifying Regulation with Adversarial Surrogates (IRAS) algorithm aims to find a non-trivial first integral of the system, that is, a scalar function g such that g(zi)≈g(zj) , for all i, j. IRAS has been suggested recently and was used successfully in several learning tasks in models from biology and physics. Here, we give the first rigorous analysis of this algorithm in a specific setting. We assume that the observations admit a linear first integral and that they are contaminated by Gaussian noise. We show that in this case the IRAS iterations are closely related to the self-consistent-field (SCF) iterations for solving a generalized Rayleigh quotient minimization problem. Using this approach, we derive several sufficient conditions guaranteeing local convergence of IRAS to the linear first integral.
|
|
WeA11 |
Brown 2 |
Data Driven Control VII |
Regular Session |
Chair: Pasqualetti, Fabio | University of California, Riverside |
Co-Chair: Li, Zhongkui | Peking University |
|
10:00-10:20, Paper WeA11.1 | |
>Frequency-Based Design Method for Model-Free Controllers (I) |
|
Moreno, Marcos | Centre for Automation and Robotics (CSIC-UPM) |
Artuñedo, Antonio | Centre for Automation and Robotics (CSIC-UPM) |
Villagra, Jorge | CSIC |
Keywords: Data driven control, Computer-aided control design, Automotive control
Abstract: Model-Free Control (MFC) has been applied to a wide variety of systems in which it has shown its performance and robustness against plant changes. MFC offers ``model-free operation", but the controller design requires some information from the nominal plant. This paper introduces a new design method for model-free controllers that uses minimal data about the system and retrieves a set of stable controller configurations. This method is specifically developed for first-order model-free controllers, but can be extended to second-order controllers, and it relies in a frequency analysis of the controller and the plant. The main feature of the design method is decoupling the design of the main control parameter alpha from the rest, providing specific values for it. The efficacy of the proposed method will be showcased with some relevant application examples.
|
|
10:20-10:40, Paper WeA11.2 | |
>From Noisy Data to Consensus Control: A Localized Design Approach |
|
Chang, Zeze | Peking University |
Li, Zhongkui | Peking University |
Keywords: Data driven control, Cooperative control, LMIs
Abstract: This paper considers a localized data-driven consensus problem for leader-follower multi-agent systems characterized by unknown linear agent dynamics, where each agent computes its local control gain using only its locally collected noise-corrupted data. Both discrete-time and continuous-time data-driven protocols are presented, which can achieve leader-follower consensus by handling the challenge of the heterogeneity in control gains caused by local data sampling. The design of these data-driven consensus protocols involves low-dimensional linear matrix inequalities. Simulation examples are provided to demonstrate the effectiveness of the proposed methods
|
|
10:40-11:00, Paper WeA11.3 | |
>A Computationally Efficient Reformulation for Data-Enabled Predictive Control |
|
Faye-Bedrin, Alexandre | IETR, CentraleSupélec |
Aranovskiy, Stanislav | CentraleSupelec - IETR // Rennes |
Chauchat, Paul | Aix-Marseille Université |
Bourdais, Romain | CentraleSupelec - IETR |
Keywords: Data driven control, Direct adaptive control, Behavioural systems
Abstract: This work investigates the computational efficiency of Data-EnablEd Predictive Control (DeePC) reformulations. Based on Willems' fundamental lemma, this control method uses Hankel matrices to represent system dynamics. The size---in particular the number of columns---of these Hankel matrices can incur a significant computational complexity, which has seen several attempts at being reduced. We propose a reformulation of DeePC aiming for lower complexity and show online recursive updates of the data matrix. The method's effectiveness is illustrated by results obtained in simulation.
|
|
11:00-11:20, Paper WeA11.4 | |
>Data Informativity for Distributed Positive Stabilization |
|
Iwata, Takumi | Hiroshima University |
Azuma, Shun-ichi | Kyoto University |
Ariizumi, Ryo | Tokyo University of Agriculture and Technology |
Asai, Toru | Chubu University |
Keywords: Data driven control, Distributed control, LMIs
Abstract: This paper investigates data informativity for distributed positive stabilization. Under the situation that the system model is unavailable but the measurement data are available, we address the problem of finding a distributed controller such that the closed-loop system by state feedback is positive and stable. We clarify that a necessary and sufficient condition for the problem to be solvable is characterized by linear matrix inequalities (LMIs), and derive an LMI-based solution to the problem. To demonstrate our results, a numerical example is provided. Moreover, an application to the vehicle formation is given.
|
|
11:20-11:40, Paper WeA11.5 | |
>Data-Driven Expressions for the Control of Network Systems with Asynchronous Experiments |
|
Cianchi, Silvia | TU Delft, VITO |
Celi, Federico | University of California, Riverside |
Tesi, Pietro | Università Degli Studi Di Firenze |
Pasqualetti, Fabio | University of California, Riverside |
Keywords: Data driven control, Distributed control, Optimal control
Abstract: This paper proposes a direct data-driven approach to address decentralized control problems in network systems, i.e., systems formed by the interconnection of multiple sub-systems, or agents. Differently from previous work, in this paper we assume that coordination among agents is limited in the data collection phase. Specifically, while we allow for multiple experiments to be performed on the network, these can be asynchronous (meaning that we do not require that all agents take part to each experiment). We focus this study on an open-loop optimal control problem, and propose a strategy to reconstruct the missing experimental data, i.e., data from the agents not participating to a given experiment. Importantly, our data-reconstruction strategy does not compromise the performance or numerical reliability of the approach, as we give conditions under which the missing data can be exactly reconstructed. We complement our findings with numerical simulations, showcasing the effectiveness of our approach in decentralized control scenarios.
|
|
11:40-12:00, Paper WeA11.6 | |
>Approximation of Koopman Operator Using Spiking Neural Networks |
|
George, Arun M | TCS Research |
Banerjee, Dighanchal | TCS Research |
Bera, Titas | Tata Consultancy Services |
Dey, Sounak | TCS Research |
Keywords: Data driven control, Biologically-inspired methods, Neural networks
Abstract: To analyse, control, and predict the behaviour of the states of a non-linear dynamical system using measurement functions in Hilbert space are a widely used method known as Koopman operator theory. Traditional approaches leverage matrix-based methods or artificial neural networks (specifically encoder-decoder architectures) to approximate Koopman operator. However, such methods necessitate significant power and computational resources, hence may not be suitable for applications that require real-time on-board processing, such as sensor fusion in robotics/autonomous vehicles or adaptive control for drone stabilization. The recent development of brain-inspired spiking neural networks and neuromorphic computing platforms could provide an effective solution, as these offer extremely low-energy computation and real-time responses. In this paper, we introduce the implementation of a Spiking Neural Network (SNN), that can efficiently approximate Koopman operator. Our model, when tested over four systems, demonstrated significant computational savings - up to 4× fewer addition operations and 43× fewer multiplication operations, while using only 20% of the input data compared to its ANN counterpart.
|
|
WeA12 |
Brown 3 |
Iterative Learning Control |
Regular Session |
Chair: Meng, Deyuan | Beihang University (BUAA) |
Co-Chair: Chu, Bing | University of Southampton |
|
10:00-10:20, Paper WeA12.1 | |
>Data-Driven Safety-Critical Control with High-Order Iterative Control Barrier Function |
|
Dong, Zi-Yuan | Zhejiang University of Technology |
Yu, Xinyi | Zhejiang University of Technology |
Ou, Linlin | Zhejiang University of Technology |
Zhang, Yongqi | Zhejiang University of Technology |
Keywords: Data driven control, Adaptive control, Iterative learning control
Abstract: The stability and safety of uncertain nonlinear systems have always been an essential and hard task in the automation field. For the forward invariance of the safety set of the system under unmodeled dynamics, a new concept of data-driven high-order iterative control barrier function (DHI-CBF) is proposed in this paper. To overcome the structural complexity and dependence on dynamic models of traditional robot controllers, an iterative learning mechanism is introduced and a model-free iterative predictive controller (MFIPC) is designed. The controller can be progressively optimized by predicting future information so that it can cope with dynamic changes and uncertainties in the environment. On this basis, quadratic programming (QP) that unifies the DHI-CBF with the MFIPC is established, which is able to prioritize the safety of the system in case of conflict between the desired output and the security boundary. Finally, the application for the Franka-Panda robot demonstrates the superiority of the presented algorithm.
|
|
10:20-10:40, Paper WeA12.2 | |
>Data-Driven Norm-Optimal Control of Discrete Repetitive Processes |
|
Moore, Kevin L. | Colorado School of Mines |
Chu, Bing | University of Southampton |
Rapisarda, Paolo | Univ. of Southampton |
Keywords: Iterative learning control, Data driven control, Behavioural systems
Abstract: We present a data-driven, model-free approach to norm-optimal control of discrete repetitive processes, exploiting Willems' fundamental lemma. The algorithm is described in detail, and a rigorous analysis of its convergence properties is performed. A numerical example is provided to demonstrate the effectiveness of the proposed design methodology.
|
|
10:40-11:00, Paper WeA12.3 | |
>Trackability Compensation for Iterative Learning Control: A Data-Based Approach |
|
Wang, Chenchao | Beihang University |
Meng, Deyuan | Beihang University (BUAA) |
Wu, Yuxin | Beihang University (BUAA) |
Keywords: Iterative learning control, Data driven control, Optimal control
Abstract: This paper aims at proposing a data-based trackability compensation strategy for iterative learning control systems to enhance their tracking performances when confronted with untrackable references. By designing and leveraging offline input-output test principles, an alternative data-based representation is constructed, based on which a data-based trackability criterion is developed. In scenarios where the reference outputs are untrackable, by interconnecting the original system with an auxiliary system, the trackability set of the interconnected system is modified. Consequently, the originally untrackable references become trackable for the interconnected system, and the perfect tracking preformances of iterative learning control can be guaranteed.
|
|
11:00-11:20, Paper WeA12.4 | |
>A Trust-Region Method for Data-Driven Iterative Learning Control of Nonlinear Systems |
|
Wang, Jia | KU Leuven |
Hemelhof, Leander | Katholieke Universiteit Leuven |
Markovsky, Ivan | International Centre for Numerical Methods in Engineering and Ca |
Patrinos, Panagiotis | KU Leuven |
Keywords: Iterative learning control, Data driven control, Optimization
Abstract: This letter employs a derivative-free trust-region method to solve the norm-optimal iterative learning control problem for nonlinear systems with unknown dynamics. The iteration process is composed by two kinds of trials: main and additional trials. The tracking error is reduced in each main trial, and the additional trials explore the nonlinear dynamics around the main trial input. Then the trust-region subproblem is constructed based on the additional trial data, and solved to generate the next main trial input. The convergence of the tracking error is proved under mild assumptions. Our method is illustrated in simulations.
|
|
11:20-11:40, Paper WeA12.5 | |
>Data-Driven Multi-Modal Learning Model Predictive Control |
|
Kopp, Fionna | University of California, Berkeley |
Borrelli, Francesco | Unversity of California at Berkeley |
Keywords: Iterative learning control, Data driven control, Predictive control for nonlinear systems
Abstract: We present a Learning Model Predictive Controller (LMPC) for systems with multi-modal dynamics performing iterative control tasks. Our goal is to use historical data from previous task iterations to design a data-driven control policy for the multi-modal system when the current mode is unknown. We first propose a novel method for using data to learn local affine time-varying (ATV) models of the multi-modal system dynamics. Then we present how to construct data-driven LMPC terminal components from multi-modal historical data, and how to design the resulting LMPC policy. The effectiveness of our method is demonstrated in a simulation example of an autonomous vehicle driving on a friction-varying track.
|
|
11:40-12:00, Paper WeA12.6 | |
>Predictive Norm Optimal Iterative Learning Control for High-Performance Formation Control Problem |
|
Zhang, Yueqing | University of Southampton |
Chen, Bin | University of Sheffield |
Keywords: Iterative learning control, Optimization, Networked control systems
Abstract: This paper develops a predictive optimisation-based iterative learning control (ILC) strategy for the high-performance formation control problem in networked dynamical systems working repetitively. It avoids the need for exact model information in traditional methods and achieves high performance via a predictive framework incorporating a unique performance index that integrates both immediate and future performance. The proposed framework guarantees geometric convergence of the formation error norm to zero and is capable of handling both heterogeneous and non-minimum phase systems. A distributed implementation of the framework is developed using the Alternating Direction Method of Multipliers to guarantee the framework's scalability for large-scale networks. Rigorous convergence analysis and numerical examples are provided to confirm its effectiveness.
|
|
WeA13 |
Suite 1 |
Macroscopic Traffic Modelling and Control |
Invited Session |
Chair: Cicic, Mladen | University of California, Berkeley |
Co-Chair: Nick Zinat Matin, Hossein | University of California, Berkeley |
Organizer: Cicic, Mladen | University of California, Berkeley |
Organizer: Nick Zinat Matin, Hossein | University of California, Berkeley |
Organizer: Yu, Huan | The Hong Kong University of Science and Technology(Guangzhou) |
Organizer: Delle Monache, Maria Laura | University of California, Berkeley |
|
10:00-10:20, Paper WeA13.1 | |
>Infrastructure-Dependent Ramp-Metering Control for METANET-S (I) |
|
Kamalifar, Ayda | University of Pavia |
Cenedese, Carlo | ETH Zurich |
Cucuzzella, Michele | University of Groningen |
Ferrara, Antonella | University of Pavia |
Keywords: Traffic control, Transportation networks, Control applications
Abstract: In this paper, we propose a novel infrastructure-dependent ramp-metering control for the recently proposed METANET with service station (METANET-s) model, i.e., a second-order macroscopic traffic model that, compared to the classical METANET, incorporates the dynamics of service stations on highways. We study the effect of a ramp-metering control scheme on a highway stretch with a service station and show that it is capable of actively regulate internal traffic demand attempting to exit the service station via its on-ramp, on top of contributing to decrease the traffic congestion on the mainstream. In fact, the proposed control scheme effectively prevents the backlog of vehicles attempting to merge back onto the mainstream. This dynamic control mechanism is further endowed by a route guidance control strategy increasing the share of vehicles stopping at the service station during mainstream congestion periods, e.g. via incentives. The combined effect of our control schemes allows to take full advantage of the presence of service stations, reducing the overall traffic congestion. Simulation results demonstrate the effectiveness of the proposed control strategies.
|
|
10:20-10:40, Paper WeA13.2 | |
>Generic Multi-Class Cell Transmission Model for Traffic Control (I) |
|
Cicic, Mladen | University of California, Berkeley |
Siri, Enrico | Inria Sophia, Université Côte D'Azur |
Delle Monache, Maria Laura | University of California, Berkeley |
Keywords: Traffic control, Modeling, Simulation
Abstract: Recent years have witnessed renewed interest in multi-class traffic models, inspired in no small part by the impending arrival of Connected and Autonomous Vehicles, whose behaviour is likely to differ from that of Human-Driven Vehicles. Although numerous multi-class traffic models have been proposed, consistent overarching theory is lacking. In this paper, we propose a generic first-order multi-class traffic modelling framework, intended to be sufficiently versatile to represent most of the traffic phenomena relevant to freeway control applications. Based on this framework, we are able to instantiate different specific multi-class models by appropriate choice of a few design functions. We restrict the design space by introducing a set of assumptions that these functions should follow, helping guide the modelling process. Finally, we study a simple control example where a single class of vehicles is controlled in order to dissipate congestion on a highway, and test the control law using several multi-class model variants. The simulation results show that proposed control is able to dissipate the congestion and harmonize the traffic without relying on knowing the exact underlying traffic dynamics.
|
|
10:40-11:00, Paper WeA13.3 | |
>Stop-And-Go Traffic Wave Attenuation: A Shared Control Approach (I) |
|
Salvato, Erica | University of Trieste |
Elia, Lorenzo | University of Trieste |
Fenu, Gianfranco | Univ. of Trieste |
Parisini, Thomas | Imperial C., Aalborg U. & Univ. of Trieste |
Keywords: Traffic control, Emerging control applications, Human-in-the-loop control
Abstract: This paper presents an innovative approach to address the challenge of stop-and-go wave mitigation in congested vehicular traffic scenarios. The proposed solution involves equipping human-driven vehicles with a controller that can effectively assist the driver by merging human input with the underlying automation input through arbitration. Specifically, our approach integrates a convex combination of human and automation inputs within the controller through a continuous and derivable sharing function. This integration allows for the fusion of human decision-making capabilities with automation's perception of the environment. We provide extensive simulation results to demonstrate the effectiveness of the proposed approach. In addition, theoretical guarantees are established for both the stability of individual vehicles and the string stability.
|
|
11:00-11:20, Paper WeA13.4 | |
>Dissipation of Stop-And-Go Waves in Traffic Flows Using Controlled Vehicles: A Macroscopic Approach |
|
Goatin, Paola | Inria |
Keywords: Stability of nonlinear systems, Traffic control, Autonomous vehicles
Abstract: We study the boundary stabilization of Generic Second Order Macroscopic traffic models in Lagrangian coordinates. These consist in 2x2 nonlinear hyperbolic systems of balance equations with a relaxation-type source term. We provide the existence of weak solutions of the Initial Boundary Value problem for generic relaxation terms. In particular, we do not require the "sub-characteristic" stability condition to hold, so that equilibria are unstable and perturbations may lead to the formation of large oscillations, modeling the appearance and persistence of stop-and-go waves. Moreover, since the largest eigenvalue of the system is null, the boundaries are characteristic, and the available results on boundary controllability do not apply. Therefore, we perform a detailed analysis of the Wave Front Tracking approximate solutions to show that weak solutions can be steered to the corresponding equilibrium state by prescribing the equilibrium speed at the right boundary. This corresponds to controlling the speed of one vehicle to stabilize the upstream traffic flow. The result is illustrated through a numerical example.
|
|
11:20-11:40, Paper WeA13.5 | |
>Constrained CAV Control for Mixed Vehicular Platoons Via Gain Parameterizations and Padé Approximations (I) |
|
Bahavarnia, MirSaleh | Vanderbilt University |
Ji, Junyi | Vanderbilt University |
Taha, Ahmad | Vanderbilt University |
Work, Daniel B. | Vanderbilt University |
Keywords: Autonomous vehicles, Traffic control, Optimal control
Abstract: The key objective of the connected and automated vehicle (CAV) platoon control problem is to regulate CAVs' position while ensuring stability and accounting for vehicle dynamics. The unconstrained version of this problem has thoroughly been investigated in the literature. We elaborate on the constrained version of this problem to theoretically mitigate the two shortcomings of the unconstrained counterpart: (i) the synthesis of unrealistic high-gain control parameters due to the lack of a systematic way to incorporate the lower and upper bounds on the control parameters, and (ii) the performance sensitivity to the communication delay due to inaccurate Taylor series approximation. The former is mitigated via a systematic parameterization of the control gains based on the Hurwitz stability criterion. The latter is mitigated by taking advantage of the well-known Padé approximation. The usefulness of the proposed theoretical results is assessed by performing numerous numerical simulations. Furthermore, a thorough comparative analysis is empirically conducted between the constrained and unconstrained versions of the CAV platoon control problem with application to the mixed vehicular platoon. Modern transportation systems will benefit from the proposed CAV controls by effectively attenuating the stop-and-go disturbance---a single cycle of deceleration followed by acceleration---amplification throughout the mixed vehicular platoon as it will potentially reduce collisions.
|
|
11:40-12:00, Paper WeA13.6 | |
>Performance-Barrier Periodic Event-Triggered PDE Control of Traffic Flow (I) |
|
Zhang, Peihan | University of California San Diego |
Rathnayake, Bhathiya | Student (University of California San Diego) |
Diagne, Mamadou | University of California San Diego |
Krstic, Miroslav | University of California, San Diego |
Keywords: Traffic control, Sampled-data control, Hybrid systems
Abstract: This paper enhances the recently introduced performance-barrier-based continuous-time event-triggered control (P-CETC) by introducing a periodically evaluated triggering function, which leads to the proposed performance-barrier-based periodic event-triggered control (P-PETC). This approach aims to stabilize stop-and-go oscillations in traffic flow through the activation of variable speed limits (VSL) at the downstream boundary of a freeway segment, described by the linearized Aw-Rascle-Zhang (ARZ) traffic model—a 2times 2 coupled hyperbolic system. P-PETC preserves the flexibility of P-CETC, which allows the Lyapunov function to increase within a performance barrier. This results in fewer control updates than regular continuous-time event-triggered control (R-CETC), where a strict decrease in the Lyapunov function is mandated. Furthermore, P-PETC guarantees exponential convergence to zero in the spatial L^2 norm and ensures a Zeno-free behavior. Through comparative simulations, P-PETC is shown to match P-CETC in traffic metrics such as driver comfort, total travel time, and fuel consumption. Notably, P-PETC significantly reduces discomfort by nearly 50% compared to the open-loop scenario and increases the average dwell time by up to 8 times relative to R-CETC.
|
|
WeA14 |
Suite 2 |
Estimation III |
Regular Session |
Chair: Horn, Martin | Graz University of Technology |
Co-Chair: Mu, Biqiang | Chinese Academy of Sciences |
|
10:00-10:20, Paper WeA14.1 | |
>Regret Analysis with Almost Sure Convergence for OBF-ARX Filter |
|
Li, Jiayun | Tsinghua University |
Lu, Yiwen | Tsinghua University |
Mo, Yilin | Tsinghua University |
Keywords: Identification, Kalman filtering, Estimation
Abstract: This paper considers the output prediction problem for an unknown Linear Time-Invariant (LTI) system. In particular, we focus our attention on the OBF-ARX filter, whose transfer function is a linear combination of Orthogonal Basis Functions (OBFs), with the coefficients determined by solving a least-squares regression. We prove that the OBF-ARX filter is an accurate approximation of the Kalman Filter (KF) by quantifying its online performance. Specifically, we analyze the average regret between the OBF-ARX filter and the KF, proving that the average regret over N time steps converges to the asymptotic bias at the speed of O(N^{-0.5+epsilon}) almost surely for all epsilon>0. Then, we establish an upper bound on the asymptotic bias, demonstrating that it decreases exponentially with the number of OBF bases, and the decreasing rate tau(boldsymbol{lambda}, boldsymbol{mu}) explicitly depends on the poles of both the KF and the OBF. Numerical results on diffusion processes validate the derived bounds.
|
|
10:20-10:40, Paper WeA14.2 | |
>Asymptotic Properties of Generalized Maximum Likelihood Hyper-Parameter Estimator for Regularized System Identification |
|
Zhang, Meng | The Chinese University of Hong Kong, Shenzhen |
Chen, Tianshi | The Chinese University of Hong Kong, Shenzhen, China |
Mu, Biqiang | Chinese Academy of Sciences |
Keywords: Identification, Linear systems
Abstract: Regularized system identification is one of the major advances in the field of system identification in the last decade. One key issue is the hyper-parameter estimation, for which the generalized maximum likelihood (GML) estimator is a popular one closely related to the empirical Bayes (EB) method. Considering the rich theoretical results on the EB estimator, the asymptotic properties of the GML estimator has not been studied before and is critical for understanding its efficacy when the sample size is large. In this paper, we investigate the asymptotic properties of the GML estimator and show that the GML estimator is asymptotically equivalent to the EB estimator. Furthermore, Monte-Carlo simulations verify their asymptotic equivalence and also indicates the GML estimator outperform the EB estimator for small sample sizes.
|
|
10:40-11:00, Paper WeA14.3 | |
>Finite-Sample System Identification with Residual-Permuted Sums |
|
Szentpéteri, Szabolcs | SZTAKI |
Csáji, Balázs Cs. | HUN-REN SZTAKI |
Keywords: Identification, Randomized algorithms, Linear systems
Abstract: This letter studies a distribution-free, finite-sample data perturbation (DP) method, the Residual-Permuted Sums (RPS), which is an alternative of the Sign-Perturbed Sums (SPS) algorithm, to construct confidence regions. While SPS assumes independent (but potentially time-varying) noise terms which are symmetric about zero, RPS gets rid of the symmetricity assumption, but assumes i.i.d. noises. The main idea is that RPS permutes the residuals instead of perturbing their signs. This letter introduces RPS in a flexible way, which allows various design-choices. RPS has exact finite sample coverage probabilities and we provide the first proof that these permutation-based confidence regions are uniformly strongly consistent under general assumptions. This means that the RPS regions almost surely shrink around the true parameters as the sample size increases. The ellipsoidal outer-approximation (EOA) of SPS is also extended to RPS, and the effectiveness of RPS is validated by numerical experiments, as well.
|
|
11:00-11:20, Paper WeA14.4 | |
>Identifiability of the Linear Threshold Model |
|
Lekamalage, Anuththara Sarathchandra | Brock University |
Ramazi, Pouria | Brock University |
Keywords: Identification, Game theory
Abstract: In the linear threshold model, each individual has a time-invariant threshold and an initial action A or B. At each time step one or more individuals become active to revise their action depending on their own threshold and the population proportion of each action. The resulting decision-making dynamics can be predicted and controlled, provided that the thresholds of individuals are known. In practice, however, the thresholds are unknown and often only the evolution of the total number of individuals who have chosen one action is known. The question then is whether the thresholds are identifiable given this quantity over time. We find necessary and sufficient conditions for threshold identifiability of the linear threshold model under synchronous and asynchronous decision-making. The results open the door for reliable estimation of the thresholds, and in turn, prediction and control of the decision-making dynamics using real data.
|
|
11:20-11:40, Paper WeA14.5 | |
>An Analytical Framework for Utilizing Chirp Signals in System Identification |
|
Zeiringer, Thomas | Graz University of Technology |
Hölzl, Stefan Lambert | Graz University of Technology |
Horn, Martin | Graz University of Technology |
Keywords: Identification for control, Identification, Linear systems
Abstract: A method for synthesizing a chirp signal given its desired spectrum is proposed, extending the application of established experiment design methods to chirp signals. Further, aspects concerning plant-friendly system identification are discussed and variations of the regular chirp signal are therefore introduced. An analytical expression for the system's response to a chirp signal is derived and used to discuss the application of chirp signals in detail. Finally, an experiment is presented to compare chirp and noise signals for system identification.
|
|
11:40-12:00, Paper WeA14.6 | |
>Kernel-Based Regularized Continuous-Time System Identification from Sampled Data |
|
Fang, Xiaozhu | The Chinese University of Hong Kong, Shenzhen |
Mu, Biqiang | Chinese Academy of Sciences |
Chen, Tianshi | The Chinese University of Hong Kong, Shenzhen, China |
Keywords: Identification
Abstract: The identification of continuous-time (CT) systems from discrete-time (DT) input and output signals, i.e., the sampled data, has received considerable attention for half a century. The state-of-the-art methods are parametric methods and thus subject to the typical issues of parametric methods. In the last decade, a major advance in system identification is the so-called kernel-based regularization method (KRM), which is free of the issues of parametric methods. It is interesting to test the potential of KRM on CT system identification. However, very few results have been reported before, mainly because the estimators have no closed forms for general CT input signals, except for some very special cases. In this paper, we show for KRM that, the estimators have closed forms when the DT input signal has the typical intersample behavior: zero-order hold or band-limited, and thus paves the way for the application of KRM for CT system identification. Numerical Monte Carlo simulations show that the proposed method is more robust compared to the state-of-the-art methods, and also more accurate when the sample size is small.
|
|
WeA15 |
Suite 3 |
Power Systems I |
Regular Session |
Chair: Hidalgo-Gonzalez, Patricia | University of California, San Diego |
Co-Chair: Villagra, Matias | Columbia University |
|
10:00-10:20, Paper WeA15.1 | |
>Frequency Dynamics with Inverters: Proof of Stabilizability and Existence of Nash Equilibrium |
|
Serna-Torre, Paul | University of California San Diego |
Hidalgo-Gonzalez, Patricia | University of California, San Diego |
Keywords: Power systems, Game theory, Linear systems
Abstract: We model frequency dynamics for power systems with inverters and provide analytical results that enable the development of a novel control scheme for frequency regulation using non-cooperative linear quadratic differential games (NLQGs). First, we prove for the first time that the model for frequency dynamics consisting of n synchronous generators (SGs), r sixth-order-model grid following inverters (GFLIs), and a Laplacian network matrix of a grid with n nodes is stabilizable. We leverage this analytical result and propose a compensator design to ensure the existence of a Nash equilibrium solution in a NLQG for frequency regulation that is mindful of networked inverter dynamics. In this NLQG, we reformulate the frequency dynamics model of an electrical grid considering that n SGs and r GFLIs jointly participate in frequency regulation. All agents are selfish such that each of them seeks to minimize its individual linear quadratic cost during the frequency regulation service. Simulations in a 12-bus network show that the proposed control scheme enables 30%-83% less overshoot and 56%-69% faster settling times than conventional frequency regulation. Furthermore, the proposed control scheme is still able to steer frequency back to nominal value despite contingencies, e.g.,disconnection of a transmission line or a SG.
|
|
10:20-10:40, Paper WeA15.2 | |
>Game-Theoretic Learning for Power System Dynamic Ancillary Service Provisions |
|
Xie, Haiwei | Delft University of Technology |
Cremer, Jochen | Delft University of Technology |
Keywords: Power systems, Game theory, Optimization
Abstract: This paper studies the problem of coordinating aggregators in the power system to provide fast frequency response as dynamic ancillary services. We approach the problem from the perspective of suboptimal H∞ control, and propose an efficient and tractable formulation. We further develop a distributed solution method for the investigated problem, which enables aggregator agents to learn their optimal provisions in an adaptive way. More precisely, we reformulate the original problem into a state-based potential game, where the agents interact with each other towards our designed Nash equilibrium. The proposed game-theoretic learning approach decouples the coupling Linear Matrix Inequality constraint, guarantees the convergence to the equilibrium which is close enough to the original optimum. The learning process is also robust to the changes in communication graphs. We demonstrate the efficacy of our proposed approach with a case study on a 3-aggregator system.
|
|
10:40-11:00, Paper WeA15.3 | |
>Mean-Field Stackelberg Game for Mitigating the Strategic Bidding of Energy Consumers in Congested Distribution Networks |
|
Silani, Amirreza | Delft University of Technology |
Tindemans, Simon H. | TU Delft |
Keywords: Power systems, Game theory
Abstract: The sudden proliferation of Electric Vehicles (EVs), batteries and photovoltaic cells in power networks can lead to congested distribution networks. A substitute for upgrading network capacity is a redispatch market that enables the Distribution System Operators (DSOs) to mitigate congested networks by requesting the energy consumers to modify their consumption schedules. However, energy consumers are able to strategically modify their day-ahead market bids in anticipation of the redispatch market outcomes. This behaviour, which is known as increase-decrease gaming, can exacerbate congestion and give arbitrage opportunities to the energy consumers for gaining windfall profits from the DSO. In this paper, we propose an algorithm based on mean-field Stackelberg game to mitigate the increase-decrease game for large populations of energy consumers. In this game, the energy consumers (followers) maximize their individual welfare on the day-ahead market with anticipation of the redispatch market outcomes while the leader maximizes the social welfare of all agents and minimizes the costs of DSO on the redispatch market. We show the convergence of this algorithm to the mean-field leader-follower varepsilon_N-Nash equilibrium.
|
|
11:00-11:20, Paper WeA15.4 | |
>Learning Uncertainty Set for Adaptive Robust DC-OPF |
|
Gu, Nan | Tsinghua University |
Yuan, Enming | Tsinghua University |
Wu, Chenye | The Chinese University of Hong Kong, Shenzhen |
Keywords: Power systems, Optimization, Machine learning
Abstract: In robust optimization for power system operations, striking a balance between solution robustness and performance is crucial. Unlike conventional interval-based uncertainty sets, which treat random variables as independent entities, our approach introduces a compact, coupled representation of these variables. We establish theoretical benchmarks to assess the benefits of employing this coupled uncertainty set in the context of the DC optimal power flow problem. Moreover, we have devised a pioneering data-driven algorithm capable of autonomously learning the shape of the parametric uncertainty set. This algorithm concurrently optimizes performance and furnishes solutions with statistical guarantees in terms of generalization capabilities. The effectiveness of this algorithm is validated through case studies on both a synthetic dataset and a real-world problem.
|
|
11:20-11:40, Paper WeA15.5 | |
>GPU-Accelerated Sequential Quadratic Programming Algorithm for Solving ACOPF |
|
Li, Bowen | University of Michigan |
Kim, Kibaek | Argonne National Laboratory |
Keywords: Power systems, Optimization algorithms, Large-scale systems
Abstract: Sequential quadratic programming (SQP) is a powerful and widely-used method in solving nonlinear optimization problems, with advantages of warm-starting solutions, high solution accuracy, and quadratic convergence due to the use of second-order information, including the Hessian matrix. In this study, we have developed a scalable SQP algorithm for solving the alternating current optimal power flow problem (ACOPF), leveraging the parallel computing capabilities of graphics processing units (GPUs). Our methodology incorporates the alternating direction method of multipliers (ADMM) to initialize and decompose the quadratic programming subproblems within each SQP iteration into independent small subproblems for each electric grid component. We have implemented the proposed SQP algorithm using our portable and efficient Julia package, ExaAdmm.jl, which solves the ADMM subproblems in parallel on all major GPU architectures. For numerical experiments, we compared three solution approaches: (i) the SQP algorithm with a GPU-based ADMM subproblem solver, (ii) a CPU-based ADMM solver, and (iii) the QP solver Ipopt (the state-of-the-art interior point solver). We observed that for larger instances, our GPU-based SQP solver effectively leverages the many-core GPU architecture, dramatically reducing the solution time.
|
|
11:40-12:00, Paper WeA15.6 | |
>Accurate and Warm-Startable Linear Cutting-Plane Relaxations for ACOPF |
|
Bienstock, Daniel | Columbia University |
Villagra, Matias | Columbia University |
Keywords: Power systems, Optimization algorithms, Nonlinear systems
Abstract: We present a linear cutting-plane relaxation approach that rapidly proves tight lower bounds for the Alternating Current Optimal Power Flow Problem (ACOPF). Our method leverages outer-envelope linear cuts for well-known second-order cone relaxations for ACOPF along with modern cut management techniques. These techniques prove effective on a broad family of ACOPF instances, including the largest ones publicly available, quickly and robustly yielding sharp bounds. Our primary focus concerns the (frequent) case where an ACOPF instance is considered following a small or moderate change in problem data, e.g., load changes and generator or branch shut-offs. We provide significant computational evidence that the cuts computed on the prior instance provide an effective warm-start for our algorithm.
|
|
WeA16 |
Suite 4 |
Constrained Control I |
Regular Session |
Chair: Kim, H. Jin | Seoul National University |
Co-Chair: Jagtap, Pushpak | Indian Institute of Science |
|
10:00-10:20, Paper WeA16.1 | |
>Estimation of Constraint Admissible Invariant Set with Neural Lyapunov Function |
|
Kim, Dabin | Seoul National University |
Kim, H. Jin | Seoul National University |
Keywords: Constrained control, Nonlinear systems, Data driven control
Abstract: Constraint admissible positively invariant (CAPI) sets play a pivotal role in ensuring safety in control and planning applications, such as the recursive feasibility guarantee of explicit reference governor and model predictive control. However, existing methods for finding CAPI sets for nonlinear systems are often limited to single equilibria or specific system dynamics. This limitation underscores the necessity for a method to construct a CAPI set for general reference tracking control and a broader range of systems. In this work, we leverage recent advancements in learning-based methods to derive Lyapunov functions, particularly focusing on those with piecewise-affine activation functions. Previous attempts to find an invariant set with the piecewise-affine neural Lyapunov function have focused on the estimation of the region of attraction with mixed integer programs. We propose a methodology to determine the maximal CAPI set for any reference with the neural Lyapunov function by transforming the problem into multiple linear programs. Additionally, to enhance applicability in real-time control scenarios, we introduce a learning-based approach to train the estimator, which infers the CAPI set from a given reference. The proposed approach is validated with multiple simulations to show that it can generate a valid CAPI set with the given neural Lyapunov functions for any reference. We also employ the proposed CAPI set estimation method in the explicit reference governor and demonstrate its effectiveness for constrained control.
|
|
10:20-10:40, Paper WeA16.2 | |
>Safe Tracking Control of Nonlinear Systems Based on Optimal Control for Low-Fidelity Models |
|
Castroviejo-Fernandez, Miguel | University of Michigan |
Kolmanovsky, Ilya V. | The University of Michigan |
Keywords: Constrained control, Supervisory control, Predictive control for nonlinear systems
Abstract: This letter considers the control of discrete-time nonlinear systems subject to output constraints. A control strategy is proposed which combines an input sequence generator, based on solving an optimal control problem for a lower-fidelity linear model, with a prediction-based supervisory scheme that adjusts the reference command to ensure constraints satisfaction for the original nonlinear system. Under suitable assumptions, recursive feasibility, finite-time convergence of the applied reference command to a desired strictly steady state admissible reference command and asymptotic convergence of the state to the associated steady state are shown. Simulation results illustrate the proposed methodology for a highly nonlinear spacecraft control maneuver.
|
|
10:40-11:00, Paper WeA16.3 | |
>Efficient and Safe Learning-Based Control of Piecewise Affine Systems Using Optimization-Free Safety Filters |
|
He, Kanghui | Delft University of Technology |
Shi, Shengling | Delft University of Technology |
van den Boom, Ton J. J. | Delft Univ. of Tech |
De Schutter, Bart | Delft University of Technology |
Keywords: Constrained control, Learning, Hybrid systems
Abstract: Control of piecewise affine (PWA) systems under complex constraints faces challenges in guaranteeing both safety and online computational efficiency. Learning-based methods can rapidly generate control signals with good performance, but rarely provide safety guarantees. A safety filter is a modular method to improve safety for any controller. When applied to PWA systems, a traditional safety filter usually need to solve a mixed-integer convex program, which reduces the computational benefit of learning-based controllers. We propose a novel optimization-free safety filter designed to handle state constraints that involve a combination of polyhedra and ellipsoids. The proposed safety filter only utilizes algebraic and min-max operations to determine safe control inputs. This offers a notable advantage compared with traditional safety filters by allowing for significantly more efficient computation of control signals. The proposed safety filter can be integrated into various function approximators, such as neural networks, enabling safe learning throughout the learning process. Simulation results on a bicycle model with PWA approximation validate the proposed method regarding constraint satisfaction, CPU time, and the preservation of sub-optimality.
|
|
11:00-11:20, Paper WeA16.4 | |
>Prescribed-Time Reach-Avoid-Stay Specifications for Unknown Systems: A Spatiotemporal Tubes Approach |
|
Das, Ratnangshu | Indian Institute of Science, Bangalore |
Jagtap, Pushpak | Indian Institute of Science |
Keywords: Constrained control, Lyapunov methods
Abstract: The paper considers controller synthesis problems for control-affine nonlinear systems with unknown dynamics, aiming to fulfil reach-avoid-stay specifications in a prescribed time. The research's primary aim is to devise a closed-form, approximation-free control strategy that ensures the system's trajectory reaches a target set, avoids an unsafe set, and complies with state constraints. To address this challenge, the paper introduces a spatiotemporal tube framework that encapsulates both reaching and avoiding requirements via reachability tubes and a circumvent function, respectively. Following this, the paper presents the control strategy and validates its effectiveness through a robot navigation case study.
|
|
11:20-11:40, Paper WeA16.5 | |
>Robust Safety-Critical Control for Systems with Sporadic Measurements and Dwell Time Constraints |
|
Breeden, Joseph | University of Michigan, Ann Arbor |
Zaccarian, Luca | LAAS-CNRS |
Panagou, Dimitra | University of Michigan, Ann Arbor |
Keywords: Constrained control, Aerospace, Hybrid systems
Abstract: This paper presents extensions of control barrier function (CBF) theory to systems with disturbances wherein a controller only receives measurements infrequently and operates open-loop between measurements, while still satisfying state constraints. The paper considers both impulsive and continuous actuators, and models the actuators, measurements, disturbances, and timing constraints as a hybrid dynamical system. We then design an open-loop observer that bounds the worst-case uncertainty between measurements. We develop definitions of CBFs for both actuation cases, and corresponding conditions on the control input to guarantee satisfaction of the state constraints. We apply these conditions to simulations of a satellite rendezvous in an elliptical orbit and autonomous orbit stationkeeping.
|
|
11:40-12:00, Paper WeA16.6 | |
>Rollover Prevention for Mobile Robots with Control Barrier Functions: Differentiator-Based Adaptation and Projection-To-State Safety |
|
Das, Ersin | Caltech |
Ames, Aaron D. | California Institute of Technology |
Burdick, Joel W. | California Inst. of Tech |
Keywords: Constrained control, Control applications, Robotics
Abstract: This paper develops rollover prevention guarantees for mobile robots using control barrier function (CBF) theory, and demonstrates these formal results experimentally. To this end, we consider a safety measure based on the zero moment point to provide conditions on the control input through the lens of CBFs. However, these conditions depend on time-varying and noisy parameters. To address this, we present a differentiator-based safety-critical controller that estimates these parameters and pairs Input-to-State Stable (ISS) differentiator dynamics with CBFs to achieve rigorous guarantees of safety. Additionally, to ensure safety in the presence of disturbance, we utilize a time-varying extension of Projection-to-State Safety (PSSf). The effectiveness of the proposed method is demonstrated through experiments on a tracked robot with a rollover potential on steep slopes.
|
|
WeA17 |
Suite 6 |
Autonomous Robots I |
Regular Session |
Chair: Naldi, Roberto | Università Di Bologna |
Co-Chair: Diehl, Moritz | University of Freiburg |
|
10:00-10:20, Paper WeA17.1 | |
>Adaptive Robot Detumbling of a Non-Rigid Satellite |
|
Gao, Longsen | University of New Mexico |
Danielson, Claus | University of New Mexico |
Fierro, Rafael | University of New Mexico |
Keywords: Autonomous robots, Adaptive systems, Aerospace
Abstract: The challenge of satellite stabilization, particularly those with uncertain flexible dynamics, has become a pressing concern in control and robotics. These uncertainties, especially the dynamics of a third-party client satellite, significantly complicate the stabilization task. This paper introduces a novel adaptive detumbling method to handle non-rigid satellites with unknown motion dynamics (translation and rotation). The distinctive feature of our approach is that we model the nonrigid tumbling satellite as a two-link serial chain with unknown stiffness and damping in contrast to previous detumbling research works which consider the satellite a rigid body. We develop a novel adaptive robotics approach to detumble the satellite by using two space tugs as servicers despite the uncertain dynamics in the post-capture case. Notably, the stiffness properties and other physical parameters, including the mass and inertia of the two links, remain unknown to the servicer. Our proposed method addresses the challenges in detumbling tasks and paves the way for advanced manipulation of non-rigid satellites with uncertain dynamics.
|
|
10:20-10:40, Paper WeA17.2 | |
>Graph-Based Deep Reinforcement Learning Approach for Alliance Formation Game Based Robot Swarm Task Assignment |
|
Lv, Yang | TongJi University |
Lei, Jinlong | Tongji University |
Yi, Peng | Tongji University |
Keywords: Autonomous robots, Autonomous systems, Agents-based systems
Abstract: This paper explores the robot swarm task allocation problem based on alliance formation game theory, which treats each robot as an autonomous agent capable of forming strategic alliances for task completion, with a focus on optimizing overall system revenue and mutual benefits. To resolve the problem, we introduce a unique graph-based Deep Reinforcement Learning (DRL) framework named AFGNet_DDQN. Firstly, we construct an Allocation Feature Graph (AFG) that intricately maps the complex interactive relationships and allocation features among robots and tasks, and develop the AFGNet architecture to efficiently extract features from the graph nodes. Then thorugh reconstructing a Markov Decision Process (MDP) within this graph, we implement an advanced version of Double Deep Q-Networks (DDQN) algorithm, adapted for our graph-based framework. This setup allows for the effective learning and optimization of task allocation strategies through localized interactions among the robots. Finally, our empirical results demonstrate the superiority of our framework over traditional methods, especially in terms of scalability and robustness.
|
|
10:40-11:00, Paper WeA17.3 | |
>SCvxPyGen : Autocoding SCvx Algorithm |
|
Berrah, Danil | ENSTA Paris, Institut Polytechnique De Paris |
Chapoutot, Alexandre | ENSTA Paris |
Garoche, Pierre Loic | ENAC |
Keywords: Autonomous robots, Control software, Autonomous vehicles
Abstract: In this paper, we address the embedded code generation for an optimal control algorithm, SCVX, which is particularly suitable for solving trajectory planning problems with collision avoidance constraints. Producing code compatible with embedded systems constraints will support the use of the SCVX algorithm in a real-time configuration. Existing uses of SCVX on drones or embedded platforms are currently handcrafted code. On the other hand, recent toolboxes such as SCPToolbox provide a simpler access to these trajectory planning algorithms, based on the resolution of a sequence of convex sub-problems. We define here a framework, in Python, enabling the automatic code generation for SCVX, in C, based on CVXPYgen and the ECOS solver. The framework is able to address problems involving non-convex constraints such as obstacle avoidance. This is a first step towards a more streamlined process to auto-code trajectory planning algorithms and convex optimization solvers.
|
|
11:00-11:20, Paper WeA17.4 | |
>Target Search and Tracking in an Unknown Environment by a Fleet of UAVs Using a Set-Membership Approach |
|
Zagar, Maxime | Université Paris-Saclay |
Meyer, Luc | ONERA, Univ Paris Saclay |
Kieffer, Michel | CNRS - Univ Paris-Sud - CentraleSupelec |
Piet-Lahanier, Helene | ONERA |
Keywords: Autonomous robots, Cooperative control, Vision-based control
Abstract: This paper addresses the problem of search and tracking an unknown number of mobile ground targets within a Region of Interest (RoI) using a fleet of cooperating Unmanned Aerial Vehicles (UAVs). Each UAV embeds a Computer Vision System providing images with labeled pixels, depth maps, and bounding boxes around identified targets. This information is used by a set-membership estimator to characterize sets guaranteed to contain the locations of already identified targets and a set containing the locations of all targets remaining to detect. A map of the unknown environment is constructed during search to favor exploration of areas previously occluded by obstacles.
|
|
11:20-11:40, Paper WeA17.5 | |
>Bearing-Based Target Localisation in Search and Rescue Scenarios |
|
Michieletto, Giulia | University of Padova |
Mimmo, Nicola | University of Bologna |
Naldi, Roberto | Università Di Bologna |
Cenedese, Angelo | University of Padova |
Keywords: Autonomous robots, Extremum seeking, Control applications
Abstract: This paper deals with the target localisation problem in search and rescue scenarios in which the technology is based on electromagnetic transceivers. The noise floor and the shape of the electromagnetic radiation pattern make this problem challenging. Indeed, on the one hand, the signal-to-noise ratio reduces with the inverse of the distance from the electromagnetic source thus impacting estimation-based techniques applicability. On the other hand, non-isotropic radiation patterns reduce the efficacy of gradient-based policies. In this work, we manage a fleet of autonomous agents, equipped with electromagnetic sensors, by combining gradient-based and estimation-based techniques to speed up the transmitter localisation. Simulations specialized in the so-called ARTVA technology used in search and rescue in avalanche scenarios confirm that our scheme outperforms current solutions.
|
|
11:40-12:00, Paper WeA17.6 | |
>Real-Time-Feasible Collision-Free Motion Planning for Ellipsoidal Objects |
|
Gao, Yunfan | Robert Bosch GmbH |
Messerer, Florian | University of Freiburg |
Van Duijkeren, Niels | Robert Bosch GmbH |
Houska, Boris | ShanghaiTech University |
Diehl, Moritz | University of Freiburg |
Keywords: Autonomous robots, Optimal control, Constrained control
Abstract: Online planning of collision-free trajectories is a fundamental task for robotics and self-driving car applications. This paper revisits collision avoidance between ellipsoidal objects using differentiable constraints. Two ellipsoids do not overlap if and only if the endpoint of the vector between the center points of the ellipsoids does not lie in the interior of the Minkowski sum of the ellipsoids. This condition is formulated using a parametric over-approximation of the Minkowski sum, which can be made tight in any given direction. The resulting collision avoidance constraint is included in an optimal control problem (OCP) and evaluated in comparison to the separating-hyperplane approach. Not only do we observe that the Minkowski-sum formulation is computationally more efficient in our experiments, but also that using pre-determined over-approximation parameters based on warm-start trajectories leads to a very limited increase in suboptimality. This gives rise to a novel real-time scheme for collision-free motion planning with model predictive control (MPC). Both the real-time feasibility and the effectiveness of the constraint formulation are demonstrated in challenging real-world experiments.
|
|
WeA18 |
Suite 7 |
Stability of Nonlinear Systems I |
Regular Session |
Chair: Valmorbida, Giorgio | L2S, CentraleSupelec |
Co-Chair: Oliveira, Paulo | Instituto Superior Técnico |
|
10:00-10:20, Paper WeA18.1 | |
>Plant-Order Saturated Output-Feedback Regional Controller Synthesis with Sign-Indefinite Quadratic Forms |
|
Pantano Calderón, Santiago | LAAS-CNRS |
Tarbouriech, Sophie | LAAS-CNRS |
Zaccarian, Luca | LAAS-CNRS |
Keywords: Stability of nonlinear systems, LMIs, Lyapunov methods
Abstract: This note provides means to design a dynamic plant-order output-feedback controller for linear plants subject to saturating input with measurable output. Based on Linear Matrix Inequalities (LMIs), together with appropriate transformations and sector conditions, the proposed solution exploits sign-indefinite quadratic forms to define a locally positive definite piece-wise Lyapunov function providing non-ellipsoidal estimates of the closed-loop basin of attraction. With guaranteed local exponential stability, methods to ensure a prescribed local exponential convergence rate and to maximize the estimates of the region of attraction are also given.
|
|
10:20-10:40, Paper WeA18.2 | |
>A Lyapunov-Based Small-Gain Theorem for Finite Time Input-To-State Stablity of Discrete Time Infinite Networks |
|
Mo, Yuanqiu | Southeast University |
Yu, Wenwu | Southeast University |
Dasgupta, Soura | Univ. of Iowa |
Keywords: Stability of nonlinear systems, Lyapunov methods, Large-scale systems
Abstract: This paper considers the finite time input-to-state stability (FTISS) with respect to a closed set for discrete time infinite networks composed of a potentially infinite finite-dimensional subsystems. Towards this end, FTISS Lyapunov functions are first provided for infinite networks, via leveraging the existing tools for discrete time finite networks. Further, a small gain condition is postulated so that FTISS Lyapunov functions for the overall system can be constructed from the FTISS Lyapunov-like functions for each subsystem. The established small gain result is scale-free as it can be applied to any truncation of the original infinite network while maintaining quantitative stability properties.
|
|
10:40-11:00, Paper WeA18.3 | |
>Further Stability Analysis of Delayed Neural Networks Based on Novel Activation Function Dependent Lyapunov-Krasovskii Functional |
|
Lee, Jun Hui | POSTECH |
Na, Hyeon-Woo | POSTECH |
Lee, Hae Seong | POSTECH |
Park, PooGyeon | POSTECH (Pohang Univ. of Sci. & Tech.) |
Keywords: Stability of nonlinear systems, Lyapunov methods, LMIs
Abstract: This paper addresses the stability analysis problem of Delayed Neural Networks (DNNs) based on the Lyapunov Krasovskii functional (LKF) approach. The introduction of the novel activation function-dependent double integral LKFs (AFDLKFs) incorporates additional constraint information on activation function into the stability analysis process. This information, not considered in the conventional single integral LKFs, plays a crucial role in deriving less conservative stability criteria. Additionally, leveraging the bound lemma for line integral, the upper bound of the single integral terms arising from the derivative of AFDLKFs are approximated in the form of linear matrix inequalities. Furthermore, relaxation of the positive definite condition of LKF is achieved based on integral inequalities based on free matrices. The combination of these two techniques leads to improved stability criteria. The advantages and effectiveness of the proposed stability criteria are demonstrated through a numerical example.
|
|
11:00-11:20, Paper WeA18.4 | |
>Exponential Stability of Continuous-Time Piecewise Affine Systems |
|
Cabral, Leonardo | Universidade Federal Do Rio Grande Do Sul |
Valmorbida, Giorgio | L2S, CentraleSupelec |
Gomes da Silva Jr, Joao Manoel | Universidade Federal Do Rio Grande Do Sul |
Keywords: Stability of nonlinear systems, Lyapunov methods, LMIs
Abstract: This work addresses the problem of global exponential stability analysis of the origin of continuous-time Continuous Piecewise Affine (CPWA) systems. The stability analysis in this paper considers Piecewise Quadratic (PWQ) Lyapunov Functions (LF) and a ramp-based implicit representation of PWA systems. Sufficient convex stability conditions are obtained in the form of a Semidefinite Programming (SDP) problem. Two major benefits arise from the proposed results: i) the need for equality constraints to ensure the continuity of the LF across the boundaries of the sets of the partition is withdrawn; ii) there is no need to consider separate SDP conditions for each set of the partition, which simplifies the application of the conditions. The effectiveness of the proposed method is illustrated in numerical examples.
|
|
11:20-11:40, Paper WeA18.5 | |
>Characterizing Nonlinear Systems with Mixed Input-Output Properties through Dissipation Inequalities |
|
van den Eijnden, Sebastiaan | Eindhoven University of Technology |
Chaffey, Thomas | University of Cambridge |
Keywords: Stability of nonlinear systems, Nonlinear systems, LMIs
Abstract: Systems that show different characteristics, such as finite-gain and passivity, depending on the nature of the inputs, are said to possess mixed input-output properties. In this paper, we provide a constructive method for characterizing mixed input-output properties of nonlinear systems using a dissipativity framework. Our results take inspiration from the generalized Kalman-Yakubovich-Popov lemma, and show that a system is mixed if it is dissipative with respect to highly specialized supply rates. The mixed input-output characterization is used for assessing stability of feedback interconnections in which the feedback components violate conditions of classical results such as the small-gain and passivity theorem. We showcase applicability of our results through various examples.
|
|
11:40-12:00, Paper WeA18.6 | |
>Global Exponential Stabilization and Global mathcal{L}_p Performance of a Saturated Double Integrator |
|
Martins, Luis | Instituto Superior Técnico |
Cardeira, Carlos | IDMEC/Instituto Superior Tecnico |
Oliveira, Paulo | Instituto Superior Técnico |
Keywords: Stability of nonlinear systems, Constrained control, Robust control
Abstract: This paper presents novel results on the simultaneous global internal and external stabilization problem for the double integrator controlled by a saturated static state linear feedback. The methodology capitalizes on the distinctive characteristics of the smooth strictly increasing saturation function considered to strengthen and extend the stability properties reported in the existing literature for this canonical system. Concretely, as the main contributions, this work demonstrates that the resulting closed-loop system, in the absence of disturbances, is globally exponentially stable and, in the face of non-input additive disturbances, is globally mathcal{L}_p stable for any given p in [1, infty) and yields a bounded state trajectory. The simulation experiments showcase these new findings.
|
|
WeA19 |
Suite 8 |
Stochastic Systems |
Regular Session |
Chair: Tsiotras, Panagiotis | Georgia Institute of Technology |
Co-Chair: Delvenne, Jean-Charles | Universite Catholique De Louvain |
|
10:00-10:20, Paper WeA19.1 | |
>Safe Exit Controllers Synthesis for Continuous-Time Stochastic Systems |
|
Xue, Bai | Institute of Software, Chinese Academy of Sciences |
Keywords: Stochastic systems
Abstract: This paper tackles the problem of generating safe exit controllers for continuous-time systems described by stochastic differential equations (SDEs). The primary aim is to develop controllers that maximize the lower bounds of the exit probability that the system escapes from a safe but uncomfortable set within a specified time frame and guide it towards a comfortable set. The paper considers two distinct cases: one in which the boundary of the safe set is a subset of the boundary of the uncomfortable set, and the other where the boundaries of the two sets do not intersect. To begin, we present a sufficient condition for establishing lower bounds on the exit probability in the first case. This condition serves as a guideline for constructing an online linear programming problem. The linear programming problem is designed to implicitly synthesize an optimal exit controller that maximizes the lower bounds of the exit probability. The method employed in the first case is then extended to the second one. Finally, we demonstrate the effectiveness of the proposed approaches on one example.
|
|
10:20-10:40, Paper WeA19.2 | |
>Bounding Stochastic Safety: Leveraging Freedman's Inequality with Discrete-Time Control Barrier Functions |
|
Cosner, Ryan | California Institute of Technology |
Culbertson, Preston | Caltech |
Ames, Aaron D. | California Institute of Technology |
Keywords: Stochastic systems, Lyapunov methods
Abstract: When deployed in the real world, safe control methods must be robust to unstructured uncertainties such as modeling error and external disturbances. Typical robust safety methods achieve their guarantees by always assuming that the worst-case disturbance will occur. In contrast, this letter utilizes Freedman’s inequality in the context of discrete-time control barrier functions (DTCBFs) and c-martingales to provide stronger (less conservative) safety guarantees for stochastic systems. Our approach accounts for the underlying disturbance distribution instead of relying exclusively on its worst-case bound and does not require the barrier function to be upper-bounded, which makes the resulting safety probability bounds more useful for intuitive safety constraints such as signed distance. We compare our results with existing safety guarantees, such as input-to-state safety (ISSf) and martingale results that rely on Ville’s inequality. When the assumptions for all methods hold, we provide a range of parameters for which our guarantee is stronger. Finally, we present simulation examples, including a bipedal walking robot, that demonstrate the utility and tightness of our safety guarantee.
|
|
10:40-11:00, Paper WeA19.3 | |
>Leakage Rate As a Measure of Continuous-Time Stochastic Set Invariance |
|
Jang, Inkyu | Seoul National University |
Yoon, Minhyuk | Seoul National University |
Kim, H. Jin | Seoul National University |
Keywords: Stochastic systems, Markov processes, Uncertain systems
Abstract: Probabilistic set invariance is a concept that measures at which probability the given set is invariant under a stochastic process. While this concept successfully measures the level of invariance of a given set, its major limitations include excessive conservativeness and lack of consideration of the prior distribution of the state. In this paper, we introduce a modified notion of probabilistic set invariance that takes into account the starting distribution of the stochastic process. The probability of never escaping the given set, i.e., the survival rate, is defined as a function of the initial distribution and the length of the time window, and its infinitesimal-time behavior is analyzed. For Itˆo diffusion processes, we find that the decaying rate of survival rate, which we call the leakage rate, can be analytically evaluated through a surface integral formula on the boundary of the set given the probability distribution of state and the update rule. We validate the formula through a numerical example, in which the simulation result well matches the analytical prediction.
|
|
11:00-11:20, Paper WeA19.4 | |
>Predicting State Transitions in Autonomous Nonlinear Bistable Systems with Hidden Stochasticity |
|
Van Brandt, Léopold | UCLouvain |
Delvenne, Jean-Charles | Universite Catholique De Louvain |
Keywords: Stochastic systems, Stability of nonlinear systems, Autonomous systems
Abstract: Bistable autonomous systems can be found inmany areas of science. When the intrinsic noise intensity is large, these systems exhibits stochastic transitions from onemetastable steady state to another. In electronic bistable memories, these transitions are failures, usually simulated in a Monte-Carlo fashion at a high CPU-time price. Existing closed-form formulas, relying on near-stable-steady-state approximations of the nonlinear system dynamics to estimate the mean transition time, have turned out inaccurate. Our contribution is twofold. From a unidimensional stochastic model of overdamped autonomous systems, we propose an extended Eyring-Kramers analytical formula accounting for both nonlinear drift and state-dependent white noise variance, rigorously derived from Itô stochastic calculus. We also adapt it to practical system engineering situations where the intrinsic noise sources are hidden and can only be inferred from the fluctuations of observables measured in steady states. First numerical trials on an industrial electronic case study suggest that our approximate prediction formula achieve remarkable accuracy, outperforming previous non-Monte-Carlo approaches.
|
|
11:20-11:40, Paper WeA19.5 | |
>Forward Reachability for Discrete-Time Nonlinear Stochastic Systems Via Mixed-Monotonicity and Stochastic Order |
|
Sivaramakrishnan, Vignesh | University of New Mexico |
Devonport, Rosalyn Alex | University of New Mexico |
Arcak, Murat | University of California, Berkeley |
Oishi, Meeko | University of New Mexico |
Keywords: Stochastic systems, Computational methods
Abstract: We present a method to overapproximate forward stochastic reach sets of discrete-time, stochastic nonlinear systems with interval geometry. This is made possible by extending the theory of mixed-monotone systems to incorporate stochastic orders, and a concentration inequality result that lower-bounds the probability the state resides within an interval through a monotone mapping. Then, we present an algorithm to compute the overapproximations of forward reachable set and the probability the state resides within it. We present our approach on two aerospace examples to show its efficacy.
|
|
11:40-12:00, Paper WeA19.6 | |
>Safety Barrier Certificates for Stochastic Control Systems with Wireless Communication Networks |
|
Akbarzadeh, Omid | Newcastle University |
Soudjani, Sadegh | Newcastle University |
Lavaei, Abolfazl | Newcastle University |
Keywords: Stochastic systems, Formal Verification/Synthesis
Abstract: This work is concerned with a formal approach for safety controller synthesis of stochastic control systems with both process and measurement noises while considering emph{wireless communication networks} between sensors, controllers, and actuators. The proposed scheme is based on emph{control barrier certificates (CBC)}, which allows us to provide safety certificates for wirelessly-connected stochastic control systems. Despite the existing literature on designing control barrier certificates, there has been no consideration of wireless communication networks to capture potential packet losses, which is absolutely crucial in real-world applications. In our proposed setting, the key objective is to construct a CBC together with a safety controller while providing a lower bound on the satisfaction probability of the safety property over a finite time horizon. We propose a systematic approach in the form of sum-of-squares optimization and matrix inequalities for the synthesis of CBC and its associated safety controller. We demonstrate the efficacy of our approach on a permanent magnet synchronous motor under a wireless communication network.
|
|
WeA20 |
Suite 9 |
Formal Verification |
Regular Session |
Chair: Schmuck, Anne-Kathrin | MPI-SWS |
Co-Chair: Nejati, Amy | University of Newcastle |
|
10:00-10:20, Paper WeA20.1 | |
>Barrier Certificates for Weighted Automata-Based Specifications |
|
Murali, Vishnu | University of Colorado, Boulder |
Trivedi, Ashutosh | University of Colorado Boulder |
Zamani, Majid | University of Colorado Boulder |
Keywords: Formal Verification/Synthesis, Automata, Hybrid systems
Abstract: Barrier certificates, functional analogs of inductive invariants, play a fundamental role in the verification of safety for dynamical systems. The success of these certificates in safety verification has led to the investigation of their use to verify more general qualitative objectives such as those characterized by omega-regular automata. Here the certificates are used to establish a proof to ensure a set of accepting states are visited only finitely often. While omega-automata provide a reliable framework for specifying qualitative objectives such as safety, reachability, and patrolling, they are unable to capture notions of how well a system satisfies a desired property. Weighted-automata, weighted extensions to omega-automata, provide an expressive framework to describe quantitative objectives. We thus consider the problem of using barrier certificate-based approaches to verify dynamical systems against properties specified by weighted automata. Here one seeks to prove that all traces of a system have corresponding runs on the weighted automata with an aggregated cost that is greater than a fixed (a priori) threshold. We provide certificates to verify systems against weighted automata based on the choice of aggregation function. Our certificates rely on proving properties of safety, or (repeated) reachability in appropriate augmented systems. Finally, we demonstrate our approach on a case study.
|
|
10:20-10:40, Paper WeA20.2 | |
>Interval Signal Temporal Logic for Robust Optimal Control |
|
Baird, Luke | Georgia Institute of Technology |
Coogan, Samuel | Georgia Institute of Technology |
Keywords: Formal Verification/Synthesis, Computational methods, Optimal control
Abstract: We propose a robust optimal control strategy for linear systems subject to bounded disturbances constrained to satisfy a Signal Temporal Logic (STL) formula with uncertain predicates. We encode such constraints using Interval STL (I-STL), an extension of STL to interval signals and predicates that accommodates efficient numerical implementations for verification and synthesis using interval arithmetic methods. Given an I-STL constraint, a quadratic cost function, and a bounded hyper-rectangular disturbance set, we construct a second robust optimal control problem using an embedding system with double the state dimension and the same cost function such that a solution to this second problem is feasible for the original problem. Moreover, owing to the numerical efficiencies of I-STL and the embedding, the computational complexity of this problem is, at worst, approximately equivalent to solving a non-robust optimal STL synthesis problem with double the state dimension, and we solve this problem as a mixed-integer quadratic program. We present a case study of a miniature blimp modeled as a 12-dimensional linear system subject to disturbances and tasked with a mission specified in I-STL with multiple nested temporal operators.
|
|
10:40-11:00, Paper WeA20.3 | |
>Guaranteed Completion of Complex Tasks Via Temporal Logic Trees and Hamilton-Jacobi Reachability |
|
Jiang, Frank J. | Royal Institute of Technology |
Munhoz Arfvidsson, Kaj | KTH Royal Institute of Technology |
He, Chong | Simon Fraser University |
Chen, Mo | Simon Fraser University |
Johansson, Karl H. | KTH Royal Institute of Technology |
Keywords: Formal Verification/Synthesis, Autonomous systems, Robotics
Abstract: In this paper, we present an approach for guaranteeing the completion of complex tasks with cyber-physical systems (CPS). Specifically, we leverage temporal logic trees constructed using Hamilton-Jacobi reachability analysis to (1) check for the existence of control policies that complete a specified task and (2) develop a computationally-efficient approach to synthesize the full set of control inputs the CPS can implement in real-time to ensure the task is completed. We show that, by checking the approximation directions of each state set in the temporal logic tree, we can check if the temporal logic tree suffers from the "leaking corner issue," where the intersection of reachable sets yields an incorrect approximation. By ensuring a temporal logic tree has no leaking corners, we know the temporal logic tree correctly verifies the existence of control policies that satisfy the specified task. After confirming the existence of control policies, we show that we can leverage the value functions obtained through Hamilton-Jacobi reachability analysis to efficiently compute the set of control inputs the CPS can implement throughout the deployment time horizon to guarantee the completion of the specified task. Finally, we use a newly released Python toolbox to evaluate the presented approach on a simulated driving task.
|
|
11:00-11:20, Paper WeA20.4 | |
>Decentralized Control of Multi-Agent Systems under Acyclic Spatio-Temporal Task Dependencies |
|
Marchesini, Gregorio | KTH Royal Institute of Technology |
Liu, Siyuan | KTH Royal Institute of Technology |
Lindemann, Lars | University of Southern California |
Dimarogonas, Dimos V. | KTH Royal Institute of Technology |
Keywords: Formal Verification/Synthesis, Distributed control, Agents-based systems
Abstract: We introduce a novel distributed sampled-data control method tailored for heterogeneous multi-agent systems under a global spatio-temporal task with acyclic dependencies. Specifically, we consider the global task as a conjunction of independent and collaborative tasks, defined over the absolute and relative states of agent pairs. Task dependencies in this form are then represented by a task graph, which we assume to be acyclic. From the given task graph, we provide an algorithmic approach to define a distributed sampled-data controller prioritizing the fulfilment of collaborative tasks as the primary objective, while fulfilling independent tasks unless they conflict with collaborative ones. Moreover, communication maintenance among collaborating agents is seamlessly enforced within the proposed control framework. A numerical simulation is provided to showcase the potential of our control framework.
|
|
11:20-11:40, Paper WeA20.5 | |
>How Deduction Systems Can Help You to Verify Stability Properties |
|
Gleirscher, Mario | University of Bremen |
Massoud, Rehab | EPFL Lausanne |
Hutter, Dieter | Deutsches Forschungszentrum Für Künstliche Intelligenz |
Lüth, Christoph | Deutsches Forschungszentrum Für Künstliche Intelligenz (DFKI) |
Keywords: Formal Verification/Synthesis, Stability of nonlinear systems, Hybrid systems
Abstract: Mathematical proofs are a cornerstone of control theory, and it is crucial to get them right. Deduction systems can help with this by mechanically checking the proofs. However, the structure and level of detail of mechanized proofs can differ vastly from manual proofs by mathematicians and engineers, hampering understanding and adoption of these systems. This paper helps bridging the gap between manual and machine-checked proofs by presenting a mechanized stability proof using Lyapunov's theory in a human-readable way and wrapping it into a workflow. The proof structure is analyzed and potential benefits of mechanization are discussed, such as generalizability, reusability, and increased trust in correctness.
|
|
11:40-12:00, Paper WeA20.6 | |
>Reactive Synthesis of Stochastic Control Systems: A Mode-Triggered Safety Barrier Approach |
|
Nejati, Amy | Newcastle University |
Schmuck, Anne-Kathrin | MPI-SWS |
Keywords: Formal Verification/Synthesis, Stochastic systems
Abstract: This paper develops a formal framework to systematically synthesize a controller for continuous-time nonlinear stochastic control systems which react to specification-mode changes, initiated by either the external environment or the system itself. Considering the particular challenging setting where mode switches affect the specification rather than the dynamics, our proposed scheme adopts a synthesis approach based on control barrier certificates to synthesize controllers that ensure compliance with mode-triggered safety specifications. Our method leverages the computational capabilities derived from state-space control techniques and combines them with the reactivity of logic control. We provide a robotic case study to illustrate the effectiveness of our proposed approach.
|
|
WeLuSP |
Amber 1 |
Mitsubishi Electric Research Labs: Fundamental Research with Real-World
Impact |
Special Session |
Chair: Di Cairano, Stefano | Mitsubishi Electric Research Labs |
|
12:00-13:30, Paper WeLuSP.1 | |
Mitsubishi Electric Research Labs: Fundamental Research with Real-World Impact (I) |
|
Di Cairano, Stefano | Mitsubishi Electric Research Labs |
Berntorp, Karl | Walmart Advanced Systems Robotics |
P. Vinod, Abraham | Mitsubishi Electric Research Laboratories |
Weiss, Avishai | Mitsubishi Electric Research Labs |
Keywords:
Abstract: Mitsubishi Electric Research Laboratories (MERL) is a leading research organization that conducts fundamental research for industrially motivated problems. MERL is a subsidiary of Mitsubishi Electric Corporation, a global manufacturer of a wide range of products including robots, automotive, HVAC, factory automation, electrical systems, and space systems. MERL researchers collaborate with corporate laboratories and academic partners from around the world to develop novel solutions to challenging problems. In this talk we present an overview of control-related research activities at MERL. We discuss fundamental research including model predictive control and control of constrained systems, estimation and motion planning for autonomous systems, real-time optimization and integration of learning and control. Then, we describe how these fundamental research areas have impacted real world applications and products such as automated vehicles, drones, spacecraft, robots and navigation systems. Students interested in internship and staff positions, and faculty interested in exchange of ideas and collaborations including potential sabbatical stays are encouraged to attend. *The lunch sessions are scheduled to start at 12:10 and conclude at 13:10. This timing is intended to allow for a smooth transition between the morning and afternoon sessions, giving participants sufficient time to navigate between consecutive events.*
|
|
WeB01 |
Auditorium |
Bringing Quantum Systems under Control |
Tutorial Session |
Chair: Berberich, Julian | University of Stuttgart |
Co-Chair: Kosut, Robert L. | SC Solutions, Inc |
Organizer: Berberich, Julian | University of Stuttgart |
Organizer: Kosut, Robert L. | SC Solutions, Inc |
Organizer: Schulte-Herbrueggen, Thomas | Technichal University Munich (TUM) |
|
13:30-14:10, Paper WeB01.1 | |
>Bringing Quantum Systems under Control: A Tutorial Invitation to Quantum Computing and Its Relation to Bilinear Control Systems (I) |
|
Berberich, Julian | University of Stuttgart |
Kosut, Robert L. | SC Solutions, Inc |
Schulte-Herbrueggen, Thomas | Technichal University Munich (TUM) |
Keywords: Quantum information and control, Robust control, Algebraic/geometric methods
Abstract: Quantum computing comes with the potential to push computational boundaries in various domains including, e.g., cryptography, simulation, optimization, and machine learning. Exploiting the principles of quantum mechanics, new algorithms can be developed with capabilities that are unprecedented by classical computers. However, the experimental realization of quantum devices is an active field of research with enormous open challenges, including robustness against noise and scalabilty. While systems and control theory plays a crucial role in tackling these challenges, the principles of quantum physics lead to a (perceived) high entry barrier for entering the field of quantum computing. This tutorial paper aims at lowering the barrier by introducing basic concepts required to understand and solve research problems in quantum systems. First, we introduce fundamentals of quantum algorithms, ranging from basic ingredients such as qubits and quantum logic gates to prominent examples and more advanced concepts, e.g., variational quantum algorithms. Next, we formalize some engineering questions for building quantum devices in the real world, which requires the careful manipulation of microscopic quantities obeying quantum effects. To this end for N-level systems, we introduce basic concepts of (bilinear) quantum systems and control theory including controllability, observability, and optimal control in a unified frame. Finally, we address the problem of noise in real-world quantum systems via robust quantum control, which relies on a set-membership uncertainty description frequently employed in control. A key goal of this tutorial paper is to demystify engineering aspects of quantum computing by emphasizing that its mathematical description mainly involves linear algebra (for quantum algorithms) and the handling of bilinear control systems (for quantum systems and control theory) but does not require too much detailed knowledge of quantum physics.
|
|
14:10-14:50, Paper WeB01.2 | |
A Unified Approach to Quantum Systems Theory and Control Engineering (I) |
|
Schulte-Herbrueggen, Thomas | Technichal University Munich (TUM) |
Keywords: Quantum information and control, Optimization, Robust control
Abstract: In emerging quantum technologies, quantum optimal control is often key to unlock the full potential of experimental set-ups. Typical engineering questions concerning controllability, reachability, and observability will be addressed for the broad class of finite-dimensional quantum control systems. Most often they take the form of bilinear control systems well known from the classical realm. For quantum engineering, our Lie frame of quantum systems theory provides full symmetry assessment of controllability, reachability and accessibility. In view of quantum sensing, we apply the same tools to observability and tomographiability. We see which symmetries to break to get a better handle both on the preparation and the detection of states. Principles are put into practice by interfacing to numerical quantum optimal control. Worked examples elucidate simple symmetry guidelines for quantum technologies 2.0.
|
|
14:50-15:30, Paper WeB01.3 | |
Robust Quantum Control (I) |
|
Kosut, Robert L. | SC Solutions, Inc |
Keywords: Quantum information and control, Optimization, Robust control
Abstract: For a quantum computer that is ideally composed of a temporal sequence of unitary logic gates, robustness means that at a specified time, despite imperfections or perturbations, each actual gate operation is as close as possible to its ideal. To do this we will need a particular knowledge about the system. Borrowing ideas from the control community, we focus on an uncertainty model class which is suitable for robust quantum control design: set membership Hamiltonian uncertainty. In this section we expand upon the principles supporting this emerging approach to robust quantum control. Using the method of averaging we find a limit on robust performance, i.e., a fidelity error bound versus a bound on physical uncertainty.
|
|
WeB02 |
Amber 1 |
Learning, Optimization, and Game Theory II |
Invited Session |
Chair: Doan, Thinh T. | University of Texas at Austin |
Co-Chair: Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Organizer: Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Organizer: Doan, Thinh T. | University of Texas at Austin |
Organizer: Sayin, Muhammed Omer | Bilkent University |
|
13:30-13:50, Paper WeB02.1 | |
>Nash Equilibrium Seeking for Noncooperative Duopoly Games Via Event-Triggered Control (I) |
|
Rodrigues, Victor Hugo Pereira | Federal University of Rio De Janeiro (UFRJ) |
Oliveira, Tiago Roux | State University of Rio De Janeiro |
Krstic, Miroslav | University of California, San Diego |
Basar, Tamer | Univ of Illinois, Urbana-Champaign |
Keywords: Extremum seeking, Game theory, Optimization
Abstract: This paper proposes a novel approach for locally stable convergence to Nash equilibrium in duopoly noncooperative games based on a distributed event-triggered control scheme. The proposed approach employs extremum seeking, with sinusoidal perturbation signals applied to estimate the Gradient (first derivative) of unknown quadratic payoff functions. This is the first instance of noncooperative games being tackled in a model-free fashion integrated with the event-triggered methodology. Each player evaluates independently the deviation between the corresponding current state variable and its last broadcasted value to update the player action, while they preserve control performance under limited bandwidth of the actuation paths and still guarantee stability for the closed-loop dynamics. In particular, the stability analysis is carried out using time-scaling technique, Lyapunov's direct method and averaging theory for discontinuous systems. We quantify the size of the ultimate small residual sets around the Nash equilibrium and illustrate the theoretical results numerically on an example.
|
|
13:50-14:10, Paper WeB02.2 | |
>Filterless Fixed-Time Extremum Seeking for Scalar Quadratic Maps (I) |
|
Tang, Michael | University of California, San Diego |
Krstic, Miroslav | University of California, San Diego |
Poveda, Jorge I. | University of California, San Diego |
Keywords: Optimization, Adaptive control, Extremum seeking
Abstract: In this paper, we study a novel fixed-time extremum-seeking algorithm that eliminates the need for filters to obtain an appropriate estimation of the gradient of a static map for optimization problems where the cost function is available only via measurements or evaluations. Previous research leveraged these filters to facilitate the application of averaging theory in analyzing the stability properties of the system. Specifically, they were employed to separate, using multi-time scale techniques, the non-smooth terms of the algorithms from the rapidly fluctuating oscillatory terms associated with periodic dithers. This separation was achieved through a singular perturbation argument, where the filter acted as boundary layer system with a sufficiently fast transient. However, since in many practical applications such transient cannot be made arbitrarily fast, and since classic extremum-seeking algorithms are also known to be stable even in the absence of filters, it is natural to ask whether the fixed-time extremum-seeking dynamics can also be simplified by removing the filters while achieving semi-global practical fixed-time convergence properties. This paper addresses this question for scalar quadratic cost functions, providing positive and negative answers depending on the structure of the cost. Additionally, we demonstrate that removing the filters results in average dynamics distinct from the conventional fixed-time gradient flow dynamics found in existing literature. Furthermore, we provide numerical examples to illustrate our findings.
|
|
14:10-14:30, Paper WeB02.3 | |
>Generalizing Better Response Paths and Weakly Acyclic Games (I) |
|
Yongacoglu, Bora | University of Toronto |
Arslan, Gurdal | University of Hawaii at Manoa |
Pavel, Lacra | University of Toronto |
Yuksel, Serdar | Queen's University |
Keywords: Game theory
Abstract: Weakly acyclic games generalize potential games and are fundamental to the study of game theoretic control. In this paper, we present a generalization of weakly acyclic games, and we observe its importance in multi-agent learning when agents employ experimental strategy updates in periods where they fail to best respond. While weak acyclicity is defined in terms of path connectivity properties of a game’s better response graph, our generalization is defined using a generalized better response graph. We provide sufficient conditions for this notion of generalized weak acyclicity in both two-player games and n-player games. To demonstrate that our generalization is not trivial, we provide examples of games admitting a pure Nash equilibrium that are not generalized weakly acyclic. The generalization presented in this work is closely related to the recent theory of satisficing paths, and the counterexamples presented here constitute the first negative results in that theory.
|
|
14:30-14:50, Paper WeB02.4 | |
>Embedding Learning-Based Optimal Controllers with Assured Safety (I) |
|
Fotiadis, Filippos | Georgia Institute of Technology |
Rovithakis, George A. | Aristotle University of Thessaloniki |
Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Keywords: Neural networks, Robust control, Optimal control
Abstract: We consider an off-policy reinforcement learning algorithm that gathers input and state data from a nonlinear system, and uses them to approximate the infinite-horizon optimal control for that system. However, as this algorithm relies on neural networks, its convergence depends on restrictive assumptions regarding the underlying neural network structure. Moreover, the derived approximate optimal controller that it yields is stabilizing only within a compact set Omega of the state space, leading to significant issues of robustness and safety for real-world implementations. Motivated by this, to increase the robustness and safety guarantees of controllers obtained by off-policy reinforcement learning procedures, we combine them with a novel textit{safety net}. The safety net is minimally interfering, leaving the approximate optimal controller unaltered within the compact set Omega in which it is valid and stabilizing. On the other hand, the safety net interferes with the approximate optimal controller whenever the set Omega is violated, so as to guarantee the boundedness and integrity of the closed loop. Since the proposed net is model-agnostic yet learning-free, it provides, for the first time, hard guarantees of safety, established by rigorous theoretical analysis and subsequently verified in simulations.
|
|
14:50-15:10, Paper WeB02.5 | |
>Distributed Target Tracking under Partial Feedback Using Lyapunov-Based Deep Neural Networks (I) |
|
Nino, Cristian F. | University of Florida |
Patil, Omkar Sudhir | University of Florida |
Edwards, Sage | University of Florida |
Bell, Zachary I. | Air Force |
Dixon, Warren E. | University of Florida |
Keywords: Networked control systems, Stability of nonlinear systems, Adaptive control
Abstract: The target tracking problem is addressed for multi-agent systems where the target state information is only partially available to the agents via a heterogeneous measurement model. A necessary and sufficient condition, termed trackability, is provided, to indicate the feasibility for tracking a target with partial measurements. A Lyapunov-based deep neural network (Lb-DNN) adaptive controller is developed to achieve target tracking, under the trackability condition, by adaptively compensating for the uncertainty stemming from the unknown target dynamics. A Lyapunov-based stability analysis is provided to guarantee exponential target state estimation and tracking within a neighborhood of the target state.
|
|
15:10-15:30, Paper WeB02.6 | |
>Incentive Design in Noncooperative Dynamical Systems for Social Welfare Maximization Using Limited Payoff Information |
|
Yan, Yuyue | Osaka University |
Ye, Maojiao | Nanjing University of Science and Technology |
Hashimoto, Kazumune | Osaka University |
Wu, Yuhu | Dalian University of Technology |
Kong, He | Southern University of Science and Technology |
Keywords: Game theory, Agents-based systems, Finance
Abstract: In this paper, an incentive mechanism is developed to maximize the weighted social welfare for pseudo-gradient- based noncooperative dynamical systems. In the proposed approach, the system manager manipulates agents’ decision making by collecting taxes and giving subsidies to the agents un- der a sustainable budget constraint using the partial knowledge related to the payoff structure from the agents except a secret agent. To avoid direct acquisition of the payoff information of all the agents, our idea is to shift the stationary points of those partial agents and hence indirectly move the Nash equilibrium of the entire system. Sufficient conditions are derived under which the Nash equilibrium is shifted towards the target state achieving Pareto efficiency under the incentive mechanism without knowing any payoff information of the secret agent. Some convergent conditions to ensure agents’ state converge to the target state under a sustainable budget are provided for the given initial state. We present a numerical example to illustrate the efficacy of our results.
|
|
WeB03 |
Amber 2 |
Complex Networks - Analysis and Control |
Invited Session |
Chair: Caiza, Jose | Purdue University |
Co-Chair: Ye, Mengbin | Curtin University |
Organizer: Gracy, Sebin | South Dakota School of Mines and Technology |
Organizer: Caiza, Jose | Purdue University |
Organizer: Pare, Philip E. | Purdue University |
|
13:30-13:50, Paper WeB03.1 | |
>Multi-Layer Default Risk Contagion in Inter-Banking Networks (I) |
|
Bi, Xiaoqi | University of Illinois, Urbana-Champaign |
Koppel, Alec | JP Morgan Chase |
Beck, Carolyn L. | Univ of Illinois, Urbana-Champaign |
Keywords: Modeling, Networked control systems, Identification
Abstract: Default risk spreading processes in inter-banking networks are commonly viewed as contagion processes, with inter-bank loans as a direct spreading channel and overlapping investment portfolios as an indirect channel. In this paper, we propose a multi-layer network default risk contagion model to incorporate additional panic contagions in the networks of depositors as a novel augmentation of previous models, allowing for the direct characterization of the “bank run” phenomenon, where many depositors simultaneously issue withdrawal requests. Our model is calibrated with post-COVID pandemic data, accounting for macroeconomic factors such as fluctuating interest rates and asset bubbles. Using system identification methods, we analyze relationships between federal interest rates and market price, and formulate an optimal control problem to mitigate default risk via liquidity ratio requirements in stress tests. Long-term simulation results are presented to reveal threshold structures under varying contagion parameters.
|
|
13:50-14:10, Paper WeB03.2 | |
>Rate of Convergence to the Disease Free Equilibrium for Multi-Population SIS Networks in the Critical Case |
|
Ye, Mengbin | Curtin University |
Anderson, Brian D.O. | Australian National University |
Keywords: Compartmental and Positive systems, Stability of nonlinear systems, Network analysis and control
Abstract: A networked version of the classical deterministic SIS epidemic model is studied. Existing results establish that convergence to an equilibrium occurs exponentially fast, except in the critical case, when the basic reproduction number is equal to 1. This paper uses nonlinear systems and center manifold theory to establish that with such a reproduction number, convergence occurs at a rate of 1/t. Numerical simulations help to illustrate the results.
|
|
14:10-14:30, Paper WeB03.3 | |
>Global and Distributed Reproduction Numbers of a Multilayer SIR Model with an Infrastructure Network (I) |
|
Caiza, Jose | Purdue University |
Qin, Junjie | Purdue University |
Pare, Philip E. | Purdue University |
Keywords: Network analysis and control, Emerging control applications, Networked control systems
Abstract: In this paper, we propose an SIR spread model in a population network coupled with an infrastructure network that has a pathogen spreading in it. We develop a threshold condition to characterize the monotonicity and peak time of a weighted average of the infection states in terms of the global (network-wide) effective reproduction number. We further define the distributed reproduction numbers (DRNs) of each node in the multilayer network which are used to provide local threshold conditions for the dynamical behavior of each entity. Furthermore, we leverage the DRNs to predict the global behavior based on the node-level assumptions. We use both analytical and simulation results to illustrate that the DRNs allow a more accurate analysis of the networked spreading process than the global effective reproduction number.
|
|
14:30-14:50, Paper WeB03.4 | |
>Multidimensional Signed Friedkin-Johnsen Model (I) |
|
Razaq, Muhammad Ahsan | Linkoping University |
Altafini, Claudio | Linkoping University |
Keywords: Network analysis and control, Networked control systems
Abstract: The multidimensional signed Friedkin-Johnsen (SFJ) model introduced in this paper describes opinion dynamics on a signed network in which the agents hold opinions on multiple interconnected topics and are allowed to be stubborn. In the paper, we establish sufficient conditions for the stability of the multidimensional SFJ model, and analyze convergence to consensus in concatenated instances of this model.
|
|
14:50-15:10, Paper WeB03.5 | |
>A Study of Three Influencer Archetypes for the Control of Opinion Spread in Time-Varying Social Networks (I) |
|
DeBuse, Michael | Brigham Young University, Provo |
Warnick, Sean | Brigham Young University |
Keywords: Agents-based systems, Time-varying systems, Control of networks
Abstract: In this work we consider the impact of information spread in time-varying social networks, where agents request to follow other agents with aligned opinions while dropping ties to neighbors whose posts are too dissimilar to their own views. Opinion control and rhetorical influence has a very long history, employing various methods including education, persuasion, propaganda, marketing, and manipulation through mis-, dis-, and mal-information. The automation of opinion controllers, however, has only recently become easily deployable at a wide scale, with the advent of large language models (LLMs) and generative AI that can translate the quantified commands from opinion controllers into actual content with the appropriate nuance. Automated agents in social networks can be deployed for various purposes, such as breaking up echo chambers, bridging valuable new connections between agents, or shaping the opinions of a target population--and all of these raise important ethical concerns that deserve serious attention and thoughtful discussion and debate. This paper attempts to contribute to this discussion by considering three archetypal influencing styles observed by human drivers in these settings, comparing and contrasting the impact of these different control methods on the opinions of agents in the network. We will demonstrate the efficacy of current generative AI for generating nuanced content consistent with the command signal from automatic opinion controllers like these, and we will report on frameworks for approaching the relevant ethical considerations.
|
|
15:10-15:30, Paper WeB03.6 | |
>Incentive Designs for Learning Agents to Stabilize Coupled Exogenous Systems (I) |
|
Certorio, Jair | University of Maryland |
Martins, Nuno C. | University of Maryland |
La, Richard J. | University of Maryland, College Park |
Arcak, Murat | University of California, Berkeley |
Keywords: Lyapunov methods, Game theory, Stability of nonlinear systems
Abstract: We consider a large population of learning agents noncooperatively selecting strategies from a common set, influencing the dynamics of an exogenous system (ES) we seek to stabilize at a desired equilibrium. Our approach is to design a dynamic payoff mechanism capable of shaping the population's strategy profile, thus affecting the ES's state, by offering incentives for specific strategies within budget limits. Employing system-theoretic passivity concepts, we establish conditions under which a payoff mechanism can be systematically constructed to ensure the global asymptotic stabilization of the ES's equilibrium. In comparison to previous approaches originally studied in the context of the so-called epidemic population games, the method proposed here allows for more realistic epidemic models and other types of ES, such as predator-prey dynamics. Stabilization is established with the support of a Lyapunov function, which provides useful bounds on the transients.
|
|
WeB04 |
Amber 3 |
Securing Cyber-Physical Resilience: Learning, Controls, and Design Insights |
Invited Session |
Chair: Nandanoori, Sai Pushpak | Pacific Northwest National Laboratory |
Co-Chair: Maruf, Abdullah Al | California State University |
Organizer: Kundu, Soumya | Pacific Northwest National Laboratory |
Organizer: Roy, Sandip | Washington State University |
Organizer: Nandanoori, Sai Pushpak | Pacific Northwest National Laboratory |
Organizer: Maruf, Abdullah Al | California State University |
|
13:30-13:50, Paper WeB04.1 | |
>How Much Reserve Fuel: Quantifying the Maximal Energy Cost of System Disturbances (I) |
|
Padmanabhan, Ram | University of Illinois Urbana-Champaign |
Bakker, Craig | Pacific Northwest National Laboratory |
Dinkar, Siddharth Abhijit | University of Illinois Urbana-Champaign |
Ornik, Melkior | University of Illinois Urbana-Champaign |
Keywords: Optimal control, Robust control, Resilient Control Systems
Abstract: Motivated by the design question of additional fuel needed to complete a task in an uncertain environment, this paper introduces metrics to quantify the maximal additional energy used by a control system in the presence of bounded disturbances when compared to a nominal, disturbance-free system. In particular, we consider the task of finite-time stabilization for a linear time-invariant system. We first derive the nominal energy required to achieve this task in a disturbance-free system, and then the worst-case energy over all feasible disturbances. The latter leads to an optimal control problem with a least-squares solution, and then an infinite-dimensional optimization problem where we derive an upper bound on the solution. The comparison of these energies is accomplished using additive and multiplicative metrics, and we derive analytical bounds on these metrics. Simulation examples on an ADMIRE fighter jet model demonstrate the practicability of these metrics, and their variation with the task hardness, a combination of the distance of the initial condition from the origin and the task completion time.
|
|
13:50-14:10, Paper WeB04.2 | |
>Resilient Platooning Control of Connected Automated Vehicles in the Presence of Cyber-Attacks (I) |
|
Gusrialdi, Azwirman | Tampere University |
Iqbal, Muhammad | Aalto University |
Qu, Zhihua | Univ. of Central Florida |
Keywords: Resilient Control Systems, Cooperative control, Attack Detection
Abstract: This paper considers the problem of platooning control of connected automated vehicles under cyber-attacks. Specifically, the attacker aims to prevent the follower vehicle from maintaining a pre-defined safe distance from its immediate predecessor by manipulating the measurement of the on-board radar and inserting bounded injections into the communication links and actuators of the follower vehicles. A novel distributed resilient control is proposed which does not require any assumptions on the number of attacks. It is shown that by appropriately designing the information being exchanged between the vehicles, the resilient control ensures that the follower vehicles converge to the leader vehicle’s velocity and constant distance between the vehicles in presence of any number of attacks. Furthermore, the proposed resilient control also enables attack detection and identification in a real-time and distributed manner. A Numerical example demonstrates the effectiveness of the proposed strategy.
|
|
14:10-14:30, Paper WeB04.3 | |
>ANOCA: AC Network-Aware Optimal Curtailment Approach for Dynamic Hosting Capacity (I) |
|
Badmus, Emmanuel | University of Vermont |
Amritanshu, Pandey | University of Vermont |
Keywords: Power systems, Distributed control, Optimization
Abstract: With exponential growth in distributed energy resources (DERs) coupled with at-capacity distribution grid infrastructure, prosumers cannot always export all extra power to the grid without violating technical limits. Consequently, a slew of dynamic hosting capacity (DHC) algorithms have emerged for optimal utilization of grid infrastructure while maximizing export from DERs. Most of these DHC algorithms utilize the concept of operating envelopes (OE), where the utility gives prosumers technical power export limits, and they are free to export power within these limits. Recent studies have shown that OE-based frameworks have drawbacks, as most develop power export limits based on convex or linear grid models. As OEs must capture extreme operating conditions, both convex and linear models can violate technical limits in practice because they approximate grid physics. However, AC models are unsuitable because they may not be feasible within the whole region of OE. We propose a new two-stage optimization framework for DHC built on three-phase AC models to address the current gaps. In this approach, the prosumers first run a receding horizon multi-period optimization to identify optimal export power setpoints to communicate with the utility. The utility then performs an infeasibility-based optimization to either accept the prosumer's request or dispatch an optimal curtail signal such that overall system technical constraints are not violated. To explore various curtailment strategies, we develop an L-one, L-two, and L-infinity norm-based dispatch algorithm with an exact three-phase AC model. We test our framework on a 1420 three-phase node meshed distribution network and show that the proposed algorithm optimally curtails DERs while guaranteeing the AC feasibility of the network.
|
|
14:30-14:50, Paper WeB04.4 | |
>Grid-Forming Storage Networks: Analytical Characterization of Damping and Design Insights (I) |
|
Chatterjee, Kaustav | Pacific Northwest National Laboratory |
Hossain, Ramij Raja | Pacific Northwest National Laboratory |
Nandanoori, Sai Pushpak | Pacific Northwest National Laboratory |
Kundu, Soumya | Pacific Northwest National Laboratory |
Sinha, Subhrajit | Pacific Northwest National Laboratory |
Baldwin, Diane | Pacific Northwest National Laboratory |
Melton, Ronald | Battelle, Pacific Northwest Division |
Keywords: Power systems, Energy systems, Smart grid
Abstract: The paper presents a theoretical study on small-signal stability and damping in bulk power systems with multiple grid-forming inverter-based storage resources. A detailed analysis is presented, characterizing the impacts of inverter droop gains and storage size on the slower eigenvalues, particularly those concerning inter-area oscillation modes. From these parametric sensitivity studies, a set of necessary conditions are derived that the design of droop gain must satisfy to enhance damping performance. The analytical findings are structured into propositions highlighting potential design considerations for improving system stability, which are illustrated via numerical studies on the IEEE 68-bus system with a grid-forming storage network.
|
|
14:50-15:10, Paper WeB04.5 | |
>Coherency-Aware Learning Control of Inverter-Dominated Grids: A Distributed Risk-Constrained Approach |
|
Kwon, Kyung-bin | Pacific Northwest National Laboratory |
Hossain, Ramij Raja | Pacific Northwest National Laboratory |
Mukherjee, Sayak | Pacific Northwest National Laboratory |
Chatterjee, Kaustav | Pacific Northwest National Laboratory |
Kundu, Soumya | Pacific Northwest National Laboratory |
Nekkalapu, Sameer | Pacific Northwest National Laboratory |
Elizondo, Marcelo Anibal | Pacific Northwest National Laboratory |
Keywords: Distributed control, Power systems, Robust control
Abstract: This paper investigates the importance of integrating the coherency knowledge for designing controllers to dampen sustained oscillations in wide-area power networks with significant penetration of inverter-interfaced resources. Coherency is a fundamental property of power systems, where time-scale separation in frequency dynamics leads to clustered behavior among generators of different groups. Large-scale penetration of inverter-driven low inertia resources replacing conventional synchronous generators (SGs) can lead to perturbation in the coherent partitioning; hence, integrating such information is of utmost importance for oscillation control designs. We present the coherency-aware design of a distributed output feedback-based reinforcement learning method that additionally incorporates risk constraints to capture the uncertainties related to net-load fluctuations. The use of domain-aware coherency information has produced improved training and oscillation performance than the coherency-agnostic control design, hence proving to be effective in controller design. Finally, we validated the proposed method with numerical experiments on the benchmark IEEE 68-bus test system.
|
|
15:10-15:30, Paper WeB04.6 | |
>Small-Signal Dynamics of Lossy Inverter-Based Microgrids for Generalized Droop Controls (I) |
|
Maruf, Abdullah Al | California State University |
Dubey, Anamika | Washington State University |
Roy, Sandip | Washington State University |
Keywords: Network analysis and control, Energy systems, Distributed control
Abstract: A network-level small-signal model is developed for lossy microgrids, which considers coupled angle and voltage dynamics of inverter-based microgrids and uses a more general framework of droop controls in the inverter. It is shown that when resistance to inductance ratios of the lines in the microgrid are consistent and differences of voltage angles across the lines are sufficiently small at the operating point, the generalized droop controls can be designed to enforce decoupling between angle dynamics and voltage dynamics. Next, structural results for the asymptotic stability of small-signal angle and voltage dynamics are given for the case when generalized droop control achieves decoupling. Simulated transient responses of a modified IEEE 9-bus system are presented to validate the theoretical findings which show the effectiveness of generalized droop controls in independently shaping the settling times of the angle and voltage responses of the lossy microgrid system.
|
|
WeB05 |
Amber 4 |
Submodularity in Control, Robotics, and Machine Learning: Applications,
Challenges, and Opportunities |
Invited Session |
Chair: Tzoumas, Vasileios | University of Michigan, Ann Arbor |
Co-Chair: Kia, Solmaz S. | University of California Irvine (UCI) |
Organizer: Tzoumas, Vasileios | University of Michigan, Ann Arbor |
Organizer: Kia, Solmaz S. | University of California Irvine (UCI) |
Organizer: Bushnell, Linda | University of Washington |
|
13:30-13:50, Paper WeB05.1 | |
>On Bounds for Greedy Schemes in String Optimization Based on Greedy Curvatures (I) |
|
Li, Bowen | Colorado State University |
Van Over, Brandon | Colorado State University |
Chong, Edwin K. P. | Colorado State University |
Pezeshki, Ali | Colorado State University |
Keywords: Optimization, Discrete event systems, Optimal control
Abstract: We consider the celebrated bound introduced by Conforti and Cornuejols (1984) for greedy schemes in submodular optimization. The bound assumes a submodular function defined on a collection of sets forming a matroid and is based on greedy curvature. We show that the bound holds for a very general class of string problems that includes maximizing submodular functions over set matroids as a special case. We also derive a bound that is computable in the sense that they depend only on quantities along the greedy trajectory. We prove that our bound is superior to the greedy curvature bound of Conforti and Cornuejols. In addition, our bound holds under a condition that is weaker than submodularity.
|
|
13:50-14:10, Paper WeB05.2 | |
>Bridging the Gap between Central and Local Decision-Making: The Efficacy of Collaborative Equilibria in Altruistic Congestion Games (I) |
|
Ferguson, Bryce L. | University of California, Santa Barbara |
Paccagnan, Dario | Imperial College London |
Pradelski, Bary S. R. | CNRS -- Centre National De La Recherche Scientifique, France |
Marden, Jason R. | University of California, Santa Barbara |
Keywords: Game theory, Agents-based systems, Cooperative control
Abstract: Congestion games are popular models often used to study the system-level inefficiencies caused by selfish agents, typically measured by the price of anarchy. One may expect that aligning the agents' preferences with the system-level objective--altruistic behavior--would improve efficiency, but recent works have shown that altruism can lead to more significant inefficiency than selfishness in congestion games. In this work, we study to what extent the localness of decision-making causes inefficiency by considering collaborative decision-making paradigms that exist between centralized and distributed in altruistic congestion games. In altruistic congestion games with convex latency functions, the system cost is a super-modular function over the player's joint actions, and the Nash equilibria of the game are local optima in the neighborhood of unilateral deviations. When agents can collaborate, we can exploit the common-interest structure to consider equilibria with stronger local optimality guarantees in the system objective, e.g., if groups of k agents can collaboratively minimize the system cost, the system equilibria are the local optima over k-lateral deviations. Our main contributions are in constructing tractable linear programs that provide bounds on the price of anarchy of collaborative equilibria in altruistic congestion games. Our findings bridge the gap between the known efficiency guarantees of centralized and distributed decision-making paradigms while also providing insights into the benefit of inter-agent collaboration in multi-agent systems.
|
|
14:10-14:30, Paper WeB05.3 | |
>Performance-Guaranteed Solutions for Multi-Agent Optimal Coverage Problems Using Submodularity, Curvature, and Greedy Algorithms (I) |
|
Welikala, Shirantha | Stevens Institute of Technology |
Cassandras, Christos G. | Boston University |
Keywords: Sensor networks, Optimization, Agents-based systems
Abstract: We consider a class of multi-agent optimal coverage problems in which the goal is to determine the optimal placement of a group of agents in a given mission space so that they maximize a coverage objective that represents a blend of individual and collaborative event detection capabilities. This class of problems is extremely challenging due to the non-convex nature of the mission space and of the coverage objective. With this motivation, greedy algorithms are often used as means of getting feasible coverage solutions efficiently. Even though such greedy solutions are suboptimal, the submodularity (diminishing returns) property of the coverage objective can be exploited to provide performance bound guarantees. Moreover, we show that improved performance bound guarantees (beyond the standard (1-1/e) performance bound) can be established using various curvature measures of the coverage problem. In particular, we provide a brief review of all existing popular applicable curvature measures, including a recent curvature measure that we proposed, and discuss their effectiveness and computational complexity, in the context of optimal coverage problems. We also propose novel computationally efficient techniques to estimate some curvature measures. Finally, we provide several numerical results to support our findings and propose several potential future research directions.
|
|
14:30-14:50, Paper WeB05.4 | |
>Optimality Gap of Decentralized Submodular Maximization under Probabilistic Communication (I) |
|
Vendrell Gallart, Joan | University of California Irvine |
Kia, Solmaz S. | University of California Irvine (UCI) |
Keywords: Optimization, Optimization algorithms, Agents-based systems
Abstract: This paper considers the problem of decentralized submodular maximization subject to partition matroid constraint using a sequential greedy algorithm with probabilistic inter-agent message-passing. We propose a communication-aware framework where the probability of successful communication between connected devices is considered. Our analysis introduces the notion of the probabilistic optimality gap, highlighting its potential influence on determining the message-passing sequence based on the agent's broadcast reliability and strategic decisions regarding agents that can broadcast their messages multiple times in a resource-limited environment. This work not only contributes theoretical insights but also has practical implications for designing and analyzing decentralized systems in uncertain communication environments. A numerical example demonstrates the impact of our results.
|
|
14:50-15:10, Paper WeB05.5 | |
>Performance-Aware Self-Configurable Multi-Agent Networks: A Distributed Submodular Approach for Simultaneous Coordination and Network Design (I) |
|
Xu, Zirui | University of Michigan |
Tzoumas, Vasileios | University of Michigan, Ann Arbor |
Keywords: Optimization, Distributed control, Control of networks
Abstract: We introduce the first, to our knowledge, rigorous approach that enables multi-agent networks to self-configure their communication topology to balance the trade-off between scalability and optimality during multi-agent planning. We are motivated by the future of ubiquitous collaborative autonomy where numerous distributed agents will be coordinating via agent-to-agent communication to execute complex tasks such as traffic monitoring, event detection, and environmental exploration. But the explosion of information in such large-scale networks currently curtails their deployment due to impractical decision times induced by the computational and communication requirements of the existing near-optimal coordination algorithms. To overcome this challenge, we present the AlterNAting COordination and Network-Design Algorithm (Anaconda), a scalable algorithm that also enjoys near-optimality guarantees. Subject to the agents’ bandwidth constraints, Anaconda enables the agents to optimize their local communication neighborhoods such that the action-coordination approximation performance of the network is maximized. Compared to the state of the art, Anaconda is an anytime self-configurable algorithm that quantifies its suboptimality guarantee for any type of network, from fully disconnected to fully centralized, and that, for sparse networks, is one order faster in terms of decision speed. To develop the algorithm, we quantify the suboptimality cost due to decentralization, i.e., due to communication-minimal distributed coordination. We also employ tools inspired by the literature on multi-armed bandits and submodular maximization subject to cardinality constraints. We demonstrate Anaconda in simulated scenarios of area monitoring and compare it with a state-of-the-art algorithm.
|
|
15:10-15:30, Paper WeB05.6 | |
>Multi-Robot Planning for Filming Groups of Moving Actors Leveraging Submodularity and Pixel Density (I) |
|
Hughes, Skyler | New Mexico Institute of Mining and Technology |
Martin, Rebecca | Carnegie Mellon University |
Corah, Micah | Colorado School of Mines |
Scherer, Sebastian | Carnegie Mellon University |
Keywords: Robotics, Optimization algorithms, Aerospace
Abstract: Observing and filming a group of moving actors with a team of aerial robots is a challenging problem that combines elements of multi-robot coordination, coverage, and view planning. A single camera may observe multiple actors at once, and a robot team may observe individual actors from multiple views. As actors move about, groups may split, merge, and reform, and robots filming these actors should be able to adapt smoothly to such changes in actor formations. Rather than adopt an approach based on explicit formations or assignments, we propose an approach based on optimizing views directly. We model actors as moving polyhedra and compute approximate pixel densities for each face and camera view. Then, we propose an objective that exhibits diminishing returns as pixel densities increase from repeated observation. This gives rise to a multi-robot perception planning problem that we solve via a combination of value iteration and greedy submodular maximization. We evaluate our approach on challenging scenarios modeled after various social behaviors and featuring different numbers of robots and actors and observe that robot assignments and formations arise implicitly given the movements of groups of actors. Simulation results demonstrate that our approach consistently outperforms baselines, and in addition to performing well with the planner's approximation of pixel densities our approach also performs comparably for evaluation based on rendered views. Overall, the multi-round variant of the sequential planner we propose meets (within 1%) or exceeds formation and assignment baselines in all scenarios.
|
|
WeB06 |
Amber 5 |
Agent-Based Systems II |
Regular Session |
Chair: Garin, Federica | Inria |
Co-Chair: Bonnet, Benoît | CentraleSupelec |
|
13:30-13:50, Paper WeB06.1 | |
>Exponential Consensus Formation in Time-Varying Multiagent Systems Via Compactification Methods |
|
Bonnet-Weill, Benoît | CentraleSupelec |
Sigalotti, Mario | INRIA Paris |
Keywords: Agents-based systems, Cooperative control, Time-varying systems
Abstract: In this article, we establish exponential contraction results for the diameter and variance of general first-order multiagent systems. Our approach is based on compactification techniques, and works under rather mild assumptions. Namely, we posit that either the scrambling coefficient, or the algebraic connectivity of the averaged interaction graphs of the system over all time windows of a given length are uniformly positive.
|
|
13:50-14:10, Paper WeB06.2 | |
>Exploiting Over-The-Air Consensus for Collision Avoidance and Formation Control in Multi-Agent Systems |
|
Epp, Michael | Technische Universität Berlin |
Molinari, Fabio | TU Berlin |
Raisch, Joerg | Technical University Berlin |
Keywords: Agents-based systems, Decentralized control, Autonomous systems
Abstract: This paper introduces a distributed control method for multi-agent robotic systems employing Over the Air Consensus (OtA-Consensus). Designed for agents with decoupled single-integrator dynamics, this approach aims at efficient formation achievement and collision avoidance. As a distinctive feature, it leverages OtA's ability to exploit interference in wireless channels, a property traditionally considered a drawback, thus enhancing communication efficiency among robots. An analytical proof of asymptotic convergence is established for systems with time-varying communication topologies represented by sequences of strongly connected directed graphs. Comparative evaluations demonstrate significant efficiency improvements over current state-of-the-art methods, especially in scenarios with a large number of agents.
|
|
14:10-14:30, Paper WeB06.3 | |
>Image-Based Visual Relative Information for Distributed Rigid Formation Control in 3D Space |
|
Rosa, Muhammad Ridho | University of Groningen |
Berkel, Aymar | University of Groningen |
Jayawardhana, Bayu | University of Groningen |
Keywords: Agents-based systems, Distributed control, Cooperative control
Abstract: Image-based visual relative information (IBVR) for distributed rigid formation control of agents moving in 3D space is proposed for single-integrator agents. The IBVR approach is based solely on the local information of the neighbors' visible area, of neighbors' local coordinates in the image plane, and of the camera parameters in order to achieve and maintain rigid formation distributedly. Each agent is represented as a spherical agent and modeled as a single integrator. We introduce a rigid formation framework based on the image-based visual information and subsequently use it to design the gradient-based distributed formation control in 3D space.
|
|
14:30-14:50, Paper WeB06.4 | |
>Opinion Dynamics on Signed Graphs and Graphons: Beyond the Piece-Wise Constant Case |
|
Prisant, Raoul | Gipsa-Lab, Université Grenoble Alpes |
Garin, Federica | Inria |
Frasca, Paolo | CNRS, GIPSA-Lab, Univ. Grenoble Alpes |
Keywords: Agents-based systems, Large-scale systems, Network analysis and control
Abstract: In this paper we make use of graphon theory to study opinion dynamics on large undirected networks. The opinion dynamics models that we take into consideration allow for negative interactions between the individuals, i.e. competing entities whose opinions can grow apart. We consider both the repelling model and the opposing model that are studied in the literature. We define the repelling and the opposing dynamics on graphons and we show that their initial value problem’s solutions exist and are unique. We then show that the graphon dynamics well approximate the dynamics on large graphs that converge to a graphon. This result applies to large random graphs that are sampled according to a graphon. All these facts are illustrated in an extended numerical example.
|
|
14:50-15:10, Paper WeB06.5 | |
>A Probabilistic Topology Inference Method for Networked Dynamical System Via Single Excitation |
|
Jiao, Qing | Shanghai Jiao Tong University |
Li, Yushan | Shanghai Jiao Tong University |
He, Jianping | Shanghai Jiao Tong University |
Keywords: Agents-based systems, Network analysis and control, Networked control systems
Abstract: A growing number of works have investigated inferring the topology of networked dynamical systems from observations, such as to better understand the system behaviour. Despite the tremendous advances, most of them require the observations to be abundant. This paper focuses on inferring the topology by injecting single excitation on a node and collecting several steps of noisy observations. The problem is challenging because the noises cannot be depressed in several observations and are mixed with the injected excitation, making it hard to directly reveal the topology. To practice, we develop a probabilistic method based on the hypothesis test framework. First, we infer the neighbors that are within h-hop of the excited node and derive the accuracy guarantees. Then, we extend the method to infer the exact h-hop neighbors. A computable lower bound for the accuracy probability is established to provide confidence support in the inference procedures. Furthermore, we give the conditions of excitation input to ensure a desired inference probability, which provides guidance for the input design. Numerical simulations are conducted to verify the effectiveness of the proposed method.
|
|
15:10-15:30, Paper WeB06.6 | |
>Low Complexity Convergence Rate Bounds for the Synchronous Gossip Subclass of Push-Sum Algorithms |
|
Gerencsér, Balázs | HUN-REN Alfréd Rényi Institute of Mathematics |
Kornyik, Miklos | HUN-REN Alfréd Rényi Institute of Mathematics |
Keywords: Agents-based systems, Numerical algorithms, Network analysis and control
Abstract: We develop easily accessible quantities for bounding the almost sure exponential convergence rate of push-sum algorithms. We analyze the scenario of i.i.d. synchronous gossip, every agent communicating independently towards at most a single target at every step. Multiple bounding expressions are developed depending on the generality of the setup, all functions of the network's spectrum. Numerical experiments demonstrate the quality of the bounds obtained together with the computational speedup for acquiring them.
|
|
WeB07 |
Amber 6 |
Cooperative Control I |
Regular Session |
Chair: Reiffers-Masson, Alexandre | IMT Atlantique |
Co-Chair: Kechagias, Andreas | Aristotle University of Thessaloniki |
|
13:30-13:50, Paper WeB07.1 | |
>Cooperative Protection Control for UCAV Swarms in Hostile Environments |
|
De Charentenay, Stanislas | IMT Atlantique, LAB-STICC Laboratory Brest, France |
Reiffers-Masson, Alexandre | IMT Atlantique |
Coppin, Gilles | IMT Atlantique / Lab-STICC |
Keywords: Cooperative control, Adaptive control, Markov processes
Abstract: Swarms of Unmanned Combat Aerial Vehicles (UCAVs) are efficient in various tasks. However, they evolve in hostile environments with risks of destruction during their flight. To mitigate this risk, it is known that cooperative behaviour can be used to enhance the protection within the swarm. The goal of this paper is to design efficient algorithms to guide the overall swarm to a given target while minimizing the risk of destruction of the member of the swarm. First, a new model, based on a controlled Markov chain, is derived to capture this cooperative swarm effect on the destruction threat of each member of the swarm. Then, an algorithm combining path planning to guide the overall swarm and local individual control to optimize the formation is suggested to help a swarm to reach a target before the destruction of all UCAVs. We evaluate our approach using numerical experiments.
|
|
13:50-14:10, Paper WeB07.2 | |
>Prescribed Performance Output Synchronization for Heterogeneous Uncertain Nonlinear Multi-Agent Systems under Directed Switching Graphs |
|
Kechagias, Andreas | Aristotle University of Thessaloniki |
Boumpoulis, Thomas | Aristutle University of Thessaloniki |
Rovithakis, George A. | Aristotle University of Thessaloniki |
Keywords: Cooperative control, Agents-based systems, Uncertain systems
Abstract: In this work, we consider the problem of designing a distributed, output synchronization protocol, for heterogeneous, high-order, uncertain, multi-agent systems in Brunovsky canonical form, operated in a leader-follower scenario. The underlying communication graph is directed and switching. The proposed controller, which is of low-complexity, enforces prescribed performance bounds on the convergence time and the output synchronization accuracy. The leader dynamics are unknown. Each following agent requires relative output information from its neighbors, as well as, measuring its own state. The theoretical findings are highlighted via simulation studies.
|
|
14:10-14:30, Paper WeB07.3 | |
>Cluster Consensus Problem on Time-Varying Higher-Order Interaction Networks |
|
Wei, Haoyu | Shanghai Jiao Tong University |
Pan, Lulu | Shanghai Jiao Tong University |
Shao, Haibin | Shanghai Jiao Tong University |
Li, Dewei | Shanghai Jiao Tong University |
Yu, Wenbin | Shanghai Jiao Tong University |
Keywords: Cooperative control, Agents-based systems, Network analysis and control
Abstract: This paper examines the cluster consensus problem of higher-order interaction networks on switching networks. The matrix-valued inter-agent coupling mechanism is employed to capture the higher-order interactions amongst neighboring agents. Specifically, for a higher-order interaction network with both positive definite and positive semi-definite coupling matrices, we examine the scenario in which the nullspace of positive semi-definite coupling matrices is constructed by a set of orthogonal basis vectors. From the perspective of the lower-order component graph –the projection of network structure onto the direction of each orthogonal basis– results on the explicit characterization of cluster consensus of higher-order interaction networks are provided. Moreover, for time-varying higher-order interaction networks, necessary and sufficient conditions are provided to ensure that a time-varying network shares the same steady-state values as the integral time-invariant network. Simulation results are provided to demonstrate the theory.
|
|
14:30-14:50, Paper WeB07.4 | |
>Scalable, Pairwise Collaborations in Heterogeneous Multi-Robot Teams |
|
Nguyen, Alexander A. | University of California, Irvine |
Guerrero-Bonilla, Luis | Instituto Tecnologico Y De Estudios Superiores De Monterrey |
Jabbari, Faryar | Univ. of California at Irvine |
Egerstedt, Magnus | University of California, Irvine |
Keywords: Cooperative control, Automata, Agents-based systems
Abstract: This paper introduces a finite state machine (FSM) for encoding collaborative interactions among robots. The resulting novel architecture is particularly designed with heterogeneous multi-robot teams in mind, where pairwise collaborative arrangements can result in new capabilities for the participants. To ensure scalability, the proposed FSM's complexity does not depend on the overall team size for individuals' decisions. Additionally, we explore various selection strategies to facilitate the pairing of robots and demonstrate the framework's efficacy on a team of mobile robots with tasks requiring collaboration for their successful completion.
|
|
14:50-15:10, Paper WeB07.5 | |
>A Trust-Based Human-Vehicle Co-Driving System with Model Free Adaptive Dynamic Programming Controller |
|
Shi, Yingkui | Shanghai Jiao Tong University |
Ge, Sicheng | Shanghai Jiao Tong University |
Zhao, Jing | University of Macau |
Hu, Chuan | Shanghai Jiao Tong University |
Zhang, Xi | Shanghai Jiao Tong University |
Keywords: Cooperative control, Autonomous vehicles, Optimal control
Abstract: Trust is one of the crucial factors influencing the performance and safety of human-vehicle co-driving system. The evolution mechanism of human-vehicle trust is studied in this work, and a trust-based steering control model (SCM) is designed to allow the autonomous driving system to adjust its control behavior based on real-time trust level, on the purpose of improve the efficiency and trust. The contributions made in this paper are as follows: 1) a novel quantitative model of human-vehicle dynamic trust is established for the first time in shared steering control by considering the deviation of human-vehicle driving expectations; 2) a trust-based steering controller using model free adaptive dynamic programming (MFADP) is designed which can solve the optimal control policy according to the value of trust without the dependencies on parameters of dynamic model of the controlled system. The rationality of proposed trust model and trust-based steering control method are validated by high-fidelity Carsim-Simulink simulations.
|
|
15:10-15:30, Paper WeB07.6 | |
>Multi-Aerial Pursuit of an Intruder Drone Using a Behavioral Approach Based on Energy |
|
Cariño Escobar, Jossué | ONERA |
Castillo, Pedro | Univ De Technologie De Compiegne |
Vidolov, Boris | Universite De Technologie De Compiegne |
Lozano, Rogelio | University De Technologie |
Keywords: Cooperative control, Autonomous vehicles, Robust control
Abstract: The pursuit-evasion is a model describing many practical problems and applications in the context of Unmanned Aerial Vehicles (UAVs). In this paper, we propose a solution based on a multi-agent scheme with robustness that is able to capture agile intruders. Our solution uses a control scheme based on energy dissipation and robustness to avoid numerical approximations, unlike solutions that solve the Hamilton-Jacobi-Isaacs (HJI) in a numerical way. In addition, the proposed controller ensures asymptotic stability using the Lyapunov theory, with smooth state changes and guarantees the capture of the target. Experimental results demonstrate the performance of the proposed solution even in presence of an agile intruder.
|
|
WeB08 |
Amber 7 |
Optimization II |
Regular Session |
Chair: Castillo Jimenez, Andres Catalino | Purdue University |
Co-Chair: Shin, Hyo-Sang | Cranfield University |
|
13:30-13:50, Paper WeB08.1 | |
>From Convexity to Strong Convexity and Beyond: Bridging the Gap in Convergence Rates |
|
Romero, Orlando | University of Pennsylvania |
Benosman, Mouhacine | MERL |
Pappas, George J. | University of Pennsylvania |
Keywords: Optimization, Optimization algorithms, Machine learning
Abstract: In this paper, we re-examine the role of convexity and smoothness on gradient-based unconstrained optimization. While existing literature establishes the fundamental limits for gradient-based optimization algorithms for the class mathcal{F}_L of mbox{L-smooth} convex functions and the subclass mathcal{F}_{mu,L} of mbox{L-smooth} and mu-strongly convex functions, there is a notable gap in the stark transition from their respective sublinear and linear/exponential convergence rates that persists even as muto 0. This gap is notable since the classical rate of mathcal{O}(1/k) for gradient descent in mathcal{F}_L is often overly conservative compared to what is observed in practice for convex functions that are not strongly convex. In this work, we partially close the aforementioned gap by leveraging the notion of emph{uniform} smoothness and convexity, and their respective emph{moduli}, to quantify and more comprehensively characterize the smoothness and convexity of a given function. We show how, through a simple rescaling of gradient descent informed by the modulus of smoothness, we can recover the classic rates as edge cases and establish novel rates for a wide variety of functions. Further, we examine how uniform convexity can be replaced with the Kurdyka-{L}ojasiewicz inequality, with the so-called ``desingularizing function'' replacing the role of the modulus of convexity in the novel rates. This characterization yields novel geometric insights on the relationship between the optimization landscape and the attainable convergence rates.
|
|
13:50-14:10, Paper WeB08.2 | |
>A General Framework for Approximate and Delayed Gradient Descent for Decomposable Cost Functions |
|
Zheng, Xinran | University of California San Diego |
Javidi, Tara | University of California, San Diego |
Touri, Behrouz | University of California San Diego |
Keywords: Optimization, Machine learning, Optimization algorithms
Abstract: We propose and analyze a generalized framework for distributed, delayed, and approximate stochastic gradient descent. Our framework considers n local agents who utilize their local data and computation to collectively assist a central server tasked with optimizing a global cost function composed of local cost functions accessible to the local agents. This framework is very general, subsuming a great variety of algorithms in federated learning and distributed optimization. In particular, this framework allows each local agent to approximate and share a stochastic (possibly biased) and delayed estimate of its local function gradient. Focusing on strongly convex functions with sufficient degree of smoothness, we characterize the mean square error in terms of the varying step-size, the approximation error (bias), and the delay in computing the gradients. This characterization, together with a careful design of step size process, establishes an optimal convergence rate that aligns with centralized stochastic gradient descent (SGD).
|
|
14:10-14:30, Paper WeB08.3 | |
>Equitable Client Selection in Federated Learning Via Truncated Submodular Maximization |
|
Castillo Jimenez, Andres Catalino | Purdue University |
Kaya, Ege Can | Purdue University |
Ye, Lintao | Huazhong University of Science and Technology |
Hashemi, Abolfazl | Purdue University |
Keywords: Optimization, Machine learning
Abstract: In a typical Federated Learning paradigm, a random subset of clients is selected at every round for training. This randomly chosen subset often does not perform well when evaluated in terms of fairness as the final model’s performance often varies greatly between clients. This lack of a balanced and fair performance can be detrimental in sensitive applications, such as disease diagnosis in healthcare settings. Such issues may be exacerbated by emerging performance-centric client sampling procedures. This paper proposes a new equitable client selection method, SUBTRUNC, that addresses the shortcomings of random selection via a modification of the well-known facilitylocation problem through submodular function maximization. This new approach entailing submodular functions incorporates using the information of loss values of each client to ensure a more balanced and thus more fair performance of the final model. Additionally, strong theoretical guarantees on the convergence of the resulting FL algorithm are established under mild assumptions. The algorithm’s performance is evaluated on heterogeneous scenarios with a clear improvement in fairness when being observed under the scope of a client dissimilarity metric.
|
|
14:30-14:50, Paper WeB08.4 | |
>A Passivity-Based Method for Accelerated Convex Optimisation |
|
Cho, Namhoon | Cranfield University |
Shin, Hyo-Sang | Cranfield University |
Keywords: Optimization, Adaptive control, Machine learning
Abstract: This study presents a constructive methodology for designing accelerated convex optimisation algorithms in continuous-time domain. The two key enablers are the classical concept of passivity in control theory and the time-dependent change of variables that maps the output of the internal dynamic system to the optimisation variables. The Lyapunov function associated with the optimisation dynamics is obtained as a natural consequence of specifying the internal dynamics that drives the state evolution as a passive linear time-invariant system. The passivity-based methodology provides a general framework that has the flexibility to generate convex optimisation algorithms with the guarantee of different convergence rate bounds on the objective function value. The same principle applies to the design of online parameter update algorithms for adaptive control by re-defining the output of internal dynamics to allow for the feedback interconnection with tracking error dynamics.
|
|
14:50-15:10, Paper WeB08.5 | |
>Online Nonstochastic Control vs. Retrospective Cost Adaptive Control |
|
Syed, Usman Ahmed | University of Illinois Urbana Champaign |
Li, Yingying | UIUC |
Hu, Bin | University of Illinois at Urbana-Champaign |
Keywords: Optimization, Adaptive control, Machine learning
Abstract: Recently, online optimization methods have been leveraged to develop the online nonstochastic control framework which is capable of learning online gradient perturbation controllers in the presence of nonstochastic adversarial disturbances. Interestingly, using online optimization for adapting controllers in the presence of unknown disturbances is not a completely new idea, and a similar algorithmic framework called Retrospective Cost Adaptive Control (RCAC) has already appeared in the controls literature in 2000s. In this letter, we present the connections between online nonstochastic control and RCAC, and discuss the different strengths of both approaches: i.e., RCAC is able to stabilize unknown unstable plants via the use of target models, while online nonstochastic control enjoys provably near optimal regret bounds given a stabilizing policy a priori. We further propose an integration of these two approaches. We hope that our insights will help the development of new algorithms that complement the two approaches.
|
|
15:10-15:30, Paper WeB08.6 | |
>Wasserstein Distributionally Robust Regret Minimization |
|
Cho, Youngchae | Seoul National University |
Yang, Insoon | Seoul National University |
Keywords: Optimization
Abstract: We propose a decision-making model under uncertainty to minimize the ex-ante regret in a distributionally robust manner using the Wasserstein metric, where the regret is defined as the difference of the expected cost achieved and the best achievable expected cost for any given distribution of uncertainty. First, we formulate a minimization problem of the worst-case ex-ante regret over a Wasserstein ball. Subsequently, we derive its surrogate as the proposed model using the minimax inequality, whose objective function is above the worst-case ex-ante regret on the decision space. Our main contributions are to show that (i) the approximation error of our model is uniformly bounded, and it vanishes depending on the cost function, the uncertainty set, and the Wasserstein ball’s radius; (ii) our model provides a finite-sample performance guarantee and is asymptotically optimal; and (iii) an optimal solution of our model for a class of two-stage linear programs can be obtained using the cutting-plane method. Simulation results demonstrate the effectiveness of our model.
|
|
WeB09 |
Amber 8 |
Predictive Control for Nonlinear Systems II |
Regular Session |
Chair: Diehl, Moritz | University of Freiburg |
Co-Chair: Stellato, Bartolomeo | Princeton University |
|
13:30-13:50, Paper WeB09.1 | |
>A Secure Communication Framework Based on Chaotic Synchronization Via Approximate Nonlinear Model Predictive Control |
|
Xie, Hongyang | Beihang University |
Li, Dongyu | Beihang University |
Dong, Fei | Beihang University |
Hu, Qinglei | Beihang University |
Keywords: Predictive control for nonlinear systems, Networked control systems, Neural networks
Abstract: Chaotic synchronization control has shown a great potential in the field of secure communications. Due to chaotic behaviors and restricted computational resources, the efficient implementation of synchronization control remains a significant challenge and an open problem. In this letter, an active model predictive controller is developed to address the synchronization errors of the master-slave system. To meet the requirement of computational efficiency, a constrained neural network is taken as the approximation control law of the nonlinear model predictive controller. Furthermore, a secure communication framework is proposed for the networked control system. Numerical simulations illustrate the synchronization performance of the proposed method and its practical applications in secure communication.
|
|
13:50-14:10, Paper WeB09.2 | |
>Tube-Based MPC for Two-Timescale Discrete-Time Nonlinear Processes with Robust Control Contraction Metrics |
|
Guo, Yang | Technische Universität Chemnitz |
Sauerteig, Philipp | Technische Universität Chemnitz |
Streif, Stefan | Technische Universität Chemnitz |
Keywords: Predictive control for nonlinear systems, Optimal control, Nonlinear systems
Abstract: A computationally efficient model predictive control scheme is proposed for constrained nonlinear processes with inherent slow and fast dynamics. Specifically, the nonlinear process is approximated by a surrogate model, whose slow state is sampled with a larger time interval than the fast one. This reduces optimization variables, on the other hand, introduces prediction errors and henceforth may induce constraint violation without further treatments. To mitigate these issues, we tighten constraints by leveraging Robust Control Contraction Metrics. Furthermore, a back-up strategy is employed to ensure feasibility over time. The numerical efficiency of this proposed scheme is illustrated with a crops growth process, e.g., in indoor farming.
|
|
14:10-14:30, Paper WeB09.3 | |
>Advanced-Step Real-Time Iterations with Four Levels -- New Error Bounds and Fast Implementation in Acados |
|
Frey, Jonathan | University of Freiburg |
Nurkanovic, Armin | University of Freiburg |
Diehl, Moritz | University of Freiburg |
Keywords: Predictive control for nonlinear systems, Optimal control, Optimization algorithms
Abstract: The Real-Time Iteration (RTI) is an online nonlinear model predictive control algorithm that performs a single Sequential Quadratic Programming (SQP) per sampling time. The algorithm is split into a preparation and a feedback phase, where the latter one performs as little computations as possible solving a single prepared quadratic program. To further improve the accuracy of this method, the Advanced-Step RTI (AS-RTI) performs additional Multi-Level Iterations (MLI) in the preparation phase, such as inexact or zero-order SQP iterations on a problem with a predicted state estimate. This paper extends and streamlines the existing local convergence analysis of AS-RTI, such as analyzing MLI of level A and B for the first time, and significantly simplifying the proofs for levels C and D. Moreover, this paper provides an efficient open-source implementation in acados, making it widely accessible to practitioners.
|
|
14:30-14:50, Paper WeB09.4 | |
>An Execution-Time-Certified Riccati-Based IPM Algorithm for RTI-Based Input-Constrained NMPC |
|
Wu, Liang | Massachusetts Institute of Technology |
Ganko, Krystian | Massachusetts Institute of Technology |
Wang, Shimin | Massachusetts Institute of Technology |
Braatz, Richard D. | Massachusetts Institute of Technology |
Keywords: Predictive control for nonlinear systems, Optimization algorithms, Optimal control
Abstract: Establishing an execution time certificate in deploying model predictive control (MPC) is a pressing and challenging requirement. As nonlinear MPC (NMPC) results in nonlinear programs, differing from quadratic programs encountered in linear MPC, deriving an execution time certificate for NMPC seems an impossible task. Our prior work cite{wu2023direct} introduced an input-constrained MPC algorithm with the exact and only textit{dimension-dependent} (textit{data-independent}) number of floating-point operations ([flops]). This paper extends it to input-constrained NMPC problems via the real-time iteration (RTI) scheme, which results in textit{data-varying} (but textit{dimension-invariant}) input-constrained MPC problems. Therefore, applying our previous algorithm can certify the execution time based on the assumption that processors perform fixed [flops] in constant time. As the RTI-based scheme generally results in MPC with a long prediction horizon, this paper employs the efficient factorized Riccati recursion, whose computational cost scales linearly with the prediction horizon, to solve the Newton system at each iteration. The execution-time certified capability of the algorithm is theoretically and numerically validated through a case study involving nonlinear control of the chaotic Lorenz system.
|
|
14:50-15:10, Paper WeB09.5 | |
>Improved Quasi-Min-Max MPC for Constrained LPV Systems Via Nonlinearly Parameterized State Feedback Control |
|
Yan, Jin | Institut Polytechnique De Paris |
Nguyen, Hoai-Nam | Telecom SudParis |
Samama, Nel | Telecom SudParis, Institute Polytechnique De Paris |
Keywords: Predictive control for nonlinear systems, Constrained control, LMIs
Abstract: We consider the regulation problem of linear parameter varying systems with input and state constraints. It is assumed that the time-varying parameters are available at the current time, but their future behavior is unknown and contained in a polytopic set. The aim is to design new stabilizing quasi-min-max MPC algorithms based on a nonlinearly parameterized state feedback control law. It is shown that the use of such a control law leads to less conservative results compared to those derived from linearly parameterized state feedback control laws. At each time instant, a convex semi-definite optimization problem is required to solved. Two numerical examples, including a non-quadratically stabilizable system, are given with comparison to earlier solutions from the literature to illustrate the effectiveness of the proposed approaches.
|
|
15:10-15:30, Paper WeB09.6 | |
>Learning Hierarchical Control for Multi-Agent Capacity-Constrained Systems |
|
Vallon, Charlott | University of California, Berkeley |
Pinto, Alessandro | NASA Jet Propulsion Laboratory |
Stellato, Bartolomeo | Princeton University |
Borrelli, Francesco | Unversity of California at Berkeley |
Keywords: Predictive control for nonlinear systems, Hierarchical control, Data driven control
Abstract: This paper introduces a data-driven hierarchical control scheme for a fleet of nonlinear, capacity-constrained autonomous agents in an iterative environment. The proposed control framework consists of a high-level dynamic task assignment and routing layer and low-level motion planning and tracking layer. Each layer uses a data-driven Model Predictive Control (MPC) policy for efficient calculation of new task assignments and actuation. We use collected data to iteratively refine estimates of agent capacity usage, and update MPC policy parameters accordingly. We leverage tools from iterative learning control to integrate learning at both hierarchy levels, and coordinate learning between levels to maintain closed-loop feasibility and performance improvement at each iteration.
|
|
WeB10 |
Brown 1 |
Context-Aware Analysis and Design of Genetic Circuits |
Invited Session |
Chair: Yeung, Enoch | University of California Santa Barbara |
Co-Chair: Sontag, Eduardo | Northeastern University |
Organizer: Del Vecchio, Domitilla | Massachusetts Institute of Technology |
Organizer: Sontag, Eduardo | Northeastern University |
|
13:30-13:50, Paper WeB10.1 | |
>Guaranteeing System-Level Properties in Genetic Circuits Subject to Context Effects (I) |
|
Incer, Inigo | California Institute of Technology |
Pandey, Ayush | University of California, Merced |
Nolan, Nicholas | Massachusetts Institute of Technology |
Peterman, Emma | Massachusetts Institute of Technology |
Galloway, Kate | Massachusetts Institute of Technology |
Sontag, Eduardo | Northeastern University |
Del Vecchio, Domitilla | Massachusetts Institute of Technology |
Keywords: Biological systems, Formal Verification/Synthesis, Algebraic/geometric methods
Abstract: The identification of constraints on system parameters that will ensure that a system achieves desired requirements remains a challenge in synthetic biology, where components unintendedly affect one another by perturbing the cellular environment in which they operate. This paper shows how to solve this problem optimally for a class of input/output system-level specifications, and for unintended interactions due to resource sharing. Specifically, we show how to solve the problem based on the input/output properties of the subsystems and on the unintended interaction map. Our approach is based on the elimination of quantifiers in monotone properties of the system. We illustrate applications of this methodology to guaranteeing system-level performance of multiplexed and sequential biosensing and of bistable genetic circuits.
|
|
13:50-14:10, Paper WeB10.2 | |
>Noise Reduction in Resource-Coupled Multi-Module Gene Circuits through Antithetic Feedback Control (I) |
|
Chakravarty, Suchana | Arizona State University |
Zhang, Rong | Arizona State University |
Tian, Xiaojun | Arizona State University |
Keywords: Systems biology, Genetic regulatory systems, Stochastic systems
Abstract: Gene circuits within the same host cell often experience coupling, stemming from the competition for limited resources during transcriptional and translational processes. This resource competition introduces an additional layer of noise to gene expression. Here we present three multi-module antithetic control strategies: negatively competitive regulation (NCR) controller, alongside local and global controllers, aimed at reducing the gene expression noise within the context of resource competition. Through stochastic simulations and fluctuation-dissipation theorem (FDT) analysis, our findings highlight the superior performance of the NCR antithetic controller in reducing noise levels. Our research provides an effective control strategy for attenuating resource-driven noise and offers insight into the development of robust gene circuits.
|
|
14:10-14:30, Paper WeB10.3 | |
>Selection of Control Inputs for Enhancing the Long-Term Performance of Synthetic Gene Circuits (I) |
|
Byrom, Daniel Peter | University of Warwick |
Bates, Declan G. | Univ. of Warwick |
Darlington, Alexander P. S. | University of Warwick |
Keywords: Genetic regulatory systems, Biomolecular systems, Cellular dynamics
Abstract: Engineered biological systems fail when mutations arise that inhibit their intended function. Such mutations introduce uncertainty into the system parameters, making negative feedback control an attractive strategy to improve the evolutionary longevity of synthetic gene circuits. Here we propose three classes of controller that improve evolutionary performance in repeated batch culture. Each controller takes a different biological input: (i) the gene product in the cell, (ii) the host growth rate and (iii) the total population output. Using a multi-scale model of host-circuit interactions, we demonstrate that these different modes of action differentially influence the growth dynamics between mutant strains, driving significant differences in the long-term performance of gene circuits. We show that population-based feedback is least effective and that, whilst direct feedback inhibition is effective in the short-term, growth-based feedback can enhance long-term performance to a greater degree. We propose a novel control strategy which combines two of the strategies and improves both short- and long-term performance.
|
|
14:30-14:50, Paper WeB10.4 | |
>Competition for Binding Targets Results in Paradoxical Effects for Simultaneous Activator and Repressor Action (I) |
|
Ali Al-Radhawi, Muhammad | Northeastern University |
Manoj, Krishna | Massachusetts Institute of Technology |
Jatkar, Dhruv D. | Northeastern University |
Duvall, Alon | Northeastern University |
Del Vecchio, Domitilla | Massachusetts Institute of Technology |
Sontag, Eduardo | Northeastern University |
Keywords: Biological systems, Systems biology, Genetic regulatory systems
Abstract: In the context of epigenetic transformations in cancer metastasis, a puzzling effect was recently discovered, in which the elimination (knock-out) of an activating regulatory element leads to increased (rather than decreased) activity of the element being regulated. It has been postulated that this paradoxical behavior can be explained by activating and repressing transcription factors competing for binding to other possible targets. It is very difficult to prove this hypothesis in mammalian cells, due to the large number of potential players and the complexity of endogenous intracellular regulatory networks. Instead, this paper analyzes this issue through an analogous synthetic biology construct which aims to reproduce the paradoxical behavior using standard bacterial gene expression networks. The paper first reviews the motivating cancer biology work, and then describes a proposed synthetic construct. A mathematical model is formulated, and basic properties of uniqueness of steady states and convergence to equilibria are established, as well as an identification of parameter regimes which should lead to observing such paradoxical phenomena (more activator leads to less activity at steady state). A proof is also given to show that this is a steady-state property, and for initial transients the phenomenon will not be observed. This work adds to the general line of work of resource competition in synthetic circuits.
|
|
14:50-15:10, Paper WeB10.5 | |
>Modeling Control of Supercoiling Dynamics and Transcription Using DNA-Binding Proteins |
|
Clark, Harris | UC Santa Barbara |
Taylor, Aleczander | University of California, Santa Barbara |
Yeung, Enoch | University of California Santa Barbara |
Keywords: Biomolecular systems, Systems biology, Biological systems
Abstract: Nearly all natural and synthetic gene networks rely on the fundamental process of transcription to enact biological feedback, genetic programs, and living circuitry. In this work, we investigate the efficacy of controlling transcription using a new biophysical mechanism, control of localized supercoiling near a gene of interest. We postulate a basic reaction network model for describing the general phenomenon of transcription and introduce a separate set of equations to describe the dynamics of supercoiling. We show that supercoiling and transcription introduce a shared reaction flux term in the model dynamics and illustrate how the modulation of supercoiling can be used to control transcription rates. We show the supercoiling-transcription model can be written as a nonlinear state-space model, with a radial basis function nonlinearity to capture the empirical relationship between supercoiling and transcription rates. We show the system admits a single, globally exponentially stable equilibrium point. Notably, we show that mRNA steady-state levels can be controlled directly by increasing a length-scale parameter for genetic spacing. Finally, we build a mathematical model to explore the use of a DNA binding protein, to define programmable boundary conditions on supercoiling propagation, which we show can be used to control transcriptional bursting or pulsatile transcriptional response. We show there exists a stabilizing control law for mRNA tracking, using the method of control Lyapunov functions and illustrate these results with numerical simulations.
|
|
15:10-15:30, Paper WeB10.6 | |
>The Impact of Plasmid Copy Number on Leaky Gene Expression and on the Behavior of an Activator-Based Genetic Switch (I) |
|
Yong, Chentao | New York University |
Zhou, Yiren | New York University |
Gyorgy, Andras | New York University Abu Dhabi |
Keywords: Biomolecular systems, Cellular dynamics, Genetic regulatory systems
Abstract: Plasmid copy number (PCN) is traditionally considered a static design parameter in synthetic biology applications. However, recent tools enable its dynamic regulation, thus opening up a novel dimension of gene expression control that complements well-established transcriptional and translational techniques. Therefore, here we characterize how tuning this crucial parameter impacts promoter leakiness both when relying on positive and negative regulation. We demonstrate both analytically and experimentally that in the former case, greater PCN yields elevated leakiness in protein expression, whereas this basal level can surprisingly decrease as PCN increases in repressor-based regulation, and that multi-level gene expression control can amplify this effect. Considering a genetic switch as a concrete application example, we further characterize how the interplay of PCN and promoter leakiness together determine the number of stable fixed points and their robustness to noise. Finally, we reveal how the metabolic burden that originates within the switch and its context shapes the dynamics and behavior of this ubiquitous gene circuit.
|
|
WeB11 |
Brown 2 |
Switched Systems I |
Regular Session |
Chair: Kreiss, Jérémie | Université De Lorraine, CRAN, ENSEM, |
Co-Chair: Jungers, Raphaël M. | University of Louvain |
|
13:30-13:50, Paper WeB11.1 | |
>Using Symbolic Dynamics to Compare Path-Complete Lyapunov Functions |
|
Singh, Somya | U C Louvain |
Jungers, Raphaël M. | University of Louvain |
Keywords: Switched systems, Hybrid systems, Automata
Abstract: In this paper, we study discrete-time linear switched systems by leveraging tools from symbolic dynamics and language theory. More specifically, viewing path-complete Lyapunov functions (PCLF) associated with the switched system as finite automata, we investigate the coverings of bi-infinite words generated via a PCLF and develop a framework for comparing two different PCLFs via these coverings. However, in most of the cases PCLFs are not comparable with regards to one corresponding to a better stability criterion than the other. For this purpose, we utilize the notion of support sets, which is a subset of paths in a PCLF that is sufficient to obtain the same performance index as that of the entire set of bi-infinite words generated by the PCLF, to obtain partial relations between two coverings. We also illustrate a numerical example to justify the study of support sets via the covering framework.
|
|
13:50-14:10, Paper WeB11.2 | |
>Towards Stochastic Realization Theory for Generalized Linear Switched Systems with Inputs: Decomposition into Stochastic and Deterministic Components and Existence and Uniqueness of Innovation Form |
|
Rouphael, Elie | University of Lille |
Mejari, Manas | University of Applied Sciences and Arts of Southern Switzerland |
Petreczky, Mihaly | UMR CNRS 9189, Ecole Centrale De Lille |
Belkoura, Lotfi | Université De Lille |
Keywords: Switched systems, Linear parameter-varying systems, Identification
Abstract: In this paper, we study a class of stochastic Generalized Linear Switched System (GLSS), which includes subclasses of jump-Markov, piecewide-linear and Linear Parameter-Varying (LPV) systems. We prove that the output of such systems can be decomposed into deterministic and stochastic components. Using this decomposition, we show existence of state-space representation in innovation form, and we provide sufficient conditions for such representations to be minimal and unique up to isomorphism.
|
|
14:10-14:30, Paper WeB11.3 | |
>Input Redundancy of Switched Linear Systems Via Polynomial Parameter-Dependent Systems |
|
Valentim Viana, Valessa | Université De Lorraine |
Kreiss, Jérémie | Université De Lorraine, CRAN, ENSEM, |
Jungers, Marc | CNRS - Université De Lorraine |
Keywords: Switched systems, Linear systems, Linear parameter-varying systems
Abstract: In this paper, we introduce a definition of input redundancy for switched systems, particularly focusing on the continuous input. We establish a criterion for assessing whether a switched linear system is input redundant with respect to the continuous input. Our approach initially involves transforming the switched linear system into a linear system depending polynomially on a univariate parameter through Lagrange polynomial interpolation. This allows us to leverage recent results in geometric control theory and input redundancy for parameter-dependent systems, and adapt them to the context of switched systems in order to obtain the desired conditions. To illustrate the application of the proposed strategy, we provide three numerical examples.
|
|
14:30-14:50, Paper WeB11.4 | |
>Necessary Conditions for the Stability of Singularly Perturbed Linear Systems with Switching Slow-Fast Behaviors |
|
Chitour, Yacine | Universit'e Paris-Sud, CNRS, Supelec |
Daafouz, Jamal | Université De Lorraine, CRAN, CNRS |
Haidar, Ihab | ENSEA |
Mason, Paolo | CNRS, Laboratoire Des Signaux Et Systèmes, Supélec |
Sigalotti, Mario | INRIA Paris |
Keywords: Switched systems, Linear systems, Stability of linear systems
Abstract: We consider linear singularly perturbed dynamics in which the set of fast variables is a switching parameter. We introduce auxiliary switching systems (in a single time-scale) whose instability implies the instability of the original dynamics. This translates into necessary conditions for the stability of the linear singularly perturbed dynamics.
|
|
14:50-15:10, Paper WeB11.5 | |
>Stability of Nonlinear Systems with Slow and Fast Time Variation and Switching: The Common Equilibrium Case |
|
Liberzon, Daniel | Univ of Illinois, Urbana-Champaign |
Shim, Hyungbo | Seoul National University |
Keywords: Switched systems, Nonlinear systems, Time-varying systems
Abstract: In this paper we consider a class of nonlinear systems with two kinds of inputs: one is slowly-varying, the other is fast-varying and periodic, and both are only piecewise continuous. Under the assumption that the origin is a common equilibrium for all values of the input signals, we provide sufficient conditions under which this equilibrium is semi-globally exponentially stable. Our approach is based on considering the (partial) average system which averages out the fast variation but retains the slow variation, and which can be used to approximate the original system in a certain sense. The stability conditions involve the existence of a suitable Lyapunov function for this average system, along with a bound on the total variation of the slowly-varying input.
|
|
15:10-15:30, Paper WeB11.6 | |
>On Sampled-Data Control of Nonlinear Asynchronous Switched Systems |
|
Di Ferdinando, Mario | University of L'Aquila |
Pola, Giordano | University of L'Aquila |
Di Gennaro, Stefano | University of L'Aquila |
Pepe, Pierdomenico | University of L' Aquila |
Keywords: Switched systems, Sampled-data control, Lyapunov methods
Abstract: In this paper, the sampled-data stabilization problem of nonlinear asynchronous switched systems is studied. In particular, a new methodology for the design of sampled-data controllers is provided for fully nonlinear asynchronous switched systems (i.e. not necessarily affine in the control inputs) described by locally Lipschitz functions. Firstly, the new notion of Steepest Descent Switching Feedback (SDSF) is introduced. Then, it is proved the existence of a suitably fast sampling such that the digital implementation of SDSFs (continuous or not) ensures the semi-global practical stability property with arbitrarily small final target ball of the related sampled-data closed-loop system under any kind of switching with arbitrarily pre-fixed dwell time. The stabilization in the sample-and-hold sense theory is used as a tool to prove the results. Possible discontinuities in the function describing the controller at hand are also managed. The case of aperiodic sampling is included in the theory here developed. The proposed theoretical results are validated through a numerical example.
|
|
WeB12 |
Brown 3 |
Machine Learning I |
Regular Session |
Chair: Alamo, Teodoro | Universidad De Sevilla |
Co-Chair: Wei, Ermin | Northwestern Univeristy |
|
13:30-13:50, Paper WeB12.1 | |
>One-Shot Averaging for Distributed TD(lambda) under Markov Sampling |
|
Tian, Haoxing | Boston University |
Paschalidis, Ioannis Ch. | Boston University |
Olshevsky, Alexander | Boston University |
Keywords: Machine learning
Abstract: We consider a distributed setup for reinforcement learning, where each agent has a copy of the same Markov Decision Process but transitions are sampled from the corresponding Markov chain independently by each agent. We show that in this setting, we can achieve a linear speedup for TD(lambda), a family of popular methods for policy evaluation, in the sense that N agents can evaluate a policy N times faster provided the target accuracy is small enough. Notably, this speedup is achieved by ``one shot averaging,'' a procedure where the agents run TD(lambda) with Markov sampling independently and only average their results after the final step. This significantly reduces the amount of communication required to achieve a linear speedup relative to previous work.
|
|
13:50-14:10, Paper WeB12.2 | |
>Temporal-Logic-Based Causal Fairness Analysis |
|
Baharisangari, Nasim | Arizona State University |
Pydah, Anirudh | Arizona State University |
Wu, Shiyou | Arizona State University |
Xu, Zhe | Arizona State University |
Keywords: Machine learning, Statistical learning, Learning
Abstract: Recently, there has been growing reliance on AI-based decision-making algorithms and frameworks in different areas such as health-related policies and credit scoring. The AI-based algorithms and frameworks may have been trained using data that is biased against a specific subpopulation or group, e.g., a certain race or gender. Hence, it is crucial to incorporate fairness measures into the decision-making process. However, most of the existing fairness analysis methods do not consider the inherent temporal dependencies in real-world situations. In this paper, we propose Temporal-Logic-Based Causal Fairness Analysis (TL-CFA) as a novel framework that integrates causal reasoning and temporal logic to address fairness concerns in algorithmic decision-making processes. This framework contributes to fairness-aware machine learning by offering a comprehensive approach that considers the temporal dimension, thus promoting fair and unbiased decision-making. We demonstrate the usefulness of the proposed method in health-inequity and gender pay gap case studies. We then compare the results obtained by TL-CFA by Path Analysis and Mediation Analysis.
|
|
14:10-14:30, Paper WeB12.3 | |
>Bayesian Meta Learning for Trustworthy Uncertainty Quantification |
|
Yuan, Zhenyuan | Virginia Tech |
Doan, Thinh T. | University of Texas at Austin |
Keywords: Machine learning, Statistical learning, Intelligent systems
Abstract: We consider the problem of Bayesian regression with trustworthy uncertainty quantification. We define that the uncertainty quantification is trustworthy if the ground truth can be captured by intervals dependent on the predictive distributions with a pre-specified probability. Furthermore, we propose, Trust-Bayes, a novel optimization framework for Bayesian meta learning which is cognizant of trustworthy uncertainty quantification without explicit assumptions on the prior model/distribution of the functions. We characterize the lower bounds of the probabilities of the ground truth being captured by the specified intervals and analyze the sample complexity with respect to the feasible probability for rustworthy uncertainty quantification. Monte Carlo simulation of a case study using Gaussian process regression is conducted for verification and comparison with the Meta-prior algorithm.
|
|
14:30-14:50, Paper WeB12.4 | |
>Fast Reinforcement Learning for Optimal Control of Nonlinear Systems Using Transfer Learning |
|
Wang, Yujia | National University of Singapore |
Xiao, Ming | National University of Singapore |
Wu, Zhe | National University of Singapore |
Keywords: Machine learning, Reinforcement learning, Chemical process control
Abstract: Traditional reinforcement learning (RL) methods for optimal control of nonlinear processes often face challenges such as substantial demands on computational resources and training time, and the difficulty of ensuring the safety of the closed-loop system during training. To overcome these obstacles, this work proposes a safe transfer reinforcement learning (TRL) framework. The algorithm leverages knowledge obtained from pre-trained source tasks to expedite learning in a new yet related target task, thereby significantly reducing both learning time and computational overhead for optimizing a control policy. Additionally, the proposed TRL method collects data and optimizes the control policy within a control invariant set (CIS) to ensure the safety of the system throughout the learning process. Furthermore, we develop a theoretical analysis for the TRL algorithm that establishes an error bound between the approximate control policy and the optimal ones, accounting for the discrepancy between the target and source tasks. Finally, we validate our approach using an example of optimal control of a chemical reactor, showcasing its effectiveness in solving the optimal control problem with improved computational efficiency and safety guarantees.
|
|
14:50-15:10, Paper WeB12.5 | |
>Dissimilarity Function Based Classification |
|
Palomo, Jaime | University of Seville |
Muñoz de la Peña, David | Universidad De Sevilla |
Alamo, Teodoro | Universidad De Sevilla |
Keywords: Machine learning, Randomized algorithms, Data driven control
Abstract: Most employed Bayesian algorithms, such as quadratic discriminant analysis, linear discriminant analysis or naive Bayes, rely on Gaussian assumptions. In this work, we introduce a novel approach to Bayesian classification based on dissimilarity functions. Dissimilarity functions provide a measure of the distance of a sample to a given set and can be used to obtain empirical probability distributions that allows the fitting of non-Gaussian probability distributions. We prove that the resulting set of classifiers, based on dissimilarity functions, includes, but is not limited to, quadratic discriminant analysis classifiers. We also demonstrate the properties of the proposed approach analyzing their performance over publicly available datasets.
|
|
15:10-15:30, Paper WeB12.6 | |
>A Stochastic Quasi-Newton Method for Non-Convex Optimization with Non-Uniform Smoothness |
|
Sun, Zhenyu | Northwestern University |
Wei, Ermin | Northwestern Univeristy |
Keywords: Machine learning, Optimization algorithms
Abstract: Classical convergence analyses for optimization algorithms rely on the widely-adopted uniform smoothness assumption. However, recent experimental studies have demonstrated that many machine learning problems exhibit non-uniform smoothness, meaning the smoothness factor is a function of the model parameter instead of a universal constant. In particular, it has been observed that the smoothness grows with respect to the gradient norm along the training trajectory. Motivated by this phenomenon, the recently introduced (L_0, L_1)-smoothness is a more general notion, compared to traditional L-smoothness, that captures such positive relationship between smoothness and gradient norm. Under this type of non-uniform smoothness, existing literature has designed stochastic first-order algorithms by utilizing gradient clipping techniques to obtain the optimal mathcal{O}(epsilon^{-3}) sample complexity for finding an epsilon-approximate first-order stationary solution. Nevertheless, the studies of quasi-Newton methods are still lacking. Considering higher accuracy and more robustness for quasi-Newton methods, in this paper we propose a fast stochastic quasi-Newton method when there exists non-uniformity in smoothness. Leveraging gradient clipping and variance reduction, our algorithm can achieve the best-known mathcal{O}(epsilon^{-3}) sample complexity and enjoys convergence speedup with simple hyperparameter tuning. Our numerical experiments show that our proposed algorithm outperforms the state-of-the-art approaches.
|
|
WeB13 |
Suite 1 |
Microscopic Traffic Modelling and Control |
Invited Session |
Chair: Yu, Huan | The Hong Kong University of Science and Technology(Guangzhou) |
Co-Chair: Delle Monache, Maria Laura | University of California, Berkeley |
Organizer: Cicic, Mladen | University of California, Berkeley |
Organizer: Nick Zinat Matin, Hossein | University of California, Berkeley |
Organizer: Yu, Huan | The Hong Kong University of Science and Technology(Guangzhou) |
Organizer: Delle Monache, Maria Laura | University of California, Berkeley |
|
13:30-13:50, Paper WeB13.1 | |
>Bidirectional Cruise Controller for Platoons with Constant Spacing and Collision Avoidance (I) |
|
Theodosis, Dionysios | Technical University of Crete |
Karafyllis, Iasson | National Technical University of Athens |
Papageorgiou, Markos | Technical Univ. of Crete |
Keywords: Lyapunov methods, Nonlinear systems, Stability of nonlinear systems
Abstract: In this paper we introduce a bidirectional cruise controller for platoons of vehicles of different lengths that uses spacing and relative-speed measurements from the preceding and following vehicles, in order to select the proper control action (acceleration). The proposed controllers are decentralized and do not need vehicle-to-vehicle communication other than the desired speed which can be provided by the leading vehicle. We rigorously prove that vehicles avoid collisions and attain a desired speed and desired spacing. Finally, by introducing appropriate metrics, we show by simulations the strong dissipation of disturbances along the string of vehicles.
|
|
13:50-14:10, Paper WeB13.2 | |
>On the Analytical Properties of a Nonlinear Microscopic Dynamical Model for Connected and Automated Vehicles |
|
Nick Zinat Matin, Hossein | University of California, Berkeley |
Yeo, Yuneil | University of California at Berkeley |
Gong, Xiaoqian | Amherst College |
Delle Monache, Maria Laura | University of California, Berkeley |
Keywords: Autonomous vehicles, Traffic control, Transportation networks
Abstract: In this paper, we propose an integrated dynamical model of Connected and Automated Vehicles (CAVs) which incorporates CAV technologies and a microscopic car-following model to improve safety, efficiency, and convenience. We rigorously investigate the analytical properties such as well-posedness, maximum principle, perturbation, and stability of the proposed model in some proper functional spaces. Furthermore, we prove that the model is collision-free and derive an explicit lower bound on the distance as a safety measure.
|
|
14:10-14:30, Paper WeB13.3 | |
>A Feasibility Analysis at Signal-Free Intersections |
|
Tzortzoglou, Filippos | Cornell University |
Beaver, Logan E. | Old Dominion University |
Malikopoulos, Andreas A. | Cornell University |
Keywords: Traffic control, Autonomous vehicles, Transportation networks
Abstract: In this letter, we address the problem of improving the feasible domain of the solution of a control framework for coordinating connected and automated vehicles (CAVs) at signal-free intersections. The framework provides the optimal trajectories of CAVs to cross the intersection safely without stop-and-go driving. However, when traffic volume exceeds a certain level, finding a feasible solution for a CAV may become unattainable. We use concepts from numerical mathematics to identify appropriate polynomials that can serve as alternative trajectories of the CAVs, expanding the domain of the feasible CAV trajectories. We then select the final trajectories through an optimization problem that aims at minimizing jerk. Finally, we demonstrate the efficacy of our approach through numerical simulations.
|
|
14:30-14:50, Paper WeB13.4 | |
>Safe Merging of Autonomous Vehicles under Temporal Logic Task (I) |
|
Zhao, Chenguang | The Hong Kong University of Science and Technology (Guangzhou) |
Yu, Huan | The Hong Kong University of Science and Technology(Guangzhou) |
Keywords: Traffic control, Formal Verification/Synthesis, Autonomous vehicles
Abstract: Autonomous vehicles are increasingly recognized for their potential to enhance traffic stability and safety through longitudinal control. However, a significant challenge remains in controlling AVs at merging ramps due to increased risks of congestion and accidents. This paper tackles merging control of an AV from an accelerated lane of finite length to mainline traffic of Human-Driven Vehicles (HV). Compared with safe car-following control, the merging control needs to satisfy multiple constraints due to complexity of the task. A novel dynamical model for merging and car-following behaviors is developed for mixed merging traffic. The complex temporal behaviors of the merging AV and car-following HVs are described with signal temporal logic. We integrate safety, task-specific, and physical constraints into a single control barrier function, which provides input constraints to synthesize a safety-critical controller via quadratic programming. Simulations on real traffic merging trajectories validate that the designed controller enhances traffic stability and efficiency with the merging task completed safely within the temporal and spatial constraints.
|
|
14:50-15:10, Paper WeB13.5 | |
>Optimal Sequencing and Motion Control in a Roundabout with Safety Guarantees (I) |
|
Chen, Yingqing | Boston University |
Cassandras, Christos G. | Boston University |
Xu, Kaiyuan | Boston University |
Keywords: Autonomous vehicles, Transportation networks, Traffic control
Abstract: We develop controllers for Connected and Automated Vehicles (CAVs) traversing a single-lane roundabout so as to simultaneously determine the optimal sequence and associated optimal motion control jointly minimizing travel time and energy consumption while providing speed-dependent safety guarantees, as well as satisfying velocity and acceleration constraints. This is achieved by integrating (a) Model Predictive Control (MPC) to enable receding horizon optimization with (b) Control Lyapunov-Barrier Functions (CLBFs) to guarantee convergence to a safe set in finite time, thus providing an extended stability region compared to the use of classic Control Barrier Functions (CBFs). The proposed MPC-CLBF framework addresses both infeasibility and myopic control issues commonly encountered when controlling CAVs over multiple interconnected control zones in a traffic network, which has been a limitation of prior work on CAVs going through roundabouts, while still providing safety guarantees. Simulations under varying traffic demands demonstrate the controller's effectiveness and stability.
|
|
15:10-15:30, Paper WeB13.6 | |
>Mesoscopic Digital Control for Practical String Stability of Vehicular Platoons (I) |
|
Bonsanto, Pietro | Università Degli Studi Dell'Aquila |
Mattioni, Mattia | Università Degli Studi Di Roma La Sapienza |
Iovine, Alessio | CNRS |
De Santis, Elena | University of L'Aquila |
Di Benedetto, Maria Domenica | University of L'Aquila |
Keywords: Traffic control, Sampled-data control, Autonomous vehicles
Abstract: In this paper, we propose a new piecewise constant feedback for string-stability of a platoon under sampled and quantized measurements. The design is based on a mesoscopic approach and is carried out over the sampled-data model associated to each vehicle. The proposed feedback ensures string-stability in the practical sense independently of the effect of sampling and quantization. Simulations show the effectiveness of the results.
|
|
WeB14 |
Suite 2 |
Estimation IV |
Regular Session |
Chair: Hjalmarsson, Håkan | KTH Royal Inst. of Tech |
Co-Chair: Dokoupil, Jakub | Brno University of Technology |
|
13:30-13:50, Paper WeB14.1 | |
>A Deep-Learning Model of Virtual Test Drivers |
|
Pallacci, Thomas | Maserati |
Mimmo, Nicola | University of Bologna |
Sessa, Pasquale | Maserati |
Rabbeni, Roberto | Maserati |
Keywords: Identification, Machine learning, Modeling
Abstract: Virtual test drivers, conceived as automatic control systems, are becoming a paramount automatic verification tool which enables car makers to test new and advanced vehicle functionalities in a standardised, repeatable, high-quality, and cost-saving way. In this paper, we use modern machine-learning methodologies to build a virtual driver able to test the hill-descent control, one of the driver assistance systems equipping modern cars. The experimental results show that our virtual driver performs as a human driver involved in the same test conditions.
|
|
13:50-14:10, Paper WeB14.2 | |
>Merging Parameter Estimation and Classification Using LASSO |
|
Wang, Le | KTH Royal Institute of Technology, Shanghai Jiao Tong University |
Wang, Ying | KTH Royal Institute of Technology |
Qiu, Yu | SAIC Motor RD Innovation Headquarters |
Li, Mian | Shanghai Jiao Tong University |
Hjalmarsson, Håkan | KTH Royal Inst. of Tech |
Keywords: Identification, Model Validation, Sensor fusion
Abstract: Soft sensing is a way to indirectly obtain information of signals for which direct sensing is difficult or prohibitively expensive. It may not a priori be evident which sensors provide useful information about the target signal. There may be sensors irrelevant for the estimation as well as sensors for which the information is very poor. It is often required that the soft sensor should cover a wide range of operating points. This means that some sensors may be useful in certain operating conditions while irrelevant in others, while yet others may have no bearing on the target signal whatsoever. However, this type of structural information is typically not available but has to be deduced from data. A further compounding issue is that multiple operating conditions may be described by the same model, but which ones are not known in advance either. In this contribution, we provide a systematic method to construct a soft sensor that can deal with these issues. While different models can be used, we adopt multi-input single-output finite impulse response models since they are linear in the parameters. We propose a single estimation criterion, where the objectives are encoded in terms of model fit, model sparsity (reducing the number of different models), and model parameter coefficient sparsity (to exclude irrelevant sensors). A post-processing model clustering step is also included. As proof of concept, the method is tested on field test datasets from a prototype vehicle.
|
|
14:10-14:30, Paper WeB14.3 | |
>Rate-Optimal Non-Asymptotics for the Quadratic Prediction Error Method |
|
Stamouli, Charis | University of Pennsylvania |
Ziemann, Ingvar | University of Pennsylvania |
Pappas, George J. | University of Pennsylvania |
Keywords: Identification, Statistical learning
Abstract: We study the quadratic prediction error method---i.e., nonlinear least squares---for a class of time-varying parametric predictor models satisfying a certain identifiability condition. While this method is known to asymptotically achieve the optimal rate for a wide range of problems, there have been no non-asymptotic results matching these optimal rates outside of a select few, typically linear, model classes. By leveraging modern tools from learning with dependent data, we provide the first rate-optimal non-asymptotic analysis of this method for our more general setting of nonlinearly parametrized model classes. Moreover, we show that our results can be applied to a particular class of identifiable AutoRegressive Moving Average (ARMA) models, resulting in the first optimal non-asymptotic rates for identification of ARMA models.
|
|
14:30-14:50, Paper WeB14.4 | |
>Convergence of Adaptive Identification for Phase Retrieval Problem |
|
Liu, Yujing | Academy of Mathematics and Systems Science, Chinese Academy of Sc |
Liu, Zhixin | Academy of Mathematics and Systems Science, ChineseAcademyof Scie |
Guo, Lei | Academy of Mathematics and Systems Science, Chinese Academy of S |
Keywords: Identification, Stochastic systems, Adaptive systems
Abstract: The phase retrieval problem is of fundamental importance in various fields including computer science, physics, and engineering, where only the magnitude measurements are variable. For this NP-hard problem, previous work has mainly focused on the noiseless case, off-line algorithms, i.i.d. standard Gaussian assumptions on the regressor and convergence results in a high probability sense. To overcome these limitations, we consider the phase retrieval problem in the presence of noise disturbances. Based on the maximum likelihood (ML) criterion, we propose a novel adaptive identification algorithm to estimate the true signal vector. For the first time, we establish the global convergence and convergence rate results for the proposed algorithm without relying on the i.i.d. assumption on the regressor. Finally, we provide a numerical example to illustrate the effectiveness of our adaptive algorithm.
|
|
14:50-15:10, Paper WeB14.5 | |
>Total Least Squares from a Bayesian Perspective: Incorporating Data-Informed Forgetting |
|
Dokoupil, Jakub | Brno University of Technology |
Vaclavek, Pavel | Brno University of Technology |
Keywords: Identification, Time-varying systems, Estimation
Abstract: The real-time estimation of error-in-variables (EIV) models with unknown time-varying parameters is considered and resolved using a Bayesian framework. The stochastic model under consideration is a regression-type model that accounts for inherently inaccurate measurements, which are corrupted by the normal noise. The EIV model identification is traditionally performed via total least squares (TLS), relying on computationally intensive methods to numerically obtain a point estimate. Such a concept, despite its theoretical appeal, nevertheless lacks the ability to quantify the uncertainty associated with the parameter estimates. Thus, this limitation hinders the concept from being combined with the statistical decision-making strategies. The paper opens the way towards enriching the standard TLS in this respect. The enrichment is achieved by projecting the unnormalized posterior generated by the EIV parametric models onto the normal-Wishart distribution. This projection is made optimal by minimizing the Kullback-Leibler distance between the unnormalized and the normal-Wishart posteriors while imposing a hard equality constraint on the mean parameter scalar product. By establishing credible intervals for both the regression parameters and the noise precision, the resultant procedure is additionally endowed with Bayesian data-informed forgetting, which allows for effective operation in nonstationary environments.
|
|
15:10-15:30, Paper WeB14.6 | |
>Scalable Reachset-Conformant Identification of Linear Systems |
|
Lützow, Laura | Technical University Munich |
Althoff, Matthias | Technische Universität München |
Keywords: Identification, Uncertain systems, Time-varying systems
Abstract: By monitoring the set of reachable outputs, safety can be verified. However, to compute the reachable set of real-world systems, we require models that are able to produce all possible system behaviors. These kinds of models are called reachset-conformant, and their identification is a promising new research direction. While many existing reachset-conformant identification techniques require the computation of the halfspace representation of the zonotopic reachable sets, we propose an approach that leads to the same optimal identification results using the more scalable generator representation. Thus, our approach offers greater efficiency for high-dimensional systems and long time horizons. The scalability and accuracy of both approaches are compared in numerical experiments with linear time-variant systems.
|
|
WeB15 |
Suite 3 |
Power Systems II |
Regular Session |
Chair: Lestas, Ioannis | University of Cambridge |
Co-Chair: Ratnam, Elizabeth | The Australian National University |
|
13:30-13:50, Paper WeB15.1 | |
>Stability-Constrained Learning for Frequency Regulation in Power Grids with Variable Inertia |
|
Feng, Jie | University of California San Diego |
Muralidharan, Manasa | University of California San Diego |
Henriquez-Auba, Rodrigo | University of California at Berkeley |
Hidalgo-Gonzalez, Patricia | University of California, San Diego |
Shi, Yuanyuan | University of California San Diego |
Keywords: Power systems, Data driven control, Time-varying systems
Abstract: The increasing penetration of converter-based renewable generation has resulted in faster frequency dynamics, and low and variable inertia. As a result, there is a need for frequency control methods that are able to stabilize a disturbance in the power system at timescales comparable to the fast converter dynamics. This paper proposes a combined linear and neural network controller for primary frequency control that is stable at time-varying levels of inertia. We model the time-variance in inertia via a switched affine hybrid system model. We derive stability certificates for the proposed controller via a quadratic candidate Lyapunov function. We test the proposed control on a 12-bus 3-area test network, and compare its performance with a base case linear controller, optimized linear controller, and model predictive controller (MPC). Our proposed controller achieves a 0.5 s faster mean settling time and a 56% reduction in average control cost across 100 inertia scenarios compared to the optimized linear controller. Unlike MPC which requires complete knowledge of the inertia trajectories and system dynamics over the entire control time horizon, our proposed controller is real-time tractable, and achieves comparable performance to MPC.
|
|
13:50-14:10, Paper WeB15.2 | |
>A-Priori Reduction of Scenario Approximation for Automated Generation Control in High-Voltage Power Grids with Renewable Energy |
|
Lukashevich, Aleksandr | Skolkovo Institute of Science and Technology |
Bulkin, Alexander | LLC Interdata |
Maximov, Yury | Los Alamos National Laboratory |
Keywords: Power systems, Data driven control
Abstract: Renewable energy sources (RES) are increasingly integrated into power systems to achieve decarbonization and energy security United Nations Sustainable Development Goals. However, their low inertia and high uncertainty pose challenges to grid stability and increase blackout risks. Stochastic chance-constrained optimization, particularly data-driven methods, offer solutions but can be time-consuming, especially for multiple system snapshots. This paper addresses a dynamic joint chance-constrained Direct Current Optimal Power Flow (DC-OPF) problem with Automated Generation Control (AGC) for cost-effective power generation while ensuring balance and security constraints. We propose an approach for data-driven approximation with a-priori reduction of samples, maintaining solution reliability while reducing data-driven approximation size. Both theoretical analysis and empirical results demonstrate the approach's superiority in handling generation uncertainty, requiring up to 2 times less data and keeping solution reliability.
|
|
14:10-14:30, Paper WeB15.3 | |
>Information Structures in AC/DC Grids |
|
Taylor, Josh | New Jersey Institute of Technology |
Keywords: Power systems, Decentralized control, Hierarchical control
Abstract: The converters in an AC/DC grid form actuated boundaries between the AC and DC subgrids. In both simple linear and balanced dq-frame models, the states on either side of these boundaries are coupled only by control inputs. In this paper, we show how this topological property imparts all AC/DC grids with poset-causal information structures. A practical benefit is that certain decentralized control problems that are hard in general are tractable for poset-causal systems. We also show that special cases like multi-terminal DC grids can have coordinated and leader-follower information structures.
|
|
14:30-14:50, Paper WeB15.4 | |
>Frequency Control and Power Sharing in Combined Heat and Power Networks (I) |
|
Qin, Xin | University of Cambridge |
Lestas, Ioannis | University of Cambridge |
Keywords: Power systems, Decentralized control, Nonlinear systems
Abstract: We consider the problem of using district heating systems as ancillary services for primary frequency control in power networks. We propose a novel power sharing scheme for heating systems based on the average temperature, which enables an optimal power allocation among the diverse heat sources without having a prior knowledge of the disturbances. We then discuss two approaches for heating systems to contribute to frequency regulation in power networks. We show that both approaches ensure stability in the combined heat and power network and facilitate optimal power allocation among the different energy sources.
|
|
14:50-15:10, Paper WeB15.5 | |
>Event-Triggered Pricing-Based Frequency Control in Power System Via Passivity Analysis |
|
Shibata, Yasutomo | Keio University |
Namerikawa, Toru | Keio University |
Keywords: Power systems, Stability of linear systems, Discrete event systems
Abstract: This study addresses the problem of frequency control in a power system with the participation of demand response (DR). The pricing-based controller is applied to the power system so that DR participants and suppliers can contribute to the frequency regulation via electricity price. However, the high frequency of communication between frequency dynamics and pricing-based controllers is thought to be a problem. To address this problem, we derive event-triggered conditions that reduce the communication frequency while maintaining stability based on the passivity property of the system, and the asymptotical stability around the equilibrium point of the overall system is derived under the proposed event-triggered conditions. Furthermore, numerical simulation with simplified power network system is conducted so the effectiveness of the proposed system design and system stability is confirmed.
|
|
15:10-15:30, Paper WeB15.6 | |
>Unified Control of Voltage, Frequency and Angle in Electrical Power Systems: A Passivity and Negative-Imaginary Based Approach |
|
Chen, Yijun | University of Sydney |
Shi, Kanghong | Australian National University |
Petersen, Ian R. | Australian National University |
Ratnam, Elizabeth | The Australian National University |
Keywords: Power systems, Stability of nonlinear systems, Nonlinear output feedback
Abstract: This paper proposes a unified methodology for voltage regulation, frequency synchronization, and rotor angle control in power transmission systems considering a one-axis generator model with time-varying voltages. First, we formulate an output consensus problem with a passivity and negative-imaginary (NI) based control framework. We establish output consensus results for both networked passive systems and networked NI systems. Next, we apply the output consensus problem by controlling large-scale batteries co-located with synchronous generators that use real-time voltage phasor measurements. By controlling the battery storage systems so as to dispatch real and reactive power, we enable simultaneous control of voltage, frequency, and power angle differences across a transmission network. Validation through numerical simulations on a four-area transmission network confirms the robustness of our unified control framework.
|
|
WeB16 |
Suite 4 |
Constrained Control II |
Regular Session |
Chair: Ushirobira, Rosane | Inria |
Co-Chair: Katayama, Hitoshi | The University of Shiga Prefecture |
|
13:30-13:50, Paper WeB16.1 | |
>Globally Asymptotically Stable Control of Integrators with Long Dead Time in the Presence of Actuator Constraints |
|
Hölzl, Stefan Lambert | Graz University of Technology |
Keywords: Delay systems, Predictive control for linear systems, Constrained control
Abstract: Different concepts to control integrators with long dead time have been presented over the last 40 years. Although this particular plant model is of practical interest, another practical issue of high interest, i.e. actuator constraints, has often enough been neglected in the treatises. This paper presents a control scheme (the so-called Conditioned Smith-Astrom Predictor) that guarantees global asymptotic stability for integrators with long dead time. Furthermore, it offers a straightforward design procedure and—yet another practical aspect of increasing interest—an explicit disturbance estimate (independent of constraints). A highlight of the proposed stability proof is that it uses well-established graphical methods that allow for an easy verification of closed-loop stability even in the presence of unmodelled dynamics and dead time mismatch, the latter of which is illustrated in the paper. This makes it an appealing alternative to other methods found in literature.
|
|
13:50-14:10, Paper WeB16.2 | |
>Anti-Windup-Like Compensator Design for Continuous-Time Systems Affected by Unknown Nonlinearities and Input Saturation |
|
Giorgetti, Folco | Università Degli Studi Di Perugia |
Ferrante, Francesco | Universita Degli Studi Di Perugia |
Fravolini, Mario Luca | Universita' Di Perugia |
Keywords: LMIs, Lyapunov methods, Nonlinear systems
Abstract: This paper addresses the stabilization of a particular class of continuous-time systems affected by unknown sector bound nonlinearities and input saturation. An observer is designed to provide an estimate of the unknown nonlinearity. Such an estimate is used by an additional compensation loop with the purpose of mitigating the effect of the nonlinearity and enlarge the basin of attraction of the system. Stability conditions are given as a set of matrix inequalities and quadratic Lyapunov functions are exploited. An optimal compensator design algorithm based on semidefinite programming is proposed. Finally a numerical example shows the results of the proposed approach.
|
|
14:10-14:30, Paper WeB16.3 | |
>On the Constrained Feedback Linearization Control Based on the MILP Representation of a ReLU-ANN |
|
Do, Huu-Thinh | Grenoble Institute of Technology (Grenoble INP) |
Prodan, Ionela | Grenoble Institute of Technology (Grenoble INP) - Esisar |
Keywords: Feedback linearization, Constrained control, Neural networks
Abstract: In this work, we explore the efficacy of rectified linear unit artificial neural networks in addressing the intricate challenges of convoluted constraints arising from feedback linearization mapping. Our approach involves a comprehensive procedure, encompassing the approximation of constraints through a regression process. Subsequently, we transform these constraints into an equivalent representation of mixed-integer linear constraints, seamlessly integrating them into other stabilizing control architectures. The advantage resides in the compatibility with the linear control design and the constraint satisfaction in the model predictive control setup, even for forecasted trajectories. Simulations are provided to validate the proposed constraint reformulation.
|
|
14:30-14:50, Paper WeB16.4 | |
>Motion Planning of 3D Nonholonomic Robots Via Curvature-Constrained Vector Fields |
|
Qiao, Yike | Peking University |
He, Xiaodong | University of Science and Technology Beijing |
Li, Zhongkui | Peking University |
Keywords: Nonholonomic systems, Autonomous robots
Abstract: Vector-field-based methods are typical feedback planning algorithms, especially eligible for the motion planning of nonholonomic robots. Nevertheless, most existing vector fields (VF) do not account for the prevalent constraints on robot’s kinematics. This paper addresses the motion planning problem for 3D nonholonomic robots with trajectories featuring upper bounded curvature. To this end, a curvature-constrained VF over R3 is proposed, whose integral curves guarantee an upperbound of curvature as well as an almost-global attraction region towards the desired position with a specified heading direction. Moreover, a control strategy is presented to determine the robot’s control inputs subject to the curvature constraint. Under the designed control laws, the robot is guaranteed to track the VF while ensuring that the actual trajectory adheres to the curvature constraint. Finally, the efficacy of the presented motion planning algorithm is validated by numerical simulations.
|
|
14:50-15:10, Paper WeB16.5 | |
>Sampled-Data Circular Path Following Control of Four Wheeled Mobile Robots with Steering Angle Saturation |
|
Katayama, Hitoshi | The University of Shiga Prefecture |
Hayashi, Kohei | The University of Shiga Prefecture |
Imamura, Yuya | The University of Shiga Prefecture |
Keywords: Sampled-data control, Automotive control, Constrained control
Abstract: Sampled-data circular path following control of four wheeled mobile robots with steering angle saturation is considered. The line-of-sight guidance algorithm is used to drive a tracking error dynamics and sampled-data path following control is formulated as sampled-data stabilization of the tracking error dynamics. Both single-rate and multi-rate sampled-data steering angle controllers are designed on the basis of the Euler model of the tracking error dynamics. The closed-loop tracking error dynamics given by the designed controllers is represented by a cascade system with the disturbance input induced by steering angle saturation. Then we show that the designed controllers achieve sampled-data circular path following control in the semiglobally practically input-to-state stable sense. Simulation and experimental results are also given to show the effectiveness of the designed controllers.
|
|
15:10-15:30, Paper WeB16.6 | |
>An Event-Triggered Robust Control for Unicycle Mobile Robots with Communication, State, and Input Constraints |
|
Gutiérrez, Ariana | Tecnológico Nacional De México/I.T. De La Laguna |
Ríos, Héctor | Tecnológico Nacional De México/I.T. La Laguna |
Mera, Manuel | Esime Upt Ipn |
Efimov, Denis | Inria |
Ushirobira, Rosane | Inria |
Keywords: Sampled-data control, Robust control, Autonomous robots
Abstract: This paper contributes to designing a sampled robust controller for the trajectory tracking problem in constrained unicycle mobile robots. The proposed controller comprises the design of an aperiodic control law based on an event–triggered controller and a periodic control law based on a constant sampled state–feedback controller. The state-feedback–based event-triggered control is designed by means of the attractive ellipsoid method and the barrier Lyapunov function and provides a safe set, where the state constraints are not transgressed, and a switching set that defines the region where each part of the controller is active. The periodic sampled control part is designed taking into account a maximum sampling time and is active inside the switching set. The proposed strategy ensures the input–to–state stability of the tracking error dynamics with respect to external disturbances. A constructive and simple method based on linear matrix inequalities is proposed to compute the controller gains. Simulation results illustrate the feasibility of the proposed approach.
|
|
WeB17 |
Suite 6 |
Autonomous Robots II |
Regular Session |
Chair: Le Ny, Jerome | Polytechnique Montréal |
Co-Chair: Puig, Vicenc | Universitat Politècnica De Catalunya |
|
13:30-13:50, Paper WeB17.1 | |
>Coverage Control with Heterogeneous Robotic Teams Via Multi-Marginal Optimal Transport |
|
Le Ny, Jerome | Polytechnique Montréal |
Keywords: Autonomous robots, Optimization algorithms, Stochastic systems
Abstract: Coverage control refers to the problem of simultaneously deploying a mobile robotic network and assigning tasks distributed in the environment to each robot. We focus on a natural extension of this problem, where tasks must be serviced by teams of robots from different classes. We leverage the connection between the assignment part of the coverage control problem and the theory of optimal transport to formulate and study a general coverage control problem by heterogeneous robotic teams, with possibly constraints on the utilization rate of each robot. The optimization of the assignment maps and of the utilization rates are shown to be convex problems, amenable to finite-dimensional deterministic or stochastic optimization methods. The optimization of the robot states or locations is subject to local minima as in the standard coverage control problem, but can be performed locally using deterministic or stochastic gradient descent, in a manner similar to Lloyd's method. Numerical simulations illustrate the flexibility of the formulation and the behavior of the algorithms.
|
|
13:50-14:10, Paper WeB17.2 | |
>A Wake Avoidance Method for Unmanned Maritime Vehicles Using a Trajectory-Circle-Based Control Barrier Function |
|
Fan, Yexin | University of Warwick |
Yang, Haoyang | University of Warwick |
Dong, Hongyang | University of Warwick |
Zhao, Xiaowei | University of Warwick |
Yue, Weitao | University of Warwick |
Keywords: Autonomous systems, Constrained control, Autonomous robots
Abstract: Efficient obstacle avoidance while controlling unmanned maritime vehicles (UMVs) is an important topic. Evading obstacles that are both time-varying and shape changing, such as avoiding moving boats and their wake regions that vary with the boat’s motion and propagate along time, is essential for UMV’s safe operation yet a challenging task. To address this challenge, this paper proposes a novel trajectory-circle-based control barrier function (CBF) method for variable-area obstacle avoidance. As a stepping stone, the boundaries of obstacles are defined by modeling the relationship between the Kelvin wake pattern and the boat’s motions, thereby defining the boundaries of obstacles. We propose a novel CBF formulation based on the geometric relationship between the collision area and UMV motion states. Particularly, we leverage the geometrical properties between the predictive circular trajectory and the collision area to construct invariant sets. Compared to traditional CBF-based methods, our design achieves smoother avoidance trajectories and requires smaller control corrections, guaranteeing a safe operation for UMVs.
|
|
14:10-14:30, Paper WeB17.3 | |
>Asymmetric Risk-Field Based Spatio-Temporal Trajectory Planning for Autonomous Driving Considering Game Interaction |
|
Chen, Zihao | Shanghai Jiao Tong University |
Pang, Hui | Xi'an University of Technology |
Hu, Chuan | Shanghai Jiao Tong University |
Zhang, Xi | Shanghai Jiao Tong University |
Keywords: Autonomous vehicles, Autonomous systems, Control applications
Abstract: Aiming at the trajectory planning problem of autonomous vehicles, a spatio-temporal joint planning framework considering both multi-vehicle interactions through a game theoretic approach and asymmetric risk field theory was proposed in this article. Through game theoretic forward propagation, the predicted future trajectory of the surrounding vehicle is acquired and coupled into the framework of the ego vehicle decision-making and planning, so that the ego vehicle trajectory planning considering multi-vehicle interaction can be realized. The trajectory is derived from a spatio-temporal planning approach to integrate the velocity planning and path planning and the safety of the trajectory is guaranteed. Furthermore, the trajectory points generated by forward propagation can effectively consider the asymmetric risk field generated by surrounding vehicles and integrate it into the solution of the numerical optimization problem, comprehensively considering the impact of different types of surrounding vehicles, together with their states and other factors, so that the calculated route is safer and in line with human decision-making. The simulation results show that in the dense traffic flow with frequent interactions with surrounding vehicles, the autonomous ego vehicle can reasonably change lanes and achieve efficient and safe driving.
|
|
14:30-14:50, Paper WeB17.4 | |
>Symbolically Synthesized Motion Primitives for Autonomous Navigation |
|
Zhao, Zhixin | University Paris Saclay |
Girard, Antoine | CNRS |
Olaru, Sorin | CentraleSupélec |
Keywords: Autonomous vehicles, Formal Verification/Synthesis, Computational methods
Abstract: This paper proposes a novel approach to navigation for autonomous vehicles that leverages symbolic control methods and system translational and rotational invariance properties. By decomposing in-plane motions into translations and rotations, the approach constructs corresponding motion primitives that enable efficient offline controller design and avoid computationally expensive discretization of the whole state space. The resulting controllers achieve complex trajectories through concatenation of motion primitives. At the same time the safe corridor given by this method will provide safety guarantee for the whole mission.
|
|
14:50-15:10, Paper WeB17.5 | |
>Integrated Planning and Control: A Crucial Path Nodes-Based Piecewise Model Predictive Control Strategy |
|
Deng, Yuxiang | Tianjin University |
Zuo, Zhiqiang | Tianjin University |
Wang, Haoyu | Tianjin University |
Wang, Yijing | Tianjin University |
Keywords: Autonomous vehicles, Optimization, Control applications
Abstract: In this paper, an integrated planning and control strategy based on piecewise model predictive control (MPC) is developed to improve the efficiency of planningcontrol framework. First, a penalty term characterizing the influence of obstacles is merged into the heuristic function of A* algorithm. This enables the planned path to maintain navigability in narrow areas while deviating from obstacles in open areas. Second, to rapidly identify safe regions around the original path, several crucial path nodes (CPNs) are selected, including heading change nodes and constraint change nodes. With the safe regions around these CPNs, a novel method for constructing safe travel corridors (STCs) is developed. Finally, a piecewise MPC is put forward to drive the unmanned ground vehicle (UGV) towards the target node within STCs, which eliminates the requirement for the reference path. This promotes the efficiency of planning-control framework and achieves stronger traveling stability. The proposed strategy is deployed on a UGV platform, and its superiority and effectiveness are demonstrated via real vehicle experiments.
|
|
15:10-15:30, Paper WeB17.6 | |
>Linear Quadratic Zonotopic Control of Switched Systems: Application to Autonomous Vehicle Path-Tracking |
|
Zhang, Shuang | Univeritat Politecnica De Catalunya |
Ifqir, Sara | CRIStAL Laboratory, Centrale Lille Institut |
Puig, Vicenc | Universitat Politècnica De Catalunya |
Keywords: Autonomous vehicles, Switched systems, Uncertain systems
Abstract: This paper proposes a zonotopic approach for the state feedback control problem of a class of uncertain switched systems subject to unknown but bounded disturbances and measurement noises. The proposed approach is the zonotopic analogous case of the switched Linear Quadratic Gaussian (LQG) control, in which the feedback loop is closed using the optimal estimates of a Switched Zonotopic Kalman Filter (SZKF) leading to a Switched Linear Quadratic Zonotopic (SLQZ) control scheme. In this context, first, a SZKF with offline filter gains design is proposed so that the unmeasurable system states can be estimated. Then, to tackle the synthesis of the SZKF and the state feedback controller, separation principle is proved so that the computation of the optimal controller and estimator can be done separately by finding the solutions to a finite set of LMIs. At last, a reference path tracking controller of the vehicle lateral dynamics is designed to demonstrate the validity and performance of the proposed method.
|
|
WeB18 |
Suite 7 |
Stability of Nonlinear Systems II |
Regular Session |
Chair: Heath, William Paul | Bangor University |
Co-Chair: Jungers, Marc | CNRS - Université De Lorraine |
|
13:30-13:50, Paper WeB18.1 | |
>Multiplier Analysis of Lurye Systems with Power Signals |
|
Heath, William Paul | Bangor University |
Carrasco, Joaquin | University of Manchester |
Keywords: Stability of nonlinear systems, Constrained control, Nonlinear systems
Abstract: Multipliers can be used to guarantee both the Lyapunov stability and input-output stability of Lurye systems with time-invariant memoryless slope-restricted nonlinearities. If a dynamic multiplier is used there is no guarantee the closed- loop system has finite incremental gain. It has been suggested in the literature that without this guarantee such a system may be critically sensitive to time-varying exogenous signals including noise. We show that multipliers guarantee the power gain of the system to be bounded and quantifiable. Furthermore power may be measured about an appropriate steady state bias term, provided the multiplier does not require the nonlinearity to be odd. Hence dynamic multipliers can be used to guarantee Lurye systems have low sensitivity to noise, provided other exogenous systems have constant steady state. We illustrate the analysis with an example where the exogenous signal is a power signal with non-zero mean.
|
|
13:50-14:10, Paper WeB18.2 | |
>On Static O'Shea-Zames-Falb Multipliers for Idempotent Nonlinearities |
|
Yuno, Tsuyoshi | Kyushu Univ |
Nishinaka, Shingo | Kyushu University |
Saeki, Rin | Kyushu University |
Ebihara, Yoshio | Kyushu University |
Keywords: Stability of nonlinear systems, Neural networks, LMIs
Abstract: In this paper, we investigate static O'Shea-Zames-Falb (OZF) multipliers for slope-restricted and idempotent nonlinearities. For the analysis of nonlinear feedback systems, the powerful framework of integral quadratic constraint has been frequently employed, where the core of this framework is capturing the behavior of nonlinearities by multipliers. Among such multipliers, OZF multipliers are known to be effective for slope-restricted nonlinearities. However, OZF multipliers only grasp the rough slope properties of the nonlinearities, and hence cannot distinguish nonlinearities with the same slope property. To address this issue, we focus on the fact that some nonlinearities that are important in control engineering satisfy idempotence. By actively using the idempotence property, we first show that we can enlarge the set of the standard OZF multipliers for slope-restricted nonlinearities. In particular, for slope-restricted and idempotent nonlinearities with slope [0,1], we can provide a clear understanding on how the set of the standard OZF multipliers is enlarged. We finally illustrate the effectiveness of the newly proposed multipliers by numerical examples on stability analysis of nonlinear feedback systems.
|
|
14:10-14:30, Paper WeB18.3 | |
>Ensuring Both Positivity and Stability Using Sector-Bounded Nonlinearity for Systems with Neural Network Controllers |
|
Montazeri Hedesh, Hamidreza | Northeastern University |
Siami, Milad | Northeastern University |
Keywords: Stability of nonlinear systems, Neural networks, Nonlinear output feedback
Abstract: This paper introduces a novel method for the stability analysis of positive feedback systems with a class of fully connected feedforward neural networks (FFNN) controllers. By establishing sector bounds for fully connected FFNNs without biases, we present a stability theorem that demonstrates the global exponential stability of linear systems under fully connected FFNN control. Utilizing principles from positive Lur’e systems and the positive Aizerman conjecture, our approach effectively addresses the challenge of ensuring stability in highly nonlinear systems. The crux of our method lies in maintaining sector bounds that preserve the positivity and Hurwitz property of the overall Lur’e system. We showcase the practical applicability of our methodology through its implementation in a linear system managed by a FFNN trained on output feedback controller data, highlighting its potential for enhancing stability in dynamic systems.
|
|
14:30-14:50, Paper WeB18.4 | |
>On the Generalization of the Multivariable Popov Criterion for Slope-Restricted Nonlinearities |
|
Li, Mengmou | Hiroshima University |
Hatanaka, Takeshi | Tokyo Institute of Technology |
Nagahara, Masaaki | Hiroshima University |
Keywords: Stability of nonlinear systems, Robust control, Nonlinear output feedback
Abstract: The Popov criterion has proved to be a powerful tool for analyzing the absolute stability of Lur’e systems, where a linear time-invariant system is in feedback interconnection with a nonlinear operator. However, its applicability is limited by the stringent requirements that the linear system is strictly proper and the input of the nonlinear operator has a bounded time derivative. In this paper, we relax these requirements for the Popov stability criterion on MIMO systems with nonrepeated diagonal slope-restricted nonlinearities by demonstrating the phase containment of Popov multipliers within the well-established O’Shea-Zames-Falb multipliers.
|
|
14:50-15:10, Paper WeB18.5 | |
>Control Design for Linear Systems with Asymmetric Input Backlash and Dead-Zone through LMI Conditions |
|
Kreiss, Jérémie | Université De Lorraine, CRAN, ENSEM, |
Jungers, Marc | CNRS - Université De Lorraine |
Pierron, Aurelien | SPIE INDUSTRIE and Université De Lorraine - CNRS |
Dupont, JÉrÉmy | SPIE INDUSTRIE |
Millerioux, Gilles | Lorraine University |
Martig, Martial | SPIE INDUSTRIE |
Keywords: Lyapunov methods, Stability of nonlinear systems, LMIs
Abstract: The interconnection between a linear time invariant system and a nonlinear operator gathering an asymmetric backlash and an asymmetric dead-zone is studied, which are relevant in mechanics and hydraulics. This paper aims to design a controller as a static linear state feedback to ensure Uniform Ultimately Bounded (UUB) property for the closed-loop system. A UUB-Lyapunov-based approach is used to provide sufficient conditions as Linear Matrix Inequalities (LMIs) for this control design and a resulting optimization problem is offered to minimize the UUB set. A numerical illustration presents the efficiency of our contribution and related discussions.
|
|
15:10-15:30, Paper WeB18.6 | |
>On Integral and Local Input-To-State Stability for Discrete-Time Systems |
|
Dashkovskiy, Sergey | University of Würzburg |
Schroll, Andreas | University of Würzburg |
Keywords: Stability of nonlinear systems, Lyapunov methods, Nonlinear systems
Abstract: In this paper we consider several ISS-like properties for infinite-dimensional discrete-time systems with inputs. Characterizations of these properties are developed and illustrative examples are provided. Relations between these properties are discussed.
|
|
WeB19 |
Suite 8 |
Markov Jump Systems |
Regular Session |
Chair: Kieffer, Michel | CNRS - Univ Paris-Sud - CentraleSupelec |
Co-Chair: Manes, Costanzo | Universita' Dell'Aquila |
|
13:30-13:50, Paper WeB19.1 | |
>State Estimation for Discrete-Time Semi-Markovian Jump Piecewise-Affine Systems with Complex Semi-Markovian Kerne |
|
Tang, Qi | The University of Hong Kong |
Liu, Tao | The University of Hong Kong |
Sun, Xi-Ming | Dalian University of Technology |
Keywords: Markov processes, Estimation, Nonlinear systems
Abstract: This paper addresses state estimation for discrete-time semi-Markovian jump piecewise-affine (PWA) systems. To handle uncertainties caused by unknown disturbances, we propose an analysis method that considers all transfer paths be- tween different regions within the same mode. We then examine the mean-square exponential stability and L∞ performance of the error dynamic system by constructing Lyapunov functions that depend on mode, region, and sojourn time. The related criteria are derived using ellipsoidal outer approximation. Based on these criteria, we design L∞ observers. Finally, we demonstrate the applicability of our method through a numerical example.
|
|
13:50-14:10, Paper WeB19.2 | |
>H2 State-Feedback Control of Continuous-Time Markov Jump Lur'e Systems with Partial Mode Information and Sector Bound Optimization |
|
Porrelli Moreira, da Silva, Lucas | Universidade Federal De Mato Grosso |
Oliveira, Ricardo C. L. F. | University of Campinas - UNICAMP |
Keywords: Markov processes, Stability of nonlinear systems, Optimization
Abstract: This paper investigates the problem of Hd state-feedback control of continuous-time Markov jump Lur'e systems with sector-bounded nonlinearity. It is provided synthesis conditions for the design of robust stabilizing controllers in which the Markov mode theta(t) is not available but is estimated by a detector hat{theta}(t). As performance criteria, it is proposed a multi-objective optimization problem where an Hd guaranteed cost is minimized and the amplitude of the sector is maximized. The synthesis framework is provided in terms of linear matrix inequalities, and a numerical example based on a practical system is provided to illustrate the results.
|
|
14:10-14:30, Paper WeB19.3 | |
>On Component-Wise Asymptotic Moment Stability of Continuous-Time Markov Jump Linear Systems |
|
Liu, Shenyu | Beijing Institute of Technology |
Wen, Penghui | Beijing Institute of Technology |
Keywords: Stability of hybrid systems, Markov processes, Switched systems
Abstract: In addition to the known stochastic stability properties of asymptotic moment stability and almost sure global asymptotic stability for continuous-time Markov jump linear systems, in this work, we propose component-wise asymptotic moment stability and study the relations between these stochastic stability properties. Next, we show that the component-wise 1st and 2nd moments of a Markov jump linear system can be precisely computed by solving linear ordinary differential equations. Consequently, necessary and sufficient conditions for component-wise asymptotic 1st and 2nd moment stability are obtained. Lastly, we test stochastic stability of several numerical examples via our criteria, one of which consists of all unstable flow and all unstable jumps, yet has all the stochastic stability properties aforementioned.
|
|
14:30-14:50, Paper WeB19.4 | |
>First-Moment Stability Conditions for Continuous-Time Markov Jump Linear Systems with Stationary and Time-Varying Transition Rates |
|
De Iuliis, Vittorio | Università Degli Studi Dell'Aquila |
Manes, Costanzo | Universita' Dell'Aquila |
Keywords: Stability of linear systems, Markov processes, Compartmental and Positive systems
Abstract: This work deals with 1-moment stability for continuous-time Markov Jump Linear Systems under stationary and time-varying transition rates. The analysis leverages comparison systems to exploit stability results from the context of positive systems. For the case of stationary transition rates, we introduce novel sufficient conditions of 1-moment stability that only involve Metzler matrices and can be thus checked via linear programming. Such conditions offer a computationally simpler and less restrictive stability characterization with respect to mean square stability. For the general case of time-varying transition rates, a novel approach that leverages recent results on positive time-varying systems is adopted, providing 1-moment stability conditions given in form of linear inequalities, albeit infinite-many in the general case. The theoretical findings are illustrated by means of numerical examples.
|
|
14:50-15:10, Paper WeB19.5 | |
>A New Approach to the Energy-To-Peak Performance Analysis of Continuous-Time Markov Jump Linear Systems |
|
Todorov, Marcos | LNCC |
Keywords: Stochastic systems, Switched systems, Markov processes
Abstract: The energy-to-peak (L2 to L∞) performance of continuous-time Markov jump linear systems (MJLS) is studied from a new perspective: by considering an output norm which is different from the one treated in the existing MJLS literature, we devise upper and lower bounds to the worst-case energy- to-peak gain (L2 to L∞ induced norm). The lower bound can be efficiently computed by solving an ordinary differential equation, or, if the jump process is irreducible, a Lyapunov (algebraic) equation. The upper bound is obtained from the controllability Gramian of the MJLS (a set of coupled Lyapunov equations). As a by-product of this result, we are also able to show the consistence of the corresponding L2 to L∞ system norm vis-à-vis the H2 norm of MJLS: it is proven that the H2 norm is an upper bound to the worst-case energy-to-peak gain, a feature which was not proven in other works devoted to the MJLS case. In the case without jumps, it is shown that both bounds coincide with the actual L2 to L∞ system norm.
|
|
15:10-15:30, Paper WeB19.6 | |
>Model-Based Forecasting of the Load of Parcel Pickup Points Using a Markov Jump Process |
|
Nguyen, Thi Thu Tam | Université Paris-Saclay |
Cabani, Adnane | Esigelec / Irseem |
Cabani, Iyadh | PickUp Services |
De Turck, Koen | Unervisty Gent |
Kieffer, Michel | CNRS - Univ Paris-Sud - CentraleSupelec |
Keywords: Markov processes, Modeling, Smart cities/houses
Abstract: The growth of e-commerce has resulted in a surge in parcel deliveries, increasing transportation costs and pollution issues. Alternatives to home delivery have emerged, such as the delivery to parcel pick-up points (PUPs), which eliminates delivery failure due to customers not being at home. Nevertheless, parcels reaching overloaded PUPs may need to be redirected to alternative PUPs, sometimes far from the chosen ones, which generates customer dissatisfaction. Consequently, predicting the PUP load is critical for a PUP management company to infer the availability of PUPs for future orders, better balance parcel flows between PUPs, and avoid parcel redirection. This paper proposes a new approach to forecasting the PUP load evolution using a Markov jump process that models the parcel life cycle. The latest known status of each parcel is considered to estimate its contribution to the future load of its target PUP. This approach can account for the variability of activity, the various parcel preparation delays by sellers, and the diversity of parcel carriers with specific delivery delays. Here, we predict the load of a PUP due to parcels ordered from online retailers by customers (Business-to-Customer, B2C). The proposed approach is generic and can also be applied to other parcel flows to PUPs, such as second-hand products (Customer-to-Customer, C2C) sent via a PUP network.
|
|
WeB20 |
Suite 9 |
Optimization for Nonlinear Systems |
Regular Session |
Chair: Cunis, Torbjørn | University of Stuttgart |
Co-Chair: Patrinos, Panagiotis | KU Leuven |
|
13:30-13:50, Paper WeB20.1 | |
>Breaking Limitation of Convergence in Continuous Optimization: Predefined-Time Gradient Flows |
|
Zhang, Renyongkang | Northeastern University |
Guo, Ge | Northeastern University |
Gao, Zhenyu | Northeastern University |
Yu, Miao | Northeastern University |
Zhang, Lu | Northeastern University |
Wang, Zhengsong | Northeastern University |
Han, Meng | Northeastern University |
Yin, Yanyan | Curtin University |
Keywords: Optimization algorithms, Lyapunov methods, Nonlinear systems
Abstract: This paper investigates the continuous-time unconstrained optimization via dynamical gradient flow systems. For strongly convex objective functions, a gradient flow with predefined time convergence is proposed, which is suitable for relaxing the objective function to only satisfy the Polyak-Łojasiewicz (P-Ł) condition. A predefined-time convergent Newton's flow is designed for the minimization of strictly convex functions. The proposed result is extended for strongly convex objective functions with time-varying parameters. In our schemes, the upper bound of convergence time can be used as a prior choice without involving system initialization and complex parameter calculations. The convergence of algorithms is rigorously proved by using the Lyapunov method. Numerical simulation illustrates the effectiveness of the proposed methods.
|
|
13:50-14:10, Paper WeB20.2 | |
>From Exponential to Finite/Fixed-Time Stability: Applications to Optimization |
|
Ozaslan, Ibrahim Kurban | University of Southern California |
Jovanovic, Mihailo R. | University of Southern California |
Keywords: Optimization algorithms, Lyapunov methods, Stability of nonlinear systems
Abstract: The development of finite/fixed-time stable optimization algorithms typically involves the study of specific problem instances. The lack of a unified framework hinders the understanding of more sophisticated algorithms, e.g., primal-dual gradient flow dynamics. The purpose of this paper is to address the following question: Given an exponentially stable optimization algorithm, can it be modified to obtain a finite/fixed-time stable algorithm? We provide an affirmative answer, demonstrate how the solution can be computed on a finite-time interval via a simple scaling of the right-hand side of the original dynamics, and certify the desired properties of the modified algorithm using the Lyapunov function that proves the exponential stability of the original system. Finally, we examine nonsmooth composite optimization problems and smooth problems with linear constraints to demonstrate the merits of our approach.
|
|
14:10-14:30, Paper WeB20.3 | |
>Input-To-State Stability of Newton Methods for Generalized Equations in Nonlinear Optimization |
|
Cunis, Torbjørn | University of Stuttgart |
Kolmanovsky, Ilya V. | The University of Michigan |
Keywords: Optimization algorithms, Stability of nonlinear systems, Numerical algorithms
Abstract: We show that Newton methods for generalized equations are input-to-state stable with respect to perturbations such as due to inexact computations. We then use this result to obtain convergence and robustness of a multistep Newton-type method for multivariate generalized equations. We demonstrate the usefulness of the results with other applications to nonlinear optimization. In particular, we provide a new proof for robust local convergence of the augmented Lagrangian method.
|
|
14:30-14:50, Paper WeB20.4 | |
>QPALM-OCP: A Newton-Type Proximal Augmented Lagrangian Solver Tailored for Quadratic Programs Arising in Model Predictive Control |
|
Lowenstein, Kristoffer | Politecnico Di Milano, ODYS Srl |
Bernardini, Daniele | ODYS S.r.l |
Patrinos, Panagiotis | KU Leuven |
Keywords: Optimization algorithms, Predictive control for nonlinear systems, Optimal control
Abstract: In model predictive control fast and reliable quadratic programming solvers are of fundamental importance. The inherent structure of the subsequent optimal control problems can lead to substantial performance improvements if exploited. Therefore, we present a structure-exploiting solver based on proximal augmented Lagrangian, extending the general-purpose quadratic programming solver QPALM. Our solver relies on semismooth Newton iterations to solve the inner sub-problem while directly accounting for the optimal control problem structure via efficient and sparse matrix factorizations. The matrices to be factorized depend on the active-set and therefore lowrank factorization updates can be employed like in activeset methods resulting in cheap iterates. We compare our solver with QPALM and other well-known solvers and show its benefits in a numerical example.
|
|
14:50-15:10, Paper WeB20.5 | |
>GPU-Accelerated Dynamic Nonlinear Optimization with ExaModels and MadNLP |
|
Pacaud, Francois | Mines Paris - PSL |
Shin, Sungho | Argonne National Laboratory |
Keywords: Optimization algorithms, Predictive control for nonlinear systems, Optimal control
Abstract: We investigate the potential of Graphics Processing Units (GPUs) to solve large-scale nonlinear programs with a dynamic structure. Using ExaModels, a GPU-accelerated automatic differentiation tool, and the interior-point solver MadNLP, we significantly reduce the time to solve dynamic nonlinear optimization problems. The sparse linear systems formulated in the interior-point method is solved on the GPU using a hybrid solver combining an iterative method with a sparse Cholesky factorization, which harness the newly released NVIDIA cuDSS solver. Our results on the classical distillation column instance show that despite a significant pre-processing time, the hybrid solver allows to reduce the time per iteration by a factor of 25 for the largest instance.
|
|
15:10-15:30, Paper WeB20.6 | |
>Accelerated Saddle Flow Dynamics for Bilinearly Coupled Minimax Problems |
|
Liu, Yingzhu | Peking University |
Mallada, Enrique | Johns Hopkins University |
Li, Zhongkui | Peking University |
You, Pengcheng | Peking University |
Keywords: Optimization algorithms, Stability of nonlinear systems
Abstract: Minimax problems have attracted much attention due to various applications in constrained optimization problems and zero-sum games. Identifying saddle points within these problems is crucial, and saddle flow dynamics offer a straightforward yet useful approach. This study focuses on a class of bilinearly coupled minimax problems with strongly convex-linear objective functions. We design an accelerated algorithm based on saddle flow dynamics, achieving a convergence rate beyond the stereotype limit (the strong convexity constant). The algorithm is derived from a sequential two-step transformation of a given objective function. First, a change of variables is applied to render the objective function better-conditioned, introducing strong concavity (from linearity) while preserving strong convexity. Second, proximal regularization, when staggered with the first step, further enhances the strong convexity of the objective function by shifting some of the obtained strong concavity. After these transformations, saddle flow dynamics based on the new objective function can be tuned for accelerated exponential convergence. Besides, such an approach can be extended to weakly convex-weakly concave functions and still guarantees exponential convergence to one stationary point. The theory is verified by a numerical test on an affine equality-constrained convex optimization problem.
|
|
WeC02 |
Amber 1 |
Learning, Optimization, and Game Theory III |
Invited Session |
Chair: Sayin, Muhammed Omer | Bilkent University |
Co-Chair: Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Organizer: Doan, Thinh T. | University of Texas at Austin |
Organizer: Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Organizer: Sayin, Muhammed Omer | Bilkent University |
|
16:00-16:20, Paper WeC02.1 | |
>On the Effect of Bounded Rationality in Electricity Markets (I) |
|
Yi, Lihui | Northwestern University |
Wei, Ermin | Northwestern Univeristy |
Keywords: Game theory, Power systems, Modeling
Abstract: Nash equilibrium is a common solution concept that captures the strategic interaction in electricity market analysis. However, it requires a fundamental but impractical assumption that all market participants are fully rational, which implies unlimited computational resources and cognitive abilities. To tackle the limitation, level-k reasoning is proposed and studied to model the bounded rational behaviors. In this paper, we consider a Cournot competition in electricity markets with two suppliers both following level-k reasoning. One is a self-interested firm and the other serves as a benevolent social planner. First, we observe that the optimal strategy of the social planner is to be of a particular rationality level. Being less or more rational may both result in reduced social welfare. Then, we investigate the effect of bounded rationality on social welfare performance and find that it could largely deviate from that at the Nash equilibrium point. Finally, we characterize optimal, mean maximizing and max-min strategies for the benevolent social planner, when having access to different information. The numerical experiments further demonstrate and validate our findings.
|
|
16:20-16:40, Paper WeC02.2 | |
>Higher-Order Gradient Play Leading to Nash Equilibrium in the Bandit Setting (I) |
|
Toonsi, Sarah | University of Illinois Urbana-Champaign |
Shamma, Jeff S. | University of Illinois at Urbana-Champaign |
Keywords: Game theory, Learning
Abstract: We investigate learning in games in the bandit setting, where players only have access to their own realized payoffs. Players do not observe actions of others and do not know the functional form of their own utility functions. Of particular interest is learning mixed-strategy Nash Equilibria (NE). Prior work has shown that learning mixed-strategy NE can be impossible for broad classes of learning dynamics. Follow-up work showed that higher-order learning can overcome such limitations. In particular, for any isolated completely mixed-strategy NE in a polymatrix game, there exist continuous-time uncoupled higher-order gradient play dynamics that converge locally to that NE. Using the ODE method of stochastic approximation, we leverage these results to address the bandit setting. As an interim step, we first address a stochastic discrete-time setting where players observe actions of others. We then modify the same setup to cover the bandit case. Our primary focus will be on isolated mixed-strategy NE that can be stabilized by higher-order learning dynamics that are internally stable, or what we refer to as strongly stabilizable mixed-strategy NE. For both the action observation and the bandit case, we show that if x^* is an isolated completely mixed-strategy NE in a polymatrix game, and if x^* is strongly stabilizable, then there exist higher-order uncoupled learning algorithms that guarantee a positive probability of convergence to that NE for the original game and to perturbed NE in nearby games. We then treat the unnatural case where internally unstable dynamics are required for stabilization. We show that the same results hold under minor modifications of the stochastic algorithms. These results do not imply universal convergence by specific dynamics for all games. Rather, the implication is to establish that isolated completely mixed-strategy NE are learnable.
|
|
16:40-17:00, Paper WeC02.3 | |
>Linear Time-Invariant Solutions for LQ Optimal Control Problems with Terminal-State Affine Constraints (I) |
|
Tarantino, Lorenzo | University of Rome, Tor Vergata |
Astolfi, Alessandro | Imperial College & Univ. of Rome |
Sassano, Mario | University of Rome, Tor Vergata |
Keywords: Optimal control, Optimization, Linear systems
Abstract: We study the problem of steering the state of a system from a given initial condition towards a prescribed affine set while minimizing a quadratic cost functional. While the optimal solution is defined in terms of a time-varying open-loop control law, herein the problem is solved by limiting the search for the optimal input in the space of linear time-invariant feedback control laws. This choice preserves the LTI nature of the original plant in closed loop. Within this framework, the solution of the underlying optimal control problem hinges upon the solution of a nonlinear constrained optimization problem. Constructive algorithms and comparison with the time-varying optimal control law are discussed.
|
|
17:00-17:20, Paper WeC02.4 | |
>The Forward-Reverse Greedy Algorithm for Distributed Submodular Maximization (I) |
|
Tackett, Justin | Brigham Young University |
Grimsman, David | Brigham Young University |
Keywords: Agents-based systems
Abstract: This work addresses submodular maximization problems, a widely-used mathematical tool to model many real-world decisions. Though this set of problems is NP-Hard, a well-known result is that a distributed greedy algorithm, wherein agents sequentially make greedy decisions, is guaranteed to approximate the optimal solution with a multiplicative factor of 1/2. This work explores whether running the greedy algorithm with different agent orderings can provide stronger guarantees. In particular, we show that for two-agent systems, running the distributed greedy algorithm with both possible sequences ensures a guarantee of 2/3. We also show that this method does not benefit systems of more than two agents, but opens the door to future work for exploring other ways to sequence these systems.
|
|
17:20-17:40, Paper WeC02.5 | |
>Non-Convex Learning with Guaranteed Convergence: Perspectives on Stochastic Optimal Control (I) |
|
Moreschini, Alessio | Imperial College London |
Scandella, Matteo | University of Bergamo |
Parisini, Thomas | Imperial C., Aalborg U. & Univ. of Trieste |
Keywords: Machine learning, Optimization, Stochastic optimal control
Abstract: We present and analyze a novel functional learning paradigm that operates on Reproducing Kernel Hilbert Spaces (RKHSs) without relying on the Representer Theorem, demonstrating its potential to learn stochastic optimal control policies in feedback form. Our methodology, based on the newly introduced concept of the Fréchet discrete derivative on RKHS, ensures that sequences generated by the iterative method remain within the intersection of all sublevel sets of the cost function, regardless of the chosen learning rate. In this way, we guarantee a consistent decrease in the cost function evaluation with each iteration until convergence, which is a significant finding in machine learning. By further decomposing the overall functional optimization problem into a suitable sequence of sub-problems, we create a cascade iterative method that computes each function in a domino effect. We briefly address the Witsenhausen counterexample problem, validating our convergence to a local minimum. Although the numerical solution obtained for the Witsenhausen counterexample problem is not yet on par with established ad-hoc methods, the preliminary results are promising and offer a fresh perspective in the field of stochastic optimal control.
|
|
17:40-18:00, Paper WeC02.6 | |
>Distributed Stochastic Optimal Control of Nonlinear Systems Based on ADMM |
|
Pierer von Esch, Maximilian | Institute of Automatic Control, Friedrich-Alexander-Universität |
Landgraf, Daniel | Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU) |
Steffel, Matthias | Friedrich-Alexander-Universität Erlangen-Nürnberg |
Völz, Andreas | Friedrich-Alexander-University Erlangen-Nürnberg |
Graichen, Knut | University Erlangen-Nürnberg (FAU) |
Keywords: Distributed control, Stochastic optimal control, Predictive control for nonlinear systems
Abstract: This paper presents an algorithm based on the alternating direction method of multipliers (ADMM) for the distributed solution of optimal control problems of stochastic multi-agent systems with nonlinear dynamics and state/input couplings as they arise, for instance, in distributed model predictive control of uncertain systems. The solution is based on a deterministic reformulation of the original stochastic problem in which certain covariances are exchanged between agents within the ADMM iterations resulting in a scalable algorithm. The algorithm is evaluated numerically for a nonlinear example system and for energy-optimal building automation control.
|
|
WeC03 |
Amber 2 |
Networks, Games and Learning I |
Invited Session |
Chair: Aggarwal, Shubham | University of Illinois, Urbana Champaign |
Co-Chair: Bastopcu, Melih | University of Illinois Urbana Champaign |
Organizer: Aggarwal, Shubham | University of Illinois, Urbana Champaign |
Organizer: Zaman, Muhammad Aneeq uz | UIUC |
Organizer: Bastopcu, Melih | Bilkent University |
Organizer: Basar, Tamer | Univ of Illinois, Urbana-Champaign |
|
16:00-16:20, Paper WeC03.1 | |
>On the Convergence of Policy Gradient for Designing a Linear Quadratic Regulator by Leveraging a Proxy System (I) |
|
Ye, Lintao | Huazhong University of Science and Technology |
Mitra, Aritra | North Carolina State University |
Gupta, Vijay | Purdue University |
Keywords: Optimal control, Data driven control, Optimization algorithms
Abstract: Policy gradient algorithms have been shown to converge to the optimal controller in a linear quadratic regulator (LQR) design problem. Calculating policy gradients using the true system such as for a robot may, however, be costly. We consider a formulation in which an ample number of gradient calculations from a proxy or approximate system such as a simulator are mixed with gradient calculations from the true system at a few time steps. Under the simplifying assumption of exact gradient calculation, we show that policy gradient in this framework can still converge to a neighborhood of the desired optimal controller. In this sense, it can reduce both the burden and cost of experimenting only on the true system and potentially obtain a less conservative controller than a purely robust control approach that relies only on the proxy system for designing the controller.
|
|
16:20-16:40, Paper WeC03.2 | |
>Linear Quadratic Zero-Sum Differential Games with Intermittent and Costly Sensing |
|
Aggarwal, Shubham | University of Illinois, Urbana Champaign |
Basar, Tamer | Univ of Illinois, Urbana-Champaign |
Maity, Dipankar | University of North Carolina at Charlotte |
Keywords: Optimal control, Game theory, Networked control systems
Abstract: In this letter, we revisit the two-player continuous-time infinite-horizon linear quadratic differential game problem, where one of the players can sample the state of the system only intermittently due to a sensing constraint while the other player can do so continuously. Under these asymmetric sensing limitations between the players, we analyze the optimal sensing and control strategies for the player at a disadvantage while the other player continues to play its security strategy. We derive an optimal sensor policy within the class of stationary randomized policies. Finally, using simulations, we show that the expected cost accrued by the first player approaches its security level as its sensing limitation is relaxed.
|
|
16:40-17:00, Paper WeC03.3 | |
>Could Anticipating Gaming Incentivize Improvement in (Fair) Strategic Classification? (I) |
|
Alhanouti, Sura | The Ohio State University |
Naghizadeh, Parinaz | University of California, San Diego |
Keywords: Game theory, Machine learning, Optimization
Abstract: As machine learning algorithms increasingly influence crucial decisions in areas like loan approvals and hiring, understanding human strategic behavior in response to these systems becomes vital. We explore strategic manipulation and improvement actions by individuals facing algorithmic decisions, the algorithm designer's role in shaping these strategic responses, and the fairness implications. We formulate these interactions as a Stackelberg game, where a firm deploys a (fair) classifier, and individuals strategically respond. Unlike previous research, our model incorporates both different costs and stochastic efficacy for manipulation and improvement. The analysis reveals different potential classes of agent responses, and characterizes optimal classifiers. Based on these, we highlight the impact of the firm's anticipation of strategic behavior, identifying cases when a (fair) strategic policy can motivate improvement while reducing manipulation.
|
|
17:00-17:20, Paper WeC03.4 | |
>Hypergame Theory for Decentralized Resource Allocation in Multi-User Semantic Communications (I) |
|
Kurisummoottil Thomas, Christo | Virginia Tech |
Saad, Walid | Virginia Tech |
Keywords: Game theory, Communication networks, Distributed control
Abstract: Semantic communications (SC) is an emerging communication paradigm in which wireless devices can send only relevant information from a source of data while relying on computing resources to regenerate missing data points. However, the design of a multi-user SC system becomes more challenging because of the computing and communication overhead required for coordination. Existing solutions for learning the semantic language and performing resource allocation often fail to capture the computing and communication tradeoffs involved in multiuser SC. To address this gap, a novel framework for decentralized computing and communication resource allocation in multi-user SC systems is proposed. The challenge of efficiently allocating communication and computing resources (for reasoning) in a decentralized manner to maximize the quality of task experience for the end users is addressed through the application of Stackelberg hypergame theory. Leveraging the concept of second-level hypergames, novel analytical formulations are developed to model misperceptions of the users about each other’s communication and control strategies. Further, equilibrium analysis of the learned resource allocation protocols examines the convergence of the computing and communication strategies to a local Stackelberg equilibria, considering misperceptions. Simulation results show that the proposed Stackelberg hypergame results in efficient usage of communication and computing resources while maintaining a high quality of experience for the users compared to state-of-the-art that does not incorporate perceptions about incomplete information.
|
|
17:20-17:40, Paper WeC03.5 | |
>Multi-Agent Fact-Checker: Adaptive Estimators (I) |
|
Verma, Ashwin | University of California San Diego |
Mohajer, Soheil | University of Minnesota |
Touri, Behrouz | University of California San Diego |
Keywords: Agents-based systems
Abstract: We consider a multi-agent dynamics for distributed fact-checking that validates the truth of a statement based on the labels of an ensemble of inexpert agents. Each agent in the system is modeled as a Binary Symmetric Channel (BSC) that incorrectly judges the veracity of each true/false statement with some probability pi_i in (0,1) which we refer to as the unreliability parameter of the agent. We introduce a class of adaptive estimators for the unreliability parameters of the agents. For the class of estimators we provide necessary conditions for the adaptive estimator to converge to the true unreliability parameters. We show that the estimators for ensembles of two and three agents eventually adhere to a consistent (fixed) update rule. Furthermore, we also show that, surprisingly, the estimator for the unreliability parameters based on the hard-decoded estimate of the statement truths fails to converge to the true unreliability parameters for any number of agents.
|
|
17:40-18:00, Paper WeC03.6 | |
>Reinforcement Learning for Joint Resource and Power Allocation in D2D Communications (I) |
|
Saeed, Ifrah | The University of Melbourne |
Cullen, Andrew Craig | University of Melbourne |
Zaidi, Zainab Razia | Monash University |
Erfani, Sarah M. | University of Melbourne |
Alpcan, Tansu | The University of Melbourne |
Keywords: Communication networks, Reinforcement learning
Abstract: Device-to-Device (D2D) communication is critical in many public safety scenarios where access to network infrastructure is not guaranteed. In such out-of-coverage situations, random resource allocation is a straightforward way for direct communication by each D2D pair. However, such a resource allocation plan often results in severe interference and low throughput, which can degrade crucial communications performance. In contrast, the problem of centralised joint resource block selection and power control optimisation to maximise the total data rate of all D2D pairs, known as the sum-rate, is non-convex and NP-hard but can effectively reduce interference. To address the inability to solve this non-convex centralised problem setting due to limited information in resource-constrained out-of-coverage scenarios, we propose two distributed reinforcement learning schemes. Our methods allow D2D pairs to autonomously make decisions on their joint resource block and power allocation to minimise their mutual interference and maximise their sum-rate, while maintaining the quality of service constraints. To evaluate these methods in out-of-coverage scenarios, we conduct extensive performance evaluations using the ns-3 network simulator designed for public safety LTE-D2D. We demonstrate that our algorithms, in the absence of any centralised coordination and neighbouring information, autonomously reach time-averaged sum-rates that are within 98.2% and 98.6% of the rates achieved by the centralised optimisation solution.
|
|
WeC04 |
Amber 3 |
Security and Resilience in Distributed Optimization and Control |
Invited Session |
Chair: Ballotta, Luca | Delft University of Technology |
Co-Chair: Bastianello, Nicola | KTH Royal Institute of Technology |
Organizer: Ballotta, Luca | Delft University of Technology |
Organizer: Pezzutto, Matthias | University of Padova |
Organizer: Bastianello, Nicola | KTH Royal Institute of Technology |
Organizer: Ferrari, Riccardo M.G. | Delft University of Technology |
Organizer: Johansson, Karl H. | KTH Royal Institute of Technology |
|
16:00-16:20, Paper WeC04.1 | |
>Multi-Agent Resilient Consensus under Intermittent Faulty and Malicious Transmissions (I) |
|
Aydin, Sarper | Harvard University |
Akgün, Orhan Eren | Harvard University |
Gil, Stephanie | Harvard University |
Nedich, Angelia | Arizona State University |
Keywords: Attack Detection, Resilient Control Systems, Communication networks
Abstract: In this work, we consider the consensus problem in which legitimate agents share their values over an undirected communication network in the presence of malicious or faulty agents. Different from the previous works, we characterize the conditions that generalize to several scenarios such as intermittent faulty or malicious transmissions, based on trust observations. As the standard trust aggregation approach based on a constant threshold fails to distinguish intermittent malicious/faulty activity, we propose a new detection algorithm utilizing time-varying thresholds and the random trust values available to legitimate agents. Under these conditions, legitimate agents almost surely determine their trusted neighborhood correctly with geometrically decaying misclassification probabilities. We further prove that the consensus process converges almost surely even in the presence of malicious agents. We also derive the probabilistic bounds on the deviation from the nominal consensus value that would have been achieved with no malicious agents in the system. Numerical results verify the convergence among agents and exemplify the deviation under different scenarios.
|
|
16:20-16:40, Paper WeC04.2 | |
>Resilience Quantification Control of Uncertain Nonlinear Interconnected Systems under Consecutive Replay Attacks (I) |
|
Li, Ying | Beijing University of Chemical Technology |
Wang, Youqing | Beijing University of Chemical Technology |
Parisini, Thomas | Imperial College & Univ. of Trieste |
Zhao, Dong | Beihang University |
Keywords: Resilient Control Systems, Neural networks, Distributed control
Abstract: We address the resilience quantification in terms of control for a class of uncertain nonlinear interconnected systems subjected to consecutive replay attacks. The distributed controllers and parameter estimators in the framework of adaptive approximation are designed to stabilize the uncertain nonlinear interconnected systems. The explicit resilience condition for the resting time between two consecutive attacks is characterized by taking into account the stability requirements. Specifically, if the resting time between two consecutive replay attacks is larger than a suitable boundary constant, then all signals of the interconnected system controlled by the adaptive distributed controllers remain bounded. Simulation results are given to show the effectiveness of our theoretical analysis.
|
|
16:40-17:00, Paper WeC04.3 | |
>Compression-Based Privacy Preservation for Distributed Nash Equilibrium Seeking in Aggregative Games |
|
Huo, Wei | HKUST |
Chen, Xiaomeng | Hong Kong University of Science and Technology |
Ding, Kemi | Southern University of Science and Technology |
Dey, Subhrakanti | Uppsala University |
Shi, Ling | Hong Kong University of Science and Technology |
Keywords: Game theory, Agents-based systems
Abstract: This paper explores distributed aggregative games in multi-agent systems. Current methods for finding distributed Nash equilibrium require players to send original messages to their neighbors, leading to communication burden and privacy issues. To jointly address these issues, we propose an algorithm that uses stochastic compression to save communication resources and conceal information through random errors induced by compression. Our theoretical analysis shows that the algorithm guarantees convergence accuracy, even with aggressive compression errors used to protect privacy. We prove that the algorithm achieves differential privacy through a stochastic compression scheme. Simulation results for energy consumption games support the effectiveness of our approach.
|
|
17:00-17:20, Paper WeC04.4 | |
>Data-Driven Command Governors for Discrete-Time LTI Systems with Linear Constraints (I) |
|
El Qemmah, Ayman | Università Della Calabria |
Casavola, Alessandro | Universita' Della Calabria |
Tedesco, Francesco | Università Della Calabria |
Sinopoli, Bruno | Washington University in St Louis |
Keywords: Predictive control for linear systems, Data driven control, Constrained control
Abstract: The Command Governor (CG) approach effectively addresses the problem of enforcing constraints on pre-compensated systems without modifying existing controllers. However, the prediction model dependence limits its use in cost-sensitive parameter identification applications. Inspired by the recent development of several Data-driven Predictive Control (DPC) algorithms and leveraging behavioral systems theory, this paper proposes a novel data-driven Command Governor scheme that bypasses explicit modeling and does not rely on a parametric system representation. By means of using an input/output trajectory of the plant and a representation of the controller, the proposed data-driven CG handles explicitly both input and output constraints. The effectiveness of the proposed approach is validated through an illustrative example.
|
|
17:20-17:40, Paper WeC04.5 | |
>Asynchronous Distributed Learning with Quantized Finite-Time Coordination (I) |
|
Bastianello, Nicola | KTH Royal Institute of Technology |
Rikos, Apostolos I. | The Hong Kong University of Science and Technology (Gz) |
Johansson, Karl H. | KTH Royal Institute of Technology |
Keywords: Optimization algorithms, Agents-based systems, Machine learning
Abstract: In this paper we address distributed learning problems over peer-to-peer networks. In particular, we focus on the challenges of quantized communications, asynchrony, and stochastic gradients that arise in this set-up. We first discuss how to turn the presence of quantized communications into an advantage, by resorting to a finite-time, quantized coordination scheme. This scheme is combined with a distributed gradient descent method to derive the proposed algorithm. Secondly, we show how this algorithm can be adapted to allow asynchronous operations of the agents, as well as the use of stochastic gradients. Finally, we propose a variant of the algorithm which employs zooming-in quantization. We analyze the convergence of the proposed methods and compare them to state-of-the-art alternatives.
|
|
17:40-18:00, Paper WeC04.6 | |
>Model Predictive Control with Adaptive Resilience for Denial-Of-Service Attacks Mitigation on a Regulated Dam |
|
Cestari, Raffaele Giuseppe | Politecnico Di Milano |
Longari, Stefano | Politecnico Di Milano |
Zanero, Stefano | Politecnico Di Milano |
Formentin, Simone | Politecnico Di Milano |
Keywords: Cyber-Physical Security, Resilient Control Systems, Attack Detection
Abstract: In recent years, SCADA (Supervisory Control and Data Acquisition) systems have increasingly become the target of cyber attacks. SCADAs are no longer isolated, as web-based applications expose strategic infrastructures to the outside world connection. In a cyber-warfare context, we propose a Model Predictive Control (MPC) architecture with adaptive resilience, capable of guaranteeing control performance in normal operating conditions and driving towards resilience against DoS (controller-actuator) attacks when needed. Since the attackers’ goal is typically to maximize the system damage, we assume they solve an adversarial optimal control problem. An adaptive resilience factor is then designed as a function of the intensity function of a Hawkes process, a point process model estimating the occurrence of random events in time, trained on a moving window to estimate the return time of the next attack. We demonstrate the resulting MPC strategy’s effectiveness in 2 attack scenarios on a real system with actual data, the regulated Olginate dam of Lake Como.
|
|
WeC05 |
Amber 4 |
Sparsity in Systems and Control: Trends and New Challenges |
Invited Session |
Chair: Joseph, Geethu | TU Delft |
Co-Chair: Siami, Milad | Northeastern University |
Organizer: Joseph, Geethu | TU Delft |
Organizer: Siami, Milad | Northeastern University |
|
16:00-16:20, Paper WeC05.1 | |
>Invertibility of Discrete-Time Linear Systems with Sparse Inputs (I) |
|
Poe, Kyle | University of Pennsylvania |
Mallada, Enrique | Johns Hopkins University |
Vidal, Rene | University of Pennsylvania |
Keywords: Linear systems, Information theory and control, Switched systems
Abstract: One of the fundamental problems of interest for discrete-time linear systems is whether its input sequence may be recovered given its output sequence, a.k.a. the left inversion problem. Many conditions on the state space geometry, dynamics, and spectral structure of a system have been used to characterize the well-posedness of this problem, without assumptions on the inputs. However, certain structural assumptions, such as input sparsity, have been shown to translate to practical gains in the performance of inversion algorithms, surpassing classical guarantees. Establishing necessary and sufficient conditions for left invertibility of systems with sparse inputs is therefore a crucial step toward understanding the performance limits of system inversion under structured input assumptions. In this work, we provide the first necessary and sufficient characterizations of left invertibility for linear systems with sparse inputs, echoing classic characterizations for standard linear systems. The key insight in deriving these results is in establishing the existence of two novel geometric invariants unique to the sparse-input setting, the weakly unobservable and strongly reachable subspace arrangements. By means of a concrete example, we demonstrate the utility of these characterizations. We conclude by discussing extensions and applications of this framework to several related problems in sparse control.
|
|
16:20-16:40, Paper WeC05.2 | |
>Design of Sparse Control with Minimax Concave Penalty |
|
Hayashi, Naoki | Hiroshima University |
Ikeda, Takuya | The University of Kitakyushu |
Nagahara, Masaaki | Hiroshima University |
Keywords: Optimization algorithms, Constrained control, Optimal control
Abstract: In this paper, we propose a novel computational method for sparse control, also known as maximum hands-off control, using the minimax concave penalty. The sparse control problem is formulated as an L0-optimal control problem, which is known to be hard to solve. To overcome this difficulty, we propose using the minimax concave penalty as a surrogate for the L0 norm. We demonstrate the equivalence between the original and proposed control problems without relying on the normality assumption, which is typically required when approximating the L0 norm with the L1 norm. Furthermore, we present an effective numerical algorithm for the proposed optimal control based on the Alternating Direction Method of Multipliers (ADMM). A design example is shown to illustrate the effectiveness of the proposed method.
|
|
16:40-17:00, Paper WeC05.3 | |
>Parametric PDE Control with Deep Reinforcement Learning and L0 Sparse Polynomial Policies (I) |
|
Botteghi, Nicolò | University of Twente |
Fasel, Urban | University of Washington-Seattle Campus |
Keywords: Reinforcement learning, Machine learning, Chaotic systems
Abstract: Optimal control of parametric partial differential equations (PDEs) is crucial in many applications in engineering and science, such as robotics, aeronautics, chemisty, and biomedicine. In recent years, the progress in scientific machine learning has opened up new frontiers for the control of parametric PDEs. In particular, deep reinforcement learning (DRL) has the potential to solve high-dimensional and complex control problems in a large variety of applications. Most DRL methods rely on deep neural network (DNN) control policies. However, for many dynamical systems, DNN-based control policies tend to be over-parametrized, which means they need large amounts of training data, show limited robustness, and lack interpretability. In this work, we leverage dictionary learning and differentiable L_0 regularization to learn sparse, robust, and interpretable control policies for parametric PDEs. Our sparse policy architecture is agnostic to the DRL method and can be used in different policy-gradient and actor-critic DRL algorithms without changing their policy-optimization procedure. We test our approach on the challenging task of controlling a parametric Kuramoto-Sivashinsky PDE. We show that our method (1) outperforms baseline DNN-based DRL policies, (2) allows for the derivation of interpretable equations of the learned optimal control laws, and (3) generalizes to unseen parameters of the PDE without retraining the policies.
|
|
17:00-17:20, Paper WeC05.4 | |
>Event-Triggered Hierarchical Control of Distributed Network Systems under DoS Attacks (I) |
|
Yanai, Masaki | Hokkaido University |
Kobayashi, Koichi | Hokkaido University |
Yamashita, Yuh | Hokkaido University |
Keywords: Networked control systems
Abstract: A distributed network system is a dynamical system in which multiple subsystems are connected through a physical/communication network. There are many applications such as power systems. In this paper, a new method of hierarchical control for distributed network systems with switching topologies is proposed. In the proposed method, each subsystem is controlled by state-feedback using its state and the state of neighbors. Its gain is calculated by the upper controller, only when a certain event-triggering condition is satisfied. The design problem of state-feedback gains is reduced to an LMI (linear matrix inequality) optimization problem. Furthermore, focusing on the average dwell time (ADT) in the switching of topologies, the condition of ADT for the closed-loop system to be asymptotically stable under DoS (denial-of-service) attacks is derived. Finally, the proposed method is demonstrated by a numerical example.
|
|
17:20-17:40, Paper WeC05.5 | |
>Distributed Adaptive Control of Disturbed Interconnected Systems with High-Order Tuners |
|
Wafi, Moh. Kamalul | Northeastern University |
Siami, Milad | Northeastern University |
Keywords: Adaptive control, Distributed control, Network analysis and control
Abstract: This paper addresses the challenge of network synchronization under limited communication, involving heterogeneous agents with different dynamics and various network topologies, to achieve a consensus. We investigate the distributed adaptive control for interconnected unknown linear subsystems with a leader and followers, with the presence of input-output disturbance. We enhance the communication within multi-agent systems to achieve consensus under the leadership's guidance. While the measured variable is similar among the followers, the incoming measurements are weighted and constructed based on their proximity to the leader. We also explore the convergence rates across various balanced topologies (Star-like, Cyclic-like, Path, Random), featuring different numbers of agents, using distributed first and high-order tuners. Moreover, we conduct several numerical simulations across various networks, agents and tuners to evaluate the effects of sparsity in the interaction between subsystems using the L_2-norm and L_infty-norm. Some networks exhibit a trend where an increasing number of agents results in smaller errors, although this is not universally the case. Additionally, patterns observed at initial times may not reliably predict overall performance across different networks. Finally, we demonstrate that the proposed modified high-order tuners outperform its counterpart, and we provide related insights along with our conclusions.
|
|
17:40-18:00, Paper WeC05.6 | |
>Resilient Infrastructure Network: Sparse Edge Change Identification Via L1-Regularized Least Squares (I) |
|
Anguluri, Rajasekhar | University of Maryland, Baltimore County |
Keywords: Identification, Cyber-Physical Security, Smart grid
Abstract: Adversaries and rapidly changing climate disrupt operations of infrastructure networks including energy, water, manufacturing, and transportation. Unaddressed disruptions lead to system wide shutdowns, emphasizing the need for quick and robust identification methods. One significant disruption arises from edge changes (addition or deletion) in networks. We present an L1-norm regularized least-squares framework to identify multiple but sparse edge changes using noisy data. Our focus is on networks that obey equilibrium equations, as commonly observed in the above sectors. The presence or lack of edges in these networks is captured by the sparsity pattern of the weighted, symmetric Laplacian matrix, while noisy data are node injections and potentials. Our proposed framework systematically leverages the inherent structure within the Laplacian matrix, effectively avoiding overparameterization. Through a series of representative examples, with a primary emphasis on power networks, we demonstrate the robustness and efficacy of the proposed approach.
|
|
WeC06 |
Amber 5 |
Agent-Based Systems III |
Regular Session |
Chair: Charalambous, Themistoklis | University of Cyprus |
Co-Chair: Restrepo, Esteban | CNRS, INRIA Rennes – Bretagne Atlantique |
|
16:00-16:20, Paper WeC06.1 | |
>A Generalized Partitioning Strategy for Distributed Control |
|
Riccardi, Alessandro | Delft University of Technology |
Laurenti, Luca | TU Delft |
De Schutter, Bart | Delft University of Technology |
Keywords: Agents-based systems, Distributed control, Power systems
Abstract: The partitioning problem is a key problem for distributed control techniques. The problem consists in the definition of the subnetworks of a dynamical system that can be considered as individual control agents in the distributed control approach. Despite its relevance and the different approaches proposed in the literature, no generalized technique to perform the partitioning of a network of dynamical systems is present yet. In this article, we introduce a general approach to partitioning for distributed control. This approach is composed by an algorithmic part selecting elementary subnetworks, and by an integer program, which aggregates the elementary components according to a global index. We empirically evaluated our approach on a distributed predictive control problem in the context of power systems, obtaining promising performances in terms of reduction of computation speed and resource cost, while retaining a good level of performance.
|
|
16:20-16:40, Paper WeC06.2 | |
>Non-Bayesian Social Learning with Multiview Observations |
|
Sui, Dongyan | Fudan University |
Cao, Weichen | Fudan University |
Vlaski, Stefan | Imperial College London |
Guan, Chun | Fudan University |
Leng, Siyang | Fudan University |
Keywords: Agents-based systems, Distributed parameter systems, Networked control systems
Abstract: Non-Bayesian social learning enables multiple agents to conduct networked signal and information processing through observing environmental signals and information aggregating. Traditional non-Bayesian social learning models only consider single signals, limiting their applications in scenarios where multiple viewpoints of information are available. In this work, we exploit, in the information aggregation step, the independently learned results from observations taken from multiple viewpoints and propose a novel non-Bayesian social learning model for scenarios with multiview observations. We prove the convergence of the model under traditional assumptions and provide convergence conditions for the algorithm in the presence of misleading signals. Through theoretical analyses and numerical experiments, we validate the strong reliability and robustness of the proposed algorithm, showcasing its potential for real-world applications.
|
|
16:40-17:00, Paper WeC06.3 | |
>A Fully Distributed LTI Estimation Scheme Over Directed Graph Topologies |
|
Makridis, Evagoras | University of Cyprus |
Fioravanti, Camilla | University Campus Bio-Medico of Rome |
Oliva, Gabriele | University Campus Bio-Medico of Rome |
Vrakopoulou, Maria | University of Melbourne |
Charalambous, Themistoklis | University of Cyprus |
Keywords: Agents-based systems, Estimation, Decentralized control
Abstract: This letter introduces a distributed state estimation scheme for linear time-invariant (LTI) discrete-time systems, where observers, with partial observation of the system, communicate with each other over a directed and strongly connected (but not necessarily balanced) graph topology and execute a predefined number of average consensus steps in-between estimation steps. Our methodology departs from previous works in the literature in that it does not require any degree of centralized design nor relies on procedures that might be prone to numerical instability. By leveraging ratio consensus and matrix perturbation theory, we establish a convergence-guaranteeing condition for the number of consensus iterations needed between the steps of the distributed estimation process. This condition becomes the blueprint for a distributed initialization procedure, which allows the agents to collectively select an adequate number of ratio consensus steps.
|
|
17:00-17:20, Paper WeC06.4 | |
>New Results on Finite Convergence Time Mode Consensus: A Summary |
|
Huang, Chao | Tongji University |
Shim, Hyungbo | Seoul National University |
Yu, Siliang | Tongji University |
Anderson, Brian D.O. | Australian National University |
Keywords: Agents-based systems, Estimation, Networked control systems
Abstract: This paper studies the distributed mode consensus problem in a multi-agent system. Three algorithms are proposed to find the most frequent attribute (the mode) owned by the agents via distributed computation. The first algorithm computes the frequency of each attribute using consensus protocols rooted in blended dynamics, then identifies the most frequent attribute as the mode. The second algorithm, under the assumption that each agent possesses a priori knowledge of a minimum frequency for the mode, can decrease the frequency computations required at each agent for large lower bounds. In contrast, the third algorithm eliminates the necessity for such information by implementing an adaptive updating mechanism. These algorithms successfully determine the mode within a finite time frame, and predictive estimates for convergence time are included. Moreover, the first and second algorithms demonstrate plug-and-play property with a dwell time.
|
|
17:20-17:40, Paper WeC06.5 | |
>Understanding the Impact of Coalitions between EV Charging Stations |
|
Kudva, Sukanya | UC Berkeley |
Kulkarni, Kshitij | University of California, Berkeley |
Maheshwari, Chinmay | University of California Berkeley |
Aswani, Anil | UC Berkeley |
Sastry, Shankar | Univ. of California at Berkeley |
Keywords: Agents-based systems, Game theory, Power systems
Abstract: The rapid growth of electric vehicles (EVs) is driving the expansion of charging infrastructure globally. As charging stations become ubiquitous, their substantial electricity consumption can influence grid operation and electricity pricing. Naturally, textit{some} groups of charging stations, which could be jointly operated by a company, may coordinate to decide their charging profile. While coordination among all charging stations is ideal, it is unclear if coordination of some charging stations is better than no coordination. In this paper, we analyze this intermediate regime between no and full coordination of charging stations. We model EV charging as a non-cooperative aggregative game, where each station's cost is determined by both monetary payments tied to reactive electricity prices on the grid and its sensitivity to deviations from a desired charging profile. We consider a solution concept that we call mathcal{C}-Nash equilibrium, which is tied to a coalition mathcal{C} of charging stations coordinating to reduce their costs. We provide sufficient conditions, in terms of the demand and sensitivity of charging stations, to determine when independent (aka uncoordinated) operation of charging stations could result in lower overall costs to charging stations, coalition and charging stations outside the coalition. Somewhat counter to common intuition, we come up with scenarios where allowing charging stations to operate independently is better than coordinating as a coalition. Jointly, these results provide operators of charging stations insights into how to coordinate their charging behavior, and open several research directions.
|
|
17:40-18:00, Paper WeC06.6 | |
>Simultaneous Topology Estimation and Synchronization of Dynamical Networks with Time-Varying Topology |
|
Wang, Nana | Royal Institute of Technology (KTH) |
Restrepo, Esteban | CNRS, INRIA Rennes – Bretagne Atlantique |
Dimarogonas, Dimos V. | KTH Royal Institute of Technology |
Keywords: Agents-based systems, Identification for control, Cooperative control
Abstract: We propose an adaptive control strategy for the simultaneous estimation of topology and synchronization in complex dynamical networks with unknown, time-varying topology. Our approach transforms the problem of time-varying topology estimation into a problem of estimating the time-varying weights of a complete graph, utilizing an edge-agreement framework. We introduce two auxiliary networks: one that satisfies the persistent excitation condition to facilitate topology estimation, while the other, a uniform-delta persistently exciting network, ensures the boundedness of both weight estimation and synchronization errors, assuming bounded time-varying weights and their derivatives. A relevant numerical example shows the efficiency of our methods.
|
|
WeC07 |
Amber 6 |
Cooperative Control II |
Regular Session |
Chair: Maestre, Jose Maria (Pepe) | University of Seville |
Co-Chair: Aranda, Miguel | Universidad De Zaragoza |
|
16:00-16:20, Paper WeC07.1 | |
>Cooperative Nonlinear Distributed Model Predictive Control with Dissimilar Control Horizons |
|
Chanfreut, Paula | Eindhoven University of Technology |
Maestre, Jose Maria (Pepe) | University of Seville |
Zhu, Quanyan | New York University |
Heemels, W.P.M.H. (Maurice) | Eindhoven University of Technology |
Keywords: Cooperative control, Distributed control, Predictive control for nonlinear systems
Abstract: In this paper, we introduce a nonlinear distributed model predictive control (DMPC) algorithm, which allows for dissimilar and time-varying control horizons among agents, thereby addressing a common limitation in current DMPC schemes. We consider cooperative agents with varying computational capabilities and operational objectives, each willing to manage varying numbers of optimization variables at each time step. Recursive feasibility and a non-increasing evolution of the optimal cost are proven for the proposed algorithm. Through numerical simulations on systems with three agents, we show that our approach effectively approximates the performance of traditional DMPC, while reducing the number of variables to be optimized. This advancement paves the way for a more decentralized yet coordinated control strategy in various applications, including power systems and traffic management.
|
|
16:20-16:40, Paper WeC07.2 | |
>Adaptive Observer from Body-Frame Relative Position Measurements |
|
De Carli, Nicola | CNRS, Université De Rennes, IRISA Laboratory |
Restrepo, Esteban | CNRS, INRIA Rennes – Bretagne Atlantique |
Robuffo Giordano, Paolo | Centre National De La Recherche Scientifique (CNRS) |
Keywords: Cooperative control, Estimation, Distributed control
Abstract: In this work, we propose an observer scheme to estimate in a common frame the position and yaw orientation of a group of robots from body-frame relative position measurements. The state of the robots is represented by their position and yaw orientation (R3 × S1). The graph representing the sensing interaction among the robots is directed and it is only required to be weakly connected in addition to satisfy certain persistency of excitation conditions. Our proposed observer scheme consists of three distinct components. Firstly, an adaptive observer is used to estimate the relative yaw corresponding to each persistently exciting edge. Subsequently, an observer is used to estimate the yaw orientation in a common frame based on these estimated relative yaws. Finally, a third observer is employed to estimate the positions in this common frame, using the estimated yaw orientations. The stability of the whole system is investigated and numerical simulations validate the theoretical findings.
|
|
16:40-17:00, Paper WeC07.3 | |
>Fast Consensus Topology Design Via Minimizing Laplacian Energy |
|
Lu, Susie | Stanford OHS |
Liu, Ji | Stony Brook University |
Keywords: Cooperative control, Distributed control, Network analysis and control
Abstract: This paper characterizes the graphical properties of an optimal topology with minimal Laplacian energy under the constraint of fixed numbers of vertices and edges, and devises an algorithm to construct such connected optimal graphs. These constructed graphs possess maximum vertex and edge connectivity, and more importantly, generically exhibit large algebraic connectivity of an optimal order provided they are not sparse. These properties guarantee fast and resilient consensus processes over these graphs.
|
|
17:00-17:20, Paper WeC07.4 | |
>Cooperative Control of Constrained Discrete-Time Multi-Train Systems: A Fully Distributed Approach |
|
Zhang, Zhixin | Central South University |
Chen, Zhiyong | The University of Newcastle |
Fang, Wentuo | Central South University, China |
Keywords: Cooperative control, Constrained control, Transportation networks
Abstract: The paper investigates fully distributed cooperative control for discrete-time multi-train systems, focusing on managing transient constraints on position differences between neighboring trains, as well as on individual train velocities and inputs. Existing research on this topic is limited and typically requires a linear network structure, with each train having access to network structural information, such as the Laplacian matrix. However, travel routes may vary significantly, and the network structure is prone to change. Consequently, this study introduces a fully distributed control law that solely relies on relative position information from neighboring trains, offering adaptability to changes in the network structure of actual multi-train systems. Technically, the paper ingeniously converts both position and velocity constraints into input constraints, guiding the controller design to achieve the objectives of transient constraints.
|
|
17:20-17:40, Paper WeC07.5 | |
>Distributed Formation Control for Unicycle Agents Using Fourier Descriptors |
|
Aranda, Miguel | Universidad De Zaragoza |
Aldana-López, Rodrigo | Universidad De Zaragoza |
Moya-Lasheras, Eduardo | Universidad De Zaragoza |
Aragues, Rosario | Universidad De Zaragoza |
Sagues, Carlos | Universidad De Zaragoza |
Keywords: Cooperative control, Distributed control, Autonomous robots
Abstract: This paper studies the control of multiagent formations in which the Fourier descriptors of the sequence of agent positions are constrained to be zero outside a selected set of low frequencies. This specification enables the mobile agents to form a low-frequency discretized planar closed curve without sharp variation while accommodating high flexibility of the curve's shape. We propose a novel approach for controlling this type of formation assuming a motion model with unicycle kinematics and velocity saturation. Our approach uses a distributed estimator to compute the values of the Fourier descriptors at the selected set of low frequencies, and employs these estimates in a gradient-based motion control strategy. We exploit existing techniques to define a convergent gradient-based control law under the considered motion constraints. Compared to prior work in this problem, our approach only requires distributed interactions between agents, and explicitly considers realistic motion constraints. We also study how to combine descriptors from different sets of frequencies, which enhances transient control performance. We present simulation examples to illustrate the properties of the proposed approach.
|
|
17:40-18:00, Paper WeC07.6 | |
>Circumnavigation of UAVs Integrating QP-Based CLF and CBF in a Cooperative Path-Following Framework |
|
Anand, Pallov | Faculty of Engineering, University of Porto |
Aguiar, A. Pedro | Faculty of Engineering, University of Porto |
Pb, Sujit | IISER Bhopal |
Keywords: Cooperative control, Lyapunov methods, Optimization
Abstract: Unmanned Aerial Vehicles (UAVs) are increasingly utilized in various applications often requiring coordinated operations of multiple UAVs. Circular formation control has garnered significant attention due to its diverse relevance in many situations. In this paper, we propose a Quadratic Programming (QP) based Control Lyapunov Function (CLF) and Control Barrier Function (CBF) framework for circumnavigating stationary and moving targets for single and multiple UAVs while avoiding collision with a known obstacle. By integrating CLF and CBF constraints in a cooperative path-following framework, our approach offers robustness and scalability for complex multi-agent systems, and ensures asymptotic tracking of the target while maintaining closed-loop stability of the error dynamics and safety margins around the obstacle. Through theoretical analysis and simulation studies, we demonstrate the effectiveness and versatility of our proposed framework in challenging environments.
|
|
WeC08 |
Amber 7 |
Optimization III |
Regular Session |
Chair: Sojoudi, Somayeh | UC Berkeley |
Co-Chair: Hendrickx, Julien M. | UCLouvain |
|
16:00-16:20, Paper WeC08.1 | |
>Spatio-Temporal Communication Compression in Distributed Prime-Dual Flows |
|
Ren, Zihao | Zhejiang University |
Wang, Lei | Zhejiang University |
Yuan, Deming | Nanjing University of Science and Technology |
Su, Hongye | Zhejiang Univ |
Shi, Guodong | The University of Sydney |
Keywords: Optimization, Communication networks, Networked control systems
Abstract: In this paper, we study distributed prime-dual flows for multi-agent optimization with spatio-temporal compressions. The central aim of multi-agent optimization is for a network of agents to collaboratively solve a system-level optimization problem with local objective functions and node-tonode communication by distributed algorithms. The scalability of such algorithms crucially depends on the complexity of the communication messages, and a number of communication compressors for distributed optimization have recently been proposed in the literature. First of all, we introduce a general spatio-temporal compressor characterized by the stability of the resulting dynamical system along the vector field of the compressor. We show that several important distributed optimization compressors such as the greedy sparsifier, the uniform quantizer, and the scalarizer all fall into the category of this spatio-temporal compressor. Next, we propose two distributed prime-dual flows with the spatio-temporal compressors being applied to local node states and local error states, respectively, and prove (exponential) convergence of the node trajectories to the global optimizer for (strongly) convex cost functions. Finally, a few numerical examples are present to illustrate our theoretical results.
|
|
16:20-16:40, Paper WeC08.2 | |
>On the Set of Possible Minimizers of a Sum of Convex Functions |
|
Zamani, Moslem | UCLouvain |
Glineur, François | UCLouvain |
Hendrickx, Julien M. | UCLouvain |
Keywords: Optimization, Decentralized control
Abstract: Consider a sum of convex functions, where the only information known about each individual summand is the location of a minimizer. In this work, we give an exact characterization of the set of possible minimizers of the sum. Our results cover several types of assumptions on the summands, such as smoothness or strong convexity. Our main tool is the use of necessary and sufficient conditions for interpolating the considered function classes, which leads to shorter and more direct proofs in comparison with previous work. We also address the setting where each summand minimizer is assumed to lie in a unit ball, and prove a tight bound on the norm of any minimizer of the sum.
|
|
16:40-17:00, Paper WeC08.3 | |
>Tikhonov Regularized Exterior Penalty Dynamics for Constrained Variational Inequalities |
|
Staudigl, Mathias | Universität Mannheim |
Qu, Siqi | University of Mannheim |
Keywords: Optimization, Distributed control, Constrained control
Abstract: Solving equilibrium problems under constraints is an important problem in optimization and optimal control. In this context an important practical challenge is the efficient incorporation of constraints. We develop a continuous-time method for solving constrained variational inequalities based on a new penalty regulated dynamical system in a general potentially infinite-dimensional Hilbert space. In order to obtain strong convergence of the issued trajectory of our method, we incorporate an explicit Tikhonov regularization parameter in our method, leading to a class of time-varying monotone inclusion problems featuring multiscale aspects. Besides strong convergence, we illustrate the practical efficiency of our developed method in solving constrained min-max problems.
|
|
17:00-17:20, Paper WeC08.4 | |
>Approximately Gaussian Replicator Flows: Nonconvex Optimization As a Nash-Convergent Evolutionary Game |
|
Anderson, Brendon G. | California Polytechnic State University |
Pfrommer, Samuel | University of California, Berkeley |
Sojoudi, Somayeh | UC Berkeley |
Keywords: Optimization, Evolutionary computing
Abstract: This work leverages tools from evolutionary game theory to solve unconstrained nonconvex optimization problems. Specifically, we lift such a problem to an optimization over probability measures, whose minimizers exactly correspond to the Nash equilibria of a particular population game. To algorithmically solve for such Nash equilibria, we introduce approximately Gaussian replicator flows (AGRFs) as a tractable alternative to simulating the corresponding infinite-dimensional replicator dynamics. Our proposed AGRF dynamics can be integrated using off-the-shelf ODE solvers when considering objectives with closed-form integrals against a Gaussian measure. We theoretically analyze AGRF dynamics by explicitly characterizing their trajectories and stability on quadratic objective functions, in addition to analyzing their descent properties. Our methods are supported by illustrative experiments on a range of canonical nonconvex optimization benchmark functions.
|
|
17:20-17:40, Paper WeC08.5 | |
>Feedforward Control of Flat Hybrid Automata: A Behavioral Systems Theory Approach |
|
Zahn, Frederik | Karlsruhe Institute of Technology |
Kleinert, Tobias | RWTH Aachen University |
Hagenmeyer, Veit | Karlsruhe Institute of Technology (KIT) |
Keywords: Optimization, Nonlinear systems, Hybrid systems
Abstract: Differential flatness has been defined in the literature for continuous time dynamical systems and for discrete time systems. We define flatness of automata from the perspective of behavioral systems theory, and synthesize a flatness definition for hybrid automata. These new definitions are allow for feedforward trajectory generation for flat hybrid automata. Additionally, we provide necessary and sufficient conditions of hybrid automaton flatness for different types of continuous state constraints and switching conditions, and how to compute whether a hybrid automaton is flat. We illustrate feedforward trajectory generation based on our findings for a robotic manipulator example.
|
|
17:40-18:00, Paper WeC08.6 | |
>An Integer Programming Approach for Angular Coverage under Uncertainty |
|
Zhang, Yuntian | Beijing Institute of Technology |
Chen, Chen | Beijing Institute of Technology |
Ding, Shuxin | China Academy of Railway Sciences Corporation Limited |
Deng, Fang | Beijing Institute of Technology |
Keywords: Optimization, Modeling, Sensor networks
Abstract: This paper investigates angular coverage under uncertainty (ACU). A compact integer programming (IP) formulation is developed to model the angular field-of-view (FoV) of sensors and probabilistic coverage under uncertainty. The IP formulation minimizes the weighted non-coverage probability over the target set as well as considering the practical co-location and budget constraints. Recognizing the non-linearity, non-convexity, and non-separability of ACU, we first introduce the reformulation-linearisation technique (RLT) to obtain a tractable mixed-integer linear programming model which provides a tight lower bound for the original problem. Further, we exploit the structure of the mathematical model and customize a branch-and-cut (B&C) algorithm to solve the derived problem exactly. We show that the solution for the derived problem can also solve the original problem based on the bounding scheme. Computational experiments on a series of problem instances ranging from moderate to large size scaling up to 4,000 dimensional decision variables reveal the effectiveness and efficiency of the proposed exact approach.
|
|
WeC09 |
Amber 8 |
Extremum Seeking Control |
Regular Session |
Chair: Weber, Marc | RWTH Aachen University |
Co-Chair: Fridman, Emilia | Tel-Aviv Univ |
|
16:00-16:20, Paper WeC09.1 | |
>Delay-Robust Unbiased Extremum Seeking for 1D Static Maps Via a Delay-Free Transformation |
|
Jbara, Adam | Tel-Aviv University |
Fridman, Emilia | Tel-Aviv Univ |
Keywords: Extremum seeking, Delay systems, Stability of nonlinear systems
Abstract: In this paper, we consider the recently introduced extremum seeking (ES) with unbiased convergence of an uncertain 1D static map with a constant and known measurement delay. We present a novel efficient constructive approach for averaging-based stability of the unbiased ES algorithm via a delay-free transformation. The transformed ES system is presented in the form of an averaged linear time-delayed system with perturbations. Then, to find a lower bound on the dither frequency for exponential stability, we employ the variation of constants formula, and exploit tight bounds on the fundamental solution of the averaged system with delay. Explicit quantitative conditions in terms of simple scalar inequalities are established which ensure the exponential unbiased convergence of the original ES system. Moreover, we obtain a simple inequality that guarantees the practical stability of the classical ES control system. The results are semi-global and ES parameters can be found for any large delay. A numerical example from the literature demonstrates the efficiency of the proposed results with the essential improvement of the previous results for practical stability.
|
|
16:20-16:40, Paper WeC09.2 | |
>Bounded Extremum Seeking for Single-Variable Static Map Using State Transformation |
|
Mazenc, Frederic | Inria Saclay |
Malisoff, Michael | Louisiana State University |
Fridman, Emilia | Tel-Aviv Univ |
Keywords: Extremum seeking, Delay systems, Uncertain systems
Abstract: We solve a gradient based bounded extremum seeking problem for single-variable static maps in the presence of time-varying piecewise continuous measurement uncertainty. Instead of using previously reported averaging-based methods, we introduce a state transformation, allowing us to use new comparison function and generalized Lyapunov function approaches to obtain our ultimate bounds on the parameter estimation error. We illustrate significant advantages of our new method, including less restrictive conditions on the extremum seeking parameters, as compared with previous methods.
|
|
16:40-17:00, Paper WeC09.3 | |
>Extremum-Seeking Policy Iteration for Data-Driven LQR |
|
Carnevale, Guido | University of Bologna |
Mimmo, Nicola | University of Bologna |
Notarstefano, Giuseppe | University of Bologna |
Keywords: Extremum seeking, Optimal control, Linear systems
Abstract: In this paper, we propose a data-driven strategy to iteratively find the state feedback gain matrix solving a Linear Quadratic Regulator (LQR) problem in a model-free fashion, i.e., under unknown system and cost matrices. In our setup, we assume that, at each iteration, an oracle provides the LQR cost of the tentative policy, e.g., by running the system or a simulator. Based on this information, we develop an algorithm based on Extremum-Seeking to iteratively refine our tentative solution without any additional knowledge on the system and cost models. By using a Lyapunov-based approach exploiting averaging theory for time-varying systems, we show that the proposed algorithm exponentially converges to an arbitrarily small ball containing the optimal gain matrix. We corroborate the theoretical results by testing the proposed strategy via numerical simulations.
|
|
17:00-17:20, Paper WeC09.4 | |
>Using Artificial Delays for Stabilization of Linear Second-Order Systems under Unknown Control Directions by Extremum Seeking Controller |
|
Zhang, Jin | Shanghai University |
Meng, Fu | Shanghai University |
Fridman, Emilia | Tel-Aviv Univ |
Keywords: Extremum seeking, Stability of linear systems, Linear systems
Abstract: We consider a linear second-order system subject to unknown control directions under the measurements of the position whereas the velocity is not available for measurements. Such system can be stabilized by an extremum seeking (ES) controller using the position and velocity. In this paper, the velocity is approximated via a finite difference leading to a delay-dependent ES controller. By applying the recently proposed time-delay approach to Lie-Brackets-based averaging method, we transform the closed-loop system to a time-delay (neutral type) one, which has a form of perturbed Lie brackets system. The input-to-state stability (ISS) of the time-delay system guarantees the same for the original one. Then, by employing variation of constants formula we derive explicit conditions in terms of simple inequalities for finding the quantitative bounds on the dither period and delay that ensure the regional ISS. An example is provided to illustrate the efficiency of the results.
|
|
17:20-17:40, Paper WeC09.5 | |
>Inferring Global Exponential Stability Properties Using Lie-Bracket Approximations |
|
Weber, Marc | RWTH Aachen University |
Gharesifard, Bahman | University of California, Los Angeles |
Ebenbauer, Christian | RWTH Aachen University |
Keywords: Extremum seeking, Stability of nonlinear systems, Algebraic/geometric methods
Abstract: In the present paper, a novel result for inferring uniform global, not semi-global, exponential stability in the sense of Lyapunov with respect to input-affine systems from global uniform exponential stability properties with respect to their associated Lie-bracket systems is shown. The result is applied to adapt dither frequencies to find a sufficiently high gain in adaptive control of linear unknown systems, and a simple numerical example is provided to support the theoretical findings.
|
|
17:40-18:00, Paper WeC09.6 | |
>Extremum Seeking Is Stable for Scalar Maps That Are Strictly but Not Strongly Convex |
|
McNamee, Patrick | San Diego State University |
Krstic, Miroslav | University of California, San Diego |
Nili Ahmadabadi, Zahra | San Diego State University |
Keywords: Extremum seeking, Stability of nonlinear systems
Abstract: For a map that is strictly but not strongly convex, a model-based gradient descent method has an eigenvalue of zero at the extremum, i.e., it fails at exponential convergence. Interestingly, perturbation-based model-free extremum seeking has a negative Jacobian, in the average, meaning that its (practical) convergence is exponential, even though the map’s Hessian is zero at the extremum. While these observations for the gradient algorithm are not trivial, we focus in this paper on an even more nontrivial study of the same phenomenon for Newton-based extremum seeking control (NESC). NESC is a second-order method which corrects for the unknown Hessian of the unknown map, not only in order to speed up parameter convergence, but also (1) to make the convergence rate user-assignable in spite of the unknown Hessian, and (2) to equalize the convergence rates in different directions for multivariable maps. Previous NESC work established stability only for maps whose Hessians are strictly positive definite everywhere, so the Hessian is invertible everywhere. For a scalar map, we establish the rather unexpected property that, even when the map is strictly convex but not strongly convex, i.e., when the Hessian may be zero, NESC guarantees practical asymptotic stability, semiglobally. While a model-based Newton method would run into non-invertibility of the Hessian, the perturbation-based NESC, surprisingly, avoids this challenge by leveraging the fact that the average of the perturbation-based Hessian estimate is always positive, even though the actual Hessian may be zero.
|
|
WeC10 |
Brown 1 |
Modeling, Analysis and Control for Systems and Synthetic Biology |
Invited Session |
Chair: Bandiera, Lucia | University of Edinburgh, School of Engineering, IBioE |
Co-Chair: Salzano, Davide | Scuola Superiore Meridionale |
Organizer: Bandiera, Lucia | University of Edinburgh, School of Engineering, IBioE |
Organizer: Bellato, Massimo | Università Di Padova |
Organizer: Fiore, Davide | University of Naples Federico II |
Organizer: Salzano, Davide | Scuola Superiore Meridionale |
|
16:00-16:20, Paper WeC10.1 | |
>Design of a Sequestration-Based Network with Tunable Pulsing Dynamics (I) |
|
Nakamura, Eiji | University of California, Los Angeles |
Cuba Samaniego, Christian | University of California Los Angeles |
Blanchini, Franco | Univ. Degli Studi Di Udine |
Giordano, Giulia | University of Trento |
Franco, Elisa | University of California a Los Angeles |
Keywords: Biological systems, Biomolecular systems, Systems biology
Abstract: Incoherent feedforward networks exhibit the abil- ity to generate temporal pulse behavior. However, exerting control over specific dynamic properties, such as amplitude and rise time, poses a challenge and is intricately tied to the network’s implementation. In this study, we focus on analyzing sequestration-based networks capable of exhibiting pulse behavior. By employing time-scale separation in the fast sequestration regime, we approximate the temporal dynamics of these networks. This approach allows us to establish a mapping that elucidates the impact of varying the kinetic rates and pulse specifications, including amplitude and rise time. Furthermore, we introduce a positive feedback mechanism to regulate the amplitude of the pulsing response.
|
|
16:20-16:40, Paper WeC10.2 | |
>Neural Networks Built from Enzymatic Reactions Can Operate As Linear and Nonlinear Classifiers (I) |
|
Cuba Samaniego, Christian | University of California Los Angeles |
Wallace, Emily | University of Michigan |
Blanchini, Franco | Univ. Degli Studi Di Udine |
Franco, Elisa | University of California a Los Angeles |
Giordano, Giulia | University of Trento |
Keywords: Biological systems, Biomolecular systems, Genetic regulatory systems
Abstract: The engineering of molecular programs capable of processing patterns of multi-input biomarkers holds great potential in applications ranging from in vitro diagnostics (e.g., viral detection, including COVID-19) to therapeutic interventions (e.g., discriminating cancer cells from normal cells). For this reason, mechanisms to design molecular networks for pattern recognition are highly sought after. In this work, we explore how enzymatic networks can be used for both linear and nonlinear classification tasks. By leveraging steady-state analysis and showing global stability, we demonstrate that these networks can function as molecular perceptrons, fundamental units of artificial neural networks—capable of processing multiple inputs associated with positive and negative weights to achieve linear classification. Furthermore, by composing orthogonal enzymatic reactions, we show that multi-layer networks can be constructed to achieve nonlinear classification.
|
|
16:40-17:00, Paper WeC10.3 | |
>SynthEvo: A Gradient-Guided Evolutionary Approach for Synthetic Circuit Design (I) |
|
Rossi, Nicolo' | ETH Zurich |
Gupta, Ankit | ETH Zürich |
Khammash, Mustafa H. | ETH Zurich |
Keywords: Computational methods, Biomolecular systems, Machine learning
Abstract: It is well-established that complex intracellular mechanisms often comprise simpler ``motifs" specializing in specific cellular functions. Consequently, it is crucial to systematically explore various motif topologies for a chosen function. Achieving this topological characterization can be facilitated by evolutionary algorithms, but the main challenge is to adequately explore the vast topological search space to identify optimal configurations for specific functions. In this paper, we aim to address this challenge by initially employing a ``fully-connected" topology and then evolving connection strengths (i.e. reaction rates), through gradient-descent to optimise for both connection sparsity as well as effectiveness in fulfilling the desired function. We call this method SynthEvo and we illustrate its effectiveness in discovering circuit topologies for two important synthetic biology functions: near-perfect adaptation and ultrasensitivity.
|
|
17:00-17:20, Paper WeC10.4 | |
>Designing High Performance Whole-Cell Biosensors by Integrating Synthetic Biology with Control Engineering (I) |
|
Fusco, Virginia | University of Naples Federico II |
Fiore, Davide | University of Naples Federico II |
di Bernardo, Mario | University of Naples Federico II |
di Bernardo, Diego | Telethon Institute of Genetics and Medicine |
Keywords: Genetic regulatory systems, Biotechnology, Biomolecular systems
Abstract: Transcription-based whole-cell biosensors (WCBs) are cells engineered with an analyte-responsive promoter that regulates the expression of a reporter gene, enabling continuous monitoring of specific biomolecules in vitro and in vivo. Despite their significant potential, the design of WCBs often involves a time-consuming trial-and-error process, resulting in suboptimal sensor performance. To overcome these challenges, we propose a theory-driven approach that integrates Synthetic Biology and Control Theory to enhance the biosensor performance by means of biomolecular circuits. Specifically, we first defined five main features of a biosensor from its input-output response (leaki- ness, fold-change, sensitivity, operating range and linearity). We then show how to optimize each feature by designing alternative biomolecular circuits to obtain a final configuration that closely approximates the characteristics of an ideal biosensor.
|
|
17:20-17:40, Paper WeC10.5 | |
>Integrating Kinetic Models with Genome-Scale Constraints to Explain Yeast-Based Batch Fermentation (I) |
|
Rodríguez Moimenta, Artai | Biosystems and Bioprocess Engineering Group. IIM-CSIC |
Troitiño-Jordedo, Diego | Instituto De Investigacións Mariñas (CSIC) |
Balsa-Canto, Eva | Instituto De Investigaciones Marinas |
Keywords: Systems biology, Modeling, Biological systems
Abstract: Biotechnological processes utilizing cell factories hold great promise for producing various compounds through fermentation. Yeast species have emerged as crucial players in this field, contributing to producing pharmaceuticals, food ingredients, and beverages through batch fermentation. However, the effectiveness of these processes relies on optimizing fermentation parameters, including yeast species or strains, medium composition, and extracellular abiotic conditions. This work addresses the multi-scale modeling of batch fermentation, highlighting the need for suitable approaches to describe primary and secondary metabolism. We show that combining dynamic mechanistic and genome-scale constrained models can do the job, and we illustrate our point with some recent results on modeling batch fermentation processes led by Saccharomyces yeast species. The proposed in silico modeling may facilitate fine-tuning fermentation conditions and manipulating metabolic pathways, thus generating sustainable alternatives in pharmaceuticals and food production and driving the progression towards a greener future.
|
|
17:40-18:00, Paper WeC10.6 | |
>Ratiometric Control of Two Microbial Populations Via a Dual Chamber Bioreactor |
|
Brancato, Sara Maria | University of Naples Federico II |
Salzano, Davide | Scuola Superiore Meridionale |
Fiore, Davide | University of Naples Federico II |
Russo, Giovanni | University of Salerno |
di Bernardo, Mario | University of Naples Federico II |
Keywords: Systems biology, Process Control, Biological systems
Abstract: Maintaining stable coexistence in microbial consortia, particularly when one species grows faster than another (i.e. the species are non-complementary), poses significant challenges. We introduce a novel control architecture that employs two bioreactors. In this system, the slower-growing species is cultivated separately before being introduced into the main mixing chamber. We analyze the open-loop dynamics of this setup and propose a switching feedback mechanism that controls the dilution rates to ensure robust regulation of population density and composition within the microbial consortium. Validated in silico using parameters from real experiments, our approach demonstrates effective and robust maintenance of microbial balance across various strains without requiring genetic modifications.
|
|
WeC11 |
Brown 2 |
Switched Systems II |
Regular Session |
Chair: Deaecto, Grace S. | FEM/UNICAMP |
Co-Chair: Incremona, Gian Paolo | Politecnico Di Milano |
|
16:00-16:20, Paper WeC11.1 | |
>Feedback Stabilization of Discrete-Time Switched Systems under Büchi-Constrained Signals |
|
Della Rossa, Matteo | University of Udine |
Alves Lima, Thiago | L2S, CentraleSupelec |
Girard, Antoine | CNRS |
Keywords: Switched systems, Automata, LMIs
Abstract: This manuscript studies the feedback stabilization problem for a class of discrete-time switched systems. The goal is the design, via semidefinite optimization techniques, of feedback control rules depending only on the current state variable and on the past values of the underlying switching sequence. The resulting control policy achieves uniform exponential stabilization over a pre-constructed class of switching signals. The overall construction generalizes known approaches for stabilization over arbitrary switching sequences, but it is able to stabilize systems for which none of the defining subsystems is stabilizable. This extension is obtained employing graph-theoretic tools, introducing the Büchi automata formalism in order to specify the considered classes of admissible sequences, seen here in the general setting of omega-regular languages. The proposed construction is finally illustrated with the help of a numerical example.
|
|
16:20-16:40, Paper WeC11.2 | |
>A Switching Model Predictive Control for Collision-Free Path-Tracking of Mobile Robots |
|
Yuca Huanca, Chrystian Pool Edmundo | Politecnico Di Milano |
Incremona, Gian Paolo | Politecnico Di Milano |
Colaneri, Patrizio | Politecnico Di Milano |
Keywords: Switched systems, Autonomous robots, Predictive control for nonlinear systems
Abstract: This paper addresses the problem of collision-free path-tracking of a team of mobile robots, which move in the environment in the presence of obstacles. A model predictive control approach is presented, relying on the switched-system formalism to capture each robot model. Specifically, a minimal description of their dynamics, consisting of two modes (rototraslation around a fixed pivot and rotation on spot) capable of efficiently capturing most of the possible motions on the plane, is considered to reduce the curse of dimensionality. Moreover, in order to enable robots to roam around while avoiding collisions and maintaining low computational complexity, obstacle avoidance constraints are relaxed to linear form and included in the optimization problem. The proposal is finally assessed both in simulation and experimentally on the Robotarium remote arena, in comparison with an intrusion-based obstacle avoidance strategy.
|
|
16:40-17:00, Paper WeC11.3 | |
>Stabilizability of Switched Differential Algebraic Equations: A Solvability-Based Approach |
|
Harivanam, Phani Raj | Indian Institute of Technology Bombay |
Pal, Debasattam | Indian Institute of Technology Bombay |
Keywords: Switched systems, Differential-algebraic systems, Stability of hybrid systems
Abstract: In this paper, we establish a Lie algebraic solvability-based criterion for stabilizability of switched differential algebraic equations (switched DAEs) consisting of individually unstable subsystems. It is well known that the Lie algebra generated by commuting matrices is trivially solvable. Thus, the Lie algebraic stabilizability criterion of this paper generalizes the existing commutativity-based criterion in the literature. Furthermore, for switched DAEs satisfying the Lie algebraic criterion, we explicitly provide a simple method to construct a stabilizing switching signal and demonstrate the same with the help of an example.
|
|
17:00-17:20, Paper WeC11.4 | |
>Formal Stabilization of a Coupled ODE-PDE Switched System |
|
Le Coent, Adrien | Université Paris-Est Créteil |
Vacher, Jonathan | Université Paris Cité, CNRS |
Kergrene, Kenan | Université De Technologie De Compiègne, Laboratoire Roberval |
Keywords: Switched systems, Formal Verification/Synthesis, Stability of hybrid systems
Abstract: Partial Differential Equations (PDEs) are a ubiquitous model that describes a wide range of dynamical systems. While stabilization of systems involving PDEs is an important problem, little work has been done on formal verification and synthesis for such systems. In this paper, we explain how a coupled ODE-PDE control problem can be formally stabilized using a tiling based control synthesis algorithm associated to set based reachability, which is usually used on finite dimensional problems. To formally prove the stabilization of the PDE, the original infinite dimensional problem is transformed into a finite dimensional one using, among other tools, model order reduction. The strength of our approach relies on the fact that we never explicitly discretize the PDE state using {em e.g.} a finite element approximation, and consequently, we provide stability guarantees directly on the infinite dimensional state.
|
|
17:20-17:40, Paper WeC11.5 | |
>Stabilization of a Limit Cycle for Discrete-Time Switched Nonlinear Systems |
|
Deaecto, Grace S. | FEM/UNICAMP |
Astolfi, Alessandro | Imperial College & Univ. of Rome |
Keywords: Switched systems, Stability of nonlinear systems, LMIs
Abstract: This paper studies global exponential stabilization of a limit cycle of interest for discrete-time switched nonlinear systems, in which the subsystems may have different equilibria. As a first step, a set of candidate limit cycles is determined according to a criterion related to the steady-state behavior of the system trajectories. Afterwards, a state-dependent switching function, based on sufficient conditions derived from a time-periodic Lyapunov function, is proposed to ensure global exponential stability of the limit cycle and a guaranteed performance level for the overall system. A class of polynomial switched systems is used to illustrate the main results. For this class, new LMI conditions are obtained that ensure local exponential stability of the limit cycle, inside a polyhedral set given by the designer. An ellipsoidal set of maximum volume is determined such that any trajectory starting inside it does not leave the polyhedron. The main features of this methodology are illustrated by academic examples.
|
|
17:40-18:00, Paper WeC11.6 | |
>Minimal Covariance Realization and System Identification Algorithm for a Class of Stochastic Linear Switched Systems with I.i.d. Switching |
|
Rouphael, Elie | University of Lille |
Mejari, Manas | University of Applied Sciences and Arts of Southern Switzerland |
Petreczky, Mihaly | UMR CNRS 9189, Ecole Centrale De Lille |
Belkoura, Lotfi | Université De Lille |
Keywords: Switched systems, Subspace methods, Identification
Abstract: In this paper, we consider stochastic realization theory of Linear Switched Systems (LSS) with i.i.d. switching. We characterize minimality of stochastic LSSs and show existence and uniqueness (up to isomorphism) of minimal LSSs in innovation form. We present a realization algorithm to compute a minimal LSS in innovation form from output and input covariances. Finally, based on this realization algorithm, by replacing true covariances with empirical ones, we propose a statistically consistent system identification algorithm.
|
|
WeC12 |
Brown 3 |
Machine Learning II |
Regular Session |
Chair: Gharesifard, Bahman | University of California, Los Angeles |
Co-Chair: Moothedath, Shana | Iowa State University |
|
16:00-16:20, Paper WeC12.1 | |
>Denoising Diffusion-Based Control of Nonlinear Systems |
|
Elamvazhuthi, Karthik | Los Alamos National Laboratory |
Gadginmath, Darshan | University of California, Riverside |
Pasqualetti, Fabio | University of California, Riverside |
Keywords: Machine learning, Data driven control, Large-scale systems
Abstract: We propose a novel approach based on Denoising Diffusion Probabilistic Models (DDPMs) to control nonlinear dynamical systems. DDPMs are the state-of-the-art of generative models that have achieved success in a wide variety of sampling tasks. In our framework, we pose the feedback control problem as a generative task of drawing samples from a target set under control system constraints. The forward process of DDPMs constructs trajectories originating from a target set by adding noise. We learn to control a dynamical system in reverse such that the terminal state belongs to the target set. For control-affine systems without drift, we prove that the control system can exactly track the trajectory of the forward process in reverse, whenever the the Lie-bracket based condition for controllability holds. We numerically study our approach on various nonlinear systems and verify our theoretical results.
|
|
16:20-16:40, Paper WeC12.2 | |
>Approximate Controllability of Continuity Equation of Transformers |
|
Adu, Daniel O | University of Georgia |
Gharesifard, Bahman | Queen's University |
Keywords: Machine learning, Distributed parameter systems, Neural networks
Abstract: Building on the recent work in~cite{GB-LC-PY:23} which provides an interacting particle system interpretation of Transformers with a continuous-time evolution, we study the controllability attributes of the corresponding continuity equation across the landscape of probability space curves. In particular, we consider the parameters of the Transformer's continuous-time evolution as control inputs. We prove that given an absolutely continuous probability measure and a non-local Lipschitz velocity field that satisfy a continuity equation, there exist control inputs such that the measure and the non-local velocity field of the Transformer's continuous-time evolution approximates them, respectively, in the p-Wasserstein and L^p-sense, where 1leq p
|
|
16:40-17:00, Paper WeC12.3 | |
>Fast and Sample Efficient Relevance-Based Multi-Task Representation Learning |
|
Lin, Jiabin | Iowa State University |
Moothedath, Shana | Iowa State University |
Keywords: Machine learning, Estimation, Optimization algorithms
Abstract: This paper explores an approach for task-relevant multi-task representation learning when the amount of data is limited for both source tasks and target tasks. Specifically, we consider a low-dimensional setting where the goal is to sample source task data based on their relevance so as to utilize task-relevant information effectively. We present a novel learning algorithm based on an alternating projected gradient descent (GD) and minimization estimator. We present the convergence guarantee of our algorithm, excess risk, and the sample complexity of our approach. We evaluated the effectiveness of our algorithm via numerical experiments and compared it empirically against three benchmark approaches.
|
|
17:00-17:20, Paper WeC12.4 | |
>Synthetic Data Generation for System Identification: Leveraging Knowledge Transfer from Similar Systems |
|
Piga, Dario | University of Applied Sciences and Arts of Southern Switzerland |
Rufolo, Matteo | USI-SUPSI |
Maroni, Gabriele | IDSIA USI-SUPSI |
Mejari, Manas | University of Applied Sciences and Arts of Southern Switzerland |
Forgione, Marco | USI-SUPSI |
Keywords: Machine learning, Neural networks, Learning
Abstract: This paper addresses the challenge of overfitting in the learning of dynamical systems by introducing a novel approach for the generation of synthetic data, aimed at enhancing model generalization and robustness in scenarios characterized by data scarcity. Central to the proposed methodology is the concept of knowledge transfer from systems within the same class. Specifically, synthetic data is generated through a pre-trained meta-model that describes a broad class of systems to which the system of interest is assumed to belong. Training data serves a dual purpose: firstly, as input to the pre-trained meta model to discern the system's dynamics, enabling the prediction of its behavior and thereby generating synthetic output sequences for new input sequences; secondly, in conjunction with synthetic data, to define the loss function used for model estimation. A validation dataset is used to tune a scalar hyper-parameter balancing the relative importance of training and synthetic data in the definition of the loss function. The same validation set can be also used for other purposes, such as early stopping during the training, fundamental to avoid overfitting in case of small-size training datasets. The efficacy of the approach is shown through a numerical example that highlights the advantages of integrating synthetic data into the system identification process.
|
|
17:20-17:40, Paper WeC12.5 | |
>Domain Adaptive Safety Filters Via Deep Operator Learning |
|
Manda, Lakshmideepakreddy | Department of Electrical and Computer Engineering, Johns Hopkins |
Chen, Shaoru | Microsoft Corporation |
Fazlyab, Mahyar | Johns Hopkins University |
Keywords: Machine learning, Neural networks, Adaptive control
Abstract: Learning-based approaches for constructing Control Barrier Functions (CBFs) are increasingly being explored for safety-critical control systems. However, these methods typically require complete retraining when applied to unseen environments, limiting their adaptability. To address this, we propose a self-supervised deep operator learning framework that learns the mapping from environmental parameters to the corresponding CBF, rather than learning the CBF directly. Our approach leverages the residual of a parametric Partial Differential Equation (PDE), where the solution defines a parametric CBF approximating the maximal control invariant set. This framework accommodates complex safety constraints, higher relative degrees, and actuation limits. We demonstrate the effectiveness of the method through numerical experiments on navigation tasks involving dynamic obstacles.
|
|
17:40-18:00, Paper WeC12.6 | |
>Exploiting Symmetry in Dynamics for Model-Based Reinforcement Learning with Asymmetric Rewards |
|
Sonmez, Yasin | UC Berkeley |
Junnarkar, Neelay | University of of California Berkeley |
Arcak, Murat | University of California, Berkeley |
Keywords: Machine learning, Nonlinear systems identification, Data driven control
Abstract: Recent work in reinforcement learning has leveraged symmetries in the model to improve sample efficiency in training a policy. A commonly used simplifying assumption is that the dynamics and reward both exhibit the same symmetry. However, in many real-world environments, the dynamical model exhibits symmetry independent of the reward model: the reward may not satisfy the same symmetries as the dynamics. In this paper, we investigate scenarios where only the dynamics are assumed to exhibit symmetry, extending the scope of problems in reinforcement learning and learning in control theory where symmetry techniques can be applied. We use Cartan's moving frame method to introduce a technique for learning dynamics which, by construction, exhibit specified symmetries. We demonstrate through numerical experiments that the proposed method learns a more accurate dynamical model.
|
|
WeC13 |
Suite 1 |
Control for Smart Transportation Systems with Incomplete Information |
Invited Session |
Chair: Jin, Li | Shanghai Jiao Tong University |
Co-Chair: Yu, Huan | The Hong Kong University of Science and Technology(Guangzhou) |
Organizer: Jin, Li | Shanghai Jiao Tong University |
Organizer: Yu, Huan | The Hong Kong University of Science and Technology(Guangzhou) |
|
16:00-16:20, Paper WeC13.1 | |
>PI Boundary Consensus of Networked Hyperbolic Systems with Application to Multi-Lane Traffic Synchronization (I) |
|
Cao, Lei | Beijing University of Technology |
Zhan, Jingyuan | Beijing University of Technology |
Zhang, Liguo | Beijing University of Technology |
Keywords: Distributed parameter systems, Cooperative control, Traffic control
Abstract: This paper studies the proportional-integral (PI) boundary feedback control problem for consensus of hyperbolic multi-agent systems (MASs), and provides an application to the synchronization of a multi-lane road traffic flow system. Firstly, we propose a PI boundary consensus protocol for the hyperbolic MASs of conservation laws in the presence of unknown constant input disturbances. Secondly, we present the consensus analysis under undirected communication topologies by employing the Lyapunov approach, obtaining sufficient conditions w.r.t. the PI boundary control matrices and Laplacian matrices for ensuring the asymptotic consensus. We further integrate the spectral decomposition technique with Lyapunov approach to derive the sufficient conditions related to Laplacian eigenvalues, which are more tractable, under the assumption that the undirected graph is connected. Finally, we provide an application to the synchronization of a multi-lane road traffic flow system described by the Aw-Rascle equation, and give numerical simulation results to demonstrate the effectiveness of the PI boundary consensus protocol.
|
|
16:20-16:40, Paper WeC13.2 | |
>Safe Stabilizing Control of Traffic Systems with Simultaneous State and Actuator Delays |
|
Zhao, Chenguang | The Hong Kong University of Science and Technology (Guangzhou) |
Yu, Huan | The Hong Kong University of Science and Technology(Guangzhou) |
Keywords: Traffic control, Delay systems, Optimization
Abstract: The integration of connectivity and autonomy in vehicles has shown great promise in smoothing traffic by a leading connected automated vehicle (CAV), which can be achieved by designing cruising control strategies to regulate the velocity and spacing of a string of follower vehicles. Safety verification of these stabilizing traffic controllers in real-world traffic systems, where delays stem from human reaction times and sensor/actuator latency, remains an open research question. In this paper, we solve the safe stabilization of a class of traffic systems with simultaneous state and actuator delays, using the control barrier function (CBF). A predictor with bounded prediction error is designed to compensate for the actuator delay and state delays. Delay-compensating CBF constraints are designed to guarantee formal safety under simultaneous state and actuator delays. We synthesize a safety-critical controller by solving a Quadratic Programming problem that minimizes deviation from a nominal stabilizing traffic controller. Numerical simulations discuss the safety impact of simultaneous reaction delay of human drivers and the actuator delay of CAV in two safety-critical scenarios and validate the proposed CBF safety constraints.
|
|
16:40-17:00, Paper WeC13.3 | |
>On Joint Convergence of Traffic State and Weight Vector in Learning-Based Dynamic Routing with Value Function Approximation (I) |
|
Wu, Yidan | Shanghai Jiao Tong University |
Zhang, Jianan | Google |
Jin, Li | Shanghai Jiao Tong University |
Keywords: Reinforcement learning, Lyapunov methods, Traffic control
Abstract: Learning-based approaches are increasingly popular for traffic control problems, but they are applied typically as black boxes with limited theoretical guarantees and interpretability. In this paper, we address these challenges by analyzing dynamic routing over parallel servers, a representative traffic control task, through a semi-gradient on-policy control algorithm, a key reinforcement learning method. We consider a linear value function approximation on an unbounded state space and derive a Lyapunov function from the approximator. In particular, the structure of the approximator naturally enables idling policies, which is an interesting and useful advantage over existing dynamic routing schemes. Our results demonstrate that the convergence of the approximation weights is coupled with the convergence of the traffic state. Specifically, we show that if the system is stabilizable, then (i) the weight vector converges to a bounded region, and (ii) the traffic state is bounded in the mean. Additionally, empirical evidence shows that our proposed algorithm is computationally efficient with an insignificant optimality gap, which is effectively practical in real-world applications.
|
|
17:00-17:20, Paper WeC13.4 | |
>Event-Triggered Cooperative Adaptive Cruise Control of Mixed Vehicular Platoons (I) |
|
Zhao, Fu-Yu | Northeastern University |
Gao, Weinan | Northeastern University |
Jiang, Zhong-Ping | New York University |
Liu, Tengfei | Northeastern University |
Keywords: Reinforcement learning, Optimal control, Adaptive control
Abstract: This paper presents a novel policy iteration (PI) based event-triggered cooperative adaptive cruise control (CACC) approach for vehicular platoons, which are composed of autonomous vehicles. An observer is established to observe the signal of leading vehicle. To reduce computation and communication resources, event-triggered based adaptive optimal controllers are designed by using the PI algorithm without relying on the knowledge of system dynamics. Sufficient conditions are introduced to achieve the leader-to formation stability (LFS) of the vehicular platoons. Simulation results are presented to verify the effectiveness of the proposed design.
|
|
17:20-17:40, Paper WeC13.5 | |
>Event-Triggered Boundary Control of Mixed-Autonomy Traffic (I) |
|
Zhang, Yihuai | The Hong Kong University of Science and Technology(Guangzhou) |
Yu, Huan | The Hong Kong University of Science and Technology(Guangzhou) |
Keywords: Traffic control, Backstepping, Distributed parameter systems
Abstract: Control problems of mixed-autonomy traffic systems that consist of both human-driven vehicles (HV) and autonomous vehicles (AV), have gained increasing attention. This paper focuses on suppressing traffic oscillations in the mixed-autonomy traffic system using boundary control design. The mixed traffic dynamics are described by 4times 4 hyperbolic partial differential equations (PDEs), governing the propagation of four waves of traffic, including the density of HV, the density of AV, the friction between the two vehicle classes from driving interactions and the averaged velocity. We propose an event-triggered boundary control design since control signals of the traffic light on ramp or the varying speed limit cannot be continuously updated. We apply the event-triggered mechanism for a PDE backstepping controller and obtain a dynamic triggering condition. Lyapunov analysis is performed to prove the exponential stability of the closed-loop system with the event-triggered controller. Numerical simulation demonstrates the efficiency of the proposed event-trigger control design. We analyzed how the car-following spacing of AV affects the event-triggering mechanism of the control input in mixed-autonomy traffic.
|
|
17:40-18:00, Paper WeC13.6 | |
>Model Predictive Control for Turning Vehicle Platoons Guaranteeing String Stability |
|
Zhang, Qihang | University of Groningen |
Qiu, Meng | Southeast University |
Cao, Ming | University of Groningen |
Keywords: Traffic control, Predictive control for linear systems, Optimization
Abstract: This paper presents a detailed study on the application of Model Predictive Control (MPC) in managing a platoon of vehicles that makes turns, focusing on ensuring asymptotic stability and maintaining leader-follower string stability. String stability is essential in vehicle platooning, as it ensures that perturbations do not amplify as they propagate through the platoon, thereby preventing unsafe vehicle behaviors. Many studies have focused on one-dimensional platooning, emphasizing longitudinal stability; however, real-world driving scenarios frequently involve scenarios such as turning and lane changing that require to satisfy both lateral and longitudinal stability. Therefore, this work introduces an MPC algorithm designed for turning platoons. The algorithm enables the platoon to follow curved trajectories while ensuring string stability. To demonstrate the MPC algorithm's performance in maintaining stability and string stability, we conducted a simulation involving a platoon of five vehicles.
|
|
WeC14 |
Suite 2 |
Estimation V |
Regular Session |
Chair: Martinez, Sonia | University of California at San Diego |
Co-Chair: Meslem, Nacim | GIPSA-LAB, CNRS |
|
16:00-16:20, Paper WeC14.1 | |
>Guaranteed Privacy-Preserving H∞-Optimal Interval Observer Design for Bounded-Error LTI Systems |
|
Khajenejad, Mohammad | The University of Tulsa |
Martinez, Sonia | University of California at San Diego |
Keywords: Estimation, Linear systems, Observers for Linear systems
Abstract: This paper furthers current research into the notion of guaranteed privacy, which provides a deterministic characterization of the privacy of output signals of a dynamical system or mechanism. Unlike stochastic differential privacy, guaranteed privacy offers strict bounds on the proximity between the ranges of two sets of estimated data. Our approach relies on synthesizing an interval observer for a perturbed linear time-invariant (LTI) bounded-error system. The design procedure incorporates a bounded noise perturbation factor computation and observer gains synthesis. Consequently, the observer simultaneously provides guaranteed private and stable interval-valued estimates for a desired variable. We demonstrate the optimality of our design by minimizing the H∞ norm of the observer error system. Furthermore, we assess the accuracy of our proposed mechanism by quantifying the loss incurred when considering guaranteed privacy specifications. Finally, we illustrate the outperformance of the proposed approach to differential privacy through simulations.
|
|
16:20-16:40, Paper WeC14.2 | |
>Set-Valued State Estimator with Sparse and Delayed Measurements for Uncertain Discrete-Time Linear Systems |
|
Meslem, Nacim | GIPSA-LAB, CNRS |
Hably, Ahmad | GIPSA-Lab |
Wang, Zhenhua | Harbin Institute of Technology |
Raïssi, Tarek | Conservatoire National Des Arts Et Métiers |
Keywords: Estimation, Linear systems
Abstract: This work deals with online set-membership estimation problem of the state vector of linear discrete-time linear systems with sparse and delayed data. This problem is addressed in a bounded error context where both state disturbance and measurements noise are modeled by bounded boxes. A prediction-correction strategy is adopted to design a Luenberger-like observer where an outer approximation of the reachable set of the estimation error can be computed offline. In addition, by applying interval analysis and under a nonrestrictive observability assumption, both bounding and convergence features of the introduced interval-based state estimator are demonstrated. A numerical case study example is exposed to support the theoretical results by simulation results and to highlight the performance of the proposed estimation approach in the presence of sparse and delayed data.
|
|
16:40-17:00, Paper WeC14.3 | |
>Koopman-Based Deep Learning for Nonlinear System Estimation |
|
Sun, Zexin | Boston University |
Chen, Mingyu | Boston University |
Baillieul, John | Boston Univ |
Keywords: Estimation, Nonlinear systems, Learning
Abstract: Nonlinear differential equations are encountered as models of fluid flow, spiking neurons, and many other systems of interest in the real world. Common features of these systems are that their behaviors are difficult to describe exactly and invariably unmodeled dynamics present challenges in making precise predictions. In this paper, we present a novel data-driven linear estimator based on Koopman operator theory to extract meaningful finite-dimensional representations of complex non- linear systems. The Koopman model is used together with deep reinforcement networks that learn the optimal stepwise actions to predict future states of nonlinear systems. Our estimator is also adaptive to a diffeomorphic transformation of the estimated nonlinear system, which enables it to compute optimal state estimates without re-learning.
|
|
17:00-17:20, Paper WeC14.4 | |
>Generic Interval Filter for Continuous Time Nonlinear Systems with Bounded State Condition |
|
Lu, Quoc Hung | UPS, LAAS-CNRS |
Fergani, Soheib | LAAS-CNRS, Laboratory for Analysis and Architecture of Systems |
Jauberthie, Carine | LAAS-CNRS |
Keywords: Estimation, Nonlinear systems, Uncertain systems
Abstract: State estimation for continuous time nonlinear systems, in bounded error framework, is concerned in this paper. Guaranteed state estimation is obtained without assumption of differentiability on the dynamical function. The proposed filter is mainly based on interval computations, bounded state assumption and with or without Lipschitz smoothness condition on the dynamical function. This filter is applied on a case study, highlighting the potential of the proposed filtering method.
|
|
17:20-17:40, Paper WeC14.5 | |
>Sparse Topology Estimation for Consensus Network Systems Via Minimax Concave Penalty |
|
Matsuzaki, Fumiya | Kyushu University |
Ikeda, Takuya | The University of Kitakyushu |
Keywords: Estimation, Optimization, Networked control systems
Abstract: This paper proposes an optimization method to estimate the network topology in continuous-time consensus systems. Assuming the network topology is non-negative weighted and undirected, we formulate an optimization problem based on the sparse maximum likelihood estimation using the minimax concave penalty function as the sparsity promoting term. We show that the problem belongs to the class of difference of convex functions (DC) optimization problems and quote the best-known DC algorithm for the computation. The effectiveness of the proposed method is demonstrated through numerical simulations and experiments using flight data measured from a multi-agent system of drones. We confirm that our method outperforms the conventional l1 regularization methods.
|
|
17:40-18:00, Paper WeC14.6 | |
>Multi-Frequency Tracking Via Group-Sparse Optimal Transport |
|
Haasler, Isabel | École Polytechnique Fédérale De Lausanne |
Elvander, Filip | Lund University |
Keywords: Estimation, Optimization algorithms, Large-scale systems
Abstract: In this work, we introduce an optimal transport framework for inferring power distributions over both spatial location and temporal frequency. Recently, it has been shown that optimal transport is a powerful tool for estimating spatial spectra that change smoothly over time. In this work, we consider the tracking of the spatio-temporal spectrum corresponding to a small number of moving broad-band signal sources. Typically, such tracking problems are addressed by treating the spatio-temporal power distribution in a frequency-by-frequency manner, allowing to use well-understood models for narrow-band signals. This however leads to decreased target resolution due to inefficient use of the available information. We propose an extension of the optimal transport framework that exploits information from several frequencies simultaneously by estimating a spatio-temporal distribution penalized by a group-sparsity regularizer. This approach finds a spatial spectrum that changes smoothly over time, and at each time instance has a small support that is similar across frequencies. To the best of the authors’ knowledge, this is the first formulation combining optimal transport and sparsity for solving inverse problems. As is shown on simulated and real data, our method can successfully track targets in scenarios where information from separate frequency bands alone is insufficient.
|
|
WeC15 |
Suite 3 |
Power Systems III |
Regular Session |
Chair: Chaudhuri, Nilanjan Ray | Penn State University |
Co-Chair: Taha, Ahmad | Vanderbilt University |
|
16:00-16:20, Paper WeC15.1 | |
>Analysis of Backward Euler Method in Presence of Saturation Nonlinearity and Applications in Power Systems Simulation |
|
Gangopadhyay, Soumyajit | The Pennsylvania State University |
Chaudhuri, Nilanjan Ray | Penn State University |
Keywords: Power systems, Power electronics, Energy systems
Abstract: Dynamic simulation of a power system involves the solutions of many nonlinear differential-algebraic equations that are computationally expensive. While quasi steady state approximation methods are computationally efficient, they cannot capture many power system phenomena such as controller-induced instabilities. Recently, Backward Euler Method (BEM) has been used to produce a coarser approximation of the ground truth (obtained from Trapezoidal method) at a lower computational effort. However, no fundamental analysis exists in the literature for understanding the properties of BEM in presence of saturation nonlinearity in a dynamical system. This paper mathematically investigates the properties of BEM when applied to a 1−dimensional and a 2−dimensional system with saturation nonlinearity. Our analyses show that besides hyperstability, unlike in a linear time-invariant system, BEM can also suffer from hyperinstability in a system with saturation. Based on the mathematical analyses, qualitative recommendations are presented for adaptively varying the stepsizes of BEM such that the solution can resemble the ground truth in an averaged sense at a significantly lower computational cost. BEM with adaptive stepsize variation is applied to simulate (i) a single-generator system (with saturation nonlinearity in the governor’s dynamics) feeding a standalone load and (ii) a 6-bus system with a synchronous generator and inverter-based resources having saturation nonlinearity. It is shown that by adaptively varying the stepsizes based on the presented recommendations, BEM can produce the same end result as in the ground truth while consuming significantly less cpu time.
|
|
16:20-16:40, Paper WeC15.2 | |
>Market Power and Withholding Behavior of Energy Storage Units |
|
Wu, Yiqian | Columbia University |
Xu, Bolun | Columbia University |
Anderson, James | Columbia University |
Keywords: Power systems, Smart grid, Energy systems
Abstract: Electricity markets are experiencing a rapid increase in energy storage participation. Unlike conventional generation resources, quantifying the competitive operation status and identifying market power exercise by energy storage is challenging, particularly in the context of multi-interval strategies. We present a framework to differentiate strategic capacity withholding behaviors attributed to market power exercise from inherent competitive bidding in storage strategies. Our framework evaluates the profitability of strategic storage participation, analyzing bidding behaviors as both price takers and price makers using a self-scheduling model, and investigate how they leverage market inefficiencies. Specifically, we propose a price sensitivity model derived from the linear supply function equilibrium model to examine the price-anticipating bidding strategy, effectively capturing the influence of market power exercise. Utilizing this framework, we introduce a sufficient ex-post analysis standard for market operators to identify potential exploitative behaviors by monitoring instances of withholding within the bidding profiles, ensuring market resilience and competitiveness. We discuss and verify applicability of the proposed framework to realistic settings. Our analysis substantiates commonly observed economic bidding behaviors of storage participants. Furthermore, it demonstrates that significant price volatility offers considerable profit opportunities not only for participants possessing market power but also for typical strategic profit seekers, underscoring the importance of sound market planning and resource allocation.
|
|
16:40-17:00, Paper WeC15.3 | |
>Real-Time Assessment of Distribution Grid Security through Adaptive Smart Meter Measurements |
|
Chen, Jiaqi | University of Wisconsin-Madison |
Roald, Line | University of Wisconsin Madison |
Keywords: Power systems, Smart grid, Energy systems
Abstract: The rapid expansion of distributed energy resources is heightening uncertainty and variability in distribution system operations, potentially leading to power quality challenges such as voltage magnitude violations and excessive voltage unbalance. Ensuring the dependable and secure operation of distribution grids requires system real-time assessment. However, constraints in sensing, measurement, and communication capabilities within distribution grids result in limited awareness of the system’s state. To achieve better real-time estimates of distribution system security, we propose a real-time security assessment based on data from smart meters, which are already prevalent in most distribution grids. Assuming that it is possible to obtain a limited number of voltage magnitude measurements in real time, we design an iterative algorithm to adaptively identify a subset of smart meters whose real-time measurements allow us to certify that all voltage magnitudes remain within bounds. This algorithm iterates between (i) solving optimization problems to determine the worst possible voltage magnitudes, given a limited set of voltage magnitude measurements, and (ii) leveraging the solutions and sensitivity information from these problems to update the measurement set. Numerical tests on the IEEE 123 distribution feeder demonstrate that the proposed algorithm consistently identifies and tracks the nodes with the highest and lowest voltage magnitude, even as the load changes over time.
|
|
17:00-17:20, Paper WeC15.4 | |
>Fault Detection, Isolation, and Estimation for a Three-Phase Grid Connected DC-AC Inverter with LCL Filters |
|
Marouane, Laaziz | LCCPS, ENSAM, Hassan II University of Casablanca |
Nicolau, Florentina | Ensea Cergy |
Ghanes, Malek | Centrale Nantes |
Barbot, Jean Pierre | Ecole Centrale Nantes & CNRS |
Boisliveau, Robert | LS2N |
Machkour, Nadia | LCCPS, ENSAM, Hassan II University of Casablanca, Morocco |
Keywords: Power systems, Fault detection, Fault diagnosis
Abstract: The paper is dedicated to symmetric and asymmetric fault detection, isolation, and estimation of a three-phase DC-AC (Direct Current-Alternating Current) inverter connected to the grid by three LCL filters. The system modeling and the faults estimation algorithm are performed in the abc frame allowing to take into account phase by phase asymmetrical faults. It is based on a left inversion technique and uses the super-twisting differentiator in order to estimate the state components of the system from the given measurements. Simulations to support the proposed approach are presented.
|
|
17:20-17:40, Paper WeC15.5 | |
>On the Stability of Power Transmission Systems under Persistent Inverter Attacks: A Bi-Linear Matrix Approach |
|
Colot, Antonin | University of Liège |
Shenoy, Vishal | University of California, San Diego |
Cavraro, Guido | National Renewable Energy Laboratory |
Dall'Anese, Emiliano | Boston University |
Poveda, Jorge I. | University of California, San Diego |
Keywords: Power systems, Hybrid systems, Stability of hybrid systems
Abstract: We investigate the stability and robustness properties of a power transmission system under persistent deceiving attacks to inverter-interfaced energy resources. The attacks can corrupt the damping coefficients in the inverters' controllers and measurements of the frequency at the points of coupling. The dynamics of the primary and of the secondary frequency controllers are modeled using a linear time-invariant dynamical system. Leveraging tools from the theory of hybrid dynamical systems, we characterize a broad family of persistent (and not necessarily periodic) attacks acting on the inverters, under which the stability properties of the transmission system can be shown to not be compromised. In particular, sufficient conditions on the average activation time of the attacks are identified via Lyapunov theory, with bounds obtained through the formulation and solution of a class of bilinear matrix inequalities that allow for a reduction in the conservativeness of some of the estimates. The results are obtained for both constant and slowly time-varying loads using input-to-state stability (ISS) tools for hybrid dynamical systems. Numerical simulations on the IEEE 39-bus test system are also presented.
|
|
17:40-18:00, Paper WeC15.6 | |
>Wide-Area Damping Controller Via Reinforcement Learning for Power Networks with Wind and Solar Farms |
|
Nadeem, Muhammad | Vanderbilt University |
Bahavarnia, MirSaleh | Vanderbilt University |
Taha, Ahmad | Vanderbilt University |
Keywords: Reinforcement learning, Power systems, Differential-algebraic systems
Abstract: Accurately predicting power system behavior is becoming more complex with the increased penetration of uncertain wind and solar-based renewable resources. Hence, there is a growing motivation to transition from model-based feedback control strategies to completely model-free counterparts. Reinforcement learning (RL) is a key methodology in designing a model-free controller. Various studies have been carried out to study voltage/frequency control strategies via RL. However, they usually consider a simplified power system model either by completely neglecting system differential equations (and thus only modeling the system via power balance equations) or considering simplified generator models. Furthermore, damping system-wide oscillations after large disturbances are usually ignored in the controller design. In contrast, we propose an RL-based wide-area damping controller (WADC) for an advanced power system model with comprehensive higher-order generator dynamics, power electronics-based wind and solar models, and composite load dynamics. The presented controller sends control signals to synchronous generators, wind, and solar power plants to actively adjust their power and voltage setpoints---thereby providing damping to the system oscillations after large disturbances. Case studies demonstrate that the system's transient stability can be significantly improved after large disturbances.
|
|
WeC16 |
Suite 4 |
Robust and Constrained Control |
Regular Session |
Chair: Oliveira, Vilma A. | Universidade De Sao Paulo |
Co-Chair: Valmorbida, Giorgio | L2S, CentraleSupelec |
|
16:00-16:20, Paper WeC16.1 | |
>Robust Model Predictive Control-Based Energy Management System for the Island Power System of Suðuroy, Faroe Islands |
|
Alferink, Marco | University of Bremen |
Reus, Lucas | Leibniz University Hannover |
Hofmann, Lutz | Leibniz University Hannover |
Michels, Kai | University of Bremen |
Keywords: Robust control, Power systems, Predictive control for nonlinear systems
Abstract: In this paper a robust Model Predictive Control (MPC)-based Energy Management System (EMS) is developed and investigated for the case study of Suðuroy, Faroe Islands, under uncertainties resulting from real data. This island power system comprises volatile renewable energy sources, synchronous machines and a Battery Energy Storage System (BESS). In the presented robust MPC both uncertainties from the wind farm power and load demand forecasts are considered. Simulation results are derived from data of one week focusing on the diesel generators’ operational costs and the BESS usage. A comparison against a deterministic MPC-based EMS as well as historical grid operation data is performed. In the presented case study it is shown that both the deterministic MPC and the robust MPC provide significant economic benefits compared to the current grid operation. It is also found that the robust MPC achieves similar performance to the deterministic MPC regarding operational costs. Increased robustness gained through higher state of charge of the BESS is not traded for higher operational costs, highlighting the advantages of the robust MPC-based EMS.
|
|
16:20-16:40, Paper WeC16.2 | |
>On Stabilizing Terminal Costs and Regions for Configuration-Constrained Tube MPC |
|
Houska, Boris | ShanghaiTech University |
Müller, Matthias A. | Leibniz University Hannover |
Villanueva, Mario E. | IMT School for Advanced Studies Lucca |
Keywords: Robust control, Predictive control for linear systems, Constrained control
Abstract: This paper introduces a novel class of terminal regions and cost functions for tube model predictive control (TMPC). Our focus is on polytopic configuration-constrained TMPC schemes, which offer flexibility by introducing a significant amount of variables to model the shape of the propagated sets. This flexibility, however, comes with a challenge, namely, to enforce stability efficiently. To address this challenge, we propose tailored terminal regions and cost functions that enable efficient and stable TMPC implementations. Our approach does not rely on regularity assumptions about the control system or configuration templates. Instead, it directly addresses the computational tractability of the end cost and terminal region, which is crucial for TMPC with high-dimensional set parameterizations. Numerical case studies demonstrate the effectiveness and performance of the proposed control scheme.
|
|
16:40-17:00, Paper WeC16.3 | |
>Robust H∞ Control for Uncertain Linear Systems Via Static Output Feedback |
|
Faria, Flávio A. | UNESP - Univ Estadual Paulista |
Elias, Leandro J. | Instituto Federal De Educação, Ciência E Tecnologia De São Paulo |
Cardim, Rodrigo | UNESP - São Paulo State University |
Solis Oncoy, Dante Javier | UNESP - São Paulo State University |
Oliveira, Vilma A. | Universidade De Sao Paulo |
Keywords: Robust control, Stability of linear systems, LMIs
Abstract: This work proposes a switched control strategy to attenuate the effect of L2 type disturbances on the output performance of uncertain linear systems. The switched controller improve the control performance and reduces the conservatism of existing solutions. Stabilization conditions are given in terms of LMIs which are designed such that the proposed solution guarantees an H∞ guaranteed cost and be free of chattering. The efficiency of the stabilizing conditions are evaluated by numerical examples.
|
|
17:00-17:20, Paper WeC16.4 | |
>The Distributionally Robust Infinite-Horizon LQR |
|
Hajar, Joudi | Caltech |
Kargin, Taylan | California Institute of Technology |
Malik, Vikrant | California Institute of Technology |
Hassibi, Babak | Caltech |
Keywords: Robust control, Stochastic optimal control, Optimization algorithms
Abstract: We explore the infinite-horizon Distributionally Robust (DR) linear-quadratic control. While the probability distribution of disturbances is unknown and potentially correlated over time, it is confined within a Wasserstein-2 ball of a radius r around a known nominal distribution. Our goal is to devise a control policy that minimizes the worst-case expected Linear-Quadratic Regulator (LQR) cost among all probability distributions of disturbances lying in the Wasserstein ambiguity set. We obtain the optimality conditions for the optimal DR controller and show that it is non-rational. Despite lacking a finite-order state-space representation, we introduce a computationally tractable fixed-point iteration algorithm. Our proposed method computes the optimal controller in the frequency domain to any desired fidelity. Moreover, for any given finite order, we use a convex numerical method to compute the best rational approximation (in H_infty-norm) to the optimal non-rational DR controller. This enables efficient time-domain implementation by finite-order state-space controllers and addresses the computational hurdles associated with the finite-horizon approaches to DR-LQR problems, which typically necessitate solving a Semi-Definite Program (SDP) with a dimension scaling with the time horizon. We provide numerical simulations to showcase the effectiveness of our approach.
|
|
17:20-17:40, Paper WeC16.5 | |
>Controller Design for Constrained Discrete-Time Uncertain Linear Systems Using Implicit Functions |
|
Muñoz-Carpintero, Diego | Universidad De O'Higgins |
Valmorbida, Giorgio | L2S, CentraleSupelec |
Keywords: LMIs, Robust control, Constrained control
Abstract: This manuscript shows a controller design procedure for constrained discrete-time uncertain linear systems, where the control law and piece-wise functions implicitly defined by ramps give the associated invariant set and Lyapunov function. This structure enables non-symmetric invariant sets of larger sizes than other structures, yielding symmetric invariant sets. The procedure is iterative, and each iteration solves a semi-definite program. A simulation example illustrates the attributes of the proposed design procedure.
|
|
17:40-18:00, Paper WeC16.6 | |
>Robust Control of Constrained Linear Systems Using Online Convex Optimization and a Reference Governor |
|
Nonhoff, Marko | Leibniz University Hannover |
Al Torshan, Mohammad Taher | Leibniz Universität Hannover |
Müller, Matthias A. | Leibniz University Hannover |
Keywords: Constrained control, Optimal control, Linear systems
Abstract: This article develops a control method for linear time-invariant systems subject to time-varying and a priori unknown cost functions, that satisfies state and input constraints, and is robust to exogenous disturbances. To this end, we combine the online convex optimization framework with a reference governor and a constraint tightening approach. The proposed framework guarantees recursive feasibility and robust constraint satisfaction. Its closed-loop performance is studied in terms of its dynamic regret, which is bounded linearly by the variation of the cost functions and the magnitude of the disturbances. The proposed method is illustrated by a numerical case study of a tracking control problem.
|
|
WeC17 |
Suite 6 |
Autonomous Systems |
Regular Session |
Chair: di Bernardo, Mario | University of Naples Federico II |
Co-Chair: Tron, Roberto | Boston University |
|
16:00-16:20, Paper WeC17.1 | |
>Formation Control of Double Integrators Over Directed Graphs Using Bearings and Bearing Rates |
|
Rayabagi, Susmitha T | Indian Institute of Technology Bombay |
Pal, Debasattam | Indian Institute of Technology Bombay |
Mukherjee, Dwaipayan | Indian Institute of Technology Bombay |
Keywords: Autonomous systems, Mechatronics, Networked control systems
Abstract: Bearing-only formation control problem for agents over directed graphs, due to the loss of symmetry in the sensing graph, has been explored sparingly in comparison to the undirected counterpart. The few existing results mainly consider single intergrator agents that achieve stationary formations. This paper studies the problem of bearing-only formation control for double integrator agents under a leader-first follower (LFF) structure where the underlying graph is directed acyclic in nature. We characterize the equilibrium points under our proposed control law and present local stability analysis around them. We further show that scale of the formation can be controlled purely through bearings by considering the leader’s physical dimensions to be non-trivial.
|
|
16:20-16:40, Paper WeC17.2 | |
>High-Dimensional Continuification Control of Large-Scale Multi-Agent Systems under Limited Sensing and Perturbations |
|
Maffettone, Gian Carlo | Scuola Superiore Meridionale |
di Bernardo, Mario | University of Naples Federico II |
Porfiri, Maurizio | New York University Tandon School of Engineering |
Keywords: Autonomous systems, Distributed parameter systems, Large-scale systems
Abstract: This paper investigates the robustness of a novel high-dimensional continuification control method for complex multi-agent systems. We begin by formulating a partial differential equation describing the spatio-temporal density dynamics of swarming agents. A stable control action for the density is then derived and validated under nominal conditions. Subsequently, we discretize this macroscopic strategy into actionable velocity inputs for the system's agents. Our analysis demonstrates the robustness of the approach beyond idealized assumptions of unlimited sensing and absence of perturbations.
|
|
16:40-17:00, Paper WeC17.3 | |
>Spline Trajectory Tracking and Obstacle Avoidance for Mobile Agents Via Convex Optimization |
|
Dickson, Akua | Boston University |
Cassandras, Christos G. | Boston University |
Tron, Roberto | Boston University |
Keywords: Autonomous systems, Optimization, Lyapunov methods
Abstract: We propose a motion planning technique based on output feedback that enables agents to converge to a specified polynomial trajectory while avoiding collisions within a polygonal environment. To achieve this, we 1) decompose the polygonal environment into overlapping cells, 2) express the polynomial trajectories as the output of a reference dynamical system with given initial conditions, 3) formulate convergence and safety (collision avoidance) constraints as Linear Matrix Inequalities (LMIs) on our controller using Control Lyapunov Functions (CLFs) and Control Barrier Functions (CBFs), and 4) synthesize a controller for each convex cell via a semi-definite programming (SDP) problem that includes the derived constraints. We test our method with simulations. We find that the synthesized controller is robust to changes in initial conditions, and maintains safety relative to the boundaries of the polygonal environment even in the presence of significant amounts of noise.
|
|
17:00-17:20, Paper WeC17.4 | |
>Safe Motion Along Harmonic Potential Surfaces with Cardinal Spline Dirichlet Boundary Conditions |
|
Nicu, Theodor-Gabriel | University Politehnica of Bucharest |
Stoican, Florin | Universitatea Nationala De Stiinta Si Tehnologie POLITEHNICA BUC |
Ioan, Daniel-Mihail | UNSTPB |
Prodan, Ionela | Grenoble Institute of Technology (Grenoble INP) - Esisar |
Keywords: Autonomous systems, Numerical algorithms, Optimization
Abstract: This work pertains to the generation and use of harmonic functions for obstacle avoidance and target tracking. For each cell of a polyhedral complex, we generate a harmonic potential surface from a Dirichlet-type condition on its boundary. The condition is expressed as a smooth cardinal B-spline curve, thus allowing an efficient computation of the potential which avoids the large gradients characteristic to the piecewise constant boundary conditions usually employed in the literature. Enumerating all pairs of "in" and "out" facets for each feasible cell (one not part of the obstacles) and attaching a relevant cost to each such pair (e.g., proportional to the path length inside the cell), we arrive at a directed graph where, by construction, all possible paths linking the source and the target avoid the obstacles. The optimal path results from a variation of Dijkstra's algorithm.
|
|
17:20-17:40, Paper WeC17.5 | |
>On Consensus Over a Network of Dynamical Descriptor Systems: Synthesis of Feedback Controllers |
|
Hazarika, Hemanta | Indian Institute of Technology, Bombay |
Mukherjee, Dwaipayan | Indian Institute of Technology Bombay |
Pal, Debasattam | Indian Institute of Technology Bombay |
Keywords: Autonomous systems, Networked control systems, Differential-algebraic systems
Abstract: In this paper, we first investigate the regularity conditions for a network of dynamical descriptor systems (NDDS). It is shown that the regularity (or lack thereof) of the NDDS depends on the choice of the edge feedback gain. For an undirected homogeneous NDDS, when individual agents are regular, we derive a necessary and sufficient condition for the loss of regularity of the overall networked system. Furthermore, we present a method for designing a suitable distributed feedback controller to ensure that the descriptor agents achieve consensus.
|
|
17:40-18:00, Paper WeC17.6 | |
>Safe Returning FaSTrack with Robust Control Lyapunov-Value Functions |
|
Gong, Zheng | University of California San Diego |
Li, Boyang | UC San Diego |
Herbert, Sylvia | UC San Diego (UCSD) |
Keywords: Autonomous systems, Robust control, Control applications
Abstract: Real-time navigation in a priori unknown environment remains a challenging task, especially when an unexpected (unmodeled) disturbance occurs. In this paper, we propose the framework Safe Returning Fast and Safe Tracking (SR-F) that merges concepts from 1) Robust Control Lyapunov-Value Functions (R-CLVF), and 2) the Fast and Safe Tracking (FaSTrack) framework. The SR-F computes an R-CLVF offline between a model of the true system and a simplified planning model. Online, a planning algorithm is used to generate a trajectory in the simplified planning space, and the R-CLVF is used to provide a tracking controller that exponentially stabilizes to the planning model. When an unexpected disturbance occurs, the proposed SR-F algorithm provides a means for the true system to recover to the planning model. We take advantage of this mechanism to induce an artificial disturbance by ``jumping'' the planning model in open environments, forcing faster navigation. Therefore, this algorithm can both reject unexpected true disturbances and accelerate navigation speed. We validate our framework using a 10D quadrotor system and show that SR-F is empirically 20% faster than the original FaSTrack while maintaining safety.
|
|
WeC18 |
Suite 7 |
Stability of Nonlinear Systems III |
Regular Session |
Chair: Dashkovskiy, Sergey | University of Wuerzburg |
Co-Chair: Zuyev, Alexander | Max Planck Institute for Dynamics of Complex Systems |
|
16:00-16:20, Paper WeC18.1 | |
>ISS of Finite Dimensional Dynamic Systems on Time Scales |
|
Dashkovskiy, Sergey | University of Würzburg |
Hütter, Gianluca | Institute of Mathematics, University of Würzburg |
Keywords: Stability of hybrid systems, Stability of nonlinear systems, Lyapunov methods
Abstract: In this paper, we introduce the notion of input-to-state stability (ISS) for systems on time scales. We then generalize some known ISS results for time continuous and discrete systems for the case of time scales. In particular we will show that for linear systems the ISS is equivalent to the uniform exponential stability for zero input. Furthermore we will give a sufficient condition for ISS of nonlinear systems using a Lyapunov function.
|
|
16:20-16:40, Paper WeC18.2 | |
>Roughness of Dichotomy for Interconnected Systems of Operator-Differential Equations |
|
Dashkovskiy, Sergey | University of Würzburg |
Pokutnyi, Aleksande | Institute of Mathematics of NANU |
Slyn'ko, Vitalii | S.P. Timoshenko Institute of Mechanics |
Keywords: Stability of nonlinear systems, Lyapunov methods, Network analysis and control
Abstract: We consider linear abstract dynamical systems on Banach spaces possessing the exponential dichotomy property and ask whether a linear or nonlinear interconnection of these systems preserves the dichotomy. We derive conditions applied to the interconnected terms guaranteeing preservation of the dichotomy.
|
|
16:40-17:00, Paper WeC18.3 | |
>On Discrete-Time Polynomial Dynamical Systems on Hypergraphs |
|
Cui, Shaoxuan | University of Groningen |
Zhang, Guofeng | The Hong Kong Polytechnic University |
Jardón-Kojakhmetov, Hildeberto | University of Groningen |
Cao, Ming | University of Groningen |
Keywords: Stability of nonlinear systems, Network analysis and control, Lyapunov methods
Abstract: This paper studies the stability of discrete-time polynomial dynamical systems on hypergraphs by utilizing the Perron–Frobenius theorem for nonnegative tensors with respect to the tensors' Z-eigenvalues and Z-eigenvectors. First of all, for a multilinear polynomial system on a uniform hypergraph, we study the stability of the origin of the corresponding systems. Afterward, we extend our results to non-homogeneous polynomial systems on non-uniform hypergraphs. We confirm that the local stability of any discrete-time polynomial system is in general dominated by pairwise terms. In particular, given the origin is locally stable, we construct a conservative (but explicit) region of attraction from the system parameters. Finally, we validate our results via some numerical examples.
|
|
17:00-17:20, Paper WeC18.4 | |
>Asymptotically 2-Contractive Systems: Converging Properties and Applications |
|
Villani, Gianluca | University of Toronto |
Scardovi, Luca | University of Toronto |
Keywords: Stability of nonlinear systems, Network analysis and control, Time-varying systems
Abstract: In this paper we show that the positive limit sets of asymptotically 2-contractive systems, i.e. systems with a vector field that converges uniformly to a 2-contractive vector field, consist of isolated equilibria. This result is established by combining a known result about asymptotically autonomous systems with a characterization of the chain recurrent sets of 2-contractive systems with hyperbolic equilibria. We apply this result to characterize the limit sets of cascades of 2-contractive systems, synchronizing networks, and systems with converging inputs.
|
|
17:20-17:40, Paper WeC18.5 | |
>Local Stabilisation of Nonlinear Systems with Time and State-Dependent Perturbations Using Sliding-Mode Model-Following Control |
|
Tietze, Niclas | Technische Universität Ilmenau |
Wulff, Kai | TU Ilmenau |
Reger, Johann | TU Ilmenau |
Keywords: Stability of nonlinear systems, Variable-structure/sliding-mode control
Abstract: We study the problem of local stabilisation of a class of nonlinear systems subject to time- and state-dependant perturbations using a two-degree-of-freedom model-following control (MFC) scheme, which consist of a model control loop (MCL) and a process control loop (PCL). We consider a stabilising linear state feedback for the MCL with a first-order as well as a super-twisting sliding-mode controller for the PCL. Our stability analysis of the closed-loop system shows that the estimate of the region of attraction depends on the initialisation of the MCL. It turns out that this initialisation can be utilised to increase the region of attraction compared to a single-loop sliding-mode design. We analyse the robustness of this initialisation with respect to the true initial process state and provide according bounds that guarantee stability. Moreover we provide guaranteed bounds for the process state. We illustrate our results by a numerical example.
|
|
17:40-18:00, Paper WeC18.6 | |
>Design of Stabilizing Feedback Controllers for High-Order Nonholonomic Systems |
|
Grushkovskaya, Victoria | University of Klagenfurt |
Zuyev, Alexander | Max Planck Institute for Dynamics of Complex Systems |
Keywords: Lyapunov methods, Nonholonomic systems, Stability of nonlinear systems
Abstract: This paper presents a novel stabilizing control design strategy for driftless control-affine systems with an arbitrary degree of nonholonomy. The proposed approach combines a time-varying control component that generates motion in the direction of prescribed Lie brackets with a state-dependent component, ensuring the stability of the equilibrium. The coefficients of the state-dependent component are derived in such a way that the trajectories of the resulting closed-loop system approximate the gradient flow of a Lyapunov-like function. In the case of a quadratic Lyapunov function, this guarantees the exponential stability of the equilibrium. The usability of this approach is demonstrated on general two-input systems having the fourth degree of nonholonomy. The proposed stabilization scheme is illustrated with several examples.
|
|
WeC19 |
Suite 8 |
Markov Processes |
Regular Session |
Chair: Antunes, Duarte | Eindhoven University of Technology, the Netherlands |
Co-Chair: Nuzzo, Pierluigi | University of Southern California |
|
16:00-16:20, Paper WeC19.1 | |
>Convergence Guarantee of Dynamic Programming for LTL Surrogate Reward |
|
Xuan, Zetong | University of Florida |
Wang, Yu | University of Florida |
Keywords: Markov processes, Automata, Reinforcement learning
Abstract: Linear Temporal Logic (LTL) is a formal way of specifying complex objectives for planning problems modeled as Markov Decision Processes (MDPs). The planning problem aims to find the optimal policy that maximizes the satisfaction probability of the LTL objective. One way to solve the planning problem is to use the surrogate reward with two discount factors and dynamic programming, which bypasses the graph analysis used in traditional model-checking. The surrogate reward is designed such that its value function represents the satisfaction probability. However, in some cases where one of the discount factors is set to 1 for higher accuracy, the computation of the value function using dynamic programming is not guaranteed. This work shows that a multi-step contraction always exists during dynamic programming updates, guaranteeing that the approximate value function will converge exponentially to the true value function. Thus, the computation of satisfaction probability is guaranteed.
|
|
16:20-16:40, Paper WeC19.2 | |
>Compositional Planning for Logically Constrained Multi-Agent Markov Decision Processes |
|
Kalagarla, Krishna Chaitanya | University of New Mexico |
Low, Matthew | University of Southern California |
Jain, Rahul | University of Southern California |
Nayyar, Ashutosh | University of Southern California |
Nuzzo, Pierluigi | University of Southern California |
Keywords: Markov processes, Decentralized control, Constrained control
Abstract: Designing control policies for large, distributed systems is challenging, especially in the context of critical, temporal logic based specifications (e.g., safety) that must be met with high probability. Compositional methods for such problems are needed for scalability, yet relying on worst-case assumptions for decomposition tends to be overly conservative. In this work, we use the framework of Constrained Markov Decision Processes (CMDPs) to provide an assume-guarantee based decomposition for synthesizing decentralized control policies, subject to logical constraints in a multi-agent setting. The returned policies are guaranteed to satisfy the constraints with high probability and provide a lower bound on the achieved objective reward. We empirically find the returned policies to achieve near-optimal rewards while enjoying an order of magnitude reduction in problem size and execution time.
|
|
16:40-17:00, Paper WeC19.3 | |
>How to Pass an Audit: Decisions on Sequential Investment |
|
Huang, Ziyuan | University of Michigan |
Biczok, Gergely | Budapest University of Technology and Economics |
Liu, Mingyan | University of Michigan |
Keywords: Markov processes, Stochastic optimal control, Game theory
Abstract: We study the following sequential decision problem: a vendor with a product or process needs to pass a mandatory audit in order to be able to release the product onto the market; it is allowed to go through the audit repeatedly, and thus the vendor needs to determine what level of effort to put into the product (e.g., to enhance its quality/performance) following each failed audit. We examine the vendor's optional decision process and fully characterize its properties under mild technical assumptions. We next examine what happens if the audit is optional, and to incentivize the vendor to voluntarily subject itself to the audit, the auditor offers a waiver from future product liabilities provided the audit is successful. We examine what type of audit might be the most effective in not only incentivizing participation but also more desirable effort from the vendor.
|
|
17:00-17:20, Paper WeC19.4 | |
>Multiparametric Analysis of Multi-Task Markov Decision Processes: Structure, Invariance, and Reducibility |
|
Shin, Jaeuk | Seoul National University |
Yang, Insoon | Seoul National University |
Keywords: Markov processes, Stochastic optimal control, Optimization
Abstract: Modern sequential decision-making problems such as robotic control require on-the-fly adaptation to a reward function encoding a novel task, particularly when there is no luxury of performing costly online planning procedures. In this article, we present a multiparametric linear programming (mp-LP) approach that addresses this problem. The key idea is to characterize the optimal return of a multi-task Markov decision process as the optimal value of an mp-LP problem having reward functions (or vectors) as parameters. This mp-LP characterization enables the simultaneous computation of the optimal returns for multiple reward functions. Our multiparametric analysis provides geometric structures of the mp-LP problem by essentially identifying all polyhedral sets of reward functions that lead to the same optimal policy. This allows us to identify the types of reward function transformations that guarantee the invariance of optimality, effortlessly yielding the potential function-based reward shaping technique as a corollary. Finally, our analysis enables us to address the reducibility problem that seeks the minimal dimensions to capture essential information about any reward function.
|
|
17:20-17:40, Paper WeC19.5 | |
>Predictable Interval MDPs through Entropy Regularization |
|
van Zutphen, Menno Johannes Theodorus Cornelis | Eindhoven University of Technology |
Delimpaltadakis, Giannis | Eindhoven University of Technology |
Heemels, W.P.M.H. (Maurice) | Eindhoven University of Technology |
Antunes, Duarte | Eindhoven University of Technology, the Netherlands |
Keywords: Markov processes, Uncertain systems, Robust control
Abstract: Regularization of control policies using entropy can be instrumental in adjusting predictability of real-world systems. Applications benefiting from such approaches range from, e.g., cybersecurity, which aims at maximal unpredictability, to human-robot interaction, where predictable behavior is highly desirable. Interval Markov decision processes (IMDPs) are uncertain MDPs, where transition probabilities are only known to belong to intervals. Lately, IMDPs have gained significant popularity in the context of abstracting stochastic systems for control design. In this work, we address robust minimization of the linear combination of entropy and a standard cumulative cost in IMDPs, thus establishing a trade-off between optimality and predictability. We show that optimal deterministic policies exist, and devise a value-iteration algorithm to compute them. The algorithm solves a number of convex programs at each step. Finally, through an illustrative example we show the benefits of penalizing entropy in IMDPs.
|
|
17:40-18:00, Paper WeC19.6 | |
>An Optimal-Control Approach to Infinite-Horizon Restless Bandits: Achieving Asymptotic Optimality with Minimal Assumptions |
|
Yan, Chen | University of Michigan, Ann Arbor |
Keywords: Stochastic optimal control, Markov processes, Optimal control
Abstract: We adopt an optimal-control framework for addressing the undiscounted infinite-horizon discrete-time restless N-armed bandit problem. Unlike most studies that rely on constructing policies based on the relaxed single-armed Markov Decision Process (MDP), we propose relaxing the entire bandit MDP as an optimal-control problem through the certainty equivalence control principle. Our main contribution is demonstrating that the reachability of an optimal stationary state within the optimal-control problem is a sufficient condition for the existence of an asymptotically optimal policy. Such a policy can be devised using an "align and steer" strategy. This reachability assumption is less stringent than any prior assumptions imposed on the arm-level MDP, notably the unichain condition is no longer needed.
|
|
WeC20 |
Suite 9 |
Numerical Algorithms |
Regular Session |
Chair: Sepulchre, Rodolphe | University of Cambridge |
Co-Chair: Reissig, Gunther | University of the Bundeswehr Munich |
|
16:00-16:20, Paper WeC20.1 | |
>Reachable Sets of Homogeneous Polynomial Dynamical Systems Using Exact Solutions |
|
Purohit, Soham Sachin | University of Michigan |
Chen, Can | University of North Carolina at Chapel Hill |
Vasudevan, Ramanarayan | University of Michigan |
Keywords: Numerical algorithms, Algebraic/geometric methods, Computational methods
Abstract: Reachability analysis is a powerful tool to analyze the behavior of dynamical systems. Typically, these tools are used to evaluate whether the dynamics of a system beginning from some initial set reaches some unsafe region of state space in a finite amount of time. To answer this question, these tools often construct over-approximations to the reachable sets of the dynamical systems, which can be overly conservative when applied to arbitrary systems. To address this challenge, this letter develops a novel technique for reachability analysis of Homogeneous Polynomial Dynamical Systems (HPDSs) by computing their exact solutions using tensor theory. In addition, this letter illustrates how to build tight over-approximations of the reachable set for HPDSs with constant control inputs. Simulation results highlight a significant improvement in the accuracy of reachable set estimates compared to established methods for HPDSs.
|
|
16:20-16:40, Paper WeC20.2 | |
>Leveraging Control Inputs to Enforce Constraints in Differential Dynamic Programming for Nonlinear Optimization |
|
Dastan, Zahed | University of New Brunswick |
Sensinger, Jonathon | University of New Brunswick |
Keywords: Numerical algorithms, Control applications, Robotics
Abstract: Differential Dynamic Programming (DDP) has become a popular strategy for optimizing nonlinear dynamic systems due to its algorithmic efficiency and precision in complex control tasks. The goal to integrate inequality constraints into DDP has sparked considerable interest, aiming to extend its utility to more demanding situations with strict operational constraints. This study provides an extension to the conventional control-limited DDP framework, introducing a methodology that leverages control inputs during the backward pass to incorporate inequality constraints. Our methodology enhances the efficiency of DDP and expedites its convergence in a variety of scenarios. We present our method in two variants: the first handles inequality constraints that are functions of both state and control variables, and the second leverages the concept of relative degree from nonlinear control theory to handle inequality constraints that are solely dependent on state variables. Through simulations on an inverted pendulum and a nonholonomic 2D car, we benchmark our approach against established methods such as Constrained DDP (CDDP) and primal–dual interior-point DDP (IPDDP). The results showcase our method's superior convergence rate and trajectory efficiency, particularly highlighting the efficacy of employing control inputs for constraint enforcement.
|
|
16:40-17:00, Paper WeC20.3 | |
>A Parallel in Time Algorithm Based on ParaExp for Optimal Control Problems |
|
Kwok, Felix | Université Laval |
Tognon, Djahou N. | INRIA Paris, Sorbonne Université |
Keywords: Numerical algorithms, Distributed control
Abstract: We propose a new parallel-in-time algorithm for solving optimal control problems constrained by discretized partial differential equations. Our approach, which is based on a deeper understanding of ParaExp, considers an overlapping time-domain decomposition in which we combine the solution of homogeneous problems using exponential propagation with the local solutions of inhomogeneous problems. The algorithm yields a linear system whose matrix-vector product can be fully performed in parallel. We then propose a preconditioner to speed up the convergence of GMRES in the special cases of the heat and wave equations. Numerical experiments are provided to illustrate the efficiency of our preconditioners.
|
|
17:00-17:20, Paper WeC20.4 | |
>Second Order Approximation of Reachable Sets of LTI Systems |
|
Reissig, Gunther | University of the Bundeswehr Munich |
Keywords: Numerical algorithms, Uncertain systems, Formal Verification/Synthesis
Abstract: We present a novel method to approximate reachable sets at time points, of continuous-time LTI systems, in which initial states are subject to compact convex uncertainty and the input may arbitrarily vary over time within a zonotopic uncertainty set. We prove a priori bounds on the approximation error, which are of second order depending on a discretization parameter and can be used to subsequently obtain over- and under-approximations rather than mere approximations. In contrast to competing approaches, our method does not iteratively propagate over- or under-approximations, and it does not reduce the complexity of any of the zonotopes internally produced at intermediate stages. We compare the performance of our method to that of competing approaches on examples.
|
|
17:20-17:40, Paper WeC20.5 | |
>Guaranteed Pseudospectral Sequential Convex Programming for Accurate Solutions to Constrained Optimal Control Problems |
|
Yamamoto, Keitaro | Kyoto University |
Fujimoto, Kenji | Kyoto University |
Maruta, Ichiro | Kyoto University |
Keywords: Optimal control, Numerical algorithms
Abstract: This paper proposes an algorithm for solving finite-time nonlinear optimal control problems. The proposed method employs the Gauss pseudospectral method to transform the optimal control problem into a nonlinear programming problem, and sequential convex programming (SCP) to solve it. Furthermore, by applying the information of the solution obtained by SCP to the indirect shooting method, a more accurate optimal solution can be obtained. There was an attempt to solve a similar class of optimal control problems, but it was only applicable to a restrictive class of problems without state constraints. In contrast, the proposed method can solve a general class of optimal control problems, including those with state constraints, while ensuring the numerical stability of the algorithm. This objective is achieved without losing the numerical stability of the algorithm by introducing a slack variable and incorporating state constraints into the dynamics. Additionally, the proposed method guarantees quadratic convergence by appropriately limiting the update step size of the optimization variables. To demonstrate the effectiveness of the proposed method, we apply the proposed method to an L1/L2-optimal control problem of a two-wheeled rover.
|
|
17:40-18:00, Paper WeC20.6 | |
>An Operator-Theoretic Framework to Simulate Neuromorphic Circuits |
|
Shahhosseini, Amir | KU Leuven |
Chaffey, Thomas | University of Cambridge |
Sepulchre, Rodolphe | University of Cambridge |
Keywords: Large-scale systems, Numerical algorithms, Optimization
Abstract: Splitting algorithms are well-established in convex optimization and are designed to solve large-scale problems. Using such algorithms to simulate the behavior of nonlinear circuit networks provides scalable methods for the simulation and design of neuromorphic systems. For circuits made of linear capacitors and inductors with nonlinear resistive elements, we propose a splitting that breaks the network into its LTI lossless component and its static resistive component. This splitting has both physical interpretability and algorithmic tractability and allows for separate calculations in the time domain and in the frequency domain. To demonstrate the scalability of this approach, a network made from one hundred neurons modeled by the FitzHugh-Nagumo circuit with all-to-all diffusive coupling is simulated.
|