| |
Last updated on December 12, 2024. This conference program is tentative and subject to change
Technical Program for Tuesday December 17, 2024
|
TuP1 Plenary Session, Auditorium |
Add to My Program |
Towards Safe and Resilient Autonomy Using Synergistic Control, Observation
and Learning |
|
|
Chair: Zaccarian, Luca | LAAS-CNRS |
Co-Chair: Prandini, Maria | Politecnico Di Milano |
|
08:30-09:30, Paper TuP1.1 | Add to My Program |
Towards Safe and Resilient Autonomy Using Synergistic Control, Observation and Learning |
|
Panagou, Dimitra | University of Michigan, Ann Arbor |
Keywords: Cyber-Physical Security, Robotics
Abstract: Enabling autonomy for robotic and cyber-physical systems with provable safety and resilience guarantees has been an ongoing area of research. Despite significant progress over the years, there are still open challenges due to constraints (e.g., safety and time specifications; sensing, computation and communication limitations), and environmental uncertainty. This plenary talk will present some of our recent results and ongoing work on a framework that interconnects control, planning and learning methods towards provably-correct safety-critical robotic and aerospace systems under constraints and uncertainty.
|
|
TuA01 Tutorial Session, Auditorium |
Add to My Program |
Half a Century of Multi-Pass Systems: Iterative Learning Control and
Repetitive Processes - a Retrospective Tutorial |
|
|
Chair: Moore, Kevin L. | Colorado School of Mines |
Co-Chair: Rogers, Eric | University of Southampton |
Organizer: Moore, Kevin L. | Colorado School of Mines |
Organizer: Rogers, Eric | University of Southampton |
Organizer: Tan, Ying | University of Melbourne |
Organizer: Oomen, Tom | Eindhoven University of Technology |
Organizer: Chu, Bing | University of Southampton |
Organizer: Wu, Yuxin | Beihang University (BUAA) |
|
10:00-10:20, Paper TuA01.1 | Add to My Program |
Iterative Learning Control — Algorithms, Applications and Future Research Directions (I) |
|
Rogers, Eric | University of Southampton |
Chu, Bing | University of Southampton |
Moore, Kevin L. | Colorado School of Mines |
Oomen, Tom | Eindhoven University of Technology |
Tan, Ying | University of Melbourne |
Keywords: Iterative learning control, Adaptive systems, Learning
Abstract: This paper gives a tutorial on iterative learning control nearly five decades after what is widely regarded as the first substantive paper in the literature. The focus is on algorithm development under a number of general headings (linear, optimization, frequency domain, and nonlinear), together with supporting experimental validation/industrial applications and also applications in healthcare.
|
|
10:20-10:40, Paper TuA01.2 | Add to My Program |
Fundamentals of Multi-Pass Systems (I) |
|
Rogers, Eric | University of Southampton |
|
|
10:40-11:00, Paper TuA01.3 | Add to My Program |
Frequency Domain Methods in ILC (I) |
|
Oomen, Tom | Eindhoven University of Technology |
Keywords: Iterative learning control, Adaptive systems, Learning
Abstract: Controller design depends on the particular application at hand. The aim of this tutorial is to provide an overview of the design, analysis, and applications of frequency-domain iterative learning controller (ILC) design. The method is based on frequency response functions, hence it is particularly suitable for systems that are typically modelled in the frequency domain. It is shown how to develop ILC algorithms that converge in a fast and robust way to the limits of the system performance. Indeed, the achievable performance is up to measurement noise level.
|
|
11:00-11:20, Paper TuA01.4 | Add to My Program |
Norm-Optimal ILC (I) |
|
Chu, Bing | University of Southampton |
Keywords: Iterative learning control, Adaptive systems, Learning
Abstract: This presentation discusses how to design optimization-based ILC algorithms, with an emphasis on the norm-optimal paradigm. An experimental case study of a robotic arm is used to illustrate the effectiveness of the ideas.
|
|
11:20-11:40, Paper TuA01.5 | Add to My Program |
Nonlinear ILC (I) |
|
Tan, Ying | University of Melbourne |
Keywords: Iterative learning control, Adaptive systems, Learning
Abstract: This paper introduces the concept of using composite energy functions to address nonlinear dynamic systems. To illustrate these ideas, we consider a nonlinear dynamic system that satisfies the global Lipschitz continuity condition. Additionally, the paper explores applications in healthcare, with a focus on stroke rehabilitation.
|
|
11:40-12:00, Paper TuA01.6 | Add to My Program |
Emerging Areas in ILC (I) |
|
Chu, Bing | University of Southampton |
Wu, Yuxin | Beijing Institute of Technology |
|
|
TuA02 Invited Session, Amber 1 |
Add to My Program |
Learning and Control Approaches for Human-AI Collaboration |
|
|
Chair: Dave, Aditya Deepak | Cornell University |
Co-Chair: Malikopoulos, Andreas A. | Cornell University |
Organizer: Dave, Aditya Deepak | Cornell University |
Organizer: Malikopoulos, Andreas A. | Cornell University |
|
10:00-10:20, Paper TuA02.1 | Add to My Program |
Optimal Function and Attention Allocation for Human-AI Collaboration Using Computational Cognition-Work Model (I) |
|
Byeon, Sooyung | Purdue University |
Tian, Danyang | Honda Research Institute USA, Inc |
Ayoub, Jackie | Honda Research Institute USA |
Song, Miao | Honda Research Institute USA |
Moradi Pari, Ehsan | Honda Research Institute USA, Inc |
Hwang, Inseok | Purdue University |
Keywords: Human-in-the-loop control, Modeling
Abstract: The paper presents a computational model-based optimization framework for function and attention allocation in collaborative control and decision-making between a human and artificial intelligence (AI). Effective human-AI collaboration (HAC) may depend on structured adaptive function allocation among team members to enhance performance while managing human cognitive limitations, especially attention. Integrating attention allocation is vital for maintaining situation awareness and managing human workload. Various allocation methods rely on heuristics and experimental studies that demand significant resources and domain expertise. To address the function and attention allocation problem in HAC in a systematic way, we propose a computational cognition-work model (CCWM)-based framework. The framework can integrate a qualitative work model and cognitive models to simulate complex team dynamics in temporal semantics. An optimization technique can then improve any task-oriented metrics by exploring the team structure and simulated episodes without requiring exhaustive experimental studies. We present numerical evaluations to demonstrate the proposed framework using a disaster relief drone fleet operation scenario, which provides valuable insights into the early stages of HAC design and the broader domain of HAC.
|
|
10:20-10:40, Paper TuA02.2 | Add to My Program |
Look-Ahead Analysis of Mixed Traffic Flows on Rings |
|
Baldi, Simone | Southeast University |
Li, Jing | Southeast University |
Liu, Di | Imperial College London |
Keywords: Autonomous vehicles, Transportation networks, Human-in-the-loop control
Abstract: Mixed traffic flows on rings are flows composed of human-driven vehicles and automated vehicles driving on a circular path. A ring is meant to approximate long strings with automated vehicles sparsely inserted into the traffic. Different from existing ring analysis methods based on the equilibrium inter-vehicle distance and equilibrium velocity, this work presents an analysis based on the spacing error and relative velocity error of each vehicle with respect to its predecessor. We refer to such analysis as look-ahead analysis. The benefit of the proposed look-ahead analysis is to directly reflect the feedback variables needed to perform the driving task: indeed, no matter if estimated using visual feedback or available through on-board sensing, the human-driven and automated vehicles exploit feedback variables measured by each vehicle with respect to the preceding one. The analysis focuses on stability of the flow, controllability via the automated vehicle, and impact of disturbances on the flow.
|
|
10:40-11:00, Paper TuA02.3 | Add to My Program |
A Framework for Effective AI Recommendations in Cyber-Physical-Human Systems |
|
Dave, Aditya Deepak | Cornell University |
Bang, Heeseung | University of Delaware |
Malikopoulos, Andreas A. | Cornell University |
Keywords: Human-in-the-loop control, Stochastic optimal control, Emerging control applications
Abstract: Many cyber-physical-human systems (CPHSs) involve a human decision-maker who acts using recommendations from an artificial intelligence (AI) platform. In such CPHS applications, the human decision-maker may depart from an optimal recommended decision and instead implement a different one for various reasons, resulting in a loss in performance. In this letter, we develop a rigorous framework to overcome this challenge. In our framework, humans may deviate from AI recommendations as they interpret the system's state differently to the AI platform. We establish the structural properties of optimal recommendation strategies and develop an approximate human model (AHM) used by the AI. We provide theoretical bounds on the optimality gap that arises from an AHM and illustrate the efficacy of our results in a numerical example.
|
|
11:00-11:20, Paper TuA02.4 | Add to My Program |
Identifying Hate Speech Peddlers in Online Platforms. a Bayesian Social Learning Approach for Large Language Model Driven Decision-Makers (I) |
|
Jain, Adit | Cornell University |
Krishnamurthy, Vikram | Cornell University |
Keywords: Autonomous systems, Stochastic optimal control, Neural networks
Abstract: This paper studies the problem of autonomous agents performing Bayesian social learning for sequential detection when the observations of the state belong to a high-dimensional space and are expensive to analyze. Specifically, when the observations are textual, the Bayesian agent can use a large language model (LLM) as a map to get a low-dimensional private observation. The agent performs Bayesian learning and takes an action that minimizes the expected cost and is visible to subsequent agents. We prove that a sequence of such Bayesian agents herd in finite time to the public belief and take the same action disregarding the private observations. We propose a stopping time formulation for quickest time herding in social learning and optimally balance privacy and herding. Structural results are shown on the threshold nature of the optimal policy to the stopping time problem. We illustrate the application of our framework when autonomous Bayesian detectors aim to sequentially identify if a user is a hate speech peddler on an online platform by parsing text observations using an LLM. We numerically validate our results on real-world hate speech datasets. We show that autonomous Bayesian agents designed to flag hate speech peddlers in online platforms herd and misclassify the users when the public prior is strong. We also numerically show the effect of a threshold policy in delaying herding.
|
|
11:20-11:40, Paper TuA02.5 | Add to My Program |
Enhancing Human-In-The-Loop Learning for Binary Sentiment Word Classification |
|
Martin Urcelay, Belen | Georgia Institute of Technology |
Rozell, Christopher | Georgia Institute of Technology |
Bloch, Matthieu | Georgia Institute of Technology |
Keywords: Human-in-the-loop control, Machine learning, Learning
Abstract: While humans intuitively excel at classifying words according to their connotation, transcribing this innate skill into algorithms remains challenging. We present a human-guided methodology to learn binary word sentiment classifiers from fewer interactions with humans. We introduce a human perception model that relates the perceived sentiment of a word to the distance between the word and the unknown classifier. Our model informs the design of queries that capture more nuanced information than traditional queries solely requesting labels. Together with active learning strategies, our approach reduces human effort without sacrificing learning fidelity. We validate our method through experiments with human data, demonstrating improved accuracy in binary sentiment word classification.
|
|
11:40-12:00, Paper TuA02.6 | Add to My Program |
Motion Planning with Adversary Avoidance Using Gaussian Process Classification (I) |
|
Netter, Josh | Georgia Institute of Technology |
Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Keywords: Automotive systems, Autonomous vehicles, Flight control
Abstract: In this paper, we propose an optimal motion planning framework to allow a player agent to navigate a multi-agent environment while simultaneously identifying and avoiding potential adversaries attempting to intercept it. The behavior of an adversarial agent pursuing the player agent is defined, and a method for identifying these adversaries in real-time using Gaussian process classification is formulated. A method of real-time path replanning is then proposed and proven to avoid collisions with likely adversarial agents. The algorithm includes functionality to identify non-adversarial independent agents to avoid unnecessary replanning or the freezing robot problem. The efficacy is then shown in simulations.
|
|
TuA03 Invited Session, Amber 2 |
Add to My Program |
Seeking and Leveraging Nash Equilibria in Large Learning-Agent Populations |
|
|
Chair: Martins, Nuno C. | University of Maryland |
Co-Chair: Pavel, Lacra | University of Toronto |
Organizer: Martins, Nuno C. | University of Maryland |
|
10:00-10:20, Paper TuA03.1 | Add to My Program |
Learning Nash Equilibria in Large Populations with Constrained Strategy Switching |
|
Kara, Semih | University of Illinois at Urbana-Champaign |
Martins, Nuno C. | University of Maryland |
Keywords: Game theory, Networked control systems, Decentralized control
Abstract: We consider a large population of learning agents that interact noncooperatively by selecting strategies from a common set. Each strategy has a payoff, assigned by a strictly concave potential game. The agents repeatedly revise their strategies according to a learning rule that models how they seek alternatives with higher payoffs. Our objective is to determine when the population learns the Nash equilibrium of the game, meaning its strategy profile asymptotically converges to this equilibrium. Unlike previous work assuming unrestricted strategy switching, here we tackle the case where only certain strategies are accessible from certain others, characterized by a strategy graph that is connected but possibly incomplete. Through Lyapunov's method, we prove that modifications based on KL-divergence to either the payoffs or the learning rules ensure the strategy profile's near-global convergence to the Nash equilibrium. We highlight the practical significance of our findings and provide a numerical validation.
|
|
10:20-10:40, Paper TuA03.2 | Add to My Program |
Passivity-Based Gradient-Play Dynamics for Distributed GNE Seeking Via Parallel Feedforward Compensation (I) |
|
Li, Weijian | University of Toronto |
Pavel, Lacra | University of Toronto |
Keywords: Agents-based systems, Game theory, Nonlinear systems
Abstract: We consider seeking generalized Nash equilibria for games with coupled nonlinear constraints over networks. We first revisit a well-known gradient-play dynamics to solve the problem from a passivity-based perspective, and address that the strict monotonicity on pseudo-gradients is a critical assumption to guarantee its convergence. Then we develop a novel passivity-based gradient-play dynamics by introducing parallel feedforward compensators. We prove that the dynamics achieves asymptotic convergence in merely monotone regimes. Moreover, in the absence of coupled constraints, we surprisingly find that the dynamics can deal with hypomonotone games.
|
|
10:40-11:00, Paper TuA03.3 | Add to My Program |
Generalized Nash Equilibrium Seeking in a Class of Contractive Population Games Over Networks (I) |
|
Martinez-Piazuelo, Juan | Universitat Politècnica De Catalunya |
Ocampo-Martinez, Carlos | Universitat Politècnica De Catalunya (UPC) |
Quijano, Nicanor | Universidad De Los Andes |
Keywords: Game theory, Optimization, Large-scale systems
Abstract: In this paper, we consider the problem of generalized Nash equilibrium (GNE) seeking in a class of contractive population games under a partial-decision information setup and subject to affine equality constraints. Namely, we consider multiple populations, each comprised of a large number of payoff-driven decision makers, and we embed a network topology ruling the exchange of information between the multiple populations. Conceptually, we consider that each population has an associated payoff provider entity, which yields the payoff signals to the agents of its corresponding population. The multiple payoff providers communicate through a (possibly non-complete) network to estimate the non-local information relevant to compute the payoff signals. As the main contribution, we formulate the dynamics of the payoff providers and we provide sufficient conditions to guarantee the asymptotic stability of the set of generalized Nash equilibria of the underlying game. To the best of our knowledge, this is the first paper to address the problem of GNE seeking in population games under partial-decision information.
|
|
11:00-11:20, Paper TuA03.4 | Add to My Program |
Learning Communities from Equilibria of Nonlinear Opinion Dynamics (I) |
|
Xing, Yu | KTH Royal Institute of Technology |
Bizyaeva, Anastasia | Cornell University |
Johansson, Karl H. | KTH Royal Institute of Technology |
Keywords: Network analysis and control, Identification, Agents-based systems
Abstract: This paper studies community detection for a nonlinear opinion dynamics model from its equilibria. It is assumed that the underlying network is generated from a stochastic block model with two communities, where agents are assigned with community labels and edges are added independently based on these labels. Agents update their opinions following a nonlinear rule that incorporates saturation effects on interactions. It is shown that clustering based on a single equilibrium can detect most community labels (i.e., achieving almost exact recovery), if the two communities differ in size and link probabilities. When the two communities are identical in size and link probabilities, and the inter-community connections are denser than intra-community ones, the algorithm can achieve almost exact recovery under negative influence weights but fails under positive influence weights. Utilizing fixed point equations and spectral methods, we also propose a detection algorithm based on multiple equilibria, which can detect communities with positive influence weights. Numerical experiments demonstrate the performance of the proposed algorithms.
|
|
11:20-11:40, Paper TuA03.5 | Add to My Program |
Credit vs. Discount-Based Congestion Pricing: A Comparison Study |
|
Chiu, Chih-Yuan | University of California, Berkeley |
Jalota, Devansh | Stanford University |
Pavone, Marco | Stanford University |
Keywords: Transportation networks, Game theory, Optimization
Abstract: Congestion pricing is a promising traffic congestion management policy, but has also been criticized for placing outsized financial burdens on low-income users. Credit-based congestion pricing (CBCP) and discount-based congestion pricing (DBCP) policies, which respectively provide travel credits and toll discounts to subsidize low-income users' access to tolled roads, are promising mechanisms for reducing traffic congestion without worsening societal inequities. However, the optimal design and relative merits of CBCP and DBCP policies remain poorly understood. This work studies the effects of deploying CBCP and DBCP policies to route users on multi-lane highway networks with tolled express lanes. We formulate a non-atomic routing game in which a subset of eligible users is granted toll relief via a fixed budget or toll discount, while the remaining ineligible users must pay out-of-pocket. We prove that Nash equilibrium traffic flow patterns exist under any CBCP or DBCP policy. For the setting in which eligible users have time-invariant values of time (VoTs), we provide a convex program to efficiently compute these equilibria. Moreover, for single-edge networks, we establish conditions under which DBCP policies outperform CBCP policies in improving eligible users' express lane access, an equity objective often neglected by existing congestion pricing schemes that price low-income users out of express lanes. Specifically, we identify user and network-dependent parameters that play a key role in determining whether DBCP or CBCP policies are more effective at expanding eligible users' express lane access. Finally, we present empirical results from a CBCP pilot study of the San Mateo 101 Express Lane Project in California. Our empirical results corroborate our theoretical analysis of the impact of deploying credit-based and discount-based policies.
|
|
11:40-12:00, Paper TuA03.6 | Add to My Program |
Optimal Seeding in Large-Scale Super-Modular Network Games |
|
Messina, Sebastiano | Politecnico Di Torino |
Cianfanelli, Leonardo | Politecnico Di Torino |
Como, Giacomo | Politecnico Di Torino |
Fagnani, Fabio | Politecnico Di Torino |
Keywords: Game theory, Network analysis and control, Control of networks
Abstract: We study optimal seeding problems for binary super-modular network games. The system planner’s objective is to design a minimal cost seeding guaranteeing that at least a predefined fraction of the players adopt a certain action in every Nash equilibrium. Since the problem is known to be NPhard and its exact solution would require full knowledge of the network structure, we focus on approximate solutions for large-scale networks with given statistics. In particular, we build on a local mean-field approximation of the linear threshold dynamics that is known to hold true on large-scale locally treelike random networks. We first reduce the optimal intervention design problem to a linear program with an infinite set of constraints. We then show how to approximate the solution of the latter by standard linear programs with finitely many constraints. Our solutions are then numerically validated.
|
|
TuA04 Invited Session, Amber 3 |
Add to My Program |
Cyber-Physical Systems: Resilience, Cybersecurity, and Privacy I: Attack
Design and Cybersecurity |
|
|
Chair: Ferrari, Riccardo M.G. | Delft University of Technology |
Co-Chair: Sadabadi, Mahdieh S. | University of Manchester |
Organizer: Selvi, Daniela | Università Di Pisa |
Organizer: Sadabadi, Mahdieh S. | The University of Manchester |
Organizer: Murguia, Carlos | Eindhoven University of Technology |
Organizer: Ferrari, Riccardo M.G. | Delft University of Technology |
Organizer: Soudjani, Sadegh | Max Planck Institute for Software Systems |
|
10:00-10:20, Paper TuA04.1 | Add to My Program |
A Quantal Response Analysis of Defender-Attacker Sequential Security Games (I) |
|
Azim, Md Reya Shad | Purdue University Indianapolis |
Abdallah, Mustafa | Purdue University in Indianapolis |
Keywords: Game theory, Cyber-Physical Security, Mean field games
Abstract: We explore a scenario involving two sites and a sequential game between a defender and an attacker, where the defender is responsible for securing the sites while the attacker aims to attack them. Each site holds a loss value for the defender when compromised, along with a probability of successful attack. The defender can reduce these probabilities through security investments at each site. The attacker's objective is to target the site that maximizes the expected loss for the defender, taking into account the defender's security investments. While previous studies have examined security investments in such scenarios, our work investigates the impact of bounded rationality exhibited by the defender, as identified in behavioral economics. Specifically, we consider quantal behavioral bias, where humans make errors in selecting efficient (pure) strategies. We demonstrate the existence of a quantal response equilibrium in our sequential game and analyze how this bias affects the defender's choice of optimal security investments. Additionally, we quantify the inefficiency of equilibrium investments under quantal decision-making compared to an optimal solution devoid of behavioral biases. We provide numerical simulations to validate our main findings.
|
|
10:20-10:40, Paper TuA04.2 | Add to My Program |
Optimal Controller Realizations against False Data Injections in Cooperative Driving (I) |
|
Huisman, Mischa | Eindhoven University of Technology |
Murguia, Carlos | Eindhoven University of Technology |
Lefeber, Erjen | Eindhoven University of Technology |
Van De Wouw, Nathan | Eindhoven University of Technology |
Keywords: Cyber-Physical Security, Resilient Control Systems, Autonomous vehicles
Abstract: To enhance the robustness of cooperative driving to cyberattacks, we study a controller-oriented approach to mitigate the effect of a class of False-Data Injection (FDI) attacks. By reformulating a given dynamic Cooperative Adaptive Cruise Control scheme (the base controller), we show that a class of new but equivalent controllers (base controller realizations) can represent the base controller. This controller class exhibits the same platooning behavior in the absence of attacks, but in the presence of attacks, their robustness varies with the realization. We propose a prescriptive synthesis framework where the base controller and the system dynamics are written in new coordinates via an invertible coordinate transformation on the controller state. Because the input-output behavior is invariant under coordinate transformations, the input-output behavior is unaffected (so controller realizations do not change the system's closed-loop performance). However, each controller realization may require a different combination of sensors. Subsequently, we obtain the optimal combination of sensors that minimizes the effect of FDI attacks by solving a linear matrix inequality while quantifying the FDI's attack impact through reachability analysis. Through simulation studies, we demonstrate that this approach enhances the robustness of cooperative driving without relying on a detection scheme and maintaining all system properties.
|
|
10:40-11:00, Paper TuA04.3 | Add to My Program |
Structural Conditions for Leak Localization in Potential Flow Networks (I) |
|
Molnö, Victor | KTH Royal Insitute of Technology |
Sandberg, Henrik | KTH Royal Institute of Technology |
Keywords: Fault diagnosis, Smart cities/houses
Abstract: In this paper, we analyze the leak localization problem in potential flow networks. We present a general model encompassing various physical systems, for example, water distribution networks. In contrast to conventional water distribution network models, our model is neither restricted to any particular type of potential loss function nor to vertex leaks. We consider leak localization via vertex potential analysis, a general description of methods that work by comparing vertex potentials calculated under a leak location hypothesis to measured vertex potentials. We derive conditions on the graph structure of the network and the potential sensor placement, under which it is guaranteed that the leak can be limited, via vertex potential analysis, to a small set of locations. We suggest a bisection method to utilize our conditions. Our conditions are based on one-way edges in graphs, a concept that we introduce. In extension, our results can be used for sensor placement.
|
|
11:00-11:20, Paper TuA04.4 | Add to My Program |
Unpredictable Switching for Cyber-Physical Security against Worst-Case Attackers (I) |
|
Kanellopoulos, Aris | KTH Royal Institute of Technology |
Fotiadis, Filippos | The University of Texas at Austin |
Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Sandberg, Henrik | KTH Royal Institute of Technology |
Keywords: Cyber-Physical Security, Switched systems, Game theory
Abstract: In this paper, we investigate the problem of securing a system against actuator attacks. Specifically, we employ an unpredictability-based defense algorithm according to the principles of Moving Target Defense, while explicitly considering the worst-case attack that can be launched into the system. Consequently, we allow the system to alternate between different actuating modes, but also between optimal and robust control schemes. Via game-theoretic methods, we derive mixed strategies dictating the probabilities of utilizing different controllers for a given mode of operation, whereas an entropy-augmented optimization problem gives the probabilities of using these different modes. Finally, the overall stability of the system is analyzed, and simulation results showcase the efficacy of our proposed method.
|
|
11:20-11:40, Paper TuA04.5 | Add to My Program |
Stealthy Cyber Attack Filter and Its Impact on Control of Grid-Forming Inverter |
|
Rezaeizadeh, Amin | FHNW |
Smith, Roy S. | ETH Zurich |
Mastellone, Silvia | University of Applied Science Northwestern Switzerland FHNW |
Keywords: Cyber-Physical Security, Power systems, Robust control
Abstract: Sustainability and environmental requirements have driven the shift in electric grids towards renewable and distributed generation, characterized by inverter-based resources (IBRs) including grid-forming inverters (GFMIs). These inverters are critical components of the energy infrastructure, and their cyber-security is crucial with the growing standardization of grid support services. Analyzing the vulnerability of the converter to cyber-attacks is the first step towards developing strategies for a cyber-secure system. This paper presents a novel filter scheme for generating the smallest stealthy damaging false data injection (FDI) undetectable by model-based intrusion detection methods. Simulation results verify the undetectability and effectiveness of the designed FDI method in destabilizing currents and voltages at the Point of Common Coupling.
|
|
11:40-12:00, Paper TuA04.6 | Add to My Program |
Unknown Input Observers Breaking Confidentiality of Controller States (I) |
|
Breukelman, Christian Enno | KTH Royal Institute of Technology |
Sandberg, Henrik | KTH Royal Institute of Technology |
Keywords: Cyber-Physical Security, Control Systems Privacy, Kalman filtering
Abstract: Driven by ubiquitous digitalization and cyberattacks on critical infrastructure, there is a high interest in research on the security of cyber-physical systems. If an attacker gains access to protected and sensitive information, such as the internal states of a control system, this is considered a breach of confidentiality. Access to sensitive information can be the first step in a larger cyber-attack scheme, such as a stealthy false data injection attack. Considering process and measurement noise in the plant, existing research investigated when an attacker equipped with a Kalman filter can perfectly estimate the internal controller states if the attacker has access to plant measurements and all model parameters. For this estimate to converge, the controller is required to have stable poles. In this paper, we show that if the attacker has access to the control inputs instead of the plant measurements, the controller needs to have stable zeros. Additionally, we demonstrate that an attacker equipped with an Unknown Input Observer, using tools from delayed system inversion, can get a delayed yet perfect estimate of the controller states from the control inputs without knowledge of the plant's parameters and noise characteristics. Lastly, we present simulation results from a three-tank system to showcase the differences in controller state estimation.
|
|
TuA05 Invited Session, Amber 4 |
Add to My Program |
Projected Dynamics in Control |
|
|
Chair: Heemels, W.P.M.H. (Maurice) | Eindhoven University of Technology |
Co-Chair: Dörfler, Florian | Swiss Federal Institute of Technology (ETH) Zurich |
Organizer: Heemels, W.P.M.H. (Maurice) | Eindhoven University of Technology |
Organizer: Cortes, Jorge | University of California, San Diego |
|
10:00-10:20, Paper TuA05.1 | Add to My Program |
Hybrid Integrator-Gain System Based Integral Resonant Controllers for Negative Imaginary Systems (I) |
|
Shi, Kanghong | Australian National University |
Petersen, Ian R. | Australian National University |
Keywords: Switched systems, Robust control, Uncertain systems
Abstract: We introduce a hybrid control system called a hybrid integrator-gain system (HIGS) based integral resonant controller (IRC) to stabilize negative imaginary (NI) systems. A HIGS-based IRC has a similar structure to an IRC, with the integrator replaced by a HIGS. We show that a HIGS-based IRC is an NI system. Also, for a SISO NI system with a minimal realization, we show there exists a HIGS-based IRC such that their closed-loop interconnection is asymptotically stable. Also, we propose a proportional-integral-double-integral resonant controller and a HIGS-based proportional-integral-double-integral resonant controller, and we show that both of them can be applied to asymptotically stabilize an NI system. An example is provided to illustrate the proposed results.
|
|
10:20-10:40, Paper TuA05.2 | Add to My Program |
Advanced Safety Filter for Smooth Transient Operation of a Battery Energy Storage System (I) |
|
Schneeberger, Michael | ETH Zürich |
Dörfler, Florian | Swiss Federal Institute of Technology (ETH) Zurich |
Mastellone, Silvia | University of Applied Science Northwestern Switzerland FHNW |
Keywords: Lyapunov methods, Constrained control, Smart grid
Abstract: This paper presents an advanced safety filter based on Control Barrier and Control Lyapunov Functions to smoothly limit the current of an inverter-based Battery Energy Storage System, avoiding converter tripping, operational loss, and potential component failure. The task involves finding Control Barrier and Control Lyapunov Function via Sum-of-Squares optimization to ensure feasibility of the safety filter at every state along the trajectory. We showcase the effectiveness of the implementation through simulations involving a load step at the Point of Common Coupling, and we compare the outcomes with those obtained using a standard vector current controller.
|
|
10:40-11:00, Paper TuA05.3 | Add to My Program |
Filtering in Projection-Based Integrators for Improved Phase Characteristics (I) |
|
Chu, Hoang | Eindhoven University of Technology |
van den Eijnden, Sebastiaan | Eindhoven University of Technology |
Heertjes, Marcel | Eindhoven University of Technology |
Heemels, W.P.M.H. (Maurice) | Eindhoven University of Technology |
Keywords: Hybrid systems, Switched systems, Mechatronics
Abstract: Projection-based integrators are effectively employed in high-precision systems with growing industrial success. By utilizing a projection operator, the resulting projection-based integrator keeps its input-output pair within a designated sector set, leading to unique freedom in control design that can be directly translated into performance benefits. This paper aims to enhance projection-based integrators by incorporating well-crafted linear filters into its structure, resulting in a new class of projected integrators that includes the earlier ones, such as the hybrid-integrator gain systems (HIGS) as special cases. The extra design freedom in the form of two filters in the input paths to the projection operator and the internal dynamics allows the controller to break away from the inherent limitations of a linear control design. The enhanced performance properties of the proposed structure are formally demonstrated through i) a describing function analysis, ii) mitigation of the gain-loss problem (occurring in e.g., HIGS), and iii) numerical case studies showcasing improved time-domain properties. The describing function analysis is supported by rigorously showing incremental properties of the new filtered projection-based integrators, thereby guaranteeing the steady-state responses to be unique and asymptotically stable.
|
|
11:00-11:20, Paper TuA05.4 | Add to My Program |
Characterization of the Dynamical Properties of Safety Filters for Linear Planar Systems (I) |
|
Chen, Yiting | Boston University |
Mestres, Pol | University of California, San Diego |
Dall'Anese, Emiliano | Boston University |
Cortes, Jorge | University of California, San Diego |
Keywords: Stability of nonlinear systems, Autonomous systems
Abstract: This paper studies the dynamical properties of closed-loop systems obtained from control barrier function-based safety filters. We provide a sufficient and necessary condition for the existence of undesirable equilibria and show that the Jacobian matrix of the closed-loop system evaluated at an undesirable equilibrium always has a nonpositive eigenvalue. In the special case of linear planar systems and ellipsoidal obstacles, we give a complete characterization of the dynamical properties of the corresponding closed-loop system. We show that for underactuated systems, the safety filter always introduces a single undesirable equilibrium, which is a saddle-point. We prove that all trajectories outside the global stable manifold of such equilibrium converge to the origin. In the fully actuated case, we discuss how the choice of nominal controller affects the stability properties of the closed-loop system. Various simulations illustrate our results.
|
|
11:20-11:40, Paper TuA05.5 | Add to My Program |
Online Feedback Optimization Over Networks: A Distributed Model-Free Approach (I) |
|
Wang, Wenbin | EPFL |
He, Zhiyu | ETH Zurich |
Belgioioso, Giuseppe | ETH Zürich |
Bolognani, Saverio | ETH Zurich |
Dörfler, Florian | Swiss Federal Institute of Technology (ETH) Zurich |
Keywords: Optimization, Optimization algorithms, Control of networks
Abstract: Online feedback optimization (OFO) enables optimal steady-state operations of a physical system by employing an iterative optimization algorithm as a dynamic feedback controller. When the plant consists of several interconnected sub-systems, centralized implementations become impractical due to the heavy computational burden and the need to pre-compute system-wide sensitivities, which may not be easily accessible in practice. Motivated by these challenges, we develop a fully distributed model-free OFO controller, featuring consensus-based tracking of the global objective value and local iterative (projected) updates that use stochastic gradient estimates. We characterize how the closed-loop performance depends on the size of the network, the number of iterations, and the level of accuracy of consensus. Numerical simulations on a voltage control problem in a direct current power grid corroborate the theoretical findings.
|
|
11:40-12:00, Paper TuA05.6 | Add to My Program |
Stability Mechanisms for Predictive Safety Filters |
|
Milios, Elias Lido Celestino | Robert Bosch GmbH |
Wabersich, Kim Peter | Robert Bosch GmbH |
Berkel, Felix | Robert Bosch GmbH |
Schwenkel, Lukas | University of Stuttgart |
Keywords: Predictive control for nonlinear systems, Constrained control, Intelligent systems
Abstract: Predictive safety filters enable the integration of potentially unsafe learning-based control approaches and humans into safety-critical systems. In addition to simple constraint satisfaction, many control problems involve additional stability requirements that may vary depending on the specific use case or environmental context. In this work, we address this problem by augmenting predictive safety filters with stability guarantees, ranging from bounded convergence to uniform asymptotic stability. The proposed framework extends well-known stability results from model predictive control (MPC) theory while supporting commonly used design techniques. As a result, straightforward extensions to dynamic trajectory tracking problems can be easily adapted, as outlined in this article. The practicality of the framework is demonstrated using an automotive advanced driver assistance scenario, involving a reference trajectory stabilization problem.
|
|
TuA06 Regular Session, Amber 5 |
Add to My Program |
Network Analysis and Control I |
|
|
Chair: Tassiulas, Leandros | Yale University |
Co-Chair: Luo, Rui | City University of Hong Kong |
|
10:00-10:20, Paper TuA06.1 | Add to My Program |
Dissipativity-Based Decentralized Control and Topology Co-Design for Vehicular Platoons with Disturbance String Stability |
|
Song, Zihao | University of Notre Dame |
Welikala, Shirantha | Stevens Institute of Technology |
Antsaklis, Panos J. | University of Notre Dame |
Lin, Hai | University of Notre Dame |
Keywords: Network analysis and control, Autonomous vehicles, Decentralized control
Abstract: Merging and splitting of vehicles in a platoon is a basic maneuvering that makes the platoons more scalable and flexible. The main challenges lie in simultaneously ensuring the compositionality of the distributed controllers and the string stability of the platoon. To handle this problem, we propose a control and topology co-design method for vehicular platoons, which enables seamless merging and splitting of vehicular platoons. In particular, we first present a centralized linear matrix inequality (LMI)-based control and topology co-design optimization for vehicular platoons with formal (centralized) disturbance string stability (DSS) guarantee. Then, these centralized DSS constraints are made decentralized by developing an alternative set of sufficient conditions. Using these decentralized DSS constraints and Sylvester’s criterion-based techniques, the said centralized LMI problem is decomposed into a set of smaller decentralized LMI problems that can be solved at each vehicle in a compositional manner, enabling seamless vehicular merging/splitting. Finally, simulation examples are provided to validate the proposed co-design method through a specifically developed simulator.
|
|
10:20-10:40, Paper TuA06.2 | Add to My Program |
Detecting Structural Shifts in Multivariate Hawkes Processes with Fréchet Statistic |
|
Luo, Rui | City University of Hong Kong |
Krishnamurthy, Vikram | Cornell University |
Keywords: Network analysis and control, Computational methods
Abstract: This paper proposes a new approach for change point detection in multivariate Hawkes processes using Fréchet statistic of a network. The method splits the point process into overlapping windows, estimates kernel matrices in each window, and reconstructs the signed Laplacians by treating the kernel matrices as the adjacency matrices of the causal network. We demonstrate the effectiveness of our method through experiments on both simulated and real-world cryptocurrency datasets. Our results show that our method is capable of accurately detecting and characterizing changes in the causal structure of multivariate Hawkes processes, and may have potential applications in fields such as finance and neuroscience. The proposed method is an extension of previous work on Fréchet statistics in point process settings and represents an important contribution to the field of change point detection in multivariate point processes.
|
|
10:40-11:00, Paper TuA06.3 | Add to My Program |
Throughput-Optimal Scheduling Via Rate Learning |
|
Promponas, Panagiotis | Yale University |
Valls, Victor | IBM Research Europe – Dublin |
Nikolakakis, Konstantinos | Yale University |
Kalogerias, Dionysios | Yale University |
Tassiulas, Leandros | Yale University |
Keywords: Network analysis and control, Control of networks, Communication networks
Abstract: We study the problem of designing scheduling policies for communication networks. This problem is often addressed with max-weight-type approaches since they are throughput-optimal. However, max-weight policies make scheduling decisions based on the network congestion, which can be sometimes unnecessarily restrictive. In this paper, we present a ``schedule as you learn'' (SYL) approach, where we learn an average rate vector, and then select schedules that generate such a rate in expectation. This approach is interesting because scheduling decisions do not depend on the size of the queue backlogs, and so it provides increased flexibility to select schedules based on other criteria or rules, such as serving high-priority queues. We illustrate the results with numerical experiments for a cross-bar switch and show that, compared to max-weight, SYL can achieve lower latency to certain flows without compromising throughput optimality.
|
|
11:00-11:20, Paper TuA06.4 | Add to My Program |
Multipolar Opinion Evolution in Biased Networks |
|
Baković, Luka | Lund University |
Ohlin, David | Lund University |
Como, Giacomo | Politecnico Di Torino |
Tegling, Emma | Lund University |
Keywords: Network analysis and control, Cooperative control
Abstract: Motivated by empirical research on bias and opinion formation, we formulate a multidimensional nonlinear opinion-dynamical model where agents have individual biases, which are fixed, as well as opinions, which evolve. The dimensions represent competing options, of which each agent has a relative opinion, and are coupled through normalization of the opinion vector. This can capture, for example, an individual’s relative trust in different media. In special cases including where biases are uniform across agents our model achieves consensus, but in general, behaviors are richer and capture multipolar opinion distributions. We examine general fixed points of the system, as well as special cases such as zero biases toward certain options or partitioned decision sets. Lastly, we demonstrate that our model exhibits polarization when biases are spatially correlated across the network, while, as empirical research suggests, a mixed community can mediate biases.
|
|
11:20-11:40, Paper TuA06.5 | Add to My Program |
Adaptive Bias for Dissensus in Nonlinear Opinion Dynamics with Application to Evolutionary Division of Labor Games |
|
Paine, Tyler | Massachusetts Institute of Technology |
Bizyaeva, Anastasia | Cornell University |
Benjamin, Michael | Massachusetts Institute of Technology |
Keywords: Network analysis and control, Distributed control, Control of networks
Abstract: This paper addresses the problem of adaptively controlling the bias parameter in nonlinear opinion dynamics (NOD) to allocate agents into groups of arbitrary sizes for the purpose of maximizing collective rewards. In previous work, an algorithm based on the coupling of NOD with an multi-objective behavior optimization was successfully deployed as part of a multi-robot system in an autonomous task allocation field experiment. Motivated by the field results, in this paper we propose and analyze a new task allocation model that synthesizes NOD with an evolutionary game framework. We prove sufficient conditions under which it is possible to control the opinion state in the group to a desired allocation of agents between two tasks through an adaptive bias using decentralized feedback. We then verify the theoretical results with a simulation study of a collaborative evolutionary division of labor game.
|
|
11:40-12:00, Paper TuA06.6 | Add to My Program |
Optimal Pricing for Linear-Quadratic Games with Nonlinear Interaction between Agents |
|
Cai, Jiamin | The Chinese University of Hong Kong |
Zhang, Chenyue | The Chinese University of Hong Kong |
Wai, Hoi-To | The Chinese University of Hong Kong |
Keywords: Network analysis and control, Game theory, Optimization
Abstract: This paper studies a class of network games with linear-quadratic payoffs and externalities exerted through a strictly concave interaction function. This class of game is motivated by the diminishing marginal effects with peer influences. We analyze the optimal pricing strategy for this class of network game. First, we prove the existence of a unique Nash Equilibrium (NE). Second, we study the optimal pricing strategy of a monopolist selling a divisible good to agents. We show that the optimal pricing strategy, found by solving a bilevel optimization problem, is strictly better when the monopolist knows the network structure as opposed to the best strategy agnostic to network structure. Numerical experiments demonstrate that in most cases, the maximum revenue is achieved with an asymmetric network. These results contrast with the previously studied case of linear interaction function, where a network-independent price is proven optimal with symmetric networks. Lastly, we describe an efficient algorithm for finding the optimal pricing strategy.
|
|
TuA07 Regular Session, Amber 6 |
Add to My Program |
Sensors Fusion and Networks |
|
|
Chair: Aghdam, Amir G. | Concordia University |
Co-Chair: van Goor, Pieter | Australian National University |
|
10:00-10:20, Paper TuA07.1 | Add to My Program |
A Fusion Estimation Approach for Robust Output Feedback MPC of Multi-Sensor Systems |
|
Chen, Bo | Zhejiang University of Technology |
Xu, Chenteng | Zhejiang University of Technology |
Wang, Zheming | Zhejiang University of Technology |
Qiu, Xiang | Zhejiang University of Technology |
Du, Shuwang | Zhejiang University of Technology |
Keywords: Sensor fusion, Estimation, Predictive control for linear systems
Abstract: This paper presents a fusion-based output feedback MPC (FOFMPC) approach for multi-sensor systems with state and control constraints. Our approach consists of a fusion estimation procedure and a robust output feedback MPC scheme. For fusion estimation, we adopt a two-layer structure where local observers are designed for all the sensors and operate in parallel and a fusion center collects the states of the individual observers and produces a fusion estimate based on certain weighted fusion criterion. The weights are computed by minimizing the weighted Minkowski sum of the local robust positively invariant (RPI) sets. This fusion estimation procedure is then integrated into the framework of robust output feedback MPC (ROFMPC). We verify the effectiveness of the proposed approach using a double-zone building thermal model.
|
|
10:20-10:40, Paper TuA07.2 | Add to My Program |
A Geometric Perspective on Fusing Gaussian Distributions on Lie Groups |
|
Ge, Yixiao | Australian National University |
van Goor, Pieter | Australian National University |
Mahony, Robert | Australian National University, |
Keywords: Sensor fusion, Kalman filtering, Algebraic/geometric methods
Abstract: Stochastic inference on Lie groups plays a key role in state estimation problems, such as inertial navigation, visual inertial odometry, pose estimation in virtual reality, etc. A key problem is fusing independent concentrated Gaussian distributions defined at different reference points on the group. In this paper we approximate distributions at different points in the group in a single set of exponential coordinates and then use classical Gaussian fusion to obtain the fused posteriori in those coordinates. We consider several approximations including the exact Jacobian of the change of coordinate map, first and second order Taylor's expansions of the Jacobian, and parallel transport with and without curvature correction associated with the underlying geometry of the Lie group. Preliminary results on SO(3) demonstrate that a novel approximation using parallel transport with curvature correction achieves similar accuracy to the state-of-the-art optimisation based algorithms at a fraction of the computational cost.
|
|
10:40-11:00, Paper TuA07.3 | Add to My Program |
Legged Robot State Estimation within Non-Inertial Environments |
|
He, Zijian | Purdue University |
Teng, Sangli | University of Michigan |
Lin, Tzu-Yuan | University of Michigan |
Ghaffari, Maani | University of Michigan |
Gu, Yan | Purdue University |
Keywords: Sensor fusion, Robotics, Kalman filtering
Abstract: This paper investigates the robot state estimation problem within a non-inertial environment. The proposed state estimation approach overcomes the common assumption of static ground in the system modeling. The process and measurement models explicitly treat the movement of the non-inertial environments without requiring knowledge of its motion in the inertial frame or relying on GPS or sensing environmental landmarks. Further, the proposed state estimator is formulated as an invariant extended Kalman filter (InEKF) with the deterministic part of its process model obeying the group-affine property, leading to log-linear error dynamics. The observability analysis of the filter confirms that the robot's pose (i.e., position and orientation) and velocity relative to the non-inertial environment are observable. Hardware experiments on a humanoid robot moving on a rotating and translating treadmill demonstrate the high convergence rate and accuracy of the proposed InEKF even under significant treadmill pitch sway, as well as large estimation errors.
|
|
11:00-11:20, Paper TuA07.4 | Add to My Program |
A Distributed Reinforcement Learning Strategy to Maximize Coverage in a Hybrid Heterogeneous Sensor Network |
|
Mosalli, Hesam | Concordia University |
Aghdam, Amir G. | Concordia University |
Keywords: Sensor networks, Agents-based systems, Reinforcement learning
Abstract: This paper introduces an efficient distributed deployment strategy for a network of mobile and stationary sensors with nonidentical sensing and communication radii. A collaborative distributed multi-agent deep reinforcement learning method is proposed to find the best moving direction and step size for each sensor considering the coverage priority. The gradient of the local coverage function is used to generate a fast-converging solution as well as a learning-inspired arbitrary input to enable the network to avoid the local optima. The sensors use their partial observation of the network and field to iteratively relocate themselves to explore the field and learn the optimal policy to increase their local coverage. The efficiency of the proposed strategy in different scenarios is demonstrated by simulations.
|
|
11:20-11:40, Paper TuA07.5 | Add to My Program |
OIDM: An Observability-Based Intelligent Distributed Edge Sensing Method for Industrial Cyber-Physical Systems |
|
Wang, Shigeng | Shanghai Jiao Tong University |
Jin, Tiankai | Shanghai Jiao Tong University |
Ma, Yehan | Shanghai Jiao Tong University |
Chen, Cailian | Shanghai Jiao Tong University |
Keywords: Sensor networks, Estimation
Abstract: Industrial cyber-physical systems (ICPS) integrate physical processes with computational and communication technologies in industrial settings. With the support of edge computing technology, it is feasible to schedule large-scale sensors for efficient distributed sensing. In the sensing process, observability is the key to obtaining complete system states, and stochastic scheduling is more suitable considering uncertain factors in wireless communication. However, existing works have limited research on observability in stochastic scheduling. Targeting this issue, we propose an observability-based intelligent distributed edge sensing method (OIDM). Deep reinforcement learning (DRL) methods are adopted to optimize sensing accuracy and power efficiency. Based on the system's ability to achieve observability, we establish a bridge between observability and the number of successful sensor transmissions. Novel linear approximations of observability criteria are provided, and probabilistic bounds on observability are derived. Furthermore, these bounds guide the design of action space to achieve a probabilistic observability guarantee in stochastic scheduling. Finally, our proposed method is applied to the estimation of slab temperature in industrial hot rolling process, and simulation results validate its effectiveness.
|
|
TuA08 Regular Session, Amber 7 |
Add to My Program |
Optimal Control IV |
|
|
Chair: Motta, Monica | University of Padua, Italy |
Co-Chair: Baldi, Simone | Southeast University |
|
10:00-10:20, Paper TuA08.1 | Add to My Program |
Resilient Learning-Based Control under Denial-Of-Service Attacks |
|
Chakraborty, Sayan | New York University |
Gao, Weinan | Northeastern University |
Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Jiang, Zhong-Ping | New York University |
Keywords: Optimal control, Reinforcement learning, Resilient Control Systems
Abstract: In this paper, we have proposed a resilient reinforcement learning method for discrete-time linear systems with unknown parameters, under denial-of-service (DoS) attacks. The proposed method is based on policy iteration that learns the optimal controller from input-state data amidst DoS attacks. We achieve an upper bound for the DoS duration to ensure closed-loop stability. The resilience of the closed-loop system, when subjected to DoS attacks with the learned controller and an internal model, has been thoroughly examined. The effectiveness of the proposed methodology is demonstrated on an inverted pendulum on a cart.
|
|
10:20-10:40, Paper TuA08.2 | Add to My Program |
A Note on Impulsive Solutions to Nonlinear Control System |
|
Motta, Monica | University of Padua, Italy |
Fusco, Giovanni | Lousiana State University |
Keywords: Optimal control, Switched systems
Abstract: In the last decades, many authors provided different notions of impulsive process, seen as a suitably defined limit of a sequence of ordinary processes for a nonlinear control-affine system with unbounded, vector-valued controls. In particular, we refer to the impulsive processes introduced by Karamzin et al. --in which the control is given by a vector measure, a non-negative scalar measure, and a family of so-called attached controls that univocally determine the jumps of the corresponding trajectory-- and to the graph completion processes developed by Bressan and Rampazzo et al. --in which an impulsive trajectory is seen as a spatial projection of a Lipschitzian trajectory in space-time. The equivalence between these notions is the crucial assumption of most results on optimal impulsive control problems, such as existence of an optimal process and necessary/sufficient optimality conditions. In this note we exhibit a counterexample which shows that, in presence of state constraints and endpoint constraints involving the total variation of the impulsive control, this equivalence may fail. Thus, we propose to replace the set of impulsive processes with a smaller class of impulsive processes, that we call admissible, which turns out to be actually in one-to-one correspondence with the set of graph completion processes.
|
|
10:40-11:00, Paper TuA08.3 | Add to My Program |
Off-Policy Reinforcement Learning for a Robust Optimal Control Problem with Real Parametric Uncertainty |
|
Mullachery, Athira | Indian Institute of Technology Palakkad |
Chitraganti, Shaikshavali | Indian Institute of Technology - Palakkad |
Keywords: Optimal control, Uncertain systems, Reinforcement learning
Abstract: This paper addresses an off-policy Reinforcement learning algorithm for robust linear quadratic regulator (R-LQR) problem of continuous-time linear dynamical systems with parametric uncertainties based on policy iteration framework. A modified algebraic Riccati equation is presented for the R-LQR problem and is further transformed into standard linear quadratic regulator problem. The proposed model-free off policy R-LQR algorithm learns the control policy using generated data samples that obviate the requirement of system dynamics. Numerical simulation examples of spring-mass system with uncertain stiffness are provided to illustrate effectiveness of the approach.
|
|
11:00-11:20, Paper TuA08.4 | Add to My Program |
Sensitivity Analysis for Piecewise-Affine Approximations of Nonlinear Programs with Polytopic Constraints |
|
Gharavi, Leila | Delft University of Technology |
Liu, Changrui | Delft University of Technology |
De Schutter, Bart | Delft University of Technology |
Baldi, Simone | Southeast University |
Keywords: Optimization, Optimal control, Optimization algorithms
Abstract: This paper investigates the sensitivity of the solutions of NonLinear Programs (NLPs) with polytopic constraints when the nonlinear continuous objective function is approximated by a PieceWise-Affine (PWA) counterpart. General NLPs are prevalent in optimization-based control of nonlinear systems, but solving them can be computationally expensive, requiring either fast hardware or tractable suboptimal approximations. By leveraging perturbation analysis using a convex modulus, we derive guaranteed bounds on the distance between the optimal solution of the original polytopically-constrained NLP and that of its approximated formulation. Our approach aids in determining criteria for achieving desired solution bounds. Two case studies on the Eggholder function and nonlinear model predictive control of an inverted pendulum demonstrate the theoretical results.
|
|
11:20-11:40, Paper TuA08.5 | Add to My Program |
Policy Iteration for Discrete-Time Systems with Discounted Costs: Stability and Near-Optimality Guarantees |
|
de Brusse, Jonathan | University of Lorraine |
Granzotto, Mathieu | University of Melbourne |
Postoyan, Romain | CNRS, CRAN, Université De Lorraine |
Nesic, Dragan | University of Melbourne |
Keywords: Optimal control, Stability of nonlinear systems, Lyapunov methods
Abstract: Given a discounted cost, we study deterministic discrete-time systems whose inputs are generated by policy iteration (PI). We provide novel near-optimality and stability properties, while allowing for non stabilizing initial policies. That is, we first give novel bounds on the mismatch between the value function generated by PI and the optimal value function, which are less conservative in general than those encountered in the dynamic programming literature for the considered class of systems. Then, we show that the system in closed-loop with policies generated by PI are stabilizing under mild conditions, after a finite (and known) number of iterations.
|
|
11:40-12:00, Paper TuA08.6 | Add to My Program |
Critic As Lyapunov Function (CALF): A Model-Free, Stability-Ensuring Agent |
|
Osinenko, Pavel | Skoltech Institute of Science and Technology |
Yaremenko, Grigory | Skolkovo Institute of Science and Technology |
Zashchitin, Roman | Deggendorf Institute of Technology |
Bolychev, Anton | Skolkovo Institute of Science and Technology |
Ibrahim, Sinan | Skolkovo Institute for Science and Technology |
Dobriborsci, Dmitrii | Deggendorf Institute of Technology |
Keywords: Optimal control, Stability of nonlinear systems, Adaptive control
Abstract: This work presents and showcases a novel reinforcement learning agent called Critic As Lyapunov Function (CALF) which is model-free and ensures online environment, in other words, dynamical system stabilization. Online means that in each learning episode, the said environment is stabilized. This, as demonstrated in a case study with a mobile robot simulator, greatly improves the overall learning performance. The base actor-critic scheme of CALF is analogous to SARSA. The latter did not show any success in reaching the target in our studies. However, a modified version thereof, called SARSA-m here, did succeed in some learning scenarios. Still, CALF greatly outperformed the said approach. CALF was also demonstrated to improve a nominal stabilizer provided to it. In summary, the presented agent may be considered a viable approach to fusing classical control with reinforcement learning. Its concurrent approaches are mostly either offline or model-based, like, for instance, those that fuse model-predictive control into the agent.
|
|
TuA09 Regular Session, Amber 8 |
Add to My Program |
Optimization Algorithms I |
|
|
Chair: Davanloo Tajbakhsh, Sam | Ohio State University |
Co-Chair: Ruiz, Fredy | Politecnico Di Milano |
|
10:00-10:20, Paper TuA09.1 | Add to My Program |
A Cutting Plane-Based Distributed Algorithm for Non-Smooth Optimisation with Coupling Constraints |
|
Zhong, Tianyi | Imperial College London |
Angeli, David | Imperial College |
Keywords: Optimization algorithms, Optimization, Control of networks
Abstract: In this paper, we study a general setup for constrained convex optimisation over time-varying networks. We propose a distributed algorithm, based on the cutting plane method, to address non-smooth optimisation challenges. Cutting plane-based approaches require constraint consensus which is structurally different from established consensus schemes. We bridge this gap by linking the cutting plane-based algorithm with a dynamic average tracking scheme. The distributed cutting plane algorithm is presented and its convergence is analysed. Its performance is investigated through a numerical example.
|
|
10:20-10:40, Paper TuA09.2 | Add to My Program |
Distributed Optimization with Finite Bit Adaptive Quantization for Efficient Communication and Precision Enhancement |
|
Rikos, Apostolos I. | The Hong Kong University of Science and Technology (Gz) |
Jiang, Wei | Aalto University, Finland |
Charalambous, Themistoklis | University of Cyprus |
Johansson, Karl H. | KTH Royal Institute of Technology |
Keywords: Optimization algorithms, Optimization, Networked control systems
Abstract: In realistic distributed optimization scenarios, individual nodes possess only partial information and communicate over bandwidth constrained channels. For this reason, the development of efficient distributed algorithms is essential. In our paper we addresses the challenge of unconstrained distributed optimization. In our scenario each node's local function exhibits strong convexity with Lipschitz continuous gradients. The exchange of information between nodes occurs through 3-bit bandwidth-limited channels (i.e., nodes exchange messages represented by a only 3-bits). Our proposed algorithm respects the network's bandwidth constraints by leveraging zoom-in and zoom-out operations to adjust quantizer parameters dynamically. We show that during our algorithm's operation nodes are able to converge to the exact optimal solution. Furthermore, we show that our algorithm achieves a linear convergence rate to the optimal solution. We conclude the paper with simulations that highlight our algorithm's unique characteristics.
|
|
10:40-11:00, Paper TuA09.3 | Add to My Program |
A Feedback Control Approach to Convex Optimization with Inequality Constraints |
|
Cerone, Vito | Politecnico Di Torino |
Fosson, Sophie | Politecnico Di Torino |
Pirrera, Simone | Politecnico Di Torino |
Regruto, Diego | Politecnico Di Torino |
Keywords: Optimization algorithms, Optimization, Stability of nonlinear systems
Abstract: We propose a novel continuous-time algorithm for inequality-constrained convex optimization inspired by proportional-integral control. Unlike the popular primal-dual gradient dynamics, our method includes a proportional term to control the primal variable through the Lagrange multipliers. This approach has both theoretical and practical advantages. On the one hand, it simplifies the proof of the exponential convergence in the case of smooth, strongly convex problems, with a more straightforward assessment of the convergence rate concerning prior literature. On the other hand, through several examples, we show that the proposed algorithm converges faster than primal-dual gradient dynamics. This paper aims to illustrate these points by thoroughly analyzing the algorithm convergence and discussing some numerical simulations.
|
|
11:00-11:20, Paper TuA09.4 | Add to My Program |
Multi-Agent Global Optimization with Decision Variable Coupling |
|
Sabug, Lorenzo Jr. | Politecnico Di Milano |
Ruiz, Fredy | Politecnico Di Milano |
Fagiano, Lorenzo | Politecnico Di Milano |
Keywords: Optimization algorithms, Optimization
Abstract: A cooperative, multi-agent global optimization problem is considered, where the global cost function is the sum of the agents' private, non-convex costs. In contrast to all previously considered setups, evaluating the private costs involves a global experiment, using a common instance of the decision vector. This is relevant when each agent can only control a part ("subvariable") of the decision vector, but its private cost is also affected by the other subvariables. A novel cooperative optimization method using Set Membership identification and consensus-based techniques is proposed, to make all agents agree on the next global decision vector to be tested. A trade-off between exploitation close to the best point found and exploration around the search set is achieved, even without explicitly sharing the private costs' information. Statistical tests show that the proposed distributed method is competitive with respect to a centralized one.
|
|
11:20-11:40, Paper TuA09.5 | Add to My Program |
Accelerating Gradient Tracking with Periodic Global Averaging |
|
Feng, Shujing | Lehigh University |
Jiang, Xin | Cornell University |
Keywords: Optimization algorithms, Optimization
Abstract: Decentralized optimization algorithms have recently attracted increasing attention due to its wide applications in all areas of science and engineering. In these algorithms, a collection of agents collaborate to minimize the average of a set of heterogeneous cost functions in a decentralized manner. State-of-the-art decentralized algorithms like Gradient Tracking (GT) and Exact Diffusion (ED) involve communication at each iteration. Yet, communication between agents is often expensive, resource intensive, and can be very slow. To this end, several strategies have been developed to balance between communication overhead and convergence rate of decentralized methods. In this paper, we introduce GT-PGA, which incorporates~GT with periodic global averaging. With the additional PGA, the influence of poor network connectivity in the GT algorithm can be compensated or controlled by a careful selection of the global averaging period. Under the stochastic, nonconvex setup, our analysis quantifies the crucial trade-off between the connectivity of network topology and the PGA period. Thus, with a suitable design of the PGA period, GT-PGA improves the convergence rate of vanilla GT. Numerical experiments are conducted to support our theory, and simulation results reveal that the proposed GT-PGA accelerates practical convergence, especially when the network is sparse.
|
|
11:40-12:00, Paper TuA09.6 | Add to My Program |
Riemannian Stochastic Gradient Method for Nested Composition Optimization |
|
Zhang, Dewei | The Ohio State University |
Davanloo Tajbakhsh, Sam | Ohio State University |
Keywords: Optimization algorithms, Optimization
Abstract: This work considers the optimization of the composition of functions in a nested form over Riemannian manifolds where each function contains an expectation. This problem type is gaining popularity in applications such as policy evaluation in reinforcement learning or model customization in meta-learning. The standard Riemannian stochastic gradient methods for non-compositional optimization cannot be directly applied as the stochastic approximation of the inner functions creates biases in the gradients of the outer functions. For two-level composition optimization, we present a Riemannian Stochastic Composition Gradient Descent (R-SCGD) method that finds an approximate stationary point, with expected squared Riemannian gradient smaller than epsilon, in O(epsilon^{-2}) calls to the stochastic gradient oracle of the outer function and stochastic function and gradient oracles of the inner function.
|
|
TuA10 Invited Session, Brown 1 |
Add to My Program |
Advances in Stochastic Control I: Optimization Methods |
|
|
Chair: Mehta, Prashant G. | Univ of Illinois, Urbana-Champaign |
Co-Chair: Yuksel, Serdar | Queen's University |
Organizer: Mehta, Prashant G. | Univ of Illinois, Urbana-Champaign |
Organizer: Yuksel, Serdar | Queen's University |
|
10:00-10:20, Paper TuA10.1 | Add to My Program |
Near-Optimal Performance of Stochastic Economic MPC (I) |
|
Schießl, Jonas | University of Bayreuth |
Ou, Ruchuan | Hamburg University of Technology |
Faulwasser, Timm | Hamburg University of Technology |
Baumann, Michael Heinrich | University of Bayreuth |
Gruene, Lars | University of Bayreuth |
Keywords: Stochastic optimal control, Markov processes, Predictive control for nonlinear systems
Abstract: This paper presents first results for near optimality in expectation of the closed-loop solutions for stochastic economic MPC. The approach relies on a recently developed turnpike property for stochastic optimal control problems at an optimal stationary process, combined with techniques for analyzing time-varying economic MPC schemes. We obtain near optimality in finite time as well as overtaking and average near optimality on infinite time horizons.
|
|
10:20-10:40, Paper TuA10.2 | Add to My Program |
Dynamic Watermarking for Cyber-Security of Nonlinear Stochastic Systems (I) |
|
Lin, Tzu-Hsiang | Texas A&M University |
Kumar, P. R. | Texas A&M University |
Keywords: Cyber-Physical Security, Attack Detection, Nonlinear systems
Abstract: As cyber-physical systems form the core of many critical infrastructures, ensuring their safety is essential. The sensor measurements of networked cyber-physical systems can potentially be compromised, resulting in the misbehavior of the overall system. Indeed there have been a number of such attacks. Dynamic Watermarking is a proactive method whose goal is to detect such cyber attacks. It superimposes a small secret stochastic excitation onto signals in the system, such as control inputs of actuators or sensor measurements. Based on an examination of the signals purportedly returned, for example, by the sensors, it determines if the measurements have been tampered with. Previous theory for detection guarantees provided by Dynamic Watermarking has been restricted to linear stochastic systems. This paper examines the Dynamic Watermarking method for nonlinear stochastic systems. We show that Dynamic Watermarking for detecting attacks can be extended to certain systems in backstepping form. We present the analytical proofs, as well as simulation results.
|
|
10:40-11:00, Paper TuA10.3 | Add to My Program |
Joint Chance Constrained Optimal Control Via Linear Programming |
|
Schmid, Niklas | ETH Zürich |
Fochesato, Marta | ETH Zurich |
Sutter, Tobias | University of Konstanz |
Lygeros, John | ETH Zurich |
Keywords: Stochastic optimal control, Hybrid systems, Stochastic systems
Abstract: We establish a linear programming formulation for the solution of joint chance constrained optimal control problems over finite time horizons. The joint chance constraint may represent an invariance, reachability or reach-avoid specification that the trajectory must satisfy with a predefined probability. For finite state and action spaces, the solution is exact and our method computationally superior to approaches in the literature. For continuous state or action spaces, our linear programming formulation enables basis function approximations.
|
|
11:00-11:20, Paper TuA10.4 | Add to My Program |
Sparse Network Mean Field Games: Ring Structures and Related Topologies (I) |
|
Caines, Peter E. | McGill University |
Huang, Minyi | Carleton University |
Keywords: Mean field games, Stochastic systems, Large-scale systems
Abstract: For sequences of networks embedded in the unit cube [0, 1]^m, (weak) measure limits of sequences of empirical measures of vertex densities (vertexon functions) exist, and the associated (weak) measure limits of sequences of empirical measures of edge densities (graphexon functions) in [0, 1]^{2m} exist, regardless of the sparsity or density of the limit graphs. This paper presents an extension of Graphon Mean Field Game (GMFG) theory to the vertexon-graphexon MFG set-up (denoted GXMFG). Specific second order dynamics are introduced for the inter-node influence mediated by the singular part of a network graphexon measure; this is analyzed in the particular cases of a network limit ring topology and a limit rectangular lattice topology. Existence and uniqueness results are presented for the corresponding GXMFG equations.
|
|
11:20-11:40, Paper TuA10.5 | Add to My Program |
Controlled Diffusions under Full, Partial and Decentralized Information: Optimal Policies and Discrete-Time Approximations (I) |
|
Pradhan, Somnath | Queen's University |
Yuksel, Serdar | Queen's University |
Keywords: Stochastic optimal control, Decentralized control, Stochastic systems
Abstract: We present existence and discrete-time approximation results on optimal control policies for continuous-time stochastic control problems under a variety of information structures. These include fully observed models, partially observed models and multi-agent models with decentralized information structures. While there exist comprehensive existence and approximations results for the fully observed setup in the literature, few prior research exists on discrete-time approximation results for partially observed models. For decentralized models, even existence results have not received much attention except for specialized models and approximation has been an open problem. Our existence and approximations results lead to the applicability of well-established partially observed Markov decision processes and the relatively more mature theory of discrete-time decentralized stochastic control to be applicable for computing near optimal solutions for continuous-time stochastic control.
|
|
11:40-12:00, Paper TuA10.6 | Add to My Program |
Time-Reversal of Stochastic Maximum Principle (I) |
|
Taghvaei, Amirhossein | University of Washington Seattle |
Keywords: Stochastic optimal control, Stochastic systems, Optimization
Abstract: Stochastic maximum principle (SMP) specifies a necessary condition for the solution of a stochastic optimal control problem. The condition involves a coupled system of forward and backward stochastic differential equations (FBSDE) for the state and the adjoint processes. Numerical solution of the FBSDE is challenging because the boundary condition of the adjoint process is specified at the terminal time, while the solution should be adaptable to the forward in time filtration of a Wiener process. In this paper, a "time-reversal" of the FBSDE system is proposed that involves integration with respect to a backward in time Wiener process. The time-reversal is used to propose an iterative Monte-Carlo procedure to solves the FBSDE system and its time-reversal simultaneously. The procedure involves approximating the Föllmer's drift and solving a regression problem between the state and its adjoint at each time. The procedure is illustrated for the linear quadratic (LQ) optimal control problem with a numerical example.
|
|
TuA11 Regular Session, Brown 2 |
Add to My Program |
Data Driven Control IV |
|
|
Chair: Smith, Roy S. | ETH Zurich |
Co-Chair: Alamo, Teodoro | Universidad De Sevilla |
|
10:00-10:20, Paper TuA11.1 | Add to My Program |
Data-Enabled Predictive Iterative Control |
|
Zhang, Kai | ETH Zurich |
Zuliani, Riccardo | Automatic Control Laboratory (IfA), ETH Zurich |
Balta, Efe C. | Inspire AG |
Lygeros, John | ETH Zurich |
Keywords: Data driven control, Iterative learning control, Predictive control for linear systems
Abstract: This work introduces the Data-Enabled Predictive iteRative Control (DeePRC) algorithm, a direct data-driven approach for iterative LTI systems. The DeePRC learns from previous iterations to improve its performance and achieves the optimal cost. By utilizing a tube-based variation of the DeePRC scheme, we propose a two-stage approach that enables safe active exploration using a left-kernel-based input disturbance design. This method generates informative trajectories to enrich the historical data, which extends the maximum achievable prediction horizon and leads to faster iteration convergence. In addition, we present an end-to-end formulation of the two-stage approach, integrating the disturbance design procedure into the planning phase. We showcase the effectiveness of the proposed algorithms on a numerical experiment.
|
|
10:20-10:40, Paper TuA11.2 | Add to My Program |
Data-Driven Architecture to Encode Information in the Kinematics of Robots and Artificial Avatars |
|
De Lellis, Francesco | University of Naples Federico II |
Coraggio, Marco | Scuola Superiore Meridionale |
Foster, Nathan, Charles | University Medical Center Hamburg Eppendorf |
Villa, Riccardo | University Medical Center Hamburg Eppendorf |
Cristina, Becchio | University Medical Center Hamburg Eppendorf |
di Bernardo, Mario | University of Naples Federico II |
Keywords: Data driven control, Machine learning
Abstract: We present a data-driven control architecture for modifying the kinematics of robots and artificial avatars to encode specific information such as the presence or not of an emotion in the movements of an avatar or robot driven by a human operator. We validate our approach on an experimental dataset obtained during the reach-to-grasp phase of a pick-and-place task.
|
|
10:40-11:00, Paper TuA11.3 | Add to My Program |
Prediction for Dynamical Systems Via Transfer Learning |
|
Du, Zhe | University of Michigan |
Oymak, Samet | University of Michigan |
Pasqualetti, Fabio | University of California, Riverside |
Keywords: Data driven control, Learning, Machine learning
Abstract: In robotics and reinforcement learning, transfer learning helps to pass the knowledge obtained in the design phase to the deployment phase, allowing for faster adaptation. However, this practice is yet to be fundamentally and theoretically understood for dynamical systems. In this work, we approach the prediction problems for dynamical systems under the transfer learning setup. Specifically, we propose methods that are able to utilize data from different training systems to learn a predictive model for an unknown target system. Feasibility condition and finite sample guarantees are developed. Particularly, through experimental studies, our methods demonstrate the ability to handle hidden model structures such as parameterization redundancy and nonlinearity, which outperforms the system identification method.
|
|
11:00-11:20, Paper TuA11.4 | Add to My Program |
A Data-Driven CLF Controller Based on a Kriged Predictor |
|
Carnerero, A. Daniel | Osaka University |
Ramirez, Daniel R. | Univ. of Sevilla |
Hatanaka, Takeshi | Tokyo Institute of Technology |
Alamo, Teodoro | Universidad De Sevilla |
Keywords: Data driven control, Machine learning, Lyapunov methods
Abstract: In this paper, we present a data-driven methodology to predict and control the behaviour of nonlinear and non-autonomous systems based on kernel functions. The technique computes the forecasting by means of a linear combination of past data. The weights used to compute the prediction are obtained by solving a convex optimization problem that stems from a novel kriging formulation. A Control Lyapunov Function (CLF) based controller using the presented predictor is also built. Finally, numerical examples of both prediction and control are presented, showing the efficacy of the proposed approach.
|
|
11:20-11:40, Paper TuA11.5 | Add to My Program |
Towards eXplainable Data-Driven Control (XDDC): The Property-Preserving Framework |
|
Riva, Giorgio | Politecnico Di Milano |
Formentin, Simone | Politecnico Di Milano |
Keywords: Data driven control, Machine learning
Abstract: As Artificial Intelligence (AI) techniques continue to advance, the need for explainability becomes increasingly crucial, especially in sensitive or safety-critical domains. eXplainable AI (XAI) has emerged to address this need, aiming to enhance transparency in complex models. While XAI has gained traction in mainstream machine learning, its application in data-driven control systems remains relatively unexplored. This paper introduces a novel concept of explainability tailored for data-driven control, allowing one to design feedback loops from data incorporating prior knowledge and preserving important system properties. Through two case studies, we demonstrate the efficacy of this property-preserving framework in direct and indirect data-driven control system design. This work lays the foundation for further research at the intersection of AI and data-driven control, offering insights into enhancing transparency in complex control systems.
|
|
11:40-12:00, Paper TuA11.6 | Add to My Program |
Data-Driven Formulation of the Kalman Filter and Its Application to Predictive Control |
|
Smith, Roy S. | ETH Zurich |
Abdalmoaty, Mohamed | ETH Zurich |
Yin, Mingzhou | Leibniz University Hannover |
Keywords: Data driven control, Kalman filtering, Predictive control for linear systems
Abstract: Data-driven methods for predictive control rely on input-output data to give a Hankel matrix representation of the space of trajectories. They are poorly suited to situations where both process noise and measurement noise dominate the behaviour whereas Kalman filters optimally estimate system states in this scenario. We derive a data-driven Kalman filter formulation based on the dynamic evolution of Hankel matrix output predictions. This leads to an extended state space model that describes the evolution of both the future inputs and outputs. By applying measurement feedback one arrives at a Kalman filter for the system. The Kalman filter design is performed purely on the basis of the input and output signals and without the need for a specific state-space representation. A benchmark simulation illustrates that the resulting prediction- based control significantly out-performs predictive controllers based on current data-driven methods.
|
|
TuA12 Regular Session, Brown 3 |
Add to My Program |
Reinforcement Learning III |
|
|
Chair: McAllister, Robert D. | Delft University of Technology |
Co-Chair: Giuliani, Matteo | Politecnico Di Milano |
|
10:00-10:20, Paper TuA12.1 | Add to My Program |
SYNLOCO: Synthesizing Central Pattern Generator and Reinforcement Learning for Quadruped Locomotion |
|
Zhang, Xinyu | Sun Yat-Sen University |
Xiao, Zhiyuan | Sun Yat-Sen University |
Zhang, Qingrui | Sun Yat-Sen University |
Pan, Wei | The University of Manchester |
Keywords: Reinforcement learning, Robotics, Learning
Abstract: The Central Pattern Generator (CPG) is adept at generating rhythmic gait patterns characterized by consistent timing and adequate foot clearance. Yet, its open-loop configuration often fails to adjust the system's control performance in response to environmental variations. On the other hand, Reinforcement Learning (RL), celebrated for its model-free properties, has gained significant traction in robotics due to its inherent adaptability and robustness. However, initiating traditional RL approaches from the ground up presents a risk of converging to suboptimal local minima and slow learning convergence. In this paper, we propose a quadruped locomotion framework--called SYNLOCO--by synthesizing CPG and RL, which can ingeniously integrate the strengths of both methods, enabling the development of a locomotion controller that is both stable and natural with partial state observations (e.g., no velocity measurements). To optimize the learning trajectory of SYNLOCO, a two-phase training strategy is presented. Both ablation analysis and experimental comparison are performed using a real quadruped robot under varied conditions, including distinct velocities, terrains, and payload capacities. The experiments showcase SYNLOCO's efficiency in producing consistent and clear-footed gaits across diverse scenarios, despite no velocity measurements. The developed controller exhibits resilience against substantial parameter variations, underscoring its potential for robust real-world applications.
|
|
10:20-10:40, Paper TuA12.2 | Add to My Program |
Harmonic RL: A Frequency-Domain Approach to Reinforcement Learning with Application to Active Knee Prosthesis |
|
Cetindag, Can | Delft University of Technology |
McAllister, Robert D. | Delft University of Technology |
Mohajerin Esfahani, Peyman | TU Delft |
Keywords: Reinforcement learning, Robotics, Simulation
Abstract: We propose a frequency-domain state representation to improve the performance and reduce the computation and data requirements of reinforcement learning. This approach is tailored to tracking tasks of periodic trajectories. We apply the proposed methodology to an active knee prosthesis application. Using the high-fidelity simulator~MuJoCo, we demonstrate significant performance improvements (in terms of Bellman error) for the proposed frequency-domain state representation relative to the current state-of-the-art time-domain state representation used in these applications.
|
|
10:40-11:00, Paper TuA12.3 | Add to My Program |
Contextual Restless Multi-Armed Bandits with Application to Demand Response Decision-Making |
|
Chen, Xin | Texas A&M University |
Hou, I-Hong | Texas A&M University |
Keywords: Reinforcement learning, Smart grid, Optimization algorithms
Abstract: This paper introduces a novel multi-armed bandits framework, termed Contextual Restless Bandits (CRB), for complex online decision-making. This CRB framework incorporates the core features of contextual bandits and restless bandits, so that it can model both the internal state transitions of each arm and the influence of external global environmental contexts. Using the dual decomposition method, we develop a scalable index policy algorithm for solving the CRB problem, and theoretically analyze the asymptotical optimality of this algorithm. In the case when the arm models are unknown, we further propose a model-based online learning algorithm based on the index policy to learn the arm models and make decisions simultaneously. Furthermore, we apply the proposed CRB framework and the index policy algorithm specifically to the demand response decision-making problem in smart grids. The numerical simulations demonstrate the performance and efficiency of our proposed CRB approaches.
|
|
11:00-11:20, Paper TuA12.4 | Add to My Program |
Towards Fast Rates for Federated and Multi-Task Reinforcement Learning |
|
Zhu, Feng | North Carolina State University |
Heath, Robert Wendell | University of California, San Diego |
Mitra, Aritra | North Carolina State University |
Keywords: Large-scale systems, Reinforcement learning, Optimization
Abstract: Motivated by the emerging paradigm of federated reinforcement learning, we consider a setting involving N agents, where each agent interacts with an environment modeled as a Markov Decision Process (MDP). The agents' MDPs differ in their reward functions, capturing heterogeneous objectives/tasks. The collective goal of the agents is to communicate via a central server to find a policy that maximizes the average of long-term cumulative rewards across environments. The limited existing work on this topic either only provide asymptotic rates, or generate biased policies, or fail to establish any benefits of collaboration. In response, we propose texttt{Fast-FedPG} - a novel federated policy gradient algorithm with a carefully designed bias-correction mechanism. Under a gradient-domination condition, we prove that our algorithm guarantees (i) fast linear convergence with exact gradients, and (ii) sub-linear rates that enjoy a linear speedup w.r.t. the number of agents with noisy, truncated policy gradients. In each case, the convergence is to a globally optimal policy with no heterogeneity-induced bias. In the absence of gradient-domination, we establish convergence to a first-order stationary point at a rate that continues to benefit from collaboration.
|
|
11:20-11:40, Paper TuA12.5 | Add to My Program |
Integrating Inverse Reinforcement Learning and Direct Policy Search for Modeling Multipurpose Water Reservoir Systems |
|
Giuliani, Matteo | Politecnico Di Milano |
Castelletti, Andrea | Politecnico Di Milano |
Keywords: Machine learning, Human-in-the-loop control, Control applications
Abstract: System identification and optimal control have always contributed to water resources systems planning and management. Although water control problems are commonly formulated as multi-objective Markov Decision Processes, accurately modeling reservoir systems controlled by human operators remains challenging due to the absence of a formal definition of the objective function guiding their behavior. In this paper, we introduce a mixed Reinforcement Learning approach to model the dynamics of multipurpose reservoir systems. Specifically, our method first uses Inverse Reinforcement Learning to extract the tradeoff among competing objectives from historical observations of the reservoir system dynamics. The identified objective function is then used in the formulation of an optimal control problem returning a closed-loop policy which allows the simulation of the observed dynamics of the reservoir system. We demonstrate the potential of the proposed method in a real-world application involving the multipurpose regulation of Lake Como in northern Italy. Results show that our approach effectively infers the tradeoff between flood control and water supply adopted in the observed system’s operation, and yields a control policy that closely approximates the observed system dynamics.
|
|
11:40-12:00, Paper TuA12.6 | Add to My Program |
Boosting Fairness and Robustness in Over-The-Air Federated Learning |
|
Oksuz, Halil Yigit | TU Berlin |
Molinari, Fabio | TU Berlin |
Sprekeler, Henning | TU Berlin |
Raisch, Joerg | Technical University Berlin |
Keywords: Machine learning, Large-scale systems, Communication networks
Abstract: Over-the-Air Computation is a beyond-5G communication strategy that has recently been shown to be useful for the decentralized training of machine learning models due to its efficiency. In this letter, we propose an Over-the-Air federated learning algorithm that aims to provide fairness and robustness through minmax optimization. By using the epigraph form of the problem at hand, we show that the proposed algorithm converges to the optimal solution of the minmax problem. Moreover, the proposed approach does not require reconstructing channel coefficients by complex encoding-decoding schemes as opposed to state-of-the-art approaches. This improves both efficiency and privacy.
|
|
TuA13 Invited Session, Suite 1 |
Add to My Program |
Estimation and Control of Distributed Parameter Systems IV |
|
|
Chair: Demetriou, Michael A. | Worcester Polytechnic Institute |
Co-Chair: Hu, Weiwei | University of Georgia |
Organizer: Demetriou, Michael A. | Worcester Polytechnic Institute |
Organizer: Hu, Weiwei | University of Georgia |
|
10:00-10:20, Paper TuA13.1 | Add to My Program |
Structure-Preserving Discretization of Multidimensional Linear Port-Hamiltonian Systems Using FEM Approaches (I) |
|
Ponce, Cristobal | Universidad Tecnica Federico Santa Maria |
Wu, Yongxin | FEMTO-ST/ENSMM |
Le Gorrec, Yann | Cnrs, Ensmm, Femto-St / As2m |
Ramirez, Hector | Universidad Tecnica Federico Santa Maria |
Keywords: Distributed parameter systems, Large-scale systems, Flexible structures
Abstract: This study introduces a novel control oriented structure-preserving scheme for discretizing a class of multidimensional linear port-Hamiltonian systems, preserving their inherent structure while enabling the imposition of diverse combinations of boundary inputs, such as generalized velocities, displacements, and tractions. The proposed approach is grounded on the modified Linked Lagrange Multiplier method and the mixed Finite Element Method (FEM), where Dirichlet and Neumann boundary conditions are weakly enforced. Connections with other standard and mixed FEM approaches are also discussed. The proposed scheme is validated through comparisons with commercial software and simulations using a 2D elasticity model as a demonstrative example.
|
|
10:20-10:40, Paper TuA13.2 | Add to My Program |
Model Reference Adaptive Controller Design for a Multi-State Reparable System (I) |
|
Demetriou, Michael A. | Worcester Polytechnic Institute |
Hu, Weiwei | University of Georgia |
Keywords: Distributed parameter systems, Adaptive control
Abstract: This work considers the adaptive repair rate design of a multi-state reparable system modeled by coupled transport and integro-differential equations. A reparable system is one which can be restored to satisfactory operation by repair actions whenever a failure occurs. The model describes the probabilities of the system in good and failure modes. The objective is to design the adaptive repair functions so that the probability of the system in good mode can be steered to a target distribution. Rigorous analysis on the convergence of tracking error between the plant and the target states is addressed.
|
|
10:40-11:00, Paper TuA13.3 | Add to My Program |
Computing Solutions to the Polynomial-Polynomial Regulator Problem (I) |
|
Corbin, Nicholas | University of California San Diego |
Kramer, Boris | University of California San Diego |
Keywords: Computational methods, Large-scale systems, Nonlinear systems
Abstract: We consider the optimal regulation problem for nonlinear control-affine dynamical systems. Whereas the linear-quadratic regulator (LQR) considers optimal control of a linear system with quadratic cost function, we study polynomial systems with polynomial cost functions; we call this problem the polynomial-polynomial regulator (PPR). The resulting polynomial feedback laws provide two potential improvements over linear feedback laws: 1) they more accurately approximate the optimal control law, resulting in lower control costs, and 2) for some problems they can provide a larger region of stabilization. We derive explicit formulas---and a scalable, general purpose software implementation---for computing the polynomial approximation to the value function that solves the optimal control problem. The method is illustrated first on a low-dimensional aircraft stall stabilization example, for which PPR control recovers the aircraft from more severe stall conditions than LQR control. Then we demonstrate the scalability of the approach on a semidiscretization of dimension n=129 of a partial differential equation, for which the PPR control reduces the control cost by approximately 75% compared to LQR for the initial condition of interest.
|
|
11:00-11:20, Paper TuA13.4 | Add to My Program |
Manifold Clustering Based Nonlinear Model Reduction with Application to Nonlinear Convection (I) |
|
Wu, Tumin | University of Tennessee |
Wilson, Dan | University of Tennessee |
Djouadi, Seddik, M. | University of Tennessee |
Keywords: Reduced order modeling, Nonlinear systems, Nonlinear systems identification
Abstract: This paper proposes a new cluster method combined with Dynamic Mode Decomposition with Control (DMDc), and the Proper Orthogonal Decomposition (POD) to construct more accurate reduced order models. DMDc and POD are popular data-driven techniques that extract low-order models from high-dimensional complex dynamic systems. However, these methods are inherently linear, i.e., the data is assumed to belong to linear manifolds. However, this may lead to inaccuracies in the reduced models commensurate with the presence of nonlinearities. To capture the nonlinear behavior, manifold clustering is introduced to group the snapshots obtained by experiments or numerical simulation into several sub-regions based on the underlying non-linear structure. Manifold clustering is a powerful approach for exploratory data analysis, allowing the discovery of patterns and structures that are not apparent in raw high-dimensional data. It does not require knowing the number of clusters and the intrinsic manifold dimensions in advance. Manifold clustering is combined with DMDc and POD to construct the local reduced-order models. Time clustering is applied to the snapshots generated by a nonlinear convective flow governed by the 2D Burgers' equations with boundary actuation. The manifold cluster reduced order model outperforms standard and other cluster-based (K-means) reduced order models.
|
|
11:20-11:40, Paper TuA13.5 | Add to My Program |
Sensor Selection with Correlated Observations Via Convex Relaxation (I) |
|
Ucinski, Dariusz | University of Zielona Gora |
Patan, Maciej | University of Zielona Gora |
Keywords: Distributed parameter systems, Optimization algorithms, Identification
Abstract: An extremely efficient technique is developed for maximizing the parameter estimation accuracy for systems modelled by partial differential equations in the presence of correlated measurement noise. The trace of the covariance matrix of the weighted least-squares estimator is employed as the measure of the resulting estimation accuracy. This design criterion is to be minimized by choosing a set of spatiotemporal measurement locations from among a given finite set of candidate locations. To make this inherently combinatorial problem tractable, a novel relaxed convex formulation is proposed. The pivotal role here is played by the decomposition of the noise into uncorrelated and correlated components. Optimal solutions are found via simplicial decomposition which alternates between updating the design weights using a very simple multiplicative algorithm and computing a closed-form solution to a linear programming problem. The sequence of the produced iterates monotonically decreases the value of the original A-optimality design criterion. Since the resulting relaxed solution is a measure on the set of candidate measurement locations and not its specific subset, a sequential conversion to a nearly optimal subset of selected sensors is discussed. A nontrivial simulation experiment is reported to validate the proposed approach.
|
|
11:40-12:00, Paper TuA13.6 | Add to My Program |
Linear Port-Hamiltonian Boundary Control Models and Their Equivalence |
|
van der Schaft, Arjan | Univ. of Groningen |
Maschke, Bernhard | University Claude Bernard of Lyon |
Keywords: Distributed parameter systems, Modeling, Flexible structures
Abstract: Systems of partial differential equations often admit different Hamiltonian representations, leading to different boundary variables that are either power or energy conjugate. It is shown that any infinite-dimensional Hamiltonian system can be transformed into one with constant symplectic matrix. Alternatively, any passive Hamiltonian system can be converted into one with constant energy storage matrix. The consideration of energy boundary variables points towards a new approach to control by interconnection. All this is illustrated on the example of the elastic rod.
|
|
TuA14 Regular Session, Suite 2 |
Add to My Program |
Filtering and Estimation |
|
|
Chair: Charalambous, Charalambos D. | University of Cyprus |
Co-Chair: Smith, Malcolm C. | University of Cambridge |
|
10:00-10:20, Paper TuA14.1 | Add to My Program |
Simultaneous Input and State Estimation for Systems with Arbitrary Inherent Delay |
|
Gakis, Grigorios | University of Cambridge |
Smith, Malcolm C. | University of Cambridge |
Keywords: Filtering, Estimation, Linear systems
Abstract: The purpose of this paper is to present a filtering algorithm for simultaneous input and state estimation for linear discrete-time systems of any inherent delay. It generalises previous work which was restricted to systems with an inherent delay of less than or equal to one. The algorithm includes the Kalman filter which is itself a special case of a system with an inherent delay of zero. The approach uses a standard characterisation of a system's inherent delay in terms of its rank indices. Two judiciously chosen rank decompositions of a Toeplitz matrix of the system's Markov parameters are used to develop the algorithm. The filter is illustrated on a simple two-mass system with an inherent delay of two.
|
|
10:20-10:40, Paper TuA14.2 | Add to My Program |
A Novel Logarithmic Transformed Deep Galerkin Approach to Optimal Filtering Problem |
|
Shi, Ji | Capitial Normal University |
Jiao, Xiaopei | Yanqi Lake Beijing Institute of Mathematical Science and Applica |
Yau, Stephen S.-T. | Tsinghua University |
Keywords: Filtering, Estimation, Stochastic systems
Abstract: The optimal filtering problem for general nonlinear state-observation systems has garnered significant attention in control theory. At its core, optimal filtering involves determining the probability density function of the system state conditioned on historical observations. The Yau-Yau method cite{yau2008real}, a pioneering framework, offers a viable approach with comprehensive theoretical guarantees and practical numerical implementation. Specifically, the Yau-Yau framework comprises two key components: offline solution of the forward Kolmogorov equation (FKE) and online data assimilation updates. The primary challenge lies in efficiently and accurately solving the FKE, as it directly impacts the real-time filtering process. To address this fundamental obstacle, we propose a highly efficient filtering algorithm that combines a FKE solver based on deep neural networks and a PDF approximator using generalized Legendre polynomials. By integrating advanced deep learning techniques with Galerkin approximation, we introduce the logarithmic transformed deep Galerkin approach (LTDG). The numerical simulations showcase the effectiveness and accuracy of our newly proposed algorithm. LTDG demonstrates superior performance compared to other methods, such as the extended Kalman filter and particle filter, and it successfully maintains the high accuracy of the Galerkin spectral method while having fewer online computational burdens.
|
|
10:40-11:00, Paper TuA14.3 | Add to My Program |
Data-Driven Approximation of Stationary Nonlinear Filters with Optimal Transport Maps |
|
Al-Jarrah, Mohammad | University of Washington Seattle |
Hosseini, Bamdad | University of Washington Seattle |
Taghvaei, Amirhossein | University of Washington Seattle |
Keywords: Filtering, Machine learning, Variational methods
Abstract: The nonlinear filtering problem is concerned with finding the conditional probability distribution (posterior) of the state of a stochastic dynamical system, given a history of partial and noisy observations. This paper presents a data-driven nonlinear filtering algorithm for the case when the state and observation processes are stationary. The posterior is approximated as the push-forward of an optimal transport (OT) map from a given distribution, that is easy to sample from, to the posterior conditioned on a truncated observation window. The OT map is obtained as the solution to a stochastic optimization problem that is solved offline using recorded trajectory data from the state and observations. An error analysis of the algorithm is presented under the stationarity and filter stability assumptions, which decomposes the error into two parts related to the truncation window during training and the error due to the optimization procedure. The performance of the proposed method, referred to as optimal transport data-driven filter (OT-DDF), is evaluated for several numerical examples, highlighting its significant computational efficiency during the online stage while maintaining the flexibility and accuracy of OT methods in nonlinear filtering.
|
|
11:00-11:20, Paper TuA14.4 | Add to My Program |
Indirect Rate Distortion Functions with Side Information: Structural Properties and Multivariate Gaussian Sources |
|
Stylianou, Evagoras | Technical University of Munich |
Gkagkos, Michail | Texas A&M University |
Charalambous, Charalambos D. | University of Cyprus |
Keywords: Information theory and control, Filtering, Optimization
Abstract: In this paper, we analyze the indirect source coding problem with side information at both the encoder and decoder, as well as only at the decoder. We first derive structural properties of the two rate distortion functions (RDFs) for general abstract spaces and identify conditions under which the RDFs coincide. For multivariate jointly Gaussian random variables with square-error fidelity, we establish structural properties of the optimal test channels, show that side information at both the encoder and decoder does not reduce compression, and provide water-filling solutions using parallel Gaussian channel realizations. This paper uses a novel realization theory approach to establish achievability of the converse coding theorem lower bounds of the two RDFs.
|
|
11:20-11:40, Paper TuA14.5 | Add to My Program |
Decentralized Input and State Estimation for Multi-Agent System with Dynamic Topology and Heterogeneous Sensor Network |
|
Wu, Zida | University of California, Los Angeles |
Mehta, Ankur | University of California Los Angeles |
Keywords: Estimation, Sensor networks, Kalman filtering
Abstract: A crucial challenge in decentralized systems is state estimation in the presence of unknown inputs, particularly within heterogeneous sensor networks with dynamic topologies. While numerous consensus algorithms have been introduced, they often require extensive information exchange or multiple communication iterations to ensure estimation accuracy. This paper proposes an efficient algorithm that achieves an unbiased and optimal solution comparable to filters with full information about other agents. This is accomplished through the use of information filter decomposition and the fusion of inputs via covariance intersection. Our method requires only a single communication iteration for exchanging individual estimates between agents, instead of multiple rounds of information exchange, thus preserving agents' privacy by avoiding the sharing of explicit observations and system equations. Furthermore, to address the challenges posed by dynamic communication topologies, we propose two practical strategies to handle issues arising from intermittent observations and incomplete state estimation, thereby enhancing the robustness and accuracy of the estimation process. Experiments and ablation studies conducted in both stationary and dynamic environments demonstrate the superiority of our algorithm over other baselines. Notably, it performs as well as, or even better than, algorithms that have a global view of all neighbors.
|
|
11:40-12:00, Paper TuA14.6 | Add to My Program |
Synchronization of Atomic Clock Ensemble for Time-Scale Generation: Steering Based on Observable Subspace Selection |
|
Dey, Priyanka | Tokyo Institute of Technology, Japan |
Kawaguchi, Takahiro | Gunma University |
Yano, Yuichiro | National Institute of Information and Communications Technology |
Hanado, Yuko | National Institute of Information and Communications Technology |
Kurata, Yosuke | Seiko Solutions Inc |
Ishizaki, Takayuki | Tokyo Institute of Technology |
Keywords: Stochastic systems, Linear systems, Filtering
Abstract: In this paper, we develop a novel time synchronization algorithm for an ensemble of atomic clocks containing a mixture of second order and third order atomic clocks. To design the feedback control, we consider the Kalman filter to estimate the states of the observable subsystem derived from the observable canonical decomposition of the atomic clock ensemble. We prove that, under certain conditions, the time deviation of each clock from the ideal clock behaviour follows the dynamics of the first state of the unobservable subsystem, which is regarded as the synchronization destination of the clocks. In the algorithm, we can adjust the synchronization destination by choosing the basis of the observable subspace in a special manner for the observable canonical decomposition. An example is included to illustrate the efficacy of the algorithm. It has been revealed numerically that altering the control interval appropriately can improve the frequency stability of the clocks.
|
|
TuA15 Regular Session, Suite 3 |
Add to My Program |
Electrical Systems |
|
|
Chair: Schiffer, Johannes | Brandenburg University of Technology |
Co-Chair: Jiang, Ruiwei | University of Michigan |
|
10:00-10:20, Paper TuA15.1 | Add to My Program |
Reliability and Lifetime Optimal Control for Electric Vehicle Power Converters |
|
Rezaeizadeh, Amin | FHNW |
Mastellone, Silvia | University of Applied Science Northwestern Switzerland FHNW |
Keywords: Optimal control, Predictive control for linear systems, Power electronics
Abstract: Currently the large scale adoption of Battery Electric Vehicles (BEVs) is limited due to cost, reliability and lifetime considerations. The power converters, and more specifically their semiconductor switching devices, are the second most likely component to fail in a BEV because of the damage caused by the current-induced temperature cycling. In this paper we propose a novel hybrid frequency and time domain control approach that integrates time-domain performance requirements and frequency-domain reliability requirements, based on a frequency model of the damage. The method is applied to control the motor of an BEVs and minimize the damage experienced by the power converter.
|
|
10:20-10:40, Paper TuA15.2 | Add to My Program |
A Novel Hamiltonian Approach for Modeling and Control of Quasi-Resonant Buck Converters |
|
Sanchez-Contreras, Agustin | Universidad Nacional Autonoma De Mexico |
Ortega-Velázquez, Isaac | Universidad Nacional Autonoma De Mexico |
Rodriguez-Benitez, Oscar | Universidad Nacional Autonoma De Mexico |
Espinosa-Perez, Gerardo | Universidad Nacional Autonoma De Mexico |
Keywords: Power electronics, Modeling, Nonlinear systems
Abstract: In this paper are presented a novel modeling approach and a passivity-based control scheme to solve the output voltage regulation control problem of a class of Quasi-resonant Converters. Instead of consider classical order reduction arguments, the proposed full order model recovers the Port-Controlled Hamiltonian structure naturally exhibited by the converters. This feature leads to the possibility to propose the implementation of a passive PI control scheme which has been widely recognized to achieve high performances while proving in a formal way its stability properties. In addition, the controller structure is complemented by the inclusion of a static map to use both the frequency and the duty-cycle of the square input signal as control input, guaranteeing a Zero Current Switching operation mode which drastically improve the efficiency of the circuit. The usefulness of the proposed model and control are validated in a numerical setting.
|
|
10:40-11:00, Paper TuA15.3 | Add to My Program |
Optimal Space Vector Modulation for Enhancing Reliability of the DC-Link Capacitor in a 2-Level Converter |
|
Bartels, Lars | ETH Zürich |
Rezaeizadeh, Amin | FHNW |
Dörfler, Florian | Swiss Federal Institute of Technology (ETH) Zurich |
Mastellone, Silvia | University of Applied Science Northwestern Switzerland FHNW |
Keywords: Optimal control, Power electronics
Abstract: In power conversion applications, the DC-link capacitor is a critical component prone to degradation and its reliability affects the whole converter lifespan. Two-levels con- verters are commonly operated using pulse patterns designed according to conventional modulation schemes to generate a desired AC voltage. The effect of the switching on the DC-link capacitor current is not considered in the planned pulse pattern. This work proposes an alternative modulation strategy where the patterns are optimized to minimize the RMS current and thus reduce the degradation effect of current ripples on the capacitor. A comparative performance analysis of the reliability-based modulation scheme with respect to the standard practice is carried out based on simulation. The analysis results demonstrate that the proposed method generates the desired voltage output with low levels of harmonic distortion, while significantly decreasing RMS current values and thus enhancing capacitor lifetime.
|
|
11:00-11:20, Paper TuA15.4 | Add to My Program |
Sequential Charging Station Location Optimization under Uncertain Charging Behavior and User Growth |
|
Shen, Wenjia | Nanjing University |
Zhou, Bo | University of Michigan |
Jiang, Ruiwei | University of Michigan |
Shen, Siqian | University of Michigan |
Keywords: Optimization, Power systems, Computational methods
Abstract: Charging station availability is crucial for a thriving electric vehicle market. Due to budget constraints, locating these stations usually proceeds in phases, which calls for careful consideration of the (random) charging demand growth throughout the planning horizon. This paper integrates user choice behavior into two-stage and multi-stage stochastic programming models for intracity charging station planning under demand uncertainty. We derive a second-order conic representation for the nonlinear, nonconvex formulation by taking advantage of the binary nature of location variables and propose subgradient inequalities to accelerate computation. Numerical results demonstrate the value of employing multi-stage models, particularly in scenarios of high demand fluctuations, increased demand dispersion, and high user sensitivity to the distance-to-recharge.
|
|
11:20-11:40, Paper TuA15.5 | Add to My Program |
Adaptive Rectification Current Observer for Slim DC-Link AC Drives with Unknown Input Voltage |
|
Ghousein, Mohammad | Triskell Consulting |
Devos, Thomas | Schneider Electric |
Henwood, Nicolas | Schneider Electric |
Jebai, Al Kassem | Schneider Electric |
Keywords: Electrical machine control, Adaptive systems, Power electronics
Abstract: The paper focuses on estimating the rectifier current in slim DC-Link AC drives, known for their reduced DC-Link capacitance. Particularly, at full load, the capacitor's filtering effect on the input voltage diminishes. This allows the estimation of the input voltage from DC-Link voltage measurements. We adopt a simplified model, treating the drive as a DC-Link that supplies a constant power load. Notably, the model accounts for the equivalent series resistance (ESR) of the capacitor, adding intricacy to the design. We propose an adaptive observer to estimate both the rectification current and the input voltage. We design a Luenberger state observer with output injection terms. The study on the observer stability is based on decoupling the state estimation errors from the input estimation errors through swapping design. Theoretical developments are validated through numerical simulations.
|
|
11:40-12:00, Paper TuA15.6 | Add to My Program |
Sufficient Conditions for Global Boundedness of Solutions for Two Coupled Synchronverters |
|
Mercado Uribe, José Angel | Brandenburgische Technische Universität Cottbus-Senftenberg |
Mendoza-Avila, Jesus | INRIA Lille-Nord Europe |
Efimov, Denis | Inria |
Schiffer, Johannes | Brandenburg University of Technology |
Keywords: Lyapunov methods, Nonlinear systems, Power systems
Abstract: This paper analyzes two synchronverters connected in parallel to a common capacitive-resistive load through resistive-inductive power lines. This system is conceptualized as a microgrid with two renewable energy sources controlled using the synchronverter algorithm. It is modeled as an interconnection of three port-Hamiltonian systems, and the dq-coordinates model is derived by averaging the frequencies. Applying the recent Leonov function theory, sufficient conditions to guarantee the global boundedness of the whole system's trajectories are provided. This is necessary to reach the global synchronization of microgrids. Additionally, a numerical example illustrates the potential resonance behavior of the microgrid.
|
|
TuA16 Regular Session, Suite 4 |
Add to My Program |
Adaptive Control I |
|
|
Chair: Landau, Ioan Dore | CNRS GIPSA-LAB |
Co-Chair: Oliveira, Tiago Roux | State University of Rio De Janeiro |
|
10:00-10:20, Paper TuA16.1 | Add to My Program |
Can Dynamic Adaptation Gain Speed up Recursive Least Squares Algorithm? |
|
Landau, Ioan Dore | CNRS GIPSA-LAB |
Airimitoaie, Tudor-Bogdan | University of Bordeaux |
Vau, Bernard | IXBLUE |
Buche, Gabriel | GIPSA-Lab |
Keywords: Adaptive control, Mechatronics, Identification
Abstract: Dynamic adaptation gain/learning rate have been introduced in the context of adaptation/learning algorithms using scalar adaptation gains/learning rates to accelerate the adaptation transients. This paper shows by means of theoretical analysis, simulations and, experimental results (on an active noise control system) that inserting a dynamic adaptation gain into the recursive least squares algorithm speeds up the adaptation transients in a deterministic environment and the asymptotic convergence in the stochastic case.
|
|
10:20-10:40, Paper TuA16.2 | Add to My Program |
Adaptive Bipartite Time-Varying Output Formation Tracking of Heterogeneous Multi-Agent Systems |
|
Zhao, Xueqing | Shandong University of Science and Technology |
Zhang, Liping | Shandong University of Science and Technology |
Keywords: Adaptive control, Networked control systems, Observers for Linear systems
Abstract: This paper investigates the bipartite time-varying output formation (BTVOF) tracking problem of heterogeneous discrete-time multi-agent systems (MASs). An adaptive distributed observer is constructed for each follower to estimate the state and system dynamics of the leader. Then, two distributed formation controllers with state feedback control and output feedback control are presented based on the proposed observer, respectively. Furthermore, based on Lyapunov stability theory, the sufficient solvability condition for achieving expected formation tracking is obtained. Finally, we perform a numerical simulation involving six quadrotors, and the effectiveness of the proposed BTVOF tracking protocol is verified by comparing the simulation results.
|
|
10:40-11:00, Paper TuA16.3 | Add to My Program |
A Novel Parameter-Dependent Input Normalization-Based Direct MRAC with Unknown Control Direction |
|
Gong, Yizhou | ShanghaiTech University |
Pin, Gilberto | Electrolux |
Zhang, Yanjun | Beijing Institute of Technology |
Wang, Yang | Shanghai Technology Unversity |
Keywords: Adaptive control, Direct adaptive control, Uncertain systems
Abstract: In this paper, we endow the model adaptive reference control (MRAC) with a novel parameter-dependent input normalization (PIN) to completely eliminate the conventional assumption of the high-frequency gain. Specifically, neither the sign nor the prior knowledge of the upper or lower bounds is required. To this end, we resort to an error augmentation together with a smart design of an adaptive law with a dead zone operation. Global stability in the mean square sense is established with the conventional proof concepts of the augmented error approach. In this way, no persistent excitation requirement is required. Although the system in question is assumed to be unity-relative-degree, the proposed technique can be easily extended to systems of arbitrary relative degrees. Finally, compared with the Nussbaum function-based methods in a numerical experiment, we show that transient behavior in our method is significantly improved.
|
|
11:00-11:20, Paper TuA16.4 | Add to My Program |
Second-Order Newton-Based Extremum Seeking for Multivariable Static Maps |
|
Ghaffari, Azad | Christopher Newport University |
Oliveira, Tiago Roux | State University of Rio De Janeiro |
Keywords: Adaptive control, Estimation, Optimization
Abstract: A second-order Newton-based extremum seeking (SONES) algorithm is presented to estimate directional inflection points for multivariable static maps. The design extends the first-order Newton-based extremum seeking algorithm that drives the system toward its peak point. This work provides perturbation matrices to estimate the second- and third-order derivatives necessary for implementing the SONES. A set of conditions are provided for the probing frequencies that ensure accurate estimation of the derivatives. A differential Riccati filter calculates the inverse of the third-order derivative. The local stability of the new algorithm is proven for general multivariable static maps using averaging analysis. The proposed algorithm ensures uniform convergence toward directional inflection points without requiring information about the curvature of the map and its gradient. Simulation results show the effectiveness of the proposed algorithm.
|
|
11:20-11:40, Paper TuA16.5 | Add to My Program |
Continuous-Time Adaptive Control with Dynamic Adaptation Gain |
|
Chen, Kaiwen | Imperial College London |
Zhang, Kangkang | Imperial College London |
Landau, Ioan Dore | CNRS GIPSA-LAB |
Astolfi, Alessandro | Imperial College & Univ. of Rome |
Keywords: Adaptive control, Adaptive systems, Lyapunov methods
Abstract: This paper investigates a class of continuous-time parameter identifiers with the so-called "dynamic adaptation gain (DAG)", inspired by its discrete-time counterpart [1]. A modular control law is first presented: this helps re-parametrize the system and highlight the required properties for the identifier. Then, a dynamic adaptation gain is constructed using a strictly passive system, strengthened by a feedthrough path: the resulting DAG is input strictly passive. Analysis shows that the identifier with the proposed DAG satisfies specific signal properties. An integrated analysis of both the control law and the identifier establishes boundedness of all closed-loop signals and state convergence. The convergence of the parameter estimate is also established under a persistence of excitation condition. Finally, an example inspired by a path-following problem demonstrates the convergence improvement achieved by the proposed DAG.
|
|
11:40-12:00, Paper TuA16.6 | Add to My Program |
Immersion and Invariance Design for Adaptive Longitudinal Platooning |
|
Liu, Di | Imperial College London |
Baldi, Simone | Southeast University |
Astolfi, Alessandro | Imperial College & Univ. of Rome |
Keywords: Adaptive control, Autonomous vehicles, Adaptive systems
Abstract: This work shows how widely adopted longitudinal platooning protocols can be made adaptive via an immersion and invariance (I&I) approach. Such an I&I approach advances the state of the art on adaptive longitudinal platooning, which mostly relies on model reference adaptive control, while also extending the standard I&I design by compensating for exogenous effects from the preceding vehicle in the adaptive law. Two I&I designs are presented and compared with the corresponding state-of-the-art model reference adaptive control designs to demonstrate their effectiveness.
|
|
TuA17 Regular Session, Suite 6 |
Add to My Program |
Aerospace |
|
|
Chair: Pajic, Miroslav | Duke University |
Co-Chair: Weiss, Avishai | Mitsubishi Electric Research Labs |
|
10:00-10:20, Paper TuA17.1 | Add to My Program |
Rigid-Body Attitude Control on SO(3) Using Nonlinear Dynamic Inversion |
|
Khan, Hafiz Zeeshan Iqbal | Institute of Space Technology, Islamabad |
Aslam, Farooq | Institute of Space Technology |
Haydar, Muhammad Farooq | Animal Dynamics Ltd |
Riaz, Jamshed | Department of Aeronautics and Astronautics, Institute of Space T |
Keywords: Aerospace, Feedback linearization, LMIs
Abstract: This paper presents a cascaded control architecture, based on nonlinear dynamic inversion (NDI), for rigid body attitude control. The proposed controller works directly with the rotation matrix parameterization, that is, with elements of the Special Orthogonal Group SO{3}, and avoids problems related to singularities and non-uniqueness which affect other commonly used attitude representations such as Euler angles, unit quaternions, modified Rodrigues parameters, etc. The proposed NDI-based controller is capable of imposing desired linear dynamics of any order for the outer attitude loop and the inner rate loop, and gives control designers the flexibility to choose higher-order dynamic compensators in both loops. In addition, sufficient conditions are presented in the form of linear matrix inequalities (LMIs) which ensure that the outer loop controller renders the attitude loop almost globally asymptotically stable (AGAS) and the rate loop globally asymptotically stable (GAS). Furthermore, the overall cascaded control architecture is shown to be AGAS in the case of attitude error regulation. Lastly, the proposed scheme is compared with an Euler angles-based NDI scheme from literature for a tracking problem involving agile maneuvering of a multicopter in a high-fidelity nonlinear simulation.
|
|
10:20-10:40, Paper TuA17.2 | Add to My Program |
Nonlinear Control Allocation: A Learning Based Approach |
|
Khan, Hafiz Zeeshan Iqbal | Institute of Space Technology, Islamabad |
Mobeen, Surrayya | Toronto Metropolitan University |
Rajput, Jahanzeb | Centers of Excellence in Science and Applied Technologies, Natio |
Riaz, Jamshed | Department of Aeronautics and Astronautics, Institute of Space T |
Keywords: Aerospace, Flight control, Neural networks
Abstract: Modern aircraft are designed with redundant control effectors to cater for fault tolerance and maneuverability requirements. This leads to aircraft being over-actuated and requires control allocation schemes to distribute the control commands among control effectors. Traditionally, optimization-based control allocation schemes are used; however, for nonlinear allocation problems, these methods require large computational resources. In this work, an artificial neural network (ANN) based nonlinear control allocation scheme is proposed. The proposed scheme is composed of learning the inverse of the control effectiveness map through ANN, and then implementing it as an allocator instead of solving an online optimization problem. Stability conditions are presented for closed-loop systems incorporating the allocator, and computational challenges are explored with piece-wise linear effectiveness functions and ANN-based allocators. To demonstrate the efficacy of the proposed scheme, it is compared with a standard quadratic programming-based method for control allocation.
|
|
10:40-11:00, Paper TuA17.3 | Add to My Program |
Dynamics-Constrained Graph Learning Approach for Hypersonic Vehicle Path Planning Problem |
|
Zhang, Yuan | Beihang University |
Zhang, Ran | Beihang University |
Li, Huifeng | Beihang University |
Keywords: Aerospace, Learning, Flight control
Abstract: This paper proposes a new dynamics-constrained path planning method for hypersonic vehicles. Due to vehicles’ limited maneuverability, path and dynamics are tightly coupled. To ensure executability, it is required to consider dynamic constraints and trajectory indexes in path planning. This leads to a more intricate problem formulation, and the path search space is high-dimensional and sparse. Existing methods mainly target low-velocity vehicles and rely on simple motion models, making them cannot be directly applied here. We address the problem with a graph learning method. This method begins by modeling the graph-search Markov decision process (MDP) on a Graph Attention Network (GAT) to find a path in the topological graph, which guides the subsequent trajectory generation. On top of the path, it uses a three-dimensional waypoint-crossing navigation (3D WCN) law to generate a trajectory under full dynamics and uncertainties. The GAT is trained using reinforcement learning, where the devised cost function includes both trajectory and path indexes. The trajectory index punishes trajectories that fail safety checks to ensure both the performance and executability of the path. The path index, aimed at eliminating the erratic impact of implicit trajectory representations and uncertainties, is calculated from the mean square error between the path and an optimal reference, thereby improving learning efficiency. Simulation results show superior optimality, adaptability, and milli-second-level computation speed.
|
|
11:00-11:20, Paper TuA17.4 | Add to My Program |
Bayesian Measurement Masks for GNSS Positioning |
|
Greiff, Marcus Carl | Mitsubishi Electric Research Laboratries |
Di Cairano, Stefano | Mitsubishi Electric Research Labs |
Berntorp, Karl | Mitsubishi Electric Research Labs |
Keywords: Aerospace, Sensor fusion, Kalman filtering
Abstract: We propose a Bayesian measurement masking method for global navigation satellite system (GNSS) positioning to mitigate disturbances from multi-path biases and modeling errors. The method removes erroneous GNSS observations to improve performance in downstream positioning algorithms. The measurement masking is posed as a binary classification problem, and solved by sequentially determining the noise statistics of individual pseudo-range measurements in the GNSS observations. Bayesian probabilities of mismatching noise models inform when outlier events such as multipath or non-line-of-sight (NLOS) events occur. We report a classification F1-score of >0.99 when the modeling assumptions are satisfied, and >0.97 when realistic modeling errors are included, both for dynamic and static receiver motion models.
|
|
11:20-11:40, Paper TuA17.5 | Add to My Program |
Almost Global Asymptotic Trajectory Tracking for Fully-Actuated Mechanical Systems on Homogeneous Riemannian Manifolds |
|
Welde, Jake | University of Pennsylvania |
Kumar, Vijay | University of Pennsylvania |
Keywords: Algebraic/geometric methods, Robotics, Aerospace
Abstract: In this work, we address the design of tracking controllers that drive a mechanical system’s state asymptotically towards a reference trajectory. Motivated by aerospace and robotics applications, we consider fully-actuated systems evolving on the broad class of homogeneous spaces (encompassing all vector spaces, Lie groups, and spheres of any dimension). In this setting, the transitive action of a Lie group on the configuration manifold enables an intrinsic description of the tracking error as an element of the state space, even in the absence of a group structure on the configuration manifold itself (e.g., for S2). Such an error state facilitates the design of a generalized control policy depending smoothly on state and time that drives this geometric tracking error to a designated origin from almost every initial condition, thereby guaranteeing almost global convergence to the reference trajectory. Moreover, the proposed controller simplifies naturally when specialized to a Lie group or the n-sphere. In summary, we propose a unified, intrinsic controller guaranteeing almost global asymptotic trajectory tracking for fully-actuated mechanical systems evolving on a broader class of manifolds. We apply the method to an axisymmetric satellite and an omnidirectional aerial robot.
|
|
11:40-12:00, Paper TuA17.6 | Add to My Program |
Black-Box Stealthy GPS Attacks on Unmanned Aerial Vehicles |
|
Khazraei, Amir | Duke University |
Meng, Haocheng | Duke University |
Pajic, Miroslav | Duke University |
Keywords: Attack Detection, Aerospace, Flight control
Abstract: This work focuses on analyzing the vulnerability of unmanned aerial vehicles (UAVs) to stealthy black-box false data injection attacks on GPS measurements. We assume that the quadcopter is equipped with IMU and GPS sensors, and an arbitrary sensor fusion and controller are used to estimate and regulate the system’s states, respectively. We consider the notion of stealthiness in the most general form, where the attack is defined to be stealthy if it cannot be detected by any existing anomaly detector. Then, we show that if the closed-loop control system is incrementally exponentially stable, the attacker can cause arbitrarily large deviation in the position trajectory by compromising only the GPS measurements. We also show that to conduct such stealthy impact-full attack values, the attacker does not need to have access to the model of the system. Finally, we illustrate our results in a UAV case study.
|
|
TuA18 Regular Session, Suite 7 |
Add to My Program |
Nonlinear Systems I |
|
|
Chair: Shames, Iman | Australian National University |
Co-Chair: Furtat, Igor | Institute of Problems of Mechanical Engineering Russian Academy of Sciences |
|
10:00-10:20, Paper TuA18.1 | Add to My Program |
Density Function Based Control of Dynamical Systems |
|
Furtat, Igor | Institute of Problems of Mechanical Engineering Russian Academy |
Keywords: Nonlinear systems, Adaptive control, Robust adaptive control
Abstract: The paper considers some class of dynamical systems that called density systems. For such systems the derivative of quadratic function depends on so-called density function. The density function is used to set the properties of phase space, therefore, it influences the behaviour of investigated systems. A particular class of such systems is previously considered for (in)stability study of dynamical systems using the flow and divergence of a phase vector. In this paper, a more general class of such systems is considered, and it is shown that the density function can be used not only to study (in)stability, but also to set the properties of space in order to change the behaviour of dynamical systems. The development of control laws based on use the density function for systems with known parameters is considered. All obtained results are accompanied by the simulations illustrating the theoretical conclusions.
|
|
10:20-10:40, Paper TuA18.2 | Add to My Program |
An Evader Controller for the Pursuit–Evasion Problem Involving Single Integrator Dynamics |
|
Mera, Manuel | Esime Upt Ipn |
Ríos, Héctor | Tecnológico Nacional De México/I.T. La Laguna |
Efimov, Denis | Inria |
Keywords: Nonlinear systems, Agents-based systems, Switched systems
Abstract: This paper contributes to the design of an evader controller for the pursuit–evasion problem, where both agents are described by single integrator dynamics. The proposed evader controller only requires a slight knowledge of the pursuer control law and state. The control law guarantees the evasion of the pursuer under a proper design of the evader controller parameters. Moreover, the synthesis of the evader controller is constructive and simple–to–tune since it is in terms of a linear matrix inequality. The stability analysis of the tracking error dynamics is based on a Lyapunov–like function approach. The effectiveness of the proposed evader control design is illustrated through some simulation results.
|
|
10:40-11:00, Paper TuA18.3 | Add to My Program |
Extensions of the Path-Integral Formula for Computation of Koopman Eigenfunctions |
|
Deka, Shankar | School of Electrical Engineering, Aalto University |
Vaidya, Umesh | Clemson University |
Keywords: Nonlinear systems, Data driven control, Learning
Abstract: Representing nonlinear dynamical systems using the Koopman Operator and its spectrum has distinct advantages in terms of linear interpretability of the model as well as in analysis and control synthesis through the use of well-studied techniques from linear systems theory. As such, efficient computation of Koopman eigenfunctions is of paramount importance towards enabling such Koopman-based constructions. To this end, several approaches have been proposed in literature, including data-driven, convex optimization, and Deep Learning-based methods. In our recent work, we proposed a novel approach based on path-integrals that allowed eigenfunction computations using a closed-form formula. In this paper, we present several important developments such as finite-time computations, relaxation of assumptions on the distribution of the principal Koopman eigenvalues, as well as extension towards saddle point systems, which greatly enhance the practical applicability of our method.
|
|
11:00-11:20, Paper TuA18.4 | Add to My Program |
Dynamic Partial State-Feedback Revisited for Output Tracking Using Lyapunov Redesign and Model-Following Control |
|
Tietze, Niclas | Technische Universität Ilmenau |
Wulff, Kai | TU Ilmenau |
Reger, Johann | TU Ilmenau |
Keywords: Nonlinear systems, Feedback linearization, Lyapunov methods
Abstract: We study the problem of dynamic partial state-feedback trajectory tracking for a class of minimumphase nonlinear systems in Byrnes-Isidori form using the Lyapunov redesign technique. We use an estimate of the unknown internal state in the feedback linearising control resulting in a dynamic control law. We consider three different approaches, namely, a simulation of the internal dynamics, an observer for the internal state, and model following control (MFC). Each design achieves asymptotic tracking. The observer-based approach and the MFC do so while overcoming the typical problem of unbounded discontinuous control signals if a sufficiently accurate estimate of the initial state of the process is available. Both approaches allow for asymptotic tracking with a continuous control signal if the initial state of the process is known. We demonstrate the tracking capabilities by a numerical example.
|
|
11:20-11:40, Paper TuA18.5 | Add to My Program |
Extended Object Tracking under a State-Coupled Model with Gaussian Mixture Distribution |
|
Li, Zhifei | Space Engineering University |
Liu, Chunsheng | National University of Defense Technology |
Ma, Shuli | Space Engineering University |
Wang, Hongyan | Space Engineering University |
Keywords: Nonlinear systems, Filtering, Stochastic optimal control
Abstract: This work proposes a state-coupled model (SCM) for extended object tracking, which treats the orientation and velocity as two dependent variables. With this model, the distribution of multiple measurements is modeled via Gaussian mixture density to match the actual automotive radar or Lidar data. As a result, SCM becomes a highly nonlinear model with multiplicative noise. To handle this challenge, we use the deterministic sampling approach to update the kinematics and orientation information, followed by a constraint condition. And the parameter of extent is estimated under a Bayesian framework with pseudo-measurement. An evaluation is conducted on simulated data, which illustrates that the proposed model and filter are effective.
|
|
11:40-12:00, Paper TuA18.6 | Add to My Program |
Joint Weakly Regularly Persistent Nonlinear Observability for Landmark-Based SLAM with Nonlinear Relatives Measurements (I) |
|
Flayac, Emilien | ISAE Supaero |
Shames, Iman | Australian National University |
Keywords: Nonlinear systems, Identification for control, Robotics
Abstract: In this paper, we give sufficient conditions on the input our system for weak regular observability in the general case of landmark-based Simultaneous Localisation and Mapping (SLAM) both with a world-centric and a sensor-centric point of view. We show notably that in the sensor-centric point of view, the dynamics of the robot is not important for our concept of observability and only its state and input trajectories matter. Besides, we prove that tracking circular trajectories imply weak regular observability jointly for 2D systems with several types of commonly used measurements in SLAM.
|
|
TuA19 Regular Session, Suite 8 |
Add to My Program |
Output Regulation |
|
|
Chair: Colaneri, Patrizio | Politecnico Di Milano |
Co-Chair: Ferrante, Francesco | Universita Degli Studi Di Perugia |
|
10:00-10:20, Paper TuA19.1 | Add to My Program |
Adaptive Observer-Based Output Regulation with Non-Smooth Non-Periodic Exogenous Signals |
|
Niu, Zirui | Imperial College London |
Chen, Kaiwen | Imperial College London |
Scarciotti, Giordano | Imperial College London |
Keywords: Output regulation, Adaptive control, Linear systems
Abstract: In this paper, we address the error-feedback output regulation problem for linear systems with non-smooth non-periodic exogenous signals. We design an adaptive observer under a relaxed persistence of excitation (PE) condition and we solve the error-feedback problem in the non-smooth case. In addition, we show that this relaxed PE condition is equivalent to a complete observability condition that can be checked textit{a priori} by means of an exogenous excitation (EE) condition. We finally show that, if the exogenous signals are generated by a traditional linear time-invariant implicit model, the EE condition is equivalent to a non-resonance-like condition.
|
|
10:20-10:40, Paper TuA19.2 | Add to My Program |
Data-Driven Gaussian Process Output Regulation for a Class Nonlinear Systems |
|
Harry, Telema | Queen's University |
Guay, Martin | Queen's University |
Keywords: Output regulation, Data driven control, Adaptive systems
Abstract: In this paper, we solve the robust output regulation problem (RORP) for a class of nonlinear systems using a data-driven approach to reconstruct the internal model unknown nonlinear continuous map online from some input and output data. The data-driven model is then used to estimate the ideal feed-forward steady-state control inputs obtained by solving the regulator equation instead of implementing it with an extended observer as in previous studies. Secondly, we implement an output feedback stabilizer that does not rely on the complete knowledge of the system but on output measurement of the regulated output, making the proposed approach suitable for systems with modelling errors. Finally, we showed through detailed Lyapunov analysis that under certain conditions the closed-loop system achieves practical asymptotic stability.
|
|
10:40-11:00, Paper TuA19.3 | Add to My Program |
A Descriptor Approach to Compensating Internal Models |
|
Colaneri, Patrizio | Politecnico Di Milano |
Incremona, Gian Paolo | Politecnico Di Milano |
Mirkin, Leonid | Technion - IIT |
Keywords: Output regulation, Linear systems
Abstract: The compensation of internal model is an approach to reduce the stabilization of a plant augmented with an (unstable) internal model to an equivalent stabilization problem for a fictitious plant, whose complexity is the same as that of the plant itself. This note presents a novel approach to construct this fictitious plant in state space, which results in simpler and more transparent procedures than those previously available. The main idea is to employ descriptor systems formalism, which facilitates manipulating systems with potentially non-proper transfer functions. The proposed procedure includes also an additional degree of freedom, which does not affect the complexity of the solution and can be used to tune its properties.
|
|
11:00-11:20, Paper TuA19.4 | Add to My Program |
Setpoint Tracking for a Class of Lur’e Discrete-Time Systems |
|
Ferrante, Francesco | Universita Degli Studi Di Perugia |
Leomanni, Mirko | University of Perugia |
Fravolini, Mario Luca | Universita' Di Perugia |
Tarbouriech, Sophie | LAAS-CNRS |
Keywords: LMIs, Output regulation, Lyapunov methods
Abstract: Setpoint regulation for a class of discrete-time nonlinear plants is addressed. The plant under consideration is a feedback interconnection of a square linear system (with the same number of input and output) and a nonlinearity obeying to an incremental quadratic constraint. Within this setting, we design a dynamic feedback controller ensuring, for the closed-loop system, internal exponential stability and output tracking of constant setpoints with tunable performance guarantees. A proportional-integral feedback controller is proposed. Combining Lyapunov theory and the use of a quadratic Lyapunov function, sufficient conditions for the design of the controller are stated in terms of linear matrix inequalities. An optimal controller design algorithm based on semidefinite programming is proposed to design the controller gains to improve transient performance and limit the control effort. Two numerical examples illustrate the application of the proposed methodology.
|
|
11:20-11:40, Paper TuA19.5 | Add to My Program |
Maneuvering Control of Uncertain Nonlinear Systems: An Output Regulation Viewpoint |
|
Zhang, Yibo | Shanghai Jiaotong University |
Wu, Wentao | Shanghai Jiao Tong University |
Xie, Tao | Shanghai Jiao Tong University |
Cheng, Peng | Shanghai Jiao Tong University |
Wu, Di | Hainan University |
Zhang, Weidong | Shanghai Jiaotong Univ |
Keywords: Output regulation, Uncertain systems, Adaptive control
Abstract: Maneuvering control aims to drive the controlled system to achieve a desired motion along a parameterized path, where the path variable, a scalar, tunes additional dynamic tasks. In this paper, we investigate the maneuvering control problem of uncertain nonlinear systems from the perspective of output regulation. Initially, we demonstrate that the maneuvering control problem can be transformed into an output regulation problem. Then, we design a controller for maneuvering control based on the internal model principle and a neural predictor-based dynamic surface control method. We prove that the resulting closed-loop system is input-to-state stable. The effectiveness of proposed theoretical results is verified via a simulation example.
|
|
11:40-12:00, Paper TuA19.6 | Add to My Program |
Parameter Adaptation for General Regulator Problems with Compensation of Internal Model |
|
Incremona, Gian Paolo | Politecnico Di Milano |
Colaneri, Patrizio | Politecnico Di Milano |
Mirkin, Leonid | Technion - IIT |
Keywords: Output regulation, Linear systems
Abstract: This paper investigates the general regulator problem with an internal model capable of adapting to the unknown parameters of persistent disturbances and/or reference signals affecting the measured output. Specifically, the proposed architecture based on stable compensators of internal model (CIM) allows to reduce the stabilization problem for an augmented system (the plant plus internal model) to that of a process without the internal model and with the complexity of the plant. Then, it is shown that, under certain conditions on the plant model, the proposed scheme makes the parameters of the internal model affect the closed-loop dynamics affinely. This, in turn, facilitates the incorporation of simple adaptation mechanisms with a global convergence.
|
|
TuA20 Regular Session, Suite 9 |
Add to My Program |
Stability Analysis |
|
|
Chair: Guerra, Thierry Marie | Polytechnic University Hauts-De-France |
Co-Chair: Hjalmarsson, Håkan | KTH Royal Inst. of Tech |
|
10:00-10:20, Paper TuA20.1 | Add to My Program |
A Unified Framework for Convergence Analysis in Social Networks |
|
Lee, Ti-Chung | National Sun Yat-Sen University |
Huang, Jun-Kai | National Sun Yat-Sen University, NSYSU |
Su, Youfeng | Fuzhou University |
Keywords: Agents-based systems, Lyapunov methods, Stability of nonlinear systems
Abstract: This paper proposes a unified framework for the stability analysis of discrete-time nonlinear systems from social networks, including the Friedkin-Johnsen opinion model, two opinion dynamics models in the study of social power, and a general nonlinear polar opinion model. Three novel convergence results are proposed to treat various conditions based on LaSalle invariance principle. Several applications are provided to illustrate the power of the proposed framework.
|
|
10:20-10:40, Paper TuA20.2 | Add to My Program |
Stability Analysis of Boolean Networks: Exploring the Influence of Multi-Bit Function Perturbations |
|
Liu, Aixin | Shanghai Jiao Tong University |
Wang, Lin | Shanghai Jiao Tong University |
Chen, Guanrong | City University of Hong Kong |
Guan, Xin-Ping | Shanghai Jiao Tong University |
Keywords: Boolean control networks and logic networks, Stability of nonlinear systems, Discrete event systems
Abstract: This study investigates the impact of multi-bit function perturbations (MFPs) on the steady-state distribution of Boolean networks (BNs), with a focus on robust stability. First, the algebraic formulation of BNs under MFPs is introduced. Then, a novel stability criterion is proposed, demonstrating that monotonically decreasing perturbations can effectively ensure the stability of BNs against MFPs. A significant advantage of this approach is that it enables robust stability assessment without remodeling the perturbed BNs. Applying this methodology to a simplified Boolean model of the lactose operon in emph{Escherichia coli} reveals that MFPs can serve as strategic tools to promote stability within BNs, rather than being merely disruptive.
|
|
10:40-11:00, Paper TuA20.3 | Add to My Program |
Convex Modeling with Vertex and Overbounding Reduction for Stability Analysis of Nonlinear Systems |
|
Guerra, Thierry Marie | Polytechnic University Hauts-De-France |
Estrada-Manzo, Víctor | Universidad Politécnica De Pachuca |
Nguyen, Anh-Tu | Université Polytechnique Des Hauts-De-France |
Keywords: Fuzzy systems, LMIs
Abstract: This paper presents a new method for stability analysis of nonlinear systems through convex modeling. The proposed method consists of two main steps. First, we develop a quadratic optimization program to obtain the scheduling functions as well as the matrix vertices of an exact convex representation of a given nonlinear system. The induced numerical complexity of this new convex modeling can be significantly reduced with respect to the classical sector nonlinearity approach. Second, based on Lyapunov stability theory, we derive relaxed LMI sufficient conditions for nonlinear stability analysis, taking into account the characteristics of the new convex representation. A numerical example is given to illustrate the advantages of the proposal over related existing results.
|
|
11:00-11:20, Paper TuA20.4 | Add to My Program |
Aggregating Multi-Criteria Decision Analysis Results with a Novel Fuzzy Ranking Approach |
|
Więckowski, Jakub | National Institute of Telecommunications |
Salabun, Wojciech | National Institute of Telecommunications |
Keywords: Fuzzy systems, Uncertain systems, Robust control
Abstract: Aggregating results in Multi-Criteria Decision Analysis (MCDA) poses a significant challenge. The common practice within MCDA evaluation is to apply multiple techniques to a single decision problem. This leads to the necessity of determining aggregated results. Despite the development of various aggregation approaches, simplifying the representation of rankings after evaluation, such as calculating mean results or compromise solutions, often neglects critical differences in results. This simplification undermines the value of knowledge presented to decision-makers, providing an incomplete problem overview. To this end, this paper introduces a novel approach to aggregating MCDA results using a fuzzy ranking. Presenting rankings as fuzzy sets, this innovative approach allows for determining membership degrees, revealing the robustness of placing alternatives in specific ranking positions. The proposed aggregating procedure facilitates more informed decision-making without compromising information from various methods or measures, offering comprehensive insights into the influence of different assessment approaches on robustness. The study focuses on a supplier selection evaluation problem, validating the approach using the COmbinative Distance-based ASsessment (CODAS) method and diverse normalization techniques, showcasing its effectiveness in providing decision-makers with a holistic understanding of evaluation results and alternative robustness.
|
|
11:20-11:40, Paper TuA20.5 | Add to My Program |
Input-To-State Stable Integral Line-Of-Sight Guidance for Curved Paths with Anti-Windup Guarantees |
|
Schmidt-Didlaukies, Henrik M. | Norwegian University of Science and Technology |
Basso, Erlend Andreas | Norwegian University of Science and Technology |
Pettersen, Kristin Y. | Norwegian University of Science and Technology (NTNU) |
Keywords: Maritime control, Stability of nonlinear systems
Abstract: This letter considers integral {line-of-sight} (LOS) guidance for curved path following for underactuated marine vehicles. {The} proposed guidance scheme renders the resulting closed-loop system input-to-state stable (ISS) with respect to a function of the vehicle's velocities. Moreover, if the forward and sideways velocities are proportional, we show that the origin of the closed-loop system is uniformly globally asymptotically stable (UGAS). Remarkably, these results are derived without the standard assumption of a small crab angle. Furthermore, we discuss how the path parameter should be selected to ensure that the along-track error remains zero for all time, and we show the connection between selecting the path parameter through differential equations and optimization. Finally, we demonstrate the effectiveness of the proposed approach through numerical simulations of an underwater vehicle.
|
|
11:40-12:00, Paper TuA20.6 | Add to My Program |
Finite Sample Analysis for a Class of Subspace Identification Methods |
|
He, Jiabao | KTH Royal Institute of Technology |
Ziemann, Ingvar | University of Pennsylvania |
Rojas, Cristian R. | KTH Royal Institute of Technology |
Hjalmarsson, Håkan | KTH Royal Inst. of Tech |
Keywords: Subspace methods, Statistical learning
Abstract: While subspace identification methods (SIMs) are appealing due to their simple parameterization for MIMO systems and robust numerical realizations, a comprehensive statistical analysis of SIMs remains an open problem, especially in the non-asymptotic regime. In this work, we provide a finite sample analysis for a class of SIMs, which reveals that the convergence rates for estimating Markov parameters and system matrices are mathcal{O}(1/sqrt{N}), in line with classical asymptotic results. Based on the observation that the model format in classical SIMs is non-causal because of a projection step, we choose a parsimonious SIM that bypasses the projection step and strictly enforces a causal model to facilitate the analysis, where a bank of ARX models are estimated in parallel. Leveraging recent results from a finite sample analysis of an individual ARX model, we obtain a union error bound for an array of ARX models and proceed to derive error bounds for system matrices using robustness results for the singular value decomposition.
|
|
TuLuSp1 Special Session, Brown 3 |
Add to My Program |
AI in the Time of Control: An A-Political Rally |
|
|
Chair: Bitmead, Robert R. | University of California San Diego |
Co-Chair: Solo, Victor | University of New South Wales |
|
12:00-13:30, Paper TuLuSp1.1 | Add to My Program |
AI in the Time of Control: An A-Political Rally (I) |
|
Solo, Victor | University of New South Wales |
Khargonekar, Pramod | Univ. of California, Irvine |
Bitmead, Robert R. | University of California San Diego |
Egerstedt, Magnus | University of California, Irvine |
Keywords:
Abstract: The strong promotion of Artificial Intelligence (AI) as a panacea/menace/smokescreen across so many areas of human endeavor has captured the attention of scientists, policymakers, granting agencies and large fractions of the public. And yet, there is very serious concern both from inside and outside the AI community that the basis for optimism is misplaced, needs rigorous examination, sophisticated regulation, and public oversight. Primarily, there are unmet requirements of quantified performance, interpretation, generalization, etc, etc. Control theory and practice intersects many of the target domains pronounced for AI, and yet has both a more formal methodology and metrics of behavior, bringing the closed-loop firmly into the picture. The rally is aimed at exploring how to achieve this in order to meet these collective challenges and realize the potential opportunities in the control space. The central themes for consideration are: a) Reclaiming Expertise: Reclaiming agency over Robotics, Automation and technical Decision Sciences. b) Supplying Expertise: Reconstituting AI for Safety using the tools from Control. c) Solving the PhD student dilemma: Why Engineering Science trumps purely ‘Data Driven’ Engineering and can help deliver on some of the promise. d) Stopping the ‘brain drain’ from Engineering to Computer Science and attract talent to work on this more material agenda. *The lunch sessions are scheduled to start at 12:10 and conclude at 13:10. This timing is intended to allow for a smooth transition between the morning and afternoon sessions, giving participants sufficient time to navigate between consecutive events.*
|
|
TuLuSp2 Special Session, Amber 1 |
Add to My Program |
Early Career Lunch |
|
|
Chair: Pare, Philip E. | Purdue University |
Co-Chair: Bizyaeva, Anastasia | Cornell University |
|
12:00-13:30, Paper TuLuSp2.1 | Add to My Program |
Early Career Lunch (I) |
|
Pare, Philip E. | Purdue University |
Bizyaeva, Anastasia | Cornell University |
Butler, Brooks A. | University of California, Irvine |
Keywords:
Abstract: Graduate students, postdoctoral scholars, and early career researchers are warmly invited to a special lunch session. This lunch will be a perfect opportunity to meet new peers and to make a game plan for the rest of your conference agenda. It is also a chance to meet members of the newly formed NextCom committee within the IEEE Control Systems Society (CSS) and learn about upcoming resources, workshops, and networking opportunities aimed at supporting early career members of our community. All are welcome! Please join us to learn about all the exciting things happening in CSS and explore getting more involved! *The lunch sessions are scheduled to start at 12:10 and conclude at 13:10. This timing is intended to allow for a smooth transition between the morning and afternoon sessions, giving participants sufficient time to navigate between consecutive events.*
|
|
TuB01 Tutorial Session, Auditorium |
Add to My Program |
Model Predictive Control for Tracking Using Artificial References:
Fundamentals, Recent Results and Practical Implementation |
|
|
Chair: Limon, Daniel | Universidad De Sevilla |
Co-Chair: Köhler, Johannes | ETH Zurich |
Organizer: Limon, Daniel | Universidad De Sevilla |
Organizer: Zeilinger, Melanie N. | ETH Zurich |
Organizer: Ferramosca, Antonio | Univeristy of Bergamo |
Organizer: Köhler, Johannes | ETH Zurich |
Organizer: Krupa, Pablo | Gran Sasso Science Institute |
Organizer: Alvarado, Ignacio | University of Seville |
Organizer: Alamo, Teodoro | Universidad De Sevilla |
|
13:30-13:50, Paper TuB01.1 | Add to My Program |
Model Predictive Control for Tracking Using Artificial References: Fundamentals, Recent Results and Practical Implementation (I) |
|
Krupa, Pablo | Gran Sasso Science Institute |
Köhler, Johannes | ETH Zurich |
Ferramosca, Antonio | Univeristy of Bergamo |
Alvarado, Ignacio | University of Seville |
Zeilinger, Melanie N. | ETH Zurich |
Alamo, Teodoro | Universidad De Sevilla |
Limon, Daniel | Universidad De Sevilla |
Keywords: Predictive control for linear systems, Predictive control for nonlinear systems
Abstract: This paper provides a comprehensive tutorial on a family of Model Predictive Control (MPC) formulations, known as MPC for tracking, which are characterized by including an artificial reference as part of the decision variables in the optimization problem. These formulations have several benefits with respect to the classical MPC formulations, including guaranteed recursive feasibility under online reference changes, as well as asymptotic stability and an increased domain of attraction. This tutorial paper introduces the concept of using an artificial reference in MPC, presenting the benefits and theoretical guarantees obtained by its use. We then provide a survey of the main advances and extensions of the original linear MPC for tracking, including its non-linear extension. Additionally, we discuss its application to learning-based MPC, discuss optimization aspects related to its implementation.
|
|
13:50-14:10, Paper TuB01.2 | Add to My Program |
Extensions of MPC for Tracking (I) |
|
Ferramosca, Antonio | Univeristy of Bergamo |
Keywords: Predictive control for linear systems, Stability of nonlinear systems, Optimization
Abstract: As with classical MPC, the linear MPC for tracking formulation has been extended to many other control paradigms. This talk presents extensions of the formulation to tracking periodic reference signals, the harmonic MPC for tracking formulation, robust control, economic control and zone control. We highlight how the use of artificial references can also provide substantial benefits to some of the major MPC paradigms.
|
|
14:10-14:30, Paper TuB01.3 | Add to My Program |
MPC for Tracking with Non-Linear Systems (I) |
|
Köhler, Johannes | ETH Zurich |
Keywords: Predictive control for nonlinear systems
Abstract: This talk shows how MPC for tracking and its variants (periodic, robust) can be naturally extended to non-linear systems. Theoretical properties and design aspects for nonlinear systems are covered, including terminal costs, convexity (or lack thereof), tractability of long horizon planning, and robust designs.
|
|
14:30-14:50, Paper TuB01.4 | Add to My Program |
Application to Learning-Based MPC (I) |
|
Zeilinger, Melanie N. | ETH Zurich |
Keywords: Predictive control for nonlinear systems, Machine learning, Optimization
Abstract: This talk highlights how MPC for tracking formulations can be a key enabler to address problems beyond tracking in learning-based MPC. We discuss safe exploration in unknown environments, where a large region of attraction is crucial. Furthermore, application to distributed coverage problem in experiments is demonstrated, where tracking MPC enables efficient distributed coordination. We highlight how tracking MPC plays a crucial role to enable such applications, which go beyond tracking.
|
|
14:50-15:10, Paper TuB01.5 | Add to My Program |
Optimization Aspects of MPC for Tracking (I) |
|
Alamo, Teodoro | Universidad De Sevilla |
Keywords: Predictive control for linear systems, Stability of nonlinear systems, Optimization
Abstract: This talk presents optimization aspects related to how the MPC for tracking formulations presented in the previous talks can be solved online. We discuss the application and tuning of first-order methods, such as ADMM, to efficiently implement these formulations. We focus on leveraging the specific semi-banded structure of the problem and addressing practical aspects such as soft-constrained formulations, warm-start techniques, or unfeasibility detection.
|
|
15:10-15:30, Paper TuB01.6 | Add to My Program |
Implementation and Applications of MPC for Tracking (I) |
|
Krupa, Pablo | Gran Sasso Science Institute |
Keywords: Predictive control for linear systems, Stability of nonlinear systems, Optimization
Abstract: This talk provides first-hand insight and examples on the implementation of MPC for tracking. We show how to implement some of the formulations presented in the previous talks using currently available software tools (including both linear and non-linear). We also present results of applications of MPC for tracking formulations, including some real-world applications. The talk highlights both the ease of use of the formulations as well as its good practical performance.
|
|
TuB02 Invited Session, Amber 1 |
Add to My Program |
Learning-Based Control I: Model Learning and System Identification |
|
|
Chair: Müller, Matthias A. | Leibniz University Hannover |
Co-Chair: Trimpe, Sebastian | RWTH Aachen University |
Organizer: Müller, Matthias A. | Leibniz University Hannover |
Organizer: Trimpe, Sebastian | RWTH Aachen University |
Organizer: Schoellig, Angela P | Technical University of Munich & University of Toronto |
Organizer: Zeilinger, Melanie N. | ETH Zurich |
|
13:30-13:50, Paper TuB02.1 | Add to My Program |
A Least-Square Method for Non-Asymptotic Identification in Linear Switching Control (I) |
|
Sun, Haoyuan | Massachusetts Institute of Technology |
Jadbabaie, Ali | Massachusetts Institute of Technology |
Keywords: Identification, Switched systems, Learning
Abstract: In this paper, we study linear system identification in the setting of switched linear systems, where it is known that the underlying partially-observed linear dynamical system lies within a finite collection of known candidate models. We first consider the problem of identification from a given trajectory, which in this setting reduces to identifying the index of the true model with high probability. We characterize the finite-time sample complexity of this problem by leveraging recent advances in the non-asymptotic analysis of linear least-square methods in the literature. In comparison to the earlier results that assume no prior knowledge of the system, our approach takes advantage of the smaller hypothesis class and leads to the design of a learner with a dimension-independent sample complexity bound. Next, we consider the switching control of linear systems, where there is a candidate controller for each of the candidate models and data is collected through interacting the system with a collection of potentially destabilizing controllers. We develop a criterion that can detect those destabilizing controllers in finite time. By leveraging these results, we propose a data-driven switching strategy that identifies the unknown parameters of the underlying system. We then provide a non-asymptotic analysis of its performance and discuss its implications on the classical method of estimator-based supervisory control.
|
|
13:50-14:10, Paper TuB02.2 | Add to My Program |
Learning-Based Prescribed-Time Safety for Control of Unknown Systems with Control Barrier Functions |
|
Huang, Tzu-Yuan | Technical University of Munich(TUM) |
Zhang, Sihua | Beijing Institute of Technology |
Dai, Xiaobing | Technical University of Munich |
Capone, Alexandre | Technical University of Munich |
Todorovski, Velimir | University of California San Diego |
Sosnowski, Stefan | Technical University of Munich |
Hirche, Sandra | Technische Universität München |
Keywords: Machine learning, Statistical learning, Data driven control
Abstract: In many control system applications, state constraint satisfaction needs to be guaranteed within a prescribed time. While this issue has been partially addressed for systems with known dynamics, it remains largely unaddressed for systems with unknown dynamics. In this paper, we propose a Gaussian process-based time-varying control method that leverages backstepping and control barrier functions to achieve safety requirements within prescribed time windows. It can be used to keep a system within a safe region or to make it return to a safe region within a limited time window. These properties are cemented by rigorous theoretical results. The effectiveness of the proposed controller is demonstrated in a simulation of a robotic manipulator.
|
|
14:10-14:30, Paper TuB02.3 | Add to My Program |
Modeling and Predictive Control of Networked Systems Via Physics-Informed Neural Networks (I) |
|
Boca de Giuli, Laura | Politecnico Di Milano |
La Bella, Alessio | Politecnico Di Milano |
Farina, Marcello | Politecnico Di Milano |
Scattolini, Riccardo | Politecnico Di Milano |
Keywords: Neural networks, Nonlinear systems identification, Control of networks
Abstract: This article addresses the data-driven modeling and predictive control of networked systems. The main contribution is a novel methodology to develop a physics-informed recurrent neural network (PI-RNN) model with guaranteed stability properties. The idea consists in interconnecting multiple RNNs consistently with the known physical topology of the networked system, and in jointly training them while enforcing conditions guaranteeing the input-to-state stability of the PI-RNN model. The stability properties of the proposed PI-RNN model pave the way for the design of (i) a decentralized state observer and (ii) a Nonlinear Model Predictive Control (NMPC) regulator with convergence guarantees. The presented strategies are tested on a realistic large-scale networked system, i.e., a district heating network, demonstrating promising results from both the modeling and the control design perspective.
|
|
14:30-14:50, Paper TuB02.4 | Add to My Program |
Active Learning for Control-Oriented Identification of Nonlinear Systems (I) |
|
Lee, Bruce | University of Pennsylvania |
Ziemann, Ingvar | University of Pennsylvania |
Pappas, George J. | University of Pennsylvania |
Matni, Nikolai | University of Pennsylvania |
Keywords: Statistical learning, Learning, Reinforcement learning
Abstract: Model-based reinforcement learning is an effective approach for controlling an unknown system. It is based on a longstanding pipeline familiar to the control community in which one performs experiments on the environment to collect a dataset, uses the resulting dataset to identify a model of the system, and finally performs control synthesis using the identified model. As interacting with the system may be costly and time consuming, targeted exploration is crucial for developing an effective control-oriented model with minimal experimentation. Motivated by this challenge, recent work has begun to study finite sample data requirements and sample efficient algorithms for the problem of optimal exploration in model-based reinforcement learning. However, existing theory and algorithms are limited to model classes which are linear in the parameters. Our work instead focuses on models with nonlinear parameter dependencies, and presents the first finite sample analysis of an active learning algorithm suitable for a general class of nonlinear dynamics. We find that after a short burn-in time, the excess control cost of our algorithm achieves the optimal rate, up to logarithmic factors. We validate our approach in simulation, showcasing the advantage of active, control-oriented exploration in controlling nonlinear systems.
|
|
14:50-15:10, Paper TuB02.5 | Add to My Program |
Single Trajectory Conformal Prediction |
|
Lee, Brian | University of Pennsylvania |
Matni, Nikolai | University of Pennsylvania |
Keywords: Statistical learning, Learning, Data driven control
Abstract: We study the performance of risk-controlling prediction sets (RCPS), an empirical risk minimization-based formulation of conformal prediction, on a single trajectory of data from an unknown stochastic process. Our analysis characterizes the graceful degradation in RCPS performance as data becomes nearly arbitrarily dependent and nonstationary, subject only to a mild requirement that the underlying process is causal. By specializing this analysis, we find that RCPS attains guarantees comparable to those enjoyed on independent and identically distributed data whenever data is generated by an asymptotically stationary and mixing process. We then relate these conditions to system-theoretic properties like contractivity.
|
|
15:10-15:30, Paper TuB02.6 | Add to My Program |
Finite Sample Frequency Domain Identification (I) |
|
Tsiamis, Anastasios | ETH Zurich |
Abdalmoaty, Mohamed | ETH Zurich |
Smith, Roy S. | ETH Zurich |
Lygeros, John | ETH Zurich |
Keywords: Statistical learning, Machine learning, Identification
Abstract: We study non-parametric frequency-domain system identification from a finite-sample perspective. We assume an open loop scenario where the excitation input is periodic and consider the Empirical Transfer Function Estimate (ETFE), where the goal is to estimate the frequency response at certain desired (evenly-spaced) frequencies, given input-output samples. We show that under sub-Gaussian colored noise (in time-domain) and stability assumptions, the ETFE estimates are concentrated around the true values. The error rate is of the order of square root of M/Ntot where Ntot is the total number of samples and M is the number of desired frequencies. It also depends multiplicatively on du+(dudy)^(1/2), where du and dy are the dimensions of the input and output signals respectively. This rate remains valid for general irrational transfer functions and does not require a finite order state-space representation. By tuning M, we obtain a Ntot^(-1/3) finite-sample rate for learning the frequency response over all frequencies in the H infinity norm. Our result draws upon an extension of the Hanson-Wright inequality to semi-infinite matrices. We study the finite-sample behavior of ETFE in simulations.
|
|
TuB03 Invited Session, Amber 2 |
Add to My Program |
Open Multi-Agent Systems: Theory and Applications |
|
|
Chair: Franceschelli, Mauro | University of Cagliari |
Co-Chair: Frasca, Paolo | CNRS, GIPSA-Lab, Univ. Grenoble Alpes |
Organizer: Franceschelli, Mauro | University of Cagliari |
Organizer: Frasca, Paolo | CNRS, GIPSA-Lab, Univ. Grenoble Alpes |
Organizer: Hendrickx, Julien M. | UCLouvain |
Organizer: Oliva, Gabriele | University Campus Bio-Medico of Rome |
|
13:30-13:50, Paper TuB03.1 | Add to My Program |
Stability of Paracontractive Open Multi-Agent Systems (I) |
|
Deplano, Diego | University of Cagliari |
Franceschelli, Mauro | University of Cagliari |
Giua, Alessandro | University of Cagliari |
Keywords: Agents-based systems, Autonomous systems, Cooperative control
Abstract: In this paper, we examine networks consisting of multiple interacting agents that have the flexibility to join or leave the network at any moment, which we term open multi-agent systems (OMASs). Expanding upon the recently introduced theoretical framework for analyzing the dynamic characteristics of OMASs, we extend our study to encompass agents with vector states and discrete-time evolution. A key point of our work is the employment of the concept of ”open stability” w.r.t. the infinity norm, which naturally makes the distance between two points in the state independent of the number of agents. This obviates the necessity for distance normalization, as required by the standard Euclidean norm. Within this framework, the main contribution of our work is that of establishing sufficient conditions for the open stability of an OMAS, which include the boundedness of the arrival/departure process and the paracontractivity of the OMAS in the absence of arrivals/departures, thus generalizing existing results for contractive OMASs. To underscore the practical relevance of our theoretical framework, we present the formulation of the dynamic max-consensus protocol for OMASs. Through numerical simulations, we demonstrate the alignment of this protocol with the theoretical findings outlined in this manuscript.
|
|
13:50-14:10, Paper TuB03.2 | Add to My Program |
Distributed Average Consensus in Open Multi-Agent Systems (I) |
|
Hadjicostis, Christoforos N. | University of Cyprus |
Dominguez-Garcia, Alejandro D. | University of Illinois at Urbana-Champaign |
Keywords: Agents-based systems, Autonomous systems, Distributed control
Abstract: In this paper, we consider the problem of distributed average consensus in multi-agent systems, where each agent can come in or move out of the system, possibly multiple times. In the literature, such systems are referred to as open multi-agent systems. A typical goal in such settings is to use an iterative distributed algorithm to calculate the average of some quantities of interest each agent possesses, which can be crucial in many estimation, control, or optimization applications. We consider an open multi-agent setting and propose a distributed algorithm that allows the participating agents to track their average. More specifically, if the set of agents remaining in the computation eventually settles to a certain subset of agents, then the proposed algorithm allows them (under some mild connectivity conditions) to asymptotically reach consensus to the average of the quantities of interest these remaining agents hold. Analysis and numerical examples to illustrate the operation of the proposed algorithm are also provided.
|
|
14:10-14:30, Paper TuB03.3 | Add to My Program |
A Distributed Strategy for Generalized Biconnectivity Maintenance in Open Multi-Robot Systems (I) |
|
Restrepo, Esteban | CNRS, INRIA Rennes – Bretagne Atlantique |
Robuffo Giordano, Paolo | Centre National De La Recherche Scientifique (CNRS) |
Keywords: Distributed control, Cooperative control, Autonomous robots
Abstract: Preserving the connectivity of the underlying interaction graph in a multi-robot system is a necessary condition for allowing the group of robots to achieve a common task by resorting to only local information. However, in the context of open multi-robot systems, that is, when the number of robots in the team is not fixed, merely preserving connectivity of the current graph does not prevent the loss of connectivity after a robot joins/leaves the group. We present a distributed strategy to achieve biconnectivity, instead of simple connectivity, for a group of robots that allows establishment/deletion of interaction links as well as addition/removal of agents at anytime while guaranteeing that the connectivity, and thus functionality, of the team is always preserved. The proposed approach is completely distributed and embeds into a unique gradient-based control multiple constraints and requirements: (i) limited inter-robot communication ranges, (ii) limited field of view, (iii) desired inter-agent distances, and (iv) collision avoidance. Numerical simulations illustrate the effectiveness of our approach.
|
|
14:30-14:50, Paper TuB03.4 | Add to My Program |
Average Consensus Over Directed Networks in Open Multi-Agent Systems with Acknowledgement Feedback (I) |
|
Makridis, Evagoras | University of Cyprus |
Grammenos, Andreas | University of Cambridge |
Oliva, Gabriele | University Campus Bio-Medico of Rome |
Kalyvianaki, Evangelia | University of Cambridge |
Hadjicostis, Christoforos N. | University of Cyprus |
Charalambous, Themistoklis | University of Cyprus |
Keywords: Cooperative control, Distributed control, Agents-based systems
Abstract: In this paper, we address the distributed average consensus problem over directed networks in open multi-agent systems (OMAS), where the stability of the network is disrupted by frequent agent arrivals and departures, leading to a time-varying average consensus target. To tackle this challenge, we introduce a novel ratio consensus algorithm (OPENRC) based on acknowledgement feedback, designed to be robust to agent arrivals and departures, as well as to unbalanced directed network topologies. We demonstrate that when all active agents execute the OPENRC algorithm, the sum of their state variables remains constant during quiescent epochs when the network remains unchanged. By assuming eventual convergence during such quiescent periods following persistent variations in system composition and size, we prove the convergence of the OPENRC algorithm using column-stochasticity and mass- preservation properties. Finally, we apply and evaluate our proposed algorithm in a simulated environment, where agents are departing from and arriving in the network to highlight its resilience against changes in the network size and topology.
|
|
14:50-15:10, Paper TuB03.5 | Add to My Program |
Steering Opinions Over N-Dimensional Spherical Manifolds |
|
Chakravarthy, Animesh | University of Texas at Arlington |
Ghose, Debasish | Indian Institute of Science |
Keywords: Agents-based systems, Optimal control, Control applications
Abstract: This paper considers opinion dynamics on an n-sphere, where leaders steer their followers' opinions to a desirable region on the sphere. In the process of doing so, the leaders may encounter other leaders who influence the opinion of these followers and entice them away. To prevent this, the leader will need to exert additional effort to keep the followers together and away from the influence of the adversarial leaders. This paper formulates this problem as an optimal control problem of choosing a piecewise geodesic path to the opinion goal region while optimizing the cost incurred due to the additional effort required in order to negate the influence of the adversarial leaders. The paper builds upon several well-known concepts such as representation of opinions as vectors on an n-sphere, bounded confidence model that defines the interaction of proximal opinions, and the methods by which leaders try to influence the opinion of their followers and strengthen their hold on them. However, instead of looking at consensus and related concepts, the paper produces novel results on how leaders can successfully steer the opinion of their followers over the spherical manifold. Simulation results are given to illustrate the concepts presented in the paper.
|
|
15:10-15:30, Paper TuB03.6 | Add to My Program |
Resource Sharing with Autonomous Agents in Cloud-Edge Computing Networks Via Mechanism Design |
|
Parvizi, Sajad | University of Tehran |
Montazeri, Mina | Empa |
Kebriaei, Hamed | University of Tehran |
Keywords: Agents-based systems, Autonomous systems, Optimal control
Abstract: Executing computation tasks through cloud–edge collaboration has emerged as a promising method to enhance the quality of service for applications. Typically, cloud servers and edge servers are selfish and rational. Therefore, it is crucial to develop incentive mechanisms that maximize cloud server profit and simultaneously motivate idle edge servers to participate in the task executing process while edge servers, as autonomous agents, choose their resource-sharing levels by themselves. This paper addresses the challenges of resource limitations and heterogeneity in edge computing by proposing a novel mechanism that integrates contract theory with Stackelberg game properties considering asymmetric information and the autonomous nature of edge servers. To propose an optimal mechanism, we design a linear form of reward function such that the mechanism’s goals are met. The mechanism allows edge servers to autonomously decide their level of resource contribution while ensuring the maximization of the cloud server's utility. The proposed mechanism not only facilitates efficient resource utilization but also guarantees the truthful and rational participation of edge servers. Initially, the proposed mechanism is conceptualized as a non-convex functional optimization with a dual continuum of constraints. However, we illustrate that by deriving an equivalent representation of the constraints, it can be transformed into a convex optimal control problem. Simulation results demonstrate the efficiency of our proposed incentive mechanism approach.
|
|
TuB04 Invited Session, Amber 3 |
Add to My Program |
Cyber-Physical Systems: Resilience, Cybersecurity, and Privacy II:
Resilience and Privacy |
|
|
Chair: Selvi, Daniela | Università Di Pisa |
Co-Chair: Soudjani, Sadegh | Newcastle University |
Organizer: Selvi, Daniela | Università Di Pisa |
Organizer: Sadabadi, Mahdieh S. | The University of Manchester |
Organizer: Murguia, Carlos | Eindhoven University of Technology |
Organizer: Ferrari, Riccardo M.G. | Delft University of Technology |
Organizer: Soudjani, Sadegh | Max Planck Institute for Software Systems |
|
13:30-13:50, Paper TuB04.1 | Add to My Program |
A Verifiable Computing Scheme for Encrypted Control Systems |
|
Stabile, Francesca | University of Calabria |
Lucia, Walter | Concordia University |
Youssef, Amr | Concordia University |
Franze, Giuseppe | Universita' Della Calabria |
Keywords: Networked control systems
Abstract: The proliferation of cloud computing technologies has paved the way for deploying networked encrypted control systems, offering high performance, remote accessibility and privacy. However, in scenarios where the control algorithms run on third-party cloud service providers, the control's logic might be changed by a malicious agent on the cloud. Consequently, it is imperative to verify the correctness of the control signals received from the cloud. Traditional verification methods, like zero-knowledge proof techniques, are computationally demanding in both proof generation and verification, may require several rounds of interactions between the prover and verifier and, consequently, are inapplicable in real-time control system applications. In this paper, we present a novel computationally inexpensive verifiable computing solution inspired by the probabilistic cut-and-choose approach. The proposed scheme allows the plant's actuator to validate the computations accomplished by the encrypted cloud-based networked controller without compromising the control scheme’s performance. We showcase the effectiveness and real-time applicability of the proposed verifiable computation scheme using a remotely controlled Khepera-IV differential-drive robot.
|
|
13:50-14:10, Paper TuB04.2 | Add to My Program |
Event-Triggered Resilient Control Design in Cyber-Physical Systems Subject to Covert Attacks (I) |
|
Khorasani, Khashayar | Concordia University |
Eslami, Ali | Concordia University |
Keywords: Cyber-Physical Security, Resilient Control Systems, Attack Detection
Abstract: This paper investigates the resilient control design problem of event-triggered Cyber-Physical Systems (CPS) when the system is subject to covert attacks. To avoid continuous communication and save resources, event-triggered schemes are considered on the communication channels between the plant and the Command and Control (C&C) center, on both the input and output channels. By utilizing the notion of auxiliary systems, we have provided an approach to estimate the cyberattack signals and design a resilient controller which can achieve stability of the CPS system. Finally, illustrative examples are provided featuring a Vertical Take-Off and Landing (VTOL) aircraft to demonstrate the effectiveness and capabilities of our approach in mitigating covert attacks.
|
|
14:10-14:30, Paper TuB04.3 | Add to My Program |
Security-Enhancing Filters against Internal Frequency Principle-Based Integrity Attacks (I) |
|
Escudero, Cédric | INSA Lyon, Laboratoire Ampère |
Sadabadi, Mahdieh S. | The University of Manchester |
Zamai, Eric | Institut National Polytechnique De Grenoble, Laboratoire |
Keywords: Cyber-Physical Security, Resilient Control Systems, Linear systems
Abstract: This paper presents a novel frequency-domain approach for control input filtering in networked control systems subject to integrity attacks on control inputs. The proposed approach relies on adding linear time-invariant filters to control loops, placed between received control actions and plant actuators, whose aim is to mitigate the impact of attacks on control performance. The filter synthesis problem can be cast in terms of a multi-objective synthesis problem subject to multiple frequency-domain inequality specifications in finite and infinite frequency ranges. The problem is formulated as a convex optimization problem subject to a set of Linear Matrix Inequalities. Simulation results demonstrate the effectiveness of the proposed security-enhancing approach.
|
|
14:30-14:50, Paper TuB04.4 | Add to My Program |
Optimal Privacy-Preserving Transmission Schedule against Eavesdropping Attacks on Remote State Estimation |
|
Zou, Jiaying | Shanghai University |
Liu, Hanxiao | Shanghai University |
Liu, Chun | Shanghai University |
Ren, Xiaoqiang | Shanghai University |
Wang, Xiaofan | Shanghai University |
Keywords: Estimation, Networked control systems, Sensor networks
Abstract: This paper is concerned with the preservation of privacy in remote state estimation of cyber-physical systems. A privacy-preserving transmission scheduling strategy against eavesdropping is proposed, incorporating three operational modes for sensors: silence, direct transmission, and noise-injected transmission. This strategy is designed to minimize the transmission cost and estimation error covariance for the remote estimator while maximizing the estimation error covariance for eavesdroppers. Threshold structures are demonstrated for optimal transmission schedules in different scenarios. Additionally, a novel correlation between the optimal transmission choice and the magnitude of injection noise is presented, particularly pertinent to scenarios involving direct transmission and transmission with injection noise. This correlation is important in balancing transmission information integrity against privacy concerns. Finally, several numerical examples are presented to demonstrate the effectiveness of the theoretical results.
|
|
14:50-15:10, Paper TuB04.5 | Add to My Program |
Resilient Control Design for EV Charging Unit with Multiple Modules Consisting of Single Active Bridge (SAB) Converters (I) |
|
Manasis, Apostolos | University of Patras |
Papageorgiou, Panos | University of Patras |
Konstantopoulos, George | University of Patras |
Keywords: Power systems, Resilient Control Systems, Nonlinear systems
Abstract: Fast charging stations for electric vehicles (EVs) often consist of charging units with multiple modules connected in parallel to achieve high power ratings and can suffer from cyber-attacks in the modern smart grid architecture or measurement/sensor/communication failure. In this paper, an EV charging unit with multiple single active bridge (SAB) converters connected in parallel is considered and a novel nonlinear resilient controller is proposed to achieve accurate power sharing among the converters with inherent current limitation, while enhancing the entire charging unit resilience under abnormal scenarios. By taking into account the dynamic model of the parallel SAB converters and using invariant set and nonlinear ultimate boundedness theories, it is proven that the current injected to the EV battery by each converter is limited below a maximum value at all times. Furthermore, based on the novel dynamic structure of the controller, it is shown that when an unexpected event (e.g. setpoint attack, sensor fault) occurs at one or more converters, the inherent current limiting property protects the corresponding devices and the remaining non-compromised units automatically share the remaining power to the battery, thus enhancing the EV charger resilience. An EV charging unit with 4 modules is simulated to verify the effectiveness and increased resilience of the proposed control approach under setpoint attacks and sensor/measurement faults.
|
|
15:10-15:30, Paper TuB04.6 | Add to My Program |
Secondary Defense Strategies of AC Microgrids under Polynomially Unbounded FDI Attacks and Communication Link Faults |
|
Wang, Yichao | University of Connecticut |
Rajabinezhad, Mohamadamin | University of Connecticut (UCONN) |
Zuo, Shan | University of Connecticut |
Keywords: Cooperative control, Lyapunov methods, Distributed control
Abstract: This paper presents fully distributed, resilient secondary defense strategies for AC microgrids considering both communication link faults and a broader spectrum of unbounded false data injection (FDI) attacks on control input channels. In contrast to existing solutions that address bounded faults or unbounded attacks on the input channels with bounded first-order time derivatives, the proposed strategies aim to enhance the defense capabilities against polynomially unbounded FDI attacks while the communication links are under faults. Resilient defense strategies for AC microgrids are developed to mitigate the adverse effects of the polynomially unbounded FDI attacks on control input channels and communication link faults, ensuring the stable and resilient operation of AC microgrids. Through rigorous Lyapunov-based stability analysis, the formal certification of the proposed strategies is demonstrated in achieving uniformly ultimately bounded convergence in frequency regulation, voltage containment, and active power sharing in multi-inverter-based AC microgrids. The effectiveness of these resilient strategies is further validated on a modified IEEE 34-bus test feeder system with four inverter-based distributed energy resources.
|
|
TuB05 Invited Session, Amber 4 |
Add to My Program |
Recent Advances in Distributed Optimization and Learning Algorithms |
|
|
Chair: Pu, Shi | Shenzhen Research Institute of Big Data, the Chinese University of Hong Kong, Shenzhen |
Co-Chair: Wai, Hoi-To | The Chinese University of Hong Kong |
Organizer: Pu, Shi | The Chinese University of Hong Kong, Shenzhen |
Organizer: Xu, Jinming | Zhejiang University |
Organizer: Wai, Hoi-To | The Chinese University of Hong Kong |
Organizer: Nedich, Angelia | Arizona State University |
|
13:30-13:50, Paper TuB05.1 | Add to My Program |
Zeroth-Order Algorithm Design with Orthogonal Direction for Distributed Weakly Convex Optimization (I) |
|
Wang, Renyi | Anhui University |
Fan, Yuan | Anhui University |
Cheng, Songsong | Anhui University |
Keywords: Optimization algorithms, Network analysis and control, Learning
Abstract: This paper investigates a zeroth-order algorithm to solve a distributed weakly convex optimization problem over a multi-agent network, where each agent in the network has access to a local weakly convex objective function. We utilize a pseudo-gradient estimation scheme with orthogonal random directions to estimate the gradient information, which is more general than the existing coordinate descent, discretized gradient descent, and spherical smoothing methods. Moreover, we design a projected pseudo-gradient algorithm with a diminishing step size to achieve the optimal solution. Furthermore, we show the proposed algorithm converges to the optimal solution with an O({ln k}/{sqrt k}) convergence rate from the perspective of the Moreau envelope. Finally, we provide a numerical example to illustrate the effectiveness of the proposed algorithm.
|
|
13:50-14:10, Paper TuB05.2 | Add to My Program |
A Zeroth-Order Proximal Algorithm for Consensus Optimization (I) |
|
Wang, Chengan | Shanghaitech Univeristy |
Ou, Zichong | ShanghaiTech University |
Lu, Jie | ShanghaiTech University |
Keywords: Optimization algorithms, Optimization
Abstract: This paper considers a consensus optimization problem, where all the nodes in a network, with access to the zeroth-order information of its local objective function only, attempt to cooperatively achieve a common minimizer of the sum of their local objectives. To address this problem, we develop ZoPro, a zeroth-order proximal algorithm, which incorporates a zeroth-order oracle for approximating Hessian and gradient into a recently proposed, high-performance distributed second-order proximal algorithm. We show that the proposed ZoPro algorithm, equipped with a dynamic stepsize, converges linearly to a neighborhood of the optimum in expectation, provided that each local objective function is strongly convex and smooth. Extensive simulations demonstrate that ZoPro converges faster than several state-of-the-art distributed zeroth-order algorithms and outperforms a few distributed second-order algorithms in terms of running time for reaching given accuracy.
|
|
14:10-14:30, Paper TuB05.3 | Add to My Program |
Distributed Optimization and Learning with Automated Stepsizes (I) |
|
Chen, Ziqin | Clemson University |
Wang, Yongqiang | Clemson University |
Keywords: Optimization algorithms, Machine learning, Cooperative control
Abstract: The selection of stepsizes has always been an elusive task in distributed optimization and learning. Although some stepsize-automation approaches have been proposed in centralized optimization, these approaches are inapplicable in the distributed setting. This is because in distributed optimization/learning, letting individual agents adapt their own stepsizes unavoidably results in stepsize heterogeneity, which can easily lead to algorithmic divergence. To solve this issue, we propose an approach that enables agents to adapt their individual stepsizes without any manual adjustments or global knowledge of the objective function. To the best of our knowledge, this is the first algorithm to successfully automate stepsize selection in distributed optimization/learning. Its performance is validated using several machine learning applications, including logistic regression, matrix factorization, and image classification.
|
|
14:30-14:50, Paper TuB05.4 | Add to My Program |
Surplus-Based ADMM for Distributed Constrained Optimization Over Directed Graphs (I) |
|
Ji, Qiutong | Southeast University |
Wen, Guanghui | Southeast University |
Yang, Tao | Northeastern University |
Keywords: Optimization
Abstract: This paper introduces a distributed parallel Alternating Direction Method of Multipliers (ADMM) algorithm for solving the distributed constrained optimization problem over directed graphs. To effectively handle the effect of asymmetric information communication on the convergence of the optimization algorithm, a surplus-based averaging consensus algorithm is integrated into the ADMM-based optimization algorithm. Unlike existing distributed ADMM algorithms over directed graphs that focus on the case with solely local constraints, the proposed algorithm can deal with both local constraints and coupling constraints. Under the assumption that the objective function is convex and the underlying graph is strongly connected, the convergence of the surplus-based ADMM to an optimal solution of the distributed constrained problem is proved. Finally, numerical simulations are conducted to validate the effectiveness of the proposed algorithm.
|
|
14:50-15:10, Paper TuB05.5 | Add to My Program |
Decentralized Multi-Armed Bandit Can Outperform Classic Upper Confidence Bound: A Homogeneous Case Over Strongly Connected Graphs (I) |
|
Zhu, Jingxuan | Zhejiang Lab |
Liu, Ji | Stony Brook University |
Keywords: Cooperative control, Distributed control, Agents-based systems
Abstract: This paper studies a homogeneous decentralized multi-armed bandit problem, in which a network of multiple agents faces the same set of arms, and each agent aims to minimize its own regret. A fully decentralized upper confidence bound (UCB) algorithm is proposed for a multi-agent network whose neighbor relations are described by a directed graph. It is shown that the decentralized algorithm guarantees each agent to achieve a lower logarithmic asymptotic regret compared to the classic UCB algorithm, provided the neighbor graph is strongly connected. The improved asymptotic regret upper bound is reciprocally related to the maximal size of a local neighborhood within the network. The roles of graph connectivity, maximum local degree, and network size are analytically elucidated in the expression of regret.
|
|
15:10-15:30, Paper TuB05.6 | Add to My Program |
Differentially Private Online Federated Learning with Correlated Noise (I) |
|
Zhang, Jiaojiao | KTH Royal Institute of Technology |
Zhu, Linglingzhi | The Chinese University of Hong Kong |
Johansson, Mikael | KTH - Royal Institute of Technology |
Keywords: Optimization algorithms, Machine learning, Computer/Network Security
Abstract: We introduce a novel differentially private algorithm for online federated learning that employs temporally correlated noise to enhance utility while ensuring privacy of continuously released models. To address challenges posed by DP noise and local updates with streaming non-iid data, we develop a perturbed iterate analysis to control the impact of the DP noise on the utility. Moreover, we demonstrate how the drift errors from local updates can be effectively managed under a quasi-strong convexity condition. Subject to an (epsilon, delta)-DP budget, we establish a dynamic regret bound over the entire time horizon, quantifying the impact of key parameters and the intensity of changes in dynamic environments. Numerical experiments confirm the efficacy of the proposed algorithm.
|
|
TuB06 Regular Session, Amber 5 |
Add to My Program |
Network Analysis and Control II |
|
|
Chair: Hendrickx, Julien M. | UCLouvain |
Co-Chair: Bazanella, Alexandre S. | Univ. Federal Do Rio Grande Do Sul |
|
13:30-13:50, Paper TuB06.1 | Add to My Program |
Extending Identifiability Results from Isolated Networks to Embedded Networks |
|
Mapurunga, Eduardo | Universidade Federal Do Rio Grande Do Sul |
Gevers, Michel | Univ. Catholique De Louvain |
Bazanella, Alexandre S. | Univ. Federal Do Rio Grande Do Sul |
Keywords: Network analysis and control, Identification, Networked control systems
Abstract: This paper deals with the design of Excitation and Measurement Patterns (EMPs) for the identification of dynamic networks, when the objective is to identify only a subnetwork embedded in a larger network. Recent results have shown how to construct EMPs that guarantee identifiability for a range of networks with specific graph topologies, such as trees, loops, parallel networks, or Directed Acyclic Graphs (DAGs). However, an EMP that is valid for the identification of a subnetwork taken in isolation may no longer be valid when that subnetwork is embedded in a larger network. Our main contribution is to exhibit conditions under which it does remain valid, and to propose ways to enhance such EMP when these conditions are not satisfied.
|
|
13:50-14:10, Paper TuB06.2 | Add to My Program |
Nonlinear Identifiability of Directed Acyclic Graphs with Partial Excitation and Measurement |
|
Vizuete, Renato | UCLouvain |
Hendrickx, Julien M. | UCLouvain |
Keywords: Network analysis and control, Identification
Abstract: We analyze the identifiability of directed acyclic graphs in the case of partial excitation and measurement. We consider an additive model where the nonlinear functions located in the edges depend only on a past input, and we analyze the identifiability problem in the class of pure nonlinear functions satisfying f(0)=0. We show that any identification pattern (set of measured nodes and set of excited nodes) requires the excitation of sources, measurement of sinks and the excitation or measurement of the other nodes. Then, we show that a directed acyclic graph (DAG) is identifiable with a given identification pattern if and only if it is identifiable with the measurement of all the nodes. Next, we analyze the case of trees where we prove that any identification pattern guarantees the identifiability of the network. Finally, by introducing the notion of a generic nonlinear network matrix, we provide sufficient conditions for the identifiability of DAGs based on the notion of vertex-disjoint paths.
|
|
14:10-14:30, Paper TuB06.3 | Add to My Program |
Belief Samples Are All You Need for Social Learning |
|
Jafari, Mahyar | Massachusetts Institute of Technology |
Ajorlou, Amir | Massachusetts Institute of Technology |
Jadbabaie, Ali | Massachusetts Institute of Technology |
Keywords: Network analysis and control, Learning, Iterative learning control
Abstract: In this paper, we consider the problem of social learning, where a group of agents embedded in a social network are interested in learning an underlying state of the world. Agents have incomplete, noisy, and heterogeneous sources of information, providing them with recurring private observations of the underlying state of the world. Agents can share their learning experience with their peers by taking actions observable to them, with values from a finite feasible set of states. Actions can be interpreted as samples from the beliefs which agents may form and update on what the true state of the world is. Sharing samples, in place of full beliefs, is motivated by the limited communication, cognitive, and information-processing resources available to agents especially in large populations. Previous work pose the question as to whether learning with probability one is still achievable if agents are only allowed to communicate samples from their beliefs. We provide a definite positive answer to this question, assuming a strongly connected network and a ``collective distinguishability'' assumption, which are both required for learning even in full-belief-sharing settings. In our proposed belief update mechanism, each agent's belief is a normalized weighted geometric interpolation between a fully Bayesian private belief --- aggregating information from the private source --- and an ensemble of empirical distributions of the samples shared by her neighbors over time. By carefully constructing asymptotic almost-sure lower/upper bounds on the frequency of shared samples matching the true state/or not, we rigorously prove the convergence of all the beliefs to the true state, with probability one.
|
|
14:30-14:50, Paper TuB06.4 | Add to My Program |
Kron Reduction of Nonlinear Networks |
|
van der Schaft, Arjan | Univ. of Groningen |
Besselink, Bart | University of Groningen |
Huijzer, Anne-Men | University of Groningen |
Keywords: Network analysis and control, Model/Controller reduction, Large-scale systems
Abstract: Kron reduction is concerned with the elimination of interior nodes of physical network systems such as linear resistor electrical circuits. In this paper it is shown how this can be extended to networks with nonlinear static relations between the variables associated to the edges of the underlying directed graph.
|
|
14:50-15:10, Paper TuB06.5 | Add to My Program |
Practical Synchronization of Perturbed Networks of Semi-Passive Systems |
|
Lazri, Anes | PARIS SACLAY |
Maghenem, Mohamed Adlene | Gipsa Lab, CNRS, France |
Panteley, Elena | CNRS |
Loria, Antonio | CNRS |
Keywords: Network analysis and control, Networked control systems, Nonlinear systems
Abstract: We study practical synchronization for heterogeneous networks of nonlinear systems in the presence of bounded perturbations. Under the assumption that the nodes are state semi-passive and the interconnection graph admits a spanning tree, we establish uniform ultimate boundedness of the solutions and, consequently, practical synchronization. That is, we show that the systems’ trajectories approach each other up to a steady state error. The magnitude of this steady-state error can be made arbitrarily small by increasing a scalar coupling gain. The results are shown to hold under the assumption that (at least) a single “well-located” node in the network enjoys some robustness properties. Our theoretical results are fairly general in regards to the topology and are illustrated in simulation on a case-study of networked mobile robots.
|
|
15:10-15:30, Paper TuB06.6 | Add to My Program |
Maintaining Strong Structural Controllability for Multi-Agent-Systems with Varying Number of Agents |
|
Schmidtke, Vincent | University of Kassel |
Al-Maqdad, Rami | University of Kassel |
Stursberg, Olaf | University of Kassel |
Keywords: Network analysis and control, Networked control systems, Time-varying systems
Abstract: This paper investigates strong structural controllability for multi-agent systems (MAS) with dynamically joining and leaving agents. Changing sets of agents can compromise controllability if only a subset of agents is equipped with control inputs. Based on the graph-theoretical concept of zero forcing, conditions are derived under which MAS with joining and leaving agents can maintain strong structural controllability without needing to reconfigure the leader set. Two methods are proposed: The first, employs so-called zero forcing chains, to provide definitive statements on maintaining SSC. The second is based on forcing agents and offers a quick assessment of feasibility for joining or leaving, but confirms SSC only in specific cases.
|
|
TuB07 Regular Session, Amber 6 |
Add to My Program |
Game Theory IV |
|
|
Chair: Lavaei, Javad | UC Berkeley |
Co-Chair: Mu, Yifen | Chinese Academy of Sciences |
|
13:30-13:50, Paper TuB07.1 | Add to My Program |
Optimal Contract Design for End-Of-Life Care Payments |
|
Jiang, Muyan | UC Berkeley |
Chen, Ying | UC Berkeley |
Chen, Xin | UC Berkeley |
Lavaei, Javad | UC Berkeley |
Aswani, Anil | UC Berkeley |
Keywords: Game theory, Modeling, Optimization
Abstract: A large fraction of total healthcare expenditure occurs due to end-of-life (EOL) care, which means it is important to study the problem of more carefully incentivizing necessary versus unnecessary EOL care because this has the potential to reduce overall healthcare spending. This paper introduces a principal-agent model that integrates a mixed payment system of fee-for-service and pay-for-performance in order to analyze whether it is possible to better align healthcare provider incentives with patient outcomes and cost-efficiency in EOL care. The primary contributions are to derive optimal contracts for EOL care payments using a principal-agent framework under three separate models for the healthcare provider, where each model considers a different level of risk tolerance for the provider. We derive these optimal contracts by converting the underlying principal-agent models from a bilevel optimization problem into a single-level optimization problem that can be analytically solved. Our results are demonstrated using a simulation where an optimal contract is used to price intracranial pressure monitoring for traumatic brain injuries.
|
|
13:50-14:10, Paper TuB07.2 | Add to My Program |
Distributed Generalized Nash Equilibrium Seeking for Constrained N-Cluster Games with Second-Order Dynamics |
|
Zhao, Yan | Tongji University |
Meng, Min | Tongji University |
Li, Xiuxian | Tongji University |
Xu, Jia | Tongji University |
Keywords: Game theory, Distributed control, Optimization algorithms
Abstract: This paper studies N-cluster games with second-order dynamics, wherein both local convex set constraints and nonlinear coupled inequality constraints are took into account. The presence of second-order dynamics coupled with constraints puts up obstacles to algorithm design and analysis, since it may be impossible to directly determine the decisions of players based on their control inputs. In order to facilitate autonomous execution of N-cluster game tasks by second-order players, this paper employs state feedback, projection, primal-dual, dynamic average consensus, and passivity methods to design a distributed algorithm, which can regulate the decisions of players to satisfy the set constraints all the time. Additionally, rigorous analysis of the convergence of the proposed algorithm is provided in this paper. Finally, a simulation example is presented to validate the effectiveness of the proposed algorithm.
|
|
14:10-14:30, Paper TuB07.3 | Add to My Program |
Two Competing Populations with a Common Environmental Resource |
|
Paarporn, Keith | University of Colorado, Colorado Springs |
Nelson, James | University of Colorado, Colorado Springs |
Keywords: Game theory, Nonlinear systems, Optimization
Abstract: Feedback-evolving games is a framework that models the co-evolution between payoff functions and an environmental state. It serves as a useful tool to analyze many social dilemmas such as natural resource consumption, behaviors in epidemics, and the evolution of biological populations. However, it has primarily focused on the dynamics of a single population of agents. In this paper, we consider the impact of two populations of agents that share a common environmental resource. We focus on a scenario where individuals in one population are governed by an environmentally ``responsible" incentive policy, and individuals in the other population are environmentally ``irresponsible". An analysis on the asymptotic stability of the coupled system is provided, and conditions for which the resource collapses are identified. We then derive consumption rates for the irresponsible population that optimally exploit the environmental resource, and analyze how incentives should be allocated to the responsible population that most effectively promote the environment via a sensitivity analysis.
|
|
14:30-14:50, Paper TuB07.4 | Add to My Program |
Periodicity in Dynamical Games Driven by the Hedge Algorithm and Myopic Best Response |
|
Guo, Xinxiang | Academy of Mathematics and Systems Science, Chinese Academy of S |
Mu, Yifen | Chinese Academy of Sciences |
Yang, Xiaoguang | Academy of Mathematics and Systems Science, Chinese Academy of S |
Keywords: Game theory, Nonlinear systems, Reinforcement learning
Abstract: In this paper, we consider the n times n two-payer zero-sum repeated game in which one player (player X) employs the popular Hedge (also called multiplicative weights update) learning algorithm while the other player (player Y) adopts the myopic best response. The theoretical analysis on the dynamics of such game system is still rare, which is however of promising interests. We investigate the dynamics of such Hedge-myopic system by defining a metric Q(textbf{x}_t), which measures the distance between the stage strategy textbf{x}_t and Nash Equilibrium (NE) strategy of player X. We analyze the trend of Q(textbf{x}_t) and prove that it is bounded and can only take finite values on the evolutionary path when the payoff matrix is rational and the game has an interior NE. Based on this, we prove that the stage strategy sequence of both players are periodic after finite stages and the time-averaged strategy of player Y within one period is an exact NE strategy.
|
|
14:50-15:10, Paper TuB07.5 | Add to My Program |
Generalized Individual Q-Learning for Polymatrix Games with Partial Observations |
|
Donmez, Ahmed Said | Bilkent University |
Sayin, Muhammed Omer | Bilkent University |
Keywords: Game theory, Learning, Reinforcement learning
Abstract: This paper addresses the challenge of limited observations in non-cooperative multi-agent systems where agents can have partial access to other agents' actions. We present the generalized individual Q-learning dynamics that combine belief-based and payoff-based learning for the networked interconnections of more than two self-interested agents. This approach leverages access to opponents' actions whenever possible, demonstrably achieving a faster (guaranteed) convergence to quantal response equilibrium in multi-agent zero-sum and potential polymatrix games. Notably, the dynamics reduce to the well-studied smoothed fictitious play and individual Q-learning under full and no access to opponent actions, respectively. We further quantify the improvement in convergence rate due to observing opponents' actions through numerical simulations.
|
|
15:10-15:30, Paper TuB07.6 | Add to My Program |
Distributed Learning Dynamics Converging to the Core of B-Matchings |
|
Hamed, Aya | University of Illinois Urbana-Champaign |
Shamma, Jeff S. | University of Illinois at Urbana-Champaign |
Keywords: Game theory, Learning
Abstract: B-Matching is a special case of matching problems where nodes can join multiple matchings with the degree of each node constrained by an upper bound, the node's B-value. The core solution of a bipartite B-matching is both a matching between the nodes respecting the upper bound constraint and an allocation of the weights of the edges among the nodes such that no group of nodes can deviate and collectively gain higher allocation. We present two learning dynamics that converge to the core of the bipartite B-matching problems. The first dynamics are centralized dynamics in the nature of the Hungarian method, which converge to the core in a polynomial time. The second dynamics are distributed dynamics, which converge to the core with probability one. For the distributed dynamics, a node maintains only a state consisting of (i) the aspiration levels for all of its possible matches and (ii) the matches, if any, to which it belongs. The node does not keep track of its history nor is it aware of the environment state. In each stage, a randomly activated node proposes to form a new match and changes its aspiration based on the success or failure of its proposal. At this stage, the proposing node inquires about the aspiration of the node it wants to match with to calculate the feasibility of the match. The environment matching structure changes whenever a proposal succeeds. A state is absorbing for the distributed dynamics if and only if it is in the core of the B-matching.
|
|
TuB08 Regular Session, Amber 7 |
Add to My Program |
Optimal Control V |
|
|
Chair: Antunes, Duarte | Eindhoven University of Technology, the Netherlands |
Co-Chair: Li, Yan | Northwestern Polytechnical University |
|
13:30-13:50, Paper TuB08.1 | Add to My Program |
Robust Adaptive Control Using Nonlinear Quadratic Optimal Controllers |
|
Li, Yan | Northwestern Polytechnical University |
Wang, Zhong | Northwestern Polytechnical University |
Liu, Yuxuan | University of Science and Technology Beijing |
Lang, Jinxi | Northwestern Polytechnical University |
Liu, Kai | Military Research Institute |
Keywords: Optimal control, Adaptive control, Nonlinear systems
Abstract: Optimal control-based robust controllers have been extensively studied to mitigate the impacts of disturbances. However, a common limitation of existing methodologies is the incorporation of disturbance information, such as upper bounds, directly into the cost function to design the controller. This practice restricts the applicability of such controllers primarily to problems where disturbance bounds are specified and known a priori. Recognizing the limitation, this paper proposes a novel robust optimal adaptive controller for nonlinear uncertain systems which is formulated without requiring knowledge of the bounds of uncertainty. By employing online parameter estimation, the considered design problem is converted into a nonlinear quadratic optimal control problem. Then an indirect Legendre-Gauss pseudospectral method (ILGPM) is developed to enable efficient numerical implementation. Numerical simulations illustrate the effectiveness of the proposed controller.
|
|
13:50-14:10, Paper TuB08.2 | Add to My Program |
Control Signal Switching in Constrained Control of Averages |
|
Yamalova, Diana | Ericsson AB |
Wigren, Torbjorn | Uppsala University |
Keywords: Optimal control, Control applications
Abstract: Constrained control of the average of outputs of nonlinear systems is sometimes needed to meet regulatory requirements, e.g., to guarantee radio frequency electromagnetic field levels from wireless transmitters, or to guarantee ammonium levels from wastewater treatment plants. Switching control signals have become a concern in some of these applications. To analyze the reason for this, the paper first formulates a general constrained optimal control problem that embeds the average control problem as a special case. The solution to the problem is then outlined using Pontryagin’s maximum principle. The optimal control is computed analytically using logarithmic barrier functions for the control signal constraints. Simulation of the optimizing Euler–Lagrange equations is then used to characterize the constrained average control and output signals over time, which shows that switching occurs due the slow dynamics and the constraints. The paper also contributes by proving that the computed control is optimal, since the second derivative of the Hamiltonian is positive.
|
|
14:10-14:30, Paper TuB08.3 | Add to My Program |
Practical Implementation and Experimental Validation of an Optimal Control Based Eco-Driving System |
|
Lakshmanan, Vinith Kumar | IFPEN |
Sciarretta, Antonio | IFP Energies Nouvelles |
Lemaire, Olivier | IFP Energies Nouvelles |
Keywords: Optimal control, Automotive systems, Autonomous vehicles
Abstract: The main goal of Eco-Driving (ED) is to maximize for energy efficiency. This study evaluates the energy gains of an ED system for an electric vehicle, obtained from a predictive optimal controller, in a real-world traffic scenario. To this end, a Visual driver Advisory System (VAS) in the form of a personal tablet is used to advise the driver to follow a target eco-speed via a screen. Two Renault Zoe electric cars, one equipped with the different modules for ED and one without, are used to perform field tests on a route between Rueil-Malmaison and Bougival in France. Overall, the eco-driven consumed, on average, 4.6 % less energy than the non-eco-driven car with a maximum of 2 % change in average speed.
|
|
14:30-14:50, Paper TuB08.4 | Add to My Program |
Nonlinear Feedback Control Design Via NEOC |
|
Rai, Ayush | Purdue University |
Mou, Shaoshuai | Purdue University |
Anderson, Brian D.O. | Australian National University |
Keywords: Optimal control, Autonomous systems, Agents-based systems
Abstract: Quadratic performance indices associated with linear plants offer simplicity and lead to linear feedback control laws, but they may not adequately capture the complexity and flexibility required to address various practical control problems. One notable example is to improve, by using possibly nonlinear laws, on the trade-off between rise time and overshoot commonly observed in classical regulator problems with linear feedback control laws. To address these issues, non-quadratic terms can be introduced into the performance index, resulting in nonlinear control laws. In this study, we tackle the challenge of solving optimal control problems with non-quadratic performance indices using the closed-loop neighboring extremal optimal control (NEOC) approach and homotopy method. Building upon the foundation of the Linear Quadratic Regulator (LQR) framework, we introduce a parameter associated with the non-quadratic terms in the cost function, which is continuously adjusted from 0 to 1. We propose an iterative algorithm based on a closed-loop NEOC framework to handle each gradual adjustment. Additionally, we discuss and analyze the classical work of Bass and Webber, whose approach involves including additional non-quadratic terms in the performance index to render the resulting Hamilton-Jacobi equation analytically solvable. Our findings are supported by numerical examples.
|
|
14:50-15:10, Paper TuB08.5 | Add to My Program |
High-Value Target Escort and Defense Scenario with Differential Subgame Cost |
|
Judy, Rachael L | University of Cincinnati |
Fuchs, Zachariah E. | University of Cincinnati |
Casbeer, David W. | Air Force Research Laboratory |
Keywords: Optimal control, Autonomous systems, Game theory
Abstract: An Escort and Defense Scenario is presented in which a high-value Target maneuvers through a high-risk region while being escorted by a mobile, defensive agent. Along this trajectory, an Attacker may be launched from one of several possible launch locations. If a launch occurs, the three agents play out a Target-Attacker-Defender (TAD) differential game in which the Defender attempts to intercept the Attacker at maximal distance from the Target while the Attacker strives to maneuver as close as possible to the Target. Prior to launch, the Target and Defender strive to preposition themselves in advantageous positions to effectively respond to a potential threat while simultaneously moving towards a safety region. An augmented collocation-based direct optimal control method is developed to solve the escort problem by solving and utilizing the value of the TAD differential subgame at each collocation point while simultaneously optimizing the primary optimal control problem.
|
|
15:10-15:30, Paper TuB08.6 | Add to My Program |
Infinite-Horizon Linear Quadratic Path-Following |
|
Antunes, Duarte | Eindhoven University of Technology, the Netherlands |
Keywords: Optimal control, Autonomous vehicles, Stochastic optimal control
Abstract: This paper proposes a path-following policy for linear systems subject to stochastic disturbances. The problem is framed as that of choosing both the control inputs and the trajectory's speed to minimize an infinite-horizon expected quadratic cost taking into account the state and the input. The path is modeled by an exosystem. It is shown that certainty equivalence holds when the path is a straight line. The proposed path-following policy improves the cost of the optimal constant speed trajectory-tracking policy associated with the exosystem. This policy guarantees that the tracking error converges to zero in the absence of disturbances. Numerical examples highlight the advantages of the method with respect to trajectory-tracking.
|
|
TuB09 Regular Session, Amber 8 |
Add to My Program |
Optimization Algorithms II |
|
|
Chair: Okano, Kunihisa | Ritsumeikan University |
Co-Chair: Gasparri, Andrea | Roma Tre University |
|
13:30-13:50, Paper TuB09.1 | Add to My Program |
Fully Distributed EV Charging Scheduling for Load Flattening in V2G Systems |
|
Heo, Jinwook | Seoul National University |
Hyeon, Soojeong | Seoul National University, |
Shim, Hyungbo | Seoul National University |
Kim, Jinsung | Hyundai Motor Company |
Keywords: Optimization algorithms, Distributed control, Smart grid
Abstract: This paper presents a fully distributed algorithm for scheduling electric vehicle (EV) charging and discharging to flatten the total load of the grid, while considering constraints on grid transmission capacity. As a fully distributed solution, the proposed algorithm operates without the need for a central unit. Instead, each agent only communicates a single dual variable with its neighboring agents based on a communication graph, and thus no private information is shared. In particular, the algorithm does not rely on initial conditions, ensuring robustness in online changes of operational conditions. Simulation results verify the effectiveness of the proposed algorithm.
|
|
13:50-14:10, Paper TuB09.2 | Add to My Program |
A Distributed Algorithm for Large-Scale Multi-Agent MINLPs |
|
Dong, Sheng | Dalian University of Technology |
Xia, Weiguo | Dalian University of Technology |
Keywords: Optimization algorithms, Distributed control, Large-scale systems
Abstract: In this paper, we focus on the optimization of large-scale multi-agent systems, where agents collaboratively optimize the sum of local objective functions through their own continuous and/or discrete decision variables, subject to global coupling constraints and local constraints. The resulting Mixed-Integer Nonlinear Programmings (MINLPs) are NPhard, non-convex, and large-scale. Therefore, this paper aims to design distributed algorithms to find feasible suboptimal solutions with a guaranteed bound. To this end, considering dual decomposition as an effective method to decompose largescale constraint-coupled optimization problems, we first show, based on the convexification effects of large-scale MINLPs, that the primal solutions from the dual are near-optimal under certain conditions. This expands recent results in Mixed- Integer Linear Programmings (MILPs) to the nonlinear case but requires additional efforts on the proof. Utilizing this result to tighten the coupling constraints, we develop a fully distributed algorithm for the tightened problem, based on dual decomposition and consensus protocols. The algorithm is guaranteed to provide feasible solutions for the original MINLP. Moreover, asymptotic suboptimality bounds are established for the obtained solution. Finally, the efficacy of the method is verified through numerical simulations.
|
|
14:10-14:30, Paper TuB09.3 | Add to My Program |
Accelerated Alternating Direction Method of Multipliers Gradient Tracking for Distributed Optimization |
|
Sebastián, Eduardo | Universidad De Zaragoza |
Franceschelli, Mauro | University of Cagliari |
Gasparri, Andrea | Roma Tre University |
Montijano, Eduardo | Universidad De Zaragoza |
Sagues, Carlos | Universidad De Zaragoza |
Keywords: Optimization algorithms, Distributed control, Cooperative control
Abstract: This paper presents a novel accelerated distributed algorithm for unconstrained consensus optimization over static undirected networks. The proposed algorithm combines the benefits of acceleration from momentum, the robustness of the alternating direction method of multipliers, and the computational efficiency of gradient tracking to surpass existing state-of-the-art methods in convergence speed, while preserving their computational and communication cost. First, we prove that, by applying momentum on the average dynamic consensus protocol over the estimates and gradient, we can study the algorithm as an interconnection of two singularly perturbed systems: the outer system connects the consensus variables and the optimization variables, and the inner system connects the estimates of the optimum and the auxiliary optimization variables. Next, we prove that, by adding momentum to the auxiliary dynamics, our algorithm always achieves faster convergence than the achievable linear convergence rate for the non-accelerated alternating direction method of multipliers gradient tracking algorithm case. Through simulations, we numerically show that our accelerated algorithm surpasses the existing accelerated and non-accelerated distributed consensus first-order optimization protocols in convergence speed.
|
|
14:30-14:50, Paper TuB09.4 | Add to My Program |
Sensitivity Function-Based Identification of Solid Oxide Electrolyzer Parameters |
|
Yazbeck, Zaman | Laboratoire Ampère |
Bribiesca-Argomedo, Federico | INSA Lyon, Laboratoire Ampere |
Pham, Minh Tu | INSA De Lyon |
Morel, Bertrand | CEA-Liten |
Panda, Ronit | Genvia |
Dimitriou, Vincent | Genvia |
Keywords: Optimization algorithms, Electrochemical processes, Nonlinear systems
Abstract: This paper presents a framework for parameter estimation of Solid Oxide Electrolyzer Stack (SOES) model. The complexity of multi-physics in SOES models poses a unique challenge for parameter identification due to the presence of nonlinearities, the large number of parameters, and few available measurements. Consequently, this study presents an enhanced method of parameter estimation, based on the Gauss-Newton optimization algorithm, incorporating a truncated Singular Value Decompostion (SVD) of a normalized sensitivity matrix. This modification prioritizes the update of parameters in the directions of high sensitivity while limiting the condition number of the matrix inverted to choose the step size, thus attenuating the adverse effects of noise and model errors unavoidable in the estimation process. This departure from the conventional approaches, allows a more nuanced and effective identification strategy tailored to the intricacies of SOESs. The proposed method is validated using data from an experimental test bench and compared to other identification methods.
|
|
14:50-15:10, Paper TuB09.5 | Add to My Program |
Fast Generation of Feasible Trajectories in Direct Optimal Control |
|
Kiessling, David | KU Leuven |
Baumgärtner, Katrin | University of Freiburg |
Frey, Jonathan | University of Freiburg |
Decre, Wilm | KU Leuven |
Swevers, Jan | KU Leuven |
Diehl, Moritz | University of Freiburg |
Keywords: Optimization algorithms, Optimal control, Optimization
Abstract: This letter examines the question of finding feasible points to discrete-time optimal control problems. The optimization problem of finding a feasible trajectory is transcribed to an unconstrained optimal control problem. An efficient algorithm, called FP-DDP, is proposed that solves the resulting problem using Differential Dynamic Programming preserving feasibility with respect to the system dynamics in every iteration. Notably, FP-DDP admits global and rapid local convergence properties induced by a combination of a Levenberg-Marquardt method and an Armijo-type line search. An efficient implementation of FP-DDP within acados is compared to established methods such as Direct Multiple Shooting, Direct Single Shooting, and state-of-the-art solvers.
|
|
15:10-15:30, Paper TuB09.6 | Add to My Program |
Feedback Control Balancing Quadratic Performance and Input Sparsity |
|
Nishida, Shumpei | Ritsumeikan University |
Okano, Kunihisa | Ritsumeikan University |
Keywords: Optimization algorithms, Networked control systems, Linear systems
Abstract: This study focuses on the simultaneous minimiza- tion of the quadratic cost and the number of actuations in the control of discrete-time linear systems. The quadratic cost measures conventional transient performance, while penalizing the actuation frequency promotes sparse control and improves energy efficiency. The main challenge in this optimization arises from the intricate interplay between the two optimization variables; control inputs and actuation timings. To avoid this complexity, we introduce a computationally feasible approximation and divide the problem into two stages. The proposed control scheme consists of a feedback controller, whose gains are determined offline, and an online actuation scheduler that decides whether to apply the input at each time step. We also introduce two extensions to further refine the optimization process. Through numerical examples, we evaluate the effectiveness of our method and its extensions. We show that these methods allow to balance the transient performance and the sparsity of the control input.
|
|
TuB10 Invited Session, Brown 1 |
Add to My Program |
Advances in Stochastic Control II: Reinforcement Learning |
|
|
Chair: Yuksel, Serdar | Queen's University |
Co-Chair: Mehta, Prashant G. | Univ of Illinois, Urbana-Champaign |
Organizer: Mehta, Prashant G. | Univ of Illinois, Urbana-Champaign |
Organizer: Yuksel, Serdar | Queen's University |
|
13:30-13:50, Paper TuB10.1 | Add to My Program |
Reproducing Kernel Approach to Linear-Quadratic Mean Field Control Problems with Additive Noise (I) |
|
Aubin-Frankowski, Pierre-Cyril | TU Wien |
Bensoussan, Alain | UTD University of Texas at Dallas |
Keywords: Stochastic optimal control, Mean field games, Linear systems
Abstract: We show in this work how to develop a kernel approach to solve linear-quadratic mean field control problems. We use operator-valued kernels, which is consistent with the fact that we are dealing with an infinite dimensional control problem due to the mean-field term. But the stochastic aspect of the problem brings also a difficulty of a different nature. The kernel is defined over the time variable, and conversely to the deterministic case, information must be considered. Thus the kernel acts on random processes, even for ordinary stochastic control problems. This type of kernels has not appeared previously in the literature. Extensions, like partially observable systems or multiplicative noise, will be considered in the future.
|
|
13:50-14:10, Paper TuB10.2 | Add to My Program |
Constant Step-Size Stochastic Approximation with Delayed Updates (I) |
|
Mahajan, Aditya | McGill University |
Niculescu, Silviu-Iulian | University Paris-Saclay, CNRS, CentraleSupelec, Inria |
Vidyasagar, Mathukumalli | Indian Institute of Technology Hyderabad |
Keywords: Iterative learning control, Stochastic systems, Delay systems
Abstract: In this paper, we consider constant step-size stochastic approximation with delayed updates. For the non-delayed case, it is well known that under appropriate conditions, the discrete-time iterates of stochastic approximation track the trajectory of a continuous-time ordinary differential equation (ODE). For the delayed case, we show in this paper that, under appropriate conditions, the discrete-time iterates track the trajectory of a delay-differential equation (DDE) rather than an ODE. Thus, delayed updates lead to a qualitative change in the behavior of constant step-size stochastic approximation. We present multiple examples to illustrate the qualitative affect of delay and show that increasing the delay is generally destabilizing but, for some systems, it can be stabilizing as well.
|
|
14:10-14:30, Paper TuB10.3 | Add to My Program |
Reinforcement Learning Design for Quickest Change Detection (I) |
|
Cooper, Austin | University of Florida |
Meyn, Sean P. | Univ. of Florida |
Keywords: Reinforcement learning, Stochastic optimal control
Abstract: The field of quickest change detection (QCD) concerns design and analysis of algorithms to estimate in real time the time at which an important event takes place, and identify properties of the post-change behavior. It is shown in this paper that approaches based on reinforcement learning (RL) can be adapted based on any "surrogate information state" that is adapted to the observations. Hence we are left to choose both the surrogate information state process and the algorithm. For the former, it is argued that there are many choices available, based on a rich theory of asymptotic statistics for QCD. Two approaches to RL design are considered: (i) Stochastic gradient descent based on an actor-critic formulation. Theory is largely complete for this approach: the algorithm is unbiased, and will converge to a local minimum. However, it is shown that variance of stochastic gradients can be very large, necessitating the need for commensurately long run times. (ii) Q-learning algorithms based on a version of the projected Bellman equation. It is shown that the algorithm is stable, in the sense of bounded sample paths, and that a solution to the projected Bellman equation exists under mild conditions. Numerical experiments illustrate these findings, and provide a roadmap for algorithm design in more general settings.
|
|
14:30-14:50, Paper TuB10.4 | Add to My Program |
Optimality of Decentralized Symmetric Policies for Stochastic Teams with Mean-Field Information Sharing (I) |
|
Sanjari, Sina | Royal Military College |
Saldi, Naci | Bilkent University |
Yuksel, Serdar | Queen's University |
Keywords: Stochastic optimal control, Mean field games, Machine learning
Abstract: We study a class of stochastic exchangeable teams with a finite number of decision makers (DMs) and their mean-field limits with infinite DMs. We study teams in the finite population regime under the centralized information structure (IS) and teams in the infinite population setting under the decentralized mean-field information sharing. The paper makes the following main contributions: i) For finite population exchangeable teams, we show existence of an optimal policy that is exchangeable (permutation invariant) and Markovian, obtained via value iterations for an equivalent measure-valued controlled Markov decision problem (MDP); ii) We show that a sequence of exchangeable optimal policies for a finite population setting converges to a symmetric (identical), independent, and decentralized randomized policy for the infinite population problem, guaranteeing existence of an optimal policy that is symmetric, independent, and decentralized optimal policy for the infinite population problem. This certifies optimality of the limiting measure-valued MDP for the representative DM. We also provide a finite approximation model for this MDP; iii) We show that symmetric, independent, decentralized optimal policies are approximately optimal for the corresponding finite-population team with many DMs under the centralized IS; iv) We show that a finite model approximation is near optimal for the mean-field MDP of the representative DM.
|
|
14:50-15:10, Paper TuB10.5 | Add to My Program |
A Variational Approach to Sampling in Diffusion Processes (I) |
|
Raginsky, Maxim | University of Illinois at Urbana-Champaign |
Keywords: Stochastic optimal control, Stochastic systems, Markov processes
Abstract: We revisit the work of Mitter and Newton on an information-theoretic interpretation of Bayes' formula through the Gibbs variational principle. This formulation allowed them to pose nonlinear estimation for diffusion processes as a problem in stochastic optimal control, so that the posterior density of the signal given the observation path could be sampled by adding a drift to the signal process. We show that this control-theoretic approach to sampling provides a common mechanism underlying several distinct problems involving diffusion processes, specifically importance sampling using Feynman--Kac averages, time reversal, and Schrödinger bridges.
|
|
15:10-15:30, Paper TuB10.6 | Add to My Program |
Supervised Learning for Stochastic Optimal Control |
|
Kurtz, Vincent | California Institute of Technology |
Burdick, Joel W. | California Inst. of Tech |
Keywords: Machine learning, Stochastic optimal control, Robotics
Abstract: Supervised machine learning is powerful. In recent years, it has enabled massive breakthroughs in computer vision and natural language processing. But leveraging these advances for optimal control has proved difficult. Data is a key limiting factor. Without access to the optimal policy, value function, or demonstrations, how can we fit a policy? In this paper, we show how to automatically generate supervised learning data for a class of continuous-time nonlinear stochastic optimal control problems. In particular, applying the Feynman-Kac theorem to a linear reparameterization of the Hamilton-Jacobi-Bellman PDE allows us to sample the value function by simulating a stochastic process. Hardware accelerators like GPUs could rapidly generate a large amount of this training data. With this data in hand, stochastic optimal control becomes supervised learning.
|
|
TuB11 Regular Session, Brown 2 |
Add to My Program |
Data Driven Control V |
|
|
Chair: Yan, Yitao | University of New South Wales |
Co-Chair: Saoud, Adnane | University Mohammed VI Polytechnic |
|
13:30-13:50, Paper TuB11.1 | Add to My Program |
A Q-Learning Approach to Model-Free Infinite Horizon Control for Linear Time Delay Systems |
|
Benhmidouch, Zineb | Electrical Engineering Department, Mohammadia School of Engineer |
Belamfedel Alaoui, Sadek | University Mohammed VI Polytechnic |
Saoud, Adnane | University Mohammed VI Polytechnic |
Abbou, Ahmed | Mohammadia School of Engineers, Mohammed V University in Rabat |
Keywords: Data driven control, LMIs, Power systems
Abstract: In this paper, an online Q-learning algorithm is proposed to address the infinite-horizon guaranteed cost control problem for linear time delay systems with completely unknown dynamics. The developed approach leverages a Lyapunov-Krasovskii functional as the state value function and integrates guaranteed cost control principles. Specifically, based on Bessel-Legendre integral inequality, a Q-function tailored for handling guaranteed cost control in time delay systems is formulated. Furthermore, an integral reinforcement learning method based on an actor/critic approximator framework is used to dynamically estimate the Q-function parameters. Finally, the proposed approach is successfully applied to an interconnected power system.
|
|
13:50-14:10, Paper TuB11.2 | Add to My Program |
Robust Model Reference Gaussian Process Regression: Enhancing Adaptability through Domain Randomization |
|
Kim, Hyuntae | ASRI, Seoul National University |
Keywords: Data driven control, Nonlinear systems identification, Nonlinear output feedback
Abstract: Nonlinear data-driven control strategies, particularly Model Reference Gaussian Process Regression (MR-GPR), have been effective in designing controllers directly from system input/output data, bypassing the need for explicit system modeling. This approach is advantageous for complex nonlinear systems where traditional modeling methods may be inadequate. MR-GPR employs Gaussian Process Regression to provide a non-parametric control method, enhancing adaptability and performance. However, real-world applications present challenges due to variability in system parameters, such as ensuring robustness and consistent performance. To address these challenges, this paper proposes a robust MR-GPR controller incorporating domain randomization to improve adaptability to varying operational conditions. This extension aims to maintain stable performance across diverse settings, mitigating the impact of parameter changes on control efficacy.
|
|
14:10-14:30, Paper TuB11.3 | Add to My Program |
Model Reference Gaussian Process Regression: Data-Driven State Feedback Controller for Strongly Controllable Systems |
|
Kim, Hyuntae | ASRI, Seoul National University |
Keywords: Data driven control, Nonlinear systems identification, Nonlinear systems
Abstract: Data-driven control methods are gaining importance in control engineering, particularly for nonlinear systems where traditional models fall short. Many approaches rely on predefined libraries of functions, such as polynomials, to approximate system dynamics. These methods assume the selected functions can effectively represent the underlying system, but this assumption may not always hold, potentially impacting the robustness of control designs. The Model Reference Gaussian Process Regression (MR-GPR) controller was introduced to address these issues by using Gaussian Process Regression (GPR) with flexible kernels that adapt to observed data, eliminating the need for predefined function libraries. In this paper, we extend the MR-GPR controller to handle (n)-step controllable systems, allowing for more complex multi-step control tasks. We propose a state feedback control strategy based on real-time system data. Theoretical analysis confirms the stability of the closed-loop system under the MR-GPR controller, and numerical simulations validate its effectiveness.
|
|
14:30-14:50, Paper TuB11.4 | Add to My Program |
IMFlow: Inverse Modeling with Conditional Normalizing Flows for Data-Driven Model Predictive Control |
|
Zhang, Yi | University of Augsburg |
Mikelsons, Lars | University of Augsburg |
Keywords: Data driven control, Nonlinear systems identification, Predictive control for nonlinear systems
Abstract: Inverse modeling is the process uncovering the relationships from the system observations to its inputs. It is essential in various fields such as control, robotics, and signal processing. We propose an underline{i}nverse underline{m}odeling method using amortized variational inference based on conditional normalizing underline{flow}s (IMFlow). IMFlow is data-driven and can therefore be applied to black-box environments with limited observability and unknown complexity. Besides, the probabilistic modeling characteristics of conditional normalizing flows allow IMFlow to cope with unknown system uncertainties. We deploy IMFlow as a probabilistic model predictive controller, which estimates the control inputs as stochastic processes based on reference signals and system responses. In addition, we also adjust IMFlow to an online model-free reinforcement learning setting. We demonstrate our proposed method achieves the same accuracy in comparison to the standard model predictive control method using white-box models.
|
|
14:50-15:10, Paper TuB11.5 | Add to My Program |
Learning-Based Pareto Optimal Control of Large-Scale Systems with Unknown Slow Dynamics |
|
Tajik Hesarkuchak, Saeed | Virginia Tech |
Boker, Almuatazbellah | Virginia Tech |
Baddam, Vasanth Reddy | Virginia Tech |
Mili, Lamine | Virginia Tech |
Eldardiry, Hoda | Virginia Tech |
Keywords: Data driven control, Optimal control, Large-scale systems
Abstract: We develop a data-driven approach to Pareto optimal control of large-scale systems, where decision-makers know only their local dynamics. Using reinforcement learning, we design a control strategy that optimally balances multiple objectives. The proposed method guarantees the stability of all the system dynamics and scales well with the total dimension of the system. Experimental results demonstrate the effectiveness of our approach in managing multi-area power systems.
|
|
15:10-15:30, Paper TuB11.6 | Add to My Program |
An Approach to Data-Based Linear Quadratic Optimal Control |
|
Yan, Yitao | University of New South Wales |
Bao, Jie | The University of New South Wales |
Huang, Biao | Univ. of Alberta |
Keywords: Data driven control, Optimal control, Linear systems
Abstract: This paper presents a data-based approach to linear quadratic optimal control design. The system manipulated variable is assumed to have a zero mean uncertainty with a certain covariance, and the true system trajectory is measurable subject to measurement noise. The separation principle in the data-based context is investigated, which reveals that the original problem can be decomposed into an optimal quadratic control problem and an interval-wise trajectory estimation problem that can be designed separately. Algorithms are developed for both the finite and infinite horizon control problem, with the latter proven to be able to asymptotically stabilize the expected value of all trajectories in the controlled behavior. An illustrative example is provided to demonstrate the effectiveness of the proposed approach.
|
|
TuB12 Regular Session, Brown 3 |
Add to My Program |
Learning and Control |
|
|
Chair: Iannelli, Andrea | University of Stuttgart |
Co-Chair: Dogan, Kadriye Merve | Embry-Riddle Aeronautical University |
|
13:30-13:50, Paper TuB12.1 | Add to My Program |
On the Regret of Recursive Methods for Discrete-Time Adaptive Control with Matched Uncertainty |
|
Karapetyan, Aren | ETH Zürich |
Balta, Efe C. | Inspire AG |
Tsiamis, Anastasios | ETH Zurich |
Iannelli, Andrea | University of Stuttgart |
Lygeros, John | ETH Zurich |
Keywords: Learning, Adaptive control, Optimization algorithms
Abstract: Continuous-time adaptive controllers for systems with a matched uncertainty often comprise an online parameter estimator and a corresponding parameterized controller to cancel the uncertainty. However, such methods are often impossible to implement directly, as they depend on an unobserved estimation error. We consider the equivalent discrete-time setting with a causal information structure, and propose a novel, online proximal point method-based adaptive controller, that under a sufficient excitation (SE) condition is asymptotically stable and achieves finite regret, scaling only with the time required to fulfill the SE. We show the same also for the widely-used recursive least squares with exponential forgetting controller under a stronger persistence of excitation condition.
|
|
13:50-14:10, Paper TuB12.2 | Add to My Program |
Symbiotic Control of Uncertain Dynamical Systems |
|
Yucelen, Tansel | University of South Florida |
Sarsilmaz, Selahattin Burak | Utah State University |
Yildirim, Emre | University of South Florida |
Keywords: Learning, Adaptive systems
Abstract: In this paper, we consider both the fixed-gain control and adaptive learning architectures to suppress the effects of uncertainties. We note that fixed-gain control provides more predictable closed-loop system behavior, but it comes at the cost of knowing uncertainty bounds. On the other hand, adaptive learning removes the requirement of this knowledge at the expense of less predictable closed-loop system behavior compared to fixed-gain control. To this end, this paper presents a novel symbiotic control framework that integrates the advantages of both fixed-gain control and adaptive learning architectures. In particular, the proposed framework utilizes both control architectures to suppress the negative effects of uncertainties with more predictable closed-loop system behavior and without the knowledge of uncertainty bounds. Both parametric and nonparametric uncertainties are considered, where we use neural networks to approximate the unknown uncertainty basis for the latter case. Several illustrative numerical examples are provided to demonstrate the efficacy of the proposed approach.
|
|
14:10-14:30, Paper TuB12.3 | Add to My Program |
Policy Optimization Finds Nash Equilibrium in Regularized General-Sum LQ Games |
|
Zaman, Muhammad Aneeq uz | UIUC |
Aggarwal, Shubham | University of Illinois, Urbana Champaign |
Bastopcu, Melih | University of Illinois Urbana Champaign |
Basar, Tamer | Univ of Illinois, Urbana-Champaign |
Keywords: Learning, Game theory, Stochastic systems
Abstract: In this paper, we investigate the impact of introducing relative entropy regularization on the Nash Equilibria (NE) of General-Sum N-agent games, revealing the fact that the NE of such games conform to linear Gaussian policies. Moreover, it delineates sufficient conditions, contingent upon the adequacy of entropy regularization, for the uniqueness of the NE within the game. As Policy Optimization serves as a foundational approach for Reinforcement Learning (RL) techniques aimed at finding the NE, in this work we prove the linear convergence of a policy optimization algorithm which (subject to the adequacy of entropy regularization) is capable of provably attaining the NE. Furthermore, in scenarios where the entropy regularization proves insufficient, we present a delta-augmentation technique, which facilitates the achievement of an epsilon-NE within the game.
|
|
14:30-14:50, Paper TuB12.4 | Add to My Program |
Online Residual Learning from Offline Experts for Pedestrian Tracking |
|
Vlachos, Anastasios | ETH Zurich |
Tsiamis, Anastasios | ETH Zurich |
Karapetyan, Aren | ETH Zürich |
Balta, Efe C. | Inspire AG |
Lygeros, John | ETH Zurich |
Keywords: Learning, Machine learning, Control applications
Abstract: In this paper, we consider the problem of predicting unknown targets from data. We propose Online Residual Learning (ORL), a method that combines online adaptation with offline-trained predictions. At a lower level, we employ multiple offline predictions generated before or at the beginning of the prediction horizon. We augment every offline prediction by learning their respective residual error concerning the true target state online, using the recursive least squares algorithm. At a higher level, we treat the augmented lower-level predictors as experts, adopting the Prediction with Expert Advice framework. We utilize an adaptive softmax weighting scheme to form an aggregate prediction and provide guarantees for ORL in terms of regret. We employ ORL to boost performance in the setting of online pedestrian trajectory prediction. Based on data from the Stanford Drone Dataset, we show that ORL can demonstrate best-of-both-worlds performance.
|
|
14:50-15:10, Paper TuB12.5 | Add to My Program |
Decision Boundary Estimation Using Reinforcement Learning for Complex Classification Problems |
|
Netter, Josh | Georgia Institute of Technology |
Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Walsh, Timothy | Sandia National Laboratories |
Ray, Jaideep | Sandia National Laboratories, Livermore, CA |
Keywords: Learning, Reinforcement learning, Machine learning
Abstract: In this paper, we propose a method for quickly training a binary support vector machine (SVM) classifier for recognizing valid input spaces in high-dimensional, highly constrained systems by using reinforcement learning to find inputs along the decision boundary of the classifier while minimizing the number of runs of a simulation representing the system. We find training points by first defining an optimization problem where the action space consists of points to test, and the reward is a function that searches for points that are close to violating the given constraints and are a sufficient distance from one another. After formulating this process, we use a Q-learning framework to find inputs that maximize the reward, and then use these inputs to train the classifier so the decision boundary is quickly well-defined. The efficacy of this approach is then shown in simulations.
|
|
15:10-15:30, Paper TuB12.6 | Add to My Program |
Gait Switching and Enhanced Stabilization of Walking Robots with Deep Learning-Based Reachability: A Case Study on Two-Link Walker |
|
Xia, Xingpeng | Tsinghua University |
Choi, Jason J. | University of California, Berkeley |
Agrawal, Ayush | University of California, Berkeley |
Sreenath, Koushil | University of California, Berkeley |
Tomlin, Claire J. | UC Berkeley |
Bansal, Somil | University of Southern California |
Keywords: Learning, Stability of hybrid systems, Robotics
Abstract: Learning-based approaches have recently shown notable success in legged locomotion. However, these approaches often lack accountability, necessitating empirical tests to determine their effectiveness. In this work, we are interested in designing a learning-based locomotion controller whose stability can be examined and guaranteed. This can be achieved by verifying regions of attraction (RoAs) of legged robots to their stable walking gaits. This is a non-trivial problem for legged robots due to their hybrid dynamics. Although previous work has shown the utility of Hamilton-Jacobi (HJ) reachability to solve this problem, its practicality was limited by its poor scalability. The core contribution of our work is the employment of a deep learning-based HJ reachability solution to the hybrid legged robot dynamics, which overcomes the previous work's limitation. With the learned reachability solution, first, we can estimate a library of RoAs for various gaits. Second, we can design a one-step predictive controller that effectively stabilizes to an individual gait within the verified RoA. Finally, we can devise a strategy that switches gaits, in response to external perturbations, whose feasibility is guided by the RoA analysis. We demonstrate our method in a two-link walker simulation, whose mathematical model is well established. Our method achieves improved stability than previous model-based methods, while ensuring transparency that was not present in the existing learning-based approaches.
|
|
TuB13 Invited Session, Suite 1 |
Add to My Program |
Estimation and Control of Distributed Parameter Systems V |
|
|
Chair: Hu, Weiwei | University of Georgia |
Co-Chair: Demetriou, Michael A. | Worcester Polytechnic Institute |
Organizer: Demetriou, Michael A. | Worcester Polytechnic Institute |
Organizer: Hu, Weiwei | University of Georgia |
|
13:30-13:50, Paper TuB13.1 | Add to My Program |
Encirclement Control for PDE-Based Leader-Follower Multi-Agent Systems with Targets in a Sphere: Part 1 – Target Enclosing (I) |
|
Hasanzadeh, Milad | Texas Tech University |
Tang, Shuxia | Texas Tech University |
Keywords: Distributed parameter systems, Agents-based systems, Backstepping
Abstract: This paper introduces an innovative target-enclosing control designed for parabolic Partial Differential Equation (PDE)-based multi-agent systems, emphasizing multi-step control within the domain and boundaries consisting of 2 parts. Diverging from prior research, our study delves into a three-dimensional enclosing control and employs PDE control for the system. At part 1, the targets in focus are dynamic, constrained to movement within a three-dimensional neighborhood. Our approach involves overcoming multi-step challenges by utilizing backstepping control for boundary agents acting as leaders. The three consecutive challenges involve prompting agent movement to reach to targets, reforming their formation to establish an appropriate distance for surrounding all targets, and finally achieving a successful enclosing in a distributed manner. Upon a successful target-enclosing in part 1, which can be treated as a standalone result, the next steps are to keep encircling them in part 2. Stability analysis of the closed-loop system is conducted using the Lyapunov technique. Finally, we conduct simulation to evaluate the effectiveness of our proposed methodology.
|
|
13:50-14:10, Paper TuB13.2 | Add to My Program |
Galerkin Approximation for H-Infinity-Control of the Stable Parabolic System under Dirichlet Boundary Control (I) |
|
Guo, Bao-Zhu | Academia Sinica |
Tan, Zheng-Qiang | Academy of Mathematics and Systems Science, Academia Sinica |
Keywords: Distributed parameter systems, Linear systems, Robust control
Abstract: In this paper, we study the state feedback control for the H^infty disturbance-attenuation problem pertaining to stable parabolic systems. We elucidate that the formulation of state feedback control can be achieved by solving an operator algebraic Riccati equation. Furthermore, we harness the Galerkin approximation method, which yields a succession of finite-dimensional systems that act as surrogates for the original infinite-dimensional system. We rigorously prove that the solutions to the corresponding finite-dimensional algebraic Riccati equations converge in norm to the solution of the infinite-dimensional operator algebraic Riccati equation. Additionally, we demonstrate that the state feedback controls derived from these finite-dimensional algebraic Riccati equations are gamma-admissible for the original system, thereby underscoring the efficacy of our approximation methodology.
|
|
14:10-14:30, Paper TuB13.3 | Add to My Program |
Enhanced Battery State Estimation: Part 1 – Electrolyte Lithium-Ion Concentration Observer (I) |
|
Sepasiahooyi, Sara | Texas Tech |
Tang, Shuxia | Texas Tech University |
Keywords: Energy systems, Distributed parameter systems, Backstepping
Abstract: This paper introduces backstepping closed-loop observers designed to estimate the concentration of lithium ions in the electrolyte within the negative electrode, positive electrode, and separator. The battery model employed is the Single Particle Model with electrolyte (SPMe) dynamics. Partial Differential Equation (PDE) backstepping observers are proposed to estimate electrolyte lithium concentration in the negative and positive electrodes, followed by a separator observer based on these results. The electrolyte lithium concentration observers for the positive and negative electrodes require the concentration at the boundaries. As direct measurement of this concentration is challenging, an alternative measurable parameter will be identified for indirect calculation. While Part 1 of this study operates independently, Part 2 complements it by conducting reverse sensitivity analysis to determine the most appropriate measurable parameter. The convergence of the estimation error for the designed observers is ensured through Lyapunov stability analysis. A simulation on a LiFePO4 battery cell under an Urban Dynamometer Driving Schedule (UDDS) current profile is included to confirm the theoretical aspects and the convergence of the state estimation error.
|
|
14:30-14:50, Paper TuB13.4 | Add to My Program |
Finite-Dimensional Boundary Control of 2D Linear Parabolic Stochastic PDEs under Boundary Measurement (I) |
|
Wang, Pengfei | Tel Aviv University |
Fridman, Emilia | Tel-Aviv Univ |
Keywords: Distributed parameter systems, Stability of linear systems, Stochastic systems
Abstract: This paper addresses finite-dimensional observer-based boundary control of 2D linear stochastic heat equation with multiplicative noise on a bounded connected set. We consider the Neumann actuation and boundary measurement. We design the controller with the shape functions in the form eigenfunctions corresponding to the unstable N_0 eigenvalues. We suggest an appropriate change of variables leading to homogeneous boundary conditions and employ N_0-dimensional dynamic extension with the corresponding proportional-integral controller. Both the observer and controller are designed on the basis of the first N (Ngeq N_0) modes. By suggesting a direct Lyapunov method and employing It^{o}'s formula, we provide mean-square L^2 exponential stability analysis of the full-order closed-loop system. We derive linear matrix inequality (LMI) conditions for finding controller/observer dimension, the controller and observer gains and the maximum admissible noise intensity. Numerical example demonstrates the efficiency of our method and shows that controller design from the first N modes allows larger noise intensity than the design from the first N_0 modes as studied in previous works on stochastic PDEs.
|
|
14:50-15:10, Paper TuB13.5 | Add to My Program |
Superposition Theorems for Input-To-Output Stability of Infinite Dimensional Systems |
|
Bachmann, Patrick | Julius-Maximilians-Universität Würzburg |
Dashkovskiy, Sergey | University of Würzburg |
Mironchenko, Andrii | University of Klagenfurt |
Keywords: Distributed parameter systems, Stability of nonlinear systems, Nonlinear systems
Abstract: We characterize input-to-output stability of a general class of both continuous-time and discrete-time infinite dimensional systems in terms of weaker stability properties. Our results generalize the corresponding criteria for ordinary differential equations achieved by Ingalls et al. [1] and those for infinite dimensional systems for which the output equals the state [2]. This way, we investigate the relation between several stability and attractivity properties for infinite dimensional systems with outputs by providing the according implications and giving counterexamples, respectively.
|
|
15:10-15:30, Paper TuB13.6 | Add to My Program |
Forwarding-Based Controller Design for a General Cascade of a Linear ODE and a Wave Equation |
|
Tsubakino, Daisuke | Nagoya University |
Krstic, Miroslav | University of California, San Diego |
Keywords: Backstepping, Distributed parameter systems, Lyapunov methods
Abstract: This paper proposes a method to design a stabilizing control law for a cascade of a possibly unstable linear ordinary differential equation (ODE) and a wave equation. The ODE has connection terms that depend on the state of the wave equation. The terms represent a large class of linear cascade structures. The control input appears in both equations. The proposed method for designing a control law exploits the concept of the forwarding approach. The wave equation is first stabilized using a boundary damping control law. Then, we seek a stable subspace under the damping control law. The ODE is converted into another ODE through a state transformation constructed based on the expression for the stable subspace. With the new state, the damping control law is augmented so that the ODE is stabilized. The stability of the closed-loop system and the inverse optimality of the obtained control law are theoretically proved.
|
|
TuB14 Regular Session, Suite 2 |
Add to My Program |
Estimation I |
|
|
Chair: Classens, Koen | Eindhoven University of Technology |
Co-Chair: Ushirobira, Rosane | Inria |
|
13:30-13:50, Paper TuB14.1 | Add to My Program |
Efficient Batch and Recursive Least Squares for Matrix Parameter Estimation |
|
Lai, Brian | University of Michigan, Ann Arbor |
Bernstein, Dennis S. | Univ. of Michigan |
Keywords: Identification, Adaptive systems, Identification for control
Abstract: Traditionally, batch least squares (BLS) and recursive least squares (RLS) are used for identification of a vector of parameters which form a linear model. In some situations however, it is of interest to identify parameters in a matrix structure. In this case, a common approach is to transform the problem into standard vector form using the vectorization (vec) operator and the Kronecker product, known as vec-permutation. However, the use of the Kronecker product introduces extraneous zero terms in the regressor, resulting in unnecessary additional computational and space requirements. This work derives matrix BLS and RLS formulations which, under mild assumptions, minimize the same cost as the vec-permutation approach. This new approach requires less computational complexity and space complexity than vec-permutation in both BLS and RLS identification. It is also shown that persistent excitation guarantees convergence to the true matrix parameters. This method can used to improve computation time in the online identification of multiple-input, multiple-output systems for indirect adaptive model predictive control.
|
|
13:50-14:10, Paper TuB14.2 | Add to My Program |
Constant Parameter Identification: An Accelerated Heavy-Ball-Based Approach |
|
Ríos, Héctor | Tecnológico Nacional De México/I.T. La Laguna |
Efimov, Denis | Inria |
Ushirobira, Rosane | Inria |
Keywords: Identification, Adaptive systems, Stability of nonlinear systems
Abstract: This paper contributes to designing a parameter identification algorithm for linear regression systems with constant unknown parameters. The proposed algorithm is based on an accelerated version of the heavy--ball method and uses a nonlinear version of Kreisselmeier's regressor extension. Moreover, it can identify constant parameters in a finite time under a persistent excitation condition. The local stability analysis is developed using a Lyapunov function approach. The applicability and effectiveness of the proposed parameter identification algorithm are illustrated through simulation results.
|
|
14:10-14:30, Paper TuB14.3 | Add to My Program |
On the Controllability Preservation of Koopman Bilinear Surrogate Model |
|
Choi, Joonwon | Purdue University |
Cho, Minhyun | Purdue University |
Park, Hyunsang | Purdue University |
Vijay, Vishnu | Purdue University |
Hwang, Inseok | Purdue University |
Keywords: Identification, Data driven control, Control applications
Abstract: In this paper, we analyze the controllability of the Koopman bilinear surrogate model of a controllable control affine system. The Koopman operator is a linear operator that can describe the evolution of an original (nonlinear) system by lifting the state using an observable. However, it has been proven that the lifted system may not necessarily be full-state controllable even if the original system is. Moreover, the infinite-dimensional nature of the Koopman operator means that a finite-dimensional approximation is often required in practice and thus, one cannot simply guarantee the lifted system to preserve the same controllability property of the original system. Motivated by this, we investigate how the controllability property of the original system affects that of the lifted system. We specifically focus on control affine systems, where one can construct a Koopman bilinear surrogate model using the infinitesimal generator of the Koopman operator. We assume there exists an admissible controller that can drive the state of the original control affine system to a desired state. Then, we present the controllability property of the corresponding Koopman bilinear surrogate model, constructed by the data-driven infinitesimal generator using generator extended dynamic mode decomposition (gEDMD). A numerical simulation example using a quadrotor model is presented to demonstrate the proposed results.
|
|
14:30-14:50, Paper TuB14.4 | Add to My Program |
Robust Local Basis Function Algorithms for Identification of Time-Varying FIR Systems in Impulsive Noise Environments |
|
Niedzwiecki, Maciej | Gdansk University of Technology |
Gancza, Artur | Gdansk University of Technology |
Wylomanska, Agnieszka | Wroclaw University of Science and Technology |
Zulawinski, Wojciech | Wrocław University of Science and Technology |
Keywords: Identification, Estimation, Filtering
Abstract: While local basis function (LBF) estimation algorithms, commonly used for identifying/tracking systems with time-varying parameters, demonstrate good performance under the assumption of normally distributed measurement noise, the estimation results may significantly deviate from satisfactory when the noise distribution is of impulsive nature, for example, heavy-tailed or corrupted by outliers. This paper introduces a computationally efficient method to make LBF estimator robust, enhancing its resistance to impulsive noise. The study illustrates that, for polynomial basis functions, this modified LBF estimator can be computed recursively. Furthermore, it demonstrates that the proposed algorithm can undergo online tuning through parallel estimation and leave-one-out cross-validation.
|
|
14:50-15:10, Paper TuB14.5 | Add to My Program |
Statistical Analysis of Block Coordinate Descent Algorithms for Linear Continuous-Time System Identification |
|
González, Rodrigo A. | Eindhoven University of Technology |
Classens, Koen | Eindhoven University of Technology |
Rojas, Cristian R. | KTH Royal Institute of Technology |
Welsh, James S. | University of Newcastle |
Oomen, Tom | Eindhoven University of Technology |
Keywords: Identification, Estimation, Linear systems
Abstract: Block coordinate descent is an optimization technique that is used for estimating multi-input single-output (MISO) continuous-time models, as well as single-input single output (SISO) models in additive form. Despite its widespread use in various optimization contexts, the statistical properties of block coordinate descent in continuous-time system identification have not been covered in the literature. The aim of this paper is to formally analyze the bias properties of the block coordinate descent approach for the identification of MISO and additive SISO systems. We characterize the asymptotic bias at each iteration, and provide sufficient conditions for the consistency of the estimator for each identification setting. The theoretical results are supported by simulation examples.
|
|
15:10-15:30, Paper TuB14.6 | Add to My Program |
A Nonlinear Hawkes Model Using Laguerre Orthogonal Polynomials |
|
Pasha, Syed Ahmed | Air University |
Solo, Victor | University of New South Wales |
Keywords: Identification, Estimation, Nonlinear systems identification
Abstract: Point process data arise in a number of application areas including neural coding, genomics, high-frequency finance, and more recently, streaming data. The Hawkes process is a flexible modeling approach for such data that exhibit self-exciting behavior. However, the Hawkes process does not accommodate inhibitory effects also observed in such data. In this paper, we present a nonlinear Hawkes model which accommodates inhibition while guaranteeing positivity of the point process intensity. The positivity preserving property allows a parsimonious representation of the Hawkes impulse response using Laguerre orthogonal polynomials. We develop a fast algorithm to perform maximum likelihood estimation and demonstrate the approach with a simulation and also on some neural data.
|
|
TuB15 Regular Session, Suite 3 |
Add to My Program |
Smart Grid |
|
|
Chair: Bianchini, Gianni | Università Di Siena |
Co-Chair: Sadabadi, Mahdieh S. | University of Manchester |
|
13:30-13:50, Paper TuB15.1 | Add to My Program |
Distributed Secondary Control for Battery Energy Storage Systems in AC Microgrids under Multiple Time-Varying Communication Delays |
|
Gholami, Milad | University of Siena |
Bianchini, Gianni | Università Di Siena |
Vicino, Antonio | Univ. Di Siena |
Keywords: Smart grid, Agents-based systems, Distributed control
Abstract: This paper addresses the challenge of designing a fully distributed secondary control strategy for heterogeneous battery energy storage systems in a microgrid with the objective of achieving consensus in frequency and active power, while preserving a balanced state of charge, subject to multiple timevarying communication delays. The problem is addressed in a multi-agent fashion where the local controllers of the distributed generators play the role of agents, and communication is affected by time-varying delays. The proposed approach exploits a combination of integral sliding mode control and a linear consensus protocol. Lyapunov analysis is presented to assess the stability properties of the closed loop. Delay-dependent stability conditions are expressed as a set of linear matrix inequalities whose solution yields appropriate control gains such that the control objectives are achieved despite multiple time-varying delays. The effectiveness of the proposed control strategy is assessed through simulations.
|
|
13:50-14:10, Paper TuB15.2 | Add to My Program |
Distributed Data-Driven Predictive Frequency Control for Networked Microgrids |
|
Wu, Jinhui | University College London |
Tao, Haochen | University College London |
Yang, Fuwen | Griffith University |
Boem, Francesca | University College London |
Keywords: Smart grid, Distributed control, Data driven control
Abstract: This paper proposes a novel distributed datadriven predictive control scheme to address the frequency control problem for Networked Microgrids (NMGs) in the presence of model uncertainty and disturbances. Firstly, the distributed data-based frequency model of NMGs is formulated according to input-output data. A suitable distributed datadriven controller is then proposed, and the convergence and stability of the system are analysed. A comparison with a modelbased predictive control and with a state-of-the-art data-driven control methods is finally presented in simulation, showing the effectiveness and the ability of the proposed method to deal with a coupled frequency control problem without requiring accurate models of the system.
|
|
14:10-14:30, Paper TuB15.3 | Add to My Program |
Neural Spectral Clustering Based Voltage Area Partition of Active Distribution Systems |
|
Wang, Shengyi | Temple University |
Du, Liang | Temple University |
Zhu, Lianren | SHANGHAI UNIVERSITY |
Li, Yan | The Pennsylvania State University |
Keywords: Smart grid, Power systems, Energy systems
Abstract: As distribution systems evolve to accommodate large-scale renewable energy sources, maintaining voltage stability becomes increasingly challenging. Network partitioning plays a pivotal role in voltage control tasks, especially in active distribution systems (ADSs). By partitioning the network into manageable small sub-networks, i.e., voltage area partition (VAP), fine-grained, decentralized, and coordinated voltage control can be realized, which prevents over-voltage or under-voltage issues and facilitates the integration and absorption of renewable energies. However, because of the weak ability to extract complicated voltage relationships, existing naive graph clustering VAP methods are likely to suffer a performance bottleneck in voltage cohesiveness for large-sized distribution networks. Therefore, this paper proposes a neural spectral clustering-based VAP method for ADSs. Specifically, a network partition problem is solved by clustering a neural spectral mapping of multi-phase voltage coupling features. Theoretical and experimental results show that the proposed method can partition the network with voltage cohesiveness higher than that of the standard spectral clustering method while bringing certain advantages in computational efficiency.
|
|
14:30-14:50, Paper TuB15.4 | Add to My Program |
Comparative Study of Aging-Aware Control Strategies for Grid-Connected Photovoltaic Battery Systems |
|
Zhang, Huang | Chalmers University of Technology |
Altaf, Faisal | Volvo Group |
Wik, Torsten | Chalmers University of Technology |
Keywords: Smart grid, Reinforcement learning
Abstract: Various strategies with different objectives have been proposed to control grid-connected photovoltaic (PV) battery systems where electric vehicle (EV) batteries can be used as stationary energy storage. As the first attempt to enable aging-aware decision-making under various uncertainties, an economically motivated stage cost function is proposed to account for both the grid and the battery degradation cost. Historical operational data and "fixed" forecasted electricity price data are utilized to improve the economic performance of an implicit (or time-varying) optimal policy. Simulation results show that an implicit optimal policy achieved better economic performance (i.e., lowest grid and battery degradation cost) with smaller fluctuation amplitudes than an explicit one. Thus, to improve the aging-aware decision-making under uncertainties for EV batteries further, the implicit optimal policy will be further developed with consideration of other forecasts.
|
|
14:50-15:10, Paper TuB15.5 | Add to My Program |
Optimizing Parameter Design with Frequency and Clock Drift Constraints in Microgrids |
|
Yu, Chang | Wuhan University of Science and Technology |
Lu, Xiaoqing | Hunan University |
Keywords: Smart grid
Abstract: Microgrids subjected to secondary cooperative control encounter significant challenges, including operational constraints and clock drifts, adversely affecting their stability and efficiency. This paper provides conditions that assure optimal microgrid performance in both transient and steadystate scenarios, focusing on the effects of clock drifts and fluctuations in load. Furthermore, we introduce a novel approach for designing secondary control parameters, specifically engineered to minimize steady-state discrepancies attributable to clock drifts while ensuring adherence to standards for transient operations. Comprehensive experimental validations corroborate the effectiveness of our proposed solutions.
|
|
15:10-15:30, Paper TuB15.6 | Add to My Program |
Retrofit Control of DC Microgrids: A Reliability-Oriented Control Approach (I) |
|
Sadabadi, Mahdieh S. | The University of Manchester |
Keywords: Power generation, Power electronics, Control system architecture
Abstract: This paper focuses on improving reliability in DC microgrids interconnected with power-electronics-interfaced distributed generation (DG) units. We propose a novel primary voltage control approach, based on projection-based operators, referred to as retrofit control, by which one can control DG units to stabilize their voltage in the presence of actuator uncertainties. The proposed control approach only requires the model of DG units for controller design and does not depend on other DG units' models. We derive decentralized conditions for the voltage stability of the closed-loop microgrids that are not dependent on line parameters, power loads, and interconnections among DG units. Theoretical results are backed up by simulation case studies in MATLAB/Simscape Electrical.
|
|
TuB16 Regular Session, Suite 4 |
Add to My Program |
Adaptive Control II |
|
|
Chair: Baldi, Simone | Southeast University |
Co-Chair: Grushkovskaya, Victoria | University of Klagenfurt |
|
13:30-13:50, Paper TuB16.1 | Add to My Program |
An Observer-Based Extremum Seeking Controller Design for a Class of Second-Order Nonlinear Systems |
|
Mousavi, Seyed Mohammadmoein | Queen's University |
Guay, Martin | Queen's University |
Keywords: Adaptive control, Nonlinear output feedback, Robust adaptive control
Abstract: In this paper, an extremum seeking controller (ESC) is designed for stabilization and output minimization for a class of second-order control-affine nonlinear systems. The main difficulty with such design lays in the fact that the relative degree between the measured output and the system's input is two. As a result, the classical ESC approaches which use a high-pass filter for differentiation, are not suitable. We propose a perturbation-based controller in the feedback loop that utilizes a high-gain like observer with bounded derivatives of first and second-order. The closed-loop system is shown to be practically stable while maintaining the output in a small neighbourhood of its optimum value. Simulation results demonstrate the effectiveness of the proposed approach.
|
|
13:50-14:10, Paper TuB16.2 | Add to My Program |
Step-Size Rules for Lie Bracket-Based Extremum Seeking with Asymptotic Convergence Guarantees |
|
Grushkovskaya, Victoria | University of Klagenfurt |
Ebenbauer, Christian | RWTH Aachen University |
Keywords: Adaptive control, Optimization algorithms
Abstract: In this paper, we derive a class of step-size rules (time-varying gains) for gradient-based extremum seeking algorithms that guarantee classical asymptotic convergence rather than practical convergence. The obtained step-size rule conditions are similar to the classical step-size rules known for stochastic approximation theory.
|
|
14:10-14:30, Paper TuB16.3 | Add to My Program |
An Integral Reinforcement Learning Approach for Temperature Regulation in a Reverse Flow Reactor |
|
Binid, Abdellaziz | Faculty of Sciences, Chouaib Doukkali University |
Aksikas, Ilyasse | Qatar University |
Mabrok, Mohamed | Australian College of Kuwait |
Meskin, Nader | Qatar University |
Keywords: Adaptive control, Reinforcement learning, Chemical process control
Abstract: This paper is centered on designing an adaptive optimal control framework for regulating the temperature in a catalytic flow reversal reactor (CFRR), employing an integral reinforcement learning (IRL) technique. Initially, a policy iteration (PI) algorithm is stated to dynamically learn the optimal solution to the related linear-quadratic control (LQC) problem in real-time. The convergence of the presented algorithm is investigated. Furthermore, an improved scheme is introduced to boost the practical execution of the IRL approach. Numerical simulations are executed to demonstrate the impact of the devised scheme.
|
|
14:30-14:50, Paper TuB16.4 | Add to My Program |
Adaptive Motion Planning for Safe VTOL Aircraft Landings on Vertically Moving Marine Platforms |
|
Kholosi, Hazem | Middle East Technical University, ESEN System Integration LLC |
Yayla, Metehan | Middle East Technical University |
Kutlu, Aykut | Esen System Integration, Ltd |
Soken, Halil Ersin | Middle East Technical University |
Keywords: Adaptive systems, Autonomous systems, Lyapunov methods
Abstract: This paper presents a methodology for enhancing the safety and reliability of Vertical Take-off and Landing (VTOL) aircraft operations on vertically moving marine platforms, such as ship decks and offshore oil rigs. Building upon prior work that introduced predictive scheduling algorithms for safe landing on such platforms, this study advances the field by integrating prediction-based motion planning with a real-time aircraft bandwidth estimator. Unlike previous approaches that required detailed pre-landing assessments of aircraft dynamics, this method simplifies the implementation process by eliminating the need for prior knowledge of aircraft vertical dynamics and sea-state conditions. The bandwidth estimator dynamically adjusts to environmental variations and operational uncertainties, offering a more adaptive and responsive solution. Furthermore, by anticipating future deck movements, the predictive motion planning component minimizes reliance on rapid, high-risk maneuvers, thereby enhancing safety margins during the critical landing phase. The efficacy of this enhanced approach is validated through comprehensive numerical examples, comparative analyses, and software-in-the-loop simulations using the X-Plane flight simulator. These results demonstrate a significant improvement in operational safety and adaptability compared to traditional methods and earlier scheduling algorithms, marking a substantial progression in the field of VTOL aircraft landings on moving platforms.
|
|
14:50-15:10, Paper TuB16.5 | Add to My Program |
Composite Learning Exponential Parameter Estimation for Discrete-Time Nonlinear Systems |
|
Wang, Qian | Sun Yat-Sen University |
Shi, Tian | Sun Yat-Sen University |
Nikiforov, Vladimir O. | ITMO University |
Pan, Yongping | Sun Yat-Sen University |
Keywords: Adaptive systems, Estimation, Identification
Abstract: As a significant part of system modeling and control, parameter identification of continuous and discrete-time systems has been extensively studied in the past decades. However, most existing parameter identifiers cannot guarantee exponential parameter convergence without a strict condition termed persistent excitation (PE). This paper presents a composite learning-based parameter estimator for discrete-time nonlinear systems with linear-in-the-parameters uncertainties. A generalized prediction error based on regressor extension with online data memory is incorporated into the normal prediction error to accelerate parameter estimation. The storage and forgetting of online data are determined by only active regressor channels, which removes the restriction that all regressor channels need to be activated simultaneously for parameter estimation. Exponential parameter convergence under the proposed estimator is achieved under an interval excitation (IE) or even partial IE condition that is strictly weaker than the PE condition. Simulation results have verified the effectiveness and superiority of the proposed estimator compared with state-of-the-art estimators.
|
|
15:10-15:30, Paper TuB16.6 | Add to My Program |
A Recursive Implementation of Sparse Regression |
|
Baldi, Simone | Southeast University |
Liu, Di | Imperial College London |
Keywords: Adaptive systems, Estimation, Statistical learning
Abstract: Sparse regression deals with the problem of representing a dataset using only a few non-zero basis elements. This work presents a recursive implementation of sparse regression, with the dataset being processed sequentially rather than as a batch. The algorithm, named sparse regularized fused recursive least squares (SP-RF-RLS), uses a re-weighting technique and a smooth approximation to deal with the discontinuous l0-norm and the non-differentiable l1-norm, standard norms for sparsity. Inspired by fused least absolute shrinkage and selection operator (fused-LASSO), the algorithm aims to capture structures in the locations of the non-zero elements by including a term depending on the difference between the estimated elements. Comparative experiments in both sparse and non-sparse scenarios show that SP-RF-RLS outperforms several state-of-the-art recursive algorithms.
|
|
TuB17 Regular Session, Suite 6 |
Add to My Program |
Robotics I |
|
|
Chair: Hatton, Ross | Oregon State University |
Co-Chair: Casini, Marco | Universita' Di Siena |
|
13:30-13:50, Paper TuB17.1 | Add to My Program |
A Family of Switching Pursuit Strategies for a Multi-Pursuer Single-Evader Game |
|
Casini, Marco | Universita' Di Siena |
Garulli, Andrea | Università Di Siena |
Keywords: Robotics, Autonomous systems, Game theory
Abstract: A new family of pursuit strategies is introduced for a multi-pursuer single-evader game. By exploiting the optimal solution of the game involving two pursuers, conditions are derived under which the multi-pursuer game becomes equivalent to the two-pursuer one. This opens the possibility of designing a number of pursuit strategies in which the pursuers first try to enforce the satisfaction of the aforementioned condition and then switch to a two-pursuer game as soon as it is verified. The contribution is useful in two ways. First, new winning pursuit strategies can be devised starting from simple plans, such as pure pursuit. Moreover, the performance of existing pursuit strategies, like those based on Voronoi partitions, can be significantly improved by resorting to the corresponding switching version.
|
|
13:50-14:10, Paper TuB17.2 | Add to My Program |
An Effective Two-Time Scale Composite Control Contraction Based Chaotic Trajectory Tracking of Two-Link Flexible Manipulator |
|
Lochan, Kshetrimayum | Khalifa University |
Khan, Asim | Khalifa University |
Krishna Roy, Binoy | National Institute of Technology Silchar |
Subudhi, Bidyadhar | NIT Rourkela |
Seneviratne, Lakmal | Khalifa University |
Hussain, Irfan | Khalifa University |
Keywords: Robotics, Control applications, Backstepping
Abstract: In this paper, a two-link flexible manipulator n-dimensional model is developed using the assumed mode method. Based on this model, the manipulator dynamics are segregated into two subsystems by the two-time scale decomposition method of singular perturbation. Subsequently, a contraction based control theory and a backstepping control of composite controller are investigated for the desired chaotic trajectory tracking along with tip deflection vibration suppression. In the two subsystems, the slow subsystem is involved in the modelling of the joint angles, and the fast subsystem is for corrected flexible modes of vibration suppression. In order to guarantee strict stability, Lyapunov’s stability is realized for closed-loop system uniform boundedness. Thus, by choosing the control parameters appropriately, the system states converge to a neighborhood of asymptotic stability. Eventually, extensive validation by comparative simulations of the Quanser model of the two-link flexible manipulator is carried out to demonstrate and indicate the effectiveness of the proposed composite controller in terms of faster tip deflection vibration suppression and better trajectory tracking.
|
|
14:10-14:30, Paper TuB17.3 | Add to My Program |
A Control Theoretic Study on Omnidirectional MAVs with Minimum Number of Actuators and No Internal Forces at Any Orientation |
|
Ali, Ahmed | University of Twente |
Gabellieri, Chiara | University of Twente |
Franchi, Antonio | University of Twente |
Keywords: Robotics, Feedback linearization, Modeling
Abstract: We propose a new multirotor aerial vehicle class of designs composed of a multi-body structure in which a main body is connected by passive joints to links equipped with propellers. We have investigated some instances of such class, some of which are shown to achieve omnidirectionality while having a minimum number of inputs equal to the main body Degrees of Freedom DoF’s, only uni-directional positive thrust propellers, and no internal forces generated at steady state. After dynamics are derived following the Euler-Lagrange approach, an I/O dynamic feedback linearization strategy is then used to show the controllability of any desired pose with stable zero dynamics. We finally verify the developed controller with closed-loop simulations.
|
|
14:30-14:50, Paper TuB17.4 | Add to My Program |
Patching Approximately Safe Value Functions Leveraging Local Hamilton-Jacobi Reachability Analysis |
|
Tonkens, Sander | University of California - San Diego |
Toofanian, Alex | University of California San Diego |
Qin, Zhizhen | UC San Diego |
Gao, Sicun | UCSD |
Herbert, Sylvia | UC San Diego (UCSD) |
Keywords: Robotics, Lyapunov methods, Machine learning
Abstract: Safe value functions, such as control barrier functions, characterize a safe set and synthesize a safety filter, overriding unsafe actions, for a dynamic system. While function approximators like neural networks can synthesize approximately safe value functions, they typically lack formal guarantees. In this paper, we propose a local dynamic programming-based approach to ``patch'' approximately safe value functions to obtain a safe value function. This algorithm, HJ-Patch, produces a novel value function that provides formal safety guarantees, yet retains the global structure of the initial value function. HJ-Patch modifies an approximately safe value function at states that are both (i) near the safety boundary and (ii) may violate safety. We iteratively update both this set of ``active'' states and the value function until convergence. This approach bridges the gap between value function approximation methods and formal safety through Hamilton-Jacobi (HJ) reachability, offering a framework for integrating various safety methods. We provide simulation results on analytic and learned examples, demonstrating HJ-Patch reduces the computational complexity by 2 orders of magnitude with respect to standard HJ reachability. Additionally, we demonstrate the perils of using approximately safe value functions directly and showcase improved safety using HJ-Patch.
|
|
14:50-15:10, Paper TuB17.5 | Add to My Program |
Optimal Control Approach for Gait Transition with Riemannian Splines |
|
Choi, Jinwoo | Oregon State University |
Hatton, Ross | Oregon State University |
Keywords: Robotics, Nonholonomic systems, Optimal control
Abstract: Robotic locomotion often relies on sequenced gaits to efficiently convert control input into desired motion. Despite extensive studies on gait optimization, achieving smooth and efficient gait transitions remains challenging. In this paper, we propose a general solver based on geometric optimal control methods, leveraging insights from previous works on gait efficiency. Building upon our previous work, we express the effort to execute the trajectory as distinct geometric objects, transforming the optimization problems into boundary value problems. To validate our approach, we generate gait transition trajectories for three-link swimmers across various fluid environments. This work provides insights into optimal trajectory geometries and mechanical considerations for robotic locomotion.
|
|
15:10-15:30, Paper TuB17.6 | Add to My Program |
Structural Properties and Control of Soft Robots Modeled As Discrete Cosserat Rods |
|
Ogunmolu, Olalekan | Microsoft Research |
Chen, Shaoru | Microsoft Corporation |
Keywords: Robotics, PID control, Nonlinear systems
Abstract: Soft robots featuring approximate finite-dimensional reduced-order models (undergoing small deformations) are increasingly becoming paramount in literature and applications. In this paper, we consider the piecewise constant strain (PCS) discrete Cosserat model whose dynamics admit the standard Newton-Euler dynamics for a kinetic model. Contrary to popular convention that soft robots under these modeling assumptions admit similar mechanical characteristics to rigid robots, the schemes employed to arrive at the properties for soft robots under finite deformation show a far dissimilarity to those for rigid robots. We set out to first correct the false premise behind this syllogism: from first principles, we established the structural properties of soft slender robots undergoing finite deformation under a discretized PCS assumption; we then utilized these properties to prove the stability of designed proportional-derivative controllers for manipulating the strain states of a prototypical soft robot under finite deformation. Our newly derived results are illustrated by numerical examples on a single arm of the Octopus robot and demonstrate the efficacy of our designed controller based on the derived kinetic properties. This work rectifies previously disseminated kinetic properties of discrete Cosserat-based soft robot models with greater accuracy in proofs and clarity.
|
|
TuB18 Regular Session, Suite 7 |
Add to My Program |
Nonlinear Systems II |
|
|
Chair: Tron, Roberto | Boston University |
Co-Chair: Ratchford, Jasmine | Software Engineering Institute - Carnegie Mellon University |
|
13:30-13:50, Paper TuB18.1 | Add to My Program |
Stable Dynamic Residual-Neural-Network-Based Estimator for Unknown Nonlinearities and Disturbances |
|
Yu, Pan | Beijing University of Technology |
Wu, Qiang | Beijing University of Technology |
Keywords: Nonlinear systems, Learning, Neural networks
Abstract: Using only the system output, a stable dynamic residual-neural-network-based (DRNN-based) estimator with excellent learning ability is devised to deal with unknown nonlinearities and disturbances of control systems. First, the unknown nonlinearities and disturbances are treated as a lumped disturbance, and an auxiliary variable is introduced to indicate the adverse impact on the system output. Then, to suppress this impact, a stable DRNN is organically integrated into a conventional equivalent-input-disturbance (EID) estimator to enhance the learning or estimation ability for the lumped disturbance. As for interpretability, the feed-forward neural network (NN) term can be viewed as a dynamic learning compensator, which is optimized by the backpropagation algorithm, and the residual term can be viewed as a performance-oriented adaptive learning gain. The stability of the DRNN-based estimator is guaranteed. Finally, comparisons with a conventional EID-based method show the developed learning method has an incomparable dynamic performance in a case study of a single-joint robot.
|
|
13:50-14:10, Paper TuB18.2 | Add to My Program |
Some Converse Lyapunov-Like Results for Strong Forward Invariance |
|
Goebel, Rafal | Loyola University Chicago |
Sanfelice, Ricardo G. | University of California at Santa Cruz |
Teel, Andrew R. | Univ. of California at Santa Barbara |
Keywords: Nonlinear systems, Lyapunov methods, Variational methods
Abstract: In the setting of a differential inclusion, strong forward invariance of a closed or a compact set is studied. Main results are novel necessary Lyapunov-like conditions for this property. They involve time-varying and autonomous Lyapunov/barrier functions that are smooth everywhere or at least outside the invariant set and are decreasing or at least not increasing faster than exponentially.
|
|
14:10-14:30, Paper TuB18.3 | Add to My Program |
Building Hybrid B-Spline and Neural Network Operators |
|
Romagnoli, Raffaele | Duquesne University |
Ratchford, Jasmine | Software Engineering Institute - Carnegie Mellon University |
Klein, Mark H. | Software Engineering Institute - Carnegie Mellon University |
Keywords: Nonlinear systems, Machine learning, Modeling
Abstract: Control systems are critical in ensuring the safety of cyber-physical systems (CPS) across domains like airplanes and missiles. Safeguarding CPS necessitates runtime methodologies that continuously monitor safety-critical conditions and respond in a verifiably safe manner. Many real-time safety approaches require predicting the future behavior of systems. However, achieving this requires accurate models that can operate in real time. Inspired by DeepONets, we propose a novel approach that combines B-splines’ inductive bias with data-driven neural networks (NNs). Our hybrid B-spline neural operator serves as a universal approximator, validated on a 6-DOF quadrotor.
|
|
14:30-14:50, Paper TuB18.4 | Add to My Program |
Algebraic Prescribed-Time KKL Observer for Continuous-Time Autonomous Nonlinear Systems |
|
Marani, Yasmine | King Abdullah University of Science and Technology |
N'Doye, Ibrahima | King Abdullah University of Science and Technology (KAUST) |
Laleg-Kirati, Taous-Meriem | National Institute for Research in Digital Science and Technolog |
Keywords: Nonlinear systems, Observers for nonlinear systems, Estimation
Abstract: Designing observers for nonlinear systems is challenging, especially when prescribed convergence is required. Such a convergence is crucial for some applications, such as tactical missile guidance, communication networks, and robot assembly lines. The nonlinear prescribed time observers reported in the literature focus on specific classes of nonlinear systems with mainly linear outputs and rely on a scaling function or a time-varying gain that goes to infinity as the time approaches the prescribed convergence time, rendering the observer highly sensitive to measurement noise. This paper proposes an algebraic prescribed time observer for a general class of nonlinear systems that does not require any scaling function or exploding gain. The observer design relies on the KKL (Kazantzis-Kravaris/Luenberger) transformation that writes the system in a linear form in another set of coordinates. Then modulating functions, combined with an integral operator are applied over a window specified by the desired convergence time to provide a closed-form solution of the state estimate at the prescribed time. Moreover, we study the performance of the proposed algebraic prescribed time observer to guarantee a disturbance attenuation level in the presence of measurement noise. The effectiveness of the proposed observer is evaluated in numerical simulations and its performance is further assessed in the presence of measurement noise.
|
|
14:50-15:10, Paper TuB18.5 | Add to My Program |
Navigating the Noise: A CBF Approach for Nonlinear Control with Integral Constraints |
|
Seidu, Idris | Boston University |
Tron, Roberto | Boston University |
Keywords: Nonlinear systems, Optimization, Control applications
Abstract: Many physical phenomena involving mobile agents involve time-varying scalar fields, e.g., quadrotors that emit noise. As a consequence, agents can influence and can be influenced by various environmental factors such as acoustic noise. This paper delves into the challenges of controlling such agents, focusing on scenarios where we would like to prevent excessive accumulation of some quantity over select regions and extended trajectories. We use quadrotors that produce acoustic noise as a primary example, to regulate the trajectory of such agents in the presence of obstacles. First, we consider constraints that are defined over accumulated quantities, i.e. functionals of the entire trajectory, as opposed to those that depend solely on the current state as in traditional Higher-order Control Barrier Functions (HOCBF). Second, we propose a method to extend constraints from individual points to lines and sets using efficient over-approximations. The efficacy of the implemented strategies is verified using simulations. Although we use quadrotors as an example, the same principles can equally apply to other scenarios, such as light emission microscopy or vehicle pollution dispersion.
|
|
15:10-15:30, Paper TuB18.6 | Add to My Program |
Optimization-Based Verification of Discrete-Time Control Barrier Functions: A Branch-And-Bound Approach |
|
Shakhesi, Erfan | Eindhoven University of Technology |
Heemels, W.P.M.H. (Maurice) | Eindhoven University of Technology |
Katriniok, Alexander | Eindhoven University of Technology |
Keywords: Nonlinear systems, Optimization algorithms, Constrained control
Abstract: Discrete-time Control Barrier Functions (DTCBFs) form a powerful control theoretic tool to guarantee safety and synthesize safe controllers for discrete-time dynamical systems. In this paper, we provide an optimization-based algorithm, inspired by the αBB algorithm, for the verification of a candidate DTCBF, i.e., either verifying a given candidate function as a valid DTCBF or falsifying it by providing a counterexample for a general nonlinear discrete-time system with input constraints. This method is applicable whether a corresponding control policy is known or unknown. We apply our method to a numerical case study to illustrate its efficacy.
|
|
TuB19 Regular Session, Suite 8 |
Add to My Program |
Uncertain Systems I |
|
|
Chair: Mammarella, Martina | CNR-IEIIT |
Co-Chair: Biannic, Jean-Marc | ONERA |
|
13:30-13:50, Paper TuB19.1 | Add to My Program |
Tractable Approximations of LMI Robust Feasibility Sets |
|
Alamo, Teodoro | Universidad De Sevilla |
Mammarella, Martina | CNR-IEIIT |
Dabbene, Fabrizio | CNR-IEIIT |
Sznaier, Mario | Northeastern University |
Keywords: Uncertain systems, Robust control, LMIs
Abstract: In this letter, we introduce novel tractable approximations for robust Linear Matrix Inequality (LMI) problems. We present various Quadratic Matrix Inequalities (QMIs) that enable us to characterize the effect of ellipsoidal uncertainty in the robust problem. These formulations are expressed in terms of a set of auxiliary decision variables, which facilitate the derivation of a generalized S-procedure result. This generalization significantly reduces the conservatism of the obtained results, compared with conventional approaches.
|
|
13:50-14:10, Paper TuB19.2 | Add to My Program |
Pearson Coefficient Degradation in a Wasserstein/Gelbrich Ambiguity Set |
|
Borelle, Matthieu | University Paris-Saclay |
Alamo, Teodoro | Universidad De Sevilla |
Stoica, Cristina | CentraleSupélec/L2S, Univ. Paris-Saclay |
Bertrand, Sylvain | ONERA |
Camacho, Eduardo F. | Univ. of Sevilla |
Keywords: Uncertain systems
Abstract: This paper presents new results on the Gelbrich distance and the corresponding ambiguity sets, to analyze the correlation between two scalar random variables. A closed expression of the minimum disturbance in the Gelbrich metric necessary to achieve a specified correlation between two random variables is proposed. This expression allows us to analytically compute the minimal absolute correlation in a Gelbrich ball. This analysis facilitates the assessment of the robustness of the Pearson coefficient within an ambiguity set. Two numerical examples illustrate the validity of the proposed results.
|
|
14:10-14:30, Paper TuB19.3 | Add to My Program |
Robustifying Model Predictive Control of Uncertain Linear Systems with Chance Constraints |
|
Wang, Kai | Norwegian University of Science and Technology |
Hoang, Kiet Tuan | Norwegian University of Science and Technology |
Gros, Sebastien | NTNU |
Keywords: Uncertain systems, Predictive control for linear systems, Stochastic systems
Abstract: This paper proposes a model predictive controller for discrete-time linear systems with additive, possibly unbounded, stochastic disturbances and subject to chance constraints. By computing a polytopic probabilistic positively invariant set for constraint tightening with the help of the computation of the minimal robust positively invariant set, the chance constraints are guaranteed, assuming only the mean and covariance of the disturbance distribution are given. The resulting online optimization problem is a standard strictly quadratic programming, just like in conventional model predictive control with recursive feasibility and stability guarantees and is simple to implement. A numerical example is provided to illustrate the proposed method.
|
|
14:30-14:50, Paper TuB19.4 | Add to My Program |
Computation of Maximal Admissible Robust Positive Invariant Sets for Linear Systems with Parametric and Additive Uncertainties |
|
Dey, Anchita | Indian Institute of Technology Delhi |
Bhasin, Shubhendu | Indian Institute of Technology Delhi |
Keywords: Uncertain systems, Linear systems, Constrained control
Abstract: In this letter, we design an efficient algorithm for computing the maximal admissible robust positive invariant (MARPI) set for discrete-time linear time-varying systems with parametric uncertainty and additive disturbances. The system state and input are subject to hard constraints, and the uncertainty in the system parameters and the exogenous disturbance are assumed to belong to known bounded sets. The proposed design hinges on computation of backward reachable sets using suitably defined halfspaces that capture the effect of both the uncertainties, and their intersections with a state constraint polytope. Necessary and sufficient conditions for obtaining the MARPI set are provided along with detailed proofs. A numerical example is used to validate the theoretical development.
|
|
14:50-15:10, Paper TuB19.5 | Add to My Program |
LFT Modelling and Mu-Based Robust Performance Analysis of Hybrid Multi-Rate Control Systems |
|
Biannic, Jean-Marc | ONERA |
Roos, Clément | ONERA |
Cumer, Christelle | ONERA |
Keywords: Uncertain systems, Sampled-data control, Modeling
Abstract: This paper focuses on robust stability and Hinfinity performance analyses of hybrid continuous/discrete time linear multi-rate control systems in the presence of parametric uncertainties. These affect the continuous-time plant in a rational way which is then modeled as a Linear Fractional Transformation (LFT). Based on a zero-order-hold (ZOH) LFT discretization process at the cost of bounded quantifiable approximations, and then using LFT-preserving down-sampling operations, a single-rate discrete-time closed-loop LFT model is derived. Interestingly, for any step-inputs, and any admissible values of the uncertain parameters, the outputs of this model cover those of the hybrid multi-rate closed-loop system at every sampling time of the slowest control loop. Such an LFT model which also captures the discretization errors can then be used to evaluate both robust stability and guaranteed Hinfinity performance with a mu-based approach. The proposed methodology is illustrated on a realistic and easily reproducible example inspired by the validation of multi-rate attitude control systems.
|
|
15:10-15:30, Paper TuB19.6 | Add to My Program |
Guaranteed Lower and Upper Bounds on the Finite-Frequency H2 Norm of Uncertain Linear Systems |
|
Casati, Tommaso | ONERA |
Roos, Clément | ONERA |
Biannic, Jean-Marc | ONERA |
Evain, Hélène | CNES |
Keywords: Uncertain systems, LMIs, Robust control
Abstract: The H2 norm plays a key role in control applications when the input of the system is a random noise. It is therefore important to evaluate how this metric is affected by uncertainties in the model. Different approaches have been proposed in the literature to compute an upper bound on the worst-case H2 norm of an uncertain system. No method, however, is available to check if the H2 norm remains larger than a given threshold on an entire set of uncertainties. In this context, the present paper introduces a sufficient condition to compute a guaranteed lower bound on the H2 norm of an uncertain system based on Linear Matrix Inequalities (LMIs). G-scaling matrices are also introduced to improve the accuracy of the computed bounds in the presence of real parametric uncertainties. The theoretical results are eventually implemented and applied to a test example.
|
|
TuB20 Regular Session, Suite 9 |
Add to My Program |
Discrete Event Systems I |
|
|
Chair: Hadjicostis, Christoforos N. | University of Cyprus |
Co-Chair: Zamani, Majid | University of Colorado Boulder |
|
13:30-13:50, Paper TuB20.1 | Add to My Program |
Fault Diagnosis and Prognosis in Partially-Observed Discrete Event Systems with Delayed Observations |
|
Wang, Jiwei | University of Groningen |
Baldi, Simone | Southeast University |
Yu, Wenwu | Southeast University |
Yin, Xiang | Shanghai Jiao Tong University |
Keywords: Discrete event systems, Automata, Fault diagnosis
Abstract: Fault diagnosis and prognosis in discrete event systems are studied in the scenario where the observations are possibly received with delay. To address this scenario, two conditions for diagnosis and prognosis with delayed observations are proposed, where we show that the state-of-the-art notion of prognosability must be revised to avoid conservativeness. Diagnosability and prognosability conditions are then verified by introducing a delay observer and a new verification function. Theoretical analysis indicates the effectiveness of the verification method for fault diagnosis and prognosis in the system.
|
|
13:50-14:10, Paper TuB20.2 | Add to My Program |
On the Existence of Simulation for Max-Plus Automata |
|
Daviaud, Bérangère | Université D'Angers |
Lahaye, Sébastien | Université D'Angers |
Lhommeau, Mehdi | Université D'Angers |
Komenda, Jan | Czech Academy of Sciences |
Keywords: Automata, Discrete event systems
Abstract: The comparison of behaviors for max-plus automata is a proven undecidable problem in the general case. The concept of weighted simulation has recently been defined and it provides a sufficient condition to compare behaviors of max-plus automata. This paper contributes to identifying the structural conditions for which a weighted simulation might exist. In particular, it provides verifiable sufficient conditions for the existence of a weighted simulation.
|
|
14:10-14:30, Paper TuB20.3 | Add to My Program |
Verification of Strong Detectability of Labeled Real-Time Automata — a Concurrent-Composition Method |
|
Zhang, Kuize | University of Surrey |
Giua, Alessandro | University of Cagliari |
Keywords: Automata, Discrete event systems
Abstract: Real-time automata are a widely-used class of real-time systems. In this paper, two strong versions of detectability are formulated for a labeled real-time automaton (LRTA) which means after some delay (the number of observed labels or real-time delay), along every generated infinite run, one can determine the current and subsequent states by observing the generated timed label sequence. By using the concurrent composition defined and computed in one of the authors’ previous papers, necessary and sufficient conditions for the negations of the two strong versions of detectability are given. It is also proven that the verification problems for the two definitions of strong detectability are both coNP-complete.
|
|
14:30-14:50, Paper TuB20.4 | Add to My Program |
Veriffcation of Approximate Prognosability Via Barrier Certiffcates |
|
Dong, Weijie | Shanghai Jiao Tong University |
Zhong, Bingzhuo | The Hong Kong University of Science and Technology (Guangzhou) |
Yin, Xiang | Shanghai Jiao Tong University |
Zamani, Majid | University of Colorado Boulder |
Keywords: Discrete event systems, Automata, Formal Verification/Synthesis
Abstract: In this paper, we investigate the verification of approximate prognosability for discrete-time control systems with continuous state set in the context of fault prognosis. Existing works on this topic rely on constructing finite abstractions, which lead to significant computation burden. To address this challenge, we propose an abstraction-free method via barrier certificates. Specifically, we consider a notion of so-called approximate (M,delta)-prognosability requiring that every fault, characterized by entering a fault region, can be predicted before its occurrence under observation precision delta and once an alarm is issued, fault will occur for sure within M time instants. To check this notion, we propose a verification scheme based on an M-deterministic finite automaton over an augmented system of the original system. Then, we reduce the verification of (the lack of) approximate (M,delta)-prognosability to a safety verification problem, which can be checked effectively by barrier certificates. Furthermore, a counter-example guided inductive synthesis framework is proposed to compute these barrier certificates.
|
|
14:50-15:10, Paper TuB20.5 | Add to My Program |
Optimal Control Synthesis of Markov Decision Processes for Efficiency with Surveillance Tasks |
|
Chen, Yu | Shanghai Jiao Tong University |
Yin, Xunyuan | Nanyang Technological University |
Ye, Hao | Tsinghua University |
Li, Shaoyuan | Shanghai Jiao Tong University |
Yin, Xiang | Shanghai Jiao Tong University |
Keywords: Discrete event systems, Automata, Markov processes
Abstract: We investigate the problem of optimal control synthesis for Markov Decision Processes (MDPs), addressing both qualitative and quantitative objectives. Specifically, we require the system to fulfill a qualitative surveillance task in the sense that a specific region of interest can be visited infinitely often with probability one. Furthermore, to quantify the performance of the system, we consider the concept of efficiency, which is defined as the ratio between rewards and costs. This measure is more general than the standard long- run average reward metric as it aims to maximize the reward obtained per unit cost. Our objective is to synthesize a control policy that ensures the surveillance task while maximizes the efficiency. We provide an effective approach to synthesize a stationary control policy achieving ε-optimality by integrating state classifications of MDPs and perturbation analysis in a novel manner. Our results generalize existing works on efficiency-optimal control synthesis for MDP by incorporating qualitative surveillance tasks.
|
|
15:10-15:30, Paper TuB20.6 | Add to My Program |
Enforcing Detectability in Discrete Event Systems Via Adaptive Control Sequences of Minimum Total Length |
|
Christou, Martha | University of Cyprus |
Hadjicostis, Christoforos N. | University of Cyprus |
Keywords: Discrete event systems, Automata, Supervisory control
Abstract: This paper studies the problem of eventually enforcing and maintaining exact knowledge of the current state of a given DES, modeled as a nondeterministic finite automaton with inputs and outputs. We assume that the system model is known and that we can choose the inputs to the system (i.e., the inputs are controllable and observable) but not the outputs; the latter are observable but not controllable as they depend on the unknown initial state and any nondeterministic actions taken by the system. Our goal is to devise an adaptive strategy that chooses the inputs to the system based on the outputs generated, in a way that allows us, after a minimal number of inputs, to (i) determine the exact current state of the system, and (ii) maintain knowledge of the exact state of the system for all its future behavior.
|
|
TuC01 Invited Session, Auditorium |
Add to My Program |
Contraction Theory in Systems and Control II |
|
|
Chair: Astolfi, Daniele | Cnrs - Lagepp |
Co-Chair: Bullo, Francesco | Univ of California at Santa Barbara |
Organizer: Giaccagli, Mattia | Cnrs - Ul |
Organizer: Russo, Giovanni | University of Salerno |
Organizer: Astolfi, Daniele | Cnrs - Lagepp |
Organizer: Bullo, Francesco | Univ of California at Santa Barbara |
|
16:00-16:20, Paper TuC01.1 | Add to My Program |
Bearing-Only Distance Estimator for Localization and Mapping (I) |
|
Marcantoni, Matteo | University of Groningen |
Langeveld, Emma | University of Groningen |
Jayawardhana, Bayu | University of Groningen |
Bunte, Kerstin | University of Groningen |
Keywords: Observers for nonlinear systems, Estimation, Autonomous vehicles
Abstract: Bearing-Only Simultaneous Localisation and Mapping (BO-SLAM) refers to the estimation problem where the goal is to map and localize oneself in an unknown environment when only relative bearing and local displacement measurements are available. This problem arises in several real-world scenarios, for instance when cameras are used as bearing sensors. When highly precise egomotion data can be obtained, BO-SLAM reduces to a BO-mapping task. In this paper, a novel observer design is proposed to solve the above problem in the 2D case. We use contraction analysis to obtain conservative bounds on the observer gain, followed by an analysis of convergence. Numerical simulations are further performed to evaluate the obtained theoretical results and the performance of the estimator in a more general multi-landmark scenario.
|
|
16:20-16:40, Paper TuC01.2 | Add to My Program |
Control Barrier Proximal Dynamics: A Contraction Theoretic Approach for Safety Verification |
|
Marvi, Zahra | University of Minnesota |
Bullo, Francesco | Univ of California at Santa Barbara |
Alleyne, Andrew G. | University of Minnesota |
Keywords: Constrained control, Optimal control, Lyapunov methods
Abstract: In this letter, we present a computationally-efficient barrier function-based contraction-theoretic approach for safety verification. We adopt a dynamical system approach towards Control Barrier Function (CBF)-based Quadratic Programming (QP). To mitigate the computational complexity of online solutions to time-varying convex optimization, we integrate tools from contraction theory and proximal primal-dual gradient dynamics (PDGD) to provide an arbitrarily close approximation of the optimal solution. Subsequently, we adopt this result for the CBF-based QP, offering a computationally-efficient and scalable safe control design termed Control Barrier Proximal Dynamics (CBPD). The contractivity of the CBPD is then leveraged to characterize the safety of the system. We demonstrate that adopting CBPD under a technical assumption guarantees the safety specifications of the system with a bounded violation margin, which can be made arbitrarily small. Additionally, a computational analysis depicts substantial improvements in efficiency and scalability compared to the state-of-the-art. Finally, we evaluate the effectiveness of the proposed method through the simulation of a battery management problem with electro-thermal constraints.
|
|
16:40-17:00, Paper TuC01.3 | Add to My Program |
Stabilizing Control Design by Integrable Differential Passive Outputs (I) |
|
Nazari Monfared, Morteza | University of Pavia, |
Kawano, Yu | Hiroshima University |
Cucuzzella, Michele | University of Groningen |
Keywords: Stability of nonlinear systems, Lyapunov methods, Algebraic/geometric methods
Abstract: Differential passivity of a nonlinear system has been introduced as passivity of its variational system. Applying standard passivity-based control techniques to differentially passive systems leads to controllers for the corresponding variational systems. However, it is challenging to construct controllers for the original nonlinear systems if the differential passive outputs are non-exact differential one-forms. In this letter, our objective is to provide a systematic procedure to address this issue when differential passive outputs are integrable, i.e., when non-exact differential passive outputs can be made exact by multiplying them by suitable integrating factors. In particular, under suitable detectability assumptions, we propose one static and two dynamic state feedback stabilizing controllers, where each dynamic controller has a form of input- and output-shaping, respectively. We illustrate their effectiveness by stabilization of counter-current heat exchangers.
|
|
17:00-17:20, Paper TuC01.4 | Add to My Program |
On the Relaxation Property of Nonlinear Circuit Elements (I) |
|
Sepulchre, Rodolphe | University of Cambridge |
Chaffey, Thomas | University of Cambridge |
Forni, Fulvio | University of Cambridge |
Huo, Yongkang | University of Cambridge |
Keywords: Modeling, Nonlinear systems, Emerging control applications
Abstract: The purpose of this paper is to explore a nonlinear generalization of the LTI theory of relaxation systems. LTI relaxation systems have the property that their Hankel operator is the gradient of a quadratic functional. We use this property as a defining property of nonlinear relaxation systems, generalizing the functional from quadratic to convex. Relaxation systems are shown to be special fading memory systems, characterized by strong positivity properties. It is suggested that relaxation systems and their duals define the elements of fading memory systems that admit a physical circuit representation.
|
|
17:20-17:40, Paper TuC01.5 | Add to My Program |
Control Contraction Metrics on Submanifolds (I) |
|
Wu, Dongjun | Lund University |
Yi, Bowen | Polytechnique Montreal, University of Montreal |
Manchester, Ian R. | University of Sydney |
Keywords: Nonlinear systems, Stability of nonlinear systems, Sampled-data control
Abstract: In this paper, we extend the control contraction metrics (CCM) approach, which was originally proposed for the universal tracking control of nonlinear systems, to those that evolves on submanifolds. We demonstrate that the search for CCM on submanifolds can be reformulated as convex conditions. In particular, since Lie groups can be viewed as submanifolds in Euclidean space, the results are directly applicable to this setting.
|
|
17:40-18:00, Paper TuC01.6 | Add to My Program |
Quadratic Abstractions for K-Contraction (I) |
|
Zoboli, Samuele | LAAS-CNRS, University of Toulouse III |
Cecilia, Andreu | Universitat Politècnica De Catalunya |
Tarbouriech, Sophie | LAAS-CNRS |
Keywords: LMIs, Nonlinear systems, Stability of nonlinear systems
Abstract: k-contraction is a generalization of the classical contraction property. It allows the study of complex behaviors in partially stable systems. However, existing conditions for k-contraction are often intractable. This work proposes efficiently solvable sufficient conditions for k-contraction verification in partially linear systems. Our findings are derived by exploiting particular quadratic abstractions arising from classical Lur’e systems analysis. We specialize our result to nonlinearities satisfying shifted monotonicity and differential sector-bound properties. We showcase the potential of our method by designing nonlinear controllers for linear systems, achieving complex closed-loop behaviors.
|
|
TuC02 Invited Session, Amber 1 |
Add to My Program |
Learning-Based Control II: Control Policy Learning |
|
|
Chair: Trimpe, Sebastian | RWTH Aachen University |
Co-Chair: Zeilinger, Melanie N. | ETH Zurich |
Organizer: Müller, Matthias A. | Leibniz University Hannover |
Organizer: Trimpe, Sebastian | RWTH Aachen University |
Organizer: Schoellig, Angela P | Technical University of Munich & University of Toronto |
Organizer: Zeilinger, Melanie N. | ETH Zurich |
|
16:00-16:20, Paper TuC02.1 | Add to My Program |
Early Stopping Bayesian Optimization for Controller Tuning (I) |
|
Stenger, David | RWTH Aachen University |
Scheurenberg, Dominik | RWTH Aachen University |
Vallery, Heike | ETH Zürich |
Trimpe, Sebastian | RWTH Aachen University |
Keywords: Optimization algorithms, Machine learning, Data driven control
Abstract: Manual tuning of performance-critical controller parameters can be tedious and sub-optimal. Bayesian Optimization (BO) is an increasingly popular practical alternative to automatically optimize controller parameters from few experiments. Standard BO practice is to evaluate the closed-loop performance of parameters proposed during optimization on an episode with a fixed length. However, fixed-length episodes can be wasteful. For example, continuing an episode where already the start shows undesirable behavior such as strong oscillations seems pointless. Therefore, we propose a BO method that stops an episode early if suboptimality becomes apparent before an episode is completed. Such early stopping results in partial observations of the controller’s performance, which cannot directly be included in standard BO. We propose three heuristics to facilitate partially observed episodes in BO. Through five numerical and one hardware experiment, we demonstrate that early stopping BO can substantially reduce the time needed for optimization.
|
|
16:20-16:40, Paper TuC02.2 | Add to My Program |
Markovian Foundations for Quasi-Stochastic Approximation in Two Timescales (I) |
|
Kalil Lauand, Caio | University of Florida |
Meyn, Sean P. | Univ. of Florida |
Keywords: Markov processes, Machine learning
Abstract: Many machine learning and optimization algorithms can be cast as instances of stochastic approximation (SA). The convergence rate of these algorithms is known to be slow, with the optimal mean squared error (MSE) of order O(n^{-1}). In prior work it was shown that MSE bounds approaching O(n^{-4}) can be achieved through the framework of quasi-stochastic approximation (QSA); essentially SA with careful choice of deterministic exploration. These results are extended to two time-scale algorithms, as found in policy gradient methods of reinforcement learning and extremum seeking control. The extensions are made possible in part by a new approach to analysis, grounded in the theory of Lyapunov exponents, allowing for the interpretation of two timescale algorithms as instances of single timescale QSA. The general theory is illustrated with applications to extremum seeking control.
|
|
16:40-17:00, Paper TuC02.3 | Add to My Program |
Coordinating Planning and Tracking in Layered Control Policies Via Actor-Critic Learning (I) |
|
Yang, Fengjun | University of Pennsylvania |
Matni, Nikolai | University of Pennsylvania |
Keywords: Learning, Hierarchical control, Reinforcement learning
Abstract: We propose a reinforcement learning (RL)-based algorithm to jointly train (1) a trajectory planner and (2) a tracking controller in a layered control architecture. Our algorithm arises naturally from a rewrite of the underlying optimal control problem that lends itself to an actor-critic learning approach. By explicitly learning a textit{dual} network to coordinate the interaction between the planning and tracking layers, we demonstrate the ability to achieve an effective consensus between the two components, leading to an interpretable policy. We theoretically prove that our algorithm converges to the optimal dual network in the Linear Quadratic Regulator (LQR) setting and empirically validate its applicability to nonlinear systems through simulation experiments on a unicycle model.
|
|
17:00-17:20, Paper TuC02.4 | Add to My Program |
Meta-Learning of Data-Driven Controllers with Automatic Model Reference Tuning: Theory and Experimental Case Study (I) |
|
Busetto, Riccardo | Politecnico Di Milano |
Breschi, Valentina | Eindhoven University of Technology |
Baracchi, Federica | Politecnico Di Milano |
Formentin, Simone | Politecnico Di Milano |
Keywords: Data driven control, Learning, Electrical machine control
Abstract: Data-driven control offers a viable option for control scenarios where constructing a system model is expensive or time-consuming. Nonetheless, many of these algorithms are not entirely automated, often necessitating the adjustment of multiple hyperparameters through trial-and-error processes and demanding significant amounts of data. In this work, we explore a meta-learning approach to leverage prior knowledge about analogous (though not identical) systems, aiming to reduce both the experimental workload and ease the tuning of the available degrees of freedom. We validate this methodology through an experimental case study involving the tuning of proportional, integral (PI) controllers for brushless DC (BLDC) motors with variable loads and architectures.
|
|
17:20-17:40, Paper TuC02.5 | Add to My Program |
Optimal Distributed Control with Stability Guarantees by Training a Network of Neural Closed-Loop Maps (I) |
|
Saccani, Danilo | EPFL |
Massai, Leonardo | Politecnico Di Torino |
Furieri, Luca | EPFL |
Ferrari-Trecate, Giancarlo | Ecole Polytechnique Fédérale De Lausanne |
Keywords: Distributed control, Optimal control, Networked control systems
Abstract: This paper proposes a novel approach to improve the performance of distributed nonlinear control systems while preserving stability by leveraging Deep Neural Networks (DNNs). We build upon the Neural System Level Synthesis (Neur-SLS) framework and introduce a method to parameterize stabilizing control policies that are distributed across a network topology. A distinctive feature is that we iteratively minimize an arbitrary control cost function through an unconstrained optimization algorithm, all while preserving the stability of the overall network architecture by design. This is achieved through two key steps. First, we establish a method to parameterize interconnected Recurrent Equilibrium Networks (RENs) that guarantees a bounded L2 gain at the network level. This ensures stability. Second, we demonstrate how the information flow within the network is preserved, enabling a fully distributed implementation where each subsystem only communicates with its neighbors. To showcase the effectiveness of our approach, we present a simulation of a distributed formation control problem for a fleet of vehicles. The simulation demonstrates how the proposed neural controller enables the vehicles to maintain a desired formation while navigating obstacles and avoiding collisions, all while guaranteeing network stability.
|
|
17:40-18:00, Paper TuC02.6 | Add to My Program |
Lipschitz Safe Bayesian Optimization for Automotive Control |
|
Menn, Johanna | Institute for Data Science in Mechanical Engineering, RWTH Aache |
Pelizzari, Pietro | ZF Engineering Solutions, ZF Friedrichshafen AG |
Fleps-Dezasse, Michael | ZF Friedrichshafen AG |
Trimpe, Sebastian | RWTH Aachen University |
Keywords: Optimization algorithms, Autonomous vehicles, Data driven control
Abstract: Controller tuning is a labor-intensive process that requires human intervention and expert knowledge. Bayesian optimization has been applied successfully in different fields to automate this process. However, when tuning on hardware, such as in automotive applications, strict safety requirements often arise. To obtain safety guarantees, many existing safe Bayesian optimization methods rely on assumptions that are hard to verify in practice. This leads to the use of unjustified heuristics in many applications, which invalidates the theoretical safety guarantees. Furthermore, applications often require multiple safety constraints to be satisfied simultaneously. Building on recently proposed Lipschitz-only safe Bayesian optimization, we develop an algorithm that relies on readily interpretable assumptions and satisfies multiple safety constraints at the same time. We apply this algorithm to the problem of automatically tuning a trajectory-tracking controller of a self-driving car. Results both from simulations and an actual test vehicle underline the algorithm’s ability to learn tracking controllers without leaving the track or violating any other safety constraints.
|
|
TuC03 Invited Session, Amber 2 |
Add to My Program |
Event-Triggered and Self-Triggered Control |
|
|
Chair: Lopez, Victor G. | Leibniz University Hannover |
Co-Chair: Aspeel, Antoine | University of Michigan |
Organizer: Johansson, Karl H. | KTH Royal Institute of Technology |
Organizer: Nowzari, Cameron | George Mason University |
Organizer: Heemels, W.P.M.H. (Maurice) | Eindhoven University of Technology |
Organizer: Postoyan, Romain | CNRS, CRAN, Université De Lorraine |
Organizer: Hirche, Sandra | Technische Universität München |
|
16:00-16:20, Paper TuC03.1 | Add to My Program |
On Lp-Gains in Noisy Event-Triggered Control (I) |
|
Scheres, Koen | Eindhoven University of Technology |
Postoyan, Romain | CNRS, CRAN, Université De Lorraine |
Heemels, W.P.M.H. (Maurice) | Eindhoven University of Technology |
Keywords: Networked control systems, Hybrid systems, Stability of hybrid systems
Abstract: This paper investigates the design of event-triggered control strategies that guarantee Lp-stability properties when measurement noise is present. It is well known that in many event-based transmission schemes, the inclusion of measurement noise can lead to the so-called Zeno phenomenon, where an infinite number of transmissions occur in a finite amount of time, even when a minimum inter-event time is guaranteed in absence of noise. In this paper, we present a solution to the open problem of designing triggering rules, which ensure bounded Lp-gains from the exogenous inputs to a desired performance output in the closed-loop system, in the presence of measurement noise. We guarantee a global minimum inter-event time by design. Additionally, we show that suitable choices of the tuning parameters allow us to affect the "steady-state" inter-event times (when close to the attractor) by exploiting the design freedom in the parameter selection, which may result in improved behavior when close to the attractor. We showcase our results through a consensus example.
|
|
16:20-16:40, Paper TuC03.2 | Add to My Program |
Minimal L2-Consistent Data-Transmission (I) |
|
Aspeel, Antoine | University of Michigan |
Bako, Laurent | Ecole Centrale De Lyon |
Ozay, Necmiye | Univ. of Michigan |
Keywords: Control over communications, Networked control systems, Communication networks
Abstract: In this work, we consider non-collocated sensors and actuators, and we address the problem of minimizing the number of sensor-to-actuator transmissions while ensuring that the L2 gain of the system remains under a threshold. By using causal factorization and system level synthesis, we reformulate this problem as a rank minimization problem over a convex set. When heuristics like nuclear norm minimization are used for rank minimization, the resulting matrix is only numerically low rank and must be truncated, which can lead to an infeasible solution. To address this issue, we introduce approximate causal factorization to control the factorization error and provide a bound on the degradation of the L2 gain in terms of the factorization error. The effectiveness of our method is demonstrated using a benchmark.
|
|
16:40-17:00, Paper TuC03.3 | Add to My Program |
Event-Triggered Moving Horizon Estimation for Nonlinear Systems (I) |
|
Krauss, Isabelle | Leibniz University Hannover |
Schiller, Julian D. | Leibniz University Hannover |
Lopez, Victor G. | Leibniz University Hannover |
Müller, Matthias A. | Leibniz University Hannover |
Keywords: Estimation, Observers for nonlinear systems
Abstract: This work proposes an event-triggered moving horizon estimation (ET-MHE) scheme for general nonlinear systems. The key components of the proposed scheme are a novel event-triggering mechanism (ETM) and the suitable design of the MHE cost function. The main characteristic of our method is that the MHE’s nonlinear optimization problem is only solved when the ETM triggers the transmission of measured data to the remote state estimator. If no event occurs, then the current state estimate results from an open-loop prediction using the system dynamics. Furthermore, we show robust global exponential stability of the ET-MHE under a suitable detectability condition. Finally, we illustrate the applicability of the proposed method in terms of a nonlinear benchmark example, where we achieved similar estimation performance compared to standard MHE using 86% less computational resources.
|
|
17:00-17:20, Paper TuC03.4 | Add to My Program |
Event-Triggered Parameterized Control of Nonlinear Systems |
|
Rajan, Anusree | Indian Institute of Science, Bangalore |
Tallapragada, Pavankumar | Indian Institute of Science |
Keywords: Control over communications, Networked control systems, Sampled-data control
Abstract: This paper deals with event-triggered parameterized control (ETPC) of nonlinear systems with external disturbances. In this control method, between two successive events, each control input to the plant is a linear combination of a set of linearly independent scalar functions. At each event, the controller updates the coefficients of the parameterized control input so as to minimize the error in approximating a continuous time control signal and communicates the same to the actuator. We design an event-triggering rule (ETR) that guarantees global uniform ultimate boundedness of trajectories of the closed loop system. We also ensure the absence of Zeno behavior by showing the existence of a uniform positive lower bound on the inter-event times (IETs). We illustrate our results through numerical examples.
|
|
17:20-17:40, Paper TuC03.5 | Add to My Program |
Event-Triggered Extended State Observer Based Platoon Control of Heterogeneous Vehicles Using Only Inter-Vehicle Distances (I) |
|
Liu, Anquan | University of Shanghai for Science and Technology |
Chen, Yan | East China Normal University |
Li, Tao | East China Normal University / New York University Shanghai |
Keywords: Cooperative control, Adaptive control, Autonomous vehicles
Abstract: In this paper, the platoon control of heterogeneous vehicles with input saturations, uncertain parameters and external perturbations is studied and a distributed control law that relies only on the inter-vehicle distance is proposed.Firstly, a model of state differences between neighbouring vehicles is given, where the saturation function of the control input is approximated by a differentiable function. Secondly, based on the state difference model, an event-triggered extended state observer (ESO) is presented, whose input is the inter-vehicle distance and whose output is the estimates of the state differences between neighbouring vehicles and unmodeled dynamics in the state difference model. Finally, an anti-saturation auxiliary system is introduced, by using which and the output of the ESO, a dynamic surface control-based distributed control law is proposed. The stability analysis of the vehicle platoon is performed and the design schemes of control parameters are given to ensure the stability of the closed-loop system and to avoid the Zeno behavior. The feasibility of our approach is demonstrated by some numerical simulations.
|
|
17:40-18:00, Paper TuC03.6 | Add to My Program |
Reward Drops in Learning-Based Control with an Experimental Validation on Microdrones (I) |
|
Tukenmez, Nejat | Georgia Institute of Technology |
Fotiadis, Filippos | The University of Texas at Austin |
Magalhaes Junior, Jose Messias | Georgia Institute of Technology |
Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Bogosyan, O. Seta | Istanbul Tech. Univ |
Keywords: Learning, Data driven control, Discrete event systems
Abstract: In this paper, we consider a computationally efficient learning-based control mechanism dealing with dense reward processing for a zero-sum game. The problem has been formulated as the online learning of the Nash equilibrium without requiring any information on the system dynamics. It has firstly been constructed as an infinite horizon optimal control problem, then as an online model-free Q-learning framework, which is composed of critic and actor networks (i.e., for the control and disturbance input). The closed-loop system is also proved to have a stable equilibrium point even in the presence of reward drops. The efficacy of the learning-based controller has been validated through simulations and experiments on micro drones.
|
|
TuC04 Invited Session, Amber 3 |
Add to My Program |
Cyber-Physical Systems: Resilience, Cybersecurity, and Privacy III:
Resilience in Multi-Agent Systems |
|
|
Chair: Soudjani, Sadegh | Newcastle University |
Co-Chair: Ferrari, Riccardo M.G. | Delft University of Technology |
Organizer: Selvi, Daniela | Università Di Pisa |
Organizer: Sadabadi, Mahdieh S. | The University of Manchester |
Organizer: Murguia, Carlos | Eindhoven University of Technology |
Organizer: Ferrari, Riccardo M.G. | Delft University of Technology |
Organizer: Soudjani, Sadegh | Max Planck Institute for Software Systems |
|
16:00-16:20, Paper TuC04.1 | Add to My Program |
Resilient Projection-Based Distributed Leader-Follower Consensus against Integrity Cyberattacks (I) |
|
Sadabadi, Mahdieh S. | University of Manchester |
Keywords: Cooperative control, Resilient Control Systems, Distributed control
Abstract: This paper focuses on the problem of distributed leader-follower consensus in multi-agent systems in which some agents are subject to adversarial attacks. We develop a resilient distributed leader-follower control strategy subject to integrity attacks, where agents' updates of their states can be compromised by injecting false signals to control inputs. Under such a threat model, we design a resilient distributed leader-follower framework for agents with continuous-time dynamics to resiliently track a reference state propagated by a leader. In the design of the resilient framework, projection-based operators are used as dynamic controllers to estimate the dynamics of uncertainties on the control inputs of each agent. By use of the properties of projection operators and Lyapunov stability theory, the uniform ultimate boundedness of the closed-loop multi-agent system in the presence of integrity attacks is guaranteed. The proposed resilient distributed scheme does not impose any limitations on the maximum tolerable number of cyberattacks and does not require high network connectivity. The effectiveness of the proposed resilient distributed consensus algorithm is verified by a numerical example.
|
|
16:20-16:40, Paper TuC04.2 | Add to My Program |
A Dynamic Coding Scheme for Preventing Controllable Cyber-Attacks in Cyber-Physical Systems (I) |
|
Khorasani, Khashayar | Concordia University |
Meskin, Nader | Qatar University |
Taheri, Mahdi | Concordia University |
Keywords: Cyber-Physical Security, Attack Detection, Resilient Control Systems
Abstract: Controllable attacks are considered as perfectly undetectable cyber-attacks that are performed by compromising input communication channels of cyber-physical systems (CPS). They are referred to as perfectly undetectable since they have zero impact on the sensor measurements of the system. In this paper, we investigate conditions under which adversaries are capable of performing controllable cyber-attacks and develop methods for designing these attack signals. Moreover, under certain assumptions, conditions for designing controllable attacks in terms of the Markov parameters of the CPS are derived. In order to analyze the vulnerability of the CPS to controllable attacks from the system operators’ point of view, a security metric designated as the security effort (SE) for controllable attacks is formally defined and proposed. The SE for controllable attacks denotes the minimum number of input communication channels that need to be secured to prevent adversaries from executing this type of cyber-attack. Consequently, as a countermeasure, we develop a coding scheme on the input communication channels that increases the minimum number of required input communication channels for performing controllable attacks to its maximum possible value. Consequently, in presence of the proposed coding scheme, adversaries need to compromise all the input communication channels to execute controllable attacks. Therefore, securing only one input channel prevents adversaries from performing controllable cyber-attacks. Finally, an illustrative numerical case study is provided to demonstrate the effectiveness and capabilities of our derived conditions and proposed methodologies.
|
|
16:40-17:00, Paper TuC04.3 | Add to My Program |
Robust Average Consensus under Byzantine Attacks (I) |
|
Yuan, Liwei | Hunan University |
Ishii, Hideaki | University of Tokyo |
Wang, Yaonan | Hunan University |
Keywords: Agents-based systems, Networked control systems, Cyber-Physical Security
Abstract: We study the problem of average consensus in multi-agent systems where some of the agents may malfunction. The object of robust average consensus is for non-faulty agents to converge to the average value of their initial values despite the erroneous effects from adversarial agents. To this end, we propose a surplus-based consensus algorithm that can achieve robust average consensus under Byzantine attacks in the multi-agent networks with directed topologies. The key idea is to equip each normal agent with a running-sum variable so that it can record the effects from/to neighbors across iterations. Moreover, compared to the existing secure broadcast and retrieval approach where each agent keeps track of the initial values of all agents in the network, our algorithm saves massive storage especially for large-scale networks as each agent only requires the values and the correct detection of neighbors. Finally, numerical examples are given for verifying the effectiveness of our algorithm.
|
|
17:00-17:20, Paper TuC04.4 | Add to My Program |
A Totally Asynchronous Nesterov's Accelerated Gradient Method for Convex Optimization (I) |
|
Pond, Ellie | Georgia Institute of Technology |
Sebok, April | University of Florida |
Bell, Zachary I. | Air Force |
Hale, Matthew | Georgia Institute of Technology |
Keywords: Optimization algorithms, Optimization
Abstract: We present a totally asynchronous algorithm for convex optimization that is based on a novel generalization of Nesterov's accelerated gradient method. This algorithm is developed for fast convergence under "total asynchrony" which allows arbitrarily long delays between agents' computations and communications. These conditions may arise, for example, due to jamming by adversaries. Our framework is block-based, in the sense that each agent is only responsible for computing and communicating updates to a small subset of the network-level decision variables. In our main result, we present bounds on the algorithm's parameters that guarantee linear convergence to an optimizer. Then, we quantify the relationship between (i) the total number of computations and communications executed by the agents and (ii) the agents' collective distance to an optimum. Numerical simulations show that this algorithm requires 28% fewer iterations than the heavy ball algorithm and 61% fewer iterations than gradient descent under total asynchrony.
|
|
17:20-17:40, Paper TuC04.5 | Add to My Program |
Cyber-Attack Detection and Isolation of Nonlinear Cyber-Physical Systems: An Auxiliary Filter Approach |
|
Kazemi, Hamed | Concordia University |
Khorasani, Khashayar | Concordia University |
Keywords: Cyber-Physical Security, Nonlinear systems, Flight control
Abstract: This paper introduces a novel framework designed to bolster security and develop cyber-attack detection and isolation of nonlinear cyber-physical systems (CPS), focusing specifically on discrete-time CPS vulnerable to actuator and sensor cyber-attacks. The framework is particularly applicable in scenarios involving data exchanges among controllers in the command-and-control (C&C) center and the plant. It operates under the assumption that adversaries can inject false data into communication networks. The paper presents design and development of auxiliary filters that are tailored for cyber-attacks detection and isolation, along with a complete stability analysis and demonstration of their efficacy through associated design algorithms. Simulation studies are conducted by using a high-fidelity Unmanned Aerial Vehicle (UAV) model, that highlight the framework’s effectiveness in detecting and isolating false data injection (FDI) and covert attacks.
|
|
17:40-18:00, Paper TuC04.6 | Add to My Program |
Optimal Linear Deception Attacks on Remote State Estimation with Constrained Alarm Rates: A Low Dimensional Case |
|
Shang, Jun | Tongji University |
Zhang, Hanwen | University of Science and Technology Beijing |
Zhou, Jing | University of Alberta |
Chen, Tongwen | University of Alberta |
Keywords: Cyber-Physical Security, Kalman filtering, Attack Detection
Abstract: This study addresses linear attacks on remote state estimation within the context of a constrained alarm rate. Smart sensors, which are equipped with local Kalman filters, transmit innovations instead of raw measurements through a wireless communication network. This transmission is vulnerable to malicious data interception and manipulation by attackers. The aim of this research is to identify the optimal attack strategy that degrades the system performance while adhering to stealthiness constraints. A notable innovation of this paper is the direct association of the attack's stealthiness with the alarm rate, diverging from traditional approaches that rely on the covariance of the innovation or the Kullback--Leibler divergence, which are conventional metrics that have been extensively explored in previous studies. Our findings reveal that the optimal attack strategy exhibits some structural characteristics in systems of low dimensions. The performance of the proposed attack strategy is demonstrated through numerical examples.
|
|
TuC05 Invited Session, Amber 4 |
Add to My Program |
Recent Advances in Theoretical Advances in Reinforcement Learning |
|
|
Chair: Mahajan, Aditya | McGill University |
Co-Chair: Niculescu, Silviu-Iulian | University Paris-Saclay, CNRS, CentraleSupelec |
Organizer: Mahajan, Aditya | McGill University |
Organizer: Niculescu, Silviu-Iulian | University Paris-Saclay, CNRS, CentraleSupelec, Inria |
Organizer: Vidyasagar, Mathukumalli | Indian Institute of Technology Hyderabad |
|
16:00-16:20, Paper TuC05.1 | Add to My Program |
Convergence of Monte Carlo Exploring Starts with TD-Learning (I) |
|
Winnicki, Anna | University of Illinois at Urbana Champaign |
Srikant, R | Univ of Illinois, Urbana-Champaign |
Keywords: Reinforcement learning, Markov processes, Stochastic optimal control
Abstract: The use of TD-learning has been widely employed in reinforcement learning algorithms due to its efficiency and practicality. Herein, we study the convergence of a variant of Monte Carlo Exploring Starts when TD(lambda) is used in policy evaluation and policy improvement, and lookahead is used in the policy improvement step. Our results provide a threshold for the amount of lookahead that ensures convergence of Monte Carlo Exploring Starts with TD(lambda) as a function of lambda in [0,1].
|
|
16:20-16:40, Paper TuC05.2 | Add to My Program |
Langevin DQN (I) |
|
Dwaracherla, Vikranth | Google |
Van Roy, Benjamin | Stanford University |
Keywords: Reinforcement learning, Stochastic systems, Neural networks
Abstract: Algorithms that tackle deep exploration -- an important challenge in reinforcement learning -- have relied on epistemic uncertainty representation through ensembles or other hypermodels, exploration bonuses, or visitation count distributions. An open question is whether deep exploration can be achieved by an incremental reinforcement learning algorithm that tracks a single point estimate, without additional complexity required to account for epistemic uncertainty. We answer this question in the affirmative. In particular, we develop Langevin DQN, a variation of DQN that differs only in perturbing parameter updates with Gaussian noise and demonstrate through a computational study that the presented algorithm achieves deep exploration.
|
|
16:40-17:00, Paper TuC05.3 | Add to My Program |
A Vector Almost-Supermartingale Convergence Theorem and Its Applications (I) |
|
Mahajan, Aditya | McGill University |
Niculescu, Silviu-Iulian | University Paris-Saclay, CNRS, CentraleSupelec, Inria |
Vidyasagar, Mathukumalli | Indian Institute of Technology Hyderabad |
Keywords: Reinforcement learning, Stochastic systems, Iterative learning control
Abstract: The almost-supermartingale convergence theorem of Robbins and Siegmund (1971) is a fundamental tool for establishing the convergence of various stochastic iterative algorithms including system identification, adaptive control, and reinforcement learning. The theorem is stated for non-negative scalar valued stochastic processes. In this paper, we generalize the theorem to non-negative vector valued stochastic processes and provide two set of sufficient conditions for such processes to converge almost surely. We present several applications of vector almost-supermartingale convergence theorem, including convergence of autoregressive supermartingales, delayed supermartingales, and stochastic approximation with delayed updates.
|
|
17:00-17:20, Paper TuC05.4 | Add to My Program |
Independent Policy Mirror Descent for Markov Potential Games: Scaling to Large Number of Players (I) |
|
Alatur, Pragnya | ETH Zurich |
Barakat, Anas | ETH Zurich |
He, Niao | ETH Zurich |
Keywords: Decentralized control, Game theory, Reinforcement learning
Abstract: Markov Potential Games (MPGs) form an important sub-class of Markov games, which are a common framework to model multi-agent reinforcement learning problems. In particular, MPGs include as a special case the identical-interest setting where all the agents share the same reward function. Scaling the performance of Nash equilibrium learning algorithms to a large number of agents is crucial for multi-agent systems. To address this important challenge, we focus on the independent learning setting where agents can only have access to their local information to update their own policy. In prior work on MPGs, the iteration complexity for obtaining epsilon-Nash regret scales linearly with the number of agents N. In this work, we investigate the iteration complexity of an independent policy mirror descent (PMD) algorithm for MPGs. We show that PMD with KL regularization, also known as natural policy gradient, enjoys a better sqrt{N} dependence on the number of agents, improving over PMD with Euclidean regularization and prior work. Furthermore, the iteration complexity is also independent of the sizes of the agents' action spaces.
|
|
17:20-17:40, Paper TuC05.5 | Add to My Program |
DASA: Delay-Adaptive Multi-Agent Stochastic Approximation |
|
Dal Fabbro, Nicolò | University of Pennsylvania |
Adibi, Arman | Princeton University |
Poor, H. Vincent | Princeton Univ |
Kulkarni, Sanjeev R. | Princeton University |
Mitra, Aritra | North Carolina State University |
Pappas, George J. | University of Pennsylvania |
Keywords: Reinforcement learning, Optimization algorithms, Markov processes
Abstract: We consider a setting in which N agents aim to speedup a common Stochastic Approximation (SA) problem by acting in parallel and communicating with a central server. We assume that the up-link agents' transmissions are subject to asynchronous and potentially unbounded time-varying delays. To mitigate the effect of delays and stragglers while reaping the benefits of distributed computation, we propose texttt{DASA}, a Delay-Adaptive algorithm for multi-agent Stochastic Approximation. We provide a finite-time analysis of texttt{DASA} assuming that the agents' stochastic observation processes are independent Markov chains. In sharp contrast with existing results, texttt{DASA} is the first algorithm whose convergence rate depends only on the mixing time tmix and on the average delay tau_{avg} while jointly achieving an N-fold convergence speedup under Markovian sampling. Our work is relevant for various SA applications, including for example multi-agent and distributed temporal difference (TD) learning, Q-learning and stochastic optimization with correlated data.
|
|
17:40-18:00, Paper TuC05.6 | Add to My Program |
Finite-Time Error Analysis of Soft Q-Learning: Switching System Approach |
|
Jeong, Narim | KAIST |
Lee, Donghwan | KAIST |
Keywords: Reinforcement learning, Switched systems
Abstract: Soft Q-learning is a variation of Q-learning designed to solve entropy regularized Markov decision problems where an agent aims to maximize the entropy regularized value function. Despite its empirical success, there have been limited theoretical studies of soft Q-learning to date. This paper aims to offer a novel and unified finite-time, control-theoretic analysis of soft Q-learning algorithms. We focus on two types of soft Q-learning algorithms: one utilizing the log-sum-exp operator and the other employing the Boltzmann operator. By using dynamical switching system models, we derive novel finite-time error bounds for both soft Q-learning algorithms. We hope that our analysis will deepen the current understanding of soft Q-learning by establishing connections with switching system models and may even pave the way for new frameworks in the finite-time analysis of other reinforcement learning algorithms.
|
|
TuC06 Regular Session, Amber 5 |
Add to My Program |
Network Analysis and Control III |
|
|
Chair: Margellos, Kostas | University of Oxford |
Co-Chair: Rizzo, Alessandro | Politecnico Di Torino |
|
16:00-16:20, Paper TuC06.1 | Add to My Program |
Complex Equilibria Patterns in Weakly-Coupled Competitive Bivirus Epidemic Networks |
|
Ye, Mengbin | Centre for Optimisation and Decision Science, Curtin University |
Anderson, Brian D.O. | Australian National University |
Keywords: Network analysis and control, Nonlinear systems, Large-scale systems
Abstract: In this paper, we study an epidemic spreading process using a bivirus Susceptible--Infected--Susceptible (SIS) network model, which examines two competing viruses spreading through a network of populations. We consider the scenario where two separate bivirus networks are weakly coupled to form a single system. We derive analytical results on how the equilibria pattern of the joined system, in particular the number of equilibria and their stability properties, is determined by the equilibria patterns of the two separate systems. In particular, we account for every possible pairing of one equilibrium from each of the two separate systems, and provide conditions governing when one can associate this pair with an equilibrium of the joined system, as well as the associated stability characteristics. A numerical example illustrates the complex patterns that can emerge from weak coupling of two bivirus networks.
|
|
16:20-16:40, Paper TuC06.2 | Add to My Program |
On a Susceptible-Infected-Susceptible Epidemic Model with Reactive Behavioral Response on Higher-Order Temporal Networks |
|
Zino, Lorenzo | Politecnico Di Torino |
Rizzo, Alessandro | Politecnico Di Torino |
Keywords: Network analysis and control
Abstract: We characterize the spread of epidemic diseases on higher-order temporal networks to shed light on the impact of large gatherings, where superspreading events occur and pairwise interactions are not sufficient to model the dynamics of infection. We propose a novel analytically-tractable continuous-time formalism for higher-order temporal networks based on the paradigm of activity-driven networks and we study a susceptible-infected-susceptible model spreading on top of it. By using a mean-field approach, we compute the epidemic threshold, characterizing a phase transition between a regime where the system converges to a disease-free equilibrium and one in which all trajectories converge to an endemic equilibrium. Using such a threshold, we quantify the role of higher-order interactions in favoring the spread of epidemic diseases, providing analytical support to restricting large gatherings during an epidemic outbreak. Finally, we incorporate a reactive behavioral response in the network formation process.
|
|
16:40-17:00, Paper TuC06.3 | Add to My Program |
Distributed Equilibrium Seeking in Aggregative Games: Linear Convergence under Singular Perturbations Lens |
|
Carnevale, Guido | University of Bologna |
Fabiani, Filippo | IMT School for Advanced Studies Lucca |
Fele, Filiberto | University of Seville |
Margellos, Kostas | University of Oxford |
Notarstefano, Giuseppe | University of Bologna |
Keywords: Network analysis and control, Smart grid, Game theory
Abstract: We present a fully-distributed algorithm for Nash equilibrium seeking in aggregative games over networks. The proposed scheme endows each agent with a gradient-based scheme equipped with a tracking mechanism to locally reconstruct the aggregative variable, which is not available to the agents. We show that our method falls into the framework of singularly perturbed systems, as it involves the interconnection between a fast subsystem -- the global information reconstruction dynamics -- with a slow one concerning the optimization of the local strategies. This perspective plays a key role in analyzing the scheme with a constant stepsize, and in proving its linear convergence to the Nash equilibrium in strongly monotone games with local constraints. By exploiting the flexibility of our aggregative variable definition (not necessarily the arithmetic average of the agents' strategy), we show the efficacy of our algorithm on a realistic voltage support case study for the smart grid.
|
|
17:00-17:20, Paper TuC06.4 | Add to My Program |
Network Learning with Directional Sign Patterns |
|
Dong, Anqi | University of California, Irvine |
Chen, Can | University of North Carolina at Chapel Hill |
Georgiou, Tryphon T. | University of California, Irvine |
Keywords: Network analysis and control, Optimization algorithms, Biological systems
Abstract: Complex systems can be effectively modeled via graphs that encode networked interactions, where relations between entities or nodes are often quantified by signed edge weights, e.g., promotion/inhibition in gene regulatory networks, or encoding political of friendship differences in social networks. However, it is often the case that only an aggregate consequence of such edge weights that characterize relations may be directly observable, as in protein expression of in gene regulatory networks. Thus, learning edge weights poses a significant challenge that is further exacerbated for intricate and large-scale networks. In this article, we address a model problem to determine the strength of sign-indefinite relations that explain marginal distributions that constitute our data. To this end, we develop a paradigm akin to that of the Schrödinger bridge problem and an efficient Sinkhorn type algorithm (more properly, Schrödinger-Fortet-Sinkhorn algorithm) that allows fast convergence to parameters that minimize a relative entropy/likelihood criterion between the sought signed adjacency matrix and a prior. The formalism that we present represents a novel generalization of the earlier Schro ̈dinger formalism in that marginal computations may incorporate weights that model directionality in underlying relations, and further, that it can be extended to high-order networks – the Schrödinger-Fortet-Sinkhorn algorithm that we derive is applicable all the same and allows geometric convergence to a sought sign-indefinite adjacency matrix or tensor, for high-order networks.
|
|
17:20-17:40, Paper TuC06.5 | Add to My Program |
Learning in Memristive Electrical Circuits |
|
Heidema, Marieke | University of Groningen |
van Waarde, Henk J. | University of Groningen |
Besselink, Bart | University of Groningen |
Keywords: Nonlinear systems, Network analysis and control, Modeling
Abstract: Memristors are nonlinear two-terminal circuit elements whose resistance at a given time depends on past electrical stimuli. Recently, networks of memristors have received attention in neuromorphic computing since they can be used as a tool to perform linear algebraic operations, like matrix-vector multiplication, directly in hardware. In this paper, the aim is to resolve two fundamental questions pertaining to a specific, but relevant, class of memristive circuits called crossbar arrays. In particular, we show (1) how the resistance values of the memristors at a given time can be determined from external (voltage and current) measurements, and (2) how the resistances can be steered to desired values by applying suitable external voltages to the network. The results will be applied to solve a prototypical learning problem, namely linear least squares, by applying and measuring voltages and currents in a suitable memristive circuit.
|
|
17:40-18:00, Paper TuC06.6 | Add to My Program |
Solving the Convex Flow Problem |
|
Diamandis, Theo | MIT |
Angeris, Guillermo | Bain Capital |
Keywords: Optimization, Network analysis and control, Modeling
Abstract: In this paper, we introduce the solver ConvexFlows for the convex flow problem first defined in the authors' previous work. In this problem, we aim to optimize a concave utility function depending on the flows over a graph. However, unlike the classic network flows literature, we also allow for a concave relationship between the input and output flows of edges. This nonlinear gain describes many physical phenomena, including losses in power network transmission lines. We outline an efficient algorithm for solving this problem which parallelizes over the graph edges. We provide an open source implementation of this algorithm in the Julia programming language package ConvexFlows.jl. This package includes an interface to easily specify these flow problems. We conclude by walking through an example of solving the optimal power flow using ConvexFlows.
|
|
TuC07 Regular Session, Amber 6 |
Add to My Program |
Game Theory V |
|
|
Chair: Brown, Philip N. | University of Colorado Colorado Springs |
Co-Chair: Cherukuri, Ashish | University of Groningen |
|
16:00-16:20, Paper TuC07.1 | Add to My Program |
Pursuit-Evasion Game with Asymmetric Information |
|
Yang, Danjie | Zhejiang University of Technology |
Feng, Yu | Zhejiang University of Technology |
Li, Yongqiang | Zhejiang University of Technology |
Luo, Biao | Central South University |
Keywords: Game theory, Reinforcement learning
Abstract: This paper studies the problem of multiple pursuers and single evader with asymmetric information, where only the leader of pursuit group can measure the relative distance to the evader, while the latter has a global view. Due to the lack of information, the pursuers introduce an imaginary circle to estimate the position of the evader. A continuous stochastic pursuit game is established and the existence of a stationary Nash equilibrium is shown. With the information advantage, a full states Markov decision process (MDP) for the evader is then constructed and the existence of a pure stationary optimal strategy is demonstrated. Moreover, an algorithm based on fictitious self-play and reinforcement learning is presented to obtain stationary strategies. An experiment with quadruped robots is also included to show the effectiveness of the results.
|
|
16:20-16:40, Paper TuC07.2 | Add to My Program |
Bayesian Hypergame Approach to Equilibrium Stability and Robustness in Moving Target Defense |
|
Zhang, Hanzheng | Tongji University |
Cheng, Zhaoyang | Academy of Mathematics and System Science |
Chen, Guanpu | KTH Royal Institute of Technology |
Johansson, Karl H. | KTH Royal Institute of Technology |
Keywords: Game theory, Uncertain systems
Abstract: We investigate the equilibrium stability and robustness in a class of moving target defense problems, where players have both incomplete information and asymmetric cognition. We first establish a Bayesian Stackelberg game model for the incomplete information, and then employ a hypergame reformulation to handle the asymmetric cognition. With the core concept of the hyper Bayesian Nash equilibrium (HBNE), a condition for equilibria to achieve both strategic and cognitive stability can be realized by solving linear equations. Moreover, to deal with players' underlying perturbed knowledge, we study the equilibrium robustness by presenting a condition of robust HBNE under the given configuration. Experiments evaluate our theoretical results.
|
|
16:40-17:00, Paper TuC07.3 | Add to My Program |
Do More Bad Choices Benefit Social Learning? |
|
Poojary, Pawan | Northwestern University |
Berry, Randall A. | Northwestern University |
Keywords: Game theory, Statistical learning, Markov processes
Abstract: Online markets can enable agents to learn from the actions of others. Such social learning can lead agents to eventually ``follow the crowd'' and ignore their own private information. This type of behavior has been well studied for agents faced with two possible actions - one ``good'' action and one ``bad'' action. In this paper, we consider a scenario where agents have more than two actions and only one of these is good. We show that sequential learning in such settings has substantially different properties compared to the binary action case and further show that increasing the number of ``bad'' choices from 1 to 2, improves the agents' learning. Whereas, if they are increased from 1 to more than 2, we find that learning can be improved if the private signals are sufficiently strong.
|
|
17:00-17:20, Paper TuC07.4 | Add to My Program |
Incentivizing Strategies in Dynamic Games Using Information Design |
|
Sun, Renyan | University of Southern California |
Nayyar, Ashutosh | University of Southern California |
Keywords: Game theory, Stochastic systems
Abstract: We consider a finite-horizon discrete-time dynamic system that is jointly controlled by two agents. There is a system designer that can influence agents' behavior by selectively revealing some information to the agents. Specifically, at each time step, the designer sends messages to the agents based on its private information. The agents use the received messages (and their own information) to choose their actions. The agents' actions influence the evolution of the underlying dynamic system and the costs incurred by the agents. We are interested in the setting where the designer would like to send messages in a way that incentivizes the two agents to play a specific pair of strategies. Under certain assumptions on the information structure of the designer and the agents, we provide an algorithm for finding a emph{messaging} strategy for the designer that incentivizes agents to play the desired strategies. Our algorithm requires solving a family of linear program feasibility problems in a backward inductive manner.
|
|
17:20-17:40, Paper TuC07.5 | Add to My Program |
Inferring the Prior Using Public Signalling in Bayesian Persuasion Routing Games |
|
Verbree, Jasper | University of Groningen |
Cherukuri, Ashish | University of Groningen |
Keywords: Game theory, Transportation networks, Uncertain systems
Abstract: This paper considers Bayesian persuasion for routing games where information about the uncertain state of the network is provided by a traffic information system (TIS) using public signals. In this setup, the TIS commits to a signalling scheme and participants form a posterior belief about the state of the network based on prior beliefs and received signal. They subsequently select routes minimizing their individual expected travel time under their posterior beliefs, giving rise to a Wardrop equilibrium. We investigate how the TIS can infer the prior held by the participants by designing suitable signalling schemes, and observing the equilibrium flows under different signals. We provide an iterative algorithm that finds such a scheme in a finite number of steps. Examples illustrate our results.
|
|
17:40-18:00, Paper TuC07.6 | Add to My Program |
Altruism Improves Congestion in Series-Parallel Nonatomic Congestion Games |
|
Hill, Colton | University of Colorado Colorado Springs |
Brown, Philip N. | University of Colorado Colorado Springs |
Keywords: Game theory, Transportation networks
Abstract: Self-interested routing policies from individual users in a system can collectively lead to poor aggregate congestion in routing networks. The introduction of altruistic agents (referred to as altruists), whose goal is to minimize other agents' routing time in addition to their own, can seemingly improve aggregate congestion. However, in some network routing problems, it is known that altruists can actually worsen congestion compared to that which would arise if all agents had simply behaved selfishly. This paper provides a thorough investigation into the necessary conditions for altruists to be guaranteed to improve total congestion. In particular, we study the class of series-parallel nonatomic congestion games, where one sub-population is selfish and the other is altruistic. We find that a game is guaranteed to have improved congestion in the presence of altruists (regardless of their population size) compared to if all agents route selfishly, provided the path set for the network is symmetric (all agents can access all paths), and the path set cannot exhibit Braess's paradox (a phenomenon we refer to as a Braess-resistant path set). Our results appear to be the most complete characterization of when behavior that is designed to improve total congestion (which we refer to as altruism) is guaranteed to do so.
|
|
TuC08 Regular Session, Amber 7 |
Add to My Program |
Optimal Control VI |
|
|
Chair: Schenato, Luca | University of Padova |
Co-Chair: Spinello, Davide | University of Ottawa |
|
16:00-16:20, Paper TuC08.1 | Add to My Program |
Distributed Optimization of Heterogeneous Agents by Adaptive Dynamic Programming |
|
Yang, Haizhou | Beijing Institute of Technology |
Xie, Kedi | Beijing Institute of Technology |
Yu, Xiao | Xiamen University |
Guan, Jinting | Xiamen University |
Lu, Maobin | Beijing Institute of Technology |
Deng, Fang | Beijing Institute of Technology |
Keywords: Optimal control, Distributed control, Data driven control
Abstract: In this paper, we study the distributed optimization problem of general linear multi-agent systems with heterogeneous dynamics under directed weight-unbalanced communication topologies. Compared with existing studies, we focus on the case when the dynamics of agents are unknown, which possesses higher application value. To tackle the issues brought by unknown system dynamics, the adaptive dynamic programming method is adopted to design the control law. The feedback gain in the control law and the system dynamics are derived from the input data, the state data, and the output data of the agents. Then, the remaining parameters in the control law are obtained by solving a series of matrix equations based on the identified system dynamics. Based on the certainty equivalence principle, the distributed optimization problem is solved in the sense that the outputs of all agents converge to the optimal solution of the global cost function. Finally, a simulation example concerning a group of resistor-inductorcapacitor (RLC) circuits is presented to verify the effectiveness of the proposed method.
|
|
16:20-16:40, Paper TuC08.2 | Add to My Program |
Humans-In-The-Building: Getting Rid of Thermostats in Comfort-Based Energy Management Control Systems |
|
Wang, Jiali | East China University of Science and Technology |
Tang, Yang | East China University of Science and Technology |
Schenato, Luca | University of Padova |
Keywords: Optimal control, Energy systems, Modeling
Abstract: The Energy Management Control Systems present in today's Building Automation Systems (BASs) typically adopt thermostats, potentially integrated with humidity, CO2, and occupancy sensors, to regulate internal temperature within a predefined range to optimize energy consumption. However direct feedback of occupants' personal comfort is rarely considered. Differently, this work proposes a new thermal control paradigm, referred to as "humans-in-the-buildings", where individuals can directly signal their discomfort to the energy management system which thus adjusts the HVAC control inputs accordingly. We provide a mathematical formulation for this novel comfort-based control and demonstrate that neither temperature sensors nor knowledge of the occupants' discomfort profiles is required, as each room's temperature is regulated based solely on the real-time discomfort signals from individuals. Finally, the effectiveness of the proposed comfort-based control is substantiated through simulations conducted on multiple adjacent rooms established building dynamics modeling.
|
|
16:40-17:00, Paper TuC08.3 | Add to My Program |
A Receding Horizon Optimal Control Method Using Neural Network HJB for Signal Model Estimation |
|
Zhu, Yancheng | Boston University |
Andersson, Sean B. | Boston University |
Keywords: Optimal control, Estimation, Information theory and control
Abstract: We consider the problem of localizing a set of nodes in a wireless sensor network when both their positions and the parameters of their communication model are unknown. We assume that a single mobile agent moves through the environment, taking measurements of the Received Signal Strength (RSS), and seek a controller that optimizes a performance metric based on the Fisher Information Matrix (FIM). Our approach involves two stages. In the first stage, we apply a discrete-time receding horizon controller that determines a sequence of positions to move to so as to maximize a metric based on the FIM. In the second stage, we formulate an optimal control problem to move the agent to the first position in that sequence, using a neural-network based controller to approximate the solution to the Hamilton-Jacobi-Bellman (HJB) equation and from that define a feedback control policy to execute that move. Measurements are collected along the way and after completing the move, the data are used to estimate the parameters, after which the process begins again. Through simulations we demonstrate that our approach outperforms a baseline based on random search, as well three other optimization approaches: a greedy approach, an approach that uses only the first stage, and a solution that moves the agent in a straight line to the next target location rather than along an optimal path.
|
|
17:00-17:20, Paper TuC08.4 | Add to My Program |
Further Results about Linfty/L1 Duality and Applications to the SIR Epidemiological Model |
|
Goreac, Dan | Université Laval |
Rapaport, Alain | INRAE & Univ. Montpellier |
Keywords: Optimal control, Healthcare and medical systems
Abstract: The Linfty/L1 duality in optimal control problems consists in studying how to link solutions minimizing the Linfty norm of an output function under an upper L1 constraint on an input function (primal problem), with solutions minimizing the L1 norm of the input function under an upper Linfty constraint on the output function (dual problem). In this work, we bring insights on recent results on Linfty/L1 duality in optimal control problems. In particular, we exhibit an example for which duality does not apply, and we revisit the application to the epidemiological SIR problem.
|
|
17:20-17:40, Paper TuC08.5 | Add to My Program |
Optimal Phase Control of Limit-Cycle Oscillators with Strong Inputs through Phase-Amplitude Reduction |
|
Namura, Norihisa | Tokyo Institute of Technology |
Nakao, Hiroya | Tokyo Institute of Technology |
Keywords: Optimal control, Model/Controller reduction
Abstract: We present a method for optimal phase control of limit-cycle oscillators using strong inputs. Based on the phase-amplitude reduction, which provides a concise representation of the oscillator dynamics, we design an optimal control input that quickly realizes the target phase while keeping the oscillator state close to the original limit cycle by penalizing the amplitude deviations. The derived scheme requires only a single one-dimensional phase equation even for the control of high-dimensional oscillators. We demonstrate the effectiveness of the proposed method by comparing the control performance with other methods, using the van der Pol oscillator and Willamowski-Rossler oscillator as examples.
|
|
17:40-18:00, Paper TuC08.6 | Add to My Program |
Formation Control of Multi-Agent Systems Via Voronoi Tessellation and Kullback-Leibler Divergence |
|
Zheng, Ruiming | University of Ottawa |
Spinello, Davide | University of Ottawa |
Keywords: Optimal control, Networked control systems, Lyapunov methods
Abstract: We present an algorithm to control the spatial distribution of kinematic multi-agent systems in two-dimensional workspace. Leveraging on the coverage control framework, we formulate the problem as a multi-objective optimization with a performance index composed of the area coverage metric and of the Kullback–Leibler (KL) divergence. The KL term drives the statistical spatial distribution of the agents to a desired, user-defined density in the workspace, whereas the coverage term drives the agents to a centroidal Voronoi configuration. The two terms are connected by setting the target distribution to be also the risk density in the area coverage term. The risk density for the coverage metric weights points in the area based on their relative importance. We prove that the proposed control law minimizes the multi-objective metric by driving the agents to a generalized centroidal Voronoi configuration along the trajectories generated by the gradient of the performance index, while minimizing the distance between the moments of the agents' distribution and of the target distribution. The proposed control allows to use the target distribution to drive the system's formation. Theoretical predictions are illustrated in simulation.
|
|
TuC09 Regular Session, Amber 8 |
Add to My Program |
Predictive Control Applications |
|
|
Chair: Ferramosca, Antonio | Univeristy of Bergamo |
Co-Chair: Lazar, Mircea | Eindhoven University of Technology |
|
16:00-16:20, Paper TuC09.1 | Add to My Program |
Frequency-Domain-Based Regularisation of Energy-Maximising Control for Wave Energy Conversion |
|
Papini, Guglielmo | Politecnico Di Torino |
Mattiazzo, Giuliana | Politecnico Di Torino |
Faedo, Nicolás | Politecnico Di Torino |
Keywords: Predictive control for linear systems, Optimal control, Power generation
Abstract: Maximising energy through optimal control strategies is crucial in developing ocean wave energy harvesting technologies. Among these strategies, direct optimal control techniques, including model predictive control, are widely employed for energy maximisation. The specific energy-based objective function might result in non-convex formulations according to the discretisation procedure employed, directly affecting real-time feasibility. To address this, modifications are commonly made to the overall objective function to ensure convexity. These modifications are virtually always based on numerical computation of a set of eigenvalues, and depend on the designer's ability to balance numerical accuracy with the objective of energy maximisation. In this study, leveraging the duality between time-domain and frequency-domain control formulations, a condition based on the spectral characteristics of the model, characterising the energy harvester, is proposed, to determine an accurate regularisation condition for the problem. The developed methodology is demonstrated with numerical examples.
|
|
16:20-16:40, Paper TuC09.2 | Add to My Program |
Robust Maneuver Planning with Scalable Prediction Horizons: A Move Blocking Approach |
|
Schitz, Philipp | German Aerospace Center (DLR) |
Dauer, Johann C. | DLR (German Aerospace Center) |
Mercorelli, Paolo | Leuphana University of Lüneburg |
Keywords: Predictive control for linear systems, Computational methods, Aerospace
Abstract: Implementation of Model Predictive Control (MPC) on hardware with limited computational resources remains a challenge. Especially for long-distance maneuvers that require small sampling times, the necessary horizon lengths prevent its application on onboard computers. In this paper, we propose a computationally efficient tube-based shrinking horizon MPC that is scalable to long prediction horizons. Using move blocking, we ensure that a given number of decision inputs is efficiently used throughout the maneuver. Next, a method to substantially reduce the number of constraints is introduced. The approach is demonstrated with a helicopter landing on an inclined platform using a prediction horizon of 300 steps. The constraint reduction decreases the computation time by an order of magnitude with a slight increase in trajectory cost.
|
|
16:40-17:00, Paper TuC09.3 | Add to My Program |
Artificial Pancreas under Stable Pulsatile Model Predictive Control: Including the Physical Activity Effect |
|
Licini, Nicola | University of Bergamo |
Sonzogni, Beatrice | University of Bergamo |
Abuin, Pablo | CONICET-INTEC |
Previdi, Fabio | Università Degli Studi Di Bergamo |
González, Alejandro H. | CONICET-Universidad Nacional Del Litoral |
Ferramosca, Antonio | Univeristy of Bergamo |
Keywords: Predictive control for linear systems, Metabolic systems, Biomedical
Abstract: This study presents the application of a pulsatile Zone Model Predictive Control (pZMPC) aimed at controlling blood glucose concentration in individuals affected by Type 1 Diabetes Mellitus (T1DM), considering physical activity effects, i.e. one of the main critical aspects of the disease. The glycemic control relies on a physiological individualized long-term model. Leveraging this model, the algorithm performs prediction and estimation of the patient’s future states, employing a disturbance observer to compensate for plant-model mismatches. The efficacy of this approach is evaluated in a cohort of in-silico patients from the FDA-approved UVA/Padova simulator.
|
|
17:00-17:20, Paper TuC09.4 | Add to My Program |
Model Predictive Control for Closed-Loop Deep Brain Stimulation |
|
Steffen, Sebastian | University of Oxford |
Cannon, Mark | University of Oxford |
Tan, Huiling | University of Oxford |
Debarros, Jean | University of Oxford |
Keywords: Predictive control for nonlinear systems, Biomedical, Identification for control
Abstract: This paper describes a model predictive control (MPC) algorithm for Deep Brain Stimulation (DBS) implants that are used to treat common movement disorders. DBS is currently used in clinical practice in open-loop with constant stimulation, which shortens the effective lifespan of the treatment and can lead to unpleasant side-effects. The goal of closed-loop control is to alleviate symptoms with minimal stimulation. The controller is based on a model of the amplitude of beta-band (13-30 Hz) oscillations of population-level neural activity at the site of the implant, which is a bio-marker related to the presence of symptoms of Parkinson's Disease. We present a two-stage approach in which a dynamic model for bio-marker activity is identified from data after applying a linearizing transformation, followed by a regulation stage using the identified model together with a model of response to stimulation based on average patient data. A Kalman filter is used to estimate the state of both the stimulation response and the nominal beta activity. The controller is compared to thresholded on/off (bang-bang) and proportional-integral (PI) feedback controllers, which are the most advanced form of control tested textit{in vivo} to date. Simulations demonstrate reductions in control input for similar levels of tracking error.
|
|
17:20-17:40, Paper TuC09.5 | Add to My Program |
Nonlinear Data-Driven Predictive Control Design for Water Distribution Networks |
|
Verheijen, Peter | Eindhoven University of Technology |
Goswami, Dip | Eindhoven University of Technology |
Lazar, Mircea | Eindhoven University of Technology |
Keywords: Predictive control for nonlinear systems, Data driven control, Control applications
Abstract: In this paper, we present a novel method for controlling Water Distribution Networks (WDNs) using Data-driven Predictive Control (DPC). First, we identify through physical first-principle knowledge that a standard linear predictor is insufficient. However, by mapping the control input as a nonlinear function to a measurable intermediate variable, we can obtain an accurate data-driven predictor. This furthermore allows us to retain the standard cost function and constraints employed for the control of WDNs. The proposed algorithm is implemented and simulated on a small example WDN. The resulting nonlinear data-driven predictive control algorithm performs well on the network, showing the expected response.
|
|
17:40-18:00, Paper TuC09.6 | Add to My Program |
Autoencoder-Based and Physically Motivated Koopman Lifted States for Wind Farm MPC: A Comparative Case Study |
|
Sharan, Bindu | Hamburg University of Technology |
Dittmer, Antje | German Aerospace Center |
Werner, Herbert | Hamburg University of Technology |
Xu, Yongyuan | Hamburg University of Technology |
Keywords: Power generation, Neural networks, Predictive control for nonlinear systems
Abstract: This paper explores the use of Autoencoder (AE) models to identify Koopman-based linear representations for designing model predictive control (MPC) for wind farms. Wake interactions in wind farms are challenging to model, and have previously been addressed with Koopman lifted states. In this study we investigate the performance of two AE models: The first AE model estimates the wind speeds acting on the turbines these are affected by changes in turbine control inputs. The wind speeds estimated by this AE model are then used in a second step to calculate the power output via a simple turbine model based on physical equations. The second AE model directly estimates the wind farm output, i.e., both turbine and wake dynamics are modelled. The primary inquiry of this study is whether either of these two AE-based models can surpass previously identified Koopman models based on physically motivated lifted states. We find that the first AE model, which estimates the wind speed and hence includes the wake dynamics, but excludes the turbine dynamics outperforms the existing physically motivated Koopman model. However, the second AE model, which estimates the farm power directly, underperforms when the turbines' underlying physical assumptions are correct. We also investigate specific conditions under which the second, purely data-driven AE model can excel: Notably, when modelling assumptions, such as the wind turbine power coefficient, are erroneous and remain unchecked within the MPC controller. In such cases, the data-driven AE models, when updated with recent data reflecting changed system dynamics, can outperform physics-based models operating under outdated assumptions.
|
|
TuC10 Invited Session, Brown 1 |
Add to My Program |
Advances in Stochastic Control III: Partial and Decentralized Information |
|
|
Chair: Mehta, Prashant G. | Univ of Illinois, Urbana-Champaign |
Co-Chair: Yuksel, Serdar | Queen's University |
Organizer: Mehta, Prashant G. | Univ of Illinois, Urbana-Champaign |
Organizer: Yuksel, Serdar | Queen's University |
|
16:00-16:20, Paper TuC10.1 | Add to My Program |
Sufficient Conditions for Solving Statistical Filtering Problems by Dynamic Programming (I) |
|
Feinberg, Eugene A. | Stony Brook University |
Ishizawa, Sayaka | Stony Brook University |
Kasyanov, Pavlo | National Technical University of Ukraine "KPI", NAS of Ukraine |
Kraemer, David | Stony Brook University |
Keywords: Filtering, Stochastic optimal control, Stochastic systems
Abstract: The paper studies discrete-time statistical filtering problems with the goal to minimize expected total costs. Such problems are usually defined by pairs of stochastic equations and by one-step cost functions. Stochastic equations describe the state and observation processes, and these equations are defined by transition and observation functions. This paper provides sufficient conditions on observation, transition, and one-step cost functions for convergence of value-iteration algorithms for problems with finite and infinite horizons. It is well-known that nonlinear and linear filtering problems can be presented as Partially Observable Markov Decision Processes (POMDPs). The paper applies contemporary results on convergence of value iterations for Markov Decision Processes (MDPs) and for POMDPs to filtering problems. It formulates conditions on observation and transition functions which imply weak continuity of the filter. Weak continuity of the filter means weak continuity of transition probabilities between belief states. The sufficient condition on one-step functions is their K-inf-compactness. The described conditions hold for broad classes of nonlinear filters and for Kalman filters.
|
|
16:20-16:40, Paper TuC10.2 | Add to My Program |
An Excursion Onto Schrödinger's Bridges: Stochastic Flows between Spatio-Temporal Marginals |
|
Eldesoukey, Asmaa | University of California at Irvine |
Movilla Miangolarra, Olga | University of Calfornia, Irvine |
Georgiou, Tryphon T. | University of California, Irvine |
Keywords: Stochastic optimal control, Stochastic systems, Optimization
Abstract: In a gedanken experiment, in 1931/32, Erwin Schrödinger sought to understand how unlikely events can be reconciled with prior laws dictated by the underlying physics. In the process, he posed and solved a celebrated problem that is now named after him -- the Schrödinger's bridge problem (SBP). In this, one seeks to determine the ``most likely'' path that stochastic particles took while transitioning between states incompatible with the prior. Schrödinger's problem proved to have yet another interpretation, that of the stochastic control problem to steer diffusive particles by suitable control action so as to match specified marginals -- a soft probabilistic constraint. Interestingly, the SBP is convex and can be solved by an efficient iterative algorithm known as the {em Fortet-Sinkhorn} algorithm. The dual interpretation of the SBP, as an estimation and a control problem, as well as its computational tractability, are at the heart of an ever-expanding range of its applications in controls. The purpose of the present work is to expand substantially the type of control and estimation problems that can be addressed following Schrödinger's dictum, by incorporating random stopping (freezing) times for a given stochastic flow. Specifically, in the context of estimation, we seek the most likely evolution realizing spatio-temporal marginals. In the context of control, we seek the {em optimal} control action directing the stochastic flow toward spatio-temporal probabilistic constraints. To this end, we derive a new Schrödinger system of coupled, in space and time, partial differential equations to construct the solution to the proposed problem, and further, we show that a Fortet-Sinkhorn type of algorithm is once again available to attain the associated bridge. A key feature
|
|
16:40-17:00, Paper TuC10.3 | Add to My Program |
On Analyzing Filters with Bayesian Parameter Inference and Poisson-Sampled Observations (I) |
|
Tanwani, Aneel | Laas -- Cnrs |
Keywords: Filtering, Estimation, Statistical learning
Abstract: The problem of state estimation in continuous-time linear stochastic systems is considered with several constraints on the available information. It is stipulated that the model of the system contains some unknown parameters and the observation process is randomly time-sampled. The classical solution due to Kalman-Bucy cannot be implemented in that case, and we revisit the idea of partitioning the set of unknown parameters, and consider multiple filters corresponding to each possible value of the unknown parameter. The posterior distribution of the unknown parameters conditioned upon available observations is computed from Bayes' rule. The resulting state estimate is a weighted sum of the state estimates generated by multiple Kalman filters, where the weights are determined by the posterior distribution of the unknown parameters. We analyze the performance of the algorithm by looking at its asymptotic behavior and establishing boundedness of the error covariance matrix.
|
|
17:00-17:20, Paper TuC10.4 | Add to My Program |
Backward Map for Filter Stability Analysis (I) |
|
Kim, Jin Won | Hongik University |
Joshi, Anant A. | University of Illinois at Urbana Champaign |
Mehta, Prashant G. | Univ of Illinois, Urbana-Champaign |
Keywords: Stochastic systems, Filtering, Markov processes
Abstract: In this paper, a backward map is introduced for the purposes of analysis of the nonlinear (stochastic) filter stability. The backward map is important because the filter-stability in the sense of chi-squared-divergence follows from showing a certain variance decay property for the backward map. To show this property requires additional assumptions on the model properties of the hidden Markov model (HMM). The analysis in this paper is based on introducing a Poincare Inequality (PI) for HMMs with white noise observations. In finite state-space settings, PI is related to both the ergodicity of the Markov process as well as the observability of the HMM. It is shown that the Poincare constant is positive if the HMM is detectable.
|
|
17:20-17:40, Paper TuC10.5 | Add to My Program |
Information Compression in Dynamic Information Disclosure Games (I) |
|
Tang, Dengwang | University of Southern California |
Subramanian, Vijay G. | University of Michigan |
Keywords: Game theory, Stochastic optimal control, Stochastic systems
Abstract: We consider a two-player dynamic information design problem for a game played between a principal and a receiver on top of a Markovian system controlled by the receiver's actions. The principal strategically obtains and shares some information about the underlying system with the receiver in order to influence their actions, and agents' instantaneous rewards depend only on the system state and receiver actions. In our game, both players have long-term objectives, and the principal sequentially commits to their strategies instead of at the beginning---at every turn the principal can choose randomized experiments to observe the system partially. The principal can share details about the experiments to the receiver. In our analysis the emph{truthful disclosure} rule is assumed---the principal is required to truthfully announce each experiment detail and result to the receiver immediately after the result is revealed. Based on the received information, when its their turn the receiver takes an action which influences the state of the underlying system. Using a constructive backward inductive procedure, we show that there exists a Perfect Bayesian Equilibrium in this game where both agents play Canonical Belief Based (CBB) strategies using a compressed version of their information, rather than their full information, to choose experiments (for the principal) or actions (for the receiver).
|
|
17:40-18:00, Paper TuC10.6 | Add to My Program |
A Class of Linear Quadratic Collective Discrete Choice Models with Congestion Effects (I) |
|
Toumi, Noureddine | Polytechnique Montreal |
Malhame, Roland P. | Ecole Poly. De Montreal |
Le Ny, Jerome | Polytechnique Montréal |
Keywords: Cooperative control, Optimization, Stochastic systems
Abstract: This paper introduces a new class of linear quadratic dynamic collective choice model with congestion effects. Agents choose between multiple destinations and cooperate to minimize a collective average cost. To preserve the model's analytical tractability, we employ a quadratic negative term in the cost function to simulate congestion avoidance. We show that identifying a solution to our model is equivalent to solving a number of linear quadratic regulator problems equal to the number of destinations, followed by an optimal transport problem parameterized by the fraction of agents choosing each destination. This contrasts with the brute-force approach consisting of solving a series of linear quadratic regulator problems whose number increases exponentially with the population size. To further reduce the complexity of the solution search, we also define an appropriate system of limiting equations, whose solution can be used to efficiently approximate the optimal solution of the model as the number of agents increases.
|
|
TuC11 Regular Session, Brown 2 |
Add to My Program |
Data Driven Control VI |
|
|
Chair: Mauroy, Alexandre | University of Namur |
Co-Chair: Anantharaman, Ramachandran | University of Namur |
|
16:00-16:20, Paper TuC11.1 | Add to My Program |
Dynamic Mode Decomposition with Non-Uniform Sampling |
|
Anantharaman, Ramachandran | Eindhoven University of Technology |
Mauroy, Alexandre | University of Namur |
Keywords: Data driven control
Abstract: Dynamic Mode Decomposition (DMD) and its extensions (EDMD) have been at the forefront of data-based approaches to Koopman operators. Most (E)DMD algorithms assume that the entire state is sampled at a uniform sampling rate. In this paper, we provide an algorithm where the entire state is not uniformly sampled, with individual components of the states measured at individual (but known) sampling rates. We propose a two-step DMD algorithm where the first step performs Hankel DMD on individual state components to estimate them at specified time instants. With the entire state reconstructed at the same time instants, we compute the (E)DMD for the system with the estimated data in the second step.
|
|
16:20-16:40, Paper TuC11.2 | Add to My Program |
Learning a Formally Verified Control Barrier Function in Stochastic Environment |
|
Tayal, Manan | Indian Institute of Science, Bengaluru |
Zhang, Hongchao | Washington University in St. Louis |
Jagtap, Pushpak | Indian Institute of Science |
Clark, Andrew | Washington University in St. Louis |
Nadubettu Yadukumar, Shishir | Indian Institute of Science |
Keywords: Data driven control, Stochastic systems, Neural networks
Abstract: Safety is a fundamental requirement of control systems. Control Barrier Functions (CBFs) are proposed to ensure the safety of the control system by constructing safety filters or synthesizing control inputs. However, the safety guarantee and performance of safe controllers rely on the construction of valid CBFs. Inspired by universal approximatability, CBFs are represented by neural networks, known as neural CBFs (NCBFs). This paper presents an algorithm for synthesizing formally verified continuous-time neural Control Barrier Functions in stochastic environments in a single step. The proposed training process ensures efficacy across the entire state space with only a finite number of data points by constructing a sample-based learning framework for Stochastic Neural CBFs (SNCBFs). Our methodology eliminates the need for post hoc verification by enforcing Lipschitz bounds on the neural network, its Jacobian, and Hessian terms. We demonstrate the effectiveness of our approach through case studies on the inverted pendulum system and obstacle avoidance in autonomous driving, showcasing larger safe regions compared to baseline methods.
|
|
16:40-17:00, Paper TuC11.3 | Add to My Program |
Feedforward Controllers from Learned Dynamic Local Model Networks with Application to Excavator Assistance Functions |
|
Greiser, Leon Constantin | RWTH Aachen University |
Demir, Ozan | Ruhr-University Bochum |
Hartmann, Benjamin | Robert Bosch GmbH |
Hose, Henrik | RWTH Aachen University |
Trimpe, Sebastian | RWTH Aachen University |
Keywords: Control applications, Data driven control, Feedback linearization
Abstract: Complicated first principles modelling and controller synthesis can be prohibitively slow and expensive for high-mix, low-volume products such as hydraulic excavators. Instead, in a data-driven approach, recorded trajectories from the real system can be used to train local model networks (LMNs), for which feedforward controllers are derived via feedback linearization. However, previous works required LMNs without zero dynamics for feedback linearization, which restricts the model structure and thus modelling capacity of LMNs. In this paper, we overcome this restriction by providing a criterion for when feedback linearization of LMNs with zero dynamics yields a valid controller. As a criterion we propose the bounded-input bounded-output stability of the resulting controller. In two additional contributions, we extend this approach to consider measured disturbance signals and multiple inputs and outputs. We illustrate the effectiveness of our contributions in a hydraulic excavator control application with hardware experiments. To this end, we train LMNs from recorded, noisy data and derive feedforward controllers used as part of a leveling assistance system on the excavator. In our experiments, incorporating disturbance signals and multiple inputs and outputs enhances tracking performance of the learned controller. A video of our experiments is available at https://youtu.be/lrrWBx2ASaE.
|
|
17:00-17:20, Paper TuC11.4 | Add to My Program |
A Unified Non-Strict Finsler Lemma |
|
Meijer, Tomas | Eindhoven University of Technology |
Scheres, Koen | Eindhoven University of Technology |
van den Eijnden, Sebastiaan | Eindhoven University of Technology |
Holicki, Tobias | University of Stuttgart |
Scherer, Carsten W. | University of Stuttgart |
Heemels, W.P.M.H. (Maurice) | Eindhoven University of Technology |
Keywords: Data driven control, LMIs, Optimization
Abstract: In this paper, we present a unified general non-strict Finsler lemma. This result is general in the sense that it does not impose any restrictions on the involved matrices and, thereby, it encompasses all existing non-strict versions of Finsler's lemma that do impose such restrictions. To further illustrate its usefulness, we showcase applications of the non-strict Finsler's lemma in deriving a structured solution to a special case of the non-strict projection lemma, and we use the unified non-strict Finsler's lemma to prove a more general version of the matrix Finsler's lemma.
|
|
17:20-17:40, Paper TuC11.5 | Add to My Program |
Model-Based and Data-Based Output Feedback for External Positivity |
|
Al Makdah, Abed AlRahman | University of California Riverside |
Pasqualetti, Fabio | University of California, Riverside |
Keywords: Compartmental and Positive systems, Data driven control, LMIs
Abstract: In this work, we derive output-feedback controllers that render the closed-loop system externally positive. We begin by expressing the class of discrete-time, linear, time-invariant systems and the class of dynamic controllers in the space of input-output behaviors, where a dynamic controller can be expressed as a static behavioral feedback gain. We leverage the static form of the controller to derive output-feedback controllers that achieve monotonic output tracking of a constant non-negative reference output. Further, we provide a direct data-driven approach to derive monotonic tracking output-feedback controllers for single-input-single-output (SISO) systems. Our approaches, model-based and data-based, allow us to obtain output-feedback controllers that render the closed-loop system externally positive. Finally, we validate our results numerically in a drone landing control problem.
|
|
17:40-18:00, Paper TuC11.6 | Add to My Program |
Controller Synthesis for Input-State Data with Measurement Errors |
|
Bisoffi, Andrea | Politecnico Di Milano |
Li, Lidong | University of Groningen |
De Persis, Claudio | University of Groningen |
Monshizadeh, Nima | University of Groningen |
Keywords: Data driven control, Uncertain systems, Robust control
Abstract: We consider the problem of designing a state-feedback controller for a linear system, based only on noisy input-state data. We focus on input-state data corrupted by measurement errors, which, albeit less investigated, are as relevant as process disturbances in applications. For energy and instantaneous bounds on these measurement errors, we derive linear matrix inequalities for controller design where the one for the energy bound is equivalent to robust stabilization of all systems consistent with the noisy data points via a common Lyapunov function.
|
|
TuC12 Regular Session, Brown 3 |
Add to My Program |
Learning and Iterative Learning Control |
|
|
Chair: Seel, Thomas | Leibniz Universität Hannover |
Co-Chair: Margellos, Kostas | University of Oxford |
|
16:00-16:20, Paper TuC12.1 | Add to My Program |
Robust Optimization for Adversarial Learning with Finite Sample Complexity Guarantees |
|
Bertolace, André | University of Oxford |
Gatsis, Konstantinos | University of Southampton |
Margellos, Kostas | University of Oxford |
Keywords: Learning, Statistical learning, Machine learning
Abstract: Decision making and learning under uncertainty, especially with adversarial attacks, is crucial for reliable operations. This paper introduces a novel adversarial training method for robust linear and nonlinear classifiers, inspired by Support Vector Machine (SVM) margins. We derive finite sample complexity bounds for binary and multi-class classifiers, which align with those of natural classifiers. Our algorithm uses Linear Programming (LP) and Second Order Cone Programming (SOCP) for linear and nonlinear models, respectively. Experiments on MNIST and CIFAR10 datasets demonstrate performance comparable to state-of-the-art methods, without requiring adversarial examples during training. Our approach provides a robust framework for enhancing classifier resilience against adversaries.
|
|
16:20-16:40, Paper TuC12.2 | Add to My Program |
Repetitive T-S Fuzzy Model-Based Iterative Learning Control Law Design |
|
Maniarski, Robert | University of Zielona Góra |
Paszke, Wojciech | University of Zielona Gora |
Tao, Hongfeng | Jiangnan University |
Rogers, Eric | University of Southampton |
Keywords: Iterative learning control, LMIs, Fuzzy systems
Abstract: This paper develops an iterative learning control law for a class of nonlinear systems. The approach used to represent the nonlinear system dynamics is a Takagi-Sugeno fuzzy repetitive process that considers the two directions of information propagation. Then, the control action investigated is a state feedback control law combined with a PD-type feed-forward learning control law. Consequently, linear matrix inequality techniques can be used for control design. Furthermore, this approach allows the design of control action to satisfy the requirements on both the error convergence and the transient dynamics. Finally, an example demonstrates the properties of the new design.
|
|
16:40-17:00, Paper TuC12.3 | Add to My Program |
Reference-Adapting Iterative Learning Control for Motion Optimization in Constrained Environments |
|
Meindl, Michael | Leibniz University Hannover, Institute of Mechatronic Systems |
Bachhuber, Simon | FAU Erlangen-Nürnberg |
Seel, Thomas | Leibniz Universität Hannover |
Keywords: Iterative learning control, Constrained control, Autonomous robots
Abstract: Optimizing controllers for reference tracking in real-world environments typically requires laborious manual tuning of a control policy to ensure safe operation under constraints. In this work, a Reference-Adapting Iterative Learning Control (RAILC) scheme is proposed that enables autonomous motion optimization for multi-input/multi-output systems with linear, inequality constraints. The proposed method consists of a standard ILC system that iteratively updates an input feedforward trajectory to learn to perform the desired, optimal motion which is encoded as a reference trajectory. To also ensure compliance with the constraints on every single trial, the standard ILC is modularly extended by a reference adaptation scheme. Both feasibility and constraint compliance of the proposed RAILC method are formally proven. Furthermore, it is shown that monotonic convergence of the underlying ILC scheme guarantees stability and monotonic convergence of the proposed RAILC method. The method's capability to solve reference tracking and motion optimization problems for constrained MIMO systems is validated by two simulation examples including a two-link robot that - by means of the proposed method - learns to increase the execution speed of a desired motion by a factor of five.
|
|
17:00-17:20, Paper TuC12.4 | Add to My Program |
Trackability-Based Tracking Control for Stochastic Learning Systems: A Two-Dimensional System Method |
|
Lv, Wenjin | Beihang University |
Zhang, Jingyao | Beihang University (BUAA) |
Meng, Deyuan | Beihang University (BUAA) |
Keywords: Iterative learning control, Stochastic systems, Time-varying systems
Abstract: This paper focuses on exploring a novel trackability-based framework for a class of iterative learning control (ILC) systems subject to stochastic disturbances by using a two-dimensional (2-D) system method. By examining the fundamental trackability property of the stochastic ILC systems, the trackability-based stochastic ILC design and analysis framework is developed, eliminating the need for the common realizability assumption. Under this framework, thanks to the 2-D system method with the Roesser systems, the convergence results for both the output and input errors can be established under a unified condition, regardless of the full row or column rank of the input-output coupling matrix. A simulation example is included to demonstrate the validity of our proposed stochastic ILC design framework for ILC systems.
|
|
17:20-17:40, Paper TuC12.5 | Add to My Program |
Efficient Online Inference and Learning in Partially Known Nonlinear State-Space Models by Learning Expressive Degrees of Freedom Offline |
|
Ewering, Jan-Hendrik | Leibniz Universität Hannover |
Volkmann, Björn | Leibniz University Hannover, Institute of Mechatronic Systems |
Ehlers, Simon F. G. | Leibniz University Hannover |
Seel, Thomas | Leibniz Universität Hannover |
Meindl, Michael | Leibniz University Hannover, Institute of Mechatronic Systems |
Keywords: Estimation, Learning, Grey-box modeling
Abstract: Intelligent real-world systems critically depend on expressive information about their system state and changing operation conditions, e.g., due to variation in temperature, location, wear, or aging. To provide this information, online inference and learning attempts to perform state estimation and (partial) system identification simultaneously. Current works combine tailored estimation schemes with flexible learning-based models but suffer from convergence problems and computational complexity due to many degrees of freedom in the inference problem (i.e., parameters to determine). To resolve these issues, we propose a procedure for data-driven offline conditioning of a highly flexible Gaussian Process (GP) formulation such that online learning is restricted to a subspace, spanned by expressive basis functions. Due to the simplicity of the transformed problem, a standard particle filter can be employed for Bayesian inference. In contrast to most existing works, the proposed method enables online learning of target functions that are nested nonlinearly inside a first-principles model. Moreover, we provide a theoretical quantification of the error, introduced by restricting learning to a subspace. A Monte-Carlo simulation study with a nonlinear battery model shows that the proposed approach enables rapid convergence with significantly fewer particles compared to a baseline and a state-of-the-art method.
|
|
17:40-18:00, Paper TuC12.6 | Add to My Program |
Distributionally Robust Clustered Federated Learning: A Case Study in Healthcare |
|
Konti, Xenia | Duke University |
Riess, Hans | Duke University |
Giannopoulos, Manos | Duke University |
Shen, Yi | Duke University |
Pencina, Michael | Duke University |
Economou-Zavlanos, Nicoleta | Duke University |
Zavlanos, Michael M. | Duke University |
Keywords: Healthcare and medical systems, Machine learning, Optimization algorithms
Abstract: In this paper, we address the challenge of hetero- geneous data distributions in cross-silo federated learning by introducing a novel algorithm, which we term Cross-silo Robust Clustered Federated Learning (CS-RCFL). Our approach leverages the Wasserstein distance to construct ambiguity sets around each client’s empirical distribution that capture possible distribution shifts in the local data, enabling evaluation of worst-case model performance. We then propose a model-agnostic integer fractional program to determine the optimal distributionally robust clustering of clients into coalitions so that possible biases in the local models caused by statistically heterogeneous client datasets are avoided, and analyze our method for linear and logistic regression models. Finally, we discuss a federated learning protocol that ensures the privacy of client distributions, a critical consideration, for instance, when clients are healthcare institutions. We evaluate our algorithm on synthetic and real-world healthcare data.
|
|
TuC13 Invited Session, Suite 1 |
Add to My Program |
Network Traffic Modelling and Control |
|
|
Chair: Nick Zinat Matin, Hossein | University of California, Berkeley |
Co-Chair: Delle Monache, Maria Laura | University of California, Berkeley |
Organizer: Cicic, Mladen | University of California, Berkeley |
Organizer: Nick Zinat Matin, Hossein | University of California, Berkeley |
Organizer: Yu, Huan | The Hong Kong University of Science and Technology(Guangzhou) |
Organizer: Delle Monache, Maria Laura | University of California, Berkeley |
|
16:00-16:20, Paper TuC13.1 | Add to My Program |
On the Impact of Coordinated Fleets Size on Traffic Efficiency (I) |
|
Toso, Tommaso | Université Grenoble Alpes, CNRS, Inria, Grenoble INP, GIPSA-Lab |
Parise, Francesca | Cornell University |
Frasca, Paolo | CNRS, GIPSA-Lab, Univ. Grenoble Alpes |
Kibangou, Alain | Univ. Grenoble Alpes |
Keywords: Transportation networks, Game theory, Traffic control
Abstract: We investigate a traffic assignment problem on a transportation network, considering both the demands of individual drivers and of a large fleet controlled by a central operator (minimizing the fleet's average travel time). We formulate this problem as a two-player convex game and we study how the size of the coordinated fleet, measured in terms of share of the total demand, influences the Price of Anarchy (PoA). We show that, for two-terminal networks, there are cases in which the fleet must reach a minimum share before actually affecting the PoA, which otherwise remains unchanged. Moreover, for parallel networks, we prove that the PoA is monotonically non-increasing in the fleet share.
|
|
16:20-16:40, Paper TuC13.2 | Add to My Program |
Improving Social Cost in Traffic Routing with Bounded Regret Via Second-Best Tolls (I) |
|
Alanqary, Arwa | University of California, Berkeley |
Kreidieh, Abdul Rahman | University of California, Berkeley |
Samaranayake, Samitha | Cornell University |
Bayen, Alexandre | University of California, Berkeley |
Keywords: Traffic control, Optimization
Abstract: In this work, we investigate algorithmic improvements that navigation services can implement to steer road networks toward a system-optimal state while retaining high levels of user compliance. We model the compliance of users using marginal regret, and we extend the definition of the social cost function to account for various traffic congestion externalities. We propose a routing algorithm for the static traffic assignment problem that improves the social cost with guarantees on the worst-case regret in the network. This algorithm leverages the connection we establish between this problem and that of second-best toll pricing. We present numerical experiments on different networks to illustrate the trade-off between regret and efficiency of the resulting assignment for arbitrary social costs.
|
|
16:40-17:00, Paper TuC13.3 | Add to My Program |
A Dynamic Programming Approach for Road Traffic Estimation (I) |
|
Laurini, Mattia | University of Parma |
Saccani, Irene | Università Di Parma |
Ardizzoni, Stefano | University of Parma |
Consolini, Luca | Università Di Parma |
Locatelli, Marco | University of Parma |
Keywords: Estimation, Transportation networks, Stochastic systems
Abstract: We consider a road network represented by a directed graph. We assume to collect many measurements of traffic flows on all the network arcs, or on a subset of them. We assume that the users are divided into different groups. Each group follows a different path. The flows of all user groups are modeled as a set of independent Poisson processes. Our focus is estimating the paths followed by each user group, and the means of the associated Poisson processes. We present a possible solution based on a Dynamic Programming algorithm. The method relies on the knowledge of high–order cumulants. We discuss the theoretical properties of the introduced method. Finally, we present some numerical tests on well–known benchmark networks, using synthetic data.
|
|
17:00-17:20, Paper TuC13.4 | Add to My Program |
Optimal Pricing Strategies for Charging Station Operators in the Frequency Containment Reserves Market (I) |
|
Gasnier, Guillaume | GIPSA-Lab, CNRS |
Canudas de Wit, Carlos | CNRS, GIPSA-Lab |
Keywords: Control applications, Optimization, Nonlinear systems
Abstract: We present an innovative strategy leveraging electric vehicles and their charging station infrastructure to provide grid-balancing services in the ancillary market. Our study focuses on the competition between two charging station operators for customer attraction and participation in the frequency containment reserves market. Our model tracks electric vehicles state-of-charge dynamics considering variables such as driver behavior, state-of-charge levels, and charging/discharging costs. Charging station operators engaged in the frequency containment reserves market in collaboration with aggregators. We introduce an optimization framework to determine pricing strategies that maximize profits for charging station operators. Our simulations demonstrate the benefits of charging stations, whether competing or collaborating, participating in the frequency containment reserves market.
|
|
17:20-17:40, Paper TuC13.5 | Add to My Program |
A Model Predictive Control Approach for Smooth Traffic Flow of Connected Autonomous Vehicles at Signalized Junctions |
|
Sen, Rudra | Indian Institute of Technology Delhi |
Datta, Subashish | Indian Institute of Technology Delhi |
Keywords: Autonomous vehicles, Traffic control, Predictive control for linear systems
Abstract: In this article, we consider the problem of controlling a group of autonomous vehicles, which are maneuvering on a road, such that smooth traffic flow can happen. For this, we propose that every autonomous vehicle maneuvers with a pre-specified velocity profile. The velocity profiles are determined in such a way that either none of the vehicles wait near a signalized junction or only a limited number of vehicles are accumulated when the red light is ON. To maneuver the autonomous vehicles with a pre-specified velocity profile, a control sequence is generated using a receding horizon control approach in the model predictive control framework. The associated problems, such as the feasibility of the underlying constrained optimization problem at every stage and the convergence analysis, are also considered. The efficacy of the proposed approach is evinced with numerical simulations.
|
|
17:40-18:00, Paper TuC13.6 | Add to My Program |
Controlling Traffic Flow for Electric Fleets Via Optimal Transport |
|
Mascherpa, Michele | KTH Royal Institute of Technology |
Karlsson, Johan | KTH Royal Institute of Technology |
Keywords: Computational methods, Transportation networks, Optimal control
Abstract: In this paper we consider the problem of optimally steering an ensemble of battery-powered agents over a network. This is an important problem in applications such as traffic flow control for electric vehicles, where both capacity constraints from the roads and the locations of charging stations need to be taken into account. We extend previous work where origin-destination problems have been formulated using optimal transport. By introducing a state representing the charge level, we can formulate the steering problem as a structured multi-marginal optimal transport problem. The computational method is based on a dual coordinate ascent algorithm applied to the the entropy regularized problem, in which we can exploit the decomposable structure of the cost tensor for efficient computations. In this formulation the capacity constraints are represented in terms of certain linear operators, and we derive explicit expressions for the corresponding updates of blocks of the dual variables. Finally, the method is illustrated with a numerical example where commodities having different charges are required to travel over a grid from origin to destination while minimizing the total energy consumed.
|
|
TuC14 Regular Session, Suite 2 |
Add to My Program |
Estimation II |
|
|
Chair: Mercère, Guillaume | University of Poitiers |
Co-Chair: Raïssi, Tarek | Conservatoire National Des Arts Et Métiers |
|
16:00-16:20, Paper TuC14.1 | Add to My Program |
Parameter Identification of Linear Error Equations: Guaranteeing Output Error Boundedness |
|
Pin, Gilberto | Electrolux |
Gong, Yizhou | ShanghaiTech University |
Wang, Yang | Shanghai Technology Unversity |
Serrani, Andrea | The Ohio State University |
Keywords: Estimation, Identification, Adaptive control
Abstract: In this paper, we tackle the classical problem of estimating the parameters of an algebraic linear parameter model with the objective of solving the long-standing problem of guaranteeing boundedness of the output error independently from the growth of the regressors. Two solutions are presented. The first solution provides global results under the assumption that the time derivative of the regressor is available. The other solution disposes of the knowledge of the derivative of the regressor, and yields results that are valid in a semi-global sense, under the assumption that the regressor has a bounded growth. Simulation results provides an illustration of the proposed techniques in comparison with standard unnormalized and normalized gradient laws.
|
|
16:20-16:40, Paper TuC14.2 | Add to My Program |
Consistent Rigid Body Localization from Range Measurements with Anchor Position Uncertainty |
|
Zhao, Hongxu | The Chinese Unviersity of Hong Kong, Shenzhen |
Zeng, Guangyang | The Chinese University of Hong Kong, Shenzhen |
Jiang, Haodong | Chinese University of Hong Kong, Shenzhen |
Ren, Xiaoqiang | Shanghai University |
Yang, Wen | East China University of Science and Technology |
Wu, Junfeng | The Chinese Unviersity of Hong Kong, Shenzhen |
Keywords: Estimation, Identification, Nonlinear systems identification
Abstract: Rigid Body Localization (RBL) using range measurements has recently attracted much attention. In some large and complicated scenarios, we may not obtain the accurate positions for anchors deployed in the environment. However, few works have considered the anchor position uncertainty. In this paper, we formulate the Maximum Likelihood (ML) RBL problem with anchor position uncertainty and find that the ML estimate is not necessarily consistent. As an alternative, we propose a two-step estimator MGN-CULS, which is both consistent and computationally efficient. In the first step, we develop a closed-form initial estimate with consistency using bias-eliminating techniques. In the second step, we design a modified Gauss-Newton iteration to refine the initial estimate without destabilizing the consistency. Simulation results demonstrate the stable and accurate performance of our proposed algorithm.
|
|
16:40-17:00, Paper TuC14.3 | Add to My Program |
Estimation of Self-Similarity Index for Rosenblatt Models |
|
Coupek, Petr | Charles University, Faculty of Mathematics and Physics |
Kriz, Pavel | Charles University |
Maslowski, Bohdan | Charles University |
Keywords: Estimation, Identification, Stochastic systems
Abstract: In this paper, we address the self-similarity index estimation problem. For observations of a Rosenblatt process, we propose a new estimator based on the logarithmic variation with adjustment for bias and perform numerical simulations that show good performance of this estimator. We also address the problem of estimation of the self-similarity index for SDEs driven by a Rosenblatt process. A robust (against drift and noise intensity misspecification) estimator is described and its (asymptotic) behavior is analyzed both theoretically and in simulations.
|
|
17:00-17:20, Paper TuC14.4 | Add to My Program |
Stable, Consistent, Closed-Form Estimators for VAR(1) |
|
Rong, Xinhui | University of New South Wales |
Solo, Victor | University of New South Wales |
Keywords: Estimation, Identification, Subspace methods
Abstract: Guaranteeing the stability of the estimator of the system matrix of a vector autoregression of order one VAR(1), is of great importance in many applications. However, almost all existing algorithms that do so are iterative, require many tuning parameters and are computationally expensive. Here we extend our recent work to derive two sets of new closed- form estimators that require no tuning parameters and are computationally cheap. We prove their stability and statistical consistency and compare them in simulations.
|
|
17:20-17:40, Paper TuC14.5 | Add to My Program |
Enhanced Recursive Total Least Squares Method with Subspace Tracking and Noise Covariance Estimation |
|
Elsherbiny, Mohamed | Université De Poitiers |
Mercère, Guillaume | University of Poitiers |
Arvis, Vincent | Michelin |
Biesse, Frederic | Michelin |
Keywords: Estimation, Identification, Subspace methods
Abstract: In this study, we present a novel recursive approach aimed at refining parameter estimation within the Errors-in-Variables (EIV) framework when noisy data are available online. Our method integrates the Weighted Total Least Squares (WTLS) technique with a Subspace Tracking algorithm and a dynamic Noise Covariance Adaptation solution, offering improved accuracy and precision in estimating parameters, particularly when errors affect both input and output variables. By merging WTLS and Subspace Tracking methodologies, our model adeptly adapts to variations in system dynamics. Moreover, the integration of a real-time Noise Covariance Adaptation mechanism into our parameter estimation strategy effectively addresses uncertainties stemming from input and output measurement noise. Through different simulations and comparative analyses, we validate the efficacy of our approach, underscoring its potential to significantly advance parameter estimation within EIV models across diverse applications.
|
|
17:40-18:00, Paper TuC14.6 | Add to My Program |
Interval Estimation for Continuous-Time Linear Systems with Parametric Uncertainties |
|
Zhang, Fan | Harbin Institute of Technology |
Wang, Zhenhua | Harbin Institute of Technology |
Meslem, Nacim | GIPSA-LAB, CNRS |
Raïssi, Tarek | Conservatoire National Des Arts Et Métiers |
Shen, Yi | Harbin Institute of Technology |
Keywords: Estimation, Linear systems, LMIs
Abstract: This paper proposes a novel two-step interval estimation method for continuous-time linear systems with parametric uncertainties. In the first step, a robust augmented Luenberger-like observer with two observer gains is proposed. More formally, this state observer can be viewed as a cascade of two estimators, where the second one is supplied by the data provided by the first one. The first estimator is able to attenuate the effect of the system uncertainty (uncertain parameters, process disturbance and measurement noise) on the accuracy of the estimation error. Based on the output equation of the system, the second estimator improves the precision of the estimated state vector (provided by the first estimator) by solving a Frobenius-norm optimization problem on the feasible domain of the system output. In the second step, by allying Lyapunov stability theory with ellipsoidal analysis, guaranteed bounds on the estimation error are established. Throughout a numerical example, we show that the proposed approach outperforms some existing methods in the literature.
|
|