| |
Last updated on September 22, 2023. This conference program is tentative and subject to change
Technical Program for Friday December 15, 2023
|
FrA01 Invited Session, Melati Junior 4010A-4111 |
Add to My Program |
Learning, Optimization, and Game Theory III |
|
|
Chair: Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Co-Chair: Zhang, Kaiqing | University of Maryland |
Organizer: Doan, Thinh T. | Virginia Tech |
Organizer: Sayin, Muhammed Omer | Bilkent University |
Organizer: Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Organizer: Zhang, Kaiqing | University of Maryland |
|
10:00-10:20, Paper FrA01.1 | Add to My Program |
Approximate Optimal Indirect Regulation of an Uncertain Agent with a Lyapunov-Based Deep Neural Network |
|
Makumi, Wanjiku A. | University of Florida |
Bell, Zachary I. | Air Force |
Dixon, Warren E. | University of Florida |
Keywords: Nonlinear systems identification, Lyapunov methods, Machine learning
Abstract: An approximate optimal policy is developed for a pursuing agent to indirectly regulate an evading agent coupled by an uncertain interaction dynamic. Approximate dynamic programming is used to design a controller for the pursuing agent to optimally influence the evading agent to a goal location. Since the interaction dynamic between the agents is unknown, integral concurrent learning is used to update a Lyapunov-based deep neural network to facilitate sustained learning and system identification. A Lyapunov-based stability analysis is used to show uniformly ultimately bounded convergence. Simulation results demonstrate the performance of the developed method.
|
|
10:20-10:40, Paper FrA01.2 | Add to My Program |
Neural Operators for Hyperbolic PDE Backstepping Feedback Laws (I) |
|
Bhan, Luke | University of California, San Diego |
Shi, Yuanyuan | University of California San Diego |
Krstic, Miroslav | University of California, San Diego |
Keywords: Nonlinear output feedback, Learning, Nonlinear systems
Abstract: We introduce a framework for accelerating the computation of a backstepping controller in PDE control. We learn the nonlinear operator from the plant parameter and PDE solution to the boundary control with a (deep) neural network. We provide closed-loop stability guarantees (semiglobal exponential) under a NN-approximation of the feedback law. While, in the existing PDE backstepping, finding a feedback law requires the solution to multiple integral equations and operations for both the gain and control input value, the neural operator (NO) approach we propose learns the mapping from the functional coefficients of the plant PDE and PDE system state to the boundary control value by employing a sufficiently high number of offline numerical solutions to the analytical feedback control law. We prove the existence of a DeepONet approximation with arbitrarily high accuracy, of the exact nonlinear continuous operator mapping between the PDE coefficient functions and PDE system state into a control feedback law. Once proven to exist, learning of the NO is standard, completed "once and for all" (never online) and the control feedback equation doesn't need to be solved ever again, for both any new functional coefficient and PDE system state that does not exceed the magnitude of the coefficients and states used in training. Simulation illustrations are provided and the code is available on github.
|
|
10:40-11:00, Paper FrA01.3 | Add to My Program |
On the Effect of the Presence of an Opponent in a Class of LQ Differential Games (I) |
|
Tarantino, Lorenzo | Università Degli Studi Di Roma Tor Vergata |
Astolfi, Alessandro | Imperial College & Univ. of Rome |
Sassano, Mario | University of Rome, Tor Vergata |
Keywords: Optimal control, Optimization, Nonlinear systems
Abstract: The aim of this paper is to assess the effect of the presence of an opponent in a class of finite-horizon differential games described by scalar linear differential equations and quadratic cost functionals in which the state is penalized only at the terminal time. The contribution of the other player is quantitatively characterized by comparing the solutions of the underlying Riccati differential equations for the optimal control (in the absence of the opponent) and of the differential game. In the case of open-loop Nash equilibria, this effect can be characterized in closed form, since an analytic expression for the solutions of the coupled asymmetric differential Riccati equations can be computed. For feedback Nash equilibria a closed-form solution to the related coupled symmetric differential Riccati equations cannot be determined. Therefore an estimate of the solution is provided by relying on a functional approximation approach, allowing to characterize the effect of the presence of an opponent also in this setting.
|
|
11:00-11:20, Paper FrA01.4 | Add to My Program |
Learning Switched Koopman Models for Control of Entity-Based Systems (I) |
|
Blischke, Madeline | The University of California, Santa Barbara |
Hespanha, Joao P. | Univ. of California, Santa Barbara |
Keywords: Data driven control, Learning, Optimal control
Abstract: One problem that arises in control engineering is that of controlling a system for which the dynamics are unknown. Such a problem favors a data-driven approach, such as can be done through the use of the Koopman operator. We present a switched Koopman model, applicable to systems with discrete sets of inputs, that gives rise to an optimal control problem with a piecewise affine value function. This structure provides an efficient representation and enables a heuristic pruning algorithm that avoids the exponential complexity of finding the true optimal solution. We use density-like observables that are defined through the notion of entity-based systems: systems whose state is composed of a possibly varying number of entities that can be grouped into classes of like-entities. This encompasses many systems, including arcade games, a common benchmark used in reinforcement learning. We find that our approach requires much less training than commonly used for reinforcement learning. The Koopman approach also has the advantage of being agnostic to the control objective, which allows the objective to be changed without needing to retrain the model.
|
|
11:20-11:40, Paper FrA01.5 | Add to My Program |
A Physics-Informed Neural Networks Framework to Solve the Infinite-Horizon Optimal Control Problem (I) |
|
Fotiadis, Filippos | Georgia Institute of Technology |
Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Keywords: Neural networks, Optimal control, Learning
Abstract: In this work, we leverage physics-informed neural networks (PINNs) to approximately solve the infinite-horizon optimal control problem for nonlinear systems. Specifically, since PINNs are generally able to solve a class of partial differential equations, they can be employed to approximate the value function in the infinite-horizon optimal control problem, via solving the associated steady-state Hamilton-Jacobi-Bellman (HJB) equation. However, the issue with such a direct approach is that the steady HJB equation generally yields more than one solution, hence directly employing PINNs to solve it can lead to divergence of the method. To tackle this problem, we instead apply PINNs to a finite-horizon variant of the steady-state HJB equation which has a unique solution, and which uniformly approximates the infinite-horizon optimal value function as the horizon increases. A method to verify whether the selected horizon is large enough is also provided, as well as an algorithm to increase it with reduced computations if it is not. Unlike conventional methods, the proposed approach does not require knowledge of a stabilizing controller, the execution of computationally expensive iterations, or polynomial basis functions for approximation.
|
|
11:40-12:00, Paper FrA01.6 | Add to My Program |
Worst-Case Control and Learning Using Partial Observations Over an Infinite Time Horizon (I) |
|
Dave, Aditya | Cornell University |
Faros, Ioannis | University of Delaware |
Senthil Kumar, Nishanth Venkatesh | Cornell University |
Malikopoulos, Andreas A. | Cornell University |
Keywords: Machine learning, Robust control, Uncertain systems
Abstract: Safety-critical cyber-physical systems require control strategies whose worst-case performance is robust against adversarial disturbances and modeling uncertainties. In this paper, we present a framework for approximate control and learning in partially observed systems to minimize the worst-case discounted cost over an infinite time horizon. We model disturbances to the system as finite-valued uncertain variables with unknown probability distributions. For problems with known system dynamics, we construct a dynamic programming (DP) decomposition to compute the optimal control strategy. Our first contribution is to define information states that improve the computational tractability of this DP for a class of problems with observable incurred costs at each time instance. Our second contribution proposes approximate information states that can be constructed or learned directly from observed data for these problems. We derive bounds on the performance loss of the resulting approximate control strategy and illustrate the effectiveness of our approach in partially observed decision-making problems with a numerical example.
|
|
FrA02 Invited Session, Orchid Main 4202-4303 |
Add to My Program |
Data-Driven Verification and Control of Cyber-Physical Systems I |
|
|
Chair: Lavaei, Abolfazl | Newcastle University |
Co-Chair: Jungers, Raphaël M. | University of Louvain |
Organizer: Lavaei, Abolfazl | Newcastle University |
Organizer: Jungers, Raphaël M. | University of Louvain |
Organizer: Abate, Alessandro | University of Oxford |
|
10:00-10:20, Paper FrA02.1 | Add to My Program |
Data-Driven Stability Certificate of Interconnected Homogeneous Networks Via ISS Properties |
|
Lavaei, Abolfazl | Newcastle University |
Angeli, David | Imperial College |
Keywords: Stability of nonlinear systems, Network analysis and control, Large-scale systems
Abstract: This letter is concerned with a compositional data-driven approach for stability certificate of interconnected homogeneous networks with (partially) unknown dynamics while providing 100% correctness guarantees (as opposed to probabilistic confidence). The proposed framework enjoys input-to-state stability (ISS) properties of subsystems described by ISS Lyapunov functions. In our data-driven scheme, we first reformulate the corresponding conditions of ISS Lyapunov functions as a robust optimization program (ROP). Due to appearing unknown dynamics of subsystems in the constraint of ROP, we propose a scenario optimization program (SOP) by collecting data from trajectories of each unknown subsystem. We solve SOP and construct an ISS Lyapunov function for each subsystem with unknown dynamics. We accordingly leverage a compositional technique based on max-type small-gain reasoning and construct a Lyapunov function for an unknown interconnected network based on ISS Lyapunov functions of individual subsystems. We demonstrate the efficacy of our data-driven approach over a room temperature network containing 1000 rooms with unknown dynamics. Given collected data from each unknown room, we verify that the unknown interconnected network is globally asymptotically stable (GAS) with 100% correctness guarantee.
|
|
10:20-10:40, Paper FrA02.2 | Add to My Program |
Data-Driven Control with Inherent Lyapunov Stability (I) |
|
Min, Youngjae | MIT |
Richards, Spencer M. | Stanford University |
Azizan, Navid | MIT |
Keywords: Data driven control, Lyapunov methods, Machine learning
Abstract: Recent advances in learning-based control leverage deep function approximators, such as neural networks, to model the evolution of controlled dynamical systems over time. However, the problem of learning a dynamics model and a stabilizing controller persists, since the synthesis of a stabilizing feedback law for known nonlinear systems is a difficult task, let alone for complex parametric representations that must be fit to data. To this end, we propose Control with Inherent Lyapunov Stability (CoILS), a method for jointly learning parametric representations of a nonlinear dynamics model and a stabilizing controller from data. To do this, our approach simultaneously learns a parametric Lyapunov function which intrinsically constrains the dynamics model to be stabilizable by the learned controller. In addition to the stabilizability of the learned dynamics guaranteed by our novel construction, we show that the learned controller stabilizes the true dynamics under certain assumptions on the fidelity of the learned dynamics. Finally, we demonstrate the efficacy of CoILS on a variety of simulated nonlinear dynamical systems.
|
|
10:40-11:00, Paper FrA02.3 | Add to My Program |
Data-Driven Abstractions Via Adaptive Refinements and a Kantorovich Metric (I) |
|
Banse, Adrien | UCLouvain |
Romao, Licio | University of Oxford |
Abate, Alessandro | University of Oxford |
Jungers, Raphaël M. | University of Louvain |
Keywords: Sampled-data control, Iterative learning control, Behavioural systems
Abstract: We introduce an adaptive refinement procedure for smart and scalable abstraction of dynamical systems. Our technique relies on partitioning the state space depending on the observation of future outputs. However, this knowledge is dynamically constructed in an adaptive, asymmetric way. In order to learn the optimal structure, we define a Kantorovich-inspired metric between Markov chains, and we use it to guide the state partition refinement. Our technique is prone to data-driven frameworks, but not restricted to. We also study properties of the above mentioned metric between Markov chains, which we believe could be of broader interest. We propose an algorithm to approximate it, and we show that our method yields a much better computational complexity than using classical linear programming techniques.
|
|
11:00-11:20, Paper FrA02.4 | Add to My Program |
Contraction-Guided Adaptive Partitioning for Reachability Analysis of Neural Network Controlled Systems (I) |
|
Harapanahalli, Akash | Georgia Institute of Technology |
Jafarpour, Saber | Georgia Institute of Technology |
Coogan, Samuel | Georgia Institute of Technology |
Keywords: Neural networks, Autonomous systems, Nonlinear systems
Abstract: In this paper, we present a contraction-guided adaptive partitioning algorithm for improving interval-valued robust reachable set estimates in a nonlinear feedback loop with a neural network controller and disturbances. Based on an estimate of the contraction rate of over-approximated intervals, the algorithm chooses when and where to partition. Then, by leveraging a decoupling of the neural network verification step and reachability partitioning layers, the algorithm can provide accuracy improvements for little computational cost. This approach is applicable with any sufficiently accurate open-loop interval-valued reachability estimation technique and any method for bounding the input-output behavior of a neural network. Using contraction-based robustness analysis, we provide guarantees of the algorithm's performance with mixed monotone reachability. Finally, we demonstrate the algorithm's performance through several numerical simulations and compare it with existing methods in the literature. In particular, we report a sizable improvement in the accuracy of reachable set estimation in a fraction of the runtime as compared to state-of-the-art methods.
|
|
11:20-11:40, Paper FrA02.5 | Add to My Program |
Certified Vision-Based State Estimation for Autonomous Landing Systems Using Reachability Analysis (I) |
|
Santa Cruz Leal, Ulices | University of California Irvine |
Shoukry, Yasser | University of California, Irvine |
Keywords: Formal Verification/Synthesis, Vision-based control, Machine learning
Abstract: This paper studies the problem of designing a certified vision-based state estimator for autonomous landing systems. In such a system, a neural network (NN) processes images from a camera to estimate the aircraft's relative position with respect to the runway. We propose an algorithm to design such NNs with certified properties in terms of their ability to detect runways and provide accurate state estimation. At the heart of our approach is the use of geometric models of perspective cameras to obtain a mathematical model that captures the relation between the aircraft states and the inputs. We show that such geometric models enjoy mixed monotonicity properties that can be used to design state estimators with certifiable error bounds. We show the effectiveness of the proposed approach using an experimental testbed on data collected from event-based cameras.
|
|
11:40-12:00, Paper FrA02.6 | Add to My Program |
MDP Abstractions from Data: Large-Scale Stochastic Networks (I) |
|
Lavaei, Abolfazl | Newcastle University |
Keywords: Stochastic systems, Data driven control, Network analysis and control
Abstract: This work proposes a compositional data-driven technique for the construction of finite Markov decision processes (MDPs) for large-scale stochastic networks with unknown mathematical models. Our proposed framework leverages dissipativity properties of subsystems and their finite MDPs using a notion of stochastic storage functions (SStF). In our data-driven scheme, we first build an SStF between each unknown subsystem and its data-driven finite MDP with a certified probabilistic confidence. We then derive dissipativity-type compositional conditions to construct a stochastic bisimulation function (SBF) between an interconnected network and its finite MDP using data-driven SStF of subsystems. Accordingly, we formally quantify the probabilistic distance between trajectories of an unknown large-scale stochastic network and those of its finite MDP with a guaranteed confidence. We illustrate the efficacy of our data-driven results over a room temperature network composing 100 rooms with unknown models.
|
|
FrA03 Invited Session, Orchid Main 4204-4305 |
Add to My Program |
Safe Planning and Control with Uncertainty Quantification III |
|
|
Chair: Gao, Yulong | University of Oxford |
Co-Chair: Motee, Nader | Lehigh University |
Organizer: Gao, Yulong | University of Oxford |
Organizer: Lindemann, Lars | University of Southern California |
Organizer: Fan, Chuchu | Massachusetts Institute of Technology |
Organizer: Abate, Alessandro | University of Oxford |
Organizer: Pappas, George J. | University of Pennsylvania |
|
10:00-10:20, Paper FrA03.1 | Add to My Program |
Safe Navigation of Networked Robots under Localization Uncertainty Using Robust Control Barrier Functions (I) |
|
Miksits, Adam | Ericsson Research |
Barbosa, Fernando | Ericsson Research |
Lindhé, Magnus | Ericsson Research |
Araujo, Jose | Ericsson Research |
Johansson, Karl H. | KTH Royal Institute of Technology |
Keywords: Robotics, Autonomous robots, Uncertain systems
Abstract: 5G networks have the potential to provide external sensor data and offload computations for future industrial mobile robots. To enable these benefits while maintaining safety, we propose a modular architecture, including an onboard safety filter for the velocity control loop. The safety filter leverages robust control barrier functions to guarantee safety from collisions under bounded localization uncertainty. Initial experiments are performed to quantify the localization uncertainty and generate suitable bounds for the safety filter. We then derive the safety filter, and analyze its conservatism numerically. Finally, the method is demonstrated in experiments using an ABB Mobile YuMi® Research Platform robot.
|
|
10:20-10:40, Paper FrA03.2 | Add to My Program |
Safe Perception-Based Control under Stochastic Sensor Uncertainty Using Conformal Prediction (I) |
|
Yang, Shuo | University of Pennsylvania |
Pappas, George J. | University of Pennsylvania |
Mangharam, Rahul | University of Pennsylvania |
Lindemann, Lars | University of Southern California |
Keywords: Vision-based control, Machine learning, Autonomous systems
Abstract: We consider perception-based control using state estimates that are obtained from high-dimensional sensor measurements via learning-enabled perception maps. However, these perception maps are not perfect and result in state estimation errors that can lead to unsafe system behavior. Stochastic sensor noise can make matters worse and result in estimation errors that follow unknown distributions. We propose a perception-based control framework that i) quantifies estimation uncertainty of perception maps, and ii) integrates these uncertainty representations into the control design. To do so, we use conformal prediction to compute valid state estimation regions, which are sets that contain the unknown state with high probability. We then devise a sampled-data controller for continuous-time systems based on the notion of measurement robust control barrier functions. Our controller uses idea from self-triggered control and enables us to avoid using stochastic calculus. Our framework is agnostic to the choice of the perception map, independent of the noise distribution, and to the best of our knowledge the first to provide probabilistic safety guarantees in such a setting. We demonstrate the effectiveness of our proposed perception-based controller for a LiDAR-enabled F1/10th car.
|
|
10:40-11:00, Paper FrA03.3 | Add to My Program |
Data-Driven IQC-Based Robust Control Design for Hybrid Micro-Disturbance Isolation Platform (I) |
|
Gupta, Vaibhav | École Polytechnique Fédérale De Lausanne (EPFL) |
Klauser, Elias | CSEM SA |
Karimi, Alireza | EPFL |
Keywords: Control applications, Data driven control, Uncertain systems
Abstract: A novel approach for robust controller synthesis, which models uncertainty as an elliptical set, is proposed in the paper. Given a set of frequency response functions of linear time-invariant (LTI) multiple-input multiple-output (MIMO) systems, the approach determines the `best' linear nominal model and the corresponding elliptical uncertainty set, which is consistent with the data. Using a novel split representation, the uncertainty set is represented as an equivalent integral quadratic constraint (IQC). Finally, this IQC is integrated into a data-driven frequency-domain controller synthesis method using convex optimisation. The proposed method is used to design a controller, which is robust against mechanical uncertainties, for a hybrid micro-disturbance isolation platform for space applications. The experimental results show that the proposed method provides a `tighter' uncertainty set and improves attenuation performance compared to classical methods that use disk uncertainty.
|
|
11:00-11:20, Paper FrA03.4 | Add to My Program |
Impact of Misperception on Emergence of Risk in Platoon of Autonomous Vehicles (I) |
|
Amini, Arash | The University of Texas at Austin |
Liu, Guangyi | Lehigh University |
Pandey, Vivek | Lehigh University |
Motee, Nader | Lehigh University |
Keywords: Network analysis and control, Networked control systems, Autonomous vehicles
Abstract: The emergence of advanced perception algorithms has opened the possibility of achieving long-term autonomy in vehicle platooning. We develop a framework to assess the risk of misperception due to noisy observations from the environment. Each vehicle is assumed to rely on a perception unit to comprehend its surrounding environment and estimate other vehicles' positions and velocity. The Expected Shortfall (Average Value-at-Risk) measure is employed to assess the risk of collision between vehicle pairs and the risk of violating traffic laws for each vehicle under possible misperceptions. Obtaining an explicit expression for the risk measure allows us to further explore the potential trade-offs between the overall misperception-induced risks and network architecture. Using our framework, we demonstrate how misperception of highway traffic signs may cause phenomena similar to tailgating and quantify how it affects the risk of such events. Our result can also be applied to identify vehicles with high fragility to misperception and to increase the robustness of the platoon concerning overall risk measures in the presence of misperception. Our theoretical results are validated by extensive simulation.
|
|
11:20-11:40, Paper FrA03.5 | Add to My Program |
Formal Verification of Attitude Control Systems Using Geometric Barrier Functions (I) |
|
Xu, Chencheng | Zhejiang University |
Zhao, Chengcheng | Zhejiang University |
Shi, Zhiguo | Zhejiang Univesity |
Chen, Jiming | Zhejiang University |
Keywords: Formal Verification/Synthesis, Autonomous robots
Abstract: Compared to safe obstacle avoidance in the position space, ensuring the safety of the attitude system in today's aerial vehicle operations is more challenging due to the non-Euclidean nature of the attitude space and the underactuated nature of the system. To address this issue, we first propose a Geometric Exponential Barrier Condition (GEBC) to produce barrier certificates on the manifold, by which attitude safety requirements can be encoded globally into the verification problems of attitude control systems. Then, we use exponential coordinates to characterize GEBCs, which makes them descriable in terms of Quantifier Free Real Arithmetic Logic (QF-NRA) and efficiently solvable by SMT solvers. A performance criterion is further discussed where we propose an effective algorithm to construct safe operational regions with different controllers, which can help with nominal controller selection and tuning. Finally, we demonstrate our approach in a quadrotor system and analyze the safe performance of two PD controllers on the proposed safe operation criteria.
|
|
11:40-12:00, Paper FrA03.6 | Add to My Program |
Data-Driven Model Reference Control: A Geometric Approach (I) |
|
Padoan, Alberto | ETH Zürich |
Coulson, Jeremy | University of Wisconsin-Madison |
Dörfler, Florian | Swiss Federal Institute of Technology (ETH) Zurich |
|
FrA04 Regular Session, Simpor Junior 4913 |
Add to My Program |
Autonomous Vehicles II |
|
|
Chair: Axehill, Daniel | Linköping University |
Co-Chair: Chen, Ben M. | Chinese University of Hong Kong |
|
10:00-10:20, Paper FrA04.1 | Add to My Program |
Scenario-Based Hybrid Model Predictive Design for Cooperative Adaptive Cruise Control in Mixed-Autonomy Environments |
|
Mosharafian, Sahand | University of Georgia |
Bao, Yajie | The University of Georgia |
Mohammadpour Velni, Javad | Clemson University |
Keywords: Autonomous vehicles, Hybrid systems, Stochastic optimal control
Abstract: This paper presents a scenario-based hybrid model predictive control (MPC) design approach for cooperative adaptive cruise control (CACC) in mixed-autonomy traffic environments with uncertainties stemming from unexpected maneuvers of human-driven vehicles. Different from the past works that consider one possible realization of uncertainty, the proposed approach here considers multiple scenarios based on the uncertainty description that varies with the relative location of the human-driven vehicles (HVs) and the connected and automated vehicles (CAVs). For each scenario, a mixed integer quadratic programming problem is formulated for the control of CAVs with four operating modes, namely free following, braking, danger, and lane change. Each CAV's operating mode is determined based on the predictive information it receives from its predecessors and the anticipated behaviors of surrounding HVs. All the scenarios are handled simultaneously using the scenario-based MPC approach for a robust CACC. Simulations in a mixed-autonomy traffic system including two lanes demonstrate that the proposed scenario-based hybrid MPC approach significantly reduces deviations from the desired spacing policy and the desired velocity in the platoon during unexpected human-driven vehicle maneuvers, compared with the past work, particularly, a discrete hybrid stochastic (DHSA) MPC approach.
|
|
10:20-10:40, Paper FrA04.2 | Add to My Program |
Energy-Optimal Trajectory-Based Traveling Salesman Problem for Multi-Rotor Unmanned Aerial Vehicles |
|
Gao, Chuanxiang | The Chinese University of Hong Kong |
Ding, Wendi | The Chinese University of Hong Kong |
Zuoquan, Zhao | The Chinese University of Hong Kong |
Chen, Ben M. | Chinese University of Hong Kong |
Keywords: Autonomous vehicles, Optimization, Evolutionary computing
Abstract: The trajectory-based traveling salesman problem represents an extension of the classical traveling salesman problem, aimed at determining the most optimal trajectory that passes through a designated set of points. This paper introduces a novel formulation, termed the Energy-Optimal Trajectory-Based Traveling Salesman Problem (EOTB-TSP), which is grounded in an innovative energy assessment model. This model takes into account the intricate dynamics of unmanned aerial vehicles (UAVs). In addition, the EOTB-TSP is cast as a bilevel optimization challenge. To tackle this complex problem, we introduce a modified genetic algorithm specifically tailored for its resolution. To validate the effectiveness of our proposed approach, we conduct a series of experiments and apply it to real-world scenarios. Our evaluation and comparative analyses unequivocally demonstrate the high efficiency of our method in minimizing energy consumption.
|
|
10:40-11:00, Paper FrA04.3 | Add to My Program |
On Integrated Optimal Task and Motion Planning for a Tractor-Trailer Rearrangement Problem |
|
Hellander, Anja | Linköping University |
Bergman, Kristoffer | Linköping University |
Axehill, Daniel | Linköping University |
Keywords: Autonomous vehicles, Optimal control, Optimization algorithms
Abstract: In this work, a combined task and motion planner for a tractor and a set of trailers is proposed and it is shown that it is resolution complete and resolution optimal. The proposed planner consists of a task planner and a motion planner that are both based on heuristically guided graph-search. As a step towards tighter integration of task and motion planning, we use the same heuristic that is used by the motion planner in the task planner as well. We further propose to use the motion planner heuristic to give an initial underestimate of the motion costs that are used as costs during the task planning search, and increase this estimate gradually by using the motion planner to verify the cost and feasibility of actions along paths of interest. To limit the time spent in the motion planner, the use of time and cost limits to pause or prematurely abort the motion planner is proposed, which does not affect the resolution completeness or resolution optimality. The planner is evaluated on numerical examples and the results show that the proposed planner can significantly reduce the execution time compared to a baseline resolution optimal task and motion planner.
|
|
11:00-11:20, Paper FrA04.4 | Add to My Program |
Interaction-Aware Trajectory Prediction and Planning in Dense Highway Traffic Using Distributed Model Predictive Control |
|
Börve, Erik | Chalmers University of Technology |
Murgovski, Nikolce | Chalmers University of Technology |
Laine, Leo | Chalmers |
Keywords: Autonomous vehicles, Optimal control, Predictive control for nonlinear systems
Abstract: In this paper we treat optimal trajectory planning for an autonomous vehicle (AV) operating in dense traffic, where vehicles closely interact with each other. To tackle this problem, we present a novel framework that couples trajectory prediction and planning in multi-agent environments, using distributed model predictive control. A demonstration of our framework is presented in simulation, employing a trajectory planner using non-linear model predictive control. We analyze performance and convergence of our framework, subject to different prediction errors. The results indicate that the obtained locally optimal solutions are improved, compared with decoupled prediction and planning.
|
|
11:20-11:40, Paper FrA04.5 | Add to My Program |
Model Predictive Control with Collision Avoidance for Unknown Environment |
|
Silvestre, Daniel | NOVA University of Lisbon |
Ramos, Guilherme | Instituto De Telecomunicações, 1049-001 Lisbon, Portugal |
Keywords: Autonomous vehicles, Predictive control for linear systems
Abstract: This paper proposes a model predictive control framework to design autonomous rendezvous operations for terrestrial and space missions in the scope of spacecraft and drones, in the presence of obstacles in an unknown environment. Vehicles equipped with a LiDAR collecting points associated with the obstacles are considered. By proposing an efficient method to compute the smallest ellipse that contains the collected LiDAR points, this work is able to correct, in run time, the vehicle trajectory to avoid collision and reach the rendezvous target. Finally, simulations of the proposed approach to show its effectiveness are presented.
|
|
FrA05 Invited Session, Simpor Junior 4912 |
Add to My Program |
Policy Optimization Methods and Data-Driven Learning-Based Control |
|
|
Chair: Tang, Yujie | Peking University |
Co-Chair: You, Keyou | Tsinghua University |
Organizer: Tang, Yujie | Peking University |
Organizer: You, Keyou | Tsinghua University |
|
10:00-10:20, Paper FrA05.1 | Add to My Program |
Learning to Control under Communication Constraints |
|
Aggarwal, Shubham | University of Illinois, Urbana Champaign |
Velicheti, Raj Kiriti | University of Illinois at Urbana Champaign |
Basar, Tamer | Univ of Illinois, Urbana-Champaign |
Keywords: Networked control systems, Optimization, Constrained control
Abstract: How to effectively communicate over wireless networks characterized by link failures is central to understanding the fundamental limits in the performance of a networked control system. In this paper, we study the online remote control of linear-quadratic Gaussian systems over unreliable wireless channels (with random packet drops), where the controller is a priori oblivious to the cost parameters. We first reformulate the problem using a semi-definite program and consequently compute a stabilizing policy from its solution. We then derive a O(sqrt(T)) regret bound (against a best offline policy in hindsight) for a projected online gradient algorithm, where T is the length of the horizon of interest. In the process, we introduce finite time notions of the classical mean-square stability, which may be of independent interest. Finally, we provide a numerical example to validate the theoretical results, demonstrating the limitations induced by lossy communication on the control performance.
|
|
10:20-10:40, Paper FrA05.2 | Add to My Program |
On the Global Optimality of Direct Policy Search for Nonsmooth H∞ Output-Feedback Control (I) |
|
Tang, Yujie | Peking University |
Zheng, Yang | University of California San Diego |
Keywords: Optimization, Robust control, Optimal control
Abstract: Direct policy search has achieved great empirical success in reinforcement learning. Recently, there has been increasing interest in studying its theoretical properties for continuous control, and fruitful results have been established for linear quadratic regulator (LQR) and linear quadratic Gaussian (LQG) control that are smooth and nonconvex. In this paper, we consider the standard H∞ robust control for output feedback systems and investigate the global optimality of direct policy search. Unlike LQR or LQG, the H∞ cost function is nonsmooth in the policy space. Despite the lack of smoothness and convexity, our main result shows that for a class of non-degenerate stabilizing controllers, all Clarke stationary points of H∞ robust control are globally optimal and there is no spurious local minimum. Our proof technique is motivated by the idea of differentiable convex liftings (DCL), and we extend DCL to analyze the nonsmooth and nonconvex H∞ robust control via convex reformulation. Our result sheds some light on the analysis of direct policy search for solving nonsmooth and nonconvex robust control problems.
|
|
10:40-11:00, Paper FrA05.3 | Add to My Program |
Data-Driven Self-Triggering Mechanism for State Feedback Control |
|
Liu, Wenjie | Beijing Institute of Technology, Beijing, China |
Li, Yifei | Beijing Institute of Technology |
Sun, Jian | Beijing Institute of Technology |
Wang, Gang | Beijing Institute of Technology |
Chen, Jie | Beijing Institute of Technology |
Keywords: Data driven control, Predictive control for linear systems, Networked control systems
Abstract: This paper presents a novel approach for data-driven self-triggered state feedback control of unknown linear systems using noisy data gathered offline. The self-triggering mechanism determines the next triggering time by checking whether the difference between the predicted state and the current state is significant or not. However, when the system matrices are unknown, the challenge lies in characterizing the distance between future states and the current state using only data. To address this, we put forth a data-driven online optimization problem for trajectory prediction by using noisy input-state data. Its optimal solution, together with another unknown parameter that reflects the open-loop divergence rate, is shown sufficient for explicitly quantifying the distance. Moreover, a data-driven set-based over-approximating algorithm using matrix zonotopes is subsequently proposed to upper-bound the {open-loop} divergence rate. Leveraging the optimal solution and the upper bound, a self-triggering mechanism is devised for state feedback control systems, which is proven to ensure input-to-state stability. Numerical examples are presented to validate the effectiveness of the proposed method.
|
|
11:00-11:20, Paper FrA05.4 | Add to My Program |
Data-Enabled Policy Optimization for the Linear Quadratic Regulator (I) |
|
Zhao, Feiran | Tsinghua University |
Dörfler, Florian | Swiss Federal Institute of Technology (ETH) Zurich |
You, Keyou | Tsinghua University |
Keywords: Learning, Optimization, Sampled-data control
Abstract: Policy optimization (PO), an essential approach of reinforcement learning for a broad range of system classes, requires significantly more system data than indirect (identification-followed-by-control) methods or behavioral-based direct methods even in the simplest linear quadratic regulator (LQR) problem. In this paper, we take an initial step towards bridging this gap by proposing the data-enabled policy optimization (DeePO) method, which requires only a finite number of sufficiently exciting data to iteratively solve the LQR problem via PO. Based on a data-driven closed-loop parameterization, we are able to directly compute the policy gradient from a batch of persistently exciting data. Next, we show that the nonconvex PO problem satisfies a projected gradient dominance property by relating it to an equivalent convex program, leading to the global convergence of DeePO. Moreover, we apply regularization methods to enhance the certainty-equivalence and robustness of the resulting controller and show an implicit regularization property. Finally, we perform simulations to validate our results.
|
|
11:20-11:40, Paper FrA05.5 | Add to My Program |
Toward Understanding State Representation Learning in MuZero: A Case Study in Linear Quadratic Gaussian Control (I) |
|
Tian, Yi | MIT |
Zhang, Kaiqing | University of Maryland, College Park |
Tedrake, Russ | MIT |
Sra, Suvrit | MIT |
Keywords: Data driven control, Observers for Linear systems, Vision-based control
Abstract: We study the problem of representation learning for control from partial and potentially high-dimensional observations. We approach this problem via direct latent model learning, where one directly learns a dynamical model in some latent state space by predicting costs. In particular, we establish finite-sample guarantees of finding a near-optimal representation function and a near-optimal controller using the directly learned latent model for infinite-horizon time-invariant Linear Quadratic Gaussian (LQG) control. A part of our approach to latent model learning closely resembles MuZero, a recent breakthrough in empirical reinforcement learning, in that it learns latent dynamics implicitly by predicting cumulative costs. A key technical contribution of this work is to prove persistency of excitation for a new stochastic process that arises from our analysis of quadratic regression in our approach.
|
|
11:40-12:00, Paper FrA05.6 | Add to My Program |
Exact Subspace Diffusion for Decentralized Multitask Learning (I) |
|
Wadehra, Shreya | Imperial College London |
Nassif, Roula | Universite Cote D'Azur |
Vlaski, Stefan | Imperial College London |
Keywords: Learning, Optimization algorithms, Adaptive systems
Abstract: Classical paradigms for distributed learning, such as federated or decentralized gradient descent, employ consensus mechanisms to enforce homogeneity among agents. While these strategies have proven effective in i.i.d. scenarios, they can result in significant performance degradation when agents follow heterogeneous objectives or data. Distributed strategies for multitask learning, on the other hand, induce relationships between agents in a more nuanced manner, and encourage collaboration without enforcing consensus. We develop a generalization of the exact diffusion algorithm for subspace constrained multitask learning over networks, and derive an accurate expression for its mean-squared deviation when utilizing noisy gradient approximations. We verify numerically the accuracy of the predicted performance expressions, as well as the improved performance of the proposed approach over alternatives based on approximate projections.
|
|
FrA06 Invited Session, Simpor Junior 4911 |
Add to My Program |
Estimation and Control of Infinite Dimensional Systems II |
|
|
Chair: Demetriou, Michael A. | Worcester Polytechnic Institute |
Co-Chair: Burns, John A | Virginia Tech |
Organizer: Demetriou, Michael A. | Worcester Polytechnic Institute |
Organizer: Burns, John A | Virginia Tech |
|
10:00-10:20, Paper FrA06.1 | Add to My Program |
Adaptive Spatial PID and PD Coupling in Synchronization Control of Collocated Infinite and Finite Dimensional Systems (I) |
|
Demetriou, Michael A. | Worcester Polytechnic Institute |
Keywords: Distributed parameter systems
Abstract: This paper presents an entirely different approach for the synchronization of identical networked infinite dimensional systems. With a large class of infinite dimensional systems representing partial differential equations (PDEs), the concept of a functional form of the consensus protocol used for synchronization is applied here and incorporates spatial derivatives and spatial averages of the differences of the PDE states. This leads to spatial PD-type of consensus protocols for synchronization of PDEs. When the networked PDEs are tasked with following a leader, also described by a PDE of the same type, an added component of the controller is incorporated to ensure leader following. The proposed PD-coupling in the synchronization control of infinite dimensional systems attains a new form for the finite dimensional case, where now a temporal PID coupling in the consensus protocol is implemented. Simulation studies for both the infinite and the finite dimensional cases are included to demonstrate the effects of the non-traditional coupling in the synchronization control of networked systems.
|
|
10:20-10:40, Paper FrA06.2 | Add to My Program |
Consensus of Networked Hyperbolic Systems Via Event-Triggered Boundary Feedback Control (I) |
|
Lu, Mengyao | Beijing University of Technology |
Zhan, Jingyuan | Beijing University of Technology |
Zhang, Liguo | Beijing University of Technology |
Keywords: Distributed parameter systems, Cooperative control, Traffic control
Abstract: This paper investigates the consensus problem of networked hyperbolic partial differential equation (PDE) systems for reaching an agreement over the whole spatial domain via event-triggered boundary feedback control. Consensus controllers are proposed for each PDE system based on the boundary information of its neighboring systems, where both centralized and distributed event-triggered strategies are designed in order to reduce the controller updating frequency. By employing the Lyapunov technique, sufficient conditions with respect to system matrices, event-triggered conditions and the undirected communication topology are obtained to ensure consensus of the networked systems, and it is proved that the Zeno behavior can also be avoided. Finally, the consensus control of a three-lane freeway traffic flow system modeled by Aw-Rascle-Zhang Equations is given as an application example, and the numerical simulation is carried out to validate the theoretical results.
|
|
10:40-11:00, Paper FrA06.3 | Add to My Program |
Stabilization by 1D Boundary Actuation of Distal 1D Reaction-Diffusion PDE through Heat PDE on a Rectangle (I) |
|
Guan, Dandan | Donghua University |
Qi, Jie | Donghua University |
Krstic, Miroslav | University of California, San Diego |
Keywords: Backstepping, Distributed parameter systems
Abstract: This paper presents a backstepping control design method of stabilization unstable 1D reaction-diffusion system, where the input is a 1D function on an edge of a rectangle, the distal system is a 1D reaction-diffusion PDE on the opposite edge of the rectangle, and the actuator dynamics in between are a 2D heat PDE on the rectangle between the opposite edges. A novel invertible integral transformation is introduced and the resulting controller with feedback of both PDEs' states (the distal 1D and the interior 2D states). We define a new Lyapunov function that contains cosine coefficients to prove the exponential stability in H^2 norm of the closed-loop system. Finally, the theoretical result is illustrated by simulations on a numerical example.
|
|
11:00-11:20, Paper FrA06.4 | Add to My Program |
Stability of Linear KdV Equation in a Network with Bounded and Unbounded Lengths (I) |
|
Parada, Hugo | Université Grenoble Alpes, Laboratoire Jean Kunzmann |
Crépeau, Emmanuelle | Université De Versailles Saint Quentin |
Prieur, Christophe | CNRS |
Keywords: Distributed parameter systems, Stability of linear systems, Network analysis and control
Abstract: In this work, we study the exponential stability of a system of linear Korteweg-de Vries (KdV) equations interconnected through the boundary conditions on a star-shaped network structure. On each branch of the network, we define a linear KdV equation defined on a bounded domain (0,ℓj) or the half-line (0,∞). We start by proving well-posedness using semigroup theory and then some hidden regularity results. Then, we state the exponential stability of the linear KdV equation by acting with a damping term on not all the branches. This is proved by using compactness argument deriving a suitable observability inequality.
|
|
11:20-11:40, Paper FrA06.5 | Add to My Program |
Pressure Stabilized POD Reduced Order Model for Control of Viscous Incompressible Flows (I) |
|
Ravindran, S.S. | University of Alabama in Huntsville |
Keywords: Reduced order modeling, Computational methods, Fluid flow systems
Abstract: In this paper, we propose a new pressure-stabilized proper orthogonal decomposition reduced order model (POD- ROM) for the control of viscous incompressible flows. It is a velocity-pressure ROM that uses pressure modes as well to compute the reduced order pressure needed for instance in the control drag and lift forces on bodies in the flow. We also propose and analyze a decoupled time-stepping scheme that uncouples the computation of velocity and pressure variables. It allows us at each time step to solve linear problems, uncoupled in pressure and velocity, which can greatly improve computational efficiency. Numerical studies are performed to discuss the accuracy and performance of the new pressure- stabilized ROM in the simulation of control of flow past a forward-facing step channel
|
|
11:40-12:00, Paper FrA06.6 | Add to My Program |
Delta-Method Induced Confidence Bands for a Parameter-Dependent Evolution System with Application to Transdermal Alcohol Concentration Monitoring (I) |
|
Liu, Haoxing | University of Southern California |
Goldstein, Larry | University of Southern California |
Luczak, Susan | University of Southern California |
Rosen, I. Gary | Univ. of Southern California |
Keywords: Estimation, Distributed parameter systems, Healthcare and medical systems
Abstract: Uncertainty caused by parameter randomness in a system modeling the relationship between alcohol concentration level in the blood (BAC) or breath (BrAC) and transdermal alcohol concentration (TAC) measured on the surface of the skin by a wearable, non-invasive, electro-chemical biosensor is considered. The parameter-dependent impulse response function (IRF) and system output in the form of a convolution are expressed in terms of an analytic semigroup of operators with regularly dissipative generator set in a Gelfand triple of Hilbert spaces. The Fr'echet derivative of the analytic semigroup in this setting is used to study the variation in the response function resulting from the uncertainties in the parameters described by probability distributions whose statistics depend on regression models with covariates as predictor variables. Finite dimensional approximation of the infinite dimensional state space system and the multi-variate delta method for nonlinear functions of random vectors with asymptotically normal distributions are used to obtain approximating uniform confidence bands for the IRF and the TAC output signal. Convergence of the approximations and three different techniques for obtaining the confidence bands are analyzed and compared.
|
|
FrA07 Invited Session, Simpor Junior 4813 |
Add to My Program |
Identification, Optimization, and Games for Stochastic Systems |
|
|
Chair: Zhao, Wenxiao | Academy of Mathematics and Systems Science, Chinese Academy of Sciences |
Co-Chair: Mu, Biqiang | Chinese Academy of Sciences |
Organizer: Zhao, Wenxiao | Academy of Mathematics and Systems Science, Chinese Academy of Sciences |
Organizer: Qi, Hongsheng | AMSS, Chinese Academy of Sciences |
Organizer: Mu, Biqiang | Chinese Academy of Sciences |
|
10:00-10:20, Paper FrA07.1 | Add to My Program |
Sparse Plus Low-Rank Identification of Latent-Variable Graphical ARMA Models (I) |
|
You, Junyao | Beijing Institute of Technology |
Yu, Chengpu | Beijing Institute of Technology |
Keywords: Identification, Estimation
Abstract: This paper deals with the identification of graphical autoregressive moving-average (ARMA) models with latent variables. Combining sparse structural characteristics of the graphical model with low-rank modeling of the latent variables, a sparse plus low-rank based iterative identification algorithm is proposed. The topological information embedded in the sparse AR dynamics is estimated from a regularized Yule-Walker optimization problem, which is then treated as prior graphical structure constraint. The latent-variable plus MA part is identified by solving a convex constrained trace norm minimization problem. Based on the MA part estimate and the structural constraint, the graphical AR estimates are updated by the sparse plus low-rank optimization framework and are then used for the update of the latent-variable plus MA part. The effectiveness of the proposed method is illustrated through a simulation study.
|
|
10:20-10:40, Paper FrA07.2 | Add to My Program |
A Family of Hyper-Parameter Estimators for Regularized Linear System Identification (I) |
|
Zhang, Meng | Academy of Mathematics and System Science, Chinese Academy of Sc |
Chen, Tianshi | The Chinese University of Hong Kong, Shenzhen, China |
Mu, Biqiang | Chinese Academy of Sciences |
Keywords: Identification, Linear systems
Abstract: Hyper-parameter estimation is one of the fundamental issues for kernel-based regularized system identification methods. Empirical Bayes (EB) estimator and Stein's unbiased risk estimator (SURE) are two popular hyper-parameter estimators, but they both have advantages and disadvantages. Specifically, EB is not asymptotically optimal in the mean squared error (MSE) sense but SURE is, while SURE is more sensitive to ill-conditioned regression matrices but EB is more robust. In this paper, to find a better estimator by combining their strength and mitigating their weakness, we propose a family of hyper-parameter estimators by linking EB and SURE estimators together through an index. The finite sample and asymptotic properties of this family of estimators have been established. The Monte Carlo simulation results show that there does exist a ``middle" hyper-parameter estimator in this family that is superior to the EB and SURE.
|
|
10:40-11:00, Paper FrA07.3 | Add to My Program |
New Approach to Variable Selection for Nonparametric Nonlinear Systems (I) |
|
Ren, Xiaotao | Key Laboratory of Systems and Control, Academy of Mathematics An |
Zhao, Wenxiao | Academy of Mathematics and Systems Science, Chinese Academy of S |
Gao, Jinwu | Jilin University |
Keywords: Nonlinear systems identification
Abstract: Let the observation be generated by the nonlinear ARX (NARX) systems. A new method for variable selection of the nonlinear function within the system at any interested points is introduced. In contrast to most of the existing results, the new method is not based on optimizing a certain criterion, and estimates from the variable selection algorithm given in this paper are easy to update computationally in comparison with the criterion-optimization-based methods when new data arrive. Under reasonable conditions the estimates are proved to converge to the true contributing variables with probability one.
|
|
11:00-11:20, Paper FrA07.4 | Add to My Program |
An Efficient Implementation for Bayesian Manifold Regularization Method (I) |
|
Zhang, Junpeng | The Chinese University of Hong Kong, Shenzhen |
Ju, Yue | KTH Royal Institute of Technology |
Wahlberg, Bo | KTH Royal Institute of Technology |
Mu, Biqiang | Chinese Academy of Sciences |
Chen, Tianshi | The Chinese University of Hong Kong, Shenzhen, China |
Keywords: Identification, Estimation, Computational methods
Abstract: When applying the Bayesian manifold regularization method to function estimation problem with manifold constraints, the direct implementation has computational complexity mathcal{O}(N^3), where N is the number of input-output data measurements. This becomes particularly costly when N is large. In this paper, we propose a more efficient implementation based on the Kalman filter and smoother using a state-space model realization of the underlying Gaussian process. Moreover, we explore the sequentially semi-separable structure of the Laplacian matrix and the posterior covariance matrix. Our proposed implementation has computational complexity mathcal{O}(N) and thus can be applied to large data problems. We exemplify the effectiveness of our proposed implementation through numerical simulations.
|
|
11:20-11:40, Paper FrA07.5 | Add to My Program |
Multi-Agent Deep Reinforcement Learning for Large-Scale Platoon Coordination with Partial Information at Hubs (I) |
|
Wei, Dixiao | Tongji University |
Yi, Peng | Tongji University |
Lei, Jinlong | Tongji University |
Keywords: Machine learning, Agents-based systems, Traffic control
Abstract: This paper considers the hub-based platoon coordination problem in the large-scale transportation network, to promote cooperation among trucks and optimize the overall efficiency of the transportation network. We design a distributed communication model for transportation networks and transform the problem into a Dec-POMDP (Decentralized-Partial Observable Markov Decision Process). We then propose the A-QMIX deep reinforcement learning algorithm to solve the problem, which adopts centralized training and distributed execution and hence provides a reliable model for trucks to make quick decisions using only partial information. Finally, we carry out experiments with 100 and 1000 trucks in the transportation network of the Yangtze River Delta region in China to demonstrate the effectiveness and real-time performance of the proposed algorithm.
|
|
11:40-12:00, Paper FrA07.6 | Add to My Program |
Compressibility of Voter-Model State Snapshots in the Graph Spectral Basis (I) |
|
Zhu, Chenyan | Texas A&M University |
Roy, Sandip | Washington State University |
Keywords: Network analysis and control, Stochastic systems, Control of networks
Abstract: The sparsity or compressibility of network spread states in a graph-spectrum basis is examined, in the context of a stochastic model for influence/spread known as the voter model. In particular, first- and second- moments are characterized, for the graph-spectrum basis components contained in the voter-model state. These formal characterizations, as well as an asymptotic analysis of the voter model, are used to relate compressibility with the network's graph. An illustrative example as well as a simulation of a larger-scale model are included.
|
|
FrA08 Regular Session, Simpor Junior 4812 |
Add to My Program |
Optimal Control VII |
|
|
Chair: Khani, Alireza | University of Minnesota-Twin Cities |
Co-Chair: Shvartsman, Ilya | Penn State Harrisburg |
|
10:00-10:20, Paper FrA08.1 | Add to My Program |
Near-Optimal Control of Nonlinear Systems with Hybrid Inputs and Dwell-Time Constraints |
|
Lal-Fediuc, Ioana | Technical University of Cluj-Napoca |
Morarescu, Irinel-Constantin | CRAN, CNRS, Université De Lorraine |
Daafouz, Jamal | Université De Lorraine, CRAN, CNRS |
Busoniu, Lucian | Technical University of Cluj-Napoca |
Keywords: Optimal control, Switched systems, Numerical algorithms
Abstract: We propose two new optimistic planning algorithms for nonlinear hybrid-input systems, in which the input has both a continuous and a discrete component, and the discrete component must respect a dwell-time constraint. Both algorithms select sets of input sequences for refinement at each step, along with a continuous or discrete step to refine (split). The dwell-time constraint means that the discrete splits must keep the discrete mode constant if the required dwell-time is not yet reached. Convergence rate guarantees are provided for both algorithms, which show the dependency between the near-optimality of the sequence returned and the computational budget. The rates depend on a novel complexity measure of the dwell-time constrained problem. We present simulation results for two problems, an adaptive-quantization networked control system and a model for the COVID pandemic.
|
|
10:20-10:40, Paper FrA08.2 | Add to My Program |
Continuous Optimization for Control of Finite-State Machines with Cascaded Hysteresis Via Time-Freezing |
|
Van Roy, Wim | KU Leuven |
Nurkanovic, Armin | University of Freiburg |
Abbasi Esfeden, Ramin | KU Leuven |
Frey, Jonathan | University of Freiburg |
Pozharskiy, Anton | University of Freiburg |
Swevers, Jan | K. U. Leuven |
Diehl, Moritz | University of Freiburg |
Keywords: Optimal control, Switched systems, Optimization
Abstract: Control problems with Finite-State Machines (FSM) are often solved using integer variables, leading to a mixed-integer optimal control problem (MIOCP). This paper proposes an alternative method to describe a subclass of FSMs using complementarity constraints and time-freezing. The FSM from this subclass is built up by a sequence of states where a transition between the states is triggered by a single switching function. This can be looked at as a cascade of hysteresis loops where a memory effect is used to maintain the active state of the state machine. Based on the reformulation for hybrid systems with a hysteresis loop, a method is developed to reformulate this subclass in a similar fashion. The approach transforms the original problem into a Piecewise Smooth System (PSS), which can be discretized using the recently developed Finite Elements with Switch Detection, allowing for high-accuracy solutions. The reformulation is compared to a mixed-integer formulation from the literature on a time-optimal control problem. This work is a first step towards the general reformulation of FSMs into nonsmooth systems without integer states.
|
|
10:40-11:00, Paper FrA08.3 | Add to My Program |
Convexification of Robust Trajectory Planning Problems with Nominal State and Control Dependent Uncertainties |
|
Sheridan, Oliver | University of Washington |
Acikmese, Behcet | University of Washington |
Keywords: Optimal control, Uncertain systems, Aerospace
Abstract: We consider the problem of robust trajectory optimization of constrained discrete-time linear systems under bounded uncertainty. This paper extends our previous work on robust trajectory planning for linear systems with control-dependent uncertainties to a more general class of uncertainties, including nominal-state-dependent uncertainties. In particular, we show that if strong duality holds for the robust state constraints, and if the uncertainties are bounded by convex elementwise nonnegative functions of the nominal state and control, the robust constraints can be equivalently reformulated as deterministic convex constraints, enabling globally optimal solutions with no conservatism. We first use convex duality theory to reformulate robust linear inequality state constraints as deterministic biconvex constraints . We then exploit the elementwise nonnegativity of the uncertainty bounds to remove the biconvexity in closed form, resulting in convex deterministic constraints that can be handled by off-the-shelf solvers. These two lemmas are our main contribution, and lead to our final result, and equivalent convex reformulation of the original robust optimization problem which allows efficient trajectory optimization solutions under control and state dependent uncertainties. We then demonstrate the practical applicability of our method via numerical simulations.
|
|
11:00-11:20, Paper FrA08.4 | Add to My Program |
Optimality Conditions in Infinite-Horizon Optimal Control Problem with Vanishing Discounting |
|
Shvartsman, Ilya | Penn State Harrisburg |
Keywords: Optimal control
Abstract: We consider an optimal control problem on infinite horizon with a vanishing discounting factor, state sufficient conditions of optimality and illustrate them with examples.
|
|
11:20-11:40, Paper FrA08.5 | Add to My Program |
Efficient and Real-Time Reinforcement Learning for Linear Quadratic Systems with Application to H-Infinity Control |
|
Aalipour, Ali | University of Minnesota |
Khani, Alireza | University of Minnesota-Twin Cities |
Keywords: Optimal control, Iterative learning control, Linear systems
Abstract: This paper presents a model-free, real-time, data-efficient Q-learning-based algorithm to solve the H_{infty} control of linear discrete-time systems. The computational complexity is shown to reduce from mathcal{O}(underline{q}^3) in the literature to mathcal{O}(underline{q}^2) in the proposed algorithm, where underline{q} is quadratic in the sum of the size of state variables, control inputs, and disturbance. An adaptive optimal controller is designed and the parameters of the action and critic networks are learned online without the knowledge of the system dynamics, making the proposed algorithm completely model-free. Also, a sufficient probing noise is only needed in the first iteration and does not affect the proposed algorithm. With no need for an initial stabilizing policy, the algorithm converges to the closed-form solution obtained by solving the Riccati equation. A simulation study is performed by applying the proposed algorithm to real-time control of an autonomous mobility-on-demand (AMoD) system for a real-world case study to evaluate the effectiveness of the proposed algorithm.
|
|
11:40-12:00, Paper FrA08.6 | Add to My Program |
Open-Loop and Feedback LQ Potential Differential Games for Multi-Agent Systems |
|
Scarpa, Maria Luisa | Imperial College London |
Mylvaganam, Thulasi | Imperial College London |
Keywords: Optimal control, Game theory, Linear systems
Abstract: Open-loop and feedback potential differential games for multi-agent systems are considered in this paper. Constructive sufficient conditions under which a linear quadratic differential game constitutes a potential differential game are provided. The conditions enable the construction of associated optimal control problems that yield (at significantly reduced computational complexity) solutions (in terms of open-loop and feedback Nash equilibrium strategies) of the original differential game. The results are demonstrated on a practically-motivated example that concerns spacecraft formation control.
|
|
FrA09 Regular Session, Simpor Junior 4811 |
Add to My Program |
Optimization II |
|
|
Chair: Jiang, Wei | Aalto University, Finland |
Co-Chair: Charalambous, Themistoklis | University of Cyprus |
|
10:00-10:20, Paper FrA09.1 | Add to My Program |
Distributed Markov Chain-Based Strategies for Multi-Agent Robotic Surveillance |
|
Diaz-Garcia, Gilberto | University of California, Santa Barbara |
Bullo, Francesco | Univ of California at Santa Barbara |
Marden, Jason R. | University of California, Santa Barbara |
Keywords: Optimization, Markov processes, Robotics
Abstract: Markov chains have been increasingly used to define persistent robotic surveillance schemes. Motivations for this design choice include their easy implementation, unpredictable surveillance patterns, and their well-studied mathematical background. However, applying previous results to scenarios with multiple agents can significantly increase the dimension of the problem, leading to intractable algorithms. In this work we analyze the hitting time minimization problem for multiple agents moving over a finite graph. We exploit the structure of this problem to propose a tractable algorithm to design Markov chains to cover the graph with multiple interacting agents. Using mathematical analysis, we provide guarantees for the convergence of our proposed solution. Also, through numerical simulations, we show the performance of our approach compared to the current state of art in multi-agent scenarios.
|
|
10:20-10:40, Paper FrA09.2 | Add to My Program |
A Unified Early Termination Technique for Primal-Dual Algorithms in Mixed Integer Conic Programming |
|
Chen, Yuwen | University of Oxford |
Ning, Catherine | University of Oxford |
Goulart, Paul J. | University of Oxford |
Keywords: Optimization, Numerical algorithms, Optimal control
Abstract: We propose an early termination technique for mixed integer conic programming for use within branch-and-bound based solvers. Our approach generalizes previous early termination results for ADMM-based solvers to a broader class of primal-dual algorithms, including both operator splitting methods and interior point methods. The complexity for checking early termination is O(n) for each termination check assuming a bounded problem domain. We show that this domain restriction can be relaxed for problems whose data satisfies a simple rank condition, in which case each check requires an O(n^2) solve using a linear system that must be factored only once at the root node. We further show how this approach can be used in hybrid model predictive control as long as system inputs are bounded. Numerical results show that our method leads to a moderate reduction in the total iterations required for branch-and-bound conic solvers with interior-point based subsolvers.
|
|
10:40-11:00, Paper FrA09.3 | Add to My Program |
Global Resolution of Chance-Constrained Optimization Problems: Minkowski Functionals and Monotone Inclusions |
|
Zhang, Peixuan | Pennsylvania State University |
Shanbhag, Uday V. | Pennsylvania State University |
Lagoa, Constantino M. | Pennsylvania State Univ |
Bardakci, Ibrahim Ekrem | Bartin University |
Keywords: Optimization, Numerical algorithms, Uncertain systems
Abstract: Chance-constrained optimization problems are generally complicated by uncertainty, nonsmoothness, and nonconvexity. Yet, non-asymptoic rates and complexity guarantees for computing an epsilon-global minimizer remain open questions. We consider a subclass of problems in which the probability is defined as mathbb{P}left{ zeta mid zeta in K(x) right}, K(x) is a set defined as K(x) , = , { zeta in Kscr , mid , c(x,zeta) leq 1} where c(x,bullet) is a positively homogenous function for any x in Xscr and Kscr is a nonempty, compact, and convex set, symmetric about the origin. We make two contributions in this context. (i) First, when zeta admits a log-concave density on Kscr, the chance-constrained formulation admits an equivalent convex representation and under a suitable regularity condition, the necessary and sufficient conditions are captured by a monotone inclusion with a compositional expectation-valed operator. (ii) Second, when zeta admits a uniform density, we present a variance-reduced stochastic proximal-point framework for resolving the monotone inclusion and provide amongst the first rate and complexity guarantees for such a problem.
|
|
11:00-11:20, Paper FrA09.4 | Add to My Program |
Variational Principles for Mirror Descent and Mirror Langevin Dynamics |
|
Tzen, Belinda | Columbia University |
Raj, Anant | INRIA - Ecole Normale Supérieure |
Raginsky, Maxim | University of Illinois at Urbana-Champaign |
Bach, Francis | INRIA - Ecole Normale Supérieure |
Keywords: Optimization, Optimal control, Stochastic optimal control
Abstract: Mirror descent, introduced by Nemirovski and Yudin in the 1970s, is a primal-dual convex optimization method that can be tailored to the geometry of the optimization problem at hand through the choice of a strongly convex potential function. It arises as a basic primitive in a variety of applications, including large-scale optimization, machine learning, and control. This paper proposes a variational formulation of mirror descent and of its stochastic variant, mirror Langevin dynamics. The main idea, inspired by the classic work of Brezis and Ekeland on variational principles for gradient flows, is to show that mirror descent emerges as a closed-loop solution for a certain optimal control problem, and the Bellman value function is given by the Bregman divergence between the initial condition and the global minimizer of the objective function.
|
|
11:20-11:40, Paper FrA09.5 | Add to My Program |
Leveraging Proximal Optimization for Differentiating Optimal Control Solvers |
|
Bounou, Oumayma | Inria, ENS |
Ponce, Jean | Ecole Normale Supérieure |
Carpentier, Justin | Inria |
Keywords: Optimization, Optimal control, Learning
Abstract: Over the past few years, differentiable optimization has gained interest within machine learning, control, and robotics communities. It consists in computing the derivatives of the solutions of a given optimization problem which can then be used in learning algorithms. Until now, dedicated approaches have been proposed to compute the derivatives of various optimization problems (LPs, QPs, SOCPs, etc.). However, these approaches assume the problems are well-conditioned, limiting textit{de facto} their application to general optimal control problems (OCP) widely used in robotics. In this work, we focus on the differentiation of such problems. We notably introduce a differentiable proximal formulation of equality-constrained LQR problems that accurately solves rank-deficient problems. This proximal formulation allows us to compute accurate gradients even in the case of problems that do not satisfy the standard linear independence constraint qualification (LICQ). Because any optimal control problem can be cast as an equality-constrained LQR problem in the vicinity of the optimal solution, we show that our robust LQR derivatives computation can be exploited to obtain the derivatives of general optimal control problems. We demonstrate the effectiveness of our approach in dynamics learning and parameter identification experiments in both linear and nonlinear optimal control problems.
|
|
11:40-12:00, Paper FrA09.6 | Add to My Program |
Distributed Optimization Via Gradient Descent with Event-Triggered Zooming Over Quantized Communication |
|
Rikos, Apostolos I. | KTH Royal Institute of Technology |
Jiang, Wei | Aalto University, Finland |
Charalambous, Themistoklis | University of Cyprus |
Johansson, Karl H. | KTH Royal Institute of Technology |
Keywords: Optimization, Optimization algorithms, Agents-based systems
Abstract: In this paper, we study unconstrained distributed optimization strongly convex problems, in which the exchange of information in the network is captured by a directed graph topology over digital channels that have limited capacity (and hence information should be quantized). Distributed methods in which nodes use quantized communication yield a solution at the proximity of the optimal solution, hence reaching an error floor that depends on the quantization level used; the finer the quantization the lower the error floor. However, it is not possible to determine in advance the optimal quantization level that ensures specific performance guarantees (such as achieving an error floor below a predefined threshold). Choosing a very small quantization level that would guarantee the desired performance, requires information packets of very large size, which is not desirable (could increase the probability of packet losses, increase delays, etc) and often not feasible due to the limited capacity of the channels available. In order to obtain a communication-efficient distributed solution and a sufficiently close proximity to the optimal solution, we propose a quantized distributed optimization algorithm that converges in a finite number of steps and is able to adjust the quantization level accordingly. The proposed solution uses a finite-time distributed optimization protocol to find a solution to the problem for a given quantization level in a finite number of steps and keeps refining the quantization level until the difference in the solution between two successive solutions with different quantization levels is below a certain pre-specified threshold. Therefore, the proposed algorithm progressively refines the quantization level, thus eventually achieving low error floor with a reduced communication burden. The performance gains of the proposed algorithm are demonstrated via illustrative examples.
|
|
FrA10 Regular Session, Roselle Junior 4713 |
Add to My Program |
Neural Networks II |
|
|
Chair: Raginsky, Maxim | University of Illinois at Urbana-Champaign |
Co-Chair: Tóth, Roland | Eindhoven University of Technology |
|
10:00-10:20, Paper FrA10.1 | Add to My Program |
Gasoline Controlled Auto-Ignition with Learning-Based Uncertainty Using Stochastic Model Predictive Control |
|
Chen, Xu | RWTH Aachen University |
Basler, Maximilian | RWTH Aachen University |
Ketelhut, Maike | RWTH Aachen University |
Abel, Dirk | RWTH Aachen University |
Keywords: Neural networks, Predictive control for linear systems, Control applications
Abstract: The internal combustion engine faces a severe energy conservation and emission reduction challenge. In this regard, low-temperature combustion technology is a profitable solution, allowing for pollutant emission reduction while improving engine efficiency. However, the process is complex, and the cycles are mutually coupled, making it a huge challenge to stabilize the entire process behavior. Also, the model mismatch and the inherent stochasticity of the process bring considerable difficulties to the application of control technology. In this work, we propose a deep learning-based generative model to learn the distribution of system uncertainties. The uncertainty information is considered in the model predictive control (MPC) strategy design. We adopt the disturbance-affine stochastic MPC (sMPC) and transform the chance-constrained MPC problem into some tractable optimization problems. The results show better closed-loop performance with smaller output variance, given the prior knowledge of uncertainty realization from the proposed generative model.
|
|
10:20-10:40, Paper FrA10.2 | Add to My Program |
Computationally Efficient Predictive Control Based on ANN State-Space Models |
|
Hoekstra, Jan Hidde | Eindhoven University of Technology |
Cseppentő, Bence | Budapest University of Technology and Economics |
Beintema, Gerben Izaak | Eindhoven University of Technology |
Schoukens, Maarten | Eindhoven University of Technology |
Kollar, Zsolt | Budapest University of Technology and Economics |
Tóth, Roland | Eindhoven University of Technology |
Keywords: Neural networks, Predictive control for nonlinear systems, Linear parameter-varying systems
Abstract: Artificial neural networks (ANN) have been shown to be flexible and effective function estimators for the identification of nonlinear state-space models. However, if the resulting models are used directly for nonlinear model predictive control (NMPC), the resulting nonlinear optimization problem is often overly complex due to the size of the network, requires the use of high-order observers to track the states of the ANN model, and the overall control scheme does not exploit the available autograd tools for these models. In this paper, we propose an efficient approach to auto-convert ANN state-space models to linear parameter-varying (LPV) form and solve predictive control problems by successive solutions of linear model predictive problems, corresponding to quadratic programs (QPs). Furthermore, we show how existing deep-learning methods, such as SUBNET that uses a state encoder, enable efficient implementation of MPCs on identified ANN models. Performance of the proposed approach is demonstrated by a simulation study on an unbalanced disc system.
|
|
10:40-11:00, Paper FrA10.3 | Add to My Program |
An Analytic End-To-End Deep Learning Algorithm Based on Collaborative Learning |
|
Li, Sitan | Nanyang Technological University |
Cheah, Chien Chern | Nanyang Tech. Univ |
Keywords: Neural networks, Robotics, Lyapunov methods
Abstract: In most control applications, theoretical analysis of the systems is crucial in ensuring stability or convergence, so as to ensure safe and reliable operations and also to gain a better understanding of the systems for further developments. However, most current deep learning methods are black-box approaches that are more focused on empirical studies. This paper develops an analytic end-to-end learning algorithm for deep fully connected neural networks(FNN). Unlike existing end-to-end deep learning methods, the convergence analysis of the proposed method is assured based on smooth sigmoid activation functions and thus avoiding any potential chattering problem, but the proposed method does not easily lead to gradient vanishing problems. The proposed End-to-End algorithm trains multiple two-layer fully connected networks concurrently and collaborative learning can be used to further combine their strengths to improve accuracy. A classification case study based on fully connected networks and MNIST dataset was done to demonstrate the performance of the proposed approach. Then an online kinematics control task of a UR5e robot arm was performed to illustrate the regression approximation and online updating ability of our algorithm.
|
|
11:00-11:20, Paper FrA10.4 | Add to My Program |
Tight Certified Robustness Via Min-Max Representations of ReLU Neural Networks |
|
Anderson, Brendon G. | University of California, Berkeley |
Pfrommer, Samuel | University of California, Berkeley |
Sojoudi, Somayeh | UC Berkeley |
Keywords: Neural networks, Robust control, Optimization
Abstract: The reliable deployment of neural networks in control systems requires rigorous robustness guarantees. In this paper, we obtain tight robustness certificates over convex attack sets for min-max representations of ReLU neural networks by developing a convex reformulation of the nonconvex certification problem. This is done by "lifting" the problem to an infinite-dimensional optimization over probability measures, leveraging recent results in distributionally robust optimization to solve for an optimal discrete distribution, and proving that solutions of the original nonconvex problem are generated by the discrete distribution under mild boundedness, nonredundancy, and Slater conditions. As a consequence, optimal (worst-case) attacks against the model may be solved for exactly. This contrasts prior state-of-the-art that either requires expensive branch-and-bound schemes or loose relaxation techniques. Experiments on robust control and MNIST image classification examples highlight the benefits of our approach.
|
|
11:20-11:40, Paper FrA10.5 | Add to My Program |
Safety Filter Design for Neural Network Systems Via Convex Optimization |
|
Chen, Shaoru | University of Pennsylvania |
Chee, Kong Yao | University of Pennsylvania |
Matni, Nikolai | University of Pennsylvania |
Hsieh, M. Ani | University of Pennsylvania |
Pappas, George J. | University of Pennsylvania |
Keywords: Neural networks, Robust control, Uncertain systems
Abstract: With the increase in data availability, it has been widely demonstrated that neural networks (NN) can capture complex system dynamics precisely in a data-driven manner. However, the architectural complexity and nonlinearity of the NNs make it challenging to synthesize a provably safe controller. In this work, we propose a novel safety filter that relies on convex optimization to ensure safety for a NN system, subject to additive disturbances that are capable of capturing modeling errors. Our approach leverages tools from NN verification to over-approximate NN dynamics with a set of linear bounds, followed by an application of robust linear MPC to search for controllers that can guarantee robust constraint satisfaction. We demonstrate the efficacy of the proposed framework numerically on a nonlinear pendulum system.
|
|
11:40-12:00, Paper FrA10.6 | Add to My Program |
A Constructive Approach to Function Realization by Neural Stochastic Differential Equations |
|
Veeravalli, Tanya | University of Illinois, Urbana-Champaign |
Raginsky, Maxim | University of Illinois at Urbana-Champaign |
Keywords: Neural networks, Stochastic systems, Nonlinear systems
Abstract: The problem of function approximation by neural dynamical systems has typically been approached in a top-down manner: Any continuous function can be approximated to an arbitrary accuracy by a sufficiently complex model with a given architecture. This can lead to high-complexity controls which are impractical in applications. In this paper, we take the opposite, constructive approach: We impose various structural restrictions on system dynamics and consequently characterize the class of functions that can be realized by such a system. The systems are implemented as a cascade interconnection of a neural stochastic differential equation (Neural SDE), a deterministic dynamical system, and a readout map. Both probabilistic and geometric (Lie-theoretic) methods are used to characterize the classes of functions realized by such systems.
|
|
FrA11 Regular Session, Roselle Junior 4712 |
Add to My Program |
Large-Scale Systems I |
|
|
Chair: Weyer, Erik | Univ. of Melbourne |
Co-Chair: Ishii, Hideaki | Tokyo Institute of Technology |
|
10:00-10:20, Paper FrA11.1 | Add to My Program |
An Optimization-Based Network Partitioning Method Considering Local Controllability: Application to Water Distribution Networks |
|
Arastou, Alireza | University of Melbourne |
Wang, Ye | The University of Melbourne |
Weyer, Erik | Univ. of Melbourne |
Keywords: Large-scale systems, Control system architecture, Distributed control
Abstract: Complex networks pose significant challenges for design of control systems. This paper proposes a partitioning method for large-scale networks to be employed for use in decentralized and distributed control. The paper is divided into two parts. In the first part, a new formulation of the partitioning problem considering both computational and communication costs associated with control is established, and the controllability of the subsystems are also taken into account. In the second part, an effective algorithm is developed to find the solution to the network decomposition problem. The proposed approach is illustrated on a water distribution system.
|
|
10:20-10:40, Paper FrA11.2 | Add to My Program |
Low-Complexity Linear Parameter-Varying Approximations of Incompressible Navier-Stokes Equations for Truncated State-Dependent Riccati Feedback |
|
Heiland, Jan | Max Planck Institute for Dynamics of Complex Technical Systems |
Werner, Steffen W. R. | Virginia Tech |
Keywords: Large-scale systems, Model/Controller reduction, Numerical algorithms
Abstract: Nonlinear feedback design via state-dependent Riccati equations is well established but unfeasible for large-scale systems because of computational costs. If the system can be embedded in the class of linear parameter-varying (LPV) systems with the parameter dependency being affine-linear, then the nonlinear feedback law has a series expansion with constant and precomputable coefficients. In this work, we propose a general method to approximating nonlinear systems such that the series expansion is possible and efficient even for high-dimensional systems. We lay out the application for the stabilization of incompressible Navier-Stokes equations, discuss the numerical solution of the involved matrix valued equations, and confirm the performance of the approach in a numerical example.
|
|
10:40-11:00, Paper FrA11.3 | Add to My Program |
Conflict-Free Node-To-Robot Scheduling for Lifelong Operation in a Warehouse with Narrow-Corridor Environment |
|
Singh, Sharad Kumar | Addverb Technologies, Noida |
M, Hemantharaj | Addverb Technologies |
Bhattacharya, Sayantani | Addverb Technologies, Noida |
Jha, Manish | Addverb Technologies |
Keywords: Large-scale systems, Robotics, Autonomous robots
Abstract: This paper presents a solution to lifelong Multi-Agent Path Finding (MAPF) problems for long and narrow-corridor environments. In this setting, robots need to navigate conflict-free paths while adapting to new goals. We propose an algorithm called Conflict-Free Node-To-Robot Scheduling (CFNRS), which effectively coordinates the paths of robots on a given graph in a constrained environment. The algorithm assigns nodes of the graph, ensuring no conflicts with other robots. In particular, we introduce a Deadlock-Detection and Resolution mechanism to find and resolve conflicts and ensure conflict-free paths. We have introduced a problem-reduction technique for improved efficiency. The proposed algorithms are evaluated through simulations in narrow-corridor environments and compared to existing state-of-the-art MAPF solvers, demonstrating their validity and effectiveness in ensuring that robots can navigate conflict-free paths.
|
|
11:00-11:20, Paper FrA11.4 | Add to My Program |
Local Convergence of Multi-Agent Systems towards Rigid Lattices |
|
Giusti, Andrea | University of Naples Federico II |
Coraggio, Marco | Scuola Superiore Meridionale |
di Bernardo, Mario | University of Naples Federico II |
Keywords: Large-scale systems, Stability of nonlinear systems, Autonomous systems
Abstract: Geometric pattern formation is an important emergent behavior in many applications involving large-scale multi-agent systems, such as sensor networks deployment and collective transportation. Attraction/repulsion virtual forces are the most common control approach to achieve such behavior in a distributed and scalable manner. Nevertheless, for most existing solutions only numerical and/or experimental evidence of their convergence is available. Here, we revisit the problem of achieving pattern formation in spaces of any dimension, giving sufficient conditions to prove analytically that under the influence of appropriate virtual forces, a large-scale multi-agent swarming system locally converges towards a stable and robust rigid lattice configuration. Our theoretical results are complemented by exhaustive numerical simulations confirming their effectiveness and estimating the region of asymptotic stability of the rigid lattice configuration.
|
|
11:20-11:40, Paper FrA11.5 | Add to My Program |
Ensemble Control of a Large Population of Stochastic Oscillators: Periodic–Feedback Control Approach |
|
Ito, Kaito | Tokyo Institute of Technology |
Kume, Haruhiro | Tokyo Institute of Technology |
Ishii, Hideaki | Tokyo Institute of Technology |
Keywords: Large-scale systems, Stochastic systems, Lyapunov methods
Abstract: In this paper, we address the problem of steering the distribution of oscillators all receiving the same control input to a given desired distribution. In a large population limit, the distribution of oscillators can be described by a probability density. Then, our problem can be seen as an ensemble control problem with a constraint on the steady-state density. In particular, we consider the case where oscillators are subjected to stochastic noise. One of the difficulties of this problem is that due to the stochasticity, it is generally impossible to design a control law under which oscillators converge to a target density exactly. To avoid this issue, we first give an alternative target density that is close enough to the original target. The modified target is carefully designed via a periodic input so that the distribution of oscillators can converge to it by an appropriate control strategy. Next, we construct a controller that decreases the Kullback–Leibler divergence between the distribution of oscillators and the modified target combining a periodic input and feedback control. We exhibit some convergence results for our proposed method. The effectiveness of the proposed method is demonstrated by a numerical example.
|
|
11:40-12:00, Paper FrA11.6 | Add to My Program |
Entropic Model Predictive Optimal Transport for Underactuated Linear Systems |
|
Ito, Kaito | Tokyo Institute of Technology |
Kashima, Kenji | Kyoto University |
Keywords: Optimal control, Large-scale systems, Predictive control for linear systems
Abstract: This paper investigates dynamical optimal transport of underactuated linear systems over an infinite time horizon. In our previous work, we proposed to integrate model predictive control and the celebrated Sinkhorn algorithm to perform efficient dynamical transport of agents. However, the proposed method requires the invertibility of input matrices, which severely limits its applicability. To resolve this issue, we extend the method to (possibly underactuated) controllable linear systems. In addition, we ensure the convergence properties of the method for general controllable linear systems. The effectiveness of the proposed method is demonstrated by a numerical example.
|
|
FrA12 Regular Session, Roselle Junior 4711 |
Add to My Program |
Distributed Control I |
|
|
Chair: Belkhatir, Zehor | University of Southampton |
Co-Chair: Jagtap, Pushpak | Indian Institute of Science |
|
10:00-10:20, Paper FrA12.1 | Add to My Program |
Distributed Multi-Robot Flocking Based on Acoustic Doppler Effect |
|
Zhou, Yizhi | Geroge Mason University |
Nowzari, Cameron | George Mason University |
Wang, Xuan | George Mason University |
Keywords: Distributed control, Agents-based systems, Autonomous robots
Abstract: This paper aims to design a distributed algorithm based on Doppler effect that allows multiple robots to achieve a consensus on their velocities. Instead of relying on a direct measurement of robots’ exact or relative velocities, which are usually challenging under denied environments (i.e., underwater), the novelty of our approach stems from utilizing the sound frequency as a medium to coordinate velocities among robots. Such a mechanism is achieved by exploiting the Doppler effect and establishing an equivalence between the velocity consensus of the robots and the frequency consensus of the sound they broadcast/receive. To address scalability and interference issues for large-scale systems, our use of the Doppler effect is broadcast-based, unlike most traditional Doppler devices that are reflection-based. Building on this, we develop a fully distributed algorithm for multi-robot flocking for the one-dimensional case, where the only control input for each robot is the sound frequencies it locally receives. The designed controller leads to highly nonlinear system dynamics. We employ a linearization method to theoretically prove the local convergence of the system to the desired equilibrium for velocity consensus. Simulations demonstrate the effectiveness of the proposed approach.
|
|
10:20-10:40, Paper FrA12.2 | Add to My Program |
Infinite-Dimensional Output-Feedback Bounded Bilinear Control of a Parallel-Flow Heat Exchanger |
|
Mechhoud, Sarah | University of 20 August 1955 Skikda |
Belkhatir, Zehor | University of Southampton |
Keywords: Distributed control, Constrained control, Output regulation
Abstract: In this paper, we consider the problem of output bounded controller design for a parallel-flow heat exchanger described by 2times 2 coupled linear hyperbolic partial differential equations (PDEs) of balance laws. We aim to drive the internal fluid outlet temperature to track a reference trajectory by manipulating the external fluid velocity. Due to physical limitations, this manipulated variable has to be bounded to avoid a laminar regime. Consequently, the control problem becomes bounded and bilinear. Based on the set-invariance concept and the Lyapunov's stability theory, first, we design a bounded state feedback controller. Then, since only boundary measurements are available, we synthesize an output feedback controller, and demonstrate the exponential stability of the closed-loop system. Finally, simulation results are provided to illustrate the performance of the proposed control technique.
|
|
10:40-11:00, Paper FrA12.3 | Add to My Program |
A Bound on the Existence of the Maximum Jointly Invariant Set of Input-Coupled Systems |
|
Sánchez-Amores, Ana | University of Seville |
Maestre, Jose Maria (Pepe) | University of Seville |
Trodden, Paul | University of Sheffield |
Camacho, Eduardo F. | Univ. of Sevilla |
Keywords: Distributed control, Constrained control, Robust control
Abstract: We present a set-theoretical characterization of a bound on the maximal portion that an agent can cede of its input variable to another agent. By ceding control authority, agents can decompose coupling variables into public and private parts, which is of interest in situations of partial cooperation. In particular, sufficient conditions under which the non-existence of the maximum robust control invariant set is guaranteed are provided, expressed in terms of support functions and the dominant system eigenvalue. Finally, the results are illustrated via stable and unstable example systems with different coupling.
|
|
11:00-11:20, Paper FrA12.4 | Add to My Program |
Distributed Optimal Solutions for Multiagent Pursuit-Evasion Games |
|
Zhou, Panpan | KTH Royal Institute of Technology |
Chen, Ben M. | Chinese University of Hong Kong |
Keywords: Distributed control, Control applications, Networked control systems
Abstract: In this paper, distributed optimal solutions are designed for networked multiagent pursuit-evasion (MPE) games for capture and formation control. In the games, the pursuers aim to minimize the distance from their target evaders while the evaders attempt to maximize it, and at the same time, all players desire to maintain cohesion with their teammates. The goals of agents are obviously reflected in the obtained optimal control strategies which consist of an attracting term and/or a repelling term. Nash equilibrium is obtained by means of optimal strategies using the solutions of the HJI equations. Furthermore, three scenarios are considered in the MPE game: one-pursuer-one-evader, multiple-pursuer-oneevader, and multiple-pursuer-multiple-evader, where sufficient conditions are given for pursuers in achieving capture or formation control with ultimate zero or bounded errors. It is shown that the conditions depend on the structure of the communication graph, the parameters in the controllers, and the expected formation configurations. Finally, both simulations and real flight experiments successfully demonstrate the effectiveness of the proposed strategies.
|
|
11:20-11:40, Paper FrA12.5 | Add to My Program |
Consensus Control Driven by Value Exchange |
|
Sugiyama, Daiki | Kyoto University |
Azuma, Shun-ichi | Kyoto University |
Ariizumi, Ryo | Nagoya University |
Asai, Toru | Nagoya University |
Keywords: Distributed control, Cooperative control, Networked control systems
Abstract: A multi-agent system is a system consisting of multiple agents that can make a global decision autonomously. So far, many studies have been conducted under the assumption that all agents behave cooperatively. On the other hand, in a large-scale multi-agent system such as a connected vehicle network, each agent must be owned by different owners. Thus, in such a system, the agents do not necessarily work cooperatively and each agent pursues its utility. Such a multi-agent system is modeled as a system driven by the exchange of certain values. In this paper, we address a consensus problem for a multi-agent system driven by the exchange between tokens and information. Unlike the typical consensus control, we assume that each agent has tokens and collects information from its neighbors in exchange for tokens. To this system, we derive a necessary and sufficient condition for the system to achieve consensus. This condition is characterized by the number of tokens and the network structure. Moreover, we disclose that the consensus value is given by the left eigenvector of the Perron matrix associated with the initial token distribution. Finally, we discuss the convergence property for several specific network structures.
|
|
11:40-12:00, Paper FrA12.6 | Add to My Program |
Scalable Distributed Controller Synthesis for Multi-Agent Systems Using Barrier Functions and Symbolic Control |
|
Sundarsingh, David Smith | Indian Institute of Science |
Bhagiya, Jay | Indian Institute of Science |
Saharsh, Saharsh | Indian Institute of Science, Bangalore |
Chatrola, Jeel | Indian Institute of Science |
Saoud, Adnane | CentraleSupelec |
Jagtap, Pushpak | Indian Institute of Science |
Keywords: Distributed control, Formal Verification/Synthesis, Agents-based systems
Abstract: In this paper, we propose a computationally efficient symbolic controller synthesis technique for multi-agent systems. The paper focuses on synthesizing distributed controllers enforcing local temporal logic specifications along with global safety specifications for multi-agent systems. To solve the problem in a computationally efficient way, we leverage the concept of control barrier functions. In particular, we use a three-step bottom-up approach: first, the symbolic controllers for individual agents are synthesized to enforce local temporal logic specifications, then we use a notion of control barrier functions for symbolic models to compose controlled agent systems by removing unsafe transitions, and finally, we synthesize controller for the reduced composed system to ensure the satisfaction of local temporal logic specifications while ensuring global safety specification. The effectiveness of our approach is demonstrated on a multi-robot system by comparing it with the conventional monolithic symbolic control approaches.
|
|
FrA13 Regular Session, Roselle Junior 4613 |
Add to My Program |
Networked Control Systems IV |
|
|
Chair: Steinberger, Martin | Graz University of Technology |
Co-Chair: Bastianello, Nicola | KTH Royal Institute of Technology |
|
10:00-10:20, Paper FrA13.1 | Add to My Program |
A Unified Approach to Solve the Dynamic Consensus on the Average, Maximum, and Median Values with Linear Convergence |
|
Deplano, Diego | University of Cagliari |
Bastianello, Nicola | KTH Royal Institute of Technology |
Franceschelli, Mauro | University of Cagliari |
Johansson, Karl H. | KTH Royal Institute of Technology |
Keywords: Networked control systems, Optimization algorithms, Agents-based systems
Abstract: This manuscript proposes novel distributed algorithms for solving the dynamic consensus problem in discrete-time multi-agent systems on three different objective functions: the average, the maximum, and the median. In this problem, each agent has access to an external time-varying scalar signal and aims to estimate and track a function of all the signals by exploiting only local communications with other agents. By recasting the problem as an online distributed optimization problem, the proposed algorithms are derived based on the distributed implementation of the alternating direction method of multipliers (ADMM) and are thus amenable to a unified analysis technique. A major contribution is that of proving linear convergence of these ADMM-based algorithms for the specific dynamic consensus problems of interest, for which current results could only guarantee sub-linear convergence. In particular, the tracking error is shown to converge within a bound, whereas the steady-state error is zero. Numerical simulations corroborate the theoretical findings, empirically show the robustness of the proposed algorithms to re-initialization errors, and compare their performance with that of state-of-the-art algorithms.
|
|
10:20-10:40, Paper FrA13.2 | Add to My Program |
An Emulation Approach to Sampled-Data Synchronization |
|
Barkai, Gal | Technion—Israel Institute of Technology |
Mirkin, Leonid | Technion - IIT |
Zelazo, Daniel | Technion - Israel Institute of Technology |
Keywords: Networked control systems, Sampled-data control, Cooperative control
Abstract: This work presents a novel approach for achieving state synchronization of homogeneous LTI agents to a trajectory generated by a prescribed reference generator under intermit- tent and asynchronous communication. The proposed protocol involves emulating “ideal” global analog dynamics at each agent to generate the control signal between samples. Each agent transmits the centroid state of its local emulator rather than its own state vector, which is used to update the emulators at the receiving end. The paper guarantees synchronization with a prescribed reference generator under mild assumptions on the system’s structure, persistency of connectivity, and uniform boundedness of sampling intervals. Additionally, the controller parameters are independent of the sampling interval, allowing it to be designed without any a priori knowledge of the sampling sequence. Lastly, a simplified and scalable implementation whose dimension is independent of the number of agents is also proposed.
|
|
10:40-11:00, Paper FrA13.3 | Add to My Program |
Optimal Scheduling Policies for Remote Estimation of Autoregressive Markov Processes Over Time-Correlated Fading Channel |
|
Dutta, Manali | Indian Institute of Science |
Singh, Rahul | Indian Institute of Science |
Keywords: Networked control systems, Sensor networks, Filtering
Abstract: We consider the problem of optimally scheduling transmissions for remote estimation of a discrete-time autoregressive Markov process that is driven by white Gaussian noise. A sensor observes this process, and then decides to either encode the current state of this process into a data packet and attempts to transmit it to the estimator over an unreliable wireless channel modeled as a Gilbert-Elliott channel~cite{Gilbert1960capacity}~cite{Chakravorty2017structure}~cite{yao2022age}, or does not send any update. Each transmission attempt consumes lambda units of transmission power, and the remote estimator is assumed to be linear. The channel state is revealed only via the feedback (ACKslash NACK) of a transmission, and hence the channel state is not revealed if no transmission occurs. The goal of the scheduler is to minimize the expected value of an infinite-horizon cumulative discounted cost, in which the instantaneous cost is composed of the following two quantities: (i)~squared estimation error, (ii) transmission power. We posed this problem as a partially observable Markov decision process (POMDP), in which the scheduler maintains a belief about the current state of the channel, and makes decisions on the basis of the current value of the error e(t) (defined in~eqref{def:error_evolve}), and the belief state.~To aid its analysis, we introduce an easier-to-analyze ``folded POMDP.'' We then analyze this folded POMDP and show that there is an optimal scheduling policy that has threshold structure, i.e. for each value of the error e, there is a threshold bust(e) such that when the error is equal to e, this policy transmits only when the current belief state is greater than bust(e).
|
|
11:00-11:20, Paper FrA13.4 | Add to My Program |
Stability of Nonlinear Systems with Two Time Scales Over a Single Communication Channel |
|
Wang, Weixuan | The University of Melbourne |
Maass, Alejandro I. | Universidad De O'Higgins |
Nesic, Dragan | University of Melbourne |
Tan, Ying | The University of Melbourne |
Postoyan, Romain | CNRS, CRAN, Université De Lorraine |
Heemels, W.P.M.H. | Eindhoven University of Technology |
Keywords: Networked control systems, Stability of hybrid systems, Nonlinear systems
Abstract: This paper studies the stabilisation problem for a class of nonlinear systems with two time scales, where only a single communication channel is available to allocate both low and high-frequency transmissions from slow and fast subsystems, respectively. A clock mechanism is proposed to govern the transmissions, and the closed-loop system is modelled by a hybrid singularly perturbed system. Singular perturbation based analysis is used to obtain individual maximum allowable transmission intervals for both slow and fast transmissions, and also to guarantee semi-global practical asymptotic stability with respect to the minimum allowable transmission interval of slow transmissions. We illustrate the results via a numerical example.
|
|
11:20-11:40, Paper FrA13.5 | Add to My Program |
Stochastic Relaxation of the Maximum Allowable Delay for a Class of Networked Control Systems |
|
Schlotterbeck, Constanza | Pontificia Universidad Católica De Chile |
Gallegos, Javier A. | University of Chile |
Núñez, Felipe | Pontificia Universidad Catolica De Chile |
Keywords: Networked control systems, Stochastic systems, Stability of hybrid systems
Abstract: Stability in networked control systems has been typically addressed in the deterministic setting using the concepts of Maximum Allowable Transmit Interval (MATI) and Maximum Allowable Delay (MAD). This work looks to extend the analysis to the stochastic setting by giving conditions for uniform stability in probability when the communication delay follows a probability distribution that includes values over the deterministic MAD. Analytical conditions are given and validated through simulations using a concrete example.
|
|
11:40-12:00, Paper FrA13.6 | Add to My Program |
Switched Lyapunov Function-Based Controller Synthesis for Networked Control Systems: A Computationally Inexpensive Approach |
|
Stanojevic, Katarina | Graz University of Technology |
Steinberger, Martin | Graz University of Technology |
Horn, Martin | Graz University of Technology |
Keywords: Networked control systems
Abstract: This paper presents a Lyapunov function-based control strategy for networked control systems (NCS) affected by variable time delays and data loss. A special focus is put on the reduction of the computational complexity. The crucial step to achieving computational efficiency involves defining a specific buffering mechanism that introduces an additional delay not larger than one sampling period. This allows representing the buffered NCS as a switched system, thus leading to a significant simplification of the NCS model and subsequent controller synthesis. The novel approach does not only circumvent the need for any over-approximation technique, but also leads to a strongly decreased number of optimization variables and linear matrix inequalities, allowing hereby greater flexibility with respect to additional degrees of freedom affecting the transient behavior. The performance and computational efficiency of the control strategy are demonstrated in a simulation example.
|
|
FrA14 Regular Session, Roselle Junior 4612 |
Add to My Program |
Observers for Linear Systems |
|
|
Chair: Wang, Lili | University of California, Irvine |
Co-Chair: Becis-Aubry, Yasmina | Univ. of Orléans |
|
10:00-10:20, Paper FrA14.1 | Add to My Program |
Split-Spectrum Based Distributed Estimator for a Continuous-Time Linear System on a Time-Varying Graph |
|
Wang, Lili | Purdue University |
Liu, Ji | Stony Brook University |
Anderson, Brian D.O. | Australian National University |
Morse, A. Stephen | Yale Univ |
Keywords: Observers for Linear systems, Distributed control, Autonomous systems
Abstract: A simply structured distributed estimator is described for estimating the state of a continuous-time, jointly observable, input free, multi-channel linear system whose sensed outputs are distributed across a fixed multi-agent network. The estimator is then extended to non-stationary networks whose neighbor graphs switch according to a switching signal with a dwell time, or switch arbitrarily under appropriate assumptions. The estimator is guaranteed to solve the problem, provided a network-widely shared gain is sufficiently large. The lower bound of the gain is derived. This is accomplished by appealing to the “split-spectrum” approach and exploiting several well-known properties of invariant subspace. The proposed estimators are inherently resilient to abrupt changes in the number of agents and communication links in the inter-agent communication graph upon which the algorithms depend, provided the network is redundantly strongly connected and redundantly jointly observable.
|
|
10:20-10:40, Paper FrA14.2 | Add to My Program |
Minimum-Volume Set-Membership State Estimation of Time Varying Constrained Systems with Sporadic Measurements |
|
Becis-Aubry, Yasmina | Univ. of Orléans |
Ramdani, Nacim | University of Orléans |
Keywords: Observers for Linear systems, Estimation
Abstract: This paper presents a recursive ellipsoidal set-membership state estimation algorithm for discrete-time linear time-varying (LTV) models with additive bounded disturbances affecting state evolution and sporadic measurement equations. The state vector is subject to linear equality and/or inequality constraints, which are mathematically viewed as additional measurements. A novel approach is developed considering the unprecedented fact that, owing to equality constraints, the ellipsoid characterizing all possible values of the state vector has a zero volume and its shape matrix is non invertible. A new size criterion, the pseudo-volume, is introduced and minimized in both the prediction and correction phases.
|
|
10:40-11:00, Paper FrA14.3 | Add to My Program |
Simultaneous State and Unknown Input Interval Observer for Discrete-Time Linear Switched Systems Using Peak-To-Peak Analysis |
|
Marouani, Ghassen | University of Monastir |
Dinh, Thach N. | CNAM Paris |
Wang, Zhenhua | Harbin Institute of Technology |
Ping, Xubin | Xidian University |
Raïssi, Tarek | Conservatoire National Des Arts Et Métiers |
Keywords: Observers for Linear systems, LMIs, Uncertain systems
Abstract: This paper investigates interval observer design for uncertain discrete-time linear switched systems under unknown inputs and a known switching signal. The approach introduces weighting matrices which allow one to relax design difficulties caused by classical coordinate transformation. To improve the accuracy, an L_{infty} method minimizing the peak-to-peak gain is employed to reduce the influence of unknown uncertainties. The interval observer gains are computed by solving Linear Matrix Inequality (LMI) formulated based on multiple quadratic Lyapunov functions under average dwell time switching signals.
|
|
11:00-11:20, Paper FrA14.4 | Add to My Program |
Necessary and Sufficient Conditions for Simultaneous State and Input Recovery of Linear Systems with Sparse Inputs by L1-Minimization |
|
Poe, Kyle | Johns Hopkins University |
Mallada, Enrique | Johns Hopkins University |
Vidal, Rene | Johns Hopkins University |
Keywords: Observers for Linear systems, Optimization algorithms, Information theory and control
Abstract: The study of theoretical conditions for recovering sparse signals from compressive measurements has received a lot of attention in the research community. In parallel, there has been a great amount of work characterizing conditions for the recovery of both the state and the input to a linear dynamical system (LDS), including a handful of results on recovering sparse inputs. However, existing sufficient conditions for recovering sparse inputs to an LDS are conservative and hard to interpret, while necessary and sufficient conditions have not yet appeared in the literature. In this work, we provide (1) the first characterization of necessary and sufficient conditions for the existence and uniqueness of sparse inputs to an LDS, (2) the first necessary and sufficient conditions for a linear program to recover both an unknown initial state and a sparse input, and (3) simple, interpretable recovery conditions in terms of the LDS parameters. We conclude with a numerical validation of these claims and discuss implications and future directions.
|
|
11:20-11:40, Paper FrA14.5 | Add to My Program |
Robust Reference Tracking of Linear Uncertain Systems Via Uncertainty Estimation and Composite Control |
|
Yang, Jun | National University of Singapore |
Jia, Xinyu | National University of Singapore |
Hou, Zhimin | National University of Singapore |
Pan, Yongping | Sun Yat-Sen University |
Yu, Haoyong | National University of Singapore |
Keywords: Observers for Linear systems, Robust control, Uncertain systems
Abstract: For linear systems with uncertainties and external disturbances, we present an uncertainty estimation and composite control (UECC) to achieve reference tracking and uncertainty estimation simultaneously. In this proposed UECC, we extract the uncertainty information from the error dynamics equation instead of the system dynamics. By reformulating the error dynamics equation as an algebraic equation using auxiliary variables, the need to measure state derivatives in the estimator and controller design is eliminated. Unlike time-delay control (TDC) and uncertainty and disturbance estimator (UDE) methods, we avoid the use of time delay and additional filtering operations to circumvent the noise amplification and oscillations in the control signal. Comparative simulations are provided to verify the effectiveness of the proposed method.
|
|
11:40-12:00, Paper FrA14.6 | Add to My Program |
Synthesis of Robust State Estimation Algorithms under Unknown Sensor Inputs |
|
Khan, Shiraz | Purdue University |
Pant, Kartik Anand | Purdue University |
Hwang, Inseok | Purdue University |
|
FrA15 Regular Session, Roselle Junior 4611 |
Add to My Program |
Robust Control III |
|
|
Chair: Bhowmick, Parijat | Indian Institute of Technology Guwahati |
Co-Chair: Lessard, Laurent | Northeastern University |
|
10:00-10:20, Paper FrA15.1 | Add to My Program |
Robust Data-Driven Safe Control Using Density Functions |
|
Zheng, Jian | Northeastern University |
Dai, Tianyu | Northeastern University |
Miller, Jared | Northeastern University |
Sznaier, Mario | Northeastern University |
Keywords: Robust control, Uncertain systems, Optimization
Abstract: This paper presents a tractable framework for data-driven synthesis of robustly safe control laws. Given noisy experimental data and some priors about the structure of the system, the goal is to synthesize a state feedback law such that the trajectories of the closed loop system are guaranteed to avoid an unsafe set even in the presence of unknown but bounded disturbances (process noise). The main result of the paper shows that for polynomial dynamics, this problem can be reduced to a tractable convex optimization by combining elements from polynomial optimization and the theorem of alternatives. This optimization provides both a rational control law and a density function safety certificate. These results are illustrated with numerical examples.
|
|
10:20-10:40, Paper FrA15.2 | Add to My Program |
Safe and Robust Stabilization of Uncertain Nonlinear Systems Via Control Lyapunov-Barrier Function and Disturbance Observer: A Preliminary Study |
|
Byeon, Sunseok | University of Seoul |
Park, Gyunghoon | University of Seoul |
Keywords: Robust control, Uncertain systems, Stability of nonlinear systems
Abstract: In this paper, we address the problem of safe and robust stabilization for a class of uncertain nonlinear systems. The key idea is to employ the disturbance observer (DOB) to a nominal safety-critical controller designed for the control Lyapunov-barrier function (CLBF). The DOB estimates and compensates the lumped disturbance that represents all the effect of model uncertainty and disturbance to the system approximately but as accurately as possible. As a result, only a small perturbation remains in the control loop, which can be dealt with as long as the nominal closed-loop system is input-tostate safe (ISSf) in a sense. To ensure the ISSf property without restriction on the CLBF, we propose a modified version of the Sontag’s universal formula as a nominal controller. This preliminary study verifies the validity of the proposed approach for 2nd-order nonlinear systems, but with mathematical analysis and simulations for the inverted pendulum on a cart.
|
|
10:40-11:00, Paper FrA15.3 | Add to My Program |
Closing the Loop on Runtime Monitors with Fallback-Safe MPC |
|
Sinha, Rohan | Stanford University |
Schmerling, Edward | Stanford University |
Pavone, Marco | Stanford University |
Keywords: Robust control, Vision-based control, Fault tolerant systems
Abstract: When we rely on deep-learned models for robotic perception, we must recognize that these models may behave unreliably on inputs dissimilar from the training data, compromising the closed-loop system's safety. This raises fundamental questions on how we can assess confidence in perception systems and to what extent we can take safety-preserving actions when external environmental changes degrade our perception model's performance. Therefore, we present a framework to certify the safety of a perception-enabled system deployed in novel contexts. To do so, we leverage robust model predictive control (MPC) to control the system using the perception estimates while maintaining the feasibility of a safety-preserving fallback plan that does not rely on the perception system. In addition, we calibrate a runtime monitor using recently proposed conformal prediction techniques to certifiably detect when the perception system degrades beyond the tolerance of the MPC controller, resulting in an end-to-end safety assurance. We show that this control framework and calibration technique allows us to certify the system's safety with orders of magnitudes fewer samples than required to retrain the perception network when we deploy in a novel context on a photo-realistic aircraft taxiing simulator. Furthermore, we illustrate the safety-preserving behavior of the MPC on simulated examples of a quadrotor. We open-source our simulation platform and provide videos of our results at our project page: https://tinyurl.com/fallback-safe-mpc.
|
|
11:00-11:20, Paper FrA15.4 | Add to My Program |
Real-Time Optimisation-Based Robust Control: Heat Exchanger Comparative Analysis |
|
Horváthová, Michaela | Slovak University of Technology in Bratislava |
Galčíková, Lenka | Faculty of Chemical and Food Technology, Slovak University of Te |
Klauco, Martin | Slovak University of Technology in Bratislava |
Oravec, Juraj | Slovak University of Technology in Bratislava |
Keywords: Process Control, Robust control, Optimal control
Abstract: This paper investigates a possibility to improve the control performance of a laboratory-scale heat exchanger by introducing the convex-lifting-based robust controller into the closed-loop system. Robust model predictive control (MPC) design serves as the relevant reference control strategy. The improvements are expected in both, increased performance of the control trajectories and, simultaneously, reduced computational complexity. The performance is analyzed subject to the reference tracking implemented on the laboratory plate heat exchanger. This plant has a nonlinear and asymmetric behavior, affected by various uncertain parameters. The case study investigates the robust MPC, tunable convex-lifting-based robust control, and convex-lifting-based robust control with approximated control law. This paper also extends the convex-lifting-based robust control with approximated control law to provide the robust stability guarantees. The experimental case study evaluates and analyses various apprehensible criteria, e.g., energy consumption, carbon footprint, and computational demands.
|
|
11:20-11:40, Paper FrA15.5 | Add to My Program |
On State-Space Characterisation for Output Negative Imaginary Systems with Possible Poles at the Origin and Their Internal Stability Result (I) |
|
Devi, Salam Athoibi | Indian Institute of Technology Guwahati |
Bhowmick, Parijat | Indian Institute of Technology Guwahati |
Lanzon, Alexander | University of Manchester |
Keywords: Robust control, Stability of linear systems, LMIs
Abstract: This paper derives a new and unifying state-space characterisation for the entire class of real, rational, proper Output Negative Imaginary (ONI) systems, allowing poles on the imaginary axis even at the origin. The proposed result captures the existing versions of the NI state-space characterisations, particularly the ones that apply to the NI systems with poles at the origin. A necessary and sufficient LMI condition has been derived to test the strict/non-strict ONI properties of an LTI system with a given minimal state-space realisation. The LMI-based characterisation offers easy and convenient execution due to the easily accessible SDP solver packages. Finally, a necessary and sufficient internal stability theorem is also derived for a positive feedback ONI systems interconnection containing pole(s) at the origin. The proposed stability result specialises to the earlier versions when the earlier assumptions are imposed. Numerical examples are given to show the usefulness of the proposed theoretical results.
|
|
11:40-12:00, Paper FrA15.6 | Add to My Program |
Guaranteed Stability Margins for Decentralized Linear Quadratic Regulators |
|
Kashyap, Mruganka | Northeastern University |
Lessard, Laurent | Northeastern University |
Keywords: Decentralized control, Robust control, Distributed control
Abstract: It is well-known that linear quadratic regulators (LQR) enjoy guaranteed stability margins, whereas linear quadratic Gaussian regulators (LQG) do not. In this letter, we consider systems and compensators defined over directed acyclic graphs. In particular, there are multiple decision-makers, each with access to a different part of the global state. In this setting, the optimal LQR compensator is dynamic, similar to classical LQG. We show that when sub-controller input costs are decoupled (but there is possible coupling between sub-controller state costs), the decentralized LQR compensator enjoys similar guaranteed stability margins to classical LQR. However, these guarantees disappear when cost coupling is introduced.
|
|
FrA16 Regular Session, Peony Junior 4512 |
Add to My Program |
Smart Grid II |
|
|
Chair: Zheng, Wei Xing | Western Sydney University |
Co-Chair: Liu, Mingxi | University of Utah |
|
10:00-10:20, Paper FrA16.1 | Add to My Program |
Deep Learning for Scalable Optimal Design of Incremental Volt/VAR Control Rules |
|
Gupta, Sarthak | Virginia Tech |
Mehrizi-Sani, Ali | Virginia Tech |
Chatzivasileiadis, Spyros | Technical University of Denmark |
Kekatos, Vassilis | Virginia Tech |
Keywords: Smart grid, Optimization, Neural networks
Abstract: Volt/VAR control rules enable distributed energy resources (DER) to autonomously regulate voltage in distribution grids. The Volt/VAR rules provisioned by the IEEE Standard 1547 take on a piecewise-linear shape. However, its maximum slope is upper bounded to ensure stability, and that may hamper its voltage regulation performance. This limitation can be surpassed by adding a memory term to the control rule, and thus, obtaining a so-termed incremental control rule. This letter aims to optimally customize the shape of incremental rules across buses to attain desirable voltage profiles. Albeit this task can be posed as a bilevel program, we pursue a more scalable approach by reformulating it as a deep learning task. The idea is that Volt/VAR dynamics can be captured by a recursive neural network (RNN). Interestingly, the RNN weights correspond to the parameters of the control rule; the RNN input to the grid loading conditions; and the RNN output to the equilibrium voltages. Therefore, the optimal rule parameters can be found upon training the RNN so its output (equilibrium voltages) approach unity. Training is performed by feeding the RNN with representative scenarios of the anticipated grid loading conditions. The RNN depth depends on the settling time of Volt/VAR dynamics. Because the discrete-time Volt/VAR dynamics can be viewed as iterations of a proximal gradient descent (PGD) algorithm, we also leverage Nesterov's accelerated PGD iterations to reduce the RNN depth. The RNN is never implemented in the field. Training this RNN is equivalent to solving the optimal rule design in a more computationally efficient manner. Analytical findings and numerical tests corroborate that the proposed solution can be neatly adapted to single- and multi-phase feeders. The proposed approach could be of general interest in designing piecewise-linear controllers acting on linear plants.
|
|
10:20-10:40, Paper FrA16.2 | Add to My Program |
On Privacy Preservation of Electric Vehicle Charging Control Via State Obfuscation |
|
Huo, Xiang | University of Utah |
Liu, Mingxi | University of Utah |
Keywords: Smart grid, Optimization algorithms, Control Systems Privacy
Abstract: The electric vehicle (EV) industry is rapidly evolving owing to advancements in smart grid technologies and charging control strategies. While EVs are promising in decarbonizing the transportation system and providing grid services, their widespread adoption has led to notable and erratic load injections that can disrupt the normal operation of power grid. Additionally, the unprotected collection and utilization of personal information during the EV charging process cause prevalent privacy issues. To address the scalability and data confidentiality in large-scale EV charging control, we propose a novel decentralized privacy-preserving EV charging control algorithm via state obfuscation that 1) is scalable w.r.t. the number of EVs and ensures optimal EV charging solutions; 2) achieves privacy preservation in the presence of honest-but-curious adversaries and eavesdroppers; and 3) is applicable to eliminate privacy concerns for general multi-agent optimization problems in large-scale cyber-physical systems. The EV charging control is structured as a constrained optimization problem with coupled objectives and constraints, then solved in a decentralized fashion. Privacy analyses and simulations demonstrate the efficiency and efficacy of the proposed approach.
|
|
10:40-11:00, Paper FrA16.3 | Add to My Program |
Fast Distributed Resource Allocation of Smart Grid: A Zeroth-Order Optimization Algorithm |
|
Luan, Meng | Southeast University |
Wen, Guanghui | Southeast University |
Zheng, Wei Xing | Western Sydney University |
Keywords: Smart grid, Optimization algorithms
Abstract: This paper investigates a multi-objective distributed resource allocation problem, where the economic cost including the transmission loss, and the environmental pollution are taken into account simultaneously. To settle this problem, a Pareto-based zeroth-order fast distributed optimization algorithm is proposed, which can always balance the overall energy demand with generation. In the algorithm design, the acceleration idea of the momentum method is tailored for the gradient estimation update, which gives a more accurate descent direction. Moreover, the unknown effect causes the gradient of the objective function to be unavailable and only the function values to be observed. Different from the gradient-based methods, a zeroth-order method is proposed to solve the distributed resource allocation problem with gradient estimation. Furthermore, the convergence of the designed algorithm is proved theoretically, and the convergence rate of linear speedup can be achieved. Finally, numerical simulations verify the validity and applicability of the proposed algorithm.
|
|
11:00-11:20, Paper FrA16.4 | Add to My Program |
Multi-Objective Optimal Dispatching for Heterogeneous Multienergy Ship Microgrid (I) |
|
Ke, Shang | Yanshan University |
Li, Xiaolei | Yanshan University |
Luo, Xiaoyuan | Yanshan University |
Wang, Jiange | Yanshan University |
Xu, Qianwen | KTH Royal Institute of Technology |
Keywords: Smart grid, Optimization algorithms, Energy systems
Abstract: With the growth of energy and transportation demand, the integrated energy dispatching of ship power grid has become the focus of researchers. The optimization technique is used to reduce the total energy consumption and pollutant emissions of ships, optimizing the ship power generation planning. The purpose is to achieve environmental protection and energy saving while ensuring the continuous and reliable power supply of ships. However, heterogeneous ship microgrid poses new challenges to integrated energy dispatch. This paper proposes an integrated energy scheduling scheme that integrates photovoltaic, wind power, diesel engine, gas turbine, and battery for a heterogeneous multienergy ship microgrid. Under the system constraints, a multi-objective optimal scheduling model including operating costs and pollutant emissions is established, then the gravity search algorithm is applied to solve such an issue. The simulation results show that the scheme can effectively reduce the cost of energy consumption and pollutant emissions of ships, improving the economy, reliability and energy conservation, which verify the advantages of the proposed scheme.
|
|
11:20-11:40, Paper FrA16.5 | Add to My Program |
An Exact Characterisation of Flexibility in Populations of Electric Vehicles |
|
Mukhi, Karan | University of Oxford |
Abate, Alessandro | University of Oxford |
Keywords: Smart grid, Power systems, Energy systems
Abstract: Increasing penetrations of electric vehicles (EVs) presents a large source of flexibility, which can be used to assist balancing the power grid. The flexibility of an individual EV can be quantified as a convex polytope and the flexibility of a population of EVs is the Minkowski sum of these polytopes. In general computing the exact Minkowski sum is intractable. However, exploiting symmetry in a restricted but significant case, enables an efficient computation of the aggregate flexibility. This results in a polytope with exponentially many vertices and facets with respect to the time horizon. We show how to use a lifting procedure to provide a representation of this polytope with a reduced number of facets, which makes optimising over more tractable. Finally, a disaggregation procedure that takes an aggregate signal and computes dispatch instructions for each EV in the population is presented. The complexity of the algorithms presented is independent of the size of the population and polynomial in the length of the time horizon. We evaluate this work against existing methods in the literature, and show how this method guarantees optimality with lower computational burden than existing methods.
|
|
11:40-12:00, Paper FrA16.6 | Add to My Program |
Reducing Aggregate Electric Vehicle Battery Capacity through Sharing |
|
Alexeenko, Polina | Cornell University |
Charisopoulos, Vasileios | Cornell University |
Keywords: Smart grid, Power systems, Optimization
Abstract: Meeting demand for automotive battery resources is predicted to be costly from both economic and environmental perspectives. To minimize costs, battery resources should be deployed as efficiently as possible. A potential source of inefficiency in battery deployment is that the batteries of personal vehicles are typically much larger than necessary to meet most daily mobility needs. In this paper, we consider whether battery resources can be used more efficiently in a setting where drivers, in addition to having personal vehicle batteries, have access to a shared battery resource. More precisely, we consider the problem of minimizing aggregate battery capacity in settings with and without a shared resource subject to the requirement that driver commuting needs are met with high reliability. To assess capacity reduction potential, we quantify the difference in deployed battery capacity in settings with and without a shared resource in a case study using real-world longitudinal mobility data from Puget Sound, Washington. We find that access to a shared battery resource can substantially reduce deployed battery capacity. Furthermore, relative reductions in battery capacity increase with number of drivers and the level of reliability desired.
|
|
FrA17 Regular Session, Peony Junior 4511 |
Add to My Program |
Statistical Learning I |
|
|
Chair: Mahajan, Aditya | McGill University |
Co-Chair: Li, Tao | East China Normal University |
|
10:00-10:20, Paper FrA17.1 | Add to My Program |
Multi-Agent Reachability Calibration with Conformal Prediction |
|
Muthali, Anish | University of California, Berkeley |
Shen, Haotian | University of California, Berkeley |
Deglurkar, Sampada | University of California, Berkeley |
Lim, Michael H. | University of California, Berkeley |
Roelofs, Rebecca | University of California, Berkeley |
Faust, Aleksandra | Google |
Tomlin, Claire J. | UC Berkeley |
Keywords: Statistical learning, Autonomous vehicles, Iterative learning control
Abstract: We investigate methods to provide safety assurances for autonomous agents that incorporate predictions of other, uncontrolled agents' behavior into their own trajectory planning. Given a learning-based forecasting model that predicts agents' trajectories, we introduce a method for providing probabilistic assurances on the model's prediction error with calibrated confidence intervals. Through quantile regression, conformal prediction, and reachability analysis, our method generates probabilistically safe and dynamically feasible prediction sets. We showcase their utility in certifying the safety of planning algorithms, both in simulations using actual autonomous driving data and in an experiment with Boeing vehicles.
|
|
10:20-10:40, Paper FrA17.2 | Add to My Program |
Error Analysis of Regularized Trigonometric Linear Regression with Unbounded Sampling: A Statistical Learning Viewpoint |
|
Scampicchio, Anna | ETH Zurich |
Arcari, Elena | ETH Zurich |
Zeilinger, Melanie N. | ETH Zurich |
Keywords: Statistical learning, Estimation
Abstract: The effectiveness of non-parametric, kernel-based methods for function estimation comes at the price of high computational complexity, which hinders their applicability in adaptive, model-based control. Motivated by approximation techniques based on sparse spectrum Gaussian processes, we focus on models given by regularized trigonometric linear regression. This paper provides an analysis of the performance of such an estimation set-up within the statistical learning framework. In particular, we derive a novel bound for the sample error in finite-dimensional spaces, accounting for noise with potentially unbounded support. Next, we study the approximation error and discuss the bias-variance trade-off as a function of the regularization parameter by combining the two bounds.
|
|
10:40-11:00, Paper FrA17.3 | Add to My Program |
Online Learning in Reproducing Kernel Hilbert Space with Non-IID Data |
|
Zhang, Xiwei | East China Normal University |
Li, Tao | East China Normal University / New York University Shanghai |
Keywords: Statistical learning, Distributed parameter systems, Adaptive systems
Abstract: We analyze the convergence of online regularized learning algorithm based on dependent and non-stationary online data streams for the nonparametric regression problem in reproducing kernel Hilbert space (RKHS). We show that the algorithm achieves mean-square convergence if the algorithm gain and regularization parameter are chosen appropriately, the online data streams are weakly dependent and satisfy the eigenvalue-wise persistence of excitation condition. Especially, for the case with independent but non-identically distributed online data streams, we give more intuitive convergence conditions on the drifts of the probability measures induced by the data.
|
|
11:00-11:20, Paper FrA17.4 | Add to My Program |
Estimation of Models with Limited Data by Leveraging Shared Structure |
|
Rui, Maryann | Massachusetts Institute of Technology |
Horel, Thibaut | MIT |
Dahleh, Munther A. | Massachusetts Inst. of Tech |
Keywords: Statistical learning, Identification, Learning
Abstract: Modern data sets, such as those in healthcare and e-commerce, are often derived from many individuals or systems but have insufficient data from each source alone to separately estimate individual, often high-dimensional, model parameters. If there is shared structure among systems however, it may be possible to leverage data from other systems to help estimate individual parameters, which could otherwise be non-identifiable. In this paper, we assume systems share a latent low-dimensional parameter space and propose a method for recovering d-dimensional parameters for N different linear systems, when there are only T
|
|
11:20-11:40, Paper FrA17.5 | Add to My Program |
On the Hardness of Learning to Stabilize Linear Systems |
|
Zeng, Xiong | University of Michigan, Ann Arbor |
Liu, Zexiang | University of Michigan |
Du, Zhe | University of Michigan |
Ozay, Necmiye | Univ. of Michigan |
Sznaier, Mario | Northeastern University |
Keywords: Statistical learning, Linear systems, Uncertain systems
Abstract: Inspired by the work of Tsiamis et al. [1], in this paper we study the statistical hardness of learning to stabilize linear time-invariant systems. Hardness is measured by the number of samples required to achieve a learning task with a given probability. The work in [1] shows that there exist system classes that are hard to learn to stabilize with the core reason being the hardness of identification. Here we present a class of systems that can be easy to identify, thanks to a non-degenerate noise process that excites all modes, but the sample complexity of stabilization still increases exponentially with the system dimension. We tie this result to the hardness of co-stabilizability for this class of systems using ideas from robust control.
|
|
11:40-12:00, Paper FrA17.6 | Add to My Program |
Relative Almost Sure Regret Bounds for Certainty Equivalence Control of Markov Jump Systems |
|
Sayedana, Borna | McGill University |
Afshari, Mohammad | McGill University |
Caines, Peter E. | McGill University |
Mahajan, Aditya | McGill University |
Keywords: Statistical learning, Machine learning, Adaptive control
Abstract: In this paper, we consider learning and control problem in an unknown Markov jump linear system (MJLS) with perfect state observations. We first establish a generic upper bound on regret for any learning based algorithm. We then propose a certainty equivalence-based learning algorithm and show that this algorithm achieves a regret of O(sqrt(T)log(T)) relative to a certain subset of the sample space. As part of our analysis, we revisit the switched least squares system identification algorithm of [1], [2] for autonomous MJLS and generalize it to controlled MJLS, establishing strong consistency and almost sure rates of convergence of this method.
|
|
FrA18 Regular Session, Peony Junior 4412 |
Add to My Program |
Stability of Nonlinear Systems III |
|
|
Chair: Efimov, Denis | Inria |
Co-Chair: Mauroy, Alexandre | University of Namur |
|
10:00-10:20, Paper FrA18.1 | Add to My Program |
Homogeneity with Respect to a Part of Variables and Accelerated Stabilization |
|
Efimov, Denis | Inria |
Kolmanovsky, Ilya V. | The University of Michigan |
Keywords: Stability of nonlinear systems, Nonholonomic systems
Abstract: The problem of transforming a locally asymptotically stabilizing time-varying control law to a globally stabilizing one with accelerated finite/fixed-time convergence is studied. The solution is based on an extension of the theory of homogeneous systems to the setting where the symmetry and stability properties only hold with respect to a part of the state variables. The proposed control design advances the kind of approaches first studied in [M'Closkey&Murray,1997], and relies on the implicit Lyapunov function framework. Examples of finite-time and nearly fixed-time stabilization of a nonholonomic integrator are reported.
|
|
10:20-10:40, Paper FrA18.2 | Add to My Program |
On the Computation of Principal Eigenfunctions of the Koopman Operator |
|
Deka, Shankar | KTH Royal Institute of Technology, Sweden |
Krishnamoorthy Shankara Narayanan, Sriram Sundar | Clemson University |
Vaidya, Umesh | Clemson University |
Keywords: Stability of nonlinear systems, Lyapunov methods, Neural networks
Abstract: The paper is about the computation of the principal spectrum of the Koopman operator (i.e., eigenvalues and eigenfunctions). The principal eigenfunctions of the Koopman operator are the ones with the corresponding eigenvalues equal to the eigenvalues of the linearization of the nonlinear system at an equilibrium point. The main contribution of this paper is to provide a novel approach for computing the principal eigenfunctions using a path-integral formula. Furthermore, we provide conditions based on the stability property of the dynamical system and the eigenvalues of the linearization towards computing the principal eigenfunction using the path-integral formula. Further, we provide a Deep Neural Network framework that utilizes our proposed path-integral approach for eigenfunction computation in high-dimension systems. Finally, we present simulation results for the computation of principal eigenfunction and demonstrate their application for determining the stable and unstable manifolds and constructing the Lyapunov function.
|
|
10:40-11:00, Paper FrA18.3 | Add to My Program |
Design of Controls for Boundedness of Trajectories of Multistable State Periodic Systems |
|
Mendoza-Avila, Jesus | INRIA Lille-Nord Europe |
Efimov, Denis | Inria |
Mercado Uribe, José Angel | Brandenburgische Technische Universität Cottbus-Senftenberg |
Schiffer, Johannes | Brandenburg University of Technology |
Keywords: Stability of nonlinear systems, Nonlinear systems, Lyapunov methods
Abstract: In this paper, the concept of control Leonov functions is introduced, and it is shown that their information is enough to design continuous and periodic controllers that provide boundedness of the state for a class of multistable state-periodic systems. These feedback control laws are based on a mild adaptation of Sontag's universal formula and a kind of small control property. The proposed method is illustrated via application in a microgrid.
|
|
11:00-11:20, Paper FrA18.4 | Add to My Program |
On Systematic Criteria for the Global Stability of Nonlinear Systems Via the Koopman Operator Framework |
|
Zagabe, Christian Mugisho | University of Namur |
Mauroy, Alexandre | University of Namur |
Keywords: Stability of nonlinear systems, Lyapunov methods, Nonlinear systems
Abstract: We present novel sufficient conditions for the global stability of an equilibrium in the case of nonlinear dynamics with analytic vector fields. These conditions provide stability criteria that are directly expressed in terms of the Taylor expansion coefficients of the vector field (e.g. in terms of first order coefficients, maximal coefficient, sum of coefficients). Our main assumptions is that the vector field components be holomorphic, and the linearized system be locally exponentially stable and diagonalizable. These results are based on the properties of the Koopman operator defined on the Hardy space on the polydisc.
|
|
11:20-11:40, Paper FrA18.5 | Add to My Program |
Global Asymptotic Stabilization of Time-Invariant Bilinear Non-Homogeneous Complex Systems |
|
Zaitsev, Vasilii | Udmurt State University |
Keywords: Stability of nonlinear systems, Lyapunov methods
Abstract: The problem of global asymptotic stabilization by state feedback is considered for time-invariant bilinear non-homogeneous control systems in the complex space. For such systems, the possibility of applying the second Lyapunov method is proved, which is not valid for general nonlinear complex systems. The approach uses the Barbashin--Krasovsky--La Salle theorem on global asymptotic stability. Sufficient conditions for global asymptotic stabilization of a bilinear non-homogeneous complex system by real state feedback are obtained. Finally, an example of using the obtained results is presented.
|
|
11:40-12:00, Paper FrA18.6 | Add to My Program |
A Recurrence-Based Direct Method for Stability Analysis and GPU-Based Verification of Non-Monotonic Lyapunov Functions |
|
Siegelmann, Roy | Johns Hopkins University |
Shen, Yue | Johns Hopkins University |
Paganini, Fernando | Universidad ORT Uruguay |
Mallada, Enrique | Johns Hopkins University |
Keywords: Stability of nonlinear systems, Numerical algorithms, Lyapunov methods
Abstract: Lyapunov's direct method is a powerful tool that provides a rigorous framework for stability analysis and control design for dynamical systems. A critical step that enables the application of the method is the existence of a Lyapunov function V---a function whose value monotonically decreases along the trajectories of the dynamical system. Unfortunately, finding a Lyapunov function is often tricky and requires ingenuity, domain knowledge, or significant computational power. At the core of this challenge is the fact that the method requires every sub-level set of V (V_{leq c}) to be forward invariant, thus implicitly coupling the geometry of V_{leq c} and the trajectories of the system. In this paper, we seek to disentangle this dependence by developing a direct method that substitutes the concept of invariance with the more flexible notion of recurrence. A set is (tau-)recurrent if every trajectory that starts in the set returns to it (within tau seconds). We show that, under mild conditions, the recurrence of sub-level sets V_{leq c} is sufficient to guarantee stability and introduce the appropriate stronger notions to obtain asymptotic stability and exponential stability. We further provide a GPU-based algorithm to verify whether V satisfies such recurrence conditions up to an arbitrarily small neighborhood of the equilibrium.
|
|
FrA19 Regular Session, Peony Junior 4411 |
Add to My Program |
Predictive Control for Linear Systems II |
|
|
Chair: Allgöwer, Frank | University of Stuttgart |
Co-Chair: Schulze Darup, Moritz | TU Dortmund University |
|
10:00-10:20, Paper FrA19.1 | Add to My Program |
Implicit Predictors in Regularized Data-Driven Predictive Control |
|
Klädtke, Manuel | TU Dortmund University |
Schulze Darup, Moritz | TU Dortmund University |
Keywords: Predictive control for linear systems, Optimal control, Subspace methods
Abstract: We introduce the notion of implicit predictors, which characterize the input-(state)-output prediction behavior underlying a predictive control scheme, even if it is not explicitly enforced as an equality constraint (as in traditional model or subspace predictive control). To demonstrate this concept, we derive and analyze implicit predictors for some basic data-driven predictive control (DPC) schemes, which offers a new perspective on this popular approach that may form the basis for modified DPC schemes and further theoretical insights.
|
|
10:20-10:40, Paper FrA19.2 | Add to My Program |
Towards Grassmanian Dimensionality Reduction in MPC |
|
Schurig, Roland | TU Darmstadt, Control and Cyber-Physical Systems Laboratory |
Himmel, Andreas | TU Darmstadt |
Findeisen, Rolf | TU Darmstadt |
Keywords: Predictive control for linear systems, Optimization, Model/Controller reduction
Abstract: Model predictive control presents remarkable po- tential for the optimal control of dynamic systems. However, the necessity for an online solution to an optimal control problem often renders it impractical for control systems with limited computational capabilities. To address this issue, specialized dimensionality reduction techniques designed for optimal con- trol problems have been proposed. In this paper, we introduce a methodology for designing a low-dimensional subspace that provides an ideal representation for a predefined finite set of high-dimensional optimizers. By characterizing the subspace as an element of a specific Riemannian manifold, we leverage the unique geometric structure of the subspace. Subsequently, the optimal subspace is identified through optimization on the Riemannian manifold. The dimensionality reduction for the model predictive control scheme is achieved by confining the search space to the optimized low-dimensional subspace, enhancing both efficiency and applicability.
|
|
10:40-11:00, Paper FrA19.3 | Add to My Program |
Efficient Computation of Lipschitz Constants for MPC with Symmetries |
|
Teichrib, Dieter | TU Dortmund University |
Schulze Darup, Moritz | TU Dortmund University |
Keywords: Predictive control for linear systems, Optimization, Optimal control
Abstract: Lipschitz constants for linear MPC are useful for certifying inherent robustness against unmodeled disturbances or robustness for neural network-based approximations of the control law. In both cases, knowing the minimum Lipschitz constant leads to less conservative certifications. Computing this minimum Lipschitz constant is trivial given the explicit MPC. However, the computation of the explicit MPC may be intractable for complex systems. The paper discusses a method for efficiently computing the minimum Lipschitz constant without using the explicit control law. The proposed method simplifies a recently presented mixed-integer linear program (MILP) that computes the minimum Lipschitz constant. The simplification is obtained by exploiting saturation and symmetries of the control law and irrelevant constraints of the optimal control problem.
|
|
11:00-11:20, Paper FrA19.4 | Add to My Program |
On Stochastic MPC Formulations with Closed-Loop Guarantees: Analysis and a Unifying Framework |
|
Köhler, Johannes | ETH Zurich |
Geuss, Ferdinand | Eidgenössische Technische Hochschule Zürich |
Zeilinger, Melanie N. | ETH Zurich |
Keywords: Predictive control for linear systems, Stochastic optimal control, Constrained control
Abstract: We investigate model predictive control (MPC) formulations for linear systems subject to i.i.d. stochastic disturbances with bounded support and chance constraints. Existing stochastic MPC formulations with closed-loop guarantees can be broadly classified in two separate frameworks: i) using robust techniques; ii) feasibility preserving algorithms. We investigate two particular MPC formulations representative for these two frameworks called robust-stochastic MPC and indirect feedback stochastic MPC. We provide a qualitative analysis, highlighting intrinsic limitations of both approaches in different edge cases. Then, we derive a unifying stochastic MPC framework that naturally includes these two formulations as limit cases. This qualitative analysis is complemented with numerical results, showcasing the advantages and limitations of each method.
|
|
11:20-11:40, Paper FrA19.5 | Add to My Program |
Stochastic Model Predictive Control Using Initial State and Variance Interpolation |
|
Schlüter, Henning | University Stuttgart |
Allgöwer, Frank | University of Stuttgart |
Keywords: Predictive control for linear systems, Stochastic optimal control, Stochastic systems
Abstract: We present a Stochastic Model Predictive Control (SMPC) framework for linear systems subject to Gaussian disturbances. In order to avoid feasibility issues, we employ a recent initialization strategy, optimizing over an interpolation of the initial state between the current measurement and previous prediction. By also considering the variance in the interpolation, we can employ variable-size tubes, to ensure constraint satisfaction in closed-loop. We show that this novel method improves control performance and enables following the constraint closer, then previous methods. Using a DC-DC converter as numerical example we illustrated the improvement over previous methods.
|
|
11:40-12:00, Paper FrA19.6 | Add to My Program |
Model Predictive Control in Partially Observable Multi-Modal Discrete Environments |
|
Rosolia, Ugo | Caltech |
Guastella, Dario Calogero | University of Catania |
Muscato, Giovanni | Universita Degli Studi Di Catania |
Borrelli, Francesco | Unversity of California at Berkeley |
Keywords: Predictive control for linear systems, Uncertain systems, Markov processes
Abstract: Autonomous systems operate in environments that can be observed only through noisy measurements. Thus, controllers should compute actions based on their beliefs about the surroundings. In these settings, we design a Model Predictive Controller (MPC) based on a continuous-state Linear Time-Invariant (LTI) system model operating in a discrete-state environment described by a Hidden Markov Model (HMM). Environment constraints are modeled as chance constraints and environment observations can be asynchronous with system state measurements and controller updates. We show how to approximate the solution of the MPC problem defined over the space of feedback policies by optimizing over a trajectory tree, where each branch is associated with an environment measurement. The proposed approach guarantees chance constraint satisfaction and recursive feasibility. Finally, we test the proposed strategy on navigation examples in partially observable environments, where the proposed MPC guarantees chance constraint satisfaction.
|
|
FrA20 Regular Session, Orchid Junior 4312 |
Add to My Program |
Autonomous Robots |
|
|
Chair: Markdahl, Johan | University of Luxembourg |
Co-Chair: Arslan, Omur | Eindhoven University of Technology |
|
10:00-10:20, Paper FrA20.1 | Add to My Program |
Adaptive Headway Motion Control and Motion Prediction for Safe Unicycle Motion Design |
|
Isleyen, Aykut | Eindhoven University of Technology |
Van De Wouw, Nathan | Eindhoven University of Technology |
Arslan, Omur | Eindhoven University of Technology |
Keywords: Autonomous robots, Nonholonomic systems, Feedback linearization
Abstract: Differential drive robots that can be modeled as a kinematic unicycle are a standard mobile base platform for many service and logistics applications. Safe and smooth autonomous motion around obstacles is a crucial skill for unicycle robots to perform diverse tasks in complex environments. A classical control approach for unicycle control is feedback linearization using a headway point at a fixed headway distance in front of the unicycle. The unicycle headway control brings the headway point to a desired goal location by embedding a linear headway reference dynamics, which often results in an undesired offset for the actual unicycle position. In this paper, we introduce a new unicycle headway control approach with an adaptive headway distance that overcomes this limitation, i.e., when the headway point reaches the goal the unicycle position is also at the goal. By systematically analyzing the closed-loop unicycle motion under the adaptive headway controller, we design analytical feedback motion prediction methods that bound the closed-loop unicycle position trajectory and so can be effectively used for safety assessment and safe unicycle motion design around obstacles. We present an application of adaptive headway motion control and motion prediction for safe unicycle path following around obstacles in numerical simulations.
|
|
10:20-10:40, Paper FrA20.2 | Add to My Program |
Min-Time Coverage in Constricted Environments with Arbitrary Guidepath Networks |
|
Kim, Young-In | ISyE, Georgia Tech |
Reveliotis, Spyros | Georgia Institute of Technology |
Keywords: Autonomous robots, Networked control systems, Optimization
Abstract: In a recent research program, we have undertaken the investigation of robotic traffic management problems arising when a fleet of networked mobile robots is employed in the support of certain coverage tasks that take place in physically constricted environments. But our past investigation of these problems is restricted to the case where the guidepath networks supporting the robot traffic have a dendritic topology. In the current work, we extend the investigation of the considered problems to the case where the underlying guidepath networks have an arbitrary topology. We provide (i) detailed descriptions of the considered problems in this new operational setting, (ii) analytical characterizations of these problems that take the form of integer programming formulations, and (iii) strong combinatorial relaxations for the derived formulations that are applicable to larger problem instances. A numerical experiment presented in the last part of the manuscript demonstrates and assesses the efficacy and the tractability of the analytical developments. We also notice that the undertaken extension of the past results is a nontrivial task, for the reasons that are explained in the manuscript.
|
|
10:40-11:00, Paper FrA20.3 | Add to My Program |
Swarm Bug Algorithms for Path Generation in Unknown Environments |
|
Johansson, Alexander | KTH |
Markdahl, Johan | The Swedish Defence Research Agency |
Keywords: Autonomous robots, Agents-based systems
Abstract: In this paper, we consider the problem of a swarm traveling between two points as fast as possible in an unknown environment cluttered with obstacles. Potential applications include search-and-rescue operations where damaged environments are typical. We present swarm generalizations, called SwarmCom, SwarmBug1, and SwarmBug2, of the classical path generation algorithms Com, Bug1, and Bug2. These algorithms were developed for unknown environments and require low computational power and memory storage, thereby freeing up resources for other tasks. We show the upper bound of the worst-case travel time for the first agent in the swarm to reach the target point for SwarmBug1. For SwarmBug2, we show that the algorithm underperforms in terms of worst-case travel time compared to SwarmBug1. For SwarmCom, we show that there exists a trivial scene for which the algorithm will not halt, and it thus has no performance guarantees. Moreover, by comparing the upper bound of the travel time for SwarmBug1 with a universal lower bound for any path generation algorithm, it is shown that in the limit when the number of agents in the swarm approaches infinity, no other algorithm has strictly better worst-case performance than SwarmBug1 and the universal lower bound is tight.
|
|
11:00-11:20, Paper FrA20.4 | Add to My Program |
Abstraction-Based Motion Coordination Control for Multi-Robot Systems |
|
Pan, Zhuo-Rui | Dalian University of Technology |
Ren, Wei | Dalian University of Technology |
Sun, Xi-Ming | Dalian University of Technology |
Keywords: Autonomous robots, Optimal control, Distributed control
Abstract: This paper studies the motion coordination control problem for multiple mobile robots under a common workspace and reach-avoid tasks. Using abstraction-based techniques, we combine the offline and online control methods to propose a distributed motion coordination control strategy. In the offline control strategy, we partition the workspace to derive the graph for the offline planning, and construct the symbolic abstraction for each robot to design the individual controller offline. In the online control strategy, we provide a detection mechanism to check the existence of the potential robot collision, and implement the constructed symbolic abstraction and graph-searching techniques to resolve the robot collision. The combination of the offline and online control strategies results in the overall motion coordination control strategy for all robots.
|
|
11:20-11:40, Paper FrA20.5 | Add to My Program |
Moving Target Estimation and Active Tracking in Multi-Robot Systems |
|
Xu, Jie | University of California, Riverside |
Zhu, Pengxiang | University of California, Riverside |
Zhang, Yanyu | University of California, Riverside |
Ren, Wei | University of California, Riverside |
Keywords: Autonomous robots, Sensor fusion, Kalman filtering
Abstract: In this paper, we propose a comprehensive solution for 3-D active target tracking with multiple robots in a fully distributed setting. Here multiple robots cooperatively estimate their own states and the target’s state and actively plan their motions to achieve better estimation of the target. For cooperative localization and target state estimation, each robot maintains a state vector consisting of its own state, the target’s state, and its own cloned history states. The challenge of localizing moving robots in 3-D is addressed by using multirobot cooperative visual-inertial odometry algorithm, which improves the estimation accuracy by using environmental common feature measurements. Each robot’s target measurement (if available) and its neighbors’ target estimators are then exploited for estimation updates. To preserve and update the correlations between the target and robot states while limiting the influence of bad target estimates on localization accuracy, the Schmidt-Kalman Filter framework is adopted. For motion planning, a gradient-based approach that uses differentiable field-of-view and potential functions is employed to achieve efficient and accurate active target tracking while avoiding collisions and maintaining communication connectivity. Numerous simulations show that our proposed algorithm provides an accurate and efficient solution for cooperative localization and active target tracking.
|
|
11:40-12:00, Paper FrA20.6 | Add to My Program |
An Integral Sliding–Mode–based Robust Interval Predictive Control for Perturbed Unicycle Mobile Robots |
|
Ríos, Héctor | Tecnológico Nacional De México/I.T. La Laguna |
Mera, Manuel | Esime Upt Ipn |
Raïssi, Tarek | Conservatoire National Des Arts Et Métiers |
Efimov, Denis | Inria |
Keywords: Autonomous robots, Variable-structure/sliding-mode control, Constrained control
Abstract: This paper contributes to the design of a robust control strategy for the trajectory tracking problem in perturbed unicycle mobile robots. The proposed strategy comprises the design of a robust control law, which is based on an Integral Sliding–Mode Control (ISMC) approach together with an interval predictor–based state–feedback controller and a Model Predictive Control (MPC) scheme. The robust controller deals with some perturbations in the kinematic model, and with state and input constraints that are related to restrictions on the workspace and saturated actuators, respectively. The proposed approach guarantees the exponential convergence to zero of the tracking error. Furthermore, the performance of the proposed approach is validated through some simulations.
|
|
FrA21 Regular Session, Orchid Junior 4311 |
Add to My Program |
Extremum-Seeking Control |
|
|
Chair: Guay, Martin | Queens University |
Co-Chair: Wang, Shimin | Queen's University |
|
10:00-10:20, Paper FrA21.1 | Add to My Program |
Exponential Extremum Seeking with Unbiased Convergence |
|
Yilmaz, Cemal Tugrul | UC San Diego |
Diagne, Mamadou | University of California San Diego |
Krstic, Miroslav | University of California, San Diego |
Keywords: Extremum seeking, Optimization
Abstract: We present a multivariable extremum seeking (ES) algorithm for static and dynamic maps that achieves unbiased convergence to the optimum exponentially, referred to as exponential ES. The conventional ES approach, which uses constant amplitude sinusoids, results in steady-state oscillations around the optimum and is unable to guarantee unbiased convergence. In contrast, our ES approach employs exponential decay and growth functions to gradually decrease the amplitude of the perturbation signal and increase the amplitude of the demodulation signal, respectively. This eliminates the steady-state oscillation. To achieve unbiased convergence, we choose an adaptation gain that is sufficiently larger than the decay rate of the perturbation so that the learning process outpaces the perturbation’s waning. The stability analysis is based on state transformation, averaging, and singular perturbation methods applied to the transformed system resulting in local stability of the transformed system as well as local exponential stability of the original system. For numerical simulation, we consider the problem of source seeking by a 2D velocity actuated point.
|
|
10:20-10:40, Paper FrA21.2 | Add to My Program |
An Adaptive Extremum Seeking Scheme for Non-Convex Optimisation |
|
Mimmo, Nicola | University of Bologna |
Marconi, Lorenzo | Univ. Di Bologna |
Keywords: Extremum seeking, Adaptive systems, Optimization
Abstract: The paper presents an extremum-seeking scheme in which the dither is adaptively tuned to deal with non-convex cost functions. The adaptation law decreases the dither when local cost function trends are easily visible from output data. Contrarily, when the cost function does not have a dominant trend, the dither is increased to enrich the output data. This adaptive scheme can give advantages in practical applications when a conservatively large dither implies unnecessary high energy to optimise a cost function corrupted by non-uniform state-dependent disturbances. Numerical comparisons confirm the superior performance of the proposed solution.
|
|
10:40-11:00, Paper FrA21.3 | Add to My Program |
Data-Efficient Static Cost Optimization Via Extremum-Seeking Control with Kernel-Based Function Approximation |
|
Weekers, Wouter | Eindhoven University of Technology |
Saccon, Alessandro | Eindhoven University of Technology |
Van De Wouw, Nathan | Eindhoven University of Technology |
Keywords: Extremum seeking, Machine learning, Data driven control
Abstract: We present a novel type of sampled-data extremum-seeking control (ESC) aimed at speeding up convergence to the optimum and reducing the number of costly performance measurements in practical applications. The approach uses collected output measurements to construct online an approximation of the system’s steady-state performance function using kernel-based function approximation. In regions where this approximation is detected to be sufficiently accurate, the proposed approach utilizes it to determine the search direction and compute a suitable optimizer gain for the update step. In regions where the approximation is not yet accurate, additional data is collected and employed in a ‘standard’ ESC update step, while also using it to refine the approximation of the performance function. By using the approximation of the performance function to determine the search direction and optimizer gain when possible, the number of required performance measurements and parameter update steps can be significantly reduced, e.g., with respectively 75% and 45% in our simulation study involving a static cost function.
|
|
11:00-11:20, Paper FrA21.4 | Add to My Program |
A Robust Time-Delay Approach to Continuous-Time Extremum Seeking for Multi-Variable Static Map |
|
Yang, Xuefei | Harbin Institute of Technology |
Fridman, Emilia | Tel-Aviv Univ |
Keywords: Extremum seeking, Delay systems, Robust control
Abstract: In this article, we introduce a time-delay approach to gradient-based extremum seeking (ES) in the continuous domain for n-dimensional (nD) static quadratic maps. As in the recently introduced (for 2D maps in the continuous domain), we transform the system to the time-delay one (neutral type system). This system is O(varepsilon)-perturbation of the averaged linear ODE system. We further explicitly present the neutral system as the linear ODE, where O(varepsilon)-terms are considered as disturbances with distributed delays of the length of the small parameter varepsilon. Quantitative (for uncertain map) and qualitative (for unknown map) practical stability analyses are provided by employing a variation of constants formula that greatly simplifies the results compared to the previously used Lyapunov-Krasovskii (L-K) method. The new approach also simplifies the conditions and improves the results. Examples from the literature illustrate the efficiency of the new approach, allowing essentially large uncertainty of the Hessian matrix with bounds on varepsilon that are not too small.
|
|
11:20-11:40, Paper FrA21.5 | Add to My Program |
Semi-Global Practical Extremum Seeking with Practical Safety |
|
Williams, Alan | UCSD |
Krstic, Miroslav | University of California, San Diego |
Scheinker, Alexander | Los Alamos National Lab |
Keywords: Extremum seeking, Optimization algorithms
Abstract: We introduce a type of safe extremum seeking (ES) controller, which minimizes an unknown objective function while also maintaining practical positivity of an unknown barrier function. We show semi-global practical asymptotic stability of our algorithm and present an analogous notion of practical safety. The dynamics of the controller are inspired by the quadratic program (QP) based safety filter designs which, in the literature, are more commonly used in cases where the barrier function is known. Conditions on the barrier and objective function are explored showing that non convex problems can be solved. A Lyapunov argument is proposed to achieve the main results of the paper. Finally, an example is given of the algorithm which solves the constrained optimization problem.
|
|
11:40-12:00, Paper FrA21.6 | Add to My Program |
Output Regulation of Second-Order Nonlinear Systems Subject to Unknown Control Direction Using Extremum Seeking Control |
|
Harry, Telema | Queen's University |
Guay, Martin | Queens University |
Wang, Shimin | Queen's University |
Keywords: Extremum seeking, Output regulation, Adaptive systems
Abstract: The Nussbaum gain approach has been the standard technique in solving unknown control direction problems. In this paper, we propose control laws composed of extremum seeking control, an internal model, and a compensator signal to solve the robust practical output regulation problem of a second-order system subject to an unknown control direction. Using the Lie bracket approximation technique, we show that the closed-loop system is bounded and the origin is (epsilon)Semi-global Practical Uniform Asymptotic Stable. Finally, we illustrate the effectiveness of the proposed approach with a numerical example of a Van der Pol system.
|
|
FrA22 Invited Session, Orchid Junior 4212 |
Add to My Program |
Optimal Transport: Theory and Applications in Systems and Control |
|
|
Chair: Chen, Yongxin | Georgia Institute of Technology |
Co-Chair: Ringh, Axel | Chalmers University of Technology and the University of Gothenburg |
Organizer: Chen, Yongxin | Georgia Institute of Technology |
Organizer: Haasler, Isabel | École Polytechnique Fédérale De Lausanne |
Organizer: Karlsson, Johan | KTH Royal Institute of Technology |
Organizer: Ringh, Axel | Chalmers University of Technology and the University of Gothenburg |
|
10:00-10:20, Paper FrA22.1 | Add to My Program |
Mean Field Type Control with Species Dependent Dynamics Via Structured Tensor Optimization |
|
Ringh, Axel | Chalmers University of Technology and the University of Gothenbu |
Haasler, Isabel | École Polytechnique Fédérale De Lausanne |
Chen, Yongxin | Georgia Institute of Technology |
Karlsson, Johan | KTH Royal Institute of Technology |
Keywords: Cooperative control, Stochastic systems, Optimization
Abstract: In this work we consider mean field type control problems with multiple species that have different dynamics. We formulate the discretized problem using a new type of entropy-regularized multimarginal optimal transport problems where the cost is a decomposable structured tensor. A novel algorithm for solving such problems is derived, using this structure and leveraging recent results in entropy-regularized optimal transport. The algorithm is then demonstrated on a numerical example in robot coordination problem for search and rescue, where three different types of robots are used to cover a given area at minimal cost.
|
|
10:20-10:40, Paper FrA22.2 | Add to My Program |
A Matching Principle for Power Transfer in Stochastic Thermodynamics |
|
Movilla Miangolarra, Olga | University of Calfornia, Irvine |
Taghvaei, Amirhossein | University of Washington Seattle |
Georgiou, Tryphon T. | University of California, Irvine |
Keywords: Stochastic optimal control, Stochastic systems
Abstract: Gradients in temperature and particle concentration fuel many processes in the physical and biological world. In the present work we study a thermodynamic engine powered by anisotropic thermal excitation (that may be due to e.g., a temperature gradient), and draw parallels with the well-known principle of impedance matching in circuit theory, where for maximal power transfer, the load voltage needs to be half of that of the supplying power source. We maximize power output of the thermodynamic engine at steady-state and show that the optimal reactive force is precise half of that supplied by the anisotropy.
|
|
10:40-11:00, Paper FrA22.3 | Add to My Program |
Optimal Transport Particle Filters (I) |
|
Al-Jarrah, Mohammad | University of Washington Seattle |
Hosseini, Bamdad | University of Washington Seattle |
Taghvaei, Amirhossein | University of Washington Seattle |
Keywords: Filtering, Optimization, Statistical learning
Abstract: This paper is concerned with the theoretical and computational development of a new class of nonlinear filtering algorithms called optimal transport particle filters (OTPF). The algorithm is based on a recently introduced variational formulation of the Bayes' rule, which aims to find the Brenier optimal transport map between the prior and the posterior distributions as the solution to a stochastic optimization problem. On the theoretical side, the existing methods for the error analysis of particle filters and stability results for optimal transport map estimation are combined to obtain uniform error bounds for the filter's performance in terms of the optimization gap in solving the variational problem. The error analysis reveals a bias-variance trade-off that can ultimately be used to understand if/when the curse of dimensionality can be avoided in these filters. On the computational side, the proposed algorithm is evaluated on a nonlinear filtering example in comparison with the ensemble Kalman filter (EnKF) and the sequential importance resampling (SIR) particle filter.
|
|
11:00-11:20, Paper FrA22.4 | Add to My Program |
Optimal Transport for Correctional Learning (I) |
|
Winqvist, Rebecka | KTH Royal Institute of Technology |
Lourenço, Inês | KTH Royal Institute of Technology |
Quinzan, Francesco | KTH Royal Institute of Technology |
Rojas, Cristian R. | KTH Royal Institute of Technology |
Wahlberg, Bo | KTH Royal Institute of Technology |
Keywords: Identification, Estimation, Learning
Abstract: The contribution of this paper is a generalized formulation of correctional learning using optimal transport, which is about how to optimally transport one mass distribution to another. Correctional learning is a framework developed to enhance the accuracy of parameter estimation processes by means of a teacher-student approach. In this framework, an expert agent, referred to as the teacher, modifies the data used by a learning agent, known as the student, to improve its estimation process. The objective of the teacher is to alter the data such that the student's estimation error is minimized, subject to a fixed intervention budget. Compared to existing formulations of correctional learning, our novel optimal transport approach provides several benefits. It allows for the estimation of more complex characteristics as well as the consideration of multiple intervention policies for the teacher. We evaluate our approach on two theoretical examples, and on a human-robot interaction application in which the teacher's role is to improve the robots performance in an inverse reinforcement learning setting.
|
|
11:20-11:40, Paper FrA22.5 | Add to My Program |
High-Dimensional Optimal Density Control with Wasserstein Metric Matching (I) |
|
Ma, Shaojun | Goldman Sachs |
Hou, Mengxue | University of Notre Dame |
Ye, Xiaojing | Georgia State University |
Zhou, Hao-Min | Georgia Inst. of Tech |
Keywords: Optimal control, Computational methods, Neural networks
Abstract: We present a novel computational framework for density control in high-dimensional state spaces. The considered dynamical system consists of a large number of indistinguishable agents whose behaviors can be collectively modeled as a time-evolving probability distribution. The goal is to steer the agents without collision from an initial distribution to reach (or approximate) a given target distribution within a fixed time horizon at minimum cost. To tackle this problem, we propose to model the drift as a nonlinear reduced-order model, such as a deep network, and enforce the matching to the target distribution at terminal time either strictly or approximately using the Wasserstein metric. The resulting saddle-point problem can be solved by an effective numerical algorithm that leverages the excellent representation power of deep networks and fast automatic differentiation for this challenging high-dimensional control problem. A variety of numerical experiments were conducted to demonstrate the performance of our method.
|
|
11:40-12:00, Paper FrA22.6 | Add to My Program |
Optimal Mass Transport Over the Euler Equation (I) |
|
Yan, Charlie | University of California Santa Cruz |
Nodozi, Iman | University of California, Santa Cruz |
Halder, Abhishek | Iowa State University |
Keywords: Stochastic optimal control, Stochastic systems, Uncertain systems
Abstract: We consider the finite horizon optimal steering of the joint state probability distribution subject to the angular velocity dynamics governed by the Euler equation. The problem and its solution amounts to controlling the spin of a rigid body via feedback, and is of practical importance, for example, in angular stabilization of a spacecraft with stochastic initial and terminal states. We clarify how this problem is an instance of the optimal mass transport (OMT) problem with bilinear prior drift. We deduce both static and dynamic versions of the Eulerian OMT, and provide analytical and numerical results for the synthesis of the optimal controller.
|
|
FrA23 Invited Session, Orchid Junior 4211 |
Add to My Program |
Formal Methods for Time-Critical Decision Making and Control |
|
|
Chair: Lindemann, Lars | University of Southern California |
Co-Chair: Vasile, Cristian Ioan | Lehigh University |
Organizer: Lindemann, Lars | University of Southern California |
Organizer: Vasile, Cristian Ioan | Lehigh University |
Organizer: Belta, Calin | Boston University |
Organizer: Dimarogonas, Dimos V. | KTH Royal Institute of Technology |
|
10:00-10:20, Paper FrA23.1 | Add to My Program |
Diagnosis of Time-Sensitive Failures in Timed Discrete-Event Systems with Metric Interval Temporal Logics (I) |
|
Dong, Weijie | Shanghai Jiao Tong University |
Li, Shaoyuan | Shanghai Jiao Tong University |
Yin, Xiang | Shanghai Jiao Tong University |
Keywords: Discrete event systems, Automata, Fault diagnosis
Abstract: In this paper, we address the problem of failure diagnosis in timed discrete-event systems modeled by timed automata. While existing works on this topic typically focus on failures modeled as particular events, many complex applica- tions, especially time-critical systems, require the ability to identify time-sensitive failures associated with real-time information rather than just the occurrence of events at any time. To address this challenge, we propose the use of metric interval temporal logic (MITL) with continuous semantics on Boolean signals to formally describe time-sensitive failures. We introduce a novel concept called time-sensitive diagnosability (TS-diagnosability) to characterize whether or not any violation of the MITL task (i.e., failure) can be determined within a finite time elapsing. Furthermore, we provide a necessary and sufficient condition for verifying TS-diagnosability. Our results offer a more general framework for failure diagnosis of timed discrete-event systems
|
|
10:20-10:40, Paper FrA23.2 | Add to My Program |
Model-Free Reinforcement Learning for Spatiotemporal Tasks Using Symbolic Automata (I) |
|
Balakrishnan, Anand | University of Southern California |
Jaksic, Stefan | Austrian Institute of Technology |
Aguilar, Edgar | Austrian Institute of Technology |
Nickovic, Dejan | AIT Austrian Institute of Technology GmbH |
Deshmukh, Jyotirmoy | University of Southern California |
Keywords: Learning, Formal Verification/Synthesis, Markov processes
Abstract: Reinforcement learning (RL) is a popular paradigm for synthesizing controllers in environments modeled as Markov Decision Processes (MDPs). The RL formulation assumes that users define local rewards that depend only on the current state (and action), and learning algorithms seek to find control policies that maximize cumulative rewards along system trajectories. An implicit assumption in RL is that policies that maximize cumulative rewards are desirable as they meet the intended control objectives. However, most control objectives are emph{global} properties of system trajectories, and meeting them with local rewards requires tedious, manual and error-prone process of hand-crafting the rewards. We propose a new algorithm for automatically inferring local rewards from high-level task objectives expressed in the form of emph{symbolic automata} (SA); a symbolic automaton is a finite state machine where edges are labeled with symbolic predicates over the MDP states. SA subsume many popular formalisms for expressing task objectives, such as discrete-time versions of Signal Temporal Logic (STL). We assume that a model-free RL setting, i.e., we assume emph{no prior knowledge} of the system dynamics. We give theoretical results that establish that an optimal policy learned using our shaped rewards also maximizes the probability of satisfying the given SA-based control objective. Our approach is empirically compared with other RL methods that use temporal logic and automata-based control objectives, and evaluate them based on training convergence time and the probability of satisfying finite horizon control objectives.
|
|
10:40-11:00, Paper FrA23.3 | Add to My Program |
Robustness Measures and Monitors for Time Window Temporal Logic (I) |
|
Ahmad, Ahmad | Boston University |
Vasile, Cristian Ioan | Lehigh University |
Tron, Roberto | Boston University |
Belta, Calin | Boston University |
Keywords: Formal Verification/Synthesis
Abstract: Temporal logics (TLs) have been widely used to formalize interpretable tasks for cyber-physical systems. Time Window Temporal Logic (TWTL) has been recently proposed as a specification language for dynamical systems. In particular, it can easily express robotic tasks, and it allows for efficient, automata-based verification and synthesis of control policies for such systems. In this paper, we define two quantitative semantics for this logic, and two corresponding monitoring algorithms, which allow for real-time quantification of satisfaction of formulas by trajectories of discrete-time systems. We demonstrate the new semantics and their runtime monitors on numerical examples.
|
|
11:00-11:20, Paper FrA23.4 | Add to My Program |
Efficient Control Synthesis under Asynchronous Temporal Robustness Constraints (I) |
|
Yu, Xinyi | University of Southern California |
Yin, Xiang | Shanghai Jiao Tong University |
Lindemann, Lars | University of Southern California |
Keywords: Formal Verification/Synthesis, Discrete event systems
Abstract: In time-critical systems, such as air traffic control systems, it is crucial to design control policies that are robust to timing uncertainty. Recently, the notion of asynchronous temporal robustness (ATR) was proposed to capture the robustness of a system trajectory against individual time shifts in its sub-trajectories. In a multi-robot system, this may correspond to individual robots being delayed or early. Control synthesis under ATR constraints is challenging and has not yet been addressed. In this paper, we propose a efficient control synthesis method under ATR constraints which are defined with respect to simple safety or complex signal temporal logic specifications. Given an ATR bound, we compute a sequence of control inputs so that the specification is satisfied by the system as long as each sub-trajectory is shifted not more than the ATR bound. We avoid combinatorially exploring all shifted sub-trajectories by first identifying redundancy between them. We capture this insight by the notion of instant-shift pair sets, and then propose an optimization program that enforces the specification only over the instant-shift pair sets. We show soundness and completeness of our method and analyze its computational complexity. Finally, we present various illustrative case studies.
|
|
11:20-11:40, Paper FrA23.5 | Add to My Program |
Signal Temporal Logic Meets Convex-Concave Programming: A Structure-Exploiting SQP Algorithm for STL Specifications (I) |
|
Takayama, Yoshinari | CentraleSupelec |
Hashimoto, Kazumune | Osaka University |
Ohtsuka, Toshiyuki | Kyoto Univ |
Keywords: Formal Verification/Synthesis, Optimization algorithms, Optimal control
Abstract: This study considers the control problem with signal temporal logic (STL) specifications. Prior works have adopted smoothing techniques to address this problem within a feasible time frame and solve the problem by applying sequential quadratic programming (SQP) methods naively. However, one of the drawbacks of this approach is that solutions can easily become trapped in local minima that do not satisfy the specification. In this study, we propose a new optimization method, termed CCP-based SQP, based on the convex-concave procedure (CCP). Our framework includes a new robustness decomposition method that decomposes the robustness function into a set of constraints, resulting in a form of difference of convex (DC) program that can be solved efficiently. We solve this DC program sequentially as a quadratic program by only approximating the disjunctive parts of the specifications. Our experimental results demonstrate that our method has a superior performance compared to the state-of-the-art SQP methods in terms of both robustness and computational time.
|
|
11:40-12:00, Paper FrA23.6 | Add to My Program |
Energy-Constrained Active Exploration under Incremental-Resolution Symbolic Perception (I) |
|
Kamale, Disha | Lehigh University |
Haesaert, Sofie | Eindhoven University of Technology |
Vasile, Cristian Ioan | Lehigh University |
Keywords: Formal Verification/Synthesis, Autonomous robots
Abstract: We consider the problem of autonomous exploration in search of targets while respecting a fixed energy budget. The robot is equipped with an incremental-resolution symbolic perception module wherein the perception of targets in the environment improves as the robot's distance from targets decreases. We assume no prior information about the number of targets, their locations, and possible distribution within the environment. This work proposes a novel decision-making framework for the resulting constrained sequential decision-making problem by first converting it into a reward maximization problem on a product graph computed offline. It is then solved online as a Mixed-Integer Linear Program (MILP) where the knowledge about the environment is updated at each step, combining automata-based and MILP-based techniques. We demonstrate the efficacy of our approach with the help of a case study and present empirical evaluation in terms of expected regret. Furthermore, the runtime performance shows that online planning can be efficiently performed for moderately-sized grid environments.
|
|
FrA24 Invited Session, Orchid Main 4201AB |
Add to My Program |
Event-Triggered and Self-Triggered Control III |
|
|
Chair: Diagne, Mamadou | University of California San Diego |
Co-Chair: Yang, Feisheng | Northwestern Polytechnical University |
Organizer: Heemels, W.P.M.H. | Eindhoven University of Technology |
Organizer: Hirche, Sandra | Technische Universität München |
Organizer: Nowzari, Cameron | George Mason University |
|
10:00-10:20, Paper FrA24.1 | Add to My Program |
Data-Driven Self-Triggered Control for Linear Networked Control Systems (I) |
|
Xin, Wang | Beijing Institute of Technology |
Li, Yifei | Beijing Institute of Technology |
Sun, Jian | Beijing Institute of Technology |
Wang, Gang | Beijing Institute of Technology |
Chen, Jie | Beijing Institute of Technology |
Dou, Lihua | Beijing Institute of Technology |
Keywords: Networked control systems, Data driven control, Sampled-data control
Abstract: This paper considers data-driven control of unknown linear discrete-time systems under a self-triggered transmission scheme. While self-triggered control has received much attention in the literature, its design and implementation typically require explicit model knowledge. Due to the difficulties in obtaining accurate models and the abundance of data in applications, this paper proposes a novel data-driven self-triggered control scheme for unknown systems. To this end, we begin by presenting a model-based self-triggered scheme (STS) in form of quadratic matrix inequalities, on the basis of an equivalent switched system representation. Combining the model-based triggering law and a data-based system representation, a data-driven STS is developed leveraging pre-collected input-state data for predicting the next transmission instant while ensuring system stability. A data-based method for co-designing the controller gain and the triggering matrix is then provided. Finally, a numerical simulation showcases the efficacy of STS in reducing transmissions as well as practicality of the proposed co-design methods.
|
|
10:20-10:40, Paper FrA24.2 | Add to My Program |
On the Digital Event-Based Implementation of a Glucose Regulator Via Subcutaneous Insulin Infusion (I) |
|
Di Ferdinando, Mario | University of L'Aquila |
Di Gennaro, Stefano | University of L'Aquila |
Borri, Alessandro | CNR-IASI |
Pola, Giordano | University of L'Aquila |
Pepe, Pierdomenico | University of L' Aquila |
Keywords: Biomedical, Sampled-data control, Nonlinear systems
Abstract: In this paper, an event-based quantized sampled-data control strategy is proposed for the plasma glucose regulation problem in Type 2 diabetic patients. In particular, the proposed event-triggered digital glucose regulator is designed by exploiting a nonlinear time-delay model of the glucose-insulin regulatory system which takes into account the subcutaneous infusion of insulin. It is proved that the provided quantized sampled-data glucose controller, updated via a proposed event-based mechanism, guarantees the semi-global practical stability property of the related closed--loop tracking error system, with arbitrarily small steady--state tracking error. The stabilization in the sample--and--hold sense theory is used as a tool to prove the results. An approximation scheme based on first--order splines is used in order to cope with the problem of the possible non--availability in the buffer of the value of the system variables at some past times which are needed for the implementation of the proposed digital controller. The possible non-uniform quantization of the input/output channels as well as the case of time--varying sampling periods are included in the theory here developed. The validation of the proposed glucose control strategy is carried out via simulations.
|
|
10:40-11:00, Paper FrA24.3 | Add to My Program |
Event-Triggered Control for Hamiltonian-Based Flexible-Joints Robots |
|
Zhang, Qi | Qufu Normal University |
Sun, Weiwei | Qufu Normal University |
Ding, Lusong | Qufu Normal University Institute of Automation |
Li, Yongshu | Qufu Normal University,School of Engineering |
Keywords: Robotics, Nonlinear systems, Network analysis and control
Abstract: This paper focuses on the event-triggered tracking control problem for Hamiltonian-based flexible-joints robots. Based on Hamiltonian theory and the delay system approach, an event-triggered modular control strategy is proposed to guarantee that the link and motor generalized position can track the target signal asymptotically while reducing the waste of transmission resources. The modular controller designed in this paper takes fully into account the structural characteristics of the flexible-joints robots. Moreover, the difficulties of tracking control and event-triggered control caused by the high coupling of Hamiltonian systems are overcome in the design process. This extends the existing theoretical results for flexible-joints robots and effectively improves energy efficiency. The validity of the proposed strategy is verified by a simulation example of a flexible joint robot.
|
|
11:00-11:20, Paper FrA24.4 | Add to My Program |
Self-Triggered Boundary Control of a Class of Reaction-Diffusion PDEs (I) |
|
Rathnayake, Bhathiya | Student (University of California San Diego) |
Diagne, Mamadou | University of California San Diego |
Keywords: Backstepping, Hybrid systems, Stability of linear systems
Abstract: This paper provides a novel self-triggered boundary control (STBC) strategy for a class of reaction-diffusion PDEs with Robin actuation using infinite-dimensional backstepping boundary control. Our goal is to offer a solution for the continuous monitoring of triggering functions in conventional event-triggered control. We propose a method for converting a certain class of continuous-time dynamic event-triggers that require continuous monitoring to self-triggers that proactively compute the time of the next event at the current event time using the knowledge of the available system states and dynamics. We achieve this by designing a positively and uniformly lower-bounded function which, when evaluated at the current event time, outputs the waiting time until the next event. The control input is updated only at events indicated by the self-trigger and is applied in a zero-order hold fashion between two events. We establish the closed-loop system well-posedness under the proposed STBC approach. Furthermore, we prove that the global L^2-exponential convergence to zero under continuous-time event-triggered boundary control (CETBC) is preserved under the proposed STBC approach. We provide a simulation result that validates the theoretical claims.
|
|
11:20-11:40, Paper FrA24.5 | Add to My Program |
Predefined-Time Distributed Optimal Consensus for Euler–Lagrangian Systems Based on Dynamic Event-Triggered Mechanism (I) |
|
Yang, Feisheng | Northwestern Polytechnical University |
Liu, Jiaming | Northwestern Polytechnical University |
Ma, Qian | Nanjing University of Science and Technology |
Keywords: Distributed control, Optimal control, Cooperative control
Abstract: It is a challenging problem to achieve fast distributed optimal consensus for Euler–Lagrangian (EL) systems meanwhile economizing communication resources. To solve the problem, a novel predefined-time distributed optimal consensus strategy for EL systems is proposed by applying time-base generator (TBG) and dynamic event-triggered mechanism, which can reach the optimal consensus in a completely distributed manner. It is proven that the algorithm can converge in predefined time by Lyapunov energy function and Zeno behavior is avoided. A dynamic event-triggered method is designed which event-triggered thresholds are replaced by dynamic variables. The numerical simulation is given to show the effectiveness and superiority of the algorithm.
|
|
11:40-12:00, Paper FrA24.6 | Add to My Program |
Event-Triggered Stabilization of Parabolic PDEs by Switching (I) |
|
Kang, Wen | Beijing Institute of Technology |
Fridman, Emilia | Tel-Aviv Univ |
Zhang, Jing | Beijing Institute of Technology |
Liu, Chuan-Xin | University of Science and Technology Beijing |
Keywords: Distributed parameter systems, Stability of nonlinear systems, Lyapunov methods
Abstract: Although switching-based stabilization of 1D parabolic systems was investigated by employing one actuator moving in spatial domain in our recent paper [18], this method increases the system cost since actuator and sensor switching happens at fixed time regardless of whether the switching is necessary or not. To further reduce operating and production cost, in the present paper, switching-based dynamic event-triggered control law is studied to stabilize the parabolic PDE systems via output-dependent switching law. Constructive exponential stability conditions are established by using Lyapunov method. A numerical example shows the effectiveness of the proposed methods.
|
|
FrA25 Invited Session, Lotus Junior 4DE |
Add to My Program |
Navigating Complexity: New Approaches for Discrete Event Systems |
|
|
Chair: Giua, Alessandro | University of Cagliari |
Co-Chair: Seatzu, Carla | Univ. of Cagliari |
Organizer: Giua, Alessandro | University of Cagliari |
Organizer: Seatzu, Carla | Univ. of Cagliari |
|
10:00-10:20, Paper FrA25.1 | Add to My Program |
Transformational Supervisor Localization |
|
Thuijsman, Sander | Eindhoven University of Technology |
Cai, Kai | Osaka Metropolitan University |
Reniers, Michel | Eindhoven University of Technology |
Keywords: Supervisory control, Discrete event systems, Automata
Abstract: Supervisor localization can be applied to distribute a monolithic supervisor into local supervisors. Performing supervisor localization can be computationally costly. In this work, we consider systems that evolve over time. We study how to reuse the results from a previous supervisor localization, to more efficiently compute local supervisors when the system is adapted. We call this approach transformational supervisor localization, and present algorithms for the procedure. The efficiency of the procedure is experimentally evaluated.
|
|
10:20-10:40, Paper FrA25.2 | Add to My Program |
Opacity from Observers with a Bounded Memory |
|
Wintenberg, Andrew | The University of Michigan, Ann Arbor |
Lafortune, Stephane | Univ. of Michigan |
Ozay, Necmiye | Univ. of Michigan |
Keywords: Discrete event systems
Abstract: Opacity is an information-flow property capturing privacy from observers that are aware of a system’s dynamics. The potential for an observer with perfect recall to reason about long histories of the system poses a challenge for opacity verification. In this paper, we address this challenge by proposing a new notion of opacity over automata, called bounded memory opacity, with respect to an observer with a bounded memory. We show that verifying this weaker notion of opacity has reduced computational complexity compared to general opacity (co-NP vs. PSPACE). Furthermore, we present a corresponding verification algorithm using an encoding to the Boolean satisfiability problem (SAT). We demonstrate this approach on randomly generated automata as well as a web server load-hiding example.
|
|
10:40-11:00, Paper FrA25.3 | Add to My Program |
Decentralized State Estimation Via Breadth-First Search through Partially Ordered Observation Sequences (I) |
|
Sun, Dajiang | Xidian University |
Hadjicostis, Christoforos N. | University of Cyprus |
Li, Zhiwu | Xidian University |
Keywords: Discrete event systems, Automata, Estimation
Abstract: We investigate the state estimation problem under a decentralized observation architecture. More specifically, we consider a discrete event system, modeled by a nondeterministic finite automaton, whose behavior is partially observed and recorded at a set of observation sites with distinct capabilities. When prompted, these observation sites send their sequences of observations to a coordinator that fuses and analyzes this information to estimate the specific system states of interest (current- and initial-states). The notion of S-builder is introduced to systematically infer possible (totally ordered) sequences of observations and an algorithm is proposed for constructing a synchronizer in a breadth-first search manner to efficiently perform current-state estimation. With slight extensions, the synchronizer construction algorithm can be also applied towards initial-state estimation.
|
|
11:00-11:20, Paper FrA25.4 | Add to My Program |
Data-Informativity for Data-Driven Supervisory Control of Discrete-Event Systems (I) |
|
Ohtsuka, Tomofumi | Kyoto University |
Cai, Kai | Osaka Metropolitan University |
Kashima, Kenji | Kyoto University |
Keywords: Discrete event systems, Supervisory control, Data driven control
Abstract: In this paper we develop a data-driven approach for supervisory control of discrete-event systems (DES). We consider a setup in which models of DES to be controlled are unknown, but a set of data concerning the behaviors of DES is available. We propose a new concept of data-informativity, which captures the notion that the available data set contains sufficient information such that a valid supervisor may be constructed for a family of DES models that all can generate the data set. We then characterize data-informativity with a necessary and sufficient condition, based on which we design an algorithm for its verification. Moreover, if the data set fails to be informative, we propose two related new concepts of limited data-informativity and informatizability. Their characterization conditions and verification algorithms are also presented.
|
|
11:20-11:40, Paper FrA25.5 | Add to My Program |
Logical and Probabilistic Aspects of State Estimation for Markovian Systems (I) |
|
Lefebvre, Dimitri | University Le Havre |
Seatzu, Carla | Univ. of Cagliari |
Hadjicostis, Christoforos N. | University of Cyprus |
Giua, Alessandro | University of Cagliari |
Keywords: Discrete event systems, Automata, Markov processes
Abstract: This paper is about state estimation in a class of labeled timed probabilistic automata. In detail, we consider continuous time Markov processes where the occurrence of some transitions produces observable events. Such observations can be used to update and refine the state estimation. In this setting, we discuss how a logical state estimation approach can be used to characterize the probabilistic state estimation whenever a new event is observed or when the system evolves without producing new observations (silent closure). The main results of the paper show that the final behaviour, as the silent closure goes to infinity, cannot be characterized only in terms of the graphical structure of the underlying automaton but also depends on the values of the firing rates.
|
|
11:40-12:00, Paper FrA25.6 | Add to My Program |
On the Reachability Space and Deadlock-Freeness in Flexible Nets (I) |
|
Julvez, Jorge | University of Zaragoza |
Keywords: Petri nets, Discrete event systems, Modeling
Abstract: Deadlock-freeness is a basic property of dynamical systems that ensures that at least one process of the system can operate indefinitely. Given that a system is deadlock-free if at any reachable state there is at least one process that can operate, the reachability space, i.e. the set of states that can be reached, and deadlock-freeness are closely related. This paper focuses on some fundamental properties of the reachability space of Flexible Nets, a modeling formalism that can easily account for uncertain parameters. After showing that the reachability space is convex, a sufficient condition for deadlock-freeness is derived.
|
|
FrB01 Invited Session, Melati Junior 4010A-4111 |
Add to My Program |
Learning, Optimization, and Game Theory IV |
|
|
Chair: Zhang, Kaiqing | University of Maryland |
Co-Chair: Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Organizer: Doan, Thinh T. | Virginia Tech |
Organizer: Sayin, Muhammed Omer | Bilkent University |
Organizer: Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Organizer: Zhang, Kaiqing | University of Maryland |
|
13:30-13:50, Paper FrB01.1 | Add to My Program |
Lyapunov-Based Long Short-Term Memory (Lb-LSTM) Neural Network-Based Control |
|
Griffis, Emily | University of Florida |
Patil, Omkar Sudhir | University of Florida |
Bell, Zachary I. | Air Force |
Dixon, Warren E. | University of Florida |
Keywords: Adaptive control, Neural networks, Stability of nonlinear systems
Abstract: Recurrent neural networks (RNNs) are a dynamic mapping that can capture time-varying, accumulative effects in a sequence that static, feedforward neural networks (NNs) cannot. Long short-term memory (LSTM) NNs are a type of RNNs that have gained recent popularity because the cell structure allows them to retain long-term information more efficiently than traditional RNNs. Existing results develop LSTM-based controllers to compensate for uncertainties in nonlinear systems. However, these results use discrete-time LSTMs with offline-trained weights. In this paper, a Lyapunov-based LSTM controller is developed for general Euler-Lagrange systems. Specifically, an Lb-LSTM is implemented in the control design to adaptively estimate uncertain model dynamics, where the weight estimates of the LSTM cell are updated using Lyapunov-based adaptation laws. This allows the LSTM cell to adapt to system uncertainties without requiring offline training. A Lyapunov-based stability analysis yields uniform ultimate boundedness (UUB) of the tracking errors and LSTM state and weight estimation errors. Simulations indicate the developed Lb-LSTM-based controller yielded significant improvement in tracking and function approximation performance when compared to several DNN examples.
|
|
13:50-14:10, Paper FrB01.2 | Add to My Program |
High-Confidence Barrier-Certified Control Design Using Goal-Oriented Scenario Optimization and Experience Replay Model Learning (I) |
|
Marvi, Zahra | University of Minnesota |
Kiumarsi, Bahare | Michigan State University |
Modares, Hamidreza | Michigan State University |
Keywords: Learning, Optimization, Uncertain systems
Abstract: This paper presents a probabilistic framework for the safe control of linear systems under parametric uncertainties. To this end, a two-layer learning-enabled controller is presented that unifies the experience-replay model learning and the scenario-based optimization. The inner loop leverages the scenario optimization to impose probabilistic stability and safety specifications through sampling from a control Lyapunov function and a control barrier function, respectively. Each sample represents a plausible system dynamics realization within the range of the uncertainties. To quantify the model uncertainty and, thus, to facilitate a proactive and goal-oriented sampling of safety and stability constraints, an experience replay-based model learning is presented in the outer loop. The exponentially fast convergent guarantees of the presented approach and the quantification of the exponential rate using the collected data allow us to quantify the ambiguity set for the system parameters based on the data informativeness. The quantified modeling error acts as a vanishing perturbation to the true dynamics, from which samples can be taken at a specific frequency to solve an optimization problem in the inner loop. The presented approach provides safety and stability guarantees with high probability, even during learning. Simulation is used to depict the efficacy of the proposed approach.
|
|
14:10-14:30, Paper FrB01.3 | Add to My Program |
Quality of Non-Convergent Best Response Processes in Multi-Agent Systems through Sink Equilibria (I) |
|
Konda, Rohit | UC Santa Barbara |
Chandan, Rahul | Amazon.com |
Marden, Jason R. | University of California, Santa Barbara |
Keywords: Game theory, Optimization, Autonomous systems
Abstract: Examining the behavior of multi-agent systems is vitally important to many emerging distributed applications - game theory has emerged as a powerful tool set in which to do so. The main approach of game-theoretic techniques is to model agents as players in a game, and predict the emergent behavior through the relevant Nash equilibria. The virtue from this viewpoint is that by assuming that self-interested decision-making processes lead to Nash equilibrium, system behavior can then be captured by Nash equilibrium without studying the decision-making processes explicitly. This approach has seen success in a wide variety of domains, such as sensor coverage, traffic networks, auctions, and network coordination. However, in many other problem settings, Nash equilibria are not necessarily guaranteed to exist or emerge from self-interested processes. Thus the main focus of the paper is on the study of sink equilibria, which are defined as the attractors of these decision-making processes. By classifying system outcomes through a global objective function, we can analyze the resulting approximation guarantees that sink equilibria have for a given game. Our main result is an approximation guarantee on the sink equilibria through defining an introduced metric of misalignment, which captures how uniform agents are in their self-interested decision making. Overall, sink equilibria are naturally occurring in many multi-agent contexts, and we display our results on their quality with respect to two practical problem settings.
|
|
14:30-14:50, Paper FrB01.4 | Add to My Program |
Asynchronous Distributed Optimization Via ADMM with Efficient Communication (I) |
|
Rikos, Apostolos I. | KTH Royal Institute of Technology |
Jiang, Wei | Aalto University, Finland |
Charalambous, Themistoklis | University of Cyprus |
Johansson, Karl H. | KTH Royal Institute of Technology |
Keywords: Optimization, Optimization algorithms, Agents-based systems
Abstract: In this paper, we focus on an asynchronous distributed optimization problem. In our problem, each node is endowed with a convex local cost function, and is able to communicate with its neighbors over a directed communication network. Furthermore, we assume that the communication channels between nodes have limited bandwidth, and each node suffers from processing delays. We present a distributed algorithm which combines the Alternating Direction Method of Multipliers (ADMM) strategy with a finite time quantized averaging algorithm. In our proposed algorithm, nodes exchange quantized valued messages and operate in an asynchronous fashion. More specifically, during every iteration of our algorithm each node (i) solves a local convex optimization problem (for the one of its primal variables), and (ii) utilizes a finite-time quantized averaging algorithm to obtain the value of the second primal variable (since the cost function for the second primal variable is not decomposable). We show that our algorithm converges to the optimal solution at a rate of O(1/k) (where k is the number of time steps) for the case where the local cost function of every node is convex and not-necessarily differentiable. Finally, we demonstrate the operational advantages of our algorithm against other algorithms from the literature.
|
|
14:50-15:10, Paper FrB01.5 | Add to My Program |
Boosting Exploration in Actor-Critic Algorithms by Incentivizing Plausible Novel States (I) |
|
Banerjee, Chayan | Queensland University of Technology |
Chen, Zhiyong | The University of Newcastle |
Noman, Nasimul | The University of Newcastle |
Keywords: Learning, Stochastic optimal control, Neural networks
Abstract: Improvement of exploration and exploitation using more efficient samples is a critical issue in reinforcement learning algorithms. A basic strategy of a learning algorithm is to facilitate indiscriminate exploration of the entire environment state space, as well as to encourage exploration of rarely visited states rather than frequently visited ones. Under this strategy, we propose a new method to boost exploration through an intrinsic reward, based on the measurement of a state's novelty and the associated benefit of exploring the state, collectively called plausible novelty. By incentivizing exploration of plausible novel states, an actor-critic (AC) algorithm can improve its sample efficiency and, consequently, its training performance. The new method is verified through extensive simulations of continuous control tasks in MuJoCo environments, using a variety of prominent off-policy AC algorithms.
|
|
15:10-15:30, Paper FrB01.6 | Add to My Program |
Risk-Sensitive RL Using Sampling-Based Expectation-Maximization (I) |
|
Noorani, Erfaun | University of Maryland College Park |
Baras, John S. | University of Maryland |
Johansson, Karl H. | KTH Royal Institute of Technology |
Keywords: Machine learning, Learning
Abstract: There is a need for robust Reinforcement Learning (RL) algorithms that can cope with model misspecification, parameter uncertainty, disturbances, etc. Risk-sensitive methods offer an approach to developing robust RL algorithms by hedging against undesirable outcomes in a probabilistic manner. The Probabilistic Graphical Model (PGM) framework offers systematic exploration for risk-sensitive RL. In this paper, we bridge the Markov Decision Process (MDP) and the PGM frameworks. We exploit the equivalence of optimizing a certain risk-sensitive criterion in the MDP formalism with optimizing a log-likelihood objective in the PGM formalism. By utilizing this equivalence, we offer an approach for developing risk-sensitive algorithms by leveraging the PGM framework. We explore the Expectation-Maximization (EM) algorithm under the PGM formalism. We show that risk-sensitive policy gradient methods can be obtained by applying sampling-based approaches to the EM algorithm, e.g., Monte-Carlo EM, with the log-likelihood. We show that Monte-Carlo EM leads to a risk-sensitive Monte-Carlo policy gradient algorithm. Our simulations illustrate the risk-sensitive nature of the resulting algorithm.
|
|
FrB02 Invited Session, Orchid Main 4202-4303 |
Add to My Program |
Data-Driven Verification and Control of Cyber-Physical Systems II |
|
|
Chair: Jungers, Raphaël M. | University of Louvain |
Co-Chair: Lavaei, Abolfazl | Newcastle University |
Organizer: Lavaei, Abolfazl | Newcastle University |
Organizer: Jungers, Raphaël M. | University of Louvain |
Organizer: Abate, Alessandro | University of Oxford |
|
13:30-13:50, Paper FrB02.1 | Add to My Program |
Data-Driven Synthesis of Safety Controllers Via Multiple Control Barrier Certificates |
|
Nejati, Ameneh | Technical University of Munich (TUM) |
Zamani, Majid | University of Colorado Boulder |
Keywords: Discrete event systems
Abstract: This work proposes a data-driven framework to synthesize safety controllers for nonlinear systems with finite input sets and unknown mathematical models. The proposed scheme leverages new notions of multiple control barrier certificates (M-CBC) and provides controllers ensuring the safety of systems with confidence 1. While there may not exist a common control barrier certificate with a fixed template, our proposed technique adaptively partitions the state set to potentially find M-CBC of the same template for different regions. In the proposed data-driven framework, we first cast our proposed conditions of M-CBC as a robust optimization program (ROP). Given that the unknown model appears in some of the constraints of the ROP, we propose a sampling approach for collecting data and provide a scenario optimization program (SOP) associated with the proposed ROP. We solve the resulted SOP and construct M-CBC together with safety controllers for the unknown system with 100% correctness guarantee. We apply our results to a nonlinear jet engine compressor with unknown dynamics to illustrate the efficacy of our data-driven approach. In the case study, we show that while there exists no common polynomial-type control barrier certificate of a given degree, there exist polynomial-type M-CBC of the same degree by partitioning the state set to different regions.
|
|
13:50-14:10, Paper FrB02.2 | Add to My Program |
Reachability Analysis of ARMAX Models (I) |
|
Lützow, Laura | Technical University Munich |
Althoff, Matthias | Technische Universität München |
Keywords: Formal Verification/Synthesis, Identification, Uncertain systems
Abstract: Reachability analysis is a powerful tool for computing the set of states or outputs reachable for a system. While previous work has focused on systems described by state-space models, we present the first methods to compute reachable sets of ARMAX models - one of the most common input-output models originating from data-driven system identification. The first approach we propose can only be used with dependency-preserving set representations such as symbolic zonotopes, while the second one is valid for arbitrary set representations but relies on a reformulation of the ARMAX model. By analyzing the computational complexities, we show that both approaches scale quadratically with respect to the time horizon of the reachability problem when using symbolic zonotopes. To reduce the computational complexity, we propose a third approach that scales linearly with respect to the time horizon when using set representations that are closed under Minkowski addition and linear transformation and that satisfy that the computational complexity of the Minkowski sum is independent of the representation size of the operands. Our numerical experiments demonstrate that the reachable sets of ARMAX models are tighter than the reachable sets of equivalent state space models in case of unknown initial states. Therefore, this methodology has the potential to significantly reduce the conservatism of various verification techniques.
|
|
14:10-14:30, Paper FrB02.3 | Add to My Program |
Stranding Risk for Underactuated Vessels in Complex Ocean Currents: Analysis and Controllers (I) |
|
Doering, Andreas | Technical University of Munich, University of California at Berk |
Wiggert, Marius | UC Berkeley |
Krasowski, Hanna | Technical University of Munich |
Doshi, Manan | Massachusetts Institute of Technology |
Lermusiaux, Pierre F. J. | Massachusetts Institute of Technology |
Tomlin, Claire J. | UC Berkeley |
Keywords: Maritime control, Autonomous robots, Predictive control for nonlinear systems
Abstract: Low-propulsion vessels can take advantage of powerful ocean currents to navigate towards a destination. Recent results demonstrated that vessels can reach their destination with high probability despite forecast errors. However, these results do not consider the critical aspect of safety of such vessels: because their propulsion is much smaller than the magnitude of surrounding currents, they might end up in currents that inevitably push them into unsafe areas such as shallow waters, garbage patches, and shipping lanes. In this work, we first investigate the risk of stranding for passively floating vessels in the Northeast Pacific. We find that at least 5.04% would strand within 90 days. Next, we encode the unsafe sets as hard constraints into Hamilton-Jacobi Multi-Time Reachability to synthesize a feedback policy that is equivalent to re-planning at each time step at low computational cost. While applying this policy guarantees safe operation when the currents are known, in realistic situations only imperfect forecasts are available. Hence, we demonstrate the safety of our approach empirically with large-scale realistic simulations of a vessel navigating in high-risk regions in the Northeast Pacific. We find that applying our policy closed-loop with daily re-planning as new forecasts become available reduces stranding below 1% despite forecast errors often exceeding the maximal propulsion. Our method significantly improves safety over the baselines and still achieves a timely arrival of the vessel at the destination.
|
|
14:30-14:50, Paper FrB02.4 | Add to My Program |
Data-Driven Reachability Analysis of Lipschitz Nonlinear Systems Via Support Vector Data Description (I) |
|
Wang, Zheming | Zhejiang University of Technology |
Chen, Bo | Zhejiang University of Technology |
Jungers, Raphaël M. | University of Louvain |
Yu, Li | Zhejiang University of Technology |
Keywords: Formal Verification/Synthesis, Learning, Randomized algorithms
Abstract: This paper is concerned with data-driven reachability analysis of discrete-time nonlinear systems without any dynamical model. We use only a number of observations of trajectories of the system to estimate the actual reachable set. With the data set, using the Support Vector Data Description (SVDD) technique, we propose a sample-based approximation method to solve the reachability analysis problem, which can be considered as a one-class classification problem. Under the framework of scenario optimization, we then derive over-approximations of the reachable set in a probabilistic sense with Lipschitz continuity and other regularity conditions. Finally, we demonstrate the proposed method on a numerical example.
|
|
14:50-15:10, Paper FrB02.5 | Add to My Program |
Learning Robust and Correct Controllers from Signal Temporal Logic Specifications Using BarrierNet (I) |
|
Liu, Wenliang | Boston University |
Xiao, Wei | Massachusetts Institute of Technology |
Belta, Calin | Boston University |
Keywords: Formal Verification/Synthesis, Neural networks, Optimal control
Abstract: We consider the problem of learning a neural network controller for a system required to satisfy a Signal Temporal Logic (STL) specification. We exploit STL quantitative semantics to define a notion of robust satisfaction. Guaranteeing the correctness of a neural network controller is a difficult problem that received a lot of attention recently. We provide a general procedure to construct a set of trainable High Order Control Barrier Functions (HOCBFs) enforcing the satisfaction of formulas in a fragment of STL. We use the BarrierNet, implemented by a differentiable Quadratic Program (dQP) with HOCBF constraints, as the last layer of the neural network controller, to guarantee the satisfaction of the STL formulas. We train the HOCBFs together with other neural network parameters to further improve the robustness of the controller. Simulation results demonstrate that our approach ensures satisfaction and outperforms existing algorithms.
|
|
15:10-15:30, Paper FrB02.6 | Add to My Program |
Verifying the Unknown: Correct-By-Design Control Synthesis for Networks of Stochastic Uncertain Systems (I) |
|
Schön, Oliver | Newcastle University |
van Huijgevoort, Birgit | Eindhoven University of Technology |
Haesaert, Sofie | Eindhoven University of Technology |
Soudjani, Sadegh | Newcastle University |
Keywords: Formal Verification/Synthesis, Uncertain systems, Large-scale systems
Abstract: In this paper, we present an approach for designing correct-by-design controllers for cyber-physical systems composed of multiple dynamically interconnected uncertain systems. We consider networked discrete-time uncertain nonlinear systems with additive stochastic noise and model parametric uncertainty. Such settings arise when multiple systems interact in an uncertain environment and only observational data is available. We address two limitations of existing approaches for formal synthesis of controllers for networks of uncertain systems satisfying complex temporal specifications. Firstly, whilst existing approaches rely on the stochasticity to be Gaussian, the heterogeneous nature of composed systems typically yields a more complex stochastic behavior. Secondly, exact models of the systems involved are generally not available or difficult to acquire. To address these challenges, we show how abstraction-based control synthesis for uncertain systems based on sub-probability couplings can be extended to networked systems. We design controllers based on parameter uncertainty sets identified from observational data and approximate possibly arbitrary noise distributions using Gaussian mixture models whilst quantifying the incurred stochastic coupling. Finally, we demonstrate the effectiveness of our approach on a nonlinear package delivery case study with a complex specification, and a platoon of cars.
|
|
FrB03 Invited Session, Orchid Main 4204-4305 |
Add to My Program |
Cyber-Physical Systems: Safety, Security, and Reliability |
|
|
Chair: Sadabadi, Mahdieh S. | Queen Mary University of London |
Co-Chair: Escudero, Cédric | INSA Lyon, Laboratoire Ampère |
Organizer: Sadabadi, Mahdieh S. | University of Manchester |
Organizer: Escudero, Cédric | INSA Lyon, Laboratoire Ampère |
Organizer: Selvi, Daniela | Università Di Pisa |
Organizer: Soudjani, Sadegh | Newcastle University |
Organizer: Murguia, Carlos | Eindhoven University of Technology |
Organizer: Chong, Michelle | Eindhoven University of Technology |
Organizer: Ferrari, Riccardo M.G. | Delft University of Technology |
Organizer: Sasahara, Hampei | Tokyo Institute of Technology |
Organizer: Zhu, Quanyan | New York University |
|
13:30-13:50, Paper FrB03.1 | Add to My Program |
Robust and Scalable Game-Theoretic Security Investment Methods for Voltage Stability of Power Systems (I) |
|
An, Lu | North Carolina State University |
Shukla, Pratishtha | Oak Ridge National Lab |
Chakrabortty, Aranya | North Carolina State University |
Duel-Hallen, Alexandra | North Carolina State University |
Keywords: Power systems, Game theory, Robust control
Abstract: We develop investment approaches to secure electric power systems against load attacks where a malicious intruder (the attacker) covertly changes reactive power setpoints of loads to push the grid towards voltage instability while the system operator (the defender) employs reactive power compensation (RPC) to prevent instability. Extending our previously reported Stackelberg game formulation for this problem, we develop a robust-defense sequential algorithm and a novel genetic algoirhtm that provides scalability to large-scale power system models. The proposed methods are validated using IEEE prototype power system models with time-varying load uncertainties, demonstrating that reliable and robust defense is feasible unless the operator's RPC investment resources are severely limited relative to the attacker's resources.
|
|
13:50-14:10, Paper FrB03.2 | Add to My Program |
The Security Requirement to Prevent Zero Dynamics Attacks and Perfectly Undetectable Cyber-Attacks in Cyber-Physical Systems (I) |
|
Taheri, Mahdi | Concordia University |
Khorasani, Khashayar | Concordia University |
Meskin, Nader | Qatar University |
Keywords: Cyber-Physical Security
Abstract: The impacts of zero dynamics attacks and perfectly undetectable cyber-attacks cannot be observed in outputs of cyber-physical systems (CPS). Adversaries are capable of executing these cyber-attacks and leading the CPS to undesirable trajectories while remaining undetected. In this paper, we introduce and formally introduce the notion of security effort (SE) as a novel security metric for CPS that determines the minimum number of actuators and sensors that should be secured and kept attack free in order to prevent adversaries from executing zero dynamics attacks, covert attacks, and controllable attacks. Moreover, since zero dynamics attacks, covert attacks, and controllable attacks belong to weakly unobservable and controllable weakly unobservable subspaces of the CPS, conditions under which these subspaces become zero are obtained and investigated. An illustrative numerical simulation is provided to demonstrate the effectiveness of our proposed security measure.
|
|
14:10-14:30, Paper FrB03.3 | Add to My Program |
Secure State Estimation with Asynchronous Measurements against Malicious Measurement-Data and Time-Stamp Manipulation (I) |
|
Li, Zishuo | Tsinghua University |
Nguyen, Anh Tung | Uppsala University |
Teixeira, André M. H. | Uppsala University |
Mo, Yilin | Tsinghua University |
Johansson, Karl H. | KTH Royal Institute of Technology |
Keywords: Sensor fusion, Fault tolerant systems, Cyber-Physical Security
Abstract: This paper proposes a secure state estimation scheme with asynchronous non-periodic measurements for continuous LTI systems under false data attacks on measurement transmission channels. Each sensor transmits the measurement information in a triple comprised of its sensor index, the time-stamp, and the measurement value to the fusion center via unprotected communication channels. A malicious attacker can corrupt a subset of sensors by (i) manipulating the time- stamp and the measurement value; (ii) blocking transmitted measurement triples; or (iii) injecting fake measurement triples. To deal with such attacks, we propose a secure state estimator by designing decentralized local estimators and fusing all the local states by the median operator. The local estimators receive the sampled measurements and update their local state in an asynchronous manner, while the fusion center triggers the fusion and generates a secure estimation in the presence of a local update. We prove that local estimators of benign sensors are unbiased with stable error covariance. Moreover, the fused secure estimation error has bounded expectation and covariance against at most p corrupted sensors as long as the system is 2p-sparse observable. The efficacy of the proposed scheme is demonstrated through a benchmark example of the IEEE 14-bus system.
|
|
14:30-14:50, Paper FrB03.4 | Add to My Program |
Set-Based Anomaly Detector and Stealthy Attack Impact Using Constrained Zonotopes (I) |
|
Wagner, Jonas | University of Texas at Dallas |
Kogel, Tanner | University of Texas at Dallas |
Ruths, Justin | University of Texas at Dallas |
Keywords: Attack Detection, Cyber-Physical Security, Uncertain systems
Abstract: In this paper, a set-based detector is developed and analyzed based on the propagation of nominal (no attacks), actual (possibly with attacks), and perceived (corrupted by attacks) residuals due to bounded system and measurement noise uncertainties. The set of stealthy attacks (attacks that raise no alarms) is characterized and the impact of these stealthy attacks on the system state is quantified. When implemented through the set tools of constrained zonotopes, this approach provides accurate, time-varying attack-reachable sets that can be used to evaluate the safety and performance of systems in adversarial conditions. These tools are demonstrated through two case studies.
|
|
14:50-15:10, Paper FrB03.5 | Add to My Program |
Electrical Fault Localisation Over a Distributed Parameter Transmission Line (I) |
|
Selvaratnam, Daniel | KTH Royal Institute of Technology |
Das, Amritam | Eindhoven University of Technology |
Sandberg, Henrik | KTH Royal Institute of Technology |
Keywords: Fault detection, Power systems, Estimation
Abstract: Motivated by the need to localise faults along electrical power lines, this paper adopts a frequency-domain approach to parameter estimation for an infinite-dimensional linear dynamical system with one spatial variable. Since the time of the fault is unknown, and voltages and currents are measured at only one end of the line, distance information must be extracted from the post-fault transients. To properly account for high-frequency transient behaviour, the line dynamics is modelled directly by the Telegrapher’s equation, rather than the more commonly used lumped-parameter approximations. First, the governing equations are non-dimensionalised to avoid ill-conditioning. A closed-form expression for the transfer function is then derived. Finally, nonlinear least-squares optimisation is employed to search for the fault location. Requirements on fault bandwidth, sensor bandwidth and simulation time-step are also presented. The result is a novel end-to-end algorithm for data generation and fault localisation, the effectiveness of which is demonstrated via simulation.
|
|
15:10-15:30, Paper FrB03.6 | Add to My Program |
Adversarial Attacks to Direct Data-Driven Control for Destabilization (I) |
|
Sasahara, Hampei | Tokyo Institute of Technology |
Keywords: Cyber-Physical Security, Data driven control, Resilient Control Systems
Abstract: This study investigates the vulnerability of direct data-driven control to adversarial attacks in the form of a small but sophisticated perturbation added to the original data. The directed gradient sign method (DGSM) is developed as a specific attack method, based on the fast gradient sign method (FGSM), which has originally been considered in image classification. DGSM uses the gradient of the eigenvalues of the resulting closed-loop system and crafts a perturbation in the direction where the system becomes less stable. It is demonstrated that the system can be destabilized by the attack, even if the eigenvalues of the original closed-loop matrix with the clean data are aligned far from the unstable region. To increase the robustness against the attack, regularization methods that have been developed to deal with random disturbances are considered. Their effectiveness is evaluated by numerical experiments using an inverted pendulum model.
|
|
FrB04 Regular Session, Simpor Junior 4913 |
Add to My Program |
Autonomous Vehicles III |
|
|
Chair: Malikopoulos, Andreas A. | University of Delaware |
Co-Chair: Oguri, Kenshiro | Purdue University |
|
13:30-13:50, Paper FrB04.1 | Add to My Program |
NMPC Strategy for Safe Robot Navigation in Unknown Environments Using Polynomial Zonotopes |
|
Baptista Pereira Nascimento, Iuro | Federal University of Minas Gerais |
Rego, Brenner | Federal University of Minas Gerais |
Pimenta, Luciano C. A. | Universidade Federal De Minas Gerais |
Raffo, Guilherme Vianna | Federal University of Minas Gerais |
Keywords: Autonomous vehicles, Predictive control for nonlinear systems, Robotics
Abstract: This work proposes a nonlinear model predictive control (NMPC) strategy for robot navigation in cluttered unknown environments using polynomial zonotopes. The infor- mation provided by a laser sensor is used in the computation of the collision-free area. The procedure splits the area into convex subregions which are converted into polynomial zonotopes (PZs) to generate constraints for the NMPC optimal control problem. The PZ is a set representation that can describe polytopes using fewer constraints than conventional half-space representations, thus being more efficient while maintaining the accuracy equiv- alent to the polytopic case. Numerical experiments demonstrate the advantages of the proposed strategy.
|
|
13:50-14:10, Paper FrB04.2 | Add to My Program |
A Hierarchical Approach to Optimal Flow-Based Routing and Coordination of Connected and Automated Vehicles |
|
Bang, Heeseung | University of Delaware |
Malikopoulos, Andreas A. | Cornell University |
Keywords: Autonomous vehicles, Traffic control, Transportation networks
Abstract: This paper addresses the challenge of generating optimal vehicle flow at the macroscopic level. Although several studies have focused on optimizing vehicle flow, little attention has been given to ensuring it can be practically achieved. To overcome this issue, we propose a route-recovery and eco-driving strategy for connected and automated vehicles (CAVs) that guarantees optimal flow generation. Our approach involves identifying the optimal vehicle flow that minimizes total travel time, given the constant travel demands in urban areas. We then develop a heuristic route-recovery algorithm to assign routes to CAVs. Finally, we present an efficient coordination framework to minimize the energy consumption of CAVs while safely crossing intersections. The proposed method can effectively generate optimal vehicle flow and potentially reduce travel time and energy consumption in urban areas.
|
|
14:10-14:30, Paper FrB04.3 | Add to My Program |
Multi-Occlusions Inference from Multiple Social Vehicles |
|
Zhao, Xuhe | Tongji University |
Zhang, Chaojie | Tongji University |
Wang, Jun | Tongji University |
Keywords: Autonomous vehicles
Abstract: Occlusions at intersections threaten the safety of autonomous driving, especially in urban scenarios. Phantom vehicles are usually considered to mitigate the risks associated with the occlusions, and their states may be inferred from observable vehicles. This paper extends these methods to deal with multiple occlusions and multiple inferences. A multi-occlusion aware model allows for the concerns of occluded areas and the configuration of phantom vehicles. The evidence theory fuses multiple estimations to enhance occlusion inferences. Simulations for intersection scenarios are conducted to demonstrate the effectiveness of the proposed method for improving traffic efficiency.
|
|
14:30-14:50, Paper FrB04.4 | Add to My Program |
Robust Collision Avoidance of Quadric and Polygonal Surfaces Moving in Planar Environments |
|
Dhal, Kashish | The University of Texas at Arlington |
Kashyap, Abhishek | University of Texas at Arlington |
Chakravarthy, Animesh | University of Texas at Arlington |
Keywords: Autonomous vehicles
Abstract: The determination of robot trajectories that can achieve reactive collision avoidance with moving objects is of great interest. In cluttered environments with tight spaces, it becomes important to consider the shapes of the objects in computing these trajectories. The literature largely models the shapes of the objects as circles, and this can make the avoidance maneuvers very conservative, especially when the objects are elongated and/or non-convex. In this paper, we model the shapes of the objects using combinations of quadric surfaces or polygons, and employ a collision cone approach to achieve reactive collision avoidance, in the presence of measurement noise. The collision avoidance design employs a dual-loop control architecture where the inner loop uses a dynamic inversion-based method and the outer loop uses Linear Matrix Inequalities (LMIs). Simulation results demonstrating the the collision avoidance laws for dynamic, heterogeneous quadric and polygonal surfaces are presented.
|
|
14:50-15:10, Paper FrB04.5 | Add to My Program |
Capturability of the Pulsed Guidance Law Based on Differential Game Theory |
|
Lu, Yuting | Beihang University |
Yu, Yang | Beihang University |
Zhao, Qilun | Beijing Institute of Electronic System Engineering |
Han, Tuo | Beihang University |
Hu, Qinglei | Beihang University |
Li, Dongyu | Beihang University |
Keywords: Autonomous vehicles
Abstract: The capturability analysis of the pulsed guidance law using differential game theory is presented in this paper. The engagement of an interceptor with pulsed guidance constraint and a maneuvering target with bounded acceleration is considered. The differential game guidance laws for both the interceptor and the target are proposed. Then the engagement is converted into four different cases according to the guidance law parameters, and the capture boundary conditions for these four cases are given as functions of the guidance law parameters and the system parameters, including acceleration constraints, the acceptable miss distance and initial values of the engagement. Afterwards, the capture zone is given according to the boundary conditions and is decided by the guidance law parameters and the system parameters. Finally, various specific examples show that the interception can be guaranteed with lower acceleration constraint ratio and higher guidance law parameter ratio.
|
|
15:10-15:30, Paper FrB04.6 | Add to My Program |
Smooth Indirect Solution Method for State-Constrained Optimal Control Problems with Nonlinear Control-Affine Systems |
|
Oguri, Kenshiro | Purdue University |
Keywords: Aerospace, Optimal control, Autonomous vehicles
Abstract: This paper presents an indirect solution method for state-constrained optimal control problems to address the long-standing issue of discontinuous control and costate under state constraints. It is known in optimal control theory that a state inequality constraint introduces discontinuities in control and costate, rendering the classical indirect solution methods ineffective. This study re-examines the necessary conditions of optimality for a class of state-constrained optimal control problems, and shows the uniqueness of the optimal control that minimizes the Hamiltonian under state constraints, which leads to a unifying form of the necessary conditions. The unified form of the necessary conditions opens the door to addressing the issue of discontinuities in control and costate by modeling them via smooth functions. This paper exploits this insight to transform the originally discontinuous problems to smooth two-point boundary value problems that can be solved by existing nonlinear root-finding algorithms. This paper also shows the formulated solution method to have an anytime algorithm-like property, and then numerically demonstrates the solution method by an optimal orbit control problem.
|
|
FrB05 Invited Session, Simpor Junior 4912 |
Add to My Program |
Learning and Decision-Making in Multi-Agent Systems |
|
|
Chair: Anderson, James | Columbia University |
Co-Chair: Mitra, Aritra | University of Pennsylvania |
Organizer: Anderson, James | Columbia University |
Organizer: Mitra, Aritra | University of Pennsylvania |
Organizer: Pappas, George J. | University of Pennsylvania |
|
13:30-13:50, Paper FrB05.1 | Add to My Program |
Multi-Agent Reinforcement Learning for Resource Allocation in Large-Scale Robotic Warehouse Sortation Centers (I) |
|
Shen, Yi | Duke University |
McClosky, Benjamin | Amazon Robotics |
Durham, Joseph W. | Kiva Systems |
Zavlanos, Michael M. | Duke University |
Keywords: Agents-based systems, Building and facility automation, Learning
Abstract: Robotic sortation centers use mobile robots to sort packages by their destinations. The destination-to-sort- location (chute) mapping can significantly impact the volume of packages that can be sorted by the sortation floor. In this work, we propose a multi-agent reinforcement learning method to solve large-scale chute mapping problems with hundreds of agents (the destinations). To address the exponential growth of the state-action space, we decompose the joint action-value function as the sum of local action-value functions associated with the individual agents. To incorporate robot congestion effects on the rates at which packages are sorted, we couple the local action-value functions through the states of destinations mapped to nearby chutes on the sortation floor. We show that our proposed framework can solve large chute mapping problems and outperforms static or reactive policies that are commonly used in practice in robotic sortation facilities.
|
|
13:50-14:10, Paper FrB05.2 | Add to My Program |
Performance Bounds for Policy-Based Reinforcement Learning Methods in Zero-Sum Markov Games with Linear Function Approximation (I) |
|
Winnicki, Anna | University of Illinois at Urbana Champaign |
Srikant, R | Univ of Illinois, Urbana-Champaign |
Keywords: Stochastic optimal control, Game theory
Abstract: Until recently, efficient policy iteration algorithms for zero-sum Markov games that converge were unknown. Therefore, model-based RL algorithms for such problems could not use policy iteration in the planning modules of the algorithms. In an earlier paper, we showed that a convergent policy iteration algorithm can be obtained by using a commonly used technique in RL called lookahead. However, the algorithm could be applied to the function approximation setting only in the special case of linear MDPs (Markov Decision Processes). In this paper, we obtain performance bounds for policy-based RL algorithms for general settings, including one where policy evaluation is performed using noisy samples of (state, action, reward) triplets from a single sample path of a given policy.
|
|
14:10-14:30, Paper FrB05.3 | Add to My Program |
Motion Planning for the Estimation of Functions (I) |
|
Raghavan, Aneesh | KTH Royal Insitute of Technology |
Sartori, Giacomo | UNIPD |
Johansson, Karl H. | KTH Royal Institute of Technology |
Keywords: Learning, Optimization, Numerical algorithms
Abstract: We consider the problem of estimation of an unknown real valued function with real valued input by an agent. The agent exists in 3D Euclidean space. It is able to traverse in a 2D plane while the function is depicted in a 2D plane perpendicular to the plane of traversal. By viewing the function from a given position, the agent is able to collect a data point lying on the function. By traversing through the plane while paying a control cost, the agent collects a finite set of data points. The set of data points are used by the agent to estimate the function. The objective of the agent is to find a control law which minimizes the control cost while estimating the function optimally. We formulate a control problem for the agent incorporating an inference cost and the control cost. The control problem is relaxed by finding a lower bound for the cost function. We present a kernel based linear regression model to approximate the cost-to-go and use the same in a control algorithm to solve the relaxed optimization problem. We present simulation results comparing the proposed approach with greedy algorithm based exploration.
|
|
14:30-14:50, Paper FrB05.4 | Add to My Program |
Learning Agent Interactions from Density Evolution in 3D Regions with Obstacles (I) |
|
Tirumalai, Amoolya | University of Maryland, Institute for Systems Research |
Mavridis, Christos | University of Maryland, College Park |
Baras, John S. | University of Maryland |
Keywords: Identification, Agents-based systems, Computational methods
Abstract: In this work, we study the inverse problem of identifying complex flocking dynamics in a domain cluttered with obstacles. We get inspiration from animal flocks moving in complex ways with capabilities far beyond what current robots can do. Owing to the difficulty of observing and recovering the trajectories of the agents, we focus on the dynamics of their probability densities, which are governed by partial differential equations (PDEs), namely compressible Euler equations subject to non-local forces. We formulate the inverse problem of learning interactions as a PDE-constrained optimization problem of minimizing the squared Hellinger distance between the histogram of the flock and the distribution associated to our PDEs. The numerical methods used to efficiently solve the PDE-constrained optimization problem are described. Realistic flocking data are simulated using the Boids model of flocking agents, which differs in nature from the reconstruction models used in our PDEs. Our analysis and simulated experiments show that the behavior of cohesive flocks can be recovered accurately with approximate PDE solutions.
|
|
14:50-15:10, Paper FrB05.5 | Add to My Program |
Learning Personalized Models with Clustered System Identification (I) |
|
Toso, Leonardo Felipe | Columbia University |
Wang, Han | Columbia University |
Anderson, James | Columbia University |
Keywords: Statistical learning, Linear systems, Optimization
Abstract: We address the problem of learning linear system models by observing multiple trajectories from systems with differing dynamics. This framework encompasses a collaborative scenario where several systems seeking to estimate their dynamics are partitioned into clusters according to system similarity. Thus, the systems within the same cluster can benefit from the observations made by the others. Considering this framework, we present an algorithm where each system alternately estimates its cluster identity and performs an estimation of its dynamics. This is then aggregated to update the model of each cluster. We show that under mild assumptions, our algorithm correctly estimates the cluster identities and achieves an epsilon-approximate solution with a sample complexity that scales inversely with the number of systems in the cluster, thus facilitating a more efficient and personalized system identification.
|
|
15:10-15:30, Paper FrB05.6 | Add to My Program |
Decentralized Conflict Resolution for Multi-Agent Reinforcement Learning through Shared Scheduling Protocol (I) |
|
Ingebrand, Tyler | UT Austin |
Smith, Sophia | The University of Texas at Austin |
Topcu, Ufuk | The University of Texas at Austin |
Keywords: Decentralized control, Learning, Cooperative control
Abstract: Decentralized multi-agent reinforcement learning (MARL) is an inherently difficult problem because agents can have individual, unique objectives and no direct incentive to cooperate. Conflicts often arise over bottlenecks in the environment, such as a shared key or an intersection, where multiple agents need to access a single resource. To resolve these conflicts, we propose the use of a shared scheduling protocol. A scheduling protocol coordinates agent behavior such that one agent is allowed to greedily use the resource while the others are required to wait. In particular, we are interested in decentralized scheduling protocols that can be implemented independently by each agent without a centralized controller. We present three protocols and prove that they resolve conflicts when obeyed by all agents. In training, agents learn to obey the protocol as violations incur a penalty. Experimental results show that scheduling protocols increase the performance of multi-agent training fivefold compared to baseline decentralized MARL.
|
|
FrB06 Invited Session, Simpor Junior 4911 |
Add to My Program |
Estimation and Control of Quantum Systems |
|
|
Chair: Dong, Daoyi | University of New South Wales |
Co-Chair: Amini, Nina H. | CNRS, L2S, CentraleSupelec |
Organizer: Dong, Daoyi | Australian National University |
Organizer: Amini, Nina H. | CNRS, L2S, CentraleSupelec |
Organizer: Wang, Yuanlong | Chinese Academy of Sciences |
|
13:30-13:50, Paper FrB06.1 | Add to My Program |
Two-Stage Solution for Ancilla-Assisted Quantum Process Tomography: Error Analysis and Optimal Design (I) |
|
Xiao, Shuixin | Shanghai Jiao Tong University |
Wang, Yuanlong | Chinese Academy of Sciences |
Dong, Daoyi | University of New South Wales |
Zhang, Jun | Shanghai Jiao Tong University |
Keywords: Quantum information and control
Abstract: Quantum process tomography (QPT) is a fundamental task to characterize the dynamics of quantum systems. In contrast to standard QPT, ancilla-assisted process tomography (AAPT) framework introduces an extra ancilla system such that a single input state is needed. In this paper, we extend the two-stage solution, a method originally designed for standard QPT, to perform AAPT. Our algorithm has O(Md_A^2d_B^2) computational complexity where M is the type number of the measurement operators, d_A is the dimension of the quantum system of interest, and d_B is the dimension of the ancilla system. Then we establish an error upper bound and further discuss the optimal design on the input state in AAPT. A numerical example on a phase damping process demonstrates the effectiveness of the optimal design and illustrates the theoretical error analysis.
|
|
13:50-14:10, Paper FrB06.2 | Add to My Program |
Analyzing and Unifying Robustness Measures for Excitation Transfer Control in Spin Networks |
|
O'Neil, Sean Patrick | University of Southern California |
Khalid, Irtaza | Cardiff University |
Rompokos, Athanasios | University of Southern California |
Weidner, Carrie Ann | University of Bristol |
Langbein, Frank C. | Cardiff University |
Schirmer, Sophie | Swansea University |
Jonckheere, Edmond | University of Southern California |
Keywords: Robust control, Quantum information and control
Abstract: Recent achievements in quantum control have resulted in advanced techniques for designing controllers for applications in quantum communication, computing, and sensing. However, the susceptibility of such systems to noise and uncertainties necessitates robust controllers that perform effectively under these conditions to realize the full potential of quantum devices. The time-domain log-sensitivity and a recently introduced robustness infidelity measure (RIM) are two means to quantify controller robustness in quantum systems. The former can be found analytically, while the latter requires Monte-Carlo sampling. In this work, the correlation between the log-sensitivity and the RIM for evaluating the robustness of single excitation transfer fidelity in spin chains and rings in the presence of dephasing is investigated. We show that the expected differential sensitivity of the error agrees with the differential sensitivity of the RIM, where the expectation is over the error probability distribution. Statistical analysis also demonstrates that the log-sensitivity and the RIM are linked via the differential sensitivity, and that the differential sensitivity and RIM are highly concordant. This unification of two means (one analytic and one via sampling) to assess controller robustness in a variety of realistic scenarios provides a first step in unifying various tools to model and assess robustness of quantum controllers.
|
|
14:10-14:30, Paper FrB06.3 | Add to My Program |
On the Mean-Field Belavkin Filtering Equation |
|
Chalal, Sofiane | CentraleSupelec/Université Paris-Saclay |
Amini, Nina H. | CNRS, L2S, CentraleSupelec |
Guo, Gaoyue | MICS/CentraleSupelec |
Keywords: Quantum information and control, Stochastic systems, Mean field games
Abstract: Following Kolokoltsov’s work [1], we present an extension of mean-field control theory in quantum framework. In particular such an extension is done naturally by considering the Belavkin quantum filtering and control theory in a mean- field setting. In this setting, the dynamics is described by a controlled Belavkin equation of McKean-Vlasov type. We prove the well-posedness of such an equation under imperfect measurement records. Furthermore, we show under purification assumption the propagation of chaos for perfect measurements. Finally, we apply particle methods to simulate the mean- field Belavkin equation and we provide numerical simulations showing the stabilization of the mean-field Belavkin equation by a feedback control strategy towards a chosen target state.
|
|
14:30-14:50, Paper FrB06.4 | Add to My Program |
Quantum Optimal Control for the Shaping of Single Photons (I) |
|
Dong, Xue | Tsinghua University |
Wu, Re-Bing | Tsinghua University |
Keywords: Quantum information and control, Optimal control
Abstract: The transmission of flying qubits carried by itinerant photons is fundamental in quantum communication networks. To physically match the receiver system, the single photons must be prepared in proper shapes, and this leads to a variety of flying-qubit control problems. In this paper, we introduce the optimal control theory to the shaping of single photons, where the control to be optimized are coherent driving fields. We design gradient-based algorithms to minimize the shape difference between the emitted and desired single photons. Simulation results show that the optimization can achieve high-fidelity in the generation of decaying photon shapes with a two-level atom. However, its performance is limited and hence has to be combined with incoherence controls when the target shape has a rising part.
|
|
14:50-15:10, Paper FrB06.5 | Add to My Program |
On the Robustness of Stability for Quantum Stochastic Systems (I) |
|
Liang, Weichao | CentraleSupelec, University of Paris Saclay |
Ohki, Kentaro | Kyoto University |
Ticozzi, Francesco | Università Di Padova |
Keywords: Quantum information and control, Robust control, Lyapunov methods
Abstract: We investigate the impact of undesired Markovian couplings to external systems on the stabilization towards pure states or subspaces of continuously monitored quantum systems under open-loop and feedback control protocols. In particular, we provide sufficient conditions on perturbations to maintain stability of the target and demonstrate the boundedness in mean of the solutions of perturbed systems under open-loop protocols. Effect on the stability of feedback protocols is also analyzed. This work is a step toward a comprehensive robustness analysis of measurement-based control of quantum systems with respect to model perturbations.
|
|
15:10-15:30, Paper FrB06.6 | Add to My Program |
Heisenberg Formulation of Adiabatic Elimination for Open Quantum Systems with Two TimeScales (I) |
|
Le Régent, Francçois-Marie | Alice&Bob |
Rouchon, Pierre | Mines Paris PSL |
Keywords: Quantum information and control, Reduced order modeling
Abstract: Consider an open quantum system governed by a Gorini–Kossakowski–Sudarshan–Lindblad master equation with two times-scales: a fast one, exponentially converging towards a linear subspace of quasi-equilibria; a slow one resulting from small decoherence and Hamiltonian dynamics. Usually adiabatic elimination is performed in the Schrödinger picture. We propose here a Heisenberg formulation where the invariant operators attached to the fast decay dynamics towards the quasi-equilibria subspace play a key role. Based on geometric singular perturbations, asymptotic expansions of the Heisenberg slow dynamics and of the fast invariant linear subspaces are proposed. They exploit Carr's approximation lemma from center-manifold and bifurcation theory. Second-order expansions are detailed and shown to ensure preservation, up to second-order terms, of the complete positivity for the slow propagator on a slow timescale. Such expansions can be exploited numerically to derive reduced-order dynamical models.
|
|
FrB07 Regular Session, Simpor Junior 4813 |
Add to My Program |
Game Theory VI |
|
|
Chair: Wang, Bing-Chang | Shandong University |
Co-Chair: Fu, Jie | University of Florida |
|
13:30-13:50, Paper FrB07.1 | Add to My Program |
Open-Loop Saddle Points for Irregular Linear-Quadratic Two-Person Zero-Sum Games (I) |
|
Liang, Yong | Shandong University |
Wang, Bing-Chang | Shandong University |
Xu, Juanjuan | Shandong University |
Zhang, Huanshui | Shandong University of Science and Technology |
Keywords: Linear systems, Optimal control, Game theory
Abstract: In this paper, we consider the feedback representation of open-loop saddle points for irregular linear-quadratic (LQ) two-person zero-sum games, where the control weighting matrices in the cost functional are only semidefinite. The existence of an open-loop saddle point is characterized by the solvability of a system of constrained linear forward-backward differential equations (FBDEs), together with a convexity-concavity condition. In classical zero-sum games, the feedback representation is obtained by decoupling the FBDEs through the regular solution to a Riccati differential equation. But the associated Riccati equation cannot be used to decouple the FBDEs to obtain the feedback representation of open-loop saddle points. The essential differences between regular and irregular LQ zero-sum games are investigated. The irregular feedback representation can be derived from two equilibrium conditions in two different layers by using the ``two-layer optimization" approach.
|
|
13:50-14:10, Paper FrB07.2 | Add to My Program |
Linear Convergence in Time-Varying Generalized Nash Equilibrium Problems |
|
Bianchi, Mattia | ETH Zurich |
Benenati, Emilio | Technische Universiteit Delft |
Grammatico, Sergio | Delft Univ. of Tech |
Keywords: Optimization algorithms, Game theory, Variational methods
Abstract: We study generalized games with full row rank equality constraints and we provide a strikingly simple proof of strong monotonicity of the associated KKT operator. This allows us to show linear convergence to a variational equilibrium of the resulting primal-dual pseudo-gradient dynamics. Next, we propose a fully-distributed algorithm with linear convergence guarantee for aggregative games under partial-decision information. Based on these results, we establish stability properties for online GNE seeking in games with time-varying cost functions and constraints. Finally, we illustrate our findings numerically on an economic dispatch problem for peer-to-peer energy markets.
|
|
14:10-14:30, Paper FrB07.3 | Add to My Program |
Multi-Agent Reach-Avoid Games: Two Attackers versus One Defender and Mixed Integer Programming |
|
Hu, Hanyang | Simon Fraser University |
Bui, Nhat Minh | Simon Fraser University |
Chen, Mo | Simon Fraser University |
Keywords: Optimal control, Game theory, Intelligent systems
Abstract: We propose a hybrid approach that combines Hamilton-Jacobi (HJ) reachability and mixed-integer optimiza- tion for solving a reach-avoid game with multiple attackers and defenders. The reach-avoid game is an important problem with potential applications in air traffic control and multi- agent motion planning; however, solving this game for many attackers and defenders is intractable due to the adversarial nature of the agents and the high problem dimensionality. In this paper, we first propose an HJ reachability-based method for solving the reach-avoid game in which 2 attackers are playing against 1 defender; we derive the numerically convergent optimal winning sets for the two sides in environments with obstacles. Utilizing this result and previous results for the 1 vs. 1 game, we further propose solving the general multi-agent reach-avoid game by determining the defender assignments that can maximize the number of attackers captured via a Mixed Integer Program (MIP). Our method generalizes previous state- of-the-art results and is especially useful when there are fewer defenders than attackers. We validate our theoretical results in numerical simulations.
|
|
14:30-14:50, Paper FrB07.4 | Add to My Program |
Games for Efficient Supervisor Synthesis |
|
Jha, Prabhat Kumar | Chalmers University of Technology |
Hausmann, Daniel | Gothenburg University |
Piterman, Nir | University of Gothenburg |
Keywords: Supervisory control, Game theory, Discrete event systems
Abstract: In recent years, there was an increasing interest in the connections between supervisory control theory and reactive synthesis. As the two fields use similar techniques there is great hope that technologies from one field could be used in the other. In this spirit, we provide an alternative reduction from the supervisor synthesis problem to solving B"uchi games via games with a non-blocking objective. Our reduction is more compact and uniform than previous reductions. As a consequence, it gives an asymptotically better upper bound on the time complexity of the supervisory control synthesis problem. Our reduction also breaks a widely held belief about the impossibility of reduction of supervisory control synthesis problem to a game with linear winning condition.
|
|
14:50-15:10, Paper FrB07.5 | Add to My Program |
Synthesis of Opacity-Enforcing Winning Strategies against Colluded Opponent |
|
Shi, Chongyang | University of Florida |
Kulkarni, Abhishek | University of Florida at Gainesville |
Rahmani, Hazhar | University of Florida |
Fu, Jie | University of Florida |
Keywords: Discrete event systems, Formal Verification/Synthesis, Game theory
Abstract: This paper studies language-based opacity enforcement in a two-player, zero-sum game on a graph. In this game, player 1 (P1) wins if he can achieve a secret temporal goal described by the language of a finite automaton, no matter what strategy the opponent player 2 (P2) selects. In addition, P1 aims to win while making its goal opaque to a passive observer with imperfect information. However, P2 colludes with the observer to reveal P1's secret whenever P2 cannot prevent P1 from achieving its goal, and therefore, opacity must be enforced against P2. We show that a winning and opacity-enforcing strategy for P1 can be computed by reducing the problem to solving a reachability game augmented with the observer's belief states. Furthermore, if such a strategy does not exist, winning for P1 must entail the price of revealing his secret to the observer. We demonstrate our game-theoretic solution of opacity-enforcement control through a small illustrative example and in a robot motion planning problem.
|
|
FrB08 Regular Session, Simpor Junior 4812 |
Add to My Program |
Optimization Algorithms VI |
|
|
Chair: Daoutidis, Prodromos | Univ. of Minnesota |
Co-Chair: Ahn, Hyo-Sung | Gwangju Institute of Science and Technology (GIST) |
|
13:30-13:50, Paper FrB08.1 | Add to My Program |
Preconditioning Matrix Synthesis for a Projected Gradient Method for Solving Constrained Linear-Quadratic Optimal Control Problems |
|
Heuts, Yannick (Y.J.J.) | Eindhoven University of Technology |
Donkers, M.C.F. | Eindhoven University of Technology |
Keywords: Numerical algorithms, LMIs, Optimization algorithms
Abstract: This paper presents a method for synthesizing preconditioning matrices for a heavy-ball accelerated projected primal-dual method. The main focus lies on linear quadratic optimal control problems, as they have a specific structure that can be exploited for fast computational times. This gradient method is rewritten into a Lur'e-type system, such that convergence of the algorithm can be enforced through finding an appropriate Lyapunov function for the Lur'e system. It has been shown that for a small problem, it is possible to synthesize preconditioning matrices and that the method is 10^4 times faster than solving the projection using a dedicated solver.
|
|
13:50-14:10, Paper FrB08.2 | Add to My Program |
Distributed Least Square Approach for Solving a Multiagent Linear Algebraic Equation |
|
Pham, Viet Hoang | GIST |
Ahn, Hyo-Sung | Gwangju Institute of Science and Technology (GIST) |
Keywords: Agents-based systems, Optimization algorithms, Linear systems
Abstract: This paper considers a linear algebraic equation over a multiagent network. The coefficient matrix is partitioned into multiple blocks; each agent only knows a subset of these blocks in different row and column partitions. Based on a proximal ADMM algorithm, we design a distributed method for every agent to find its corresponding parts in one least square solution of the considered linear algebraic equation. Each agent uses only its information and communicates with its neighbors. We show that the designed method achieves an exponentially fast convergence for an arbitrary initial setup. Numerical simulations in MATLAB are provided to verify the effectiveness of the designed method.
|
|
14:10-14:30, Paper FrB08.3 | Add to My Program |
Data-Driven and Online Estimation of Linear Sensitivity Distribution Factors: A Low-Rank Approach |
|
Ospina, Ana M. | University of Colorado Boulder |
Dall'Anese, Emiliano | University of Colorado Boulder |
Keywords: Power systems, Smart grid, Optimization algorithms
Abstract: Estimation of sensitivity matrices in electrical transmission systems allows grid operators to evaluate in real-time how changes in power injections reflect into changes in power flows. In this paper, we propose a robust low-rank minimization approach to estimate sensitivity matrices based on measurements of power injections and power flows. An online proximal-gradient method is proposed to estimate sensitivities on-the-fly from real-time measurements. The proposed method obtains meaningful estimates with fewer measurements when the regression model is underdetermined, in contrast with existing methods based on least-squares approaches. In addition, our method can also identify faulty measurements and handle missing data. In this work, convergence results in terms of dynamic regret are presented. Numerical tests corroborate the effectiveness of the novel approach and the robustness of missing measurements and outliers.
|
|
14:30-14:50, Paper FrB08.4 | Add to My Program |
Exact Noise-Robust Distributed Gradient-Tracking Algorithm for Constraint-Coupled Resource Allocation Problems (I) |
|
Wu, Wenwen | Shanghai Jiao Tong University |
Zhu, Shanying | Shanghai Jiao Tong University |
Liu, Shuai | Shandong University |
Guan, Xin-Ping | Shanghai Jiao Tong University |
Keywords: Optimization algorithms, Distributed control, Agents-based systems
Abstract: The Industrial Internet of Things gradually becomes a new paradigm for information exchange in the industrial production environment. To ensure the high reliability of IIoT services, an efficient resource allocation method with good robustness is urgently needed under complex industrial environments. This paper considers the distributed constraint-coupled resource allocation problem with noisy information exchange over an undirected network, where each agent holds a private cost function and obtains the solution via only local communications. Communication noise poses a challenge to gradient-tracking based algorithm as the impact of noise will accumulate and its variance tends to infinity when the noise is persistent. Adopting noise-tracing scheme, we propose an exact noise-robust distributed gradient-tracking algorithm to achieve cost-optimal distribution of resources, which can avoid noise-accumulation in the tracking step. Moreover, noise suppression parameters are introduced to further attenuate the impact of noise. With diminishing suppression parameters, it is theoretically proved that the proposed algorithm is able to achieve exact convergence to the optimal solution. Finally, a numerical example is provided for verification.
|
|
14:50-15:10, Paper FrB08.5 | Add to My Program |
Optimal Design of Control-Lyapunov Functions by Semi-Infinite Stochastic Programming (I) |
|
Tang, Wentao | NC State University |
Daoutidis, Prodromos | Univ. of Minnesota |
Keywords: Lyapunov methods, Nonlinear systems, Optimization
Abstract: Lyapunov-based control is a common method to enforce closed-loop stability of nonlinear systems, where the choice of a control-Lyapunov function has a strong impact on the resulting performance. In this paper, we propose a generic semi-infinite stochastic programming formulation for the optimal control-Lyapunov function design problem and discuss its various specializations. Specifically, the expected performance evaluated on simulated trajectories under different scenarios is optimized subject to infinite constraints on stability and performance specifications. A stochastic proximal primal-dual algorithm is introduced to find a stationary solution of such a semi-infinite stochastic programming problem. The proposed method is illustrated by a chemical reactor case study.
|
|
FrB09 Regular Session, Simpor Junior 4811 |
Add to My Program |
Optimization III |
|
|
Chair: Pu, Ye | The University of Melbourne |
Co-Chair: Dai, Ran | Purdue University |
|
13:30-13:50, Paper FrB09.1 | Add to My Program |
Reinforcement Learning-Guided Quadratically Constrained Quadratic Programming for Enhanced Convergence and Optimality |
|
Chaoying, Pei | Purdue University |
Xu, Zhi | Purdue University |
You, Sixiong | Purdue University |
Sun, Jeffrey | Purdue University |
Dai, Ran | Purdue University |
Keywords: Optimization, Optimization algorithms, Learning
Abstract: In the context of Quadratically Constrained Quadratic Programming (QCQP) with dynamic parameters, the effectiveness of various optimization approaches is heavily influenced by the quality of the initial guess. To address this challenge, this paper proposes a novel approach that leverages reinforcement learning (RL) to generate high-performing initial guesses for iterative algorithms, with the dynamic parameters serving as inputs. Our approach aims to accelerate convergence and improve the objective value, thereby enabling efficient and effective solutions to the QCQP problem under variability. In this study, we evaluate the proposed approach by applying it to an iterative algorithm, specifically the Iterative Rank Minimization (IRM) algorithm. Our empirical evaluations demonstrate the efficacy of the proposed approach in solving QCQP problems with dynamic parameters. The RL-guided IRM algorithm yields high-quality solutions, as evidenced by significantly improved optimality and faster convergence when compared to the original IRM algorithm.
|
|
13:50-14:10, Paper FrB09.2 | Add to My Program |
Ergodic Convergence Results for the Arrow–Hurwicz Differential System |
|
Niederlaender, Simon | Siemens AG |
Keywords: Optimization, Optimization algorithms, Lyapunov methods
Abstract: In a real Hilbert space setting, we investigate the ergodic convergence properties of the solutions of the classical Arrow–Hurwicz differential system in view of solving linearly constrained convex minimization problems. Under the convexity hypothesis on the objective function of the minimization problem, we recover the fact that every solution of the Arrow–Hurwicz differential system weakly converges in average towards its asymptotic center. Moreover, it is shown that the primal-dual gap function relative to an averaged solution obeys the asymptotic estimate mathcal{O}(1/t) as tto+infty. If, in addition, the linear operator associated with the constraint function of the minimization problem is bounded from below, we find that the primal-dual gap function obeys the refined asymptotic estimate mathcal{O}(1/t^{2}) as tto+infty. Numerical experiments illustrate our theoretical findings.
|
|
14:10-14:30, Paper FrB09.3 | Add to My Program |
Multivariate Polynomial Optimization in Complex Variables Is a (Rectangular) Multiparameter Eigenvalue Problem |
|
Vermeersch, Christof | KU Leuven |
Lagauw, Sibren | KU Leuven |
De Moor, Bart L.R. | Katholieke Universiteit Leuven |
Keywords: Optimization, Optimization algorithms, Numerical algorithms
Abstract: We extend the relation between univariate polynomial optimization in one complex variable and the polynomial eigenvalue problem to the multivariate case. The first-order necessary conditions for optimality of the multivariate polynomial optimization problem, which are computed using Wirtinger derivatives, constitute a system of multivariate polynomial equations in the complex variables and their complex conjugates. Wirtinger calculus provides an elegant way to differentiate real-valued (cost) functions in complex variables. An elimination of the complex conjugate variables, via the Macaulay matrix, results in a (rectangular) multiparameter eigenvalue problem, (some of) the eigentuples of which correspond to the stationary points of the original real-valued cost function. We illustrate our novel globally optimal optimization approach with several (didactical) examples.
|
|
14:30-14:50, Paper FrB09.4 | Add to My Program |
Linearized ADMM for Nonsmooth Nonconvex Optimization with Nonlinear Equality Constraints |
|
El Bourkhissi, Lahcen | University Polytechnic of Bucharest |
Necoara, Ion | University Politehnica Bucharest and Institute of Mathematical S |
Patrinos, Panagiotis | KU Leuven |
Keywords: Optimization, Optimization algorithms, Predictive control for nonlinear systems
Abstract: This paper proposes a new approach for solving a structured nonsmooth nonconvex optimization problem with nonlinear equality constraints, where both the objective function and constraints are 2-blocks separable. Our method is based on a 2-block linearized ADMM, where we linearize the smooth part of the cost function and the nonlinear term of the functional constraints in the augmented Lagrangian at each outer iteration. This results in simple subproblems, whose solutions are used to update the iterates of the 2 blocks variables. We prove global convergence for the sequence generated by our method to a stationary point of the original problem. To demonstrate its effectiveness, we apply our proposed algorithm as a solver for the nonlinear model predictive control problem of an inverted pendulum on a cart.
|
|
14:50-15:10, Paper FrB09.5 | Add to My Program |
Tight Lower Bounds on the Convergence Rate of Primal-Dual Dynamics for Equality Constrained Convex Problems |
|
Ozaslan, Ibrahim Kurban | University of Southern California |
Jovanovic, Mihailo R. | University of Southern California |
Keywords: Optimization, Optimization algorithms, Stability of nonlinear systems
Abstract: We study the exponential stability of continuous-time primal-dual gradient flow dynamics for convex optimization problems with linear equality constraints. Without making any assumptions on the rank of the constraint matrix, we obtain a tight lower bound on the worst-case convergence rate for smooth strongly convex objective functions. Our analysis identifies two different convergence regimes depending on the ratio between primal and dual time scales. When the primal time scale is inversely proportional to the Lipschitz parameter of the objective function and the dual time scale is large enough, the convergence rate is inversely proportional to the condition number of the problem. In contrast to the existing results, our lower bound on the convergence rate does not depend on the condition number of the constraint matrix.
|
|
15:10-15:30, Paper FrB09.6 | Add to My Program |
Second-Order Properties of Noisy Distributed Gradient Descent |
|
Qin, Lei | The University of Melbourne |
Cantoni, Michael | University of Melbourne |
Pu, Ye | The University of Melbourne |
Keywords: Optimization, Optimization algorithms
Abstract: We study a fixed step-size distributed gradient descent algorithm for solving optimization problems in which the objective is a finite sum of smooth but possibly non-convex functions. Random perturbations of the gradient descent directions are introduced at each step to actively evade saddle points. Under certain regularity conditions, and with a suitable step-size, it is established that each agent converges to a neighborhood of a local minimizer; the size of the neighborhood depends on the step-size and a probabilistic confidence parameter. A numerical example is presented to illustrate the effectiveness of the random perturbations in terms of escaping saddle points in fewer iterations than without the perturbations.
|
|
FrB10 Regular Session, Roselle Junior 4713 |
Add to My Program |
Learning I |
|
|
Chair: Zeng, Shen | Washington University in St. Louis |
Co-Chair: Siami, Milad | Northeastern University |
|
13:30-13:50, Paper FrB10.1 | Add to My Program |
An Iterative Approach to Safe Learning in Unknown Constrained Environments |
|
Vu, Minh | Washington University in St. Louis |
Zeng, Shen | Washington University in St. Louis |
Keywords: Learning, Constrained control, Autonomous systems
Abstract: This paper presents an iterative learning technique to safely guide a nonlinear system with unknown dynamics through an environment with unspecified constraints. The presented approach leverages the system’s local dynamics to incrementally explore the environment and learn the appropriate control, which allows us to avoid the data-intensive task of learning an accurate global system model. Due to the local nature of this approach, the system’s safe operating region does not need to be pre-specified as long as local areas of the constraints can be identified when the system approaches those areas. The functionality and efficiency of the proposed approach are demonstrated through simulation of a unicycle and a high-dimensional nonlinear quadcopter, indicating the system’s ability to learn dynamics from data and safely navigate unknown environments.
|
|
13:50-14:10, Paper FrB10.2 | Add to My Program |
Optimal Containment Control of Nonlinear MASs: A Time-Aggregation-Based Policy Iteration Algorithm |
|
Shi, Xiongtao | Harbin Institute of Technology (Shenzhen) |
Li, Yanjie | Harbin Institue of Technology Shenzhen Graduate School |
Du, Chenglong | Central South University |
Keywords: Learning, Discrete event systems, Cooperative control
Abstract: In this paper, the optimal containment control of a class of unknown nonlinear multi-agent systems (MASs) is studied via a time-aggregation (TA) based model-free reinforcement learning (RL) algorithm. By proposing TA-based event-state, event-control, and integration-reward, the model-free TA-based policy iteration (TA-PI) approach is synthesized such that the policy evaluation and policy improvement steps are only executed for finite event-state, and the optimal control protocol is obtained with fewer computational requirements. Besides, the control input is intermittently updating only when the event-set is visited, which greatly reduce the updating frequency of control. Therefore, the proposed learning algorithm helps to save computational resources in both learning process and control updating. Moreover, armed with a finite predefined event-set, the developed TA-PI algorithm without employing function approximator and state discretization, resulting a strict convergence analysis via the mathematical induction. Finally, simulation results are given to show the feasibility and effectiveness of the proposed algorithm.
|
|
14:10-14:30, Paper FrB10.3 | Add to My Program |
Multi-Task System Identification of Similar Linear Time-Invariant Dynamical Systems |
|
Chen, Yiting | University of Colorado Boulder |
Ospina, Ana M. | University of Colorado Boulder |
Pasqualetti, Fabio | University of California, Riverside |
Dall'Anese, Emiliano | University of Colorado Boulder |
Keywords: Learning, Identification, Optimization algorithms
Abstract: This paper presents a system identification framework - inspired by multi-task learning - to estimate the dynamics of a given number of linear time-invariant (LTI) systems jointly by leveraging structural similarities across the systems. In particular, we consider LTI systems that model networked systems with similar connectivity, or LTI systems with small differences in their matrices. The system identification task involves the minimization of the least-squares (LS) fit for individual systems, augmented with a regularization function that enforces structural similarities. The proposed method is particularly suitable for cases when the recorded trajectories for one or more LTI systems are not sufficiently rich, leading to ill-conditioning of LS methods. We analyze the performance of the proposed method when the matrices of the LTI systems feature a common sparsity pattern (i.e., similar connectivity), and provide simulations based on real data for the estimation of the brain dynamics. We show that the proposed method requires a significantly smaller number of fMRI scans to achieve similar error levels of the LS.
|
|
14:30-14:50, Paper FrB10.4 | Add to My Program |
Derivative Feedback Control Using Reinforcement Learning |
|
Zaheer, Muhammad Hamad | University of New Hampshire |
Yoon, Se Young (Pablo) | University of New Hampshire |
Rizvi, Syed Ali Asad | Tennessee Technological University |
Keywords: Learning, Optimal control, Uncertain systems
Abstract: In this paper, the model-free state-derivative and output-derivative feedback control of continuous-time dynamic systems are considered. First, an online iterative algorithm is designed following the reinforcement learning framework, in which measurements of the system state derivatives are collected to synthesize an optimal state-derivative feedback control law. The proposed approach employs the integral reinforcement learning technique to iteratively solve a derivative feedback algebraic Riccati equation. The iterative algorithm is extended to the output-derivative feedback case by introducing a state-parametrization scheme that reconstructs the state-derivative signal from the output derivatives and input data. Based on this parametrization, we develop an online iterative algorithm based on the reinforcement learning framework to determine the optimal output-derivative feedback controller. Convergence of the iterative algorithms to the analytical optimal control solutions is demonstrated. Numerical simulations motivated by practical applications demonstrate the benefits of our method compared to the standard output feedback reinforcement learning algorithms.
|
|
14:50-15:10, Paper FrB10.5 | Add to My Program |
Dynamics and Perturbations of Overparameterized Linear Neural Networks |
|
Castello Branco de Oliveira, Arthur | Northeastern University |
Siami, Milad | Northeastern University |
Sontag, Eduardo | Northeastern University |
Keywords: Learning, Optimization, Nonlinear systems
Abstract: Recent research in neural networks and machine learning suggests that using many more parameters than strictly required by the initial complexity of a regression problem can result in more accurate or faster-converging models -- contrary to classical statistical belief. This phenomenon, sometimes referred to as "benign overfitting", raises questions regarding in what other ways might overparameterization affect the properties of a learning problem. In this work, we investigate the effects of overfitting on the robustness of gradient-descent training when subject to uncertainty on the gradient estimation, which arises naturally if the gradient is estimated from noisy data or directly measured. Our object of study is a linear neural network with a single, arbitrarily wide, hidden layer and an arbitrary number of inputs and outputs, which can be equivalently written as an overparameterized matrix factorization problem. In this paper we solve the problem for the case where the input and output of our neural network are one-dimensional, deriving sufficient conditions for the robustness of our system based on necessary and sufficient conditions for convergence in the undisturbed case. We then show that the general overparameterized formulation introduces a set of spurious equilibria that lay outside the set where the loss function is minimized, and discuss directions of future work that might extend our current results for more general settings.
|
|
15:10-15:30, Paper FrB10.6 | Add to My Program |
Convergence of Gradient-Based MAML in LQR |
|
Musavi, Negin | University of Illinois Urbana Champaign |
Dullerud, Geir E. | Univ of Illinois, Urbana-Champaign |
Keywords: Learning, Optimization, Optimal control
Abstract: The main objective of this research paper is to investigate the local convergence characteristics of Model-agnostic Meta-learning (MAML) when applied to linear system quadratic optimal control (LQR). MAML and its variations have become popular techniques for quickly adapting to new tasks by leveraging previous learning knowledge in areas like regression, classification, and reinforcement learning. However, its theoretical guarantees remain unknown due to non-convexity and its structure, making it even more challenging to ensure stability in the dynamic system setting. This study focuses on exploring MAML in the LQR setting, providing its local convergence guarantees while maintaining the stability of the dynamical system. The paper also presents simple numerical results to demonstrate the convergence properties of MAML in LQR tasks.
|
|
FrB11 Regular Session, Roselle Junior 4712 |
Add to My Program |
Large-Scale Systems II |
|
|
Chair: Tegling, Emma | Lund University |
Co-Chair: Stefansson, Elis | KTH Royal Institute of Technology |
|
13:30-13:50, Paper FrB11.1 | Add to My Program |
Multi-UAV Trajectory Planning Problem Using the Difference of Convex Function Programming |
|
Ngo, Anh Phuong | North Carolina A&T State University |
Thomas, Christan | Lockheed Martin Corporation |
Karimoddini, Ali | North Carolina A&T State University |
Nguyen, Hieu | North Carolina Agricultural and Technical State University |
Keywords: Air traffic management, Computational methods, Numerical algorithms
Abstract: Trajectory planning for a swarm of UAVs is known as a challenging nonconvex optimization problem, particularly due to a large number of collision avoidance constraints required for individual pairs of UAVs in the swarm. In this paper, we tackle this nonconvexity by leveraging the difference of convex function (DC) programming. We introduce the slack variables to relax and reformulate the collision avoidance conditions and employ the penalty function term to equivalently convert the problem into the DC form. Consequently, we construct a penalty DC algorithm in which we sequentially solve a set of convex optimization problems obtained by linearizing the collision avoidance constraint. The algorithm iteratively tightens the safety condition and reduces the objective cost of the planning problem and the additional penalty term. Numerical results demonstrate the effectiveness of the proposed approach in planning a large number of UAVs in congested space.
|
|
13:50-14:10, Paper FrB11.2 | Add to My Program |
New Formal Descriptions for Timed Coloured Petri Nets Using Formal Series |
|
Bal dit Sollier, Louis | ENS Paris-Saclay, EDF R&D |
Ourghanlian, Alain | Edf R&d Chatou |
Amari, Saïd | LURPA - Ecole Normale Paris Saclay |
Keywords: Petri nets
Abstract: This paper proposes to introduce a dioid of coloured formal series for the description of Timed Coloured Petri Nets (TCPN). These formal series allow us to express TCPN time/event shiftings. As coloured Petri nets can reach a high level of complexity, we first present the coloured formal series for linear systems. Nonetheless, we extend our work to the time and event shiftings in a conflict situation. This example shows the powerful expressiveness of these series and that we can build configurable models for which the transfer function computation can be automated via predictability assumptions.
|
|
14:10-14:30, Paper FrB11.3 | Add to My Program |
Efficient and Reconfigurable Optimal Planning in Large-Scale Systems Using Hierarchical Finite State Machines (I) |
|
Stefansson, Elis | KTH Royal Institute of Technology |
Johansson, Karl H. | KTH Royal Institute of Technology |
Keywords: Automata, Hierarchical control, Large-scale systems
Abstract: In this paper, we consider a planning problem for a large-scale system modelled as a hierarchical finite state machine (HFSM) and develop a control algorithm for computing optimal plans between any two states. The control algorithm consists of two steps: a preprocessing step computing optimal exit costs for each machine in the HFSM, with time complexity scaling linearly with the number of machines, and a query step that rapidly computes optimal plans, truncating irrelevant parts of the HFSM using the optimal exit costs, with time complexity scaling near-linearly with the depth of the HFSM. The control algorithm is reconfigurable in the sense that a change in the HFSM is efficiently handled, updating only needed parts in the preprocessing step to account for the change, with time complexity linear in the depth of the HFSM. We validate our algorithm on a robotic application, comparing it with Dijkstra's algorithm and Contraction Hierarchies. Our algorithm outperforms both.
|
|
14:30-14:50, Paper FrB11.4 | Add to My Program |
A Closed-Loop Design for Scalable High-Order Consensus |
|
Hansson, Jonas | Lund University |
Tegling, Emma | Lund University |
Keywords: Decentralized control, Distributed control, Large-scale systems
Abstract: This paper studies the problem of coordinating a group of nth-order integrator systems. As in the well-studied conventional consensus problem, we consider linear and distributed control with only local and relative measurements. We propose a closed-loop dynamic that we call serial consensus and prove it achieves nth-order consensus regardless of model order and underlying network graph. This alleviates an important scalability limitation in conventional consensus dynamics of order n >= 2, whereby they may lose stability if the underlying network grows. The distributed control law which achieves the desired closed loop dynamics is shown to be localized and obey the limitation to relative state measurements. Furthermore, through use of the small-gain theorem, the serial consensus system is shown to be robust to both model and feedback uncertainties. We illustrate the theoretical results through examples.
|
|
14:50-15:10, Paper FrB11.5 | Add to My Program |
Controlling Identical Linear Multi-Agent Systems Over Directed Graphs |
|
Zaupa, Nicola | LAAS, CNRS |
Zaccarian, Luca | LAAS-CNRS |
Tarbouriech, Sophie | LAAS-CNRS |
Queinnec, Isabelle | LAAS-CNRS |
Giordano, Giulia | University of Trento |
Keywords: Decentralized control, LMIs, Linear systems
Abstract: We consider the problem of synchronizing a multi-agent system (MAS) composed of several identical linear systems connected through a directed graph. To design a suitable controller, we construct conditions based on Bilinear Matrix Inequalities (BMIs) that ensure state synchronization. Since these conditions are non-convex, we propose an iterative algorithm based on a suitable relaxation that allows us to formulate Linear Matrix Inequality (LMI) conditions. As a result, the algorithm yields a common state-feedback matrix for the controller that satisfies general linear performance constraints. Our results are achieved under the mild assumption that the graph is time-invariant and connected.
|
|
15:10-15:30, Paper FrB11.6 | Add to My Program |
Quantification of Distributionally Robust Risk of Cascade of Failures in Platoon of Vehicles (I) |
|
Pandey, Vivek | Lehigh University |
Liu, Guangyi | Lehigh University |
Amini, Arash | The University of Texas at Austin |
Motee, Nader | Lehigh University |
Keywords: Networked control systems, Stochastic systems, Delay systems
Abstract: Achieving safety is a critical aspect of attaining autonomy in a platoon of autonomous vehicles. In this paper, we propose a distributionally robust risk framework to investigate cascading failures in platoons. To examine the impact of network connectivity and system dynamics on the emergence of cascading failures, we consider a time-delayed network model of the platoon of vehicles as a benchmark. To study the cascading effects among pairs of vehicles in the platoon, we use the measure of conditional distributionally robust functional. We extend the risk framework to quantify cascading failures by utilizing a bi-variate normal distribution. Our work establishes closed-form risk formulas that illustrate the effects of time-delay, noise statistics, underlying communication graph, and sets of soft failures. The insights gained from our research can be applied to design safe platoons that are robust to the risk of cascading failures. We validate our results through extensive simulations.
|
|
FrB12 Regular Session, Roselle Junior 4711 |
Add to My Program |
Distributed Control II |
|
|
Chair: Watson, Jeremy | University of Canterbury |
Co-Chair: van Dijk, Stefan | Technical University of Eindhoven |
|
13:30-13:50, Paper FrB12.1 | Add to My Program |
State Aggregation for Distributed Value Iteration in Dynamic Programming |
|
Vertovec, Nikolaus | University of Oxford |
Margellos, Kostas | University of Oxford |
Keywords: Distributed control, Large-scale systems, Traffic control
Abstract: We propose a distributed algorithm to solve a dynamic programming problem with multiple agents, where each agent has only partial knowledge of the state transition probabilities and costs. We provide consensus proofs for the presented algorithm and derive error bounds of the obtained value function with respect to what is considered as the "true solution" obtained from conventional value iteration. To minimize communication overhead between agents, state costs are aggregated and shared between agents only when the updated costs are expected to influence the solution of other agents significantly. We demonstrate the efficacy of the proposed distributed aggregation method to a large-scale urban traffic routing problem. Individual agents compute the fastest route to a common access point and share local congestion information with other agents allowing for fully distributed routing with minimal communication between agents.
|
|
13:50-14:10, Paper FrB12.2 | Add to My Program |
Real-Time Deep-Learning-Driven Parallel MPC |
|
Kohút, Roman | Slovak University of Technology in Bratislava |
Pavlovičová, Erika | Slovak University of Technology in Bratislava |
Fedorová, Kristína | Slovak University of Technology in Bratislava |
Oravec, Juraj | Slovak University of Technology in Bratislava |
Kvasnica, Michal | Slovak University of Technology in Bratislava |
Keywords: Distributed control, Machine learning, Predictive control for linear systems
Abstract: A novel real-time approximated MPC control policy based on deep learning is proposed to address the high computational burden of model predictive control (MPC) for large-scale systems and those with fast dynamics. This control method approximates the optimal solution of the distributed optimization problems in the ALADIN-based parallel MPC design framework, resulting in a highly effective approach that outperforms other well-known methods for solving the MPC design problem. The numerical case study shows promising results, demonstrating the potential of this approach for real-time implementation.
|
|
14:10-14:30, Paper FrB12.3 | Add to My Program |
Local Input-To-State Stability for Consensus in the Presence of Intermittent Communication and Input Saturation |
|
Silva, Thales C. | University of Pennsylvania |
Hsieh, M. Ani | University of Pennsylvania |
Keywords: Distributed control, Networked control systems, LMIs
Abstract: This paper addresses the problem of reaching consensus under input saturation and intermittent communication, which can hinder the convergence of the system. We propose a method that translates the consensus into an equivalent stability problem. Then, we compute bounded sets that enclose the initial conditions and the evolution of trajectories leading to local input-to-state stability for systems interconnected over directed intermittent topologies. Our contributions include sufficient conditions for stability and stabilization of multi-agent systems under intermittent interactions and saturating inputs, with the ability to evaluate disturbance tolerance and rejection based on the regions that enclose the system’s trajectories. We define disturbance rejection in terms of the L2 gain, and formulate stability and controller design conditions as convex optimization problems. Our method enable the maximization of regions that ensure local input-to-state stability, we provide numerical examples highlighting the trade-offs between mean frequency of intermittent interactions, disturbance energy, and convergence region size.
|
|
14:30-14:50, Paper FrB12.4 | Add to My Program |
Decentralized Design for LQ Consensus in Multi-Agent Systems |
|
van Dijk, Stefan | Technical University of Eindhoven |
Chanfreut, Paula | Eindhoven University of Technology |
Heemels, W.P.M.H. | Eindhoven University of Technology |
Keywords: Distributed control, Optimal control, Decentralized control
Abstract: Consensus control of multi-agent systems (MAS) with integrator dynamics is a canonical and well-studied prob- lem in the literature. In contrast, the optimal distributed linear quadratic (LQ) consensus problem minimizing an appropriate quadratic cost is less studied. For this problem, most of the available design methods require global information on the interconnection graph and/or global (initial) state information. In this paper, we propose a suboptimal solution to the LQ-based distributed consensus control problem by approximating the quadratic cost function in a way that allows for decentralized design of the control gains. The resulting control protocol only uses information from neighboring agents for both control design and implementation. It has the additional benefit that the information needs to be only exchanged periodically, reducing the communication requirements of the agent. Despite the sub- optimality, asymptotic consensus is guaranteed for our control law, as we will formally prove. We illustrate by numerical simulations on a 6-agent system the effectiveness of our design and compare it to other approaches.
|
|
14:50-15:10, Paper FrB12.5 | Add to My Program |
Linear Quadratic Leader-Following Consensus of Multi-Agent Systems: A Decentralized Computation and Distributed Information Fusion Strategy |
|
Ren, Yunxiao | Peking University |
Qian, Jiachen | Peking University |
Duan, Zhisheng | Peking University |
Keywords: Distributed control, Optimal control, Networked control systems
Abstract: This study delves into the leader-following consensus problem in linear multi-agent systems that are structured on weight-balanced digraphs. The primary aim is to formulate a distributed, implementable optimal controller capable of achieving both leader consensus and simultaneous optimization of the linear quadratic cost function. To attain this objective, we introduce a decentralized computation approach and advocate for a distributed information strategy. Initially, the computation of the global Riccati equation is disassembled into the computation of local Riccati equations. Following that, we propose an information fusion algorithm utilizing the dynamic average consensus approach to unravel the optimal controller, enabling its implementation in a distributed fashion. Additionally, we offer numerical simulation examples to demonstrate the efficacy of our proposed approach.
|
|
15:10-15:30, Paper FrB12.6 | Add to My Program |
Discrete-Time Distributed Optimization for Linear Uncertain Multi-Agent Systems |
|
Liu, Tong | New York University |
Bin, Michelangelo | University of Bologna |
Notarnicola, Ivano | University of Bologna |
Parisini, Thomas | Imperial College & Univ. of Trieste |
Jiang, Zhong-Ping | New York University |
Keywords: Distributed control, Optimization, Uncertain systems
Abstract: The distributed optimization algorithm proposed by J. Wang and N. Elia in 2010 has been shown to achieve linear convergence for multi-agent systems with single-integrator dynamics. This paper extends their result, including the linear convergence rate, to a more complex scenario where the agents have heterogeneous multi-input multi-output linear dynamics and are subject to external disturbances and parametric uncertainties. Disturbances are dealt with via an internal-model-based control design, and the interaction among the tracking error dynamics, average dynamics, and dispersion dynamics is analyzed through a composite Lyapunov function and the cyclic small-gain theorem. The key is to ensure a small enough stepsize for the convergence of the proposed algorithm, which is similar to the condition for time-scale separation in singular perturbation theory.
|
|
FrB13 Regular Session, Roselle Junior 4613 |
Add to My Program |
Networked Systems |
|
|
Chair: Zhang, Yuan | School of Automation, Beijing Institute of Technology |
Co-Chair: Hurst, Winston | University of California, Santa Barbara |
|
13:30-13:50, Paper FrB13.1 | Add to My Program |
Stability of Non-Cooperative Load Balancing with Time-Varying Latency |
|
Giuseppi, Alessandro | University of Rome "La Sapienza" |
Menegatti, Danilo | DIAG, Università Degli Studi Di Roma La Sapienz |
Pietrabissa, Antonio | University of Rome "La Sapienza" |
Keywords: Control of networks, Switched systems, Network analysis and control
Abstract: The problem of non-cooperative load balancing arises in multi-agent scenarios where users/services compete for some limited resources. This study, leveraging on results from set stability and switched systems control theory, analyses the convergence properties of a class of load-balancing strategies towards a set of approximated adversarial equilibria in a scenario in which the performance of the resource providers is described by a time-varying latency function.
|
|
13:50-14:10, Paper FrB13.2 | Add to My Program |
Optimal Dynamic Trajectories for UAVs in Mobility-Enabled Relay System |
|
Hurst, Winston | University of California, Santa Barbara |
Mostofi, Yasamin | Univ. of California Santa Barbara |
Keywords: Control over communications, Markov processes, Machine learning
Abstract: We consider a UAV which acts as a mobile relay and must plan a trajectory to enable data transfer between multiple pairs of communication nodes. For each pair, successful data transfer is only possible in non-convex regions (relay regions) where a given quality of service requirement may be satisfied for both nodes. Trajectories consist of 1) locations where the UAV stops to relay (relay positions) and 2) a dynamic transition policy which determines the sequence in which the pairs are serviced. We are interested in minimizing the average time a bit waits at a source before being sent to a destination and first pose a general, non-convex problem that calls for optimization over both the relay positions and the dynamic transition policy. To find approximate solutions, we formulate an average cost semi-Markov decision process and propose a deep-reinforcement-learning-based algorithm to solve it. To validate our approach, we present the results of several simulation experiments, which show our approach significantly outperforms the state-of-the-art.
|
|
14:10-14:30, Paper FrB13.3 | Add to My Program |
Towards Minimal Data Rate Enforcing Regular Safety Properties: An Invariance Entropy Approach |
|
Tomar, Mahendra Singh | BITS Pilani K K Birla Goa Campus |
Zamani, Majid | University of Colorado Boulder |
Keywords: Control over communications, Networked control systems, Hybrid systems
Abstract: The study of minimal data rate for control using some notions of entropy has been so far limited to classical control tasks such as set invariance, state-estimation, or stabilization. In this work, for the first time, we present a study on sufficient data rates to enforce regular safety properties over uncertain systems with dynamics described by set valued maps. Every regular safety property has a set of bad prefixes which can be modelled by a deterministic finite automaton (DFA). The main idea is to construct a hybrid system by taking the product of the deterministic finite automata with the given system and studying the invariance feedback entropy (IFE) of controlled invariant sets of the hybrid system. If there exists a nonempty controlled invariant set for the hybrid system satisfying a certain property then there exists a coder-controller with a data rate not less than the IFE that can enforce the regular safety property over the original control system. We demonstrate the effectiveness of our results by designing a coder-controller enforcing a regular safety property over a linear control system.
|
|
14:30-14:50, Paper FrB13.4 | Add to My Program |
Faster Consensus Via a Sparser Controller |
|
Ballotta, Luca | University of Padova |
Gupta, Vijay | Purdue University |
Keywords: Control system architecture, Delay systems, Distributed control
Abstract: In this paper, we investigate the architecture of an optimal controller that maximizes the convergence speed of a consensus protocol with single-integrator dynamics. Under the assumption that communication delays increase with the number of hops from which information is allowed to reach each agent, we address the optimal control design under delayed feedback and show that the optimal controller features, in general, a sparsely connected architecture.
|
|
14:50-15:10, Paper FrB13.5 | Add to My Program |
Observability Blocking for Functional Privacy of Linear Dynamic Networks |
|
Zhang, Yuan | School of Automation, Beijing Institute of Technology |
Cheng, Ranbo | Beijing Institute of Technology |
Xia, Yuanqing | Beijing Institute of Technology |
Keywords: Control Systems Privacy, Network analysis and control, Observers for Linear systems
Abstract: This paper addresses the problem of determining the minimum set of state variables in a network that need to be blocked from direct measurements in order to protect functional privacy with respect to any output matrices. More precisely, the goal is to prevent adversarial observers or eavesdroppers from inferring a linear functional of states, either vector-wise or entry-wise. We relate the considered functional privacy to the concept of functional observability. Building on a PBH-like criterion for functional observability, we prove that both problems are NP-hard. However, by assuming a reasonable constant bound on the geometric multiplicities of the system's eigenvalues, we present an exact algorithm with polynomial time complexity for the vector-wise functional privacy protection problem. Based on this algorithm, we then provide a greedy algorithm for the entry-wise privacy protection problem. Finally, we provide an example to demonstrate the effectiveness of our proposed approach.
|
|
15:10-15:30, Paper FrB13.6 | Add to My Program |
Differentially Private Stochastic Convex Optimization for Network Routing Applications |
|
Tsao, Matthew | Stanford University |
Gopalakrishnan, Karthik | Stanford University |
Yang, Kaidi | National University of Singapore |
Pavone, Marco | Stanford University |
Keywords: Control Systems Privacy, Transportation networks, Optimization
Abstract: Network routing problems are common across many engineering applications. Computing optimal routing policies requires knowledge about network demand, i.e., the origin and destination (OD) of all requests in the network. However, privacy considerations make it challenging to share individual OD data that would be required for computing optimal policies. Privacy can be particularly challenging in standard network routing problems because sources and sinks can be easily identified from flow conservation constraints, making feasibility and privacy mutually exclusive. In this paper, we present a differentially private algorithm for network routing problems. The main ingredient is a reformulation of network routing which moves all user data-dependent parameters out of the constraint set and into the objective function. We then present an algorithm for solving this formulation based on a differentially private variant of stochastic gradient descent. In this algorithm, differential privacy is achieved by injecting noise, and one may wonder if this noise injection compromises solution quality. We prove that our algorithm is both differentially private and asymptotically optimal as the size of the training set goes to infinity. We corroborate the theoretical results with numerical experiments on a road traffic network which show that our algorithm provides differentially private and near-optimal solutions in practice.
|
|
FrB14 Regular Session, Roselle Junior 4612 |
Add to My Program |
Observers for Nonlinear Systems I |
|
|
Chair: Yong, Sze Zheng | Northeastern University |
Co-Chair: Postoyan, Romain | CNRS, CRAN, Université De Lorraine |
|
13:30-13:50, Paper FrB14.1 | Add to My Program |
A Note on Observability of Nonlinear Discrete-Time Systems |
|
Kaldmäe, Arvo | Tallinn University of Technology |
Kotta, Ülle | Tallinn University of Technology |
Keywords: Observers for nonlinear systems, Algebraic/geometric methods, Nonlinear systems
Abstract: A concept of backward observability is introduced for nonlinear discrete-time control systems. According to the new definition the state variables can be expressed as functions of inputs, outputs and their backward shifts. It is shown that the new observability definition is more general for non-reversible systems than the one used typically in the literature. In case of reversible systems it is proved that backward observability is equivalent to the standard definition of observability. The new definition allows to enlarge the class of systems for which state variables can be estimated and the observers constructed.
|
|
13:50-14:10, Paper FrB14.2 | Add to My Program |
An Adaptive Observer for Time-Varying Nonlinear Systems - Application to a Crop Irrigation Model |
|
Dadjo, Mahugnon Gildas | MISTEA, Univ. Montpellier, INRAE, Institut Agro and Inria, Univ |
Efimov, Denis | Inria |
Harmand, Jérome | INRA |
Rapaport, Alain | INRAE & Univ. Montpellier |
Ushirobira, Rosane | Inria |
Keywords: Observers for nonlinear systems, Estimation, Lyapunov methods
Abstract: We propose an adaptive observer for a class of nonlinear time-varying systems, for which the regressor depends not only on the known input-output signals but also on all the unmeasured states. There are state disturbances, and measurements are corrupted by noise. The Lyapunov function method is used to prove that the coupled system-observer dynamics admits a state-independent input-to-output stability property in estimation errors from unknown inputs. Numerical simulations illustrate the presented approach on a three-dimensional crop irrigation model.
|
|
14:10-14:30, Paper FrB14.3 | Add to My Program |
A Masking Protocol for Private Communication and Attack Detection in Nonlinear Observers |
|
Cecilia, Andreu | Universitat Politècnica De Catalunya |
Astolfi, Daniele | Cnrs - Lagepp |
Casadei, Giacomo | Ecole Centrale Lyon |
Costa-Castelló, Ramon | Universitat Politècnica De Catalunya (UPC) |
Nesic, Dragan | University of Melbourne |
Keywords: Observers for nonlinear systems, Communication networks, Fault detection
Abstract: This work presents a novel masking protocol to secure the communication between a nonlinear plant and a nonlinear observer. Communication is secured in two senses. First, the privacy of the plant is preserved during the communication. Second, the protocol can detect a false-data injection attack in the communication link. The masking protocol is based on the use of washout-filters in nonlinear observers and the internal model principle.
|
|
14:30-14:50, Paper FrB14.4 | Add to My Program |
Interval Observers for Hybrid Dynamical Systems with Known Jump Times |
|
Pati, Tarun | Northeastern University |
Khajenejad, Mohammad | University of California, San Diego |
Daddala, Sai Praveen | Arizona State University |
Sanfelice, Ricardo G. | University of California at Santa Cruz |
Yong, Sze Zheng | Northeastern University |
Keywords: Observers for nonlinear systems, Hybrid systems
Abstract: This paper proposes a novel asymptotically stable interval estimator design for hybrid systems with nonlinear dynamics and observations under the assumption of known jump times. The proposed architecture leverages the concepts of mixed-monotone decompositions to construct a hybrid interval observer that is guaranteed to frame the true states (i.e., is correct) by construction. Moreover, using Lyapunov analysis and the positive system property of the framer error dynamics, we propose two approaches for computing the observer gains to achieve asymptotic stability of the error system based on mixed-integer semidefinite and linear programs. Further, our observer design incorporates additional degrees of freedom that may provide some advantages similar to coordinate transformations. Finally, we demonstrate the efficacy of the proposed hybrid observer design using two illustrative examples.
|
|
14:50-15:10, Paper FrB14.5 | Add to My Program |
Design of a Nonlinear Observer for a Class of Locally Lipschitz Systems by Using Input-To-State Stability: An LMI Approach |
|
Mohite, Shivaraj | PhD Student, University of Lorraine |
Alma, Marouane | CRAN Lorraine University |
Zemouche, Ali | CRAN UMR CNRS 7039 & Inria: EPI-DISCO |
Keywords: Observers for nonlinear systems, LMIs, Estimation
Abstract: This note addresses the problem of state estimation for a class of locally Lipschitz nonlinear systems under the effect of noise/disturbances. An observer structure based on Hilbert projection is used to handle the same class of nonlinearities. The boundedness of the state estimation error of the proposed observer is guaranteed by deploying the input-to-state stability (ISS) property. Based on ISS stability criterion, the new LMI condition is derived by combining a well-known LPV approach with a variant of Young inequality and novel matrix multipliers. The proposed LMIs have more decision variables than the methodologies presented in the literature, which provides extra degrees of freedom, and hence enhances LMI feasibility. Further, the effectiveness of the developed approach is highlighted through a numerical example. The performance of the observer is validated through the application of slip angle estimation in a nonlinear autonomous vehicle.
|
|
15:10-15:30, Paper FrB14.6 | Add to My Program |
State Estimation for Enhanced Low Dimensional Electrochemical Models of Lithium-Ion Batteries |
|
Khalil, Mira | Université De Lorraine |
Postoyan, Romain | CNRS, CRAN, Université De Lorraine |
Raël, Stéphane | Université De Lorraine |
Keywords: Observers for nonlinear systems, Modeling, Energy systems
Abstract: Safe and efficient operation of lithium-ion batteries requires an accurate estimation of the internal states. One approach is to design an observer based on an electrochemical model of the battery internal dynamics. However, electrochemical models, and their associated observers, typically require a high dimension to generate accurate variables. In this paper, we explain how to alleviate this limitation by correcting the lithium concentrations generated by a finite-dimensional electrochemical model derived from the spatial discretizations of partial differential equations (PDE). We show that the corrected concentrations asymptotically match those generated by the original PDEs for constant input currents, irrespectively of the order of the considered finite-dimensional model. We then exploit this fact to derive a new state space model for which we design an observer, whose global, robust convergence is supported by a Lyapunov analysis provided a linear matrix inequality holds. The estimated concentrations are then corrected to asymptotically match those of the original PDEs in absence of disturbances for constant currents. Simulation results show improvements both in terms of modeling and estimation accuracy as a result of the proposed corrections.
|
|
FrB15 Regular Session, Roselle Junior 4611 |
Add to My Program |
Resilient Control Systems |
|
|
Chair: Wen, Changyun | Nanyang Tech. Univ |
Co-Chair: Mallmann-Trenn, Frederik | King's College London |
|
13:30-13:50, Paper FrB15.1 | Add to My Program |
Resilient Temporal Logic Planning in the Presence of Robot Failures |
|
Kalluraya, Samarth | Washington University in St. Louis |
Pappas, George J. | University of Pennsylvania |
Kantaros, Yiannis | Washington University in St. Louis |
Keywords: Resilient Control Systems, Autonomous robots, Automata
Abstract: Several task and motion planning algorithms have been proposed recently to design paths for mobile robot teams with collaborative high-level missions specified using formal languages, such as Linear Temporal Logic (LTL). However, the designed paths often lack reactivity to failures of robot capabilities (e.g., sensing, mobility, or manipulation) that can occur due to unanticipated events (e.g., human intervention or system malfunctioning) which in turn may compromise mission performance. To address this novel challenge, in this paper we propose a new resilient mission planning algorithm for teams of heterogeneous robots with collaborative LTL missions. The robots are heterogeneous with respect to their capabilities while the mission requires applications of these skills at certain areas in the environment in a temporal/logical order. The proposed method designs paths that can adapt to unexpected failures of robot capabilities. This is accomplished by re-allocating sub-tasks to the robots based on their currently functioning skills while minimally disrupting the existing team motion plans. We provide experiments and theoretical guarantees demonstrating the efficiency and resiliency of the proposed algorithm.
|
|
13:50-14:10, Paper FrB15.2 | Add to My Program |
Reinforcement Learning-Based Optimal Control and Software Rejuvenation for Safe and Efficient UAV Navigation |
|
Chen, Angela | Carnegie Mellon University |
Mitsopoulos, Konstantinos | Institute for Human and Machine Cognition |
Romagnoli, Raffaele | Carnegie Mellon University |
Keywords: Resilient Control Systems, Machine learning, Autonomous vehicles
Abstract: Unmanned autonomous vehicles (UAVs) rely on effective path planning and tracking control to accomplish complex tasks in various domains. Reinforcement Learning (RL) methods are becoming increasingly popular in control applications, as they can learn from data and deal with unmodelled dynamics. Cyber-physical systems (CPSs), such as UAVs, integrate sensing, network communication, control, and computation to solve challenging problems. In this context, Software Rejuvenation (SR) is a protection mechanism that refreshes the control software to mitigate cyber-attacks, but it can affect the tracking controller's performance due to discrepancies between the control software and the physical system state. Traditional approaches to mitigate this effect are conservative, hindering the overall system performance. In this paper, we propose a novel approach that incorporates Deep Reinforcement Learning (Deep RL) into SR to design a safe and high-performing tracking controller. Our approach optimizes safety and performance, and we demonstrate its effectiveness during UAV simulations. We compare our approach with traditional methods and show that it improves the system's performance while maintaining safety constraints. Our approach takes 10 seconds less to reach the goal and we interpret this enhancement through a p-norm analysis.
|
|
14:10-14:30, Paper FrB15.3 | Add to My Program |
Resilient Containment Control of Multi-Agent Networks in Adversarial Environment |
|
Kuo, Chan-Yuan | Purdue University |
Du, Bin | Nanjing University of Aeronautics and Astronautics |
Sun, Dengfeng | Purdue University |
Keywords: Resilient Control Systems, Networked control systems, Fault tolerant systems
Abstract: We study the problem of containment control in an adversarial environment, where some of the agents may be adversarial. We identify the security issue of a containment control protocol in such adversarial environment. We propose a resilient containment control protocol that ensures the state of each non-adversarial (normal) follower agent converges to the convex hull spanned by the states of the normal leader agents. Specifically, our protocol is based on the method of resilient convex combination and works for agents with a vector state. For multi-agent networks consisting of one follower agent and multiple leaders, we provide a sufficient condition on the communication topology that guarantees the success of our proposed protocol. In addition, we provide a set of numerical simulations to demonstrate the success of our protocol and verify our theoretical results.
|
|
14:30-14:50, Paper FrB15.4 | Add to My Program |
Resilient Synchronization of Networked Euler-Lagrangian Systems in Adversarial Environments |
|
Chen, Hongjian | Nanyang Technological University |
Li, Xiaolei | Yanshan University |
Wen, Changyun | Nanyang Tech. Univ |
Fang, Xu | KTH Royal Institute of Technology |
Keywords: Resilient Control Systems, Nonlinear systems, Distributed control
Abstract: This paper considers the resilient synchronization problems of multiple Euler-Lagrange (EL) systems, where the communication network is affected by the external attacks or internal non-participant agents. The considered adversarial environments can cover several types of cyber-attacks, such as misbehaving agents and false data injection attacks. The remaining normal agents aim to reach a common decision despite the influence of faulty agents. To this end, a ``safe kernel” based secure control scheme is proposed for the networked Lagrangian systems. According to the scheme, each healthy agent generates a convex hull based on the states of its neighbors and updates its reference state toward this kernel only in each sampling instant to reduce the computational burden. The ``average sampling interval” is used to define the number of sampling instants for faster convergence. With an assumption on the number of misbehaving agents, the proposed scheme guarantees consensus of Euler-Lagrange systems even in adversarial environments. Mathematical proofs and a numerical example are presented to verify the resilience and validity of the proposed scheme.
|
|
14:50-15:10, Paper FrB15.5 | Add to My Program |
Dynamic Crowd Vetting: Collaborative Detection of Malicious Robots in Dynamic Communication Networks |
|
Cavorsi, Matthew | Harvard University |
Mallmann-Trenn, Frederik | King's College London |
Saldana, David | Lehigh University |
Gil, Stephanie | Harvard University |
Keywords: Resilient Control Systems, Robotics, Distributed control
Abstract: Coordination in a large number of networked robots is a challenging task, especially when robots are constantly moving around the environment and there are malicious attacks within the network. Various approaches in the literature exist for detecting malicious robots, such as message sampling or suspicious behavior analysis. However, these approaches require every robot to sample every other robot in the network, leading to a slow detection process that degrades team performance. This paper introduces a method that significantly decreases the detection time for legitimate robots to identify malicious robots in a scenario where legitimate robots are randomly moving around the environment. Our method leverages the concept of "Dynamic Crowd Vetting" by utilizing observations from random encounters and trusted neighboring robots' opinions to quickly improve the accuracy of detecting malicious robots. The key intuition is that as long as each legitimate robot accurately estimates the legitimacy of at least some fixed subset of the team, the second-hand information they receive from trusted neighbors is enough to correct any misclassifications and provide accurate trust estimations of the rest of the team. We show that the size of this fixed subset can be characterized as a function of fundamental graph and random walk properties. Furthermore, we formally show that as the number of robots in the team increases the detection time remains constant. We develop a closed form expression for the critical number of time-steps required for our algorithm to successfully identify the true legitimacy of each robot to within a specified failure probability. Our theoretical results are validated through simulations demonstrating significant reductions in detection time when compared to previous works that do not leverage trusted neighbor information.
|
|
15:10-15:30, Paper FrB15.6 | Add to My Program |
A Compositional Resilience Index for Computationally Efficient Safety Analysis of Interconnected Systems (I) |
|
Niu, Luyao | University of Washington |
Maruf, Abdullah Al | University of Washington |
Clark, Andrew | Washington University in St. Louis |
Mertoguno, Sukarno | Georgia Institute of Technology |
Poovendran, Radha | University of Washington |
Keywords: Resilient Control Systems, Nonlinear systems, Cyber-Physical Security
Abstract: Interconnected systems such as power systems and chemical processes are often required to satisfy safety properties in the presence of faults and attacks. Verifying safety of these systems, however, is computationally challenging due to nonlinear dynamics, high dimensionality, and combinatorial number of possible faults and attacks that can be incurred by the subsystems interconnected within the network. In this paper, we develop a compositional resilience index to verify safety properties of interconnected systems under faults and attacks. The resilience index is a tuple serving the following two purposes. First, it quantifies how a safety property is impacted when a subsystem is compromised by faults and attacks. Second, the resilience index characterizes the needed behavior of a subsystem during normal operations to ensure safety violations will not occur when future adverse events occur. We develop a set of sufficient conditions on the dynamics of each subsystem to satisfy its safety constraint, and leverage these conditions to formulate an optimization program to compute the resilience index. When multiple subsystems are interconnected and their resilience indices are given, we show that the safety constraints of the interconnected system can be efficiently verified by solving a system of linear inequalities. We demonstrate our developed resilience index using a numerical case study on chemical reactors connected in series.
|
|
FrB16 Regular Session, Peony Junior 4512 |
Add to My Program |
Emerging Control Applications |
|
|
Chair: Prandini, Maria | Politecnico Di Milano |
Co-Chair: Karlsson, Niklas | Amazon |
|
13:30-13:50, Paper FrB16.1 | Add to My Program |
Ancillary Services Provision Via Aggregation: Joint Power Flexibility Assessment and Disaggregation Policy Design |
|
Zamudio, Daniel | Politecnico Di Milano |
Falsone, Alessandro | Politecnico Di Milano |
Bianchi, Federico | Ricerca Sul Settore Energetico, RSE SpA |
Prandini, Maria | Politecnico Di Milano |
Keywords: Emerging control applications, Energy systems, Optimization
Abstract: We address the problem of assessing the power flexibility that a pool of prosumers equipped with a generalized storage device can offer to the electrical grid as an ancillary service for balancing power demand and generation. A key feature of the proposed approach is that the disaggregation policy is computed jointly with the aggregate flexibility set, and it is hence readily available for the pool to supply any (feasible) power profile request from the grid. Each prosumer is assumed to provide a contribution which is an affine function of the aggregated power profile. The coefficients of the affine policies are designed by solving a distributed optimization program where the volume of the aggregate flexibility set is maximized while satisfying the power and energy constraints of each storage device and additional constraints involving multiple (possibly all) devices. Simulation results show the superiority of the proposed approach with respect to a state-of-the-art method that inspired our work.
|
|
13:50-14:10, Paper FrB16.2 | Add to My Program |
Feedback-Control Based Hierarchical Multi-Constraint Ad Campaign Optimization |
|
Karlsson, Niklas | Amazon |
Keywords: Emerging control applications, Hierarchical control, Optimization algorithms
Abstract: Online advertising is typically implemented via real-time bidding, and advertising campaigns are then defined as extremely high-dimensional optimization problems. Advertisers often define a campaign by an order consisting of multiple lines. Campaign delivery constraints may be imposed on the order as a whole and on each ad line. E.g., there may be budget and cost per click constraints on the order and on each line individually. Furthermore, the sum of line budgets may exceed the order budget, and the cost per click constraint on lines may differ. This leaves room for cross-line budget optimization; i.e., budget may be shifted across lines to maximize the advertising value without violating the constraints. This paper derives the optimal bidding mechanism for a large family of constrained optimization problems. It is shown how the optimal bidding strategy can be implemented as scalable non-cooperating agents on the order and the individual lines.
|
|
14:10-14:30, Paper FrB16.3 | Add to My Program |
Entropy for Optimal Control on a Simplex with an Application to Behavioral Nudging |
|
Al Ahdab, Mohamad | Aalborg University |
Knudsen, Torben | Aalborg University, Denmark |
Stoustrup, Jakob | Aalborg University |
Leth, John | Aalborg University |
Keywords: Emerging control applications, Optimal control
Abstract: We study the utilization of the entropy function of inputs in solving two Optimal Control Problems (OCPs) with linear dynamics and inputs constrained to either a unit simplex, or a variable-sized simplex in which the size is also an input. By using the entropy function as part of the objective functional in the OCP, we are able to derive closed-form solutions for both cases. Additionally, we present an example of how the studied OCPs can be applied to choose between nudging techniques to encourage a specific behavior, such as adhering to medication, through the lens of behavioral momentum theory.
|
|
14:30-14:50, Paper FrB16.4 | Add to My Program |
Power Purchase Agreements with Renewables: Optimal Timing and Design |
|
Gao, Zuguang | University of California, Irvine |
Sunar, Nur | University of North Carolina at Chapel Hill |
Birge, John | University of Chicago |
Keywords: Emerging control applications, Power systems, Optimal control
Abstract: In this paper, we design a power purchase agreement (PPA) where the firm agrees to make a certain transfer payment to the renewable generator, and the generator invests that payment to build new renewable energy facilities. The firm will then have access to all electricity generation from the new facilities for a long-term period. The firm may dynamically decide when to start the PPA on an ongoing basis, based on the evolving market conditions, and the transfer payment (amount of investment) is also specified by the firm. The firm's objective is to maximize its long-term discounted benefit (total savings) from signing the PPA. We mathematically formulate the firm's decision problem as an optimal stopping problem and provide analytical solutions. We also provide insights on how the firm's optimal investment capacity, expected savings, and the expected total new generation change with respect to different problem parameters.
|
|
14:50-15:10, Paper FrB16.5 | Add to My Program |
An Error-Triggered Adaptive Model Reduction and Soil Moisture Estimation for Agro-Hydrological System |
|
Debnath, Sarupa | University of Alberta |
Sahoo, Soumya | University of Alberta |
Agyeman, Bernard | University of Alberta |
Yin, Xunyuan | Nanyang Technological University |
Liu, Jinfeng | University of Alberta |
Keywords: Emerging control applications, Reduced order modeling, Estimation
Abstract: Implementing a closed-loop irrigation system requires an accurate estimation of soil moisture given a limited number of sensors. However, the main challenge lies in the potential high dimensionality (10^{4}-10^{8}) of the agro-hydrological model used for water dynamics, which is modeled based on the nonlinear Richards equation. To address this challenge, we propose a state estimation method for large-scale agricultural fields. Our approach uses an error-triggered adaptive model reduction based on a trajectory-based clustering technique. An adaptive extended Kalman filter (EKF) is designed accordingly based on the adaptively reduced model. We apply our approach to an actual agricultural field and conduct extensive simulations to demonstrate its effectiveness and applicability.
|
|
15:10-15:30, Paper FrB16.6 | Add to My Program |
Who Gets the Whip? How Supplier Diversification Influences Bullwhip Effect in a Supply Chain |
|
Burrows, Tyler | Brigham Young University |
Hamilton, Maxwell | Brigham Young University |
Grimsman, David | Brigham Young University |
Keywords: Modeling, Emerging control applications, Network analysis and control
Abstract: To navigate the evolving terrain of Supply Chains (SC), firms require new tools with broader applicability. Cur- rent research ignores the forest in favor of trees, with focal firms and serial networks assumed. This paper explicates a novel and scalable model for SC study at a broad level. We utilize the core of the model to observe the effect of structure and policy on demand disturbance in a SC as a whole. We find that complex structure alone does not effect change in disturbances; dynamic policy is necessary and sufficient for amplification. The model and simulations build to our main result: diversifying a firm’s supplier-base can amplify disturbances more quickly.
|
|
FrB17 Regular Session, Peony Junior 4511 |
Add to My Program |
Statistical Learning II |
|
|
Chair: Qin, S. Joe | City University of Hong Kong |
Co-Chair: Farokhi, Farhad | The University of Melbourne |
|
13:30-13:50, Paper FrB17.1 | Add to My Program |
Distributionally-Robust Optimization with Noisy Data for Discrete Uncertainties Using Total Variation Distance |
|
Farokhi, Farhad | The University of Melbourne |
Keywords: Statistical learning, Optimization, Uncertain systems
Abstract: Stochastic programs, where uncertainty distribution must be inferred from noisy data samples, are considered. They are approximated with distributionally-robust optimizations that minimize the worst-case expected cost over ambiguity sets, i.e., sets of distributions that are sufficiently compatible with observed data. The ambiguity sets capture probability distributions whose convolution with the noise distribution is within a ball centered at the empirical noisy distribution of data samples parameterized by total variation distance. Using the prescribed ambiguity set, the solutions of the distributionally-robust optimizations converge to the solutions of the original stochastic programs when the number of the data samples grow to infinity. Therefore, the proposed distributionally-robust optimization problems are asymptotically consistent. The distributionally-robust optimization problems can be cast as tractable optimization problems.
|
|
13:50-14:10, Paper FrB17.2 | Add to My Program |
Robust Independence Tests with Finite Sample Guarantees for Synchronous Stochastic Linear Systems |
|
Tamás, Ambrus | SZTAKI |
Bálint, Dániel Ágoston | Independent Researcher |
Csáji, Balázs Cs. | SZTAKI |
Keywords: Statistical learning, Stochastic systems, Identification
Abstract: The paper introduces robust independence tests with non-asymptotically guaranteed significance levels for synchronous stochastic linear time-invariant systems, assuming that the observed outputs are simultaneous, which means that the systems are driven by jointly i.i.d. noises. Our method provides bounds for the type I error probabilities that are distribution-free, i.e., the innovations can have arbitrary distributions. The algorithm combines confidence region estimates with permutation tests and general dependence measures, such as the Hilbert–Schmidt independence criterion and the distance covariance, to detect any nonlinear dependence between the observed systems. We also prove the consistency of our hypothesis tests under mild assumptions and demonstrate the ideas through the example of autoregressive systems.
|
|
14:10-14:30, Paper FrB17.3 | Add to My Program |
Towards Dynamic Causal Discovery with Rare Events: A Nonparametric Conditional Independence Test |
|
Chiu, Chih-Yuan | University of California, Berkeley |
Kulkarni, Kshitij | University of California, Berkeley |
Sastry, Shankar | Univ. of California at Berkeley |
Keywords: Statistical learning, Sampled-data control, Machine learning
Abstract: Causal phenomena associated with rare events occur across a wide range of engineering problems, such as risk-sensitive safety analysis, accident analysis and prevention, and extreme value theory. However, current methods for causal discovery are often unable to uncover causal links, between random variables in a dynamic setting, that manifest only when the variables first experience low-probability realizations. To address this issue, we introduce a novel statistical independence test on data collected from time-invariant dynamical systems in which rare but consequential events occur. In particular, we exploit the time-invariance of the underlying data to construct a superimposed dataset of the system state before rare events happen at different timesteps. We then design a conditional independence test on the reorganized data. We provide sample complexity bounds for the consistency of our method, and validate its performance across various simulated and real-world datasets, including incident data collected from the Caltrans Performance Measurement System (PeMS). Code containing the datasets and experiments is publicly available.
|
|
14:30-14:50, Paper FrB17.4 | Add to My Program |
Distributionally Robust Stability of Payoff Allocations in Stochastic Coalitional Games |
|
Pantazis, George | TU Delft |
Franci, Barbara | Maastricht University |
Grammatico, Sergio | Delft University of Technology |
Margellos, Kostas | University of Oxford |
Keywords: Statistical learning, Uncertain systems, Game theory
Abstract: We consider multi-agent coalitional games with uncertainty in the coalitional values. We provide a novel methodology to study the stability of the grand coalition in the case where each coalition constructs ambiguity sets for the (possibly) unknown probability distribution of the uncertainty. As a less conservative solution concept compared to worst-case approaches for coalitional stability, we consider a stochastic version of the so-called core set, i.e., the expected value core. Unfortunately, without exact knowledge of the probability distribution, the evaluation of the expected value core is an extremely challenging task. Hence, we propose the concept of distributionaly robust (DR) core. Leveraging tools from DR optimization under the Wasserstein distance, we provide finite-sample guarantees that any allocation which lies in the DR core is also stable with respect to the true probability distribution and show the asymptotic consistency of the DR core. We dedicate the last section to the computational tractability of finding an allocation in the DR core.
|
|
14:50-15:10, Paper FrB17.5 | Add to My Program |
Probabilistic Reduced-Dimensional Vector Autoregressive Modeling for Dynamics Prediction and Reconstruction with Oblique Projections |
|
Mo, Yanfang | City University of Hong Kong |
Yu, Jiaxin | CityU of Hong Kong |
Qin, S. Joe | Lingnan University |
Keywords: Process Control, Estimation, Statistical learning
Abstract: In this paper, we propose a probabilistic reduced-dimensional vector autoregressive (PredVAR) model with oblique projections. This model partitions the measurement space into a dynamic subspace and a static subspace that do not need to be orthogonal. The partition allows us to extract dynamic latent variables (DLVs) from high-dimensional data with maximized predictability. We develop an alternating iterative PredVAR algorithm using expectation maximization (EM) that exploits the interaction between updating the latent VAR dynamics and estimating the oblique projection. In addition, the noise covariance matrices are estimated as a natural outcome of the EM method. A simulation case study of the nonlinear Lorenz oscillation system illustrates the advantages of the proposed approach over two alternatives.
|
|
15:10-15:30, Paper FrB17.6 | Add to My Program |
SIS Epidemic Propagation under Strategic Non-Myopic Protection: A Dynamic Population Game Approach |
|
Maitra, Urmee | Indian Institute of Technology, Kharagpur |
Hota, Ashish Ranjan | Indian Institute of Technology (IIT), Kharagpur |
Srivastava, Vaibhav | Michigan State University |
Keywords: Game theory, Stochastic systems
Abstract: We consider a dynamic game setting in which a large population of strategic individuals decides whether to adopt protective measures to protect themselves against an infectious disease, specifically the susceptible-infected-susceptible (SIS) epidemic. Protection is costly and partially effective, and adopting protection reduces the probability of becoming infected for susceptible individuals and the probability of transmitting the infection for infected individuals. In a departure from most prior works that assume the decision-makers to be myopic, we model individuals who choose their actions to maximize the infinite horizon discounted expected reward. We define the notion of best response and stationary Nash equilibrium in this class of games, and completely characterize the equilibrium policy and stationary state distribution for different parameter regimes. Numerical results illustrate the evolution and convergence of the infected proportion and the policy of protection adoption to the equilibrium.
|
|
FrB18 Regular Session, Peony Junior 4412 |
Add to My Program |
Stability of Nonlinear Systems IV |
|
|
Chair: Martins, Nuno C. | University of Maryland |
Co-Chair: Murguia, Carlos | Eindhoven University of Technology |
|
13:30-13:50, Paper FrB18.1 | Add to My Program |
A Submodular Energy Function Approach to Controlled Islanding with Provable Stability |
|
Cheng, Shiyu | Washington University in St. Louis |
Niu, Luyao | University of Washington |
Clark, Andrew | Washington University in St. Louis |
Poovendran, Radha | University of Washington |
Keywords: Stability of nonlinear systems, Optimization, Control of networks
Abstract: Cascading failures occur when failures of one or more nodes in a network lead to failures in neighboring nodes that propagate through the remainder of the network. One approach to mitigate cascading failures is through controlled islanding, in which a subset of edges is deliberately removed in order to partition the network into disjoint and stable islands. In this paper, we propose a submodular optimization algorithm for selecting edges to remove in order to create islands with provable stability. In contrast to existing approaches that optimize over stability-related metrics such as network coherence, our approach maps standard Lyapunov stability conditions to the objective function of an optimization problem. We prove that this optimization problem is equivalent to minimizing a supermodular function subject to a matroid basis constraint. We propose a local search algorithm for selecting the islands with provable optimality bounds, and discuss special cases including signed linear consensus and nonlinear synchronization dynamics. We simulate our approach using linear consensus dynamics with negative edges and find that our proposed algorithms partition the network into a stable island and an unstable island.
|
|
13:50-14:10, Paper FrB18.2 | Add to My Program |
Homogeneous Finite/Fixed-Time Stabilization with Quantization |
|
Zhou, Yu | INRIA |
Polyakov, Andrey | Inria, Univ. Lille |
Zheng, Gang | INRIA |
Ping, Xubin | Xidian University |
Keywords: Stability of nonlinear systems, Quantized systems, Control over communications
Abstract: This paper investigates the stability of a homogeneous control system that uses quantized states. A sufficient condition for globally asymptotic (finite/nearly fixed-time) stability is derived. The stability under both uniform and logarithmic quantizers is investigated. It is demonstrated that a homogeneous feedback stabilization system with a uniform quantizer can achieve only practical stability. However, in the case of a logarithmic quantizer, the system can achieve global (finite-time, nearly fixed-time, exponential) stability, provided that the quantization density is sufficiently high. Additionally, homogeneous stabilizer design for a linear plant with quantized measurements is presented. Theoretical results are supported by numerical simulations.
|
|
14:10-14:30, Paper FrB18.3 | Add to My Program |
Secondary Controller Design for the Safety of Nonlinear Systems Via Sum-Of-Squares Programming |
|
Lin, Yankai | Eindhoven University of Tecnology |
Chong, Michelle | Eindhoven University of Technology |
Murguia, Carlos | Eindhoven University of Technology |
Keywords: Stability of nonlinear systems, Resilient Control Systems, Numerical algorithms
Abstract: We consider the problem of ensuring the safety of nonlinear control systems under adversarial signals. Using Lyapunov-based reachability analysis, we first give sufficient conditions to assess safety, i.e., to guarantee that the states of the control system, when starting from a given initial set, always remain in a prescribed safe set. We consider polynomial systems with semi-algebraic safe sets. Using the S-procedure for polynomial functions, safety conditions can be formulated as a Sum-Of-Squares (SOS) programme, which can be solved efficiently. When safety cannot be guaranteed, we provide tools via SOS to synthesize polynomial controllers that enforce safety of the closed-loop system. The theoretical results are illustrated through numerical simulations.
|
|
14:30-14:50, Paper FrB18.4 | Add to My Program |
Do 0-GAS-Guaranteeing Impulse Sequences Preserve ISS or iISS Properties? Not Always |
|
Russo, Antonio | Università Degli Studi Della Campania Luigi Vanvitelli |
Liu, Shenyu | Beijing Institute of Technology |
Keywords: Stability of hybrid systems, Lyapunov methods, Stability of nonlinear systems
Abstract: Based on the recent work on determination of classes of impulse/switching signals, uniformly over which a switched or impulsive system is input-to-state stable (ISS) or integral input-to-state stable (iISS), it is conjectured that for a switched system with all ISS or iISS subsystems, as long as the impulse/switching signal makes the system 0-input globally asymptotically stable (0-GAS), then it also ensures ISS and iISS, respectively. This work disproves this conjecture by showing examples where the impulse sequence amplifies the input, causing 0-GAS impulsive system to lose the ISS or iISS properties. As a compensation, we provide sufficient conditions on the impulse sequences which guarantee ISS or iISS. It turns out that these conditions are strictly stronger than those ensuring 0-GAS; the differences between them are in fact necessary conditions for the impulsive system to have the asymptotic gain (AG) property or uniformly bounded energy bounded state (UBEBS) property.
|
|
14:50-15:10, Paper FrB18.5 | Add to My Program |
Counter-Adversarial Learning with Inverse Unscented Kalman Filter |
|
Singh, Himali | Indian Institute of Technology Delhi |
Mishra, Kumar Vijay | United States Army Research Laboratory |
Chattopadhyay, Arpan | Indian Institute of Technology, Delhi |
Keywords: Kalman filtering, Stability of nonlinear systems, Agents-based systems
Abstract: In counter-adversarial systems, to infer the strategy of an intelligent adversarial agent, the defender agent needs to cognitively sense the information that the adversary has gathered about the latter. Prior works on the problem employ linear Gaussian state-space models and solve this inverse cognition problem by designing inverse stochastic filters. However, in practice, counter-adversarial systems are generally highly nonlinear. In this paper, we address this scenario by formulating inverse cognition as a nonlinear Gaussian state-space model, wherein the adversary employs an unscented Kalman filter (UKF) to estimate the defender's state with reduced linearization errors. To estimate the adversary's estimate of the defender, we propose and develop an inverse UKF (IUKF) system. We then derive theoretical guarantees for the stochastic stability of IUKF in the mean-squared boundedness sense. Numerical experiments for multiple practical applications show that the estimation error of IUKF converges and closely follows the recursive Cramér-Rao lower bound.
|
|
15:10-15:30, Paper FrB18.6 | Add to My Program |
Epidemic Population Games for Policy Design: Two Populations with Viral Reservoir Case Study (I) |
|
Certorio, Jair | University of Maryland |
La, Richard J. | University of Maryland, College Park |
Martins, Nuno C. | University of Maryland |
Keywords: Compartmental and Positive systems, Game theory, Stability of nonlinear systems
Abstract: We extend to two populations a recently proposed system theoretic framework for studying an epidemic influenced by the strategic behavior of a single population's agents. Our framework couples the well-known susceptible-infected-susceptible (SIS) epidemic model with a population game that captures the strategic interactions among the agents of two large populations. This framework can also be employed to study a situation where a population of nonstrategic agents (such as animals) serves as a disease reservoir. Equipped with the framework, we investigate the problem of designing a suitable control policy that assigns dynamic payoffs to incentivize the agents to adopt costlier and more effective mitigating strategies subject to a long-term budget constraint. We formulate a non-convex constrained optimization program for minimizing the disease transmission rate at an endemic equilibrium, and explain how to obtain an approximate solution efficiently. A solution to the optimization problem is an aggregate strategy distribution for the population game which minimizes the basic reproduction number at the corresponding endemic equilibrium, thus minimizing the transmission of the disease. We then propose a dynamic payoff mechanism and use a Lyapunov function to prove the convergence of i) the strategy distribution, ii) infection level, and iii) the dynamic payoff. Such that the strategy choices of the population converge to an (approximate) solution to the optimization problem, leading the level of infected to converge to the endemic equilibrium associated with the solution of the optimization.
|
|
FrB19 Regular Session, Peony Junior 4411 |
Add to My Program |
Filtering |
|
|
Chair: Mahony, Robert | Australian National University, |
Co-Chair: Tanwani, Aneel | Laas -- Cnrs |
|
13:30-13:50, Paper FrB19.1 | Add to My Program |
Sequential Learning from Noisy Data: Data-Assimilation Meets the Echo-State Network |
|
Goswami, Debdipta | Ohio State University |
Keywords: Filtering, Neural networks, Kalman filtering
Abstract: This paper explores the problem of training a recurrent neural network from noisy data. While neural network based dynamic predictors perform well with noise-free training data, prediction with noisy inputs during training phase poses a significant challenge. Here a sequential training algorithm is developed for an echo-state network (ESN) by incorporating noisy observations using an ensemble Kalman filter. The resultant Kalman-trained echo-state network (KalT-ESN) outperforms the traditionally trained ESN with least square algorithm while still being computationally cheap. The proposed method is demonstrated on noisy observations from three systems: two synthetic datasets from chaotic dynamical systems and a set of real-time traffic data.
|
|
13:50-14:10, Paper FrB19.2 | Add to My Program |
H_infty Filter Based Functional Observers for Descriptor Systems |
|
Tunga, Pabitra Kumar | Indian Institute of Technology Patna |
Tomar, Nutan Kumar | Indian Institute of Technology Patna |
Keywords: Filtering, Observers for Linear systems, Differential-algebraic systems
Abstract: This paper considers the H_infty observers design problem for linear time-invariant descriptor systems. A sufficient condition is established for functional observers of order equal to the dimension of the vector to be estimated. This sufficient condition is milder than the other existing conditions in the literature. Furthermore, the observers are of the state space form, and the parameter matrices' existence is proved via elementary matrix theory. It is shown that the observer parameter matrices exist if a matrix equation is solvable. The solution of this matrix equation is not unique, and this non-uniqueness is utilized to meet other specifications of the observer via the solution theory of linear matrix inequalities (LMIs). The theoretical findings are illustrated by designing a functional observer for an electric circuit.
|
|
14:10-14:30, Paper FrB19.3 | Add to My Program |
A Note on the Extended Kalman Filter on a Manifold |
|
Ge, Yixiao | Australian National University |
van Goor, Pieter | Australian National University |
Mahony, Robert | Australian National University, |
Keywords: Filtering, Observers for nonlinear systems, Algebraic/geometric methods
Abstract: The kinematics of many control systems, especially in the robotics field, naturally live on smooth manifolds. Most classical state-estimation algorithms, including the extended Kalman filter, are posed on Euclidean space. Although any filter algorithm can be adapted to a manifold setting by implementing it in local coordinates and ignoring the geometric structure, it has always been clear that there would be advantages in taking the geometric structure into consideration in developing the algorithm. In this paper, we argue that the minimum geometric structure required to adapt the extended Kalman filter to a manifold is that of an affine connection. With this structure, we show that a naive coordinate implementation of the EKF fails to account for geometry of the manifold in the update step and in the reset step. We provide geometric modifications to the classical EKF based on parallel transport of the measurement covariance (for the update) and a-posteriori state covariance (for the reset) that address these limitations. Preliminary results for attitude estimation with two directional measurements demonstrate that the proposed modifications significantly improve the transient behavior of the filter.
|
|
14:30-14:50, Paper FrB19.4 | Add to My Program |
Transport Inspired Particle Filters with Poisson-Sampled Observations in Gaussian Setting |
|
Yufereva, Olga | LAAS CNRS |
Tanwani, Aneel | Laas -- Cnrs |
Keywords: Filtering, Sampled-data control, Randomized algorithms
Abstract: Motivated by the need for developing computationally efficient solutions to filtering problem with limited information, this article develops particle filtering algorithms for continuous-time stochastic processes with time-sampled observation process. The state process is modeled by a continuous-time linear stochastic differential equation driven by Wiener process, and the observation process is a linear mapping of the state with additive Gaussian noise. For practical reasons, we assume that the observations are time-sampled and the underlying sampling process is a Poisson counter. With the aim of developing particle filters for this system, we first propose a mean-field type process which is an observation-driven stochastic differential equation such that the conditional distribution of this process given the observations coincides with the optimal filtering distribution. This model is then used to simulate a collection of particles which are driven only by the sample mean and sample covariance, without simulating the differential equation for the covariance matrix. It is shown that the dynamics of the sample mean and the sample covariance coincide with the optimal ones. An academic example is included for illustration.
|
|
14:50-15:10, Paper FrB19.5 | Add to My Program |
Fusion of Distance Measurements between Agents with Unknown Correlations |
|
Cros, Colin | Université Grenoble Alpes |
Amblard, Pierre-Olivier | CNRS |
Prieur, Christophe | CNRS |
Da Rocha, Jean-François | Telespazio FRANCE |
Keywords: Filtering, Sensor networks, Cooperative control
Abstract: Cooperative localization is a promising solution to improve the accuracy and overcome the shortcomings of GNSS. Cooperation is often achieved by measuring the distance between users. To optimally integrate a distance measurement between two users into a navigation filter, the correlation between the errors of their estimates must be known. Unfortunately, in large scale networks the agents cannot compute these correlations and must use consistent filters. A consistent filter provides an upper bound on the covariance of the error of the estimator taking into account all the possible correlations. In this paper, a consistent linear filter for integrating a distance measurement is derived using Split Covariance Intersection. Its analysis shows that a distance measurement between two agents can only benefit one of them, i.e., only one of the two can use the distance measurement to improve its estimator. Furthermore, in some cases, none can. A necessary condition for an agent to benefit from the measurement is given for a general class of objective functions. When the objective function is the trace or the determinant, necessary and sufficient conditions are given.
|
|
15:10-15:30, Paper FrB19.6 | Add to My Program |
Metrics for Bayesian Optimal Experiment Design under Model Misspecifications |
|
Catanach, Tommie | Sandia National Laboratories |
Das, Niladri | Sandia National Laboratories |
Keywords: Kalman filtering, Filtering, Uncertain systems
Abstract: The conventional approach to Bayesian decision-theoretic experiment design involves searching over possible experiments to select a design that maximizes the expected value of a specified utility function. The expectation is over the joint distribution of all unknown variables implied by the statistical model that will be used to analyze the collected data. Utility functions define experiments’ objectives; a common utility function is information gain. This article introduces an expanded framework for experimental design, where we go beyond the traditional Expected Information Gain criteria. We introduce Expected General Information Gain which measures robustness to the model discrepancy, and Expected Discriminatory Information to quantify how well an experiment can detect model discrepancy. The functionality of the framework is showcased through its application to a scenario involving a linearized spring mass damper system and an F-16 model where the model discrepancy is taken into account while doing Bayesian optimal experiment design.
|
|
FrB20 Invited Session, Orchid Junior 4312 |
Add to My Program |
Mechatronics I |
|
|
Chair: Boker, Almuatazbellah | Virginia Tech |
Co-Chair: Al Janaideh, Mohammad | University of Guelph |
Organizer: Flores, Gerardo | Center for Research in Optics |
Organizer: Al Janaideh, Mohammad | University of Guelph |
Organizer: Rakotondrabe, Micky | ENIT Tarbes, INPT, University of Toulouse |
Organizer: Heertjes, Marcel | Eindhoven University of Technology |
Organizer: Boker, Almuatazbellah | Virginia Tech |
|
13:30-13:50, Paper FrB20.1 | Add to My Program |
Data-Driven Event-Triggered Bipartite Consensus for Multi-Agent Systems Preventing DoS Attacks |
|
Zhao, Huarong | JiangNan University |
Shan, Jinjun | York University |
Peng, Li | Jiangnan University |
Yu, Hongnian | Edinburgh Napier University |
Keywords: Agents-based systems, Cooperative control, Networked control systems
Abstract: This paper considers event-triggered bipartite consensus issues for discrete-time nonlinear networked multi-agent systems with antagonistic interactions and denial-of-service (DoS) attacks. Firstly, a pseudo partial derivative technology is applied to obtain an equivalent dynamic linearization model of the controlled system. A signed graph theory is employed to analyze the coopetition relationships among agents. Then, a distributed combined measurement error function is formulated to transform the bipartite consensus issue into a consensus issue. Meanwhile, an output predictive compensation scheme is proposed to offset the influence of DoS attacks. Furthermore, a dead-zone operator is designed to improve the flexibility of the proposed event-triggered mechanism. Subsequently, a data-driven event-triggered resilient bipartite consensus scheme is formulated. Then, the convergence of the proposed method is strictly proved by using the Lyapunov theory and the contraction mapping principle, which indicates that the bipartite consensus error will be cut to a small region around zero. Finally, hardware tasks are conducted to verify the effectiveness of the proposed method.
|
|
13:50-14:10, Paper FrB20.2 | Add to My Program |
Classical Bouc-Wen Hysteresis Modeling and Force Control of a Piezoelectric Robotic Hand Manipulating a Deformable Object |
|
Flores, Gerardo | Center for Research in Optics |
Rakotondrabe, Micky | ENIT Tarbes, INPT, University of Toulouse |
Keywords: Mechatronics, Robotics, Nonlinear output feedback
Abstract: This letter focuses on the modeling and control of a piezoelectric actuator that is designed to manipulate objects. This research considers both the non-linearity caused by the hysteresis of the actuator and the deformation of the object being manipulated. To approximate the hysteresis, a classical Bouc-Wen model is used. To stabilize the force-tracking error, we propose a novel control approach combining three advanced methodologies: an output-feedback method based on a nonlinear observer, a Barrier-Lyapunov function design, and bounded control based on saturation functions. Combining these three powerful techniques produces a bounded and highly robust controller that can effectively reject aggressive disturbances while maintaining the tracking error inside a predefined set. Under such a scenario, it is demonstrated that the equilibrium point of the closed-loop system is asymptotically stable. The effectiveness of the proposed control method is validated through extensive numerical simulations.
|
|
14:10-14:30, Paper FrB20.3 | Add to My Program |
Control Framework for a UAV Slung-Payload Transportation System |
|
Kang, Junjie | York University |
Shan, Jinjun | York University |
Alkomy, Hassan | York University |
Keywords: Autonomous vehicles, Hierarchical control, Stability of nonlinear systems
Abstract: This paper presents a control framework for transporting a UAV slung-payload, which can asymptotically stabilize not only the UAV but also the tether swing angles. By separating the system into two subsystems, the cascade control methodology is used to design the framework, which includes two sufficient conditions and suits for a large class of existing controllers. The control framework is strictly proved with the boundness of all states such that it can guarantee the asymptotic stability of the closed-loop overall system over the entire configuration space. Then, this framework is applied to a UAV trajectory tracking control problem with stability analysis. Samples of controllers are presented in the type of saturation or dynamic feedback. Finally, numerical and experimental validations are carried out on a UAV slung-payload transportation.
|
|
14:30-14:50, Paper FrB20.4 | Add to My Program |
Fourth-Order Reference Trajectories in Lithography Stages with Weakly-Damped Modes - a Frequency-Domain Perspective (I) |
|
Heertjes, Marcel | Eindhoven University of Technology |
Zenteno Torres, Jazmin | Universite De Bordeaux |
Al Janaideh, Mohammad | University of Guelph |
Keywords: Mechatronics, Linear systems, Control applications
Abstract: The tracking accuracy in motion control systems like the moving stages in lithography machines, e.g., wafer scanners or metrology inspection tools, is partly determined by how the frequency content of its reference trajectories is transferred to the closed loop tracking error. In this regard, fourth-order reference trajectories for point-to-point motion will be studied from a frequency-domain perspective. By appropriately pairing the maximum snap and maximum jerk values, weakly-damped modes in the closed-loop response can be robustly dealt with without introducing a penalty on throughput.
|
|
14:50-15:10, Paper FrB20.5 | Add to My Program |
Singular Perturbation-Based Approach for Robust Control of Reluctance-Actuated Motion Systems (I) |
|
Al Saaideh, Mohammad | Memorial University of Newfoundland |
Boker, Almuatazbellah | Virginia Tech |
Al Janaideh, Mohammad | University of Guelph |
Keywords: Mechatronics
Abstract: This paper introduces a new composite control approach for a motion system driven by a reluctance actuator. To achieve high-performance control, the method leverages singular perturbation theory to divide the problem into two components: a fast control problem and a slow control problem. A feedforward controller is developed based on the reduced system to address the fast control problem. The dynamic model is then formulated using the feedforward control law to address the slow control problem. Full-state feedback control chooses the input signal that results in the desired reference signal. The output signal of the feedback control is treated as the desired input for the feedforward controller. With the proposed approach, the feedforward controller for the fast dynamic eliminates the need for measuring the fast states. The effectiveness of the proposed approach is demonstrated through experimental testing.
|
|
15:10-15:30, Paper FrB20.6 | Add to My Program |
Data-Driven Bayesian Control of Port-Hamiltonian Systems (I) |
|
Beckers, Thomas | Vanderbilt University |
Keywords: Mechatronics, Machine learning, Uncertain systems
Abstract: Port-Hamiltonian theory is an established way to describe nonlinear physical systems widely used in various fields such as robotics, energy management, and mechanical engineering. This has led to considerable research interest in the control of Port-Hamiltonian systems, resulting in numerous model-based control techniques. However, the performance and stability of the closed-loop typically depend on the quality of the PH model, which is often difficult to obtain using first principles. We propose a Gaussian Processes (GP) based control approach for Port-Hamiltonian systems (GPC-PHS) by leveraging gathered data. The Bayesian characteristics of GPs enable the creation of a distribution encompassing all potential Hamiltonians instead of providing a singular point estimate. Using this uncertainty quantification, the proposed approach takes advantage of passivity-based robust control with interconnection and damping assignment to establish probabilistic stability guarantees.
|
|
FrB21 Regular Session, Orchid Junior 4311 |
Add to My Program |
Transportation Networks |
|
|
Chair: Chen, Minghua | City University of Hong Kong |
Co-Chair: Frasca, Paolo | CNRS, GIPSA-Lab, Univ. Grenoble Alpes |
|
13:30-13:50, Paper FrB21.1 | Add to My Program |
Arc-Based Traffic Assignment: Equilibrium Characterization and Learning |
|
Chiu, Chih-Yuan | University of California, Berkeley |
Maheshwari, Chinmay | University of California Berkeley |
Su, Pan-Yang | University of California, Berkeley |
Sastry, Shankar | Univ. of California at Berkeley |
Keywords: Transportation networks, Game theory, Adaptive systems
Abstract: Arc-based traffic assignment models (TAMs) are a popular framework for modeling traffic network congestion generated by self-interested travelers who sequentially select arcs based on perceived arc latencies. However, existing arc-based TAMs either assign travelers to cyclic paths, or do not extend to networks with bi-directional edges between nodes. To overcome these difficulties, we propose a new modeling framework for stochastic arc-based TAMs. Given a traffic network with bidirectional edges, we replicate its arcs and nodes to construct a directed acyclic graph (DAG), which we call the Condensed DAG (CoDAG) representation of the original network. Self-interested travelers are then modeled as sequentially selecting arcs on the CoDAG. We show that the associated equilibrium flow, which we call the Condensed DAG equilibrium, exists, is unique, and can be characterized as the solution to a strictly convex optimization problem. Moreover, we propose an adaptive learning rule which ensures that self-interested travelers, who repeatedly update their arc selection rules, will converge to a neighborhood of the Condensed DAG equilibrium. To the best of our knowledge, this is the first learning scheme that provably converges to an arc-based traffic equilibrium. Finally, we present numerical results on simulated traffic networks that corroborate our theoretical results.
|
|
13:50-14:10, Paper FrB21.2 | Add to My Program |
Distributed Leader-Followers Constrained Platooning Control of Linear Homogeneous Vehicles |
|
Gaagai, Ramzi | Helmut-Schmidt-Universität - Universität Der Bundeswehr Hamburg |
Giaccagli, Mattia | Tel Aviv University |
Horn, Joachim | Helmut-Schmidt-University / University of the Federal Armed Forc |
Keywords: Transportation networks, Networked control systems, Linear systems
Abstract: This paper presents a feedback-distributed controller that guarantees exponential synchronization of a vehicle platoon described by linear dynamics. To achieve desired intervehicle spacing between two vehicles and to improve platoon cohesiveness, a leader-based bidirectional communication scheme is employed. To take into account the constraints of the system and to avoid collision between vehicles, safety features are implemented using control barrier functions via linear quadratic programming. String stability properties of the platoon are analyzed, where we show via simulations that the proposed design allows considering a gap-spacing policy for inter-vehicle distance with zero headway. To conclude, the effectiveness of the proposed controller is verified in a simulation study.
|
|
14:10-14:30, Paper FrB21.3 | Add to My Program |
Stability Analysis for a Platoon of Vehicles with Reaction-Time Delay |
|
Mousavi, Shima Sadat | ETH Zurich |
Bahrami, Somayeh | Razi University |
Kouvelas, Anastasios | ETH Zurich |
Keywords: Transportation networks, Nonlinear systems, Delay systems
Abstract: This paper explores collective dynamics in a group of multiple homogeneous vehicles on a ring-road using the Optimal Velocity Model (OVM). To account for real-world traffic scenarios, driver reaction time is incorporated as a time-delay system. Analyzing stability, we initially focus on the equilibrium motion regime, where vehicles maintain uniform speed and spacing. This is achieved through linearization and examining the impact of model parameters. Subsequently, we extend our analysis to the full nonlinear model, determining the equilibrium's region of attraction by solving Linear Matrix Inequalities (LMIs) and estimating ellipsoids.
|
|
14:30-14:50, Paper FrB21.4 | Add to My Program |
Minimizing Carbon Footprint for Timely E-Truck Transportation: Hardness and Approximation Algorithm (I) |
|
Su, Junyan | City University of Hong Kong |
Lin, Qiulin | City University of Hong Kong |
Chen, Minghua | City University of Hong Kong |
Zeng, Haibo | Virginia Tech |
Keywords: Transportation networks
Abstract: Carbon footprint optimization (CFO) is important for sustainable heavy-duty e-truck transportation. We consider the CFO problem for timely transportation of e-trucks, where the truck travels from an origin to a destination across a national highway network subject to a deadline. The goal is to minimize the carbon footprint by orchestrating path planning, speed planning, and intermediary charging planning. We first show that it is NP-hard even just to find a feasible CFO solution. We then develop a (1+eps F, 1+eps b) bi-criteria approximation algorithm that achieves a carbon footprint within a ratio of 1+eps F to the minimum with no deadline violation and at most a ratio of 1+eps b battery capacity violation (for any positive eps F and eps b). Its time complexity is polynomial in the size of the highway network, 1/eps F, and 1/eps b. Such algorithmic results are among the best possible unless P=NP. Simulation results based on real-world traces show that our scheme reduces up to 11% carbon footprint as compared to baseline alternatives considering only energy consumption but not carbon footprint.
|
|
14:50-15:10, Paper FrB21.5 | Add to My Program |
Impact on Traffic of Delayed Information in Navigation Systems |
|
Toso, Tommaso | Université Grenoble Alpes, CNRS, Inria, Grenoble INP, GIPSA-Lab |
Kibangou, Alain | Univ. Grenoble Alpes |
Frasca, Paolo | CNRS, GIPSA-Lab, Univ. Grenoble Alpes |
Keywords: Transportation networks, Delay systems
Abstract: Nowadays many car drivers resort to navigation apps to decide which route to take. To be efficient, these applications increasingly use real-time data rather than historical data. However, delay is unavoidable since data collection, communication, and processing are necessary before their usage in the App. For this purpose, we introduce a macroscopic dynamic traffic assignment model to describe the behaviour of drivers in choosing which route to follow to reach their destination. We assume that a part of the drivers follows the directions of a navigation App, whose directions are based on delayed traffic data. Through the stability analysis of the model, we show and quantify the excessive level of delay in traffic data that can be detrimental to the efficiency of the network by being responsible for the appearance of oscillating trajectories and unsatisfied demand.
|
|
15:10-15:30, Paper FrB21.6 | Add to My Program |
Assessing the Impact of Non-Compliant Users Response to System-Optimal Dynamic Traffic Assignment (I) |
|
Siri, Enrico | Inria Sophia, Université Côte D'Azur |
Goatin, Paola | Inria |
Keywords: Transportation networks, Optimal control, Modeling
Abstract: In the present work, we address a pseudo - System Optimum Dynamic Traffic Assignment optimization problem on road networks relying on trajectory control over a portion of the flows and limited knowledge on user response. The fractions of controlled flow moving between each origin-destination couple are defined as compliant, while the remaining portions, consisting of users free to make their own individual choices, are defined as non-compliant. The objective is to globally improve the state of the network by controlling a varying sub-set of compliant traffic flows. A Godunov discretization of the Lighthill-Williams-Richards model coupled with a triangular fundamental diagram is employed as the flow dynamics model. At junctions, a multi-class solver is applied which requires a class-density-weighted aggregate distribution matrix and incoming links priorities. On one hand, the selfish response of non-compliant users to changing traffic conditions is computed at each time step by updating the class related turn ratios accordingly to a discrete-choice multinomial Logit model to represent users imperfect information. On the other hand, the control action is actuated by varying the flow rates over a pre-computed set of routes while the coupled optimization problem takes into account an a priori fixed distribution of users at the nodes. We show how the effectiveness of the resulting finite horizon optimal control problem degrades by not considering the dynamic response of non-compliant users and how it varies according to the fraction of compliant ones.
|
|
FrB22 Regular Session, Orchid Junior 4212 |
Add to My Program |
Markov Processes |
|
|
Chair: Meyn, Sean P. | Univ. of Florida |
Co-Chair: Mahajan, Aditya | McGill University |
|
13:30-13:50, Paper FrB22.1 | Add to My Program |
Optimizing Sensor Allocation against Attackers with Uncertain Intentions: A Worst-Case Regret Minimization Approach |
|
Ma, Haoxiang | University of Florida |
Han, Shuo | University of Illinois Chicago |
Kamhoua, Charles | U.S. Army Research Laboratory |
Fu, Jie | University of Florida |
Keywords: Markov processes, Game theory, Optimization
Abstract: This paper focuses on the optimal allocation of multi-stage attacks with the uncertainty in attacker's intention. We model the attack planning problem using a Markov decision process and characterize the uncertainty in the attacker's intention using a finite set of reward functions---each reward represents a type of attacker. Based on this modeling, we employ the paradigm of the worst-case absolute regret minimization from robust game theory and develop mixed-integer linear program (MILP) formulations for solving the worst-case regret minimizing sensor allocation strategies for two classes of attack-defend interactions: one where the defender and attacker engage in a zero-sum game and another where they engage in a non-zero-sum game. We demonstrate the effectiveness of our algorithm using a stochastic gridworld example.
|
|
13:50-14:10, Paper FrB22.2 | Add to My Program |
Distributed TD(0) with Almost No Communication |
|
Liu, Rui | Boston University |
Olshevsky, Alexander | Boston University |
|
14:10-14:30, Paper FrB22.3 | Add to My Program |
The Curse of Memory in Stochastic Approximation |
|
Lauand, Caio Kalil | University of Florida |
Meyn, Sean P. | Univ. of Florida |
Keywords: Randomized algorithms, Stochastic optimal control, Markov processes
Abstract: Theory and application of stochastic approximation (SA) has grown within the control systems community since the earliest days of adaptive control. This paper takes a new look at the topic, motivated by recent results establishing remarkable performance of SA with (sufficiently small) constant step-size alpha>0. If averaging is implemented to obtain the final parameter estimate, then the estimates are asymptotically unbiased with nearly optimal asymptotic covariance. These results have been obtained for random linear SA recursions with i.i.d. coefficients. This paper obtains very different conclusions in the more common case of geometrically ergodic Markovian disturbance: (i) The target bias is identified, even in the case of non-linear SA, and is in general non-zero. The remaining results are established for linear SA recursions: (ii) the bivariate parameter-disturbance process is geometrically ergodic in a topological sense; (iii) the representation for bias has a simpler form in this case, and cannot be expected to be zero if there is multiplicative noise; (iv) the asymptotic covariance of the averaged parameters is within O(alpha) of optimal. The error term is identified, and may be massive if mean dynamics are not well conditioned. The theory is illustrated with application to TD-learning.
|
|
14:30-14:50, Paper FrB22.4 | Add to My Program |
Asymmetric Actor-Critic with Approximate Information State |
|
Sinha, Amit | McGill University |
Mahajan, Aditya | McGill University |
Keywords: Markov processes, Stochastic optimal control, Learning
Abstract: Reinforcement learning (RL) for partially observable Markov decision processes (POMDPs) is a challenging problem because decisions need to be made based on the entire history of observations and actions. However, in several scenarios, state information is available during the training phase. We are interested in exploiting the availability of this state information during the training phase to efficiently learn a history-based policy using RL. Specifically, we consider actor-critic algorithms, where the actor uses only the history information but the critic uses both history and state. Such algorithms are called asymmetric actor-critic, to highlight the fact that the actor and critic have asymmetric information. Motivated by the recent success of using representation losses in RL for POMDPs [1], we derive similar theoretical results for the asymmetric actor-critic case and evaluate the effectiveness of adding such auxiliary losses in experiments. In particular, we learn a history representation---called an approximate information state (AIS)---and bound the performance loss when acting using AIS.
|
|
14:50-15:10, Paper FrB22.5 | Add to My Program |
Weighted-Norm Bounds on Model Approximation in MDPs with Unbounded Per-Step Cost |
|
Bozkurt, Berk | McGill University |
Mahajan, Aditya | McGill University |
Nayyar, Ashutosh | University of Southern California |
Ouyang, Yi | Preferred Networks |
Keywords: Markov processes, Stochastic optimal control, Stochastic systems
Abstract: We consider the problem of designing a control policy for an infinite-horizon discounted cost Markov Decision Process (MDP) mathcal{M} when we only have access to an approximate model hat{mathcal M}. If we design an optimal policy hat pi^star for the approximate model, how well does it perform when used in the true model mathcal M? We provide an answer to this question by bounding a weighted norm of the difference between the value function of hat pi^star when used in mathcal M and the optimal value function of mathcal M. The use of weighted norm allows us to obtain meaningful bounds for performance loss even when the per-step cost function is unbounded. This is in contrast to much of the prior literature which has largely focused only on the case of bounded per-step cost. We illustrate our results for two specific instances — a finite MDP model for an inventory control problem and the discounted linear quadratic regulator problem.
|
|
FrB23 Regular Session, Orchid Junior 4211 |
Add to My Program |
Formal Verification and Synthesis |
|
|
Chair: Zhang, Zengjie | Eindhoven University of Technology |
Co-Chair: Pola, Giordano | University of L'Aquila |
|
13:30-13:50, Paper FrB23.1 | Add to My Program |
Model Predictive Control for Signal Temporal Logic Specifications with Time Interval Decomposition |
|
Yu, Xinyi | University of Southern California |
Wang, Chuwei | Shanghai Jiao Tong University |
Yuan, Dingran | Shanghai Jiao Tong University |
Li, Shaoyuan | Shanghai Jiao Tong University |
Yin, Xiang | Shanghai Jiao Tong University |
Keywords: Formal Verification/Synthesis, Hybrid systems, Predictive control for nonlinear systems
Abstract: In this paper, we investigate the problem of Model Predictive Control (MPC) of dynamic systems for high- level specifications described by Signal Temporal Logic (STL) formulae. Recent works show that MPC has the great potential in handling logical tasks in reactive environments. However, existing approaches suffer from the heavy computational bur- den, especially for tasks with large horizons. In this work, we propose a computationally more efficient MPC framework for STL tasks based on time interval decomposition. Specifically, we still use the standard shrink horizon MPC framework with Mixed Integer Linear Programming (MILP) techniques for open-loop optimization problems. However, instead of applying MPC directly for the entire task horizon, we decompose the STL formula into several sub-tasks with disjoint time horizons, and shrinking horizon MPC is applied for each short-horizon sub-task iteratively. To guarantee the satisfaction of the entire STL formula and to ensure the recursive feasibility of the iterative process, we introduce new terminal constraints to connect each sub-task. We show how these terminal constraints can be computed by an effective inner-approximation approach. The computational efficiency of our approach is illustrated by a case study.
|
|
13:50-14:10, Paper FrB23.2 | Add to My Program |
On Nash Equilibria for Decentralized Symbolic Control of Interconnected Finite State Systems |
|
Pola, Giordano | University of L'Aquila |
De Santis, Elena | University of L'Aquila |
Di Benedetto, Maria Domenica | University of L'Aquila |
Keywords: Formal Verification/Synthesis, Decentralized control, Game theory
Abstract: In this paper we consider a pair of interconnected nondeterministic and metric finite state systems and address a decentralized symbolic control problem where controllers are designed for enforcing local specifications expressed in terms of regular languages, up to a desired accuracy. The considered control architecture is decentralized, i.e. each controller can only communicate with the corresponding plant. Since plant systems are interconnected, the part of the specification that can be enforced on one system depends on the one on the other system. We show how this dependency can be nicely formalized in terms of equilibria and in particular, of Nash equilibria. When controlled plants are at a Nash equilibrium, deviation of each plant from its control strategy may correspond to a loss in terms of the part of specification enforced. Algorithms are proposed which converge, when an equilibrium exists, to Nash equilibria.
|
|
14:10-14:30, Paper FrB23.3 | Add to My Program |
Exact and Cost-Effective Automated Transformation of Neural Network Controllers to Decision Tree Controllers |
|
Chang, Kevin | University of Southern California |
Dahlin, Nathan | University of Illinois at Urbana-Champaign |
Jain, Rahul | University of Southern California |
Nuzzo, Pierluigi | University of Southern California |
Keywords: Formal Verification/Synthesis, Hybrid systems, Machine learning
Abstract: Over the past decade, neural network (NN)-based controllers have demonstrated remarkable efficacy in a variety of decision-making tasks. However, their black-box nature and the risk of unexpected behaviors and surprising results pose a challenge to their deployment in real-world systems with strong guarantees of correctness and safety. We address these limitations by investigating the transformation of NN-based controllers into equivalent soft decision tree (SDT)-based controllers and its impact on verifiability. Differently from previous approaches, we focus on discrete-output NN controllers including rectified linear unit (ReLU) activation functions as well as argmax operations. We then devise an exact but cost-effective transformation algorithm, in that it can automatically prune redundant branches. We evaluate our approach using two benchmarks from the OpenAI Gym environment. Our results indicate that the SDT transformation can benefit formal verification, showing runtime improvements of up to 21x and 2x for MountainCar-v0 and CartPole-v1, respectively.
|
|
14:30-14:50, Paper FrB23.4 | Add to My Program |
Modularized Control Synthesis for Complex Signal Temporal Logic Specifications |
|
Zhang, Zengjie | Eindhoven University of Technology |
Haesaert, Sofie | Eindhoven University of Technology |
Keywords: Formal Verification/Synthesis, Optimization, Computational methods
Abstract: The control synthesis of a dynamic system subject to a signal temporal logic (STL) specification is commonly formulated as a mixed-integer linear/convex programming (MILP/MICP) problem. Solving such a problem is computationally expensive when the specification is long and complex. In this paper, we propose a framework to transform a long and complex specification into separate forms in time, to be more specific, the logical combination of a series of short and simple subformulas with non-overlapping timing intervals. In this way, one can easily modularize the synthesis of a long specification by solving its short subformulas, which improves the efficiency of the control problem. We first propose a syntactic timing separation form for a type of complex specifications based on a group of separation principles. Then, we further propose a complete specification split form with subformulas completely separated in time. Based on this, we develop a modularized synthesis algorithm that ensures the soundness of the solution to the original synthesis problem. The efficacy of the methods is validated with a robot monitoring case study in simulation. Our work is promising to promote the efficiency of control synthesis for systems with complicated specifications.
|
|
14:50-15:10, Paper FrB23.5 | Add to My Program |
Computing Controlled Invariant Sets of Nonlinear Control-Affine Systems |
|
Brown, Scott | University of California, San Diego |
Khajenejad, Mohammad | University of California, San Diego |
Yong, Sze Zheng | Northeastern University |
Martinez, Sonia | University of California at San Diego |
Keywords: Formal Verification/Synthesis, Nonlinear systems, Computational methods
Abstract: In this paper, we consider the computation of controlled invariant sets (CIS) of discrete-time nonlinear control-affine systems. We propose an iterative refinement procedure based on polytopic inclusion functions, which is able to inner-approximate the maximal controlled invariant set to within a guaranteed robustness margin. In particular, this procedure allows us to guarantee the invariance of the resulting near-maximal CIS while also computing sets of control inputs which enforce the invariance. Further, we propose an alternative version of this procedure which refines the CIS by computing backward reachable sets of individual components of set unions, rather than all at once. This reduces the total number of inclusion checking operations required for convergence,especially when compared with existing methods. Finally, we compare our methods to a sampling based approach and demonstrate the improved accuracy and faster convergence.
|
|
15:10-15:30, Paper FrB23.6 | Add to My Program |
Optimal Path Planning with Opacity-Preserving Temporal Logic Specifications Using Bipartite Synthesizers (I) |
|
Zheng, Yiwei | Xiamen University |
Lai, Aiwen | Xiamen University |
Lan, Weiyao | Xiamen University |
Yu, Xiao | Xiamen University |
Keywords: Formal Verification/Synthesis, Discrete event systems, Robotics
Abstract: This paper investigates an optimal planning problem with the requirement of preventing high-level mission specifications from being revealed to the intruder. We assume that the behavior of the robotic system at some specific locations is partially observable, and the intruder is modeled as a passive observer for the observable behavior and the corresponding overall transition cost of the trajectories. We first use the transition system to model the robot system. Then, we say that the transition system is LTL-based opaque with respect to high-level mission specifications if the intruder cannot infer the exact behavior of the system via the observable sequence of the system. We design a synthesizer for the product automaton of the transition system and the Büchi automaton to find the evolutions of possible reachable states. Based on the synthesizer, the corresponding run can be synthesized in which the optimality, correctness, and opacity can be guaranteed.
|
|
FrB24 Regular Session, Orchid Main 4201AB |
Add to My Program |
Sliding-Mode Control |
|
|
Chair: Ferrara, Antonella | University of Pavia |
Co-Chair: Kim, Yoonsoo | Gyeongsang National University |
|
13:30-13:50, Paper FrB24.1 | Add to My Program |
Multivariable Disturbance Observer-Based Finite-Time Sliding Mode Attitude Control for Fixed-Wing UAVs under Matched and Mismatched Disturbances |
|
Nguyen, Ngo Phong | Ulsan National Institute of Science and Technology |
Oh, Hyondong | Ulsan National University of Science and Technology |
Moon, Jun | Hanyang University |
Kim, Yoonsoo | Gyeongsang National University |
Keywords: Variable-structure/sliding-mode control, Control applications
Abstract: In this letter, we propose a multivariable disturbance observer-based finite-time sliding mode attitude control (MDOB-FT-SM-AC) for fixed-wing UAVs in the presence of both matched and mismatched disturbances. Compared with existing sliding mode attitude controllers, the significant improvements of the proposed MDOB-FT-SM-AC are the multi- variable control structure, strong robustness, and high precision performance with continuous control input signal. In the proposed MDOB-FT-SM-AC, we first develop multivariable finite-time disturbance observers such that the precise estimation of both matched and mismatched disturbances is ensured. Next, a nonsingular terminal sliding manifold is designed such that the fixed-wing UAV is driven to track its desired attitude command in finite time. We finally present a multivariable super-twisting reaching law such that the finite-time convergence of the sliding variable and its derivative to zero is guaranteed. Attentive finite- time convergence analysis is derived based on the Lyapunov and homogeneity theories. Simulation results are given to illustrate the superiority of the proposed MDOB-FT-SM-AC.
|
|
13:50-14:10, Paper FrB24.2 | Add to My Program |
Exact Differentiator with Lipschitz Continuous Output and Optimal Worst-Case Accuracy under Bounded Noise |
|
Aldana-López, Rodrigo | Universidad De Zaragoza |
Seeber, Richard | Graz University of Technology |
Haimovich, Hernan | CONICET and UNR |
Gomez-Gutierrez, David | Intel Labs |
Keywords: Variable-structure/sliding-mode control, Estimation, Time-varying systems
Abstract: The online differentiation of a signal contaminated with bounded noise is addressed. A differentiator is developed that generates a Lipschitz continuous output, is exact in the absence of noise, and provides the optimal worst-case accuracy among all possible exact differentiators when noise is present. This combination of features is not shared by any previously existing differentiator. Tuning of the developed differentiator is very simple, requiring only the knowledge of a bound for the second-order derivative of the signal. The approach consists in regularizing the possibly highly noisy output of a recently introduced linear adaptive robust exact differentiator and feeding it to a first-order sliding-mode filter designed to maintain optimal accuracy. The proposed regularization and filtering of this output allows trading the speed with which exactness is obtained for the feature of a Lipschitz continuous, hence less noisy, output. An illustrative example is provided to highlight the features of the developed differentiator.
|
|
14:10-14:30, Paper FrB24.3 | Add to My Program |
A Passivity Based Integral Sliding Mode Controller for Mechanical Port-Hamiltonian Systems |
|
Baba, Takahiro | Kyoto University |
Fujimoto, Kenji | Kyoto University |
Maruta, Ichiro | Kyoto University |
Keywords: Variable-structure/sliding-mode control, Lyapunov methods, Stability of nonlinear systems
Abstract: This letter proposes a passivity based integral sliding mode controller for mechanical port-Hamiltonian systems. Recently, passivity based sliding mode control (PBSMC) has been proposed for mechanical and electro-mechanical systems. This method has properties of both sliding mode control (SMC) and passivity based control. However, the robustness of SMC is not guaranteed in the reaching phase and so is PBSMC. For this problem, integral sliding mode control (ISMC), which eliminates the reaching phase, has been proposed. This letter proposes unified control method of passivity based control and integral sliding mode control based on the idea of PBSMC. In order to achieve ISMC in the port-Hamiltonian form, an integral term of the sliding variable of PBSMC is firstly added to the system equation. Next, by adding appropriate potential function to the Hamiltonian function, the dynamics of ISMC can be obtained. Proposed method is more robust than PBSMC and ensures Lyapunov stability when the chattering phenomena is alleviated. The effectiveness of the proposed method is demonstrated by a numerical example.
|
|
14:30-14:50, Paper FrB24.4 | Add to My Program |
Design of a Deep Neural Network-Based Integral Sliding Mode Control for Nonlinear Systems under Fully Unknown Dynamics |
|
Vacchini, Edoardo | University of Pavia |
Sacchi, Nikolas | University of Pavia |
Incremona, Gian Paolo | Politecnico Di Milano |
Ferrara, Antonella | University of Pavia |
Keywords: Variable-structure/sliding-mode control, Neural networks, Uncertain systems
Abstract: In this paper a novel deep neural network based integral sliding mode (DNN-ISM) control is proposed for controlling perturbed systems with fully unknown dynamics. In particular, two DNNs with an arbitrary number of hidden layers are exploited to estimate the unknown drift term and the control effectiveness matrix of the system, which are instrumental to design the ISM controller. The DNNs weights are adjusted according to adaptation laws derived directly from Lyapunov stability analysis, and the proposal is satisfactorily assessed in simulation relying on benchmark examples.
|
|
14:50-15:10, Paper FrB24.5 | Add to My Program |
Structural Conditions for Chattering Avoidance in Implicitly Discretized Sliding Mode Differentiators |
|
Seeber, Richard | Graz University of Technology |
Koch, Stefan | Graz University of Technology |
Keywords: Variable-structure/sliding-mode control, Sampled-data control, Estimation
Abstract: The present paper considers the implicit discretization of Levant's arbitrary order robust exact differentiator. It is shown that an improper implicit discretization may lead to an undesired bias in the differentiation error and, surprisingly, to discretization chattering despite the implicit discretization. Necessary and sufficient structural conditions for avoiding both of these problems are presented, which define a family of chattering-free discrete-time differentiators. A guideline for selecting a representative from this family is given. Numerical simulations illustrate the results.
|
|
15:10-15:30, Paper FrB24.6 | Add to My Program |
A Robust Interval MPC for Uncertain LPV Systems Via Integral Sliding–Mode Control |
|
Gutierrez, Susana | FIME-UANL |
Ríos, Héctor | Tecnológico Nacional De México/I.T. La Laguna |
Mera, Manuel | Esime Upt Ipn |
Raïssi, Tarek | Conservatoire National Des Arts Et Métiers |
Efimov, Denis | Inria |
Keywords: Linear parameter-varying systems, Variable-structure/sliding-mode control, Constrained control
Abstract: This paper presents the design of a robust control strategy for linear parameter–varying (LPV) systems. The proposed strategy involves the design of a robust control law based on an integral sliding–mode control (ISMC) approach with an interval predictor-based state feedback controller and a Model Predictive Control (MPC) scheme. The proposed controller is robust to some external disturbances and parameter uncertainties and deals with state and input constraints. The integral sliding–mode compensates for matched perturbations starting from the initial moment, i.e., ensuring the sliding–mode from the initial time instance. Then, the interval predictor–based state feedback controller and the MPC deal with the state and input constraints. The proposed strategy guarantees the exponential stability of the system. Furthermore, the simulation results show high performance of the proposed controller.
|
|
FrB25 Invited Session, Lotus Junior 4DE |
Add to My Program |
Security, Safety and Resilience of Discrete Event Systems |
|
|
Chair: Yin, Xiang | Shanghai Jiao Tong University |
Co-Chair: Cai, Kai | Osaka Metropolitan University |
Organizer: Yin, Xiang | Shanghai Jiao Tong University |
Organizer: Cai, Kai | Osaka Metropolitan University |
|
13:30-13:50, Paper FrB25.1 | Add to My Program |
Edit Mechanism Synthesis for Opacity Enforcement under Uncertain Observations |
|
Duan, Wei | Xidian Universisty |
Liu, Ruotian | Polytechnic University of Bari |
Fanti, Maria Pia | Polytechnic of Bari |
Hadjicostis, Christoforos N. | University of Cyprus |
Li, Zhiwu | Xidian University |
Keywords: Discrete event systems, Automata, Game theory
Abstract: This paper addresses the problem of opacity enforcement by using edit functions in discrete event systems modeled as deterministic finite automata under partial observation. The edit function is an output interface of the system that manipulates actual observations to confuse a malicious intruder. We assume that the edit function simply knows whether the intruder observes a larger or smaller set of events than itself, but does not know the exact set of events observed by the intruder. In this uncertain observation setting, the edit function aims to confuse the intruder while relying on its own set of observable events, which requires the edit function to be u-enforcing. The opacity enforcement problem is then transformed to a two-player game between the system and the edit function under partial information. A so-called edit mechanism is proposed in a game scheme to enumerate all possible edited operations following the system behavior. We show that an edit function synthesized from the edit mechanism (if any) can be used to enforce opacity in the system under the uncertain observation setting.
|
|
13:50-14:10, Paper FrB25.2 | Add to My Program |
Security-Preserving Multi-Robot Path Planning for Boolean Specification Tasks Using Labeled Petri Nets |
|
Shi, Weijie | Shaanxi University of Science and Technology |
He, Zhou | Shaanxi University of Science and Technology |
Ma, Ziyue | Xidian University |
Ran, Ning | Hebei University |
Yin, Xiang | Shanghai Jiao Tong University |
Keywords: Discrete event systems, Petri nets, Optimal control
Abstract: This letter investigates the path planning of multirobot systems for high-level tasks described by Boolean specifications and security constraints. We assume that the behavior of each robot can be identified and partially monitored by a passive intruder. The problem aims to plan an optimal path for each robot such that the tasks expressed in conjunction, disjunction, and negation for trajectories and final states are coordinately completed. The security constraints require that the intruder should never infer the final locations of a set of robots called secret robots. In order to solve this problem, labeled Petri nets are adopted to model the mobile capability of the multirobot systems. Then an integer linear programming problem is proposed to find an optimal solution (if it exists) such that the Boolean specification is fulfilled, while the securities of secret robots are preserved. Finally, the effectiveness of the proposed method is illustrated through several simulation studies.
|
|
14:10-14:30, Paper FrB25.3 | Add to My Program |
Supervisor Fortification against Covert Actuator Attacks (I) |
|
Tai, Ruochen | Nanyang Technological University |
Lin, Liyong | Nanyang Technological University |
Su, Rong | Nanyang Technological University |
Keywords: Discrete event systems, Supervisory control, Cyber-Physical Security
Abstract: This work considers the supervisor fortification problem against covert actuator attacks. A supervisor S' is said to fortify the supervisor S, the latter of which is non-resilient against covert actuator attacks, if S' satisfies two conditions: 1) any covert actuator attack cannot cause damage infliction against S', and 2) S' is control equivalent to S. The key result of this work is that we show the problem of determining the existence of a fortified supervisor to defend against any covert actuator attack, an "exist-for all" decidability question, is decidable. To show the decidability result, we provide a complete and sound procedure that ensures to synthesize a fortified supervisor as long as there exists one.
|
|
14:30-14:50, Paper FrB25.4 | Add to My Program |
Abstraction-Based Synthesis of Controllers for Approximate Opacity (I) |
|
Hou, Junyao | ShanghaiJiaoTong University |
Liu, Siyuan | KTH Royal Institute of Technology |
Yin, Xiang | Shanghai Jiao Tong University |
Zamani, Majid | University of Colorado Boulder |
Keywords: Discrete event systems, Supervisory control, Automata
Abstract: Opacity is an important information-flow security property which characterizes the plausible deniability of certain “secret behaviors” in dynamical systems. In this paper, we study the problem of synthesizing controllers enforcing a notion of opacity over discrete-time control systems with continuous state sets. It is known that, for systems with uncountable state sets, the opacity-enforcing controller synthesis problem is undecidable in general. To address this undecidability issue, in this paper, we develop an abstraction-based approach to tackle the controller synthesis problem. Specifically, we adopt a notion of approximate opacity which is suitable for continuous-space control systems. We propose a notion of approximate initial- state opacity preserving alternating simulation relation which characterizes the closeness between two systems in terms of opacity preservation. We show that, based on this new notion of system relation, one can synthesize an opacity-enforcing controller for the abstract system which is finite and then refine it back to enforce opacity over the original control system. Finally, we present a method for constructing opacity- preserving finite abstractions for discrete-time control systems under some stability properties. Our results are illustrated on a two-room temperature control problem.
|
|
14:50-15:10, Paper FrB25.5 | Add to My Program |
Fault Diagnosis of Discrete Event Systems under Attack (I) |
|
Kang, Tenglong | Xidian University |
Seatzu, Carla | Univ. of Cagliari |
Li, Zhiwu | Xidian University |
Giua, Alessandro | University of Cagliari |
Keywords: Discrete event systems, Automata, Fault diagnosis
Abstract: In this paper, we study the problem of fault diagnosis under cyber attacks in the context of partially-observed discrete event systems. An operator monitors the evolution of a system through the received observations and computes its current diagnosis state. The observation is corrupted by an attacker which has the ability to edit a subset of sensor readings by inserting or erasing some events. In this sense, the attacker may induce the operator to draw incorrect diagnostic conclusions based on the corrupted observation regarding the fault occurrence. In particular, the attack is harmful if a fault can be detected by the operator when looking at an uncorrupted observation, while it is not detected when looking at the corresponding corrupted observation. In addition, the attacker must remain stealthy, i.e., its presence should not be discovered by the operator. We propose a special structure, called a stealthy joint diagnoser, which describes the set of all possible stealthy attacks. We show how to use the stealthy joint diagnoser to perform fault diagnosis under attack. Finally, such a structure also allows one to establish if a stealthy harmful attack may be implemented.
|
|
15:10-15:30, Paper FrB25.6 | Add to My Program |
Towards Supervisory Control Theory in Tactical Environments: A Stackelberg Game Approach (I) |
|
Cui, Bohan | Shanghai Jiao Tong University |
Giua, Alessandro | University of Cagliari |
Yin, Xiang | Shanghai Jiao Tong University |
Keywords: Discrete event systems, Supervisory control, Automata
Abstract: In this paper, we propose a new framework for supervisory control of discrete-event systems in tactical environments. In contrast to the standard supervisory control theory, where the environments are considered fully adversarial, we consider the possibility of the presence of attackers who have their own objectives that may not necessarily be in opposition to the specification of the supervisor. We formulate this scenario as a Stakelberg game in the leader-follower setting, where the designer proposes a supervisor, and the attacker takes a best response to the supervisor. We characterize the solution to the Stakelberg supervisory control problem as having both cooperative and antagonistic solutions. Moreover, we provide an effective algorithm for synthesizing a cooperative supervisor that enables both players to achieve their objectives. Our work makes an initial step forward from the traditional zerosum setting of supervisory control theory to the non-zero-sum setting. Examples are provided to illustrate our results.
|
|
FrC01 Invited Session, Melati Junior 4010A-4111 |
Add to My Program |
Learning, Optimization, and Game Theory V |
|
|
Chair: Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Co-Chair: Doan, Thinh T. | Virginia Tech |
Organizer: Doan, Thinh T. | Virginia Tech |
Organizer: Sayin, Muhammed Omer | Bilkent University |
Organizer: Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Organizer: Zhang, Kaiqing | University of Maryland |
|
16:00-16:20, Paper FrC01.1 | Add to My Program |
Reinforcement-Learning-Based Risk-Sensitive Optimal Feedback Mechanisms of Biological Motor Control (I) |
|
Cui, Leilei | New York University |
Pang, Bo | New York University |
Jiang, Zhong-Ping | New York University |
Keywords: Data driven control, Biological systems, Optimal control
Abstract: Risk sensitivity is a fundamental aspect of biological motor control that accounts for both the expectation and variability of movement cost in the face of uncertainty. However, most computational models of biological motor control rely on model-based risk-sensitive optimal control, which requires an accurate internal representation in the central neural system to predict the outcomes of motor commands. In reality, the dynamics of human-environment interaction is too complex to be accurately modeled, and noise further complicates system identification. To address this issue, this paper proposes a novel risk-sensitive computational mechanism for biological motor control based on reinforcement learning (RL) and adaptive dynamic programming (ADP). The proposed ADP-based mechanism suggests that humans can directly learn an approximation of the risk-sensitive optimal feedback controller from noisy sensory data without the need for system identification. Numerical validation of the proposed mechanism is conducted on the arm-reaching task under divergent force field. The preliminary computational results align with the experimental observations from the past literature of computational neuroscience.
|
|
16:20-16:40, Paper FrC01.2 | Add to My Program |
Adaptive Optimal Output Regulation of Discrete-Time Linear Systems: A Reinforcement Learning Approach (I) |
|
Chakraborty, Sayan | New York University |
Gao, Weinan | Northeastern University |
Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Jiang, Zhong-Ping | New York University |
Keywords: Optimal control, Output regulation
Abstract: In this paper, we solve the optimal output regulation problem for discrete-time systems without precise knowledge of the system model. Drawing inspiration from reinforcement learning and adaptive dynamic programming, a data-driven solution is developed that enables asymptotic tracking and disturbance rejection. Notably, it is discovered that the proposed approach for discrete-time output regulation differs from the continuous-time approach in terms of the persistent excitation condition required for policy iteration to be unique and convergent. To address this issue, a new persistent excitation condition is introduced to ensure both uniqueness and convergence of the data-driven policy iteration. The efficacy of the proposed methodology is validated by an inverted pendulum on a cart example.
|
|
16:40-17:00, Paper FrC01.3 | Add to My Program |
Online Learning for Incentive-Based Demand Response (I) |
|
Muthirayan, Deepan | University of California at Irvine |
Khargonekar, Pramod | Univ. of California, Irvine |
Keywords: Learning, Agents-based systems, Energy systems
Abstract: In this paper, we consider the problem of learning online to manage Demand Response (DR) resources. A typical DR mechanism requires the DR manager to assign a baseline to the participating consumer, where the baseline is an estimate of the counterfactual consumption of the consumer had it not been called to provide the DR service. A challenge of estimating the baseline is the incentive the consumers have to inflate the baseline. We consider the problem of learning online to estimate the baseline and to optimize the operating costs over a period of time under such incentives. We propose an online learning scheme that employs least-squares for estimation with a perturbation to the reward price (for the DR services or load curtailment) that is designed to balance the exploration and exploitation trade-off that arises with online learning. We show that, our proposed scheme is able to achieve a very low regret of mathcal{O}left((log{T})^2right) with respect to the optimal operating cost over T days of the DR program with full knowledge of the baseline, and is individually rational for the consumers to participate. Our scheme is significantly better than the averaging type approach, which only fetches mathcal{O}(T^{1/3}) regret.
|
|
17:00-17:20, Paper FrC01.4 | Add to My Program |
Game-Theoretic Deception Methods for Perfectly and Bounded Rational Stealthy Attackers (I) |
|
Fotiadis, Filippos | Georgia Institute of Technology |
Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Keywords: Game theory, Cyber-Physical Security, Agents-based systems
Abstract: In this paper, we consider a system that is under the effect of multiple stealthy attackers, whose inputs we design using perfectly and imperfectly rational game-theoretic approaches. The goal of the attackers is to steer the state of the system as far as possible from the origin, so as to disrupt the nominal objective of system regulation. However, to remain stealthy, the attackers must ensure that the total magnitude of their inputs remains below a certain threshold, otherwise they are at risk of being exposed to a detection mechanism that monitors the system. To derive the optimal attack policies for the attackers, we interpret the aforementioned setup as a constrained game, and we solve it in two cases: in the first case, we assume that the attackers are perfectly rational and operate on the Nash equilibrium, which we derive in closed-form; and in the second case, we assume that the attackers are imperfectly rational, and we design two models of bounded rationality as a means to capture their different levels of rationality. Under certain conditions, it is proved that the corresponding bounded rationality models converge to a Nash equilibrium as the levels of rationality increase. Simulations demonstrate the efficiency of the derived attack policies in both the perfectly and the imperfectly rational case.
|
|
17:20-17:40, Paper FrC01.5 | Add to My Program |
Maximizing Reachability in Factored MDPs Via Near-Optimal Clustering with Applications to Control of Multi-Agent Systems (I) |
|
Fiscko, Carmel | Washington University in St Louis |
Kar, Soummya | Carnegie Mellon University |
Sinopoli, Bruno | Washington University in St Louis |
Keywords: Markov processes, Agents-based systems, Stochastic systems
Abstract: We consider cluster-based control of agents modeled as a transition-independent Markov decision process (MDP), and the objective of assigning agents to clusters to maximize the size of the reachable state space. This goal is relevant to applications for which the same MDP model may be used to compute policies for different reward functions. The system controller wishes to define clusters to maximize flexibility within the attainable outcomes. Under the transition-independent MDP formulation, we first show that the size of the reachable state space is a submodular function. While maximizing the reachable state space subject to a desired number of clusters is a hard problem, properties of submodular optimization can be leveraged to propose approximate clustering techniques. We next demonstrate that a greedy clustering approach is a viable approximate solution and has a bounded optimality gap. We compare the performance in terms of value and computation complexity in using the flexibility-optimized clustering assignment versus a clustering assignment optimized for a specific reward function; there will be a loss in value at a savings in complexity. Finally, we demonstrate the utility of the flexibility-optimized clustering assignment in simulation on the same MDP model with various reward functions.
|
|
17:40-18:00, Paper FrC01.6 | Add to My Program |
An Observer-Based Reinforcement Learning Solution for Model-Following Problems (I) |
|
Abouheaf, Mohammed | Bowling Green State University |
Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Mayyas, Mohammad | Bowling Green State University |
Hashim, Hashim A | Carleton University |
Keywords: Machine learning, Adaptive systems, Optimal control
Abstract: This paper introduces a novel model-free solution for a multi-objective model-following control problem, utilizing an observer-based adaptive learning approach. The goal is to regulate model-following error dynamics and optimize process variables simultaneously. Integral reinforcement learning is employed to adapt three key strategies, including observation, closed-loop stabilization, and reference trajectory tracking. Implementation uses an approximate projection estimation method under mild conditions on learning parameters.
|
|
FrC02 Invited Session, Orchid Main 4202-4303 |
Add to My Program |
Data-Driven Verification and Control of Cyber-Physical Systems III |
|
|
Chair: Zamani, Majid | University of Colorado Boulder |
Co-Chair: Lavaei, Abolfazl | Newcastle University |
Organizer: Lavaei, Abolfazl | Newcastle University |
Organizer: Jungers, Raphaël M. | University of Louvain |
Organizer: Abate, Alessandro | University of Oxford |
|
16:00-16:20, Paper FrC02.1 | Add to My Program |
Star Based Reachability Analysis of Interval Neural Networks (I) |
|
Bondalakunta, Vishnu | Kansas State University |
Prabhakar, Pavithra | Kansas State University |
Keywords: Formal Verification/Synthesis
Abstract: The paper explores the computation of output reachable sets for Interval Neural Networks (INNs) with ReLU activation functions. An INN is a generalization of a Neural Network (NN), where the weights and biases are intervals rather than numbers. We propose a novel algorithm for computing precise over-approximations of the output reachable sets for INNs by introducing a novel data structure called interval star set which is a generalized version of the star set. Specifically, when the INN is a traditional NN, our method reduces to the standard star-based verification of NNs. We present experimental results that demonstrate that our method outperforms the existing method based on mixed integer linear programming (MILP) for the problem of INN output reachable set computation.
|
|
16:20-16:40, Paper FrC02.2 | Add to My Program |
Data-Driven Safe Controller Synthesis for Deterministic Systems: A Posteriori Method with Validation Tests (I) |
|
Chen, Yu | Shanghai Jiao Tong University |
Shang, Chao | Tsinghua University |
Huang, Xiaolin | Shanghai Jiao Tong University |
Yin, Xiang | Shanghai Jiao Tong University |
Keywords: Formal Verification/Synthesis, Data driven control
Abstract: In this work, we investigate the data-driven safe control synthesis problem for unknown dynamic systems. We first formulate the safety synthesis problem as a robust convex program (RCP) based on notion of control barrier function. To resolve the issue of unknown system dynamic, we follow the existing approach by converting the RCP to a scenario convex program (SCP) by randomly collecting finite samples of system trajectory. However, to improve the sample efficiency to achieve a desired confidence bound, we provide a new posteriori method with validation tests. Specifically, after collecting a set of data for the SCP, we further collect another set of independent validate data as posterior information to test the obtained solution. We derive a new overall confidence bound for the safety of the controller that connects the original sample data, the support constraints, and the validation data. The efficiency of the proposed approach is illustrated by a case study of room temperature control. We show that, compared with existing methods, the proposed approach can significantly reduce the required number of sample data to achieve a desired confidence bound
|
|
16:40-17:00, Paper FrC02.3 | Add to My Program |
Symbolic Abstractions with Guarantees: A Data-Driven Divide-And-Conquer Strategy (I) |
|
Lavaei, Abolfazl | Newcastle University |
Keywords: Data driven control, Large-scale systems, Formal Verification/Synthesis
Abstract: This article is concerned with a data-driven divide-and-conquer strategy to construct symbolic abstractions for interconnected control networks with unknown mathematical models. We employ a notion of alternating bisimulation functions (ABF) to quantify the closeness between state trajectories of an interconnected network and its symbolic abstraction. Consequently, the constructed symbolic abstraction can be leveraged as a beneficial substitute for the formal verification and controller synthesis over the interconnected network. In our data-driven framework, we first establish a relation between each unknown subsystem and its data-driven symbolic abstraction, so-called alternating pseudo-bisimulation function (APBF), with a guaranteed probabilistic confidence. We then provide compositional conditions based on max-type small-gain techniques to construct an ABF for an unknown interconnected network using APBF of its individual subsystems, constructed from data. We demonstrate the efficacy of our data-driven approach over a room temperature network composing 100 rooms with unknown models. We construct a symbolic abstraction from data for each room as an appropriate substitute of original system and compositionally synthesize controllers regulating the temperature of each room within a safe zone with some guaranteed probabilistic confidence.
|
|
17:00-17:20, Paper FrC02.4 | Add to My Program |
Transfer Learning for Barrier Certificates (I) |
|
Nadali, Alireza | University of Colorado, Boulder |
Trivedi, Ashutosh | University of Colorado Boulder |
Zamani, Majid | University of Colorado Boulder |
Keywords: Learning, Formal Verification/Synthesis, Neural networks
Abstract: A principled approach to safety verification of dynamical systems demands formal guarantees. Barrier certificates are an effective tool for searching safety proofs in the form of inductively verifiable invariants. However, finding barrier certificates is an expensive and time-consuming process that demands human expertise in selecting various templates, hyperparameters, and decision procedures. Is it possible to transfer the knowledge gained in finding a barrier certificate and control algorithm from a given environment source environment to a different but related environment target environment? This paper presents a transfer learning approach to adapt the barrier certificates (of any template) in the form of neural networks from the source to the target environment. We derive a validity condition to formally guarantee the correctness of network by leveraging its Lipschitz continuity. To demonstrate the effectiveness of our approach, we apply it to two case studies, namely the inverted pendulum and DC motor. Our results show that transfer learning can successfully adapt barrier certificates from the source to the target environment, reducing the need for human expertise and speeding up the verification process.
|
|
17:20-17:40, Paper FrC02.5 | Add to My Program |
Abstraction-Based Safety Analysis of Linear Dynamical Systems with Neural Network Controllers (I) |
|
Lal, Ratan | Northwest Missouri State University |
Prabhakar, Pavithra | Kansas State University |
Keywords: Neural networks, Linear systems, Formal Verification/Synthesis
Abstract: We consider the safety verification problem of a closed-loop discrete-time linear dynamical system with a neural network controller. The crux of safety verification relies on computing output reachable sets of the dynamical system and the neural network. Reachable set computation time of the neural network grows with the network size. To address the scalability issue, our main approach consists of abstracting the neural network controller into a smaller emph{annotated interval neural network} (AINN), and using this to compute an over-approximation of the reachable set of the closed-loop system. We present a novel approach for output reachable set computation of an AINN by decomposing it into two reachable set computation problems on neural networks, which we then compute using star-sets. Our experimental analysis on two benchmarks demonstrate the trade-off in the precision and time for reachable set computation.
|
|
17:40-18:00, Paper FrC02.6 | Add to My Program |
Towards Learning and Verifying Maximal Neural Lyapunov Functions (I) |
|
Liu, Jun | University of Waterloo |
Meng, Yiming | University of Waterloo |
Fitzsimmons, Maxwell | University of Waterloo |
Zhou, Ruikun | University of Waterloo |
Keywords: Formal Verification/Synthesis, Learning, Stability of linear systems
Abstract: The search for Lyapunov functions is a crucial task in the analysis of nonlinear systems. In this paper, we present a physics-informed neural network (PINN) approach to learning a Lyapunov function that is nearly maximal for a given stable set. A Lyapunov function is considered nearly maximal if its sub-level sets can be made arbitrarily close to the boundary of the domain of attraction. We use Zubov's equation to train a maximal Lyapunov function defined on the domain of attraction. Additionally, we propose conditions that can be readily verified by satisfiability modulo theories (SMT) solvers for both local and global stability. We provide theoretical guarantees on the existence of maximal Lyapunov functions and demonstrate the effectiveness of our computational approach through numerical examples.
|
|
FrC03 Invited Session, Orchid Main 4204-4305 |
Add to My Program |
Cyber-Physical Systems: Privacy and Security of Networked Systems |
|
|
Chair: Sadabadi, Mahdieh S. | Queen Mary University of London |
Co-Chair: Escudero, Cédric | INSA Lyon, Laboratoire Ampère |
Organizer: Sadabadi, Mahdieh S. | University of Manchester |
Organizer: Escudero, Cédric | INSA Lyon, Laboratoire Ampère |
Organizer: Selvi, Daniela | Università Di Pisa |
Organizer: Soudjani, Sadegh | Newcastle University |
Organizer: Murguia, Carlos | Eindhoven University of Technology |
Organizer: Chong, Michelle | Eindhoven University of Technology |
Organizer: Ferrari, Riccardo M.G. | Delft University of Technology |
Organizer: Sasahara, Hampei | Tokyo Institute of Technology |
Organizer: Zhu, Quanyan | New York University |
|
16:00-16:20, Paper FrC03.1 | Add to My Program |
Secure State Estimation under Actuator and Sensor Attacks Using Sliding Mode Observers |
|
Keijzer, Twan | Delft University of Technology |
Ferrari, Riccardo M.G. | Delft University of Technology |
Sandberg, Henrik | KTH Royal Institute of Technology |
Keywords: Large-scale systems, Networked control systems, Fault diagnosis
Abstract: Interconnections in modern systems make them vulnerable to adversarial attackers both by corrupting communication channels and compromising entire subsystems. The field of secure state estimation (SSE) aims to provide correct state estimation even when an unknown part of the measurement signals is corrupted. In this paper, we propose a solution to a novel generalized SSE problem in which full subsystems can be compromised, corrupting both the actuation and measurement signals. For a full system with p measurements, the proposed sliding mode observer (SMO)-based solution allows for up to p attack channels which can be arbitrarily distributed amongst attacks on actuation and measurement signals. This is a much larger class of attacks than considered in the existing literature. The method is demonstrated on 10 interconnected mass-spring-damper subsystems.
|
|
16:20-16:40, Paper FrC03.2 | Add to My Program |
On the Trade-Offs between Accuracy, Privacy, and Resilience in Average Consensus Algorithms (I) |
|
Ramos, Guilherme | Instituto De Telecomunicações, 1049-001 Lisbon, Portugal |
Teixeira, André M. H. | Uppsala University |
Pequito, Sergio | Uppsala University |
Keywords: Network analysis and control, Control applications, Estimation
Abstract: There can be none. In this paper, we address the problem of a set of discrete-time networked agents reaching average consensus privately and resiliently in the presence of a subset of attacked agents. Existing approaches to the problem rely on trade-offs between accuracy, privacy, and resilience, sacrificing one for the others. We show that a separation-like principle for privacy-preserving and resilient discrete-time average consensus is possible. Specifically, we propose a scheme that combines strategies from resilient average consensus and private average consensus, which yields both desired properties. The proposed scheme has polynomial time-complexity on the number of agents and the maximum number of attacked agents. In other words, each agent that is not under attack is able to detect and discard the values of the attacked agents, reaching the average consensus of non-attacked agents while keeping each agent’s initial state private. Finally, we demonstrate the effectiveness of the proposed method with numerical results.
|
|
16:40-17:00, Paper FrC03.3 | Add to My Program |
Privacy-Informed Consensus-Based Secondary Control in Cyber-Physical DC Microgrids |
|
Sadabadi, Mahdieh S. | University of Manchester |
Keywords: Power electronics, Distributed control, Linear systems
Abstract: This letter proposes a privacy-informed consensus-based distributed secondary control strategy for cyber-physical dc microgrids. The proposed secondary control approach relies on output masks that transform the physical states of distributed generation (DG) units to some auxiliary states by adding local perturbation signals, whose functional form and parameters are local and chosen independently by each DG unit. The proposed privacy-preserving secondary control scheme ensures voltage balancing and proportional current sharing in dc microgrids without disclosing the physical states of distributed generation units to their neighbors. Simulation case studies confirm the theoretical results of this letter.
|
|
17:00-17:20, Paper FrC03.4 | Add to My Program |
Privacy-Engineered Value Decomposition Networks for Cooperative Multi-Agent Reinforcement Learning (I) |
|
Gohari, Parham | The University of Texas at Austin |
Hale, Matthew | University of Florida |
Topcu, Ufuk | The University of Texas at Austin |
Keywords: Control Systems Privacy, Agents-based systems, Machine learning
Abstract: In cooperative multi-agent reinforcement learning (Co-MARL), a team of agents must jointly optimize the team's long-term rewards to learn a designated task. Optimizing rewards as a team often requires inter-agent communication and data sharing, leading to potential privacy implications. We assume privacy considerations prohibit the agents from sharing their environment interaction data. Accordingly, we propose Privacy-Engineered Value Decomposition Networks (PE-VDN), a Co-MARL algorithm that models multi-agent coordination while provably safeguarding the confidentiality of the agents' environment interaction data. We integrate three privacy-engineering techniques to redesign the data flows of the VDN algorithm—an existing Co-MARL algorithm that consolidates the agents' environment interaction data to train a central controller that models multi-agent coordination—and develop PE-VDN. In the first technique, we design a distributed computation scheme that eliminates vanilla VDN's dependency on sharing environment interaction data. Then, we utilize a privacy-preserving multi-party computation protocol to guarantee that the data flows of the distributed computation scheme do not pose new privacy risks. Finally, we enforce differential privacy to preempt inference threats against the agents' training data—past environment interactions—when they take actions based on their neural network predictions. We implement PE-VDN in StarCraft Multi-Agent Competition (SMAC) and show that it achieves 80% of vanilla VDN's win rate while maintaining differential privacy levels that provide meaningful privacy guarantees. The results demonstrate that PE-VDN can safeguard the confidentiality of agents' environment interaction data without sacrificing multi-agent coordination.
|
|
17:20-17:40, Paper FrC03.5 | Add to My Program |
Privacy Assessment for Linear Consensus Using Constrained Convex Generators (I) |
|
Silvestre, Daniel | NOVA University of Lisbon |
Keywords: Identification, Linear systems, Agents-based systems
Abstract: The problem of designing privacy-preserving algorithms for multi-agent systems running distributed algorithms has attracted the attention of the Control community, especially for maintaining the privacy of the initial state. In this paper, we tackle the problem of checking the privacy of the algorithm itself in terms of the linear parameters used by each agent. We first start by introducing a metric of privacy that translates the uncertainty that an attacker has related to the parameters for the case that it can eavesdrop the state from other agents given the public nature of the network. We then propose to resort to techniques in the literature to compute such metric and show how these can be used by: i) the attacker to estimate the parameters from successive runs of the algorithm; ii) by a defender that can decide when to trigger a negotiation of new parameters to ensuse privacy of the overall system. The technique is illustrated in simulations for the specific example of a consensus protocol. The tools developed herein can complement resilient consensus algorithms based on reputation metrics in the sense that the defender triggering changes to the dynamics while maintaining the overall convergence value can render the calculations of the optimal attacks rather troublesome.
|
|
17:40-18:00, Paper FrC03.6 | Add to My Program |
Structural Analysis and Design for Security against Stealthy Attacks in Uncertain Systems (I) |
|
Zhang, Kangkang | Imperial College London |
Kasis, Andreas | University of Cyprus |
Polycarpou, Marios M. | University of Cyprus |
Parisini, Thomas | Imperial College & Univ. of Trieste |
Keywords: Fault diagnosis, Fault detection, Fault tolerant systems
Abstract: This paper considers the existence of stealthy integrity attacks for uncertain cyber-physical systems from a geometric point of view. We derive geometric structural conditions for the existence of stealthy integrity attacks and deduce the minimal actuator communication channels that, when protected, no stealthy integrity attacks exists. To examine different knowledge disclosure conditions for the attacker, we consider: (a) the attacker has full knowledge of the system linear terms but only the structure of the uncertain term, and (b) the attacker only knows the structures of the linear terms and the uncertain non-linear term. For scenario (a), the obtained existence condition of stealthy integrity attacks is that the uncertainty is decoupled with the maximal output-zeroing controlled-invariant subspace. In scenario (b), a graph is used to describe the uncertain system and we show that the existence of stealthy attacks is only possible if the uncertainty is decoupled with the fixed maximal output-zeroing controlled-invariant subspace. For each disclosure scenario, we deduce the minimum actuator communication channels to protect for guaranteeing the absence of stealthy integrity attacks. Our results are validated with a numerical example.
|
|
FrC04 Regular Session, Simpor Junior 4913 |
Add to My Program |
Traffic Control |
|
|
Chair: Delle Monache, Maria Laura | University of California, Berkeley |
Co-Chair: Consolini, Luca | University of Parma |
|
16:00-16:20, Paper FrC04.1 | Add to My Program |
Near Collision and Controllability Analysis of Nonlinear Optimal Velocity Follow-The-Leader Dynamical Model in Traffic Flow |
|
Nick Zinat Matin, Hossein | University of California, Berkeley |
Delle Monache, Maria Laura | University of California, Berkeley |
Keywords: Traffic control, Autonomous vehicles, Nonlinear systems
Abstract: This paper examines the optimal velocity follow-the-leader dynamics, a microscopic traffic model, and explores different aspects of the dynamical model, with particular emphasis on collision analysis. More precisely, we present a rigorous boundary-layer analysis of the model which provides a careful understanding of the behavior of the dynamics in trade-off with the singularity of the model at collision which is essential in the controllability of the system.
|
|
16:20-16:40, Paper FrC04.2 | Add to My Program |
Boundary Stabilization for Mixed Traffic Flow in the Presence of Autonomous Vehicle Platooning |
|
Zhan, Jingyuan | Beijing University of Technology |
Wu, Jiahao | Beijing University of Technology |
Zhang, Liguo | Beijing University of Technology |
Keywords: Traffic control, Distributed parameter systems, Autonomous vehicles
Abstract: This paper investigates the boundary stabilization problem for a mixed traffic flow system composed of the traditional human-driven traffic flow and a platoon of autonomous vehicles. Firstly, we employ the first-order LWR model to describe the mixed traffic flow, which is with a bilateral moving spatial domain governed by the platoon. In order to stabilize the mixed traffic flow into the desired density and platoon length, a downstream boundary controller is designed based on the information of upstream density and platoon length. To facilitate the well-posedness and stabilization analysis, we transform the system into a coupled PDE-ODE system with fixed spatial domain. Then we prove the well-posedness of the system, and we further derive sufficient conditions for ensuring the local exponential stability of the system by employing the Lyapunov function method. Finally, numerical simulations are provided to validate the theoretical results.
|
|
16:40-17:00, Paper FrC04.3 | Add to My Program |
Toward Efficient Traffic Signal Control: Smaller Network Can Do More |
|
Li, Shuya | Tsinghua University |
Mei, Hao | New Jersey Institue of Technology |
Li, Jianwei | Moffett.AI |
Wei, Hua | Arizona State University |
Xu, Dongkuan | North Carolina State University |
Keywords: Traffic control, Machine learning, Neural networks
Abstract: Reinforcement learning (RL)-based traffic signal control (TSC) optimizes signal switches through RL agents, adapting to intersection updates. Yet, existing RL-based TSC methods often demand substantial storage and computation resources, impeding real-world implementation. This study introduces a two-stage approach to compress the network, maintaining performance. Firstly, we identify a compact network via a removal-verification strategy. Secondly, pruning yields an even sparser network. In addition, Multi-task RL is adopted for multi-intersection TSC, reducing costs, and boosting performance. Our extensive evaluation shows a compressed network at 1/1432nd of original parameters, with an 11.2% enhancement over the best baseline. This work presents an efficient RL-based TSC solution for real-world contexts, offering insights into challenges and opportunities in the field.
|
|
17:00-17:20, Paper FrC04.4 | Add to My Program |
Inducing Desired Equilibrium in Taxi Repositioning Problem with Adaptive Incentive Design |
|
Li, Jianhui | Zhejiang University |
Niu, Youcheng | Zhejiang University |
Li, Shuang | The Chinese University of Hong Kong, Shenzhen |
Li, Yuzhe | Northeastern University |
Xu, Jinming | Zhejiang University |
Wu, Junfeng | The Chinese Unviersity of Hong Kong, Shenzhen |
Keywords: Traffic control, Optimization algorithms, Network analysis and control
Abstract: We study the problem of designing incentives to induce desired equilibrium in taxi repositioning problems. In this scenario, self-interested idle drivers will update their repositioning strategies with observed payoff. Meanwhile, the platform will adaptively design incentives to induce a better Nash equilibrium for global efficiency. We formulate the problem as a bi-level optimization problem where the incentive designer and idle drivers simultaneously update their decision variables. We prove that agents' strategies will reach Nash equilibrium, and the incentive designer's objective function will reach optimality under Polyak Lojasiewicz (PL) condition. Furthermore, we derive a sufficient condition for the PL condition to hold for the upper-level objective function and lower-level agents' payoff function. Finally, we demonstrate the efficiency of the proposed method by numerical results.
|
|
17:20-17:40, Paper FrC04.5 | Add to My Program |
Local Optimization of MAPF Solutions on Directed Graphs |
|
Ardizzoni, Stefano | University of Parma |
Saccani, Irene | Università Di Parma |
Consolini, Luca | Università Di Parma |
Locatelli, Marco | University of Parma |
Keywords: Optimization, Traffic control, Agents-based systems
Abstract: Among sub-optimal MAPF solvers, rule-based algorithms are particularly appealing since they are complete. Even in crowded scenarios, they allow finding a feasible solution that brings each agent to its target, preventing deadlock situations. However, generally, rule-based algorithms provide solutions that are much longer than the optimal one. The main contribution of this paper is the introduction of an iterative local search procedure in MAPF. We start from a feasible suboptimal solution and we perform a local search in a neighborhood of this solution, to find a shorter one. Iteratively, we repeat this procedure until the solution cannot be shortened any longer. At the end, we obtain a solution, that is still sub-optimal, but, in general, of much better quality than the initial one. We use dynamic programming for the local search procedure. Under this respect, the fact that our search is local is fundamental to reduce the time complexity of the algorithm. Indeed, if we apply a standard dynamic programming the number of explored states grows exponentially with the number of agents. As we will see, the introduction of a locality constraint allows solving the (local) dynamic programming problem in a time that grows only polynomially with respect to the number of agents.
|
|
17:40-18:00, Paper FrC04.6 | Add to My Program |
Modeling Multiday Route Choices of Strategic Commuters: A Mean Field Game Approach |
|
Wu, Minghui | University of Michigan |
Yin, Yafeng | University of Michigan |
Lynch, Jerome | Duke University |
Keywords: Mean field games, Traffic control
Abstract: In the era of connected and automated mobility, commuters will possess strong computation capabilities, enabling them to make foresighted and strategic route choices. This paper investigates the implications of such strategic planning on traffic patterns by modeling the commute problem as a mean field game, where every traveler plans for sequential route choices over a span of several days. We examine the concept of multiday user equilibrium, a special mean field equilibrium under commuter interactions, to derive network traffic flow patterns. Under mild conditions, our analysis establishes the existence and uniqueness of the equilibrium flow pattern and explores its relationship with conventional Wardrop equilibrium.
|
|
FrC05 Invited Session, Simpor Junior 4912 |
Add to My Program |
Strategic Coordination in Multi-Agent Systems and Its Applications |
|
|
Chair: Vasconcelos, Marcos M. | Florida State University |
Co-Chair: Park, Shinkyu | KAUST |
Organizer: Vasconcelos, Marcos M. | Florida State University |
Organizer: Park, Shinkyu | KAUST |
|
16:00-16:20, Paper FrC05.1 | Add to My Program |
Scenario-Game ADMM: A Parallelized Scenario-Based Solver for Stochastic Noncooperative Games (I) |
|
Li, Jingqi | University of California, Berkeley |
Chiu, Chih-Yuan | University of California, Berkeley |
Peters, Lasse | Delft University of Technology |
Palafox Escobedo, Fernando | University of Texas at Austin |
Karabag, Mustafa O. | The University of Texas at Austin |
Alonso-Mora, Javier | Delft University of Technology |
Sojoudi, Somayeh | UC Berkeley |
Tomlin, Claire J. | UC Berkeley |
Fridovich-Keil, David | The University of Texas at Austin |
Keywords: Game theory, Decentralized control, Autonomous systems
Abstract: Decision making in multi-agent games can be extremely challenging, particularly under uncertainty. In this work, we propose a new sample-based approximation to a class of stochastic, general-sum, pure Nash games, where each player has an expected-value objective and a set of chance constraints. This new approximation scheme inherits the accuracy of objective approximation from the established sample average approximation (SAA) method and enjoys a feasibility guarantee derived from the scenario optimization literature. We characterize the sample complexity of this new game-theoretic approximation scheme, and observe that high accuracy usually requires a large number of samples, which results in a large number of sampled constraints. To accommodate this, we decompose the approximated game into a set of small-size games with few constraints for each sampled scenario, and propose a decentralized, consensus ADMM algorithm to efficiently compute a generalized Nash equilibrium of the approximated game. We prove the convergence of our algorithm and empirically demonstrate superior performance compared to a recent baseline.
|
|
16:20-16:40, Paper FrC05.2 | Add to My Program |
Markov Games with Decoupled Dynamics: Price of Anarchy and Sample Complexity (I) |
|
Zhang, Runyu | Harvard University |
Zhang, Yuyang | Harvard University |
Konda, Rohit | UC Santa Barbara |
Ferguson, Bryce L. | University of California, Santa Barbara |
Marden, Jason R. | University of California, Santa Barbara |
Li, Na | Harvard University |
Keywords: Game theory, Markov processes, Learning
Abstract: This paper studies the finite-time horizon Markov games where the agents' dynamics are decoupled but the rewards can possibly be coupled across agents. The policy class is restricted to local policies where agents make decisions using their local state. We first introduce the notion of smooth Markov games which extends the smoothness argument for normal form games to our setting, and leverage the smoothness property to bound the price of anarchy of the Markov game. For a specific type of Markov game called the Markov potential game, we also develop a distributed learning algorithm, multi-agent soft policy iteration (MA-SPI), which provably converges to a Nash equilibrium. Sample complexity of the algorithm is also provided. Lastly, our results are validated using a dynamic covering game.
|
|
16:40-17:00, Paper FrC05.3 | Add to My Program |
Factorization of Multi-Agent Sampling-Based Motion Planning (I) |
|
Zanardi, Alessandro | ETH Zurich |
| |