| |
Last updated on May 9, 2024. This conference program is tentative and subject to change
Technical Program for Wednesday July 10, 2024
To show or hide the keywords and abstract (text summary) of a paper (if available), click on the paper title
Open all abstracts
Close all abstracts
Presentation In person On-line No presentation No information
|
WeA01 |
Metro E/C |
RI: Machine Learning in Control |
RI Session |
Chair: Shahbakhti, Mahdi | University of Alberta |
Co-Chair: Yoon, Se Young (Pablo) | University of New Hampshire |
|
10:00-10:03, Paper WeA01.1 | |
A Physics-Informed Machine Learning Approach to Predict Soil Water Content for Agricultural Decision-Making |
|
Bagheri, Amirsalar | Kansas State University |
Patrignani, Andres | Kansas State University |
Ghanbarian, Behzad | Kansas State University |
Babaei Pourkargar, Davood | Kansas State University |
Keywords: Machine learning, Modeling, Model Validation
Abstract: This paper presents a novel hybrid modeling approach that combines physics-based modeling and machine learning (ML) methods to enhance soil water content dynamics prediction for agricultural decision-making. The proposed approach outperforms current methods for predicting water content based on soil-water physics or purely on data-driven strategies. Initially, a Markov chain-based model is employed to estimate soil water content. The uncertainty associated with the model-based estimation is then quantified by applying ML algorithms, such as support vector machine (SVM), random forest (RF), and feedforward neural network (FNN), to the soil water content data obtained from the Kansas mesoscale network (Mesonet). This quantified uncertainty reveals the complex soil water content dynamics that analytical Markov chain-based models cannot capture. The term physics-informed ML (PIML) refers to integrating soil-water physics principles with the predictive capabilities of data-driven models. Furthermore, multiple Mesonet time series datasets are utilized to assess the influence of physics-based knowledge on ML predictions. The evaluation of the proposed PIML models demonstrates significant improvements in predicting soil water content.
|
|
10:03-10:06, Paper WeA01.2 | |
Transfer Learning for Dynamical Systems Models Via Autoencoders and GANs |
|
Damiani, Angelo | Gran Sasso Science Institute |
Viera López, Gustavo | Gran Sasso Science Institute |
Manganini, Giorgio | Gran Sasso Science Institute |
Metelli, Alberto Maria | Politecnico Di Milano |
Restelli, Marcello | Politecnico Di Milano |
Keywords: Machine learning
Abstract: Transfer learning has seen significant progress in the domains of supervised learning and reinforcement learning, yet there remains a noticeable gap in the area of regression. In supervised learning, various transfer learning methods have been developed to leverage knowledge from one task to improve performance on another, but these approaches are primarily designed for classification tasks. In the realm of reinforcement learning, many techniques focus on policy transfer, and those that do address samples transfer predominantly apply it within homogeneous contexts. This paper presents a novel algorithm that bridges the transfer learning gap between supervised learning and reinforcement learning while specifically addressing the regression problem of dynamical systems' model estimation. Our approach harnesses the feature extraction capabilities of autoencoders and the generative power of Generative Adversarial Networks (GANs) to train a mapping that facilitates the seamless transformation of samples between dynamical systems. This approach represents a significant advancement in the field of transfer learning, as it offers a versatile and effective solution for transferring regression models across heterogeneous domains. Our experimental results demonstrate the algorithm's efficacy and its potential to improve model generalization and adaptability in diverse scenarios with varying data distributions and dynamics.
|
|
10:06-10:09, Paper WeA01.3 | |
Concurrent Learning and Lyapunov-Based Updates of Deep Neural Networks for Euler-Lagrange Dynamic Systems |
|
Basyal, Sujata | Auburn University |
Ting, Jonathan | Auburn University |
Mishra, Kislaya | Auburn University |
Allen, Brendon C. | Auburn University |
Keywords: Machine learning, Lyapunov methods, Nonlinear systems identification
Abstract: This paper presents a deep neural network (DNN)- and concurrent learning (CL)-based adaptive control architecture for an Euler-Lagrange dynamic system that guarantees system performance for the first time. The developed controller includes two DNNs with the same output-layer weights to ensure feasibility of the control system. In this work, a Lyapunov-and CL-based update law is developed to update the output-layer DNN weights in real-time; whereas, the inner-layer DNN weights are updated offline using data that is collected in real-time. A Lyapunov-like analysis is performed to prove that the proposed controller yields semi-global exponential convergence to an ultimate bound for the output-layer weight estimation errors and for the trajectory tracking errors.
|
|
10:09-10:12, Paper WeA01.4 | |
Model Free Difference Feedback Control of Stochastic Systems |
|
Zaheer, Muhammad Hamad | University of New Hampshire |
Yoon, Se Young (Pablo) | University of New Hampshire |
Keywords: Machine learning, Adaptive control, Stochastic optimal control
Abstract: This paper presents a model-free reinforcement learning (RL) algorithm for the output-difference feedback control (ODFC) of stochastic systems with measurement and process noise. A policy iteration algorithm is presented using a quadratic output-difference reward function to learn the optimal ODFC from noisy output difference measurements. It is proved that the proposed method trains stable dynamic control laws that approach the analytical optimal solution, even in the presence of process and measurement noise. Simulation results are presented to demonstrate the effectiveness of the proposed scheme.
|
|
10:12-10:15, Paper WeA01.5 | |
Control-Based Graph Embeddings with Data Augmentation for Contrastive Learning |
|
Ahmad, Obaid Ullah | University of Texas at Dallas |
Said, Anwar | Vanderbilt University |
Shabbir, Mudassir | Information Technology University |
Koutsoukos, Xenofon | Vanderbilt University |
Abbas, Waseem | University of Texas at Dallas |
Keywords: Machine learning, Network analysis and control, Emerging control applications
Abstract: In this paper, we study the problem of unsupervised graph representation learning by harnessing the control properties of dynamical networks defined on graphs. Our approach introduces a novel framework for contrastive learning, a widely prevalent technique for unsupervised representation learning. A crucial step in contrastive learning is the creation of `augmented' graphs from the input graphs. Though different from the original graphs, these augmented graphs retain the original graph's structural characteristics. Here, we propose a unique method for generating these augmented graphs by leveraging the control properties of networks. The core concept revolves around perturbing the original graph to create a new one while preserving the controllability properties specific to networks and graphs. Compared to the existing methods, we demonstrate that this innovative approach enhances the effectiveness of contrastive learning frameworks, leading to superior results regarding the accuracy of the classification tasks. The key innovation lies in our ability to decode the network structure using these control properties, opening new avenues for unsupervised graph representation learning.
|
|
10:15-10:18, Paper WeA01.6 | |
Distributed Reinforcement Learning for Swarm Systems with Reward Machines |
|
Meshkat Alsadat, Shayan | Arizona State University |
Baharisangari, Nasim | Arizona State University |
Paliwal, Yash | Arizona State University |
Xu, Zhe | Arizona State University |
Keywords: Machine learning, Formal verification/synthesis, Agents-based systems
Abstract: We introduce a decentralized reinforcement learning (RL) algorithm for swarm systems where reward machines (a type of Mealy machines) are used to encode the non-Markovian reward functions. We use the generalized moments (GMs) to characterize swarm features. Each agent estimates the GM of the swarm state and uses it to estimate the state of the reward machine. The agent then uses the estimated state of the reward machine to update its q-values. We use the gossip algorithm for communication between the agents. Agents exploit this communication to update their estimated GM of the swarm state, which leads to an update of their estimated state of the reward machine. We derive an upper bound for the error between the estimate GM of the swarm state and the ground truth GM of the swarm state, and using that upper bound; we prove the convergence of our proposed algorithm, swarm q-learning with reward machines (Swarm-QRM). To demonstrate the efficiency of our approach, we present two case studies wherein, for Case Study 1, a swarm of agents will perform a pickup and delivery task, and in Case Study 2, the swarm of agents will conduct a search and rescue task. We also show that Swarm-QRM outperforms the baselines, q-learning in augmented state space (QAS) and Double Deep Q-Network (DDQN).
|
|
10:18-10:21, Paper WeA01.7 | |
Integrating Machine Learning in Process Control with LSTMc: A Case Study in Batch Crystallization |
|
Sitapure, Niranjan | Texas A&M University |
Kwon, Joseph | Texas A&M University |
Keywords: Adaptive control, Computer-aided control design, Manufacturing systems
Abstract: Over the past three decades, the primary methods for managing complex chemical processes have predominantly relied on PI/PID controllers and model predictive controllers (MPCs). Despite their advancements, both approaches have limitations. PI/PID controllers require specific tuning for each scenario, while MPCs are computationally intensive. To address these challenges, we introduce the LSTM-controller (LSTMc), which is a model-independent, data-driven framework that leverages the robust time-series prediction capabilities of LSTM networks. The LSTMc predicts future control inputs by analyzing the evolution of the system's state and error dynamics, taking into account both the current and previous time steps. Our experiments, particularly in dextrose batch crystallization, have demonstrated the effectiveness of this approach. Impressively, the LSTMc achieves a set-point deviation of less than 2% in 10+ case studies we tested. These results position the LSTMc as a promising alternative for process control, as it adeptly adapts to changing process conditions and set-points, and offers efficient computation for optimal control inputs.
|
|
10:21-10:24, Paper WeA01.8 | |
Learning-Based Model Predictive Control of an Ammonia Synthesis Reactor |
|
Oliveira Cabral, Thiago | Kansas State University |
Bagheri, Amirsalar | Kansas State University |
Babaei Pourkargar, Davood | Kansas State University |
Keywords: Chemical process control, Machine learning, Predictive control for nonlinear systems
Abstract: We investigate the application of state-of-the-art recurrent machine learning methods to tackle the computational challenges associated with the real-time solvability of a packed-bed ammonia synthesis reactor model for optimal control design. A high-fidelity multi-physics model is developed for the ammonia reactor by integrating reaction kinetics and complex transport phenomena. The model involves a system of interconnected algebraic, ordinary, and partial differential equations, providing a deeper understanding and improved prediction accuracy of pressure, temperature, and species concentration dynamics. However, due to its computational complexity, the resulting model is unsuitable for optimal control applications such as model predictive control (MPC). To overcome this limitation, we create a reduced-order model based on long short-term memory (LSTM) using time series data generated from offline simulations of process outputs. This surrogate model effectively captures the system’s nonlinear dynamics, enabling the successful implementation of MPC and achieving optimal closed-loop performance.
|
|
10:24-10:27, Paper WeA01.9 | |
Explainable Optimal Solutions Using Fuzzy Inference |
|
Deneke, Tewodros Lemma | University of Texas at Austin |
Dunia, Ricardo | The University of Texas at Austin |
Baldea, Michael | The University of Texas at Austin |
Keywords: Fuzzy systems, Optimization, Machine learning
Abstract: Optimization-based decision-making in practical settings often produces solutions that are counter-intuitive to operators and human decision makers. In this paper, we propose an approach for explaining optimization results. A fuzzy inference model based on a set of observations generated by solving an optimization problem is developed, and the logical rules are used to explain the optimization results. The proposed approach is demonstrated using a realistic example of the unit commitment problem.
|
|
10:27-10:30, Paper WeA01.10 | |
Solving Two-Player General-Sum Game between Swarms |
|
Ghimire, Mukesh | Arizona State University |
Zhang, Lei | Arizona State University |
Zhang, Wenlong | Arizona State University |
Ren, Yi | Arizona State University |
Xu, Zhe | Arizona State University |
Keywords: Game theory, Machine learning
Abstract: Hamilton-Jacobi-Isaacs (HJI) PDEs are the governing equations for the two-player general-sum games. Unlike Reinforcement Learning (RL) methods, which are data-intensive methods for learning value function, learning HJ PDEs provide a guaranteed convergence to the Nash Equilibrium value of the game when it exists. However, a caveat is that solving HJ PDEs becomes intractable when the state dimension increases. To circumvent the curse of dimensionality (CoD), physics-informed machine learning methods with supervision can be used and have been shown to be effective in generating equilibrial policies in two-player general-sum games. In this work, we extend the existing work on agent-level two-player games to a two-player swarm-level game, where two sub-swarms play a general-sum game. We consider the Kolmogorov forward equation as the dynamic model for the evolution of the densities of the swarms. Results show that policies generated from the physics-informed neural network (PINN) result in a higher payoff than a Nash Deep Q-Network (Nash DQN) agent and have comparable performance with numerical solvers.
|
|
10:30-10:33, Paper WeA01.11 | |
Empowering Hybrid Models with Attention-Based Time-Series Transformers: A Case Study in Batch Crystallization |
|
Sitapure, Niranjan | Texas A&M University |
Kwon, Joseph | Texas A&M University |
Keywords: Grey-box modeling, Machine learning, Neural networks
Abstract: Due to safety concerns, direct implementation of black-box tools for fully data-driven deep-neural-network (DNN) digital twins faces challenges. To overcome this, hybrid models that combine physics-based first principles (FP) with machine learning (ML) have gained popularity, seen as a `best of both worlds' solution. However, current simple DNN models struggle to predict long-term process data evolution. Recently, time-series transformers (TSTs) using multi-headed attention mechanisms have shown excellent predictive performance for a wide array of time-series prediction tasks. To this end, a novel TST-based hybrid modeling framework for batch crystallization was developed, offering enhanced accuracy and interpretability compared to traditional black-box models. Specifically, the developed TST-based hybrid model allows online estimation of instantaneous growth and nucleation rate (i.e., mathbf{G}&mathbf{B}) for any arbitrarily chosen operating condition in batch crystallization of dextrose. The estimated mathbf{G}&mathbf{B} values can then be directly fed to an FP crystallization model to generate the evolution of crystal size distribution (CSD) and other system states. Moreover, the developed TST-based hybrid model achieved a normalized mean square error in the range of [10, 50]times10^{-4} and an R^2 value exceeding 0.99 for predicting system states. Moreover, the proposed TST-based hybrid model utilizes powerful attention mechanisms to (a) under short and long-term contextual information, and (b) seamlessly learn the functional dependence of system states, thereby setting the foundation stones for next-generation attention-powered hybrid models.
|
|
10:33-10:36, Paper WeA01.12 | |
An Example of Synthetic Data Generation for Control Systems Using Generative Adversarial Networks: Zermelo Minimum-Time Navigation |
|
Bapat, Nachiket | Worcester Polytechnic Institute |
Paffenroth, Randy C. | Worcester Polytechnic Institute |
Cowlagi, Raghvendra V. | Worcester Polytechnic Institute |
Keywords: Grey-box modeling, Simulation, Machine learning
Abstract: Real-world data of the operation of control systems is often scarce because experiments are expensive and time-consuming. We address the problem of synthetic data generation, namely, the problem of generating data about the operation of a physical system through computational means. In this paper, we report generative adversarial network (GAN) models that can incorporate training data and governing equations underlying the operation of a control system. Instead of over-generalization, we restrict the discussion to the particular example, namely, the Zermelo navigation problem of a vehicle navigating a drift field (e.g., wind) in minimum time. Owing to the nature of the example chosen, we find algebraic governing equations. We propose GAN models that learn to generate trajectories that not only resemble the training data but also satisfy these governing equations. We compare the proposed models to a standard GAN model that uses training data, only, i.e., neglects the governing equations, to demonstrate that the proposed models outperform the standard model on several metrics. To the best of our knowledge, this is the first work on generative models for control systems.
|
|
10:36-10:39, Paper WeA01.13 | |
Safe Reinforcement Learning Using Model Predictive Control with Probabilistic Control Barrier Function |
|
Shen, Xun | Osaka University |
Wachi, Akifumi | LINE Corporation |
Hashimoto, Wataru | Osaka University |
Hashimoto, Kazumune | Osaka University |
Takai, Shigemasa | Osaka Univ |
Keywords: Learning, Markov processes, Machine learning
Abstract: Practical applications of reinforcement learning (RL) often demand that the agents explore safety by satisfying designed constraints. The constraints for safety cannot be satisfied with probability 1 when an unbounded stochastic uncertainty is present. One way is to relax the hard constraints into chance constraints and to consider safe RL with chance constraints. This paper addresses safe RL with chance constraints by Model Predictive Control (MPC) with Probabilistic Control Barrier Function (PCBF). MPC with PCBF is used as a function approximator to deliver the policy that satisfies the demanded chance constraints. RL is used to optimize the parameters in MPC with PCBF to improve the closed performance. We prove that using PCBF as a constraint in MPC ensures the safety imposed by the chance constraints. A scenario-based algorithm is designed for the proposed safe RL implementation. A quadrotor system control problem in turbulent conditions has been used as a numerical example to validate the proposed safe RL method.
|
|
10:39-10:42, Paper WeA01.14 | |
Min-Max Optimization under Delays |
|
Adibi, Arman | Princeton University |
Mitra, Aritra | North Carolina State University |
Hassani, Hamed | University of Pennsylvania |
Keywords: Optimization, Optimization algorithms, Machine learning
Abstract: Delays and asynchrony are inevitable in large-scale machine-learning problems where communication plays a key role. As such, several works have extensively analyzed stochastic optimization with delayed gradients. However, as far as we are aware, no analogous theory is available for min-max optimization, a topic that has gained recent popularity due to applications in adversarial robustness, game theory, and reinforcement learning. Motivated by this gap, we examine the performance of standard min-max optimization algorithms with delayed gradient updates. First, we show (empirically) that even small delays can cause prominent algorithms like Extra-gradient (texttt{EG}) to diverge on simple instances for which texttt{EG} guarantees convergence in the absence of delays. Our empirical study thus suggests the need for a careful analysis of delayed versions of min-max optimization algorithms. Accordingly, under suitable technical assumptions, we prove that Gradient Descent-Ascent (texttt{GDA}) and texttt{EG} with delayed updates continue to guarantee convergence to saddle points for convex-concave and strongly convex-strongly concave settings. Our complexity bounds reveal, in a transparent manner, the slow-down in convergence caused by delays.
|
|
10:42-10:45, Paper WeA01.15 | |
Developing an Efficient Model for a SOFC System Using Self-Supervised Convolutional Autoencoder and Stateful LSTM Network |
|
Tofigh, Mohamadali | University of Alberta |
Salehi, Zeynab | University of Alberta |
Smith, Daniel | Cummins |
Ali, Kharazmi | Cummins Inc |
Amir, Hanifi Yazdi | University of Alberta |
Koch, Charles Robert | University of Alberta |
Shahbakhti, Mahdi | University of Alberta |
Keywords: Neural networks, Identification for control, Machine learning
Abstract: The development of an efficient dynamic model for solid oxide fuel cell (SOFC) systems is essential for control strategies, process optimization, and fault diagnostics - key steps toward maximizing power generation and extending their lifetime. Long-Short-Term-Memory (LSTM), a potent type of recurrent neural network, has proven to be adept at modeling intricate physical dynamic systems using extensive input-output data. They are particularly well-suited for capturing long-range dependencies and temporal patterns. Nevertheless, due to memory and computing constraints in practice, training an LSTM network to effectively learn long-term dependency is challenging, which stems from the sequential processing nature of LSTMs. In this study, a self-supervised convolutional autoencoder (AE) is developed to learn a concise yet informative temporal representation of historical input data. These compressed hidden sequences are then input into an LSTM model, which is trained using a truncated backpropagation through time (TBPTT) algorithm. Once trained, an identical stateful LSTM model is reconstructed using the learned parameters, enabling the LSTM to predict the voltage using only the most recent past input data, rather than a sequence of data over which the model was trained. The efficacy of the proposed framework is validated utilizing diverse experimental data collected from a lab-scale SOFC. The results indicate while the model properly identifies the underlying dynamics of the SOFC, it significantly reduces the training and runtime prediction computation costs of a traditional LSTM model by 38% and 42% respectively. This enhancement holds promise for improving real-time control and diagnostics of SOFC systems, ultimately contributing to their improved performance and reliability.
|
|
10:45-10:48, Paper WeA01.16 | |
Data-Driven Nonlinear System Identification of a Throttle Valve Using Koopman Representation |
|
Bongiovanni, Nicolas | Université Côte d’Azur, CNRS, I3S |
Mavkov, Bojan | Université Côte D'Azur |
Martins, Renato | Université De Bourgogne |
Allibert, Guillaume | Univ Cote d'Azur, CNRS, I3S |
Keywords: Identification for control, Nonlinear systems identification, Machine learning
Abstract: Electrical Throttle Bodies (ETBs) are massively used in the automotive industry and their modeling and control are challenging because of their high nonlinearity and stochasticity. In this paper, we present a data-driven method grounded on the Koopman operator for the identification of a real ETB valve. The model obtained is control-oriented and represented in a quasi-linear-parameter varying framework. Different experiments are performed to evaluate the performance of the proposed method, including the comparison with two classical nonlinear system identification methods.
|
|
10:48-10:51, Paper WeA01.17 | |
An Effective Hyperparameter Tuning Method for Ising Machines in Practical Use |
|
Kakuko, Norihiro | Fujitsu LTD |
Parizy, Matthieu | Fujitsu LTD |
Keywords: Optimization, Optimization algorithms, Machine learning
Abstract: In this paper, we propose novel Ising machines hyperparameter tuning techniques for practical use when handling multiple combinatorial problem instances in a short period of time. Firstly, we confirm the performance of a well-known hyperparameter tuning technique, Tree-structured Parzen Estimator (TPE), when targeting a single input instance within a long period of time. Secondly, we propose an efficient method consisting of a generic hyperparameter tuning and a short-time specialized hyperparameter tuning for multiple instances. Since the generic hyperparameter tuning to cover a wide range of instances in a certain category is done in advance, taking time for it is acceptable. The specialized hyperparameter tuning targeting a tuning time of several minutes, reflecting practical use, runs when the performance of a generic hyperparameter set is below expectations. Thirdly, we compare the proposed method to the TPE based tuning to show its effectiveness. For experiments, well-known Travel Salesman Problem (TSP) and Quadratic Assignment Problem (QAP) instances are used as input. The Ising machine used is Fujitsu’s third generation Digital Annealer (DA). Results show that the generic hyperparameter tuning outperforms the TPE based tuning by 2 times in GAP, which indicates the quality of the solution, and the specialized hyperparameter sets have 1.4 times better performance than the generic ones in GAP.
|
|
10:51-10:54, Paper WeA01.18 | |
Data-Efficient Uncertainty-Guided Model-Based Reinforcement Learning with Unscented Kalman Bayesian Neural Networks |
|
Wu, Xinyang | Fraunhofer IPA |
Wedernikow, Elisabeth | Fraunhofer IPA |
Huber, Marco | University of Stuttgart |
Keywords: Predictive control for nonlinear systems, Kalman filtering, Machine learning
Abstract: In recent years, reinforcement learning (RL) has made substantial progress by providing feasible solutions to many planning and control problems. However, the majority of RL algorithms, particularly model-free RL, suffer from low learning efficiency. To address this issue, this paper proposes the utilization of the Kalman Bayesian neural network (KBNN) to learn a tractable dynamics model of a system from data that captures uncertainties of the system state. Additionally, we employ the unscented Kalman filter to propagate these uncertainties over the control horizon, and we exploit the propagated uncertainties explicitly in the cost function. This approach presents a novel solution to improve data-efficiency in RL. This is validated on classic control problems in a comparative analysis against state-of-the-art methods.
|
|
10:54-10:57, Paper WeA01.19 | |
Fast Long-Term Multi-Scenario Prediction for Maneuver Planning at Unsignalized Intersections |
|
Mertens, Max Bastian | Ulm University |
Ruof, Jona | Ulm University |
Strohbeck, Jan | Ulm University |
Buchholz, Michael | Universität Ulm |
Keywords: Multivehicle systems, Modeling, Machine learning
Abstract: Motion prediction for intelligent vehicles typically focuses on estimating the most probable future evolutions of a traffic scenario. Estimating the gap acceptance, i.e., whether a vehicle merges or crosses before another vehicle with the right of way, is often handled implicitly in the prediction. However, an infrastructure-based maneuver planning can assign artificial priorities between cooperative vehicles, so it needs to evaluate many more potential scenarios. Additionally, the prediction horizon has to be long enough to assess the impact of a maneuver. We, therefore, present a novel long-term prediction approach handling the gap acceptance estimation and the velocity prediction in two separate stages. Thereby, the behavior of regular vehicles as well as priority assignments of cooperative vehicles can be considered. We train both stages on real-world traffic observations to achieve realistic prediction results. Our method has a competitive accuracy and is fast enough to predict a multitude of scenarios in a short time, making it suitable to be used in a maneuver planning framework.
|
|
WeA02 |
Harbour |
RI: Network and Multi-Agent Systems |
RI Session |
Chair: Kumar, Gautam | San Jose State University |
Co-Chair: Yuan, Yukun | University of Tennessee at Chattanooga |
|
10:00-10:03, Paper WeA02.1 | |
A Model of Chaperone Competition in Bacterial Gene Regulatory Networks |
|
Nolan, Nicholas | Massachusetts Institute of Technology |
Laub, Michael | Massachusetts Institute of Technology |
Del Vecchio, Domitilla | Massachusetts Institute of Technology |
Keywords: Biomolecular systems, Systems biology, Cellular dynamics
Abstract: Chaperones are a global resource within cellular biomolecular systems, ensuring that proteins are properly folded and preventing that aggregation leads to cell death. Introducing genetic circuits to a cell may place a load on these folding resources, resulting in unintended coupling between otherwise independent circuits' behavior. Previous analyses have considered loading effects on other cellular resources --- such as gene expression resources --- but have not included chaperone-enabled folding. In this paper, we model two chaperone modalities, encapsulating two important classes of chaperones as well as their potential interactions. We identify distinct responses that arise from the different architectures which can be either competitive or activating. This work indicates that native cellular chaperones may have built-in control architectures to mitigate loading by an increased demand from chaperone-reliant proteins.
|
|
10:03-10:06, Paper WeA02.2 | |
Opinion-Based Task Allocation Strategy for Mobile Sensor Networks |
|
Zhang, Ziqiao | Georgia Institute of Technology |
Wu, Wencen | San Jose State University |
Zhang, Fumin | Georgia Institute of Technology |
Keywords: Sensor networks, Agents-based systems, Networked control systems
Abstract: In this paper, we propose an opinion-based task allocation strategy for mobile sensor networks to conduct simultaneous multi-source tracking while doing measurement collection and information estimation. Mobile sensors form opinions of the field based on the information obtained by sensors, and opinion dynamics describe how sensors communicate with each other and generate decisions on task assignments for multi-source tracking. Triggering conditions of running opinion dynamics for task allocation are also provided, determining when it is necessary to divide sensors into groups. Simulation results with a 12-sensor network tracking two different sources validate the proposed opinion-based task allocation strategy.
|
|
10:06-10:09, Paper WeA02.3 | |
Multi-Agent Deep Reinforcement Learning for Energy Management in Grid-Responsive Networked Greenhouses |
|
Ajagekar, Akshay | Cornell University |
Decardi-Nelson, Benjamin | University of Alberta |
You, Fengqi | Cornell University |
Keywords: Control applications, Process Control, Chemical process control
Abstract: Greenhouses play a pivotal role in achieving food security and paving the way for sustainable urban agriculture. Yet, to optimize crop growth, they demand substantial energy, mainly for climate control and artificial lighting. Given this energy-intensive nature, greenhouses are prime candidates for demand response programs, enhancing the power grid’s flexibility and resilience in urban areas. In this study, we propose a multi-agent deep reinforcement learning (DRL) framework to manage energy in interconnected renewable energy integrated greenhouses. The framework addresses the challenges of fluctuating on-site renewable energy outputs, and interaction with a dynamic electricity tariff grid. Our multi-agent DRL strategy employs an actor-critic algorithm infused with a shared attention mechanism to ensure scalability and performance. We demonstrate the efficacy of our approach using a New York City (NYC) case study with a network of five greenhouses in each borough. We also compare the performance of our multi-agent DRL method to that of the rule-based and the deep deterministic policy gradient algorithms.
|
|
10:09-10:12, Paper WeA02.4 | |
Optimization-Based Countering of Misinformation on Social Networks |
|
Bayiz, Yigit Ege | The University of Texas at Austin |
Topcu, Ufuk | The University of Texas at Austin |
Keywords: Control of networks, Network analysis and control, Large-scale systems
Abstract: False information is prevalent on social media platforms, necessitating effective countering methods to combat its spread. We propose an algorithm that reduces the false information spread while preserving the spread of correct information. We model the social media network as a random network of users in which each news item propagates in the network in consecutive cascades. Existing studies suggest that the cascade behaviors of misinformation and correct information are affected differently by user polarization and reflexivity. We show that this difference can be used to alter network dynamics in a way that selectively hinders the spread of misinformation content. To implement these alterations, we introduce an optimization-based probabilistic dropout method that randomly removes connections between users to achieve minimal propagation of misinformation. We test the algorithm's effectiveness using simulated social networks. In our tests, we use both synthetic network structures based on stochastic block models, and natural network structures that are generated using random sampling of data collected from Twitter. The results show that on average the algorithm decreases the cascade size of misinformation content by up to 70% in synthetic network tests and up to 45% in natural network tests while maintaining a branching ratio of at least 1.5 for correct information.
|
|
10:12-10:15, Paper WeA02.5 | |
Distributed Multi-Agent Interaction Generation with Imagined Potential Games |
|
Sun, Lingfeng | University of California, Berkeley |
Hung, Pin-Yun | University of California, Berkeley |
Wang, Changhao | University of California, Berkeley |
Tomizuka, Masayoshi | Univ of California, Berkeley |
Xu, Zhuo | UC Berkeley |
Keywords: Distributed control, Simulation, Multivehicle systems
Abstract: Interactive behavior modeling of multiple agents is an essential challenge in simulation, especially in scenarios when agents need to avoid collisions and cooperate at the same time. Humans can interact with others without explicit communication and navigate in scenarios when cooperation is required. In this work, we aim to model human interactions in this realistic setting, where each agent acts based on its observation and does not communicate with others. We propose a framework based on distributed potential games, where each agent imagines a cooperative game with other agents and solves the game using its estimation of their behavior. We utilize iLQR to solve the games and closed-loop simulate the interactions. We demonstrate the benefits of utilizing distributed imagined games in our framework through various simulation experiments. We show the high success rate, the increased navigation efficiency, and the ability to generate rich and realistic interactions with interpretable parameters. Illustrative examples are available at https://sites.google.com/berkeley.edu/distributed-interacti on
|
|
10:15-10:18, Paper WeA02.6 | |
Control of Misinformation with Safety and Engagement Guarantees |
|
Amini, Arash | The University of Texas at Austin |
Bayiz, Yigit Ege | The University of Texas at Austin |
Topcu, Ufuk | The University of Texas at Austin |
Keywords: Information theory and control, Information technology systems, Network analysis and control
Abstract: Misinformation (e.g., rumors and false information) on social networks poses substantial threats in various domains, underscoring the urgent need for interventions to address misinformation propagation. We present an edge-based optimal control algorithm to minimize the spread of misinformation over social networks. The proposed algorithm preserves the level of information each network node receives while preventing rumors from going viral. We introduce a localized version of the algorithm to enable scaling to large networks. Localization is based on removing any connecting edges between two nodes in the underlying graph that exceed the predefined localization distance in the latent space, whereby the probability of interaction between the two nodes decays exponentially with the distance between them in the latent space. The proposed localized control algorithm compensates for the truncation errors and retains the baseline algorithm's guarantees while reducing computational complexity. Empirical studies applying the baseline algorithm on random spatial networks and its localized counterpart on large-scale social networks extracted from 100 million Twitter messages reduced the ratio of infected users by 8% in synthetic networks and up to 6% on real-world social networks while preserving the level of information each node receives and preventing viral rumors.
|
|
10:18-10:21, Paper WeA02.7 | |
Controllability-Constrained Deep Network Models for Enhanced Control of Dynamical Systems |
|
Sharma, Suruchi | San Jose State University |
Makarenko, Volodymyr | San Jose State University |
Kumar, Gautam | San Jose State University |
Tiomkin, Stas | San Jose State University |
Keywords: Iterative learning control, Learning, Variational methods
Abstract: Control of a dynamical system without the knowledge of dynamics is an important and challenging task. Modern machine learning approaches, such as deep neural networks (DNNs), allow for the estimation of a dynamics model from control inputs and corresponding state observation outputs. Such data-driven models are often utilized for the derivation of model-based controllers. However, in general, there are no guarantees that a model represented by DNNs will be controllable according to the formal control-theoretical meaning of controllability, which is crucial for the design of effective controllers. This often precludes the use of DNN-estimated models in applications, where formal controllability guarantees are required. In this proof-of-the-concept work, we propose a control-theoretical method that explicitly enhances models estimated from data with controllability. That is achieved by augmenting the model estimation objective with a controlla- bility constraint, which penalizes models with a low degree of controllability. As a result, the models estimated with the proposed controllability constraint allow for the derivation of more efficient controllers, they are interpretable by the control- theoretical quantities and have a lower long-term prediction error. The proposed method provides new insights on the connection between the DNN-based estimation of unknown dynamics and the control-theoretical guarantees of the solution properties. We demonstrate the superiority of the proposed method in two standard classical control systems with state observation given by low resolution high-dimensional images.
|
|
10:21-10:24, Paper WeA02.8 | |
A Mixing-Accelerated Primal-Dual Proximal Algorithm for Distributed Nonconvex Optimization |
|
Ou, Zichong | ShanghaiTech University |
Qiu, Chenyang | University of Virginia |
Wang, Dandan | ShanghaiTech University |
Lu, Jie | ShanghaiTech University |
Keywords: Networked control systems, Optimization, Optimization algorithms
Abstract: In this paper, we develop a distributed mixing-accelerated primal-dual proximal algorithm, referred to as MAP-Pro, which enables nodes in multi-agent networks to cooperatively minimize the sum of their nonconvex, smooth local cost functions in a decentralized fashion. The proposed algorithm is constructed upon minimizing a computationally inexpensive augmented-Lagrangian-like function and incorporating a time-varying mixing polynomial to expedite information fusion across the network. The convergence results derived for MAP-Pro include a sublinear rate of convergence to a stationary solution and, under the Polyak-{L}ojasiewics (P-{L}) condition, a linear rate of convergence to the global optimal solution. Additionally, we may embed the well-noted Chebyshev acceleration scheme in MAP-Pro, which generates a specific sequence of mixing polynomials with given degrees and enhances the convergence performance based on MAP-Pro. Finally, we illustrate the competitive convergence speed and communication efficiency of MAP-Pro via a numerical example.
|
|
10:24-10:27, Paper WeA02.9 | |
Consensus Control with Safety Guarantee: An Application to the Kinematic Bicycle Model |
|
Niu, Kaicheng | Georgia Institute of Technology |
Abdallah, Chaouki T. | Georgia Institute of Technology |
Hayajneh, Mohammad | United Arab Emirates University |
Keywords: Networked control systems, Agents-based systems, Multivehicle systems
Abstract: This paper proposes a consensus controller for multi-agent systems that can guarantee the agents' safety. The controller, built with the idea of output prediction and the Newton-Raphson method, achieves consensus for a class of heterogeneous nonlinear systems. The Integral Control Barrier Function is applied in conjunction with the controller, such that the agents' states are confined within pre-defined safety sets. Due to the dynamically-defined control input, the resulting optimization problem from the barrier function is always a Quadratic Program, despite the nonlinearities that the system dynamics may have. We verify the proposed controller using a platoon of autonomous vehicles modeled by kinematic bicycles. A convergence analysis of the leader-follower consensus under the path graph topology is conducted. Simulation results show that the vehicles achieve consensus while keeping safe inter-agent distances, suggesting a potential in future applications.
|
|
10:27-10:30, Paper WeA02.10 | |
An Auxiliary Graph for Clock Rigidity Analysis |
|
Wen, Ruixin | University of Melbourne |
Schoof, Eric | University of Melbourne |
Chapman, Airlie | University of Melbourne |
Keywords: Networked control systems
Abstract: Clock rigidity theory argues that a specific graph condition can guarantee the solvability of the clock parameters in a time-of-arrival (TOA) based sensor network, where the graph represents the network topology. An auxiliary graph has been proposed to establish the connection between clock rigidity and bearing rigidity, but what is lacking in the literature are methods to leverage the auxiliary graph in the analysis of clock rigidity problems. In this paper, we propose a generalized definition of the auxiliary graph and demonstrate the usage of the auxiliary graph for examining clock configuration genericity in applications. We also study a bearing-based method for examining clock rigidity of a multigraph network, where multiple edges between the same node pairs represent multiple different TOA measurements, extending the families of graphs that can be analyzed using existing clock rigidity theory.
|
|
10:30-10:33, Paper WeA02.11 | |
Distributed Least-Squares Optimization Solvers with Differential Privacy |
|
Liu, Weijia | Zhejiang University |
Wang, Lei | Zhejiang University |
Guo, Fanghong | Zhengjiang University of Technology |
Wu, Zheng-Guang | Zhejiang University |
Su, Hongye | Zhejiang Univ |
Keywords: Optimization, Optimization algorithms, Networked control systems
Abstract: This paper studies the distributed least-squares optimization problem with differential privacy requirement of local cost functions, for which two differentially private distributed solvers are proposed. The first is established on the distributed gradient tracking algorithm, by appropriately perturbing the initial values and parameters that contain the privacy-sensitive data with Gaussian and truncated Laplacian noises, respectively. Rigorous proofs are established to show the achievable tradeoff between the (ϵ, δ)-differential privacy and the computation accuracy. The second solver is established on the combination of the distributed shuffling mechanism and the average consensus algorithm, which enables each agent to obtain a noisy version of parameters characterizing the global gradient. As a result, the least squares optimization problem can be eventually solved by each agent locally in such a way that any given (ϵ, δ)-differential privacy requirement can be preserved while the solution may be computed with the accuracy independent of the network size, which makes the latter more suitable for large scale distributed least-squares problems. Numerical simulations are presented to show the effectiveness of both solvers.
|
|
10:33-10:36, Paper WeA02.12 | |
Leveraging Untrustworthy Commands for Multi-Robot Coordination in Unpredictable Environments: A Bandit Submodular Maximization Approach |
|
Xu, Zirui | University of Michigan |
Lin, Xiaofeng | Boston University |
Tzoumas, Vasileios | University of Michigan, Ann Arbor |
Keywords: Optimization algorithms, Cooperative control, Autonomous robots
Abstract: We study the problem of multi-agent coordination in unpredictable and partially-observable environments with untrustworthy external commands. The commands are actions suggested to the robots, and are untrustworthy in that their performance guarantees, if any, are unknown. Such commands may be generated by human operators or machine learning algorithms and, although untrustworthy, can often increase the robots’ performance in complex multi-robot tasks. We are motivated by complex multi-robot tasks such as target tracking, environmental mapping, and area monitoring. Such tasks are often modeled as submodular maximization problems due to the information overlap among the robots. We provide an algorithm, Meta Bandit Sequential Greedy (MetaBSG), which enjoys performance guarantees even when the external commands are arbitrarily bad. MetaBSG leverages a meta-algorithm to learn whether the robots should follow the commands or a recently developed submodular coordination algorithm, Bandit Sequential Greedy (BSG) [1], which has performance guarantees even in unpredictable and partially-observable environments. Particularly, MetaBSG asymptotically can achieve the better performance out of the commands and the BSG algorithm, quantifying its suboptimality against the optimal time-varying multi-robot actions in hindsight. Thus, MetaBSG can be interpreted as robustifying the untrustworthy commands. We validate our algorithm in simulated scenarios of multi-target tracking.
|
|
10:36-10:39, Paper WeA02.13 | |
Controlled Sensing for Communication-Efficient Filtering and Smoothing in POMDPs |
|
Liu, Changrong | University of Melbourne |
Molloy, Timothy L. | Australian National University |
Nair, Girish N. | University of Melbourne |
Keywords: Sensor networks, Stochastic optimal control, Markov processes
Abstract: The trade-off between communication resources and estimation accuracy is widely considered in sensor networks. In this paper we consider the problem of estimating the trajectory of an event-triggered hidden Markov model, {where the controller decides at each time step whether or not the sensor should sample and transmit a measurement to the estimator}. Adopting a Shannon information-theoretic point of view, we quantify the required communication resources by the entropy of the {transmitted observation sequence}, with a special symbol to denote non-transmission. Furthermore we evaluate the trajectory uncertainty by the conditional entropy of the state sequence given the received observations. Simultaneous minimization of the communication resources and state uncertainty is formulated and solved within a partially observable Markov decision process framework, yielding a threshold policy for triggering transmissions.
|
|
10:39-10:42, Paper WeA02.14 | |
Distributed Model Predictive Control of Integrated Process Networks Based on an Adaptive Community Detection Approach |
|
Ebrahimi, AmirMohammad | Kansas State University |
Babaei Pourkargar, Davood | Kansas State University |
Keywords: Chemical process control, Distributed control, Predictive control for nonlinear systems
Abstract: An adaptive community detection-based system decomposition approach is developed for distributed model predictive control (DMPC) of integrated process networks. In this approach, the weighted graph representation of the nonlinear state space model is utilized as the foundation for system decomposition. The most modular decomposition is identified for designing a distributed architecture by applying spectral community detection and energy-based separation techniques. The resulting decomposition evolves as the process network transitions through different operating conditions. Consequently, the distributed architecture and the DMPC design are adjusted to improve closed-loop performance and robustness. To illustrate the efficacy of the proposed approach, a benzene alkylation process benchmark under different operating conditions is used. The results of the simulation studies demonstrate that the decompositions derived through spectral community detection, based on the weighted graph representations, lead to enhanced closed-loop performance and improved computational efficiency compared to the commonly used unweighted hierarchical community-based decompositions.
|
|
10:42-10:45, Paper WeA02.15 | |
Fairness-Aware Electric Taxi Fleet Coordination under Short-Term Power System Failures |
|
Yuan, Yukun | University of Tennessee at Chattanooga |
Ding, Zihan | Stony Brook University |
Lin, Shan | State University of New York |
Keywords: Transportation networks, Control of networks, Intelligent systems
Abstract: Power outages and shortages, among other disruptive events, can markedly diminish the charging efficiency of EV fleets, e.g., e-taxis, and compromise their service quality. This paper aims to address the challenge of coordinating e-taxis amidst such power system disruptions. To understand the extent of the problem, we employ a trace-driven simulation to measure how short-term power failures influence e-taxi service quality. Our observations highlight drops in passenger service in affected areas, pointing to a potential disparity in service quality across various regions of a city. In response, we introduce the Fairness-Aware e-taxi fleet Coordination (FAC) algorithm. FAC monitors the power system disruptions and dynamically switches between two control strategies: one that optimizes city-wide performance during normal operations, and another that emphasizes both service fairness and system performance during power disruptions. We put FAC to the trace-driven evaluation using a comprehensive dataset from an existing e-taxi ecosystem, comprising almost 8,000 taxis and averaging 62,100 taxi trips daily. Our data-driven evaluation shows the effectiveness of our solution in terms of providing fair service quality across regions and enhancing the service quality in the affected regions and a city.
|
|
10:45-10:48, Paper WeA02.16 | |
A Geometric Approach to Resilient Distributed Consensus Accounting for State Imprecision and Adversarial Agents |
|
Lee, Christopher | University of Texas at Dallas |
Abbas, Waseem | University of Texas at Dallas |
Keywords: Control of networks, Networked control systems, Cooperative control
Abstract: This paper presents a novel approach for resilient distributed consensus in multiagent networks when dealing with adversarial agents and imprecision in states observed by normal agents. Traditional resilient distributed consensus algorithms often presume that agents have exact knowledge of their neighbors' states, which is unrealistic in practical scenarios. We show that such existing methods are inadequate when agents only have access to imprecise states of their neighbors. To overcome this challenge, we adapt a geometric approach and model an agent's state by an `imprecision region' rather than a point in mathbb{R}^d. From a given set of imprecision regions, we first present an efficient way to compute a region that is guaranteed to lie in the convex hull of true, albeit unknown, states of agents. We call this region the emph{invariant hull} of imprecision regions and provide its geometric characterization. Next, we use these invariant hulls to identify a emph{safe point} for each normal agent. The safe point of an agent lies within the convex hull of its emph{normal} neighbors' states and hence is used by the agent to update it's state. This leads to the aggregation of normal agents' states to safe points inside the convex hull of their initial states, or an approximation of consensus. We also illustrate our results through simulations.
|
|
10:48-10:51, Paper WeA02.17 | |
Synchronize the Parafoil and the Vessel: A Hierarchical Distributed Nonlinear Model Predictive Control Approach |
|
Wei, Zhenyu | Zhejiang University |
Gao, Yan | Zhejiang University |
Shao, Zhijiang | Zhejiang University |
Keywords: Multivehicle systems, Cooperative control, Predictive control for nonlinear systems
Abstract: Emerging recovery missions bring the need to recover the parafoil system with the vessel. However, current literature of the parafoil system focuses on the fixed-point landing. This unavoidably hinders the cooperation between the parafoil system and the vessel, which limits the flexibility of payload recovery missions. This paper aims to design the distributed control algorithm for cooperative synchronization between the parafoil system and the vessel. Firstly, the cooperative synchronization process is formulated as a trajectory optimization problem based on the dynamics of the parafoil system and the vessel. To reduce the transmission burden in coordinating the two vehicles, the hierarchical distributed control approach is designed, which incorporates the synchronization point update algorithm to coordinate the two vehicles and the nonlinear model predictive control method to leverage the nonlinear dynamic model for generating the control command. Simulation results demonstrate the effectiveness of the proposed hierarchical distributed model predictive control approach for synchronizing the parafoil and the vessel under disturbances. The findings from this study will benefit the design of cooperative recovery algorithms for future payload recovery missions.
|
|
10:51-10:54, Paper WeA02.18 | |
Guarding a Target Area from a Heterogeneous Group of Cooperative Attackers |
|
Lee, Yoonjae | The University of Texas at Austin |
Das, Goutam | George Mason University |
Shishika, Daigo | George Mason University |
Bakolas, Efstathios | The University of Texas at Austin |
Keywords: Game theory, Agents-based systems, Autonomous systems
Abstract: In this paper, we investigate a multi-agent target guarding problem in which a single defender seeks to capture multiple attackers aiming to reach a high-value target area. In contrast to previous studies, the attackers herein are assumed to be emph{heterogeneous} in the sense that they have not only different speeds but also different weights representing their respective degrees of importance (e.g., the amount of allocated resources). The objective of the attacker team is to jointly minimize the weighted sum of their final levels of proximity to the target area, whereas the defender aims to maximize the same value. Using geometric arguments, we construct candidate equilibrium control policies that require the solution of a (possibly nonconvex) optimization problem. Despite its nonconvex nature, we establish a sufficient condition for this problem to admit a unique local minimum that is also global. Subsequently, we validate the optimality of the candidate control policies using parametric optimization techniques. Lastly, we provide numerical examples to illustrate how cooperative behaviors emerge within the attacker team due to their heterogeneity.
|
|
10:54-10:57, Paper WeA02.19 | |
Heterogeneous Multi-Agent Reinforcement Learning Based on Adaptive Curiosity for Traffic Signal Control |
|
Pan, Yue | Tongji University |
Lei, Jinlong | Tongji University |
Yi, Peng | Tongji University |
Keywords: Traffic control, Learning, Markov processes
Abstract: Intelligent Traffic Signal Control (TSC) aims to optimize urban traffic management. However, the traditional fixed-cycle intersection signal stands as one of the main factors causing traffic congestion. In addition, with the burgeoning complexity of traffic scenarios and escalating travel demands, TSC becomes a daunting challenge. Recently, the exploration of reinforcement learning in the domain of TSC has emerged as a significant research frontier because it can handle complex scenarios and offers scalability. We follow the framework and propose a multi-agent reinforcement learning (MARL) methodology, in which we incorporate two pivotal enhancements. First of all, to ensure efficacy and stability of training, our method seamlessly combines the Actor-Critic framework with the double Q-network. Secondly, we introduce an advanced adaptive curiosity mechanism to exploit intrinsic rewards founded on the agents' varied action space properties, and hence offers a personalized exploration guide for each agent. Our empirical validation, implemented on the Simulation of Urban Mobility (SUMO) platform, covers both homogeneous grid environment and heterogeneous city setting. Notably, in comparison with traditional cyclic scheduling methods and existing advanced MARL solutions, our algorithm exhibited better traffic flow efficiency and robustness.
|
|
WeA03 |
Frontenac |
RI: Autonomous Robots and Systems |
RI Session |
Chair: Leang, Kam K. | University of Utah |
Co-Chair: Beard, Randal W. | Brigham Young Univ |
|
10:00-10:03, Paper WeA03.1 | |
On XYZ-Motion Planning Using a Full Car Model |
|
Chakraborty, Sayan | New York University |
Jiang, Yu | ClearMotion, Inc |
Jiang, Zhong-Ping | New York University |
Keywords: Automotive systems, Autonomous systems, Simulation
Abstract: In this study, we tackle the XYZ-motion planning problem for autonomous vehicles equipped with active suspension systems. Using perception data, we derive two road surface approximations: the encoded road and the estimated road. Using the estimated road, we first form a nonlinear optimization problem to obtain an XYZ-motion plan. For practical applicability, this optimization problem is split into two steps. First, generating a XY path by using the information of soft and hard obstacles. Second, generating a vertical motion plan with the XY path and estimated road. We have tested our methodologies on two examples, where in the first example we have used a synthetic road to demonstrate online XYZ-motion planning and in the second example we have used real-world road surface data to demonstrate offline XYZ-motion planning. We have found that our algorithms give satisfactory results in both the test cases.
|
|
10:03-10:06, Paper WeA03.2 | |
Temporally Robust Multi-Agent STL Motion Planning in Continuous Time |
|
Verhagen, Joris | KTH Royal Institute of Technology |
Lindemann, Lars | University of Southern California |
Tumova, Jana | KTH Royal Institute of Technology |
Keywords: Autonomous robots, Formal verification/synthesis, Robust control
Abstract: Signal Temporal Logic (STL) is a formal language over continuous-time signals (such as trajectories of a multi-agent system) that allows for the specification of complex spatial and temporal system requirements (such as staying sufficiently close to each other within certain time intervals). To promote robustness in multi-agent motion planning with such complex requirements, we consider motion planning with the goal of maximizing the temporal robustness of their joint STL specification, i.e. maximizing the permissible time shifts of each agent's trajectory while still satisfying the STL specification. Previous methods presented temporally robust motion planning and control in a discrete-time optimization scheme. In contrast, we parameterize the trajectory by continuous Bézier curves, where the curvature and the time-traversal of the trajectory are parameterized individually. We show an algorithm generating continuous-time temporally robust trajectories and prove soundness of our approach. Moreover, we empirically show that our parametrization realizes this with a considerable speed-up compared to state-of-the-art methods based on constant interval time discretization.
|
|
10:06-10:09, Paper WeA03.3 | |
Structure from WiFi (SfW): RSSI-Based Geometric Mapping of Indoor Environments |
|
Kim, Junseo | Toronto Metropolitan University |
Aghyourli Zalat, Jill | Toronto Metropolitan University |
Bahoo, Yeganeh | Toronto Metropolitan University |
Saeedi, Sajad | Toronto Metropolitan University |
Keywords: Autonomous robots, Estimation, Intelligent systems
Abstract: With the rising prominence of WiFi in common spaces, efforts have been made in the robotics community to take advantage of this fact by incorporating WiFi signal measurements in indoor SLAM (Simultaneous Localization and Mapping) systems. SLAM is essential in a wide range of applications, especially in the control of autonomous robots. This paper describes recent work in the development of WiFi-based localization and addresses the challenges currently faced in achieving WiFi-based geometric mapping. Inspired by the field of research into k-visibility, this paper presents the concept of inverse k-visibility and proposes a novel algorithm that allows robots to build a map of the free space of an unknown environment, essential for planning, navigation, and avoiding obstacles. Experiments performed in simulated and real-world environments demonstrate the effectiveness of the proposed algorithm.
|
|
10:09-10:12, Paper WeA03.4 | |
Tracking Control of Optical Beam Transceivers Using Mean Field Models |
|
N'Doye, Ibrahima | King Abdullah University of Science and Technology (KAUST) |
Laleg-Kirati, Taous-Meriem | National Institute for Research in Digital Science and Technolog |
Keywords: Mean field games, Cooperative control
Abstract: This paper proposes mean field models to maintain an accurate line-of-sight and tracking between transceivers mounted in mobile unmanned aerial vehicles (UAVs) platforms in the presence of underlying mechanical vibration effects. We describe a two-way optical link beam tracking control that coordinates mobile UAVs in a network architecture to provide reliable network structure, distributed connectivity, and communicability, enhancing terrestrial public safety communication systems. We derive the optical transceiver trajectory tracking problem in which each agent dynamic and cost function is coupled with other optical beam transceiver agent states via a mean field term. We propose two optimal mean field beam tracking control frameworks through decentralized and centralized strategies in which the optical transceivers compete to reach a Nash equilibrium and cooperate to attain a social optimum, respectively. The solutions of these strategies are derived from forward-backward ordinary differential equations and rely on the linearity Hamilton-Jacobi-Bellman Fokker-Planck equations and stochastic maximum principle. Moreover, we numerically compute the solution pair of the resulting joint equations using Newton and fixed point iteration methods to verify the existence and uniqueness of the equilibrium and social optimum.
|
|
10:12-10:15, Paper WeA03.5 | |
Practical Considerations for Discrete-Time Implementations of Continuous-Time Control Barrier Function-Based Safety Filters |
|
Brunke, Lukas | University of Toronto |
Zhou, Siqi | University of Toronto |
Che, Mingxuan | Technical University of Munich |
Schoellig, Angela P | Technical University of Munich & University of Toronto |
Keywords: Autonomous robots
Abstract: Safety filters based on control barrier functions (CBFs) have become a popular method to guarantee safety for uncertified control policies, e.g., as resulting from reinforcement learning. Here, safety is defined as staying in a pre-defined set, the safe set, that adheres to the system's state constraints, e.g., as given by lane boundaries for a self-driving vehicle. In this paper, we examine one commonly overlooked problem that arises in practical implementations of continuous-time CBF-based safety filters. In particular, we look at the issues caused by discrete-time implementations of the continuous-time CBF-based safety filter, especially for cases where the magnitude of the Lie derivative of the CBF with respect to the control input is zero or close to zero. When overlooked, this filter can result in undesirable chattering effects or constraint violations. In this work, we propose three mitigation strategies that allow us to use a continuous-time safety filter in a discrete-time implementation with a local relative degree. Using these strategies in augmented CBF-based safety filters, we achieve safety for all states in the safe set by either using an additional penalty term in the safety filtering objective or modifying the CBF such that those undesired states are not encountered during closed-loop operation. We demonstrate the presented issue and validate our three proposed mitigation strategies in simulation and on a real-world quadrotor.
|
|
10:15-10:18, Paper WeA03.6 | |
Model Predictive Control with Reference Path Planning for Multi-UAV Formation Control System |
|
Chen, YuWen | National Taiwan University |
Chiang, Ming-Li | National Taiwan Ocean University |
Kuo, Guo-Rong | National Taiwan University |
Chuang, Che-Jung | National Taiwan University |
Fu, Li-Chen | National Taiwan University |
Keywords: Autonomous systems, Predictive control for nonlinear systems, Cooperative control
Abstract: In this paper, we propose a multi-UAV formation control system in obstacle-filled environments. Our design consists of two key components, which are the formation path planning algorithm and the model predictive control (MPC) for quadrotor agents. The proposed path planning algorithm generates a safe reference path for the formation center and secure waypoints for each agent to minimize pattern deformation. The designed MPC controller helps agents track the reference trajectory and maintain formation using data from neighboring agents. Our system also processes sensor data and models surrounding obstacles for the environment instead of only considering the scenarios with a known environment. Several simulation examples are provided to demonstrate the feasibility and effectiveness of the proposed design.
|
|
10:18-10:21, Paper WeA03.7 | |
Interaction-Aware Decision-Making for Autonomous Vehicles in Forced Merging Scenario Leveraging Social Psychology Factors |
|
Li, Xiao | University of Michigan, Ann Arbor |
Liu, Kaiwen | University of Michigan |
Tseng, H. Eric | Ford Motor Company |
Girard, Anouck | University of Michigan, Ann Arbor |
Kolmanovsky, Ilya V. | The University of Michigan |
Keywords: Autonomous systems, Stochastic optimal control, Intelligent systems
Abstract: Understanding the intention of vehicles in the surrounding traffic is crucial for an autonomous vehicle to successfully accomplish its driving tasks in complex traffic scenarios such as highway forced merging. In this paper, we consider a behavioral model that incorporates both social behaviors and personal objectives of the interacting drivers. Leveraging this model, we develop a receding-horizon control-based decision-making strategy, that estimates online the other drivers' intentions using Bayesian filtering and incorporates predictions of nearby vehicles' behaviors under uncertain intentions. The effectiveness of the proposed decision-making strategy is demonstrated and evaluated based on simulation studies in comparison with a game theoretic controller and a real-world traffic dataset.
|
|
10:21-10:24, Paper WeA03.8 | |
An Optimal Control Framework for Influencing Human Driving Behavior in Mixed-Autonomy Traffic |
|
Chari, Anirudh | Illinois Mathematics and Science Academy |
Chen, Rui | Carnegie Mellon University |
Grover, Jaskaran | CMU |
Liu, Changliu | Carnegie Mellon University |
Keywords: Autonomous systems, Multivehicle systems, Optimal control
Abstract: As autonomous vehicles (AVs) become increasingly prevalent, their interaction with human drivers presents a critical challenge. Current AVs lack social awareness, causing behavior that is often awkward or unsafe. To combat this, social AVs, which are proactive rather than reactive in their behavior, have been explored in recent years. With knowledge of robot-human interaction dynamics, a social AV can influence a human driver to exhibit desired behaviors by strategically altering its own behaviors. In this paper, we present a novel framework for achieving human influence. The foundation of our framework lies in an innovative use of control barrier functions to formulate the desired objectives of influence as constraints in an optimal control problem. The computed controls gradually push the system state toward satisfaction of the objectives, e.g. slowing the human down to some desired speed. We demonstrate the proposed framework’s feasibility in a variety of scenarios related to car-following and lane changes, including multi-robot and multi-human configurations. In two case studies, we validate the framework’s effectiveness when applied to the problems of traffic flow optimization and aggressive behavior mitigation. Given these results, the main contribution of our framework is its versatility in a wide spectrum of influence objectives and mixed-autonomy configurations.
|
|
10:24-10:27, Paper WeA03.9 | |
Data-Driven Monitoring with Mobile Sensors and Charging Stations Using Multi-Arm Bandits and Coordinated Motion Planners |
|
Nayak, Siddharth | Massachusetts Institute of Technology |
Greiff, Marcus Carl | Mitsubishi Electric Research Laboratries |
Raghunathan, Arvind | Mitsubishi Electric Research Labs |
Di Cairano, Stefano | Mitsubishi Electric Research Labs |
P. Vinod, Abraham | Mitsubishi Electric Research Laboratories |
Keywords: Autonomous systems, Statistical learning, Autonomous robots
Abstract: We study the problem of data-driven monitoring using an autonomous search team. We consider a typical monitoring task of classifying a search environment into interesting and uninteresting regions as quickly as possible using a search team comprised of mobile sensors (e.g., drones) and mobile charging stations (e.g., ground vehicles). For widespread deployment in the physical world, the search team must also accommodate noisy data collected by the mobile sensors, overcome energy constraints on the mobile sensors which limit their range, and ensure collision avoidance for the charging stations. We address these challenges using a novel, bi-level approach for the monitoring task, where a high-level planner uses past data to determine the potential regions of interest for the drones to visit, and a low-level path planner coordinates the paths for the entire search team to visit these regions subject to the posed constraints. We design the high-level planner using a multi-armed bandit framework. For the low-level planner, we propose two approaches: an optimal integer program-based motion planner and a real-time implementable graph-based heuristic planner. We characterize several theoretical properties of the proposed approaches, including anytime guarantees, upper bounds on computing time, and task completion time. We show the efficacy of our approach in simulations, including one in Gazebo where we identify harvest-ready trees using an autonomous heterogeneous search team.
|
|
10:27-10:30, Paper WeA03.10 | |
Coupled Sensor Configuration and Planning in Unknown Dynamic Environments with Context-Relevant Mutual Information-Based Sensor Placement |
|
Poudel, Prakash | Worcester Polytechnic Institute |
Cowlagi, Raghvendra V. | Worcester Polytechnic Institute |
Keywords: Autonomous systems, Sensor networks, Optimization
Abstract: We address path-planning for a mobile agent to navigate in an unknown environment with minimum exposure to a spatially and temporally varying threat field. The threat field is estimated using pointwise noisy measurements from a sensor network separate from the mobile agent. For this problem, we present a new metric for optimal sensor placement that quantifies reduction in uncertainty in the path cost, rather than the environment state. This metric, which we call the context-relevant mutual information (CRMI), couples the sensor placement and path-planning problem. We propose also an iterative coupled sensor configuration and path planning (CSCP) algorithm. At each iteration, the algorithm places sensors to maximize CRMI, updates the threat estimate using new measurements, and recalculates the path with minimum expected exposure to the threat. The iterations converge when the path cost variance, which is an indicator of risk, reduces below a desired threshold. Through numerical simulations, we demonstrate that the principal advantage of this algorithm is that near-optimal low-variance paths are achieved using far fewer sensor measurements as compared to a standard decoupled method.
|
|
10:30-10:33, Paper WeA03.11 | |
Safe Stabilizing Control for Polygonal Robots in Dynamic Elliptical Environments |
|
Long, Kehan | University of California San Diego |
Tran, Khoa | University of California, San Diego |
Leok, Melvin | University of California, San Diego |
Atanasov, Nikolay | University of California, San Diego |
Keywords: Constrained control, Autonomous robots, Stability of nonlinear systems
Abstract: This paper addresses the challenge of safe navigation for rigid-body mobile robots in dynamic environments. We introduce an analytic approach to compute the distance between a polygon and an ellipse, and employ it to construct a control barrier function (CBF) for safe control synthesis. Existing CBF design methods for mobile robot obstacle avoidance usually assume point or circular robots, preventing their applicability to more realistic robot body geometries. Our work enables CBF designs that capture complex robot and obstacle shapes. We demonstrate the effectiveness of our approach in simulations highlighting real-time obstacle avoidance in constrained and dynamic environments for both mobile robots and multi-joint robot arms.
|
|
10:33-10:36, Paper WeA03.12 | |
Aircraft Approach Management Using Reachability and Dynamic Programming |
|
P. Vinod, Abraham | Mitsubishi Electric Research Laboratories |
Yamazaki, Sachiyo | Mitsubishi Electric Corporation |
Chakrabarty, Ankush | Mitsubishi Electric Research Laboratories (MERL) |
Yoshikawa, Nobuyuki | Mitsubishi Electric Corp |
Di Cairano, Stefano | Mitsubishi Electric Research Labs |
Keywords: Air traffic management, Autonomous systems, Constrained control
Abstract: We study the problem of designing safe trajectories for aircraft approach management. Our tractable method designs aircraft trajectories that 1) use only limited admissible maneuvers near the airport, 2) maintains a user-specified separation between aircraft during the entire duration of the approach, and 3) minimizes deviations from user-specified times of arrival at the airport. We use a first-come, first-serve framework to design the trajectories for multiple aircraft by solving a collection of single-aircraft trajectory planning problems. We ensure the safety of the overall system by imposing reachability-based constraints on each planning problem. We identify the constraints as well as the trajectories for aircraft using dynamic programming in a three-dimensional space. We validate the efficacy and safety of our method using historical data from Japan's Haneda International Airport.
|
|
10:36-10:39, Paper WeA03.13 | |
Collision Cone Control Barrier Functions: Experimental Validation on UGVs for Kinematic Obstacle Avoidance |
|
Goswami, Bhavya Giri | Indian Institute of Science, Bengaluru |
Tayal, Manan | Indian Institute of Science, Bengaluru |
Rajgopal, Karthik | Indian Institute of Science, Bengaluru |
Jagtap, Pushpak | Indian Institute of Science |
Nadubettu Yadukumar, Shishir | Indian Institute of Science |
Keywords: Control applications, Autonomous robots, Optimal control
Abstract: This paper introduces an experimental platform designed for the validation and demonstration of a novel class of Control Barrier Functions (CBFs) tailored for Unmanned Ground Vehicles (UGVs) to proactively prevent collisions with kinematic obstacles by integrating the concept of collision cones. While existing CBF formulations excel with static obstacles, extensions to torque/acceleration-controlled unicycle and bicycle models have seen limited success. Conventional CBF applications in such nonholonomic UGV models have demonstrated control conservatism, particularly in scenarios where steering/thrust control was deemed infeasible. Drawing inspiration from collision cones in path planning, we present a pioneering C3BF formulation ensuring theoretical safety guarantees for such models. The core premise revolves around aligning the obstacle's velocity away from the vehicle, establishing a constraint to perpetually avoid vectors directed towards it. This control methodology is rigorously validated through simulations and experimental verification on the Copernicus mobile robot (Unicycle Model) and FOCAS-Car (Bicycle Model).
|
|
10:39-10:42, Paper WeA03.14 | |
Avoidance of Constant Velocity Targets Using Bearing and Time-To-Collision |
|
Adams, James J. | Brigham Young University |
Liu, Jen Jui | Brigham Young University |
Beard, Randal W. | Brigham Young Univ |
Keywords: Estimation, Autonomous systems
Abstract: This paper address the problem of path deconfliction and collision avoidance for unmanned aerial vehicles equipped with a monocular optical camera. The measurements observed from the camera do not include range, but they do allow for calculation of time-to-collision. In this paper we exploit the idea that a single time-to-collision estimate may have come from any number of intruders with different range and velocities, to define a family of potential intruders. The family of potential intruders is represented by a set of particles that can then be propagated forward in time to represent the set of all potential collision scenarios. A path planning algorithm is then introduced to minimize the collision risk. The method is illustrated with simulation results.
|
|
10:42-10:45, Paper WeA03.15 | |
Achieving and Maintaining Inverted Pose for Miniature Autonomous Blimps |
|
Wang, Junkai | Georgia Institute of Technology |
Zhang, Fumin | Georgia Institute of Technology |
Keywords: Flight control, Mechanical systems/robotics, Mechatronics
Abstract: Miniature autonomous blimps (MABs) have advantages of safe flight and long flight duration. This paper presents a flight controller design for an underactuated MAB to achieve and maintain the inverted pose. When the MAB starts from a bottom-heavy pose, an energy-shaping controller is engaged to achieve a swing-up motion for the MAB to reach the proximity of the inverted pose. Once the criteria for controller switching are met, a stabilizing controller is employed to maintain the inverted pose. The performance of the proposed controller is verified through the experimental results collected from the GT-MAB.
|
|
10:45-10:48, Paper WeA03.16 | |
Safe Control Synthesis for Hybrid Systems through Local Control Barrier Functions |
|
Yang, Shuo | University of Pennsylvania |
Black, Mitchell | Toyota Motor North America |
Fainekos, Georgios | Toyota NA-R&D |
Hoxha, Bardh | Toyota Motor North America |
Okamoto, Hideki | Toyota |
Mangharam, Rahul | University of Pennsylvania |
Keywords: Hybrid systems, Switched systems, Autonomous systems
Abstract: Control Barrier Functions (CBF) have provided a very versatile framework for the synthesis of safe control architectures for a wide class of nonlinear dynamical systems. Typically, CBF-based synthesis approaches apply to systems that exhibit nonlinear – but smooth – relationship in the state of the system and linear relationship in the control input. In contrast, the problem of safe control synthesis using CBF for hybrid dynamical systems, i.e., systems which have a discontinuous relationship in the system state, remains largely unexplored. In this work, we build upon the progress on CBF- based control to formulate a theory for safe control synthesis for hybrid dynamical systems. Under the assumption that local CBFs can be synthesized for each mode of operation of the hybrid system, we show how to construct CBF that can guarantee safe switching between modes. The end result is a switching CBF-based controller which provides global safety guarantees. The effectiveness of our proposed approach is demonstrated on two simulation studies.
|
|
10:48-10:51, Paper WeA03.17 | |
Real-Time Trajectory Generation Via Dynamic Movement Primitives for Autonomous Racing |
|
Weaver, Catherine | Catherine22@berkeley.edu |
Capobianco, Roberto | Sapienza University of Rome |
Wurman, Peter | Sony AI |
Stone, Peter | The University of Texas at Austin |
Tomizuka, Masayoshi | Univ of California, Berkeley |
Keywords: Mechanical systems/robotics, Autonomous systems, Machine learning
Abstract: We employ sequences of high-order motion primitives for efficient online trajectory planning, enabling competitive racecar control even when the car deviates from an offline demonstration. Dynamic Movement Primitives (DMPs) utilize a target-driven non-linear differential equation combined with a set of perturbing weights to model arbitrary motion. The DMP’s target-driven system ensures that online trajectories can be generated from the current state, returning to the demonstration. In racing, vehicles often operate at their handling limits, making precise control of acceleration dynamics essential for gaining an advantage in turns. We introduce the Acceleration goal (Acc. goal) DMP, extending the DMP’s target system to accommodate accelerating targets. When sequencing DMPs to model long trajectories, our Acc. goal DMP explicitly models acceleration at the junctions where one DMP meets its successor in the sequence. Applicable to DMP weights learned by any method, the proposed DMP generates trajectories with less aggressive acceleration and jerk during transitions between DMPs compared to second-order DMPs. Our proposed DMP sequencing method can recover from trajectory deviations, achieve competitive lap times, and maintain stable control in autonomous vehicle racing within the high-fidelity racing game Gran Turismo Sport. Video available: https://sites.google.com/berkeley.edu/racingdmp/home
|
|
10:51-10:54, Paper WeA03.18 | |
Encouraging Inferable Behavior for Autonomy: Repeated Bimatrix Stackelberg Games with Observations |
|
Karabag, Mustafa O. | The University of Texas at Austin |
Smith, Sophia | The University of Texas at Austin |
Fridovich-Keil, David | The University of Texas at Austin |
Topcu, Ufuk | The University of Texas at Austin |
Keywords: Game theory, Autonomous systems, Optimal control
Abstract: When interacting with other non-competitive decision-making agents, it is critical for an autonomous agent to have inferable behavior: Their actions must convey their intention and strategy. For example, an autonomous car's strategy must be inferable by the pedestrians interacting with the car. We model the inferability problem using a repeated bimatrix Stackelberg game with observations where a leader and a follower repeatedly interact. During the interactions, the leader uses a fixed, potentially mixed strategy. The follower, on the other hand, does not know the leader's strategy and dynamically reacts based on observations that are the leader's previous actions. In the setting with observations, the leader may suffer from an inferability loss, i.e., the performance compared to the setting where the follower has perfect information of the leader's strategy. We show that the inferability loss is upper-bounded by a function of the number of interactions and the stochasticity level of the leader's strategy, encouraging the use of inferable strategies with lower stochasticity levels. As a converse result, we also provide a game where the required number of interactions is lower bounded by a function of the desired inferability loss.
|
|
10:54-10:57, Paper WeA03.19 | |
Where to Drop Sensors from Aerial Robots to Monitor a Surface-Level Phenomenon |
|
Shek, Chak Lam | University of Maryland |
Shi, Guangyao | University of Maryland |
Asghar, Ahmad Bilal | University of Maryland |
Tokekar, Pratap | University of Maryland |
Keywords: Multivehicle systems, Sensor networks, Autonomous robots
Abstract: We consider the problem of routing a team of energy-constrained Unmanned Aerial Vehicles (UAVs) to deploy sensors for monitoring a surface-level phenomenon in the presence of stochastic wind disturbances. Most of the environmental monitoring work considers the case where the sensors and their carrier are on one platform, and the sensing location can be controlled by controlling the carrier vehicle's position. In contrast, airdropping the sensors onto the ground can introduce stochasticity in the sensors' landing (i.e., measurement) locations. We focus on carefully choosing where to drop the sensors so that despite the measurement locations' stochasticity, we can still effectively monitor the surface-level phenomenon. Specifically, we introduce this problem where a team of UAVs must drop sensors within an allotted time budget. The objective is to maximize the mutual information between the phenomenon at Points of Interest and the measurements obtained at the random sensing locations. This is a variant of the Submodular Team Orienteering Problem with an additional constraint on the number of sensors each UAV can carry. We show that the stochasticity in the measurement location makes objective computationally expensive to compute. We propose a surrogate objective with a closed-form expression and an algorithm for the routing problem. We validate the proposed approach through extensive simulations using synthetic and a real-world dataset of Chlorophyll density from a Pacific Ocean sub-region.
|
|
10:57-11:00, Paper WeA03.20 | |
Leveraging Computational Fluid Dynamics in UAV Motion Planning |
|
Huang, Yunshen | Washington University in St. Louis |
Greiff, Marcus Carl | Mitsubishi Electric Research Laboratries |
P. Vinod, Abraham | Mitsubishi Electric Research Laboratories |
Di Cairano, Stefano | Mitsubishi Electric Research Labs |
Keywords: Optimization, Control applications, Autonomous robots
Abstract: We propose a motion planner for quadrotor unmanned aerial vehicles (UAVs) in windy environments, where the motion is defined by a sequence of Bezier curves in the flat output space of the UAV. The real-time implementable planner incorporates wind information from high-fidelity computational fluid dynamics simulations performed offline and utilizes convexity properties of Bezier curves to enable real-time implementations. For this purpose, we: (i) identify a model for the UAV-wind interaction; (ii) use the OpenFoam software to compute a model of the wind speeds subject to world geometry and boundary conditions; (iii) describe a method for regressing this wind model into a more compact representation; and finally (iv) demonstrate how this representation is amenable to minimum-snap motion planning of quad-rotor UAVs in realistic environments. We validate our approach using simulations and hardware experiments, and show a significant improvement in the thrust used by the UAV in presence of strong winds.
|
|
WeA04 |
Metro W |
RI: Modeling, Estimation, and System Identification |
RI Session |
Chair: Zhang, Jun | University of Nevada Reno |
Co-Chair: Powell, Kody | University of Utah |
|
10:00-10:03, Paper WeA04.1 | |
Transformer Neural Networks with Spatiotemporal Attention for Predictive Control and Optimization of Industrial Processes |
|
Gallup, Ethan | University of Utah |
Tuttle, Jacob | Taber International |
Immonen, Jake | University of Utah |
Billings, Blake | University of Utah |
Powell, Kody | University of Utah |
Keywords: Modeling, Neural networks, Process Control
Abstract: In the context of real-time optimization and model predictive control of industrial systems, machine learning, and neural networks represent cutting-edge tools that hold promise for enhancing dynamic modeling. This work presents a transformer neural network architecture for real-time optimization and model predictive control. This network design includes a modified attention mechanism inspired by positional embedding attention from vision transformers and task-specific modifications to the input-output structure of the transformer’s decoder stack. Experiments were conducted using data from a 450 MW coal-fired power plant to evaluate this approach's effectiveness. The transformer neural network was compared with conventional recurrent models, including GRU and LSTM neural networks. The transformer exhibited a 6% increase in the R-squared value of predictions and an 83% reduction in mean squared error. Computation time was also reduced by 84% compared to conventional recurrent models.
|
|
10:03-10:06, Paper WeA04.2 | |
A Transition System Abstraction Framework for Neural Network Dynamical System Models |
|
Yang, Yejiang | Augusta University |
Mo, Zihao | Augusta University |
Tran, Hoang-Dung | University of Nebraska at Lincoln |
Xiang, Weiming | Augusta University |
Keywords: Modeling, Hybrid systems, Automata
Abstract: This paper proposes a transition system abstraction framework for neural network dynamical system models to enhance the model interpretability, with applications to complex dynamical systems such as human behavior learning and verification. To begin with, the localized working zone will be segmented into multiple localized partitions under the data-driven Maximum Entropy (ME) partitioning method. Then, the transition matrix will be obtained based on the set-valued reachability analysis of neural networks. Finally, applications to human handwriting dynamics learning and verification are given to validate our proposed abstraction framework, which demonstrates the advantages of enhancing the interpretability of the black-box model, i.e., our proposed framework is able to abstract a data-driven neural network model into a transition system, making the neural network model interpretable through verifying specifications described in Computational Tree Logic (CTL) languages.
|
|
10:06-10:09, Paper WeA04.3 | |
A Data-Driven Method for Safety-Critical Control: Designing Control Barrier Functions from State Constraints |
|
Lee, Jaemin | California Institute of Technology |
Kim, Jeeseop | California Institute of Technology |
Ames, Aaron D. | California Institute of Technology |
Keywords: Constrained control, Identification for control, Nonlinear output feedback
Abstract: This paper addresses the challenge of integrating explicit hard constraints into the control barrier function (CBF) framework for ensuring safety in autonomous systems, including robots. We propose a novel data-driven method to derive CBFs from these hard constraints in practical scenarios. Our approach assumes that the forward invariant safe set is either a subset or equal to the constrained set. The process consists of two main steps. First, we randomly sample states within the constraint boundaries and identify inputs meeting the time derivative criteria of the hard constraint; this iterative process converges using the Jaccard index. Next, we formulate CBFs that enclose the safe set using the sampled boundaries. This enables the creation of a control-invariant safe set, approaching the maximum attainable level of control invariance. This approach, therefore, addresses the complexities posed by complex autonomous systems with constrained control input spaces, culminating in a control-invariant safe set that closely approximates the maximal control invariance set.
|
|
10:09-10:12, Paper WeA04.4 | |
Physics-Data-Hybrid Modeling of Tilt-Rotor Vertical Take-Off and Landing Aircraft |
|
Burton, Samantha | Utah State University |
He, Tianyi | Utah State University |
Su, Weihua | The University of Alabama |
Keywords: Aerospace, Linear parameter-varying systems, Modeling
Abstract: This paper presents a Physics-Data-Hybrid (PDH) approach to model the flight dynamics of tilt-rotor Vertical-TakeOff-and-Landing (VTOL) aircraft within the transition phase. The tilting rotors enable more agile flight by varying vectored thrusts; however, they render more complex aircraft dynamics and other challenges for modeling. The proposed PDH modeling approach starts from fundamental, nonlinear physical laws, and expresses them in a Linear Parameter-Varying (LPV) representation, which utilizes scheduling parameters to obtain kernels that encode the intrinsic nonlinearity. After that, flight data snapshots during the transition phase are used to determine the effective state space matrices, bypassing the laborious efforts to identify accurate flight coefficients. The proposed PDH approach is validated and its modeling accuracy is assessed via simulations on a tilt-rotor VTOL aircraft. The simulation first collects data snapshots by feeding a control input sequence to excite the aircraft dynamics and the PDH model is then derived. The state responses of the resulting PDH model are then compared against the true transition trajectory. The simulation results demonstrate the PDH model's excellent modeling accuracy.
|
|
10:12-10:15, Paper WeA04.5 | |
Disturbance Propagation in Vehicle Platoons: Symmetric Bidirectional Interconnections |
|
Farnam, Arash | Ghent University |
Farsi, Milad | Department of Applied Mathematics, University of Waterloo |
Ghorbani, Majid | Tallinn University of Technology |
L. Azad, Nasser | University of Waterloo |
Crevecoeur, Guillaume | Ghent University |
Keywords: Automotive control, Robust control, Modeling
Abstract: This paper is concerned with the problem of disturbance propagation in interconnected systems with double-integrator open-loop dynamics (e.g. acceleration-controlled automated vehicles). We consider a symmetric bidirectional linear interconnection to control the vehicles, in which each vehicle is interconnected solely to one immediate predecessor and to one immediate follower with the same control gain in these two directions. We prove that in this setting, regardless of the choice of stabilizing controller, it is not possible to keep the displacements between the vehicles bounded, and they grow unbounded depending on the number of vehicles N. In this case, we call the system string unstable. We show that this impossibility of achieving string stability remains under various boundary conditions for the leading vehicle, i.e., whether this is a virtual or physical vehicle.
|
|
10:15-10:18, Paper WeA04.6 | |
Physically Motivated Heater Model for Precise Gas Temperature Control in Fuel Cell Stack Test Beds |
|
Zeiringer, Thomas | Graz University of Technology |
Seeber, Richard | Graz University of Technology |
Horn, Martin | Graz University of Technology |
Keywords: Grey-box modeling, Identification for control, Control applications
Abstract: A physically motivated heater model for precise gas temperature control for use in fuel cell stack test beds is proposed. Additionally, a parameter identification method based on modulating functions is provided. The applicability of the identification method and the model is validated through simulations and real-world experiments. Lastly, a time-optimal controller based on the identified model is evaluated on a fuel cell stack test bed.
|
|
10:18-10:21, Paper WeA04.7 | |
Dynamic Modeling and Stability Analysis of Balancing in Riderless Electric Scooters |
|
Lin, Yun-Hao | National Cheng Kung University |
Jafari, Alireza | National Cheng Kung University |
Liu, Yen-Chen | National Cheng Kung University |
Keywords: Lyapunov methods, Feedback linearization, Stability of nonlinear systems
Abstract: Today, electric scooter is a trendy personal mobility vehicle. The rising demand and opportunities attract ride-share services. A common problem of such services is abandoned e-scooters. An autonomous e-scooter capable of moving to the charging station is a solution. This paper focuses on maintaining balance for these riderless e-scooters. The paper presents a nonlinear model for an e-scooter moving with simultaneously varying speed and steering. A PD and a feedback-linearized PD controller stabilize the model. The stability analysis shows that the controllers are ultimately bounded even with parameter uncertainties and measurement inaccuracy. Simulations on a realistic e-scooter with a general demanding path to follow verify the ultimate boundedness of the controllers. In addition, the feedback-linearized PD controller outperforms the PD controller because it has narrower ultimate bounds. Future work focuses on experiments using a self-balancing mechanism installed on an e-scooter.
|
|
10:21-10:24, Paper WeA04.8 | |
Identification of Multirotor Actuator Dynamics with RPM Feedback for Improved Control |
|
Charla, Sesha | Purdue University |
Yao, Bin | Purdue University |
Voyles, Richard | University of Minnesota |
Keywords: Identification for control, Mechanical systems/robotics, Mechatronics
Abstract: Multi-rotor actuator dynamics modeling spans from idealized to complex representations, often simplifying actuator nonlinearity. In the context of aerial manipulation, precise control is crucial. To address this, we introduce a method for nonlinear model identification, incorporating an adaptable input definition and a variable-resolution RPM measurement algorithm. Our study addresses challenges posed by commercial ESCs designed for manual control. We identify model parameters using small perturbation models, correlating them with the full nonlinear model. The paper encompasses an overview of the experimental setup, measurement algorithm, input definition, parameter identification, and validation.
|
|
10:24-10:27, Paper WeA04.9 | |
Spreading Dynamics of an SIQRS Epidemic Model and Quarantine Strategy Analysis |
|
Wang, Ruiyang | Beijing 101 Middle School |
Wang, Siqing | Peking University |
Mei, Wenjun | Peking University |
Keywords: Modeling
Abstract: During the COVID-19 pandemic, large-scale nucleic acid testing and quarantine were widely adopted as measures to contain the disease. However, it also aroused some controversy whether massive testing lead to increased interpersonal contacts and thus exacerbate the epidemic spreading. In this paper, we study a scalar SIQRS (Susceptible-Infected-Quarantines-Recovered-Susceptible) model incorporating infection, curing, quarantine and vaccination processes. To characterize the effects of massive testings, we assume that the interpersonal contact frequency between individuals increases linearly with the quarantine rate, which is in turn determined by the frequency of massive testings. Via equilibrium and stability analysis, we study the effects of the quarantine rate and find that there exists a threshold, above which massive testings are helpful in containing the disease and below which massive testings facilitate the epidemic spreading. Numerical simulations are conducted to demonstrate the correctness of the theoretical analysis and the effectiveness of the proposed quarantine strategy.
|
|
10:27-10:30, Paper WeA04.10 | |
Dimensionality Reduction of Dynamics on Lie Groups Via Structure-Aware Canonical Correlation Analysis |
|
Chung, Wooyoung | San Jose State University |
Polani, Daniel | University of Hertfordshire |
Tiomkin, Stas | San Jose State University |
Keywords: Nonlinear systems identification, Estimation, Reduced order modeling
Abstract: Incorporating prior knowledge into a data-driven modeling problem can drastically improve performance, reliability, and generalization outside of the training sample. The stronger the structural properties, the more effective these improvements become. Manifolds are a powerful nonlinear generalization of Euclidean space for modeling finite dimensions. When additionally assuming that the manifold carries (Lie) group structure, this imposes a drastically stricter global constraint. The range of their applications is very wide and includes the important case of robotic tasks. We apply this idea to Canonical Correlation Analysis (CCA). In traditional CCA one constructs a hierarchical sequence of maximal correlations of up to two paired data sets in Euclidean spaces. We here generalize the CCA concept to respect the structure of Lie groups and demonstrate its efficacy through the substantial improvements it achieves in making structure-consistent predictions about changes in the state of a robotic hand.
|
|
10:30-10:33, Paper WeA04.11 | |
Moving past Point-Contacts: Extending the ALIP Model to Humanoids with Non-Trivial Feet Using Hierarchical, Full-Body Momentum Control |
|
Paredes, Victor | The Ohio State University |
Hagen, Daniel | Sarcos Technology and Robotics Corp |
Chesebrough, Samuel | Sarcos Technology and Robotics Corporation |
Swann, Riley | Sarcos Technology and Robotics Corporation |
Garagic, Denis | The Ohio State University |
Hereid, Ayonga | Ohio State University |
Keywords: Reduced order modeling, Hierarchical control, Autonomous robots
Abstract: The Angular-Momentum Linear Inverted Pendulum (ALIP) model is a promising motion planner for bipedal robots. However, it relies on two assumptions: (1) the robot has point-contact feet or passive ankles, and (2) the angular momentum around the center of mass, known as centroidal angular momentum, is negligible. This paper addresses the question of whether the ALIP paradigm can be applied to more general bipedal systems with complex foot geometry (e.g., flat feet) and nontrivial torso/limb inertia and mass distribution (e.g., non-centralized arms). In such systems, the dynamics introduce non-negligible centroidal momentum and contact wrenches at the feet, rendering the assumptions of the ALIP model invalid. This paper presents the ALIP planner for general bipedal robots with non-point-contact feet through the use of a task-space whole-body controller that regulates centroidal momentum, thereby ensuring that the robot's behavior aligns with the desired template dynamics. To demonstrate the effectiveness of our proposed approach, we conduct simulations using the Sarcos Guardian XO robot, which is a hybrid humanoid/exoskeleton with large, offset feet. The results demonstrate the practicality and effectiveness of our approach in achieving stable and versatile bipedal locomotion.
|
|
10:33-10:36, Paper WeA04.12 | |
Exact Consistency Tests for Gaussian Mixture Filters Using Normalized Deviation Squared Statistics |
|
Ahmed, Nisar | University of Colorado Boulder |
Burks, Luke | Aurora Flight Sciences |
Cabral, Kailah | Aurora Flight Sciences |
Rose, Alyssa | Aurora Flight Sciences |
Keywords: Estimation, Filtering, Sensor fusion
Abstract: We consider the problem of evaluating dynamic consistency in discrete time probabilistic filters that approximate stochastic system state densities with Gaussian mixtures. Dynamic consistency means that the estimated probability distributions correctly describe the actual uncertainties. As such, the problem of consistency testing naturally arises in applications with regards to estimator tuning and validation. However, due to the general complexity of the density functions involved, straightforward approaches for consistency testing of mixture-based estimators have remained challenging to define and implement. This paper derives a new exact result for Gaussian mixture consistency testing within the framework of normalized deviation squared (NDS) statistics. It is shown that NDS test statistics for generic multivariate Gaussian mixture models exactly follow mixtures of generalized chi- square distributions, for which efficient computational tools are available. The accuracy and utility of the resulting consistency tests are numerically demonstrated on static and dynamic mixture estimation examples.
|
|
10:36-10:39, Paper WeA04.13 | |
DiProber: Estimating Relays Capacities in Underloaded Anonymous Communication Networks |
|
Darir, Hussein | University of Illinois Urbana-Champaign |
Dullerud, Geir E. | Univ of Illinois, Urbana-Champaign |
Borisov, Nikita | Univ. Illinois |
Keywords: Estimation, Network analysis and control, Simulation
Abstract: Tor is a widely used anonymous communication network that routes and anonymizes users’ internet traffic through thousands of relays. To create a path, Tor authorities estimate relay capacities based on the bandwidth of observation probes assigned to each relay. These estimates are used to generate a probability distribution over relays for incoming users to choose from. However, the currently implemented estimation algorithm generates inaccurate estimates, resulting in underutilization of the network and unfair distribution of capacities between users. To address this, we propose DiProber, a new algorithm that uses two probes per relay and maximum likelihood to more accurately estimate relay capacities. Our new technique works particularly well in underutilized networks where users have low demand on the Tor network.
|
|
10:39-10:42, Paper WeA04.14 | |
Optimal State Estimation in the Presence of Non-Gaussian Uncertainty Via Wasserstein Distance Minimization |
|
Prabhat, Himanshu | Texas A&M University |
Bhattacharya, Raktim | Texas A&M |
Keywords: Estimation, Kalman filtering, Filtering
Abstract: This paper presents a novel distribution-agnostic Wasserstein distance-based estimation framework. The goal is to determine an optimal map combining prior estimate with measurement likelihood such that posterior estimation error optimally reaches the Dirac delta distribution with minimal effort. The Wasserstein metric is used to quantify the effort of transporting from one distribution to another. We hypothesize that minimizing the Wasserstein distance between the posterior error and the Dirac delta distribution results in optimal information fusion and posterior state uncertainty. Framework validation is demonstrated by the successful recovery of the classical Kalman filter for linear systems with Gaussian uncertainties. Notably, the proposed Wasserstein filter does not rely on particle representation of uncertainty. Furthermore, the classical result for the Gaussian Sum Filter (GSF) is retrieved from the Wasserstein framework. This approach analytically exhibits the suboptimality of GSF and enables the use of nonlinear optimization techniques to enhance the accuracy of the Gaussian sum estimator.
|
|
10:42-10:45, Paper WeA04.15 | |
Adaptive Health Monitoring of Second-Life Batteries |
|
Cui, Xiaofan | Stanford University |
Khan, Muhammad Aadil | Stanford University |
Singh, Surinder | Relyion Energy |
Sharma, Ratnesh | NEC Laboratories America, Inc |
Onori, Simona | Stanford Univeristy |
Keywords: Energy systems, Estimation
Abstract: As surplus of second-life (SL) batteries becomes available, one critical obstacle between their widespread adoption is the accurate estimation and monitoring of their state-of-health (SOH). Various retired battery packs with a lack of knowledge of their historical usage are used to set up new SL battery systems. The paper presents a new online adaptive health estimation method designed to address this practical challenges associated with SL battery systems. This method utilizes the real-time data obtained from the SL batteries used for grid energy storage systems. The proposed approach is validated on a laboratory-aged experimental data set of retired EV batteries. Through dynamic adjustment of estimator gains, the approach can effectively accommodate the unique characteristics of individual cells, enhancing its adaptability and robustness. The bounded-input-bounded-output (BIBO) stability of this adaptive estimator is theoretically guaranteed.
|
|
10:45-10:48, Paper WeA04.16 | |
Simultaneous Parameter Estimation in Model-Free Control |
|
Waleed, Danial | University of Vermont |
Duffaut Espinosa, Luis Augusto | University of Vermont |
Keywords: Intelligent systems, Adaptive control, Optimization
Abstract: Model-Free control is a data-driven control methodology that utilizes a surrogate model, namely the ultra-local mode, to provide a short-time estimate of the dynamics and uncertainties related to a nonlinear system. Traditional Model-Free control requires two parameters, one is estimated online while the other is considered a tuning parameter for the performance of the system at hand. The tuning parameter is typically chosen heuristically and depends upon expert knowledge of the system. This paper provides a methodology for on-the-fly simultaneous selection of all parameters of the ultra-local model. The procedure relies on a robust Kalman Filter and an optimization routine that enforces the validity of the ultra-local model. An illustrative example is provided where the methodology developed is shown to improve the tracking performance of a system and, more importantly, that the estimated parameters converge to the expected values of the system.
|
|
10:48-10:51, Paper WeA04.17 | |
Deep Reinforcement Learning Based Distributed Active Joint Localization and Target Tracking |
|
Wang, Dongming | University of California, Riverside |
Su, Shaoshu | University of California, Riverside |
Ren, Wei | University of California, Riverside |
Hao, Ce | University of California, Berkeley |
Keywords: Multivehicle systems, Distributed control, Estimation
Abstract: In this paper, we investigate the problem of active joint localization and target tracking of mobile robots with onboard sensors. Our primary objective is concurrent target tracking while precisely localizing the robots through coordinated motion. A key constraint is the distributed setting, where each robot’s observations are limited to its immediate vicinity, and communication is restricted to neighboring robots. To address this, we propose a novel reinforcement learning-based approach for active motion planning, grounded in a distributed estimation framework called Joint Localization and Target Tracking (JLATT). The policy is trained to optimize robot coordination and trajectories for enhanced self-localization, target tracking, and collision avoidance. Empirical analysis demonstrates our algorithm’s effectiveness compared to benchmarks, both in collision avoidance and reducing estimation covariance, affirming its robustness for complex robotic systems.
|
|
10:51-10:54, Paper WeA04.18 | |
Observable GNSS-IMU Sliding Window Filtering Using Differential Flatness |
|
Johnson, Jacob Collin | Brigham Young University |
Beard, Randal W. | Brigham Young Univ |
Keywords: Sensor fusion, Estimation
Abstract: Measurements from the global navigation satellite system (GNSS) and inertial measurement units (IMU) complement one another. Global position is observed from GNSS at low frequencies, and attitude and angular velocity are estimated from IMU data at high frequencies. However, in order to observe global attitude, there must be sufficient motion to excite each axis of the IMU, which may not be possible for large vehicles with constrained dynamics. We propose to model the trajectory estimate on the flat output space of a motion model whose heading is constrained to be in the direction of motion. This mitigates the need to observe heading from the measurements. In this method we use a continuous-time spline and optimize the control points such that the flat output trajectory fits the available GNSS and IMU measurements. We validate the proposed method with simulated data and show that it achieves a higher accuracy and lower solve time than continuous-time estimation on the configuration manifold SE(3).
|
|
10:54-10:57, Paper WeA04.19 | |
Sloppiness of Structured Systems with a Matrix Fraction Description |
|
Ma, Yunxiang | Tsinghua University |
Zhou, Tong | Tsinghua University, Beijing, 100084, CHINA |
Keywords: Identification
Abstract: This paper attacks practical difficulties in estimating parameters of a matrix fraction described system. Results on the sloppiness of a linear functional transformation (LFT) described descriptor system have been extended. Besides the assumptions that both the numerator and denominator polynomial matrices depend affinely on system parameters, and the denominator polynomial matrix is invertible, not any other hypothesis is adopted. An ellipsoid approximation is obtained for the parameter set constituted from all the parameter values, with the associated system frequency responses deviating within a specified range from those of the system with a known parameter value. For some significant application cases, explicit expressions were derived for the absolute and relative sloppiness. Comparisons with the well-known Fisher information matrix have also been performed through a mechanical system, revealing significant differences between them in quantifying parameter estimation hardness.
|
|
WeB01 |
Metro E/C |
Machine Learning I |
Regular Session |
Chair: Shakeri, Heman | University of Virginia |
Co-Chair: Kamalapurkar, Rushikesh | Oklahoma State University |
|
13:30-13:45, Paper WeB01.1 | |
Dynamic Mode Decomposition of Control-Affine Nonlinear Systems Using Discrete Control Liouville Operators |
|
Morrison, Zachary | Oklahoma State University |
Abudia, Moad | Oklahoma State University |
Rosenfeld, Joel A. | University of South Florida |
Kamalapurkar, Rushikesh | Oklahoma State University |
Keywords: Machine learning, Modeling
Abstract: The representation of a nonlinear dynamical system as infinite-dimensional linear operator over a Hilbert space of functions enables the study of the nonlinear system via pseudo-spectral analysis of the corresponding operator. In this paper, we develop a novel operator representation of discrete-time, control-affine nonlinear dynamical systems. We also demonstrate that this representation can be used to predict the behavior of the closed-loop system in response to a given feedback law. The representation is learned using recorded snapshots of the system state resulting from arbitrary, potentially open-loop control inputs. We thereby extend the predictive capabilities of dynamic mode decomposition to discrete-time nonlinear systems that are affine in control. We validate the method using two numerical experiments by predicting the response of a controlled Duffing oscillator to a known feedback law. The advantages of the developed method relative to existing techniques in the literature are also demonstrated.
|
|
13:45-14:00, Paper WeB01.2 | |
Enhanced Joint Angle Estimation Using Support Vector Machine-Long Short-Term Memory Fusion with Electromyography Signals |
|
Wahid, MD Ferdous | TAMUQ |
Tafreshi, Reza | Texas A&M University at Qatar |
Keywords: Machine learning, Biomedical, Biotechnology
Abstract: Due to its non-invasive nature, the Electromyography (EMG) signal has become crucial in predicting human motor intention and facilitating human-robot collaboration across various fields, including rehabilitation, assistive technologies, ergonomics, clinical diagnosis, and sport science. This study aims to enhance joint angle prediction using a two-step algorithm, leveraging a substantial dataset of twenty human subjects. The EMG data were acquired from four upper-limb muscles during an elbow flexion-extension and shoulder abduction-adduction task using Kinect V2 and Noraxon’s TeleMyo Mini DTS system. In stage one, a Support Vector Machine (SVM) is used to predict joint angles directly, followed by refining predictions with a Long Short-Term Memory (LSTM) network. The impact of different sliding window sizes on algorithm performance is analyzed, and a statistical comparison between direct angle prediction and the two-stage approach is conducted to assess the proposed method’s effectiveness. The proposed SVM-LSTM algorithm consistently yields enhanced Root Mean Square Error (RMSE) values (ranging from 8.45o to 9.36o) for both elbow and shoulder angles, compared to RMSE values reported in the literature (ranging from 8.2o to 19.04o). Statistical analysis through Friedman Analysis of Variance (ANOVA) underscores significant differences between SVM-direct and SVM-LSTM for both elbow and shoulder angles (p < 0.001). In the dynamic field of EMG-based joint angle prediction, these results hold profound significance for applications like prosthetics and rehabilitation robotics, where precise joint angle estimations are crucial.
|
|
14:00-14:15, Paper WeB01.3 | |
Operator-Based Detecting, Learning, and Stabilizing Unstable Periodic Orbits of Chaotic Attractors |
|
Tavasoli, Ali | University of Virginia |
Shakeri, Heman | University of Virginia |
Keywords: Machine learning, Chaotic systems, Reduced order modeling
Abstract: This paper examines the use of operator-theoretic approaches to the analysis of chaotic systems through the lens of their unstable periodic orbits (UPOs). Our approach involves three data-driven steps for detecting, identifying, and stabilizing UPOs. We demonstrate the use of kernel integral operators within delay coordinates as an innovative method for UPO detection. For identifying the dynamic behavior associated with each individual UPO, we utilize the Koopman operator to present the dynamics as linear equations in the space of Koopman eigenfunctions. This allows for characterizing the chaotic attractor by investigating its principal dynamical modes across varying UPOs. We extend this methodology into an interpretable machine learning framework aimed at stabilizing strange attractors on their UPOs. To illustrate the efficacy of our approach, we apply it to the Lorenz attractor as a case study.
|
|
14:15-14:30, Paper WeB01.4 | |
Counterfactually-Guided Causal Reinforcement Learning with Reward Machines |
|
Baharisangari, Nasim | Arizona State University |
Paliwal, Yash | Arizona State University |
Xu, Zhe | Arizona State University |
Keywords: Machine learning, Formal verification/synthesis, Agents-based systems
Abstract: In causal reinforcement learning (RL), counterfactual reasoning deals with “what if” situations and allows for investigating the potential consequences of actions or events that did not actually happen. In this paper, we combine counterfactual reasoning and reinforcement learning (RL) and propose Counterfactually-Guided Causal Reinforcement Learning with Reward Machines (CGC-RL). In CGC-RL, using observational data, we first compute the optimal textit{counterfactual sequence} with the highest probability of completing a given task. Then, we construct an RM compatible with the textit{counterfactual sequence}. We use the constructed RM to apply dynamic potential-based reward shaping to encourage the agent to follow the textit{counterfactual sequence}. We prove the policy-invariance under dynamic reward shaping with RMs. Finally, we implement CGC-RL in one case study and compare the results with three baselines. Our results show that CGC-RL outperforms the baselines.
|
|
14:30-14:45, Paper WeB01.5 | |
Machine Learning Modeling of Nonlinear Processes with Lyapunov Stability Guarantees |
|
Tan, Wallace | Gian Yion |
Xiao, Ming | National University of Singapore |
Wu, Guoquan | National University of Singapore |
Wu, Zhe | National University of Singapore |
Keywords: Machine learning, Lyapunov methods, Chemical process control
Abstract: Machine learning (ML) methods such as neural networks provide an efficient way to build nonlinear dynamic models from data that can be used in the model predictive control system. In this work, we develop – for the first time – the method of ML modeling of nonlinear processes with Lyapunov stability guarantees, and the theory of its generalization performance and closed-loop stability. By designing a novel loss function that accounts for Lyapunov stability conditions in the training phase, the resulting Lyapunov-stable neural network (LSNN) is able to capture the nonlinear dynamics and guarantee closed-loop stability when incorporated in a stabilizing controller simultaneously. Provable closed-loop stability properties are derived based on the analysis of its generalization performance using statistical learning theory. Finally, a chemical reactor example is used to demonstrate the efficacy of the proposed ML modeling method.
|
|
14:45-15:00, Paper WeB01.6 | |
A Q-Learning Approach for Adherence-Aware Recommendations |
|
Faros, Ioannis | Cornell University |
Dave, Aditya Deepak | Cornell University |
Malikopoulos, Andreas A. | Cornell University |
Keywords: Human-in-the-loop control, Stochastic optimal control, Machine learning
Abstract: In many real-world scenarios involving high-stakes and safety implications, a human decision-maker (HDM) may receive recommendations from an artificial intelligence while holding the ultimate responsibility of making decisions. In this letter, we develop an "adherence-aware Q-learning" algorithm to address this problem. The algorithm learns the "adherence level" that captures the frequency with which an HDM follows the recommended actions and derives the best recommendation policy in real time. We prove the convergence of the proposed Q-learning algorithm to the optimal value and evaluate its performance across various scenarios.
|
|
WeB02 |
Harbour |
Network Control Systems I |
Regular Session |
Chair: Mousavi, Shima Sadat | ETH Zurich |
Co-Chair: Takaba, Kiyotsugu | Ritsumeikan University |
|
13:30-13:45, Paper WeB02.1 | |
Event-Triggered Distributed Control of Multiagent Systems: A Performance Recovery Consideration |
|
Kurtoglu, Deniz | University of South Florida |
Yucelen, Tansel | University of South Florida |
Tran, Dzung | AFRL |
Casbeer, David W. | Air Force Research Laboratory |
Garcia, Eloy | Air Force Research Laboratory |
Keywords: Networked control systems, Distributed control
Abstract: Although event-triggered architectures reduce the total number of agent-to-agent information exchange in distributed control of multiagent systems, their closed-loop performance can significantly deviate from their ideal, non-event-triggered counterparts. To address this issue, this paper presents a decentralized corrective signal, which is system-theoretically designed to achieve the ideal closed-loop multiagent system performance in event-triggered distributed control. In particular, the stability of the event-triggered closed-loop multiagent system with this decentralized corrective signal is first given and it is then demonstrated that the closed-loop performance approaches its ideal one as the gain of the decentralized corrective signal increases. A numerical example is also presented to illustrate the effectiveness of our approach when it is applied to a team of nonholonomic agents. While the results of this paper focus on performance recovery in distributed control of leader-follower multiagent systems over connected and undirected graphs, the presented decentralized corrective signal can be adapted for use in other distributed control architectures (e.g., consensus, containment, and formation control) as well as for other graph topologies, with minor changes.
|
|
13:45-14:00, Paper WeB02.2 | |
Multi-Agent Target Position Estimation Using Bearing-Only Measurements Via Spatial Excitation |
|
Hyeon, Soojeong | Seoul National University, |
Shames, Iman | Australian National University |
Shim, Hyungbo | Seoul National University |
Keywords: Networked control systems, Agents-based systems
Abstract: This paper solves the problem of estimating a single target position by a group of agents in two-dimensional space. Each agent is assumed to know its own position and to measure only the bearing to the target. We develop a distributed estimation scheme that processes individual bearing measurements by utilizing continuous-time dynamics with communication among the agents. Under the assumption of connectivity on the communication graph, it is shown that, even if each bearing measurement does not satisfy the persistency of excitation condition, each agent is able to estimate the target position provided that the agents are properly positioned. We examine this condition in terms of spatial excitation and its relationship with geometric configuration of the agents. Additionally, we analyze the convergence rate of the estimation error. Simulation results are provided to illustrate the performance of the proposed estimator.
|
|
14:00-14:15, Paper WeB02.3 | |
Strong Structural Controllability of Linear Descriptor Systems |
|
Mousavi, Shima Sadat | ETH Zurich |
Bahrami, Somayyeh | Razi University |
Fekih, Afef | University of Louisiana at Lafayette |
Keywords: Networked control systems, Control of networks, Network analysis and control
Abstract: This paper explores the controllability of linear descriptor systems on graphs. These systems, which are a generalization of linear time-invariant systems, are widely used for modeling various real-life phenomena with equations that include both algebraic and differential terms. There are three main types of controllability for these systems: controllability within a reachable set (R-controllability), complete controllability (C-controllability), and impulse controllability (I-controllability). This work focuses on strong structural controllability, which refers to the ability to control a family of linear descriptor systems with the same underlying directed graph structure. To this end, different graphs are defined, and a correspondence between zero forcing sets of these graphs and the sets of control nodes required for strong structural R-controllability, C-controllability, and I-controllability is established.
|
|
14:15-14:30, Paper WeB02.4 | |
Cooperative Learning with Gaussian Processes for Euler-Lagrange Systems Tracking Control under Switching Topologies |
|
Yang, Zewen | Robert Koch Institute |
Dong, Songbo | Technical University of Munich |
Lederer, Armin | ETH Zurich |
Dai, Xiaobing | Technical University of Munich |
Chen, Siyu | Technical University of Munich |
Sosnowski, Stefan | Technical University of Munich |
Hattab, Georges | Center for Artificial Intelligence in Public Health Research, Ro |
Hirche, Sandra | Technische Universität München |
Keywords: Cooperative control, Machine learning, Networked control systems
Abstract: This work presents an innovative learning-based approach to tackle the tracking control problem of Euler-Lagrange multi-agent systems with partially unknown dynamics operating under switching communication topologies. The approach leverages a correlation-aware cooperative algorithm framework built upon Gaussian process regression, which adeptly captures inter-agent correlations for uncertainty predictions. A standout feature is its exceptional efficiency in deriving the aggregation weights achieved by circumventing the computationally intensive posterior variance calculations. Through Lyapunov stability analysis, the distributed control law ensures bounded tracking errors with high probability. Simulation experiments validate the protocol's efficacy in effectively managing complex scenarios, establishing it as a promising solution for robust tracking control in multi-agent systems characterized by uncertain dynamics and dynamic communication structures.
|
|
14:30-14:45, Paper WeB02.5 | |
Cloud-Mediated Self-Triggered Synchronization of Physically Coupled Linear Agents |
|
Namba, Takumi | Ritsumeikan University |
Takaba, Kiyotsugu | Ritsumeikan University |
Keywords: Networked control systems, Distributed control, Cooperative control
Abstract: This letter deals with a cloud-mediated self-triggered synchronization of physically coupled linear agents. In such a multi-agent system, each agent has not only information communication but also physical couplings with its neighboring agents. In the cloud-mediated control, each agent asynchronously communicates with its neighbors through a cloud repository. At every access time, each agent estimates current and future behaviors of its neighbors as well as of itself, and locally determines its next access time to the cloud according to a certain access rule. A major difficulty in the cloud-mediated control of physically coupled agents is that each agent cannot accurately estimate its neighbors’ behaviors because they are continuously influenced by other agents through the physical couplings. To overcome this difficulty, we propose a novel self-triggered synchronizing control law for the physically coupled linear agents. The proposed control law achieves bounded synchronization by appropriately bounding the adverse effects of the physical couplings.
|
|
WeB03 |
Frontenac |
Autonomous Robots I |
Regular Session |
Chair: P. Vinod, Abraham | Mitsubishi Electric Research Laboratories |
Co-Chair: Afghah, Fatemeh | Clemson University |
|
13:30-13:45, Paper WeB03.1 | |
Safety-Critical Control with Uncertainty Quantification Using Adaptive Conformal Prediction |
|
Zhou, Hao | University of North Carnolina at Charlotte |
Zhang, Yanze | University of North Carolina at Charlotte |
Luo, Wenhao | University of North Carolina at Charlotte |
Keywords: Autonomous robots, Autonomous systems, Robotics
Abstract: Safety assurance is critical in the planning and control of robotic systems. For robots operating in the real world, the safety-critical design often needs to explicitly address uncertainties and the pre-computed guarantees often rely on the assumption of the particular distribution of the uncertainty. However, it is difficult to characterize the actual uncertainty distribution beforehand and thus the established safety guarantee may be violated due to possible distribution mismatch. In this paper, we propose a novel safe control framework that provides a high-probability safety guarantee for stochastic dynamical systems following unknown distributions of motion noise. Specifically, this framework adopts adaptive conformal prediction to dynamically quantify the prediction uncertainty from online observations and combines that with the probabilistic extension of the control barrier functions to characterize the uncertainty-aware control constraints. By integrating the constraints in the model predictive control scheme, it allows robots to adaptively capture the true prediction uncertainty online in a distribution-free setting and enjoys formally provable high-probability safety assurance. Simulation results on multi-robot systems with stochastic single-integrator dynamics and unicycle dynamics are provided to demonstrate the effectiveness of our framework.
|
|
13:45-14:00, Paper WeB03.2 | |
Cascaded Nonlinear Control Design for Highly Underactuated Balance Robots |
|
Han, Feng | Rutgers University |
Yi, Jingang | Rutgers University |
Keywords: Autonomous robots, Autonomous systems, Stability of nonlinear systems
Abstract: This paper presents a nonlinear control design for highly underactuated balance robots, where the number of unactuated degree-of-freedom is greater than that of actuated one. To address the challenge of simultaneously trajectory tracking and balancing, the control converts a robot dynamics into a series of cascaded subsystems and each of them is considered virtually actuated. We sequentially design and update the virtual and actual control inputs to incorporate the balance task such that the unactuated coordinates are balanced to their instantaneous equilibrium. The closed-loop dynamics are shown to be stable and the tracking errors exponentially converge towards a neighborhood near the origin. The simulation results demonstrate the effectiveness of the proposed control design by using a triple-inverted pendulum cart system.
|
|
14:00-14:15, Paper WeB03.3 | |
Decoupled Trajectory Planning for Monitoring UAVs and Their UGV Carrier by Reachable Sets |
|
Kim, Taewan | University of Washington |
P. Vinod, Abraham | Mitsubishi Electric Research Laboratories |
Di Cairano, Stefano | Mitsubishi Electric Research Labs |
Keywords: Autonomous robots, Constrained control, Multivehicle systems
Abstract: We consider the trajectory generation for a UGV and multiple UAVs that are tasked with monitoring certain specified target areas, where the former carries and re-charges the latter ones. We decouple the motion planning of UGVs and UAVs using reachable sets constructed from Lyapunov functions. The reachable sets are used as constraints for UGV trajectory generation resulting in existence guarantees of feasible UAVs trajectories with respect to flight and energy constraints. The reachable sets also provide an initial trajectory for UAVs rendezvous from and launch to the targets, which may be refined by optimization. We show simulation results of a case study with multiple UAVs monitoring multiple target sites.
|
|
14:15-14:30, Paper WeB03.4 | |
Probabilistic Visibility-Aware Trajectory Planning for Target Tracking in Cluttered Environments |
|
Gao, Han | Peking University |
Wu, Pengying | Peking University |
Su, Yao | Univeristy of California, Los Angeles (UCLA) |
Zhou, Kangjie | Peking University |
Ma, Ji | Peking University |
Liu, Hangxin | Virginia Tech |
Liu, Chang | Peking University |
Keywords: Autonomous robots, Control applications, Stochastic optimal control
Abstract: Target tracking has numerous significant civilian and military applications, and maintaining the visibility of the target plays a vital role in ensuring the success of the tracking task. Existing visibility-aware planners primarily focus on keeping the target within the limited field of view of an onboard sensor and avoiding obstacle occlusion. However, the negative impact of system uncertainty is often neglected, rendering the planners delicate to uncertainties in practice. To bridge the gap, this work proposes a model predictive control (MPC)-based trajectory planner for real-time visibility-aware and safe target tracking in the presence of system uncertainty. For more accurate target motion prediction, we introduce the concept of belief-space probability of detection (BPOD) to measure the predictive visibility of the target under stochastic robot and target states. An Extended Kalman Filter variant incorporating BPOD is developed to predict target belief state under uncertain visibility within the planning horizon. To reach real-time trajectory planning, we propose a computationally efficient algorithm to uniformly calculate both BPOD and the chance-constrained collision risk by utilizing linearized signed distance function (SDF), and subsequently solve the MPC problem by sequential convex programming. Extensive simulation results with benchmark comparisons show the capacity of the proposed approach to robustly maintain the visibility of the target under high system uncertainty. The practicality of the proposed trajectory planner is validated by real-world experiments.
|
|
14:30-14:45, Paper WeB03.5 | |
PyroTrack: Belief-Based Deep Reinforcement Learning Path Planning for Aerial Wildfire Monitoring in Partially Observable Environments |
|
Khoshdel, Sahand | Clemson University |
Luo, Qi | Clemson University |
Afghah, Fatemeh | Clemson University |
Keywords: Autonomous robots, Markov processes, Intelligent systems
Abstract: Motivated by agility, 3D mobility, and low-risk operation compared to human-operated management systems of autonomous unmanned aerial vehicles (UAVs), this work studies UAV-based active wildfire monitoring where a UAV detects fire incidents in remote areas and tracks the fire frontline. A UAV path planning solution is proposed considering realistic wildfire management missions, where a single low-altitude drone with limited power and flight time is available. Noting the limited field of view of commercial low-altitude UAVs, the problem formulates as a partially observable Markov decision process (POMDP), in which wildfire progression outside the field of view causes inaccurate state representation that prevents the UAV from finding the optimal path to track the fire front in limited time. Common deep reinforcement learning (DRL)-based trajectory planning solutions require diverse drone-recorded wildfire data to generalize pre-trained models to real-time systems, which is not currently available at a diverse and standard scale. To narrow down the gap caused by partial observability in the space of possible policies, a belief-based state representation with broad, extensive simulated data is proposed where the beliefs (i.e., ignition probabilities of different grid areas) are updated using a Bayesian framework for the cells within the field of view. The performance of the proposed solution in terms of the ratio of detected fire cells and monitored ignited area (MIA) is evaluated in a complex fire scenario with multiple rapidly growing fire batches, indicating that the belief state representation outperforms the observation state representation both in fire coverage and the distance to fire frontline.
|
|
14:45-15:00, Paper WeB03.6 | |
Collision-Free Platooning of Mobile Robots through a Set-Theoretic Predictive Control Approach |
|
Rajkumar, Suryaprakash | Concordia University |
Tiriolo, Cristian | Concordia University |
Lucia, Walter | Concordia University |
Keywords: Autonomous robots, Multivehicle systems, Constrained control
Abstract: This paper proposes a control solution to achieve collision-free platooning control of input-constrained mobile robots. The platooning policy is based on a leader-follower approach where the leader tracks a reference trajectory while followers track the leader's pose with an inter-agent delay. First, the leader and the follower kinematic models are feedback linearized and the platoon's error dynamics and input constraints characterized. Then, a set-theoretic model predictive control strategy is proposed to address the platooning trajectory tracking control problem. An ad-hoc collision avoidance policy is also proposed to guarantee collision avoidance amongst the agents. Finally, the effectiveness of the proposed control architecture is validated through experiments performed on a formation of Khepera IV differential drive robots.
|
|
WeB04 |
Metro W |
Estimation and Identification I |
Regular Session |
Chair: Ifqir, Sara | CRIStAL Laboratory, Centrale Lille Institut |
Co-Chair: Khosravi, Mohammad | Delft University of Technology |
|
13:30-13:45, Paper WeB04.1 | |
A New Switched Interval Observer Design for Vehicle Lateral Dynamics Estimation |
|
Ifqir, Sara | CRIStAL Laboratory, Centrale Lille Institut |
Ichalal, Dalil | IBISC-Lab, Univ Evry, Paris Saclay University |
Ait Oufroukh, Naima | IBISC, Université D'Evry |
Mammar, Said | Université D'Evry |
Keywords: Estimation, Autonomous vehicles, Switched systems
Abstract: A new interval observer (IO) for uncertain switched linear dynamical systems is derived in this paper. We show that estimation accuracy and robustness can be achieved by combining Input-to-State Stability (ISS) concept and common quadratic Lyapunov function. The stability and positivity conditions are expressed in terms of Linear Matrix Inequalities (LMIs). This new observer overcomes the drawbacks of existing IO's, which are designed based on similarity transformations. The strengths of the proposed observer structure lie in having additional decision variables. The design procedure is very simple and uniform, it does not require changes of coordinates, which are costly in terms of computational time. The increased robustness of the presented method, compared to other commonly used approaches, is demonstrated by accurately estimating, at each time instant, the interval including vehicle sideslip angle and yaw rate. The proposed algorithm shows high potential for practical implementation.
|
|
13:45-14:00, Paper WeB04.2 | |
Closed-Form Information-Theoretic Roughness Measures for Mixture Densities |
|
Hanebeck, Uwe D. | Karlsruhe Institute of Technology (KIT) |
Frisch, Daniel | Karlsruhe Institute of Technology (KIT) |
Prossel, Dominik | Karlsruhe Institute of Technology (KIT) |
Keywords: Estimation, Filtering, Stochastic systems
Abstract: In estimation, control, and machine learning under uncertainties, latent variables are usually described by a probability density function (pdf). The optimal reconstruction of a continuous pdf from given samples or moments is an important and ubiquitous task. Unfortunately, it typically results in an underdetermined optimization problem, as the pdf is not fully constrained by the given samples or moments. For regularization, we use Fisher Information (FI) that acts as a roughness measure, i.e., selects the smoothest pdf fulfilling the constraints, in an information-theoretic sense. For the important class of mixture densities, FI can only be computed numerically. In this paper, we derive a closed-form solution for FI for mixtures by transforming the problem to the space cal R of root mixtures (RMs). This results in a tandem processing scheme simultaneously working in the original mixture space cal M and the corresponding RM space: The density parameters are optimized in root mixture space based on the closed-form FI. The desired constraints are evaluated in the original mixture space cal M.
|
|
14:00-14:15, Paper WeB04.3 | |
Linear Time-Varying Parameter Estimation: Maximum a Posteriori Approach Via Semidefinite Programming |
|
Vakili, Sasan | Delft University of Technology |
Khosravi, Mohammad | Delft University of Technology |
Mohajerin Esfahani, Peyman | TU Delft |
Mazo Jr., Manuel | Delft University of Technology |
Keywords: Estimation, Identification, LMIs
Abstract: We study the problem of identifying a linear time-varying output map from measurements and linear time-varying system states, which are perturbed with Gaussian observation noise and process uncertainty, respectively. Employing a stochastic model as prior knowledge for the parameters of the unknown output map, we reconstruct their estimates from input/output pairs via a Bayesian approach to optimize the posterior probability density of the output map parameters. The resulting problem is a non-convex optimization, for which we propose a tractable linear matrix inequalities approximation to warm-start a first-order subsequent method. The efficacy of our algorithm is shown experimentally against classical Expectation Maximization and Dual Kalman Smoother approaches.
|
|
14:15-14:30, Paper WeB04.4 | |
MARG Sensor-Based Attitude Estimation on SO(3) under Unknown External Acceleration |
|
Shaaban, Ghadeer | University Grenoble Alpes |
Fourati, Hassen | University Grenoble Alpes |
Kibangou, Alain | Univ. Grenoble Alpes |
Prieur, Christophe | CNRS |
Keywords: Estimation, Kalman filtering, Sensor fusion
Abstract: In many applications, attitude estimation algorithms rely mainly on magnetic and inertial measurements from MARG sensors (consisting of a magnetometer, a gyroscope, and an accelerometer). One of the main challenges facing these algorithms is that the accelerometer measures both gravity and an unknown external acceleration, while these algorithms assume that the accelerometer measures only the gravity. In this paper, an attitude estimation algorithm on the special orthogonal group SO(3) is designed, considering the external acceleration as an unknown input with direct feedthrough to the output, with a local approximation approach. The proposed algorithm is validated through Monte Carlo simulations and real datasets, demonstrating better accuracy and enhanced performance than existing solutions.
|
|
14:30-14:45, Paper WeB04.5 | |
Distributed Fact Checking: Learning Unreliability |
|
Verma, Ashwin | University of California San Diego |
Mohajer, Soheil | University of Minnesota |
Touri, Behrouz | University of California San Diego |
Keywords: Estimation, Lyapunov methods
Abstract: We formulate the problem of fake news detection using distributed fact-checkers (agents) with unknown reliability. The stream of news/statements is modeled as an independent and identically distributed binary source (to represent true and false statements). Upon observing a news, agent i labels the news as true or false that reflects the true validity of the statement with some probability 1-pi_i. In other words, agent i misclassified each statement with error probability pi_iin (0,1), where the parameter pi_i models the (un)trustworthiness of agent i. We present an algorithm to learn the unreliability parameters, resulting in a distributed fact checking algorithm. For the two-agent case, we extensively analyze the discrete-time limit of our algorithm.
|
|
14:45-15:00, Paper WeB04.6 | |
Domain-Adaptation with Knowledge Accumulation through Parallel Stacked Autoencoders: Methodology and Application to Sulfur Recovery |
|
Mou, Tianhao | Shanghai Jiao Tong University |
Liu, Jinfeng | University of Alberta |
Zou, Yuanyuan | Shanghai Jiao Tong University |
Li, Shaoyuan | Shanghai Jiao Tong University |
Xibilia, Maria Gabriella | University of Messina |
Keywords: Estimation, Machine learning, Chemical process control
Abstract: Data-driven predictive models for end-point quality variables are important tools in industrial process modeling. However, establishing an effective predictive model with limited labeled data remains challenging. Transfer learning (TL) offers a solution by leveraging knowledge from similar but different tasks. This paper introduces a novel TL-based predictive model, domain-adaptation parallel stacked autoencoders (DA-PSAE), which can extract and accumulate knowledge from multiple similar processes. First, a parallel model structure is designed for the simultaneous extraction of static and plastic latent features. Besides, an effective TL-based training strategy is proposed, which utilizes data from multiple similar processes. The proposed model is applied to a sulfur recovery unit composed of four parallel sub-units. Experimental results verify the effectiveness of the proposed model.
|
|
WeB05 |
Marine |
Optimization I |
Regular Session |
Chair: Poveda, Jorge I. | University of California, San Diego |
Co-Chair: Dong, Roy | University of Illinois at Urbana-Champaign |
|
13:30-13:45, Paper WeB05.1 | |
Tradeoffs between Convergence Speed and Noise Amplification in First-Order Optimization: The Role of Averaging |
|
Samuelson, Samantha | University of Southern California |
Jovanovic, Mihailo R. | University of Southern California |
Keywords: Optimization algorithms, Linear systems, Robust control
Abstract: We study the effect of averaging on convergence speed and noise amplification of the heavy-ball algorithm in the presence of additive white stochastic disturbances. For strongly convex quadratic problems, we show that averaging over the entire algorithmic history eliminates steady-state variance of the averaged output at the expense of slowing down convergence to a sub-linear rate. In contrast, finite window averaging converges with a linear rate but it leads to a non-zero value of the steady-state variance. While this value is smaller than the steady-state variance of the iterates of the heavy-ball algorithm, it has the same orderwise dependence on the condition number. We also show that the finite window averaging increases the upper bound on the expected error at iteration t by a constant factor that depends on the length of the averaging window.
|
|
13:45-14:00, Paper WeB05.2 | |
Online Linear Quadratic Tracking with Regret Guarantees |
|
Karapetyan, Aren | ETH Zürich |
Bolliger, Diego | Zurich University of Applied Sciences (ZHAW) |
Tsiamis, Anastasios | ETH Zurich |
Balta, Efe C. | Inspire AG |
Lygeros, John | ETH Zurich |
Keywords: Optimization algorithms, Machine learning, Optimal control
Abstract: Online learning algorithms for dynamical systems provide finite time guarantees for control in the presence of sequentially revealed cost functions. We pose the classical linear quadratic tracking problem in the framework of online optimization where the time-varying reference state is unknown a priori and is revealed after the applied control input. We show the equivalence of this problem to the control of linear systems subject to adversarial disturbances and propose a novel online gradient descent-based algorithm to achieve efficient tracking in finite time. We provide a dynamic regret upper bound scaling linearly with the path length of the reference trajectory and a numerical example to corroborate the theoretical guarantees.
|
|
14:00-14:15, Paper WeB05.3 | |
An Interconnected Systems Approach to Convergence Analysis of Discrete-Time Primal-Dual Algorithms |
|
Kelly, Spencer | University of Toronto |
Simpson-Porco, John W. | University of Toronto |
Keywords: Optimization algorithms, Optimization, Lyapunov methods
Abstract: We study the geometric convergence rate of discrete-time primal-dual algorithms for solving strongly-convex equality-constrained optimization problems. Our approach separates the primal-dual algorithm into an interconnection of two exponentially stable systems, and a composite Lyapunov approach is used to establish stability of the interconnection and provide new bounds on the geometric rate of convergence. Analogous convergence results are developed for two variations of the primal-dual algorithm: an extrapolated version which accelerates convergence, and an inner-loop version which interpolates between the vanilla primal-dual method and dual ascent. The obtained bounds are compared and contrasted with existing bounds from the literature.
|
|
14:15-14:30, Paper WeB05.4 | |
On Distributed Nonconvex Optimisation Via Modified ADMM |
|
Mafakheri, Behnam | University of Melbourne |
Manton, Jonathan H. | The University of Melbourne |
Shames, Iman | Australian National University |
Keywords: Optimization algorithms, Network analysis and control, Machine learning
Abstract: This paper addresses the problem of nonconvex nonsmooth decentralised optimisation in multi-agent networks with undirected connected communication graphs. Our contribution lies in introducing an algorithmic framework designed for the distributed minimisation of the sum of a smooth (possibly nonconvex and non-separable) function and a convex (possibly nonsmooth and non-separable) regulariser. The proposed algorithm can be seen as a modified version of the ADMM algorithm where, at each step, an ``inner loop'' needs to be iterated for a number of iterations. The role of the inner loop is to aggregate and disseminate information across the network. We observe that a naive decentralised approach (one iteration of the inner loop) may not converge. We establish the asymptotic convergence of the proposed algorithm to the set of stationary points of the nonconvex problem where the number of iterations of the inner loop increases logarithmically with the step count of the ADMM algorithm. We present numerical results demonstrating the proposed method's correctness and performance.
|
|
14:30-14:45, Paper WeB05.5 | |
Connection of Optimal Stopping Time to S-T Cut Problems on Trees |
|
Wang, Yijin | University of Illinois Urbana-Champaign |
Ornik, Melkior | University of Illinois Urbana-Champaign |
Dong, Roy | University of Illinois at Urbana-Champaign |
Keywords: Optimization algorithms, Stochastic optimal control, Control of networks
Abstract: We present a method to transform any optimal stopping time problem with an underlying tree structure into an s-t min-cut problem on the same tree but with modified capacities, the details of which are lacking in existing optimal stopping time research. We also show that any s-t min/max-cut problem on a tree has an equivalent optimal stopping problem formulation. We provide a dynamic programming algorithm to solve this problem and also perform a sensitivity analysis on it. Our results imply that the s-t max-cut problem on a tree can be solved using an algorithm with runtime that is linear in the tree size.
|
|
14:45-15:00, Paper WeB05.6 | |
Decentralized Laplacian Gradient Flows with Vanishing Anchors for Resource Allocation Problems with Arbitrary Initialization |
|
Barreiro-Gomez, Julian | New York University Abu Dhabi (NYUAD) |
Poveda, Jorge I. | University of California, San Diego |
Keywords: Optimization algorithms, Time-varying systems, Distributed control
Abstract: Resource allocation problems are ubiquitous across different engineering applications where a fixed and limited asset needs to be optimally allocated among multiple agents in an online manner. While a plethora of algorithms exist for the solution of such problems, most existing approaches are restricted to either centralized initializations or the use of multiple auxiliary states that increase the complexity of the dynamics. In this paper, we tackle these challenges by introducing a novel and simple approach for the solution of decentralized resource allocation problems in multi-agent networked systems. The approach relies on a family of time-varying Laplacian gradient flows with vanishing anchors that remove any restriction on the initialization of the dynamics, without the need of incorporating additional auxiliary states. The convergence properties of the proposed dynamics are studied for a general class of time-varying vanishing gains. Analytical and numerical examples are also presented.
|
|
WeB06 |
Queens Quay 1 |
Control of Large-Scale Battery Energy Storage Systems |
Invited Session |
Chair: Lin, Xinfan | University of California, Davis |
Co-Chair: Soudbakhsh, Damoon | Temple University |
Organizer: Zhang, Dong | University of Oklahoma |
Organizer: Soudbakhsh, Damoon | Temple University |
Organizer: Jain, Neera | Purdue University |
Organizer: Dey, Satadru | The Pennsylvania State University |
Organizer: Tang, Shuxia | Texas Tech University |
Organizer: Roy, Tanushree | Texas Tech University |
Organizer: Moura, Scott | University of California, Berkeley |
Organizer: Lin, Xinfan | University of California, Davis |
Organizer: De Castro, Ricardo | University of California, Merced |
Organizer: Song, Ziyou | University of Michigan, Ann Arbor |
Organizer: Fogelquist, Jackson | University of California, Davis |
|
13:30-13:45, Paper WeB06.1 | |
Optimal Power Management of Battery Energy Storage Systems Via Ensemble Kalman Inversion (I) |
|
Farakhor, Amir | University of Kansas |
Askari, Iman | University of Kansas |
Wu, Di | Pacific Northwest National Laboratory |
Fang, Huazhen | University of Kansas |
Keywords: Energy systems, Estimation, Optimal control
Abstract: Optimal power management of battery energy storage systems (BESS) is crucial for their safe and efficient operation. Numerical optimization techniques are frequently utilized to solve the optimal power management problems. However, these techniques often fall short of delivering real-time solutions for large-scale BESS due to their computational complexity. To address this issue, this paper proposes a computationally efficient approach. We introduce a new set of decision variables called power-sharing ratios corresponding to each cell, indicating their allocated power share from the output power demand. We then formulate an optimal power management problem to minimize the system-wide power losses while ensuring compliance with safety, balancing, and power supply-demand match constraints. To efficiently solve this problem, a parametrized control policy is designed and leveraged to transform the optimal power management problem into a parameter estimation problem. We then implement the ensemble Kalman inversion to estimate the optimal parameter set. The proposed approach significantly reduces computational requirements due to 1) the much lower dimensionality of the decision parameters and 2) the estimation treatment of the optimal power management problem. Finally, we conduct extensive simulations to validate the effectiveness of the proposed approach. The results show promise in accuracy and computation time compared with explored numerical optimization techniques.
|
|
13:45-14:00, Paper WeB06.2 | |
Optimal Charging with Active Thermal Management for eVTOL Aircraft Battery Packs (I) |
|
Goshtasbi, Alireza | University of Michigan |
Han, Sangwoo | Joby Aviation |
Zhao, Ruxiu | Joby Aviation |
Neubauer, Jeremy | Joby Aviation |
Keywords: Optimal control, Energy systems, Optimization
Abstract: Electric vertical takeoff and landing (eVTOL) aircraft operation often requires the battery packs to be charged and thermally conditioned to a given range of state-of-charge (SOC) and temperature for the next flight in minimum time to maximize aircraft utility. Where charging favors higher temperatures than what is required at launch, an optimal solution must trade-off faster charging times with the required thermal conditioning time. Here we develop a coupled electro-thermal battery, charger, and cooling system model, formulate an optimal control problem for fast charging, and solve it using a direct collocation approach. The resulting algorithm simultaneously provides the optimal current and thermal conditioning profiles that bring the pack from a starting temperature and SOC to their respective targets in minimum time, while ensuring degradation and safety constraints are satisfied. Insights from the results can be used to develop simple on-line control algorithms for charging with active thermal management that are critical for eVTOL applications.
|
|
14:00-14:15, Paper WeB06.3 | |
Depreciation Cost Is a Poor Proxy for Revenue Lost to Aging in Grid Storage Optimization (I) |
|
Kumtepeli, Volkan | University of Oxford |
Hesse, Holger | Kempten University of Applied Sciences |
Morstyn, Thomas | University of Oxford |
Nosratabadi, Seyyed Mostafa | University of Oxford |
Aunedi, Marko | Brunel University London |
Howey, David A. | University of Oxford |
Keywords: Energy systems, Optimization
Abstract: Dispatch of a grid energy storage system for arbitrage is typically formulated into a rolling-horizon optimization problem that includes a battery aging model within the cost function. Quantifying degradation as a depreciation cost in the objective can increase overall profits by extending lifetime. However, depreciation is just a proxy metric for battery aging; it is used because simulating the entire system life is challenging due to computational complexity and the absence of decades of future data. In cases where the depreciation cost does not match the loss of possible future revenue, different optimal usage profiles result and this reduces overall profit significantly compared to the best case (e.g., by 30–50%). Representing battery degradation perfectly within the rolling-horizon optimization does not resolve this—in addition, the economic cost of degradation throughout life should be carefully considered. For energy arbitrage, optimal economic dispatch requires a trade-off between overuse, leading to high return rate but short lifetime, vs. underuse, leading to a long but not profitable life. We reveal the intuition behind selecting representative costs for the objective function, and propose a simple moving average filter method to estimate degradation cost. Results show that this better captures peak revenue, assuming reliable price forecasts are available.
|
|
14:15-14:30, Paper WeB06.4 | |
Optimal Sizing, Operation, and Efficiency Evaluation of Battery Swapping Station for Electric Heavy-Duty Trucks (I) |
|
Wang, Ruiting | University of California, Berkeley |
Ju, Yi | University of California, Berkeley |
Allybokus, Zaid | TotalEnergies OneTech |
Zeng, Wente | Total S.A |
Obrecht, Nicolas | TotalEnergies OneTech |
Moura, Scott | University of California, Berkeley |
Keywords: Optimization, Energy systems, Transportation networks
Abstract: Decarbonization and electrification of long-haul trucks are notoriously difficult due to the high energy demand and limited gravimetric energy density of lithium-ion cells. In this study, we investigate the optimal deployment and operation of a grid-connected battery swapping station (BSS) for electric long-haul trucks as a mixed-integer optimization problem. We construct a model for reliably meeting customer energy needs while providing grid services, to demonstrate the business case and the operation of such a system. The impact of optimal sizing of the station is explored. A comparative study of two alternative charging infrastructure solutions using battery swapping and fast charging stations (FCS) has been performed through numerical experiments. We examine ton-mile-per-hour as a metric for the relative efficiency of cargo movement and the labor requirements. We find that FCS works better for large battery packs with short and regional hauls, and BSS is well suited for relatively medium-sized battery packs and long-haul truck movements. The study also found that the potential to use smaller batteries, and the expansion of the network can increase the battery utilization efficiency of BSS, which may be able to approach a similar level to the utilization efficiency of FCS.
|
|
14:30-14:45, Paper WeB06.5 | |
Comparison between Battery Cell Level Dynamics and Pack Level Dynamics Using Equivalent Circuit Models (I) |
|
Ross, Joseph Peter | Universtiy of Oxford, Brill Power Limited |
Frost, Damien Francis | Brill Power Limited |
Chatzinikolaou, Efstratios | Brill Power Limited |
Duncan, Stephen | University of Oxford |
Howey, David A. | University of Oxford |
Keywords: Estimation, Model Validation, Energy systems
Abstract: Large battery systems include parallel-connected cells and modules, and these can exhibit complex and unexpected behaviours. In this paper, we investigate parallel-connected battery equivalent circuit models and show that cell inhomogeneity leads to differences between parallel groups that are not typically accounted for in battery management system (BMS) models. The theoretical analysis is complemented by experimental results using impedance data to parameterise models from individual cells, and from a pack assembled from the individual cells. We show that cell-level parameterisation is sufficient for parameterising a pack model---but only if parasitic impedances and inhomogeneity between cells is well understood.
|
|
WeB07 |
Queens Quay 2 |
Safety of Advanced Driver Assistance Systems and Automated Driving Systems |
Invited Session |
Chair: Rastgoftar, Hossein | University of Arizona |
Co-Chair: Nazari, Shima | UC Davis |
Organizer: Zhao, Junfeng | Arizona State University |
Organizer: Rastgoftar, Hossein | University of Arizona |
Organizer: Nazari, Shima | UC Davis |
|
13:30-13:45, Paper WeB07.1 | |
Energy-Critical Control Using Control Barrier Functions (I) |
|
Alan, Anil | University of Michigan |
Ivanco, Andrej | Allison Transmission |
Orosz, Gabor | University of Michigan |
Keywords: Automotive control, Control applications, Energy systems
Abstract: This study proposes an energy-critical control scheme for road vehicles. The framework is designed to ensure an energy-critical goal that is defined in the form of an upper bound on the energy consumption. In order to respond to the changing traffic environment, this bound is changed based on the preceding vehicle's energy consumption. Control barrier functions are used to obtain a condition on the control input such that the energy-critical goal is formally guaranteed. This condition is used in a quadratic program scheme as a constraint, and the resulting controller intervenes the driver input in a minimally-invasive fashion. The effect of the energy-critical controller is demonstrated in data-based simulations, where up to 25% improvement in energy consumption is observed compared to an energy-inefficient driver, without drastically changing the car-following performance.
|
|
13:45-14:00, Paper WeB07.2 | |
Safety-Guaranteed Learning-Based Flocking Control Design (I) |
|
Liu, Mingzhe | Arizona State University |
Chen, Yan | Arizona State University |
Keywords: Agents-based systems, Machine learning, Constrained control
Abstract: This research aims to develop a new learning-based flocking control framework that ensures inter-agent free collision. To achieve this goal, a leader-following flocking control based on a deep Q-network (DQN) is designed to comply with the three Reynolds’ flocking rules. However, due to the inherent conflict between the navigation attraction and inter agent repulsion in the leader-following flocking scenario, there exists a potential risk of inter-agent collisions, particularly with limited training episodes. Failure to prevent such collision not only caused penalties in training but could lead to damage when the proposed control framework is executed on hardware. To address this issue, a control barrier function (CBF) is incorporated into the learning strategy to ensure collision-free flocking behavior. Moreover, the proposed learning framework with CBF enhances training efficiency and reduces the complexity of reward function design and tuning. Simulation results demonstrate the effectiveness and benefits of the proposed learning methodology and control framework.
|
|
14:00-14:15, Paper WeB07.3 | |
Adaptive Control of Vehicle Steering-By-Wire System with Varying-Degree Lyapunov Function and Deterministic Robust Control Augmentation (I) |
|
Zhou, Xingyu | University of Texas at Austin |
Ahn, Hyunjin | The University of Texas at Austin |
Kung, Yung-Chi | The University of Texas at Austin |
Shen, Heran | The University of Texas at Austin |
Wang, Junmin | University of Texas at Austin |
Keywords: Adaptive control, Automotive control, Direct adaptive control
Abstract: Existing adaptive control designs for vehicle steering-by-wire (SbW) systems mainly rely on quadratic Lyapunov functions, providing (global) stability and, at best, (global) asymptotic convergence of certain closed-loop signals. However, these approaches generally lack assurance in transient performance. In this paper, we introduce a novel adaptive control scheme aimed to enhance and guarantee the transient performance of the adaptive SbW control system. This approach integrates a varying-degree Lyapunov function with deterministic robust control. The new adaptive control scheme is derived in a general context, applicable to a class of single-input, parametrically uncertain, nonlinear dynamic systems in Brunovsky form. We then apply this general theoretical result to develop an adaptive controller for the SbW system. Using a high-fidelity moving-base driving simulator, we demonstrate the transient performance improvement of the new adaptive SbW controller compared to a baseline method.
|
|
14:15-14:30, Paper WeB07.4 | |
Teleoperated Steering Using Estimated Position and Orientation of Remote Ego Vehicle (I) |
|
Sharma, Gaurav | University of Minnesota |
Rajamani, Rajesh | Univ. of Minnesota |
Keywords: Automotive control, Estimation, Mechatronics
Abstract: Teleoperation of a vehicle requires displaying the road environment of the remote vehicle accurately on a teleoperation station, so that a human teleoperator can use the display to control the vehicle safely and efficiently. Limited bandwidth and latencies in wireless communication may prevent transmission of camera images and Lidar data at a sufficiently high frequency for rapid updates of the display. This paper describes how frequent transmission of just GPS and IMU data can enable accurate vehicle position and orientation estimation with which realistic intermediate updates of the remote vehicle environment can be provided. A nonlinear dynamic motion model and an extended Kalman filter are utilized for estimating vehicle position and orientation. A study with 5 human subjects is used to compare steering control of a remote vehicle with and without intermediate position updates. Experimental data show that a 0.5 second delay in real-time display makes it extremely difficult for a human teleoperator to control the vehicle to stay in its lane on curved roads. However, using an estimation-based predictive display system to update the vehicle position and orientation with respect to the road environment enables safe remote vehicle control with almost as accurate a performance as the delay-free case.
|
|
14:30-14:45, Paper WeB07.5 | |
Safety-Critical Stabilization of Mixed Traffic by Pairs of CAVs (I) |
|
Zhao, Chenguang | The Hong Kong University of Science and Technology (Guangzhou) |
Molnar, Tamas G. | Wichita State University |
Yu, Huan | The Hong Kong University of Science and Technology(Guangzhou) |
Keywords: Traffic control, Cooperative control, Autonomous systems
Abstract: Connected and automated vehicles (CAVs) have been widely applied to vehicle-based traffic control in mixed-autonomy systems that consist of CAVs and human-driven vehicles (HVs). The control designs of CAVs allow them to drive smoothly, cooperate, and stabilize the flow of traffic. However, these controllers must also ensure safe behaviors for CAVs as well as consider their potential impact on the safety of following HVs. In this paper, we propose nonlinear controllers for a pair of CAVs that respond to each other whilst traveling amongst HVs. The controllers seek to stabilize traffic, while the safety of CAVs in terms of~front-end collisions is formally guaranteed via control barrier functions. We analyze how the coordination of CAVs affects stability and safety in the mixed traffic system. The efficacy of the proposed controllers is demonstrated by numerical simulations.
|
|
WeB08 |
Bay |
Advanced Methods in Diagnostics and Prognostics |
Tutorial Session |
Chair: Castillo, Ivan | The Dow Chemical Company |
Co-Chair: Wang, Zhenyu | Dow Chemical |
Organizer: Castillo, Ivan | The Dow Chemical Company |
Organizer: Wang, Zhenyu | Dow Chemical |
Organizer: Makki, Imad | Ford Motor Company |
|
13:30-13:45, Paper WeB08.1 | |
Cycle Life Prediction for Lithium-Ion Batteries: Machine Learning and More (I) |
|
Schaeffer, Joachim | Technischen Universität Darmstadt |
Galuppini, Giacomo | University of Pavia |
Rhyu, Jinwook | Massachusetts Institute of Technology |
Asinger, Patrick | Massachusetts Institute of Technology |
Droop, Robin | Massachusetts Institute of Technology |
Findeisen, Rolf | TU Darmstadt |
Braatz, Richard D. | Massachusetts Institute of Technology |
Keywords: Energy systems, Machine learning, Modeling
Abstract: Batteries are dynamic systems with complicated nonlinear aging, highly dependent on cell design, chemistry, manufacturing, and operational conditions. Prediction of battery cycle life and estimation of aging states is important to accelerate battery R&D, testing, and to further the understanding of how batteries degrade. Beyond testing, battery management systems rely on real-time models and onboard diagnostics and prognostics for safe operation. Estimating the state of health and remaining useful life of a battery is important to optimize performance and use resources optimally. This tutorial begins with an overview of first-principles, machine learning, and hybrid battery models. Then, a typical pipeline for the development of interpretable, machine learning models is explained and showcased for cycle life prediction from laboratory testing data. We highlight the challenges of machine learning models, motivating the incorporation of physics in hybrid modeling approaches, which are needed to decipher the aging trajectory of batteries but require more data and further work on the physics of battery degradation. The tutorial closes with a discussion on generalization and further research directions.
|
|
13:45-14:00, Paper WeB08.2 | |
Advanced Methods in Diagnostics and Prognostics (I) |
|
Mohr, Fabian | Massachusetts Institute of Technology |
Sun, Weike | Massachusetts Institute of Technology |
Braatz, Richard D. | Massachusetts Institute of Technology |
Keywords: Fault diagnosis, Machine learning, Fault detection
Abstract: This article provides a tutorial on data-driven methods for diagnostics and prognostics. The methods span the range from classical methods to machine learning-based methods including ensemble methods and support vector machine. The main challenges associated with diagnostics/prognostics are discussed, and decision rules are provided for selecting which data-driven modeling method to apply to a particular dataset based on the characteristics of the data. The autonomous selection of data-driven methods is described, as an application-specific automated machine learning (AutoML) approach. A case study of a simulated chemical plant provided by Tennessee Eastman Company is used to illustrate the characteristics of the data-driven methods for diagnostics and prognostics.
|
|
WeB09 |
Dockside 1 |
Aerospace Systems |
Regular Session |
Chair: Castillo, Pedro | Univ De Technologie De Compiegne |
Co-Chair: Kumar, Shashi Ranjan | Indian Institute of Technology Bombay |
|
13:30-13:45, Paper WeB09.1 | |
A Comparative Study of Machine Learning Techniques for Aircraft Loss of Control Prediction |
|
Khatri, Amit | CSIR-National Aerospace Laboratories |
Subbarao, Kamesh | The University of Texas, Arlington |
Keywords: Aerospace, Machine learning, Intelligent systems
Abstract: This paper presents a comparative analysis of different machine learning algorithms for aircraft loss of control (LOC) prediction. Aircraft LOC is a flight dynamics and control phenomenon that has been associated with a significant number of aircraft accidents worldwide. The NASA Generic Transport Model (GTM) is used as a representative aircraft to study the efficacy of machine learning based prediction algorithms under LOC scenarios. Extensive simulation of the GTM model in normal and LOC flight conditions are carried out to generate training and testing data for the machine learning algorithms. Different machine learning methods, viz. neural networks, support vector machines, decision trees, ensemble subspace kNN, ensemble boosted trees, and ensemble bagged trees are examined for their effectiveness in predicting LOC. A multi class classification problem is solved to build the machine learning models. It is observed from the analysis of prediction results that neural network method provides the best predictive model. Furthermore, the neural network predictive model performs quite well in correctly classifying the flight conditions under LOC, with an overall accuracy of 97 %
|
|
13:45-14:00, Paper WeB09.2 | |
Optimal Impact Angle Guidance Via First-Order Optimization under Nonconvex Constraints |
|
Park, Gyubin | Inha University |
Choi, Jiwoo | Inha University |
Jeong, Da Hoon | Hyundai Motors Company |
Kim, Jong-Han | Inha University |
Keywords: Aerospace, Optimal control, Optimization algorithms
Abstract: Most of the optimal guidance problems can be formulated as nonconvex optimization problems, which can be solved indirectly by relaxation, convexification, or linearization. Although these methods are guaranteed to converge to the global optimum of the modified problems, the obtained solution may not guarantee global optimality or even the feasibility of the original nonconvex problems. In this paper, we propose a computational optimal guidance approach that directly handles the nonconvex constraints encountered in formulating the guidance problems. The proposed computational guidance approach alternately solves the least squares problems and projects the solution onto nonconvex feasible sets, which rapidly converges to feasible suboptimal solutions or sometimes to the globally optimal solutions. The proposed algorithm is verified via a series of numerical simulations on impact angle guidance problems under state dependent maneuver vector constraints, and it is demonstrated that the proposed algorithm provides superior guidance performance than conventional techniques.
|
|
14:00-14:15, Paper WeB09.3 | |
Trajectory Tracking for Aerobatics Maneuvers in Quadrotors Vehicles |
|
Ibarra, Efrain | Universidad Autonoma De Nuevo Leon |
Castillo, Pedro | Univ De Technologie De Compiegne |
Keywords: Aerospace, Robotics, Autonomous robots
Abstract: Some inspection missions for quadrotors require a previous definition of the trajectories before being tracked. The well design of the trajectory can contribute for the success of the mission when aggressive maneuvers are demanded. In this paper, we propose a solution for designing and tracking unconventional trajectories requiring aerobatics maneuvers in quadrotors. The trajectory is based on the Hopf bifurcations and the sliding mode approach to form spiral loops in 3D around a pillar that need to be inspected by a quadrotor. The control architecture is based on the sliding mode methodology adding new parameters for achieving a desired angular velocity around the stable limit cycle solution. The proposed solution is validated in simulations and numerical results illustrate the good performance of the proposed trajectory-tracking control scheme.
|
|
14:15-14:30, Paper WeB09.4 | |
Capturing a Non-Cooperative Resident Space Object: A Control Barrier Function Approach |
|
Edwards, Sage | University of Florida |
Isaly, Axton | Air Force Research Laboratory |
Brewer, John Matthew | Georgia Institute of Technology |
Dixon, Warren E. | University of Florida |
Keywords: Aerospace, Robotics, Hybrid systems
Abstract: Control barrier functions are commonly used for providing safety guarantees to a controlled dynamical system, such as a guarantee of collision avoidance. This work leverages the recent advancements in multiple higher-order control barrier functions to develop a control strategy for a free-flying space manipulator system (SMS) with an arbitrary number of redundant manipulator arms to capture a non-cooperative resident space object (RSO). The impact of this result is a real-time controller capable of being used on a computationally constrained system while satisfying several safety constraints necessary for the approach, synchronization, and capture of the RSO.
|
|
14:30-14:45, Paper WeB09.5 | |
Spiral-Based Guidance Strategy for Interception of Stationary Targets |
|
Mishra, Kushagra | Indian Institute of Technology Bombay |
Mukherjee, Dwaipayan | Indian Institute of Technology Bombay |
Kumar, Shashi Ranjan | Indian Institute of Technology Bombay |
Keywords: Aerospace, Variable-structure/sliding-mode control
Abstract: This paper introduces a novel impact angle-constrained guidance law for tackling the problem of stationary target interception within a planar framework. The guidance law leverages a class of bounded curvature spirals as the fundamental geometric framework, allowing the pursuer to enforce a desired impact angle by choosing a suitable spiral trajectory. Another key aspect that this paper addresses is that of minimizing the maximum lateral acceleration required by the pursuer since this is a crucial determinant of a guidance law's effectiveness. The geometric law is implemented using a robust nonlinear super-twisting sliding-mode control-based guidance command to ensure finite-time interception of targets. Extensive numerical simulations have been performed to validate the theoretical results and the robustness of the guidance law.
|
|
14:45-15:00, Paper WeB09.6 | |
Three-Dimensional Nonlinear Impact Time Guidance Considering Field-Of-View Constraints |
|
Majumder, Kakoli | Indian Institute of Technology Bombay |
Kumar, Shashi Ranjan | Indian Institute of Technology Bombay |
Keywords: Aerospace
Abstract: In this letter, we propose a three-dimensional nonlinear guidance strategy that guarantees the interception of stationary targets at the pre-specified impact time while accounting for the onboard seeker’s field-of-view constraints. The proposed approach for designing guidance law does not require a time-to-go estimation. It utilizes range-to-go, free from time-to-go approximations, allowing the interceptor to control the time of target interception. The guidance command, derived using the prescribed constraint function and sliding mode control considering coupled and nonlinear engagement kinematics, continuously maintains the target within its field-of-view and achieves guidance objectives. The proposed guidance strategy performs effectively, even for large deviations from the collision course, where pitch and yaw channels may be strongly coupled. Finally, the efficacy of the proposed guidance strategy is demonstrated using extensive numerical simulations.
|
|
WeB10 |
Dockside 2 |
Control of Cyber-Physical Systems: Multidisciplinary Approaches in
Robotics, Autonomy, Optimization, and Safety |
Invited Session |
Chair: Sinha, Abhinav | The University of Cincinnati |
Co-Chair: Cao, Yongcan | University of Texas, San Antonio |
Organizer: Sinha, Abhinav | The University of Cincinnati |
Organizer: Cao, Yongcan | University of Texas, San Antonio |
Organizer: Casbeer, David W. | Air Force Research Laboratory |
|
13:30-13:45, Paper WeB10.1 | |
Path Integral Control with Rollout Clustering and Dynamic Obstacles (I) |
|
Patrick, Steven | University of Texas at Austin |
Bakolas, Efstathios | The University of Texas at Austin |
Keywords: Autonomous robots, Optimization algorithms, Stochastic systems
Abstract: Model Predictive Path Integral (MPPI) control has proven to be a powerful tool for the control of uncertain systems. One important limitation of the baseline MPPI algorithm is that it does not utilize simulated trajectories to their fullest extent. For one, it assumes that the average of all trajectories weighted by their performance index will be a safe trajectory. In this paper, multiple examples are shown where the previous assumption does not hold, and a trajectory clustering technique is presented that reduces the chances of the weighted average crossing in an unsafe region. Secondly, MPPI does not account for dynamic obstacles, so the authors put forward a novel cost function that accounts for dynamic obstacles without adding significant computation time to the overall algorithm. The novel contributions proposed in this paper were evaluated with extensive simulations to demonstrate improvements upon the state-of-the-art MPPI techniques.
|
|
13:45-14:00, Paper WeB10.2 | |
State-Constrained Adaptive Guidance for Three-Body Pursuit-Evasion Using Super Twisting Algorithm (I) |
|
Gurjar, Bhagyashri | Indian Institute of Technology, Bombay |
Kumar, Shashi Ranjan | Indian Institute of Technology Bombay |
Mukherjee, Dwaipayan | Indian Institute of Technology Bombay |
Keywords: Aerospace, Control applications, Variable-structure/sliding-mode control
Abstract: In this paper, we address the problem of robust guidance for a three-body pursuit-evasion problem involving a target, a defender, and an attacker, where the goal is to safeguard the target (an aircraft) from the attacker, using the defender. Thus, the target and the defender may act as a team of vehicles whose common goal is to safeguard the target from the attacker. Additionally, it is desired to maintain necessary constraints on certain states. An adaptive guidance law is proposed for the target-defender team to ensure optimal performance within this set-up. The target-defender dynamics are considered for designing the guidance law with only approximate knowledge of the unknown parameters of the attacker. A robust guidance command is designed using a super-twisting algorithm with adaptive gains. The unknown disturbance bounds required for control implementation are obtained using a novel norm observer. We consider the complete nonlinear model of the system for control design, thereby enabling a wider domain of applicability of the proposed approach. The convergence is established using Lyapunov analysis, which also ensures satisfaction of the required state constraints. Relevant simulation results are presented to study the efficacy of the proposed approach.
|
|
14:00-14:15, Paper WeB10.3 | |
LQ-OCP: Energy-Optimal Control for LQ Problems (I) |
|
Beaver, Logan E. | Old Dominion University |
Keywords: Optimal control, Optimization algorithms, Linear systems
Abstract: This article presents a method to automatically generate energy-optimal trajectories for systems with linear dynamics, linear constraints, and a quadratic cost functional (LQ systems). First, using recent advancements in optimal control, we derive the optimal motion primitive generator for LQ systems. This yields linear differential equations that describe all dynamical motion primitives that the optimal system trajectory follows. We demonstrate the performance of our approach in simulation on an energy-minimizing submersible robot with state and control constraints and compare our approach to an energy-optimizing Linear Quadratic Regulator.
|
|
14:15-14:30, Paper WeB10.4 | |
Semi-Autonomous Full 3D Robot Operation with Variable Autonomy through Gaussian Process Regression (I) |
|
Kitashiba, Atsushi | Tokyo Institute of Technology |
Oda, Ryo | Tokyo Institute of Technology |
Hatanaka, Takeshi | Tokyo Institute of Technology |
Keywords: Human-in-the-loop control, Robotics, Machine learning
Abstract: In this paper, we develop a semi-autonomous full 3D robot operation system for a target reaching and holding task through Gaussian Process Regression (GPR). In order to mitigate the limited dimensionality of the human manipulable command, a virtual reality (VR) controller is employed as a command interface that links the operator’s own arm movements with those of the robot. We start by having a well-trained operator operate the robot from various initial positions away from an object to holding the object with grippers while directly looking at the manipulator, instead of through a 2D display. We then construct a GPR model of the operator, and design an automatic control law based on the mean function of the model, where we reveal the need for thinning out the training data to eliminate a computational issue stemming from excessive data. It is further pointed out that the high dimensionality of the robot motion poses a new challenge, namely a trade-off between data coverage and computational effort. To address this issue, we present a semi-autonomous system that blends manual control and automatic control based on the variance function of the GPR model. Finally, the proposed operation support system is demonstrated through experiments.
|
|
14:30-14:45, Paper WeB10.5 | |
Multi Agent Pathfinding for Noise Restricted Hybrid Fuel Unmanned Aerial Vehicles (I) |
|
Scott, Drew | University of Cincinnati |
Manyam, Satyanarayana Gupta | Air Force Research Labs |
Casbeer, David W. | Air Force Research Laboratory |
Kumar, Manish | University of Cincinnati |
Weintraub, Isaac | Air Force Research Laboratory |
Keywords: Optimization, Cooperative control, Transportation networks
Abstract: Multi Agent Path Finding (MAPF) seeks the optimal set of paths for multiple agents from respective start to goal locations such that no paths conflict. We address the MAPF problem for a fleet of hybrid-fuel unmanned aerial vehicles which are subject to location-dependent noise restrictions. We solve this problem by searching a constraint tree for which the subproblem at each node is a set of shortest path problems subject to the noise and fuel constraints and conflict zone avoidance. A labeling algorithm is presented to solve this subproblem, including the conflict zones which are treated as dynamic obstacles. We present the experimental results of the algorithms for various graph sizes and number of agents.
|
|
14:45-15:00, Paper WeB10.6 | |
Resilient Fleet Management for Energy-Aware Intra-Factory Logistics (I) |
|
Goutham, Mithun | The Ohio State University |
Stockar, Stephanie | The Ohio State University |
Keywords: Cooperative control, Autonomous robots, Manufacturing systems
Abstract: This paper presents a novel fleet management strategy for battery-powered robot fleets tasked with intra-factory logistics in an autonomous manufacturing facility. In this environment, repetitive material handling operations are subject to real-world uncertainties such as blocked passages, and equipment or robot malfunctions. In such cases, centralized approaches enhance resilience by immediately adjusting the task allocation between the robots. To overcome the computational expense, a two-step methodology is proposed where the nominal problem is solved a priori using a Monte Carlo Tree Search algorithm for task allocation, resulting in a nominal search tree. When a disruption occurs, the nominal search tree is rapidly updated a posteriori with costs to the new problem while simultaneously generating feasible solutions. Computational experiments prove the real-time capability of the proposed algorithm for various scenarios and compare it with the case where the search tree is not used and the decentralized approach that does not attempt task reassignment.
|
|
WeB11 |
Dockside 3 |
Game Theory I |
Regular Session |
Chair: Ramazi, Pouria | Brock University |
Co-Chair: Brown, Philip N. | University of Colorado Colorado Springs |
|
13:30-14:15, Paper WeB11.1 | |
Topology of Nash Equilibrium Set with Quadratic Vector Payoff Functions |
|
Guo, Zehui | Tokyo Institute of Technology |
Hayakawa, Tomohisa | Tokyo Institute of Technology |
Keywords: Game theory, Agents-based systems, Finance
Abstract: The condition when the Nash equilibrium set has a simple topological structure is presented. Specifically, we first define the terms `weakly simplicial' and `simplicial' for the Nash equilibrium set characterization problem (NESCP), which is a generalization of the terms for the Pareto optimality characterization problem. Then, we give a condition for when the NESCP is weakly simplicial so that the Nash equilibrium set is compact. After that, we present a condition for the NESCP to be simplicial, when each face of the simplices corresponds to a face of the Nash equilibrium set. Finally, we give a couple of examples to illustrate our results.
|
|
14:15-15:00, Paper WeB11.2 | |
From Discrete to Continuous Best-Response Dynamics: Discrete Fluctuations Do Not Scale with Population Size |
|
Aghaeeyan, Azadeh | Brock University |
Ramazi, Pouria | Brock University |
Keywords: Game theory, Agents-based systems
Abstract: In binary decision-makings, individuals often go for a common or rare action. In the framework of evolutionary game theory the best-response update rule can be used to model this dichotomy. Those who prefer a common action are called coordinators and those who prefer a rare one are called anticoordinators. A finite mixed population of the two types may undergo perpetual fluctuations, the characterization of which appears to be challenging. It is particularly unknown, whether the fluctuations scale with population size. To fill this gap, we approximate the discrete finite population dynamics of coordinators and anticoordinators with the corresponding mean dynamics in the form of (semi-)continuous differential inclusions. We show that the family of the state sequences of the discrete dynamics for increasing population sizes forms a generalized stochastic approximation process for the differential inclusion. On the other hand, we show that the differential inclusions always converge to an equilibrium. This implies that the reported perpetual fluctuations in the finite discrete dynamics of coordinators and anticoordinators do not scale as the population size do. The results encourage to first analyze the often simpler (semi-)continuous mean dynamics of discrete population dynamics as the continuous dynamics partly reveal the asymptotic behaviour of the discrete dynamics.
|
|
14:15-15:00, Paper WeB11.3 | |
On the Intrinsic Fragility of the Price of Anarchy |
|
Seaton, Joshua | University of Colorado at Colorado Springs |
Brown, Philip N. | University of Colorado Colorado Springs |
Keywords: Game theory, Agents-based systems
Abstract: In problems of multiagent coordination, it is known that equilibria arising from self-interested behavior can be highly inefficient. This inefficiency, quantified by the Price of Anarchy (PoA), has been studied for a wide range of problems. Typically, only worst-case boundsare known on the PoA, leaving significant questions: are highly inefficient equilibria likely? Are they stable? In this paper, we comprehensively prove that worst-case Nash equilibria in a large class of games are not stable, in a game-theoretically relevant sense: at these equilibria, every agent can switch to its system-optimal action without incurring any loss in individual payoff. We parameterize the inefficiency of equilibria by a new measure of stability which quantifies agents' aggregate "satisfaction" with their equilibrium choices; we show that if agents are highly "satisfied" with their actions at a particular equilibrium (i.e., the equilibrium is highly stable), then that equilibrium must be relatively efficient. Our proof techniques are highly general and apply to all lambda-mu-smooth games, which account for a large fraction of games with known PoA bounds including routing, scheduling, resource allocation, and auctions. We also perform numerical experiments to contextualize our work in specific games of interest.
|
|
14:15-15:00, Paper WeB11.4 | |
A Robust Distributed Nash Equilibrium Seeking Algorithm for Aggregative Games under Byzantine Attacks |
|
Zhao, Jishu | Department of Control Science and Engineering, Tongji University |
Yi, Peng | Tongji University |
Keywords: Game theory, Distributed control, Agents-based systems
Abstract: This paper investigates the Nash equilibrium seeking problem for aggregative games, where some players are under Byzantine attacks. Since the previous distributed algorithms might be seriously affected by Byzantine attacks that inject malicious information to the system, we propose a resilient distributed geometric-median based method to improve the robustness of the algorithm. When less than half of the players are Byzantine attackers and under strongly monotone assumption on the pseudo-gradient mapping, we show that the algorithm converges linearly to a neighborhood of the Nash Equilibrium with an error bound related to the ratio of Byzantine attackers. Finally, numerical examples are implemented to demonstrate the robustness of the proposed algorithm under various types of Byzantine attacks.
|
|
14:15-15:00, Paper WeB11.5 | |
Large-Scale Multi-Agent System Optimization with Fixed Final Density Constraints: An Imbalanced Mean-Field Game Theory |
|
Dey, Shawon | University of Nevada, Reno |
Xu, Hao | University of Nevada, Reno |
Keywords: Game theory, Distributed control, Stochastic optimal control
Abstract: This paper presents a novel distributed optimization algorithm for large-scale multi-agent systems (LS-MAS), particularly with a given fixed final density constraint. Although the Mean field game (MFG) theory provides a distribution solution to overcome the ``Curse of dimensionality" in LS-MAS, it significantly sacrifices LS-MAS optimality and also not be capable of achieving arbitrary fixed final probability density function (PDF) constraint. To overcome these challenges, a novel Imbalanced Mean-Field Game (Imb-MFG) theory is developed along with an adaptive PDF decomposition algorithm and distributed reinforcement learning. Specifically, an induction-based PDF parameter estimation is developed to decompose the final density constraints into multiple imbalanced norm distributions. Then, the Imb-MFG theory is designed by integrating multi-group MFG with a constrained K-means clustering algorithm. To solve the developed Imb-MFG and further obtain the distributed optimal solution, a multi-actor-critic-mass (Multi-ACM) algorithm is designed to learn the solution of multi-group coupled Hamilton-Jacobi-Bellman (HJB) and Fokker-Planck-Kolmogorov (FPK) equations simultaneously. Finally, the convergence of the developed Multi-ACM algorithm is guaranteed through Lyapunov analysis.
|
|
14:15-15:00, Paper WeB11.6 | |
On the Optimal Cost and Asymptotic Stability in Two-Player Zero-Sum Set-Valued Hybrid Games |
|
J. Leudo, Santiago | University of California, Santa Cruz |
Ferrante, Francesco | Universita Degli Studi Di Perugia |
Sanfelice, Ricardo G. | University of California at Santa Cruz |
Keywords: Game theory, Hybrid systems, Optimal control
Abstract: In this paper, we formulate a two-player zero-sum game under dynamic constraints formulated in terms of a hybrid inclusion. The game consists of a min-max problem involving a cost functional associated to the actions and corresponding (potentially nonunique) solutions to the system. We present sufficient conditions given in terms of Hamilton-Jacobi- Isaacs-like equations to establish a bound on the worst-case cost under the optimal strategy and to exactly evaluate it. Under additional conditions, we show that the proposed optimal state-feedback laws render a set of interest pre-symptotically stable for the resulting hybrid closed-loop system. The results are illustrated in a numerical example.
|
|
WeB12 |
Dockside 9 |
Predictive Control for Nonlinear Systems I |
Regular Session |
Chair: Huan, Xun | University of Michigan |
Co-Chair: Shi, Yang | University of Victoria |
|
13:30-13:45, Paper WeB12.1 | |
Eco-Driving for Connected and Automated Vehicles in Mixed Traffic Urban Environments with Signalized Intersections |
|
Ebrahimi, Alireza | Clemson University |
Mosharafian, Sahand | University of Georgia |
Mohammadpour Velni, Javad | Clemson University |
Keywords: Automotive systems, Predictive control for nonlinear systems, Autonomous systems
Abstract: This paper presents a trajectory planning method for connected and automated vehicles (CAVs) operating on a two-lane urban road that includes signalized intersections with a mix of human-driven vehicles (HVs) and CAVs. The proposed approach aims to find the optimal trajectory for CAVs, considering factors such as energy consumption, travel time and passengers' comfort. To achieve this objective, each CAV utilizes data obtained from various sources. Vehicle-to-infrastructure (V2I) communication enables access to information such as traffic light cycle lengths, timings and positions. Vehicle-to-vehicle (V2V) communication allows CAVs to gather information from other CAVs, including their positions, velocities and predicted trajectories. Additionally, on-board sensors can also provide data about surrounding vehicles. Dynamic programming (DP) is employed to predict the velocity profile of the CAVs over a long horizon, incorporating information from traffic lights and the vehicle's specifications. The predicted velocity profile is then fed into a model predictive controller (MPC) to follow the CAV's trajectory. The MPC is capable of handling disturbances encountered during the maneuvers, such as stopping behind a red traffic light or performing lane changes. Extensive simulation studies demonstrate how various objectives in terms of saving in energy consumption and travel time are achieved.
|
|
13:45-14:00, Paper WeB12.2 | |
Data-Driven Model Predictive Control of a Nonlinear Ball-On-A-Wheel System |
|
Kruse, Niklas | University of Rostock |
Wache, Alexander | University of Rostock |
Aschemann, Harald | University of Rostock |
Starke, Jens | University of Rostock |
Keywords: Identification for control, Mechatronics, Predictive control for nonlinear systems
Abstract: The topic of this paper is the design of a data-driven model predictive controller for a nonlinear ball-on-a-wheel system. A Koopman predictor is identified using simulation data of a ball-on-a-wheel system model. The challenge lies in using data, which is characterized by the instability of the given system. This limits the time horizon during which meaningful time series can be extracted. Within this scenario, the predictor is included in a model predictive control scheme. This controller successfully tracks a specified non-flat output trajectory subject to input and output constraints. Finally, the prediction quality of the Koopman predictor is compared with a linear predictor based on dynamic mode decomposition with control.
|
|
14:00-14:15, Paper WeB12.3 | |
Deep Koopman-Based Control of Quality Variation in Multistage Manufacturing Systems |
|
Chen, Zhiyi | University of Michigan |
Maske, Harshal | Ford Motor Company |
Upadhyay, Devesh | Ford |
Shui, Huanyi | Ford Motor Company |
Huan, Xun | University of Michigan |
Ni, Jun | University of Michigan |
Keywords: Manufacturing systems and automation, Predictive control for nonlinear systems, Control applications
Abstract: This paper presents a modeling-control synthesis to address the quality control challenges in multistage manufacturing systems (MMSs). A new feedforward control scheme is developed to minimize the quality variations caused by process disturbances in MMSs. Notably, the control framework leverages a stochastic deep Koopman (SDK) model to capture the quality propagation mechanism in the MMSs, highlighted by its ability to transform the nonlinear propagation dynamics into a linear one. Two roll-to-roll case studies are presented to validate the proposed method and demonstrate its effectiveness. The overall method is suitable for nonlinear MMSs and does not require extensive expert knowledge.
|
|
14:15-14:30, Paper WeB12.4 | |
Training and Generalization Errors for Underparameterized Neural Networks |
|
Martin Xavier, Daniel | Université Paris-Saclay, CentraleSupélec, ENS Paris-Saclay, CNRS |
Chamoin, Ludovic | Université Paris-Saclay, CentraleSupélec, ENS Paris-Saclay, CNRS |
Fribourg, Laurent | CNRS |
Keywords: Neural networks, Predictive control for nonlinear systems, Network analysis and control
Abstract: It has been theoretically explained, through the notion of Neural Tangent Kernel, why the training error of overparameterized networks converges linearly to 0. In this work, we focus on the case of small (or underparameterized) networks. An advantage of small networks is that they are faster to train while often retaining sufficient precision to perform useful tasks in many applications. Our main theoretical contribution is to prove that the training error of small networks converges linearly to a (nonnull) constant, of which we give a precise estimate. We verify this result on a neural network of 10 neurons simulating a Model Predictive Controller. We also observe that an upper bound of the generalization error follows a double-peak curve as the number of training data increases.
|
|
14:30-14:45, Paper WeB12.5 | |
Tube MPC-Based Tracking Control of AUVs Using Contraction Metric |
|
Zhang, Kunwu | University of Victoria |
Shi, Yang | University of Victoria |
Keywords: Predictive control for nonlinear systems, Constrained control, Optimal control
Abstract: This paper investigates trajectory tracking of autonomous underwater vehicles (AUV) subject to thruster saturation and environmental disturbances. We propose a tube-based model predictive control (MPC) framework to robustly stabilize the AUV tracking error defined in the local frame. The essence of our design centers on leveraging a robust control contraction metric (RCCM) to construct a disturbance invariant set, ensuring bounded deviation between the actual and nominal system states under the RCCM-based feedback control law. Subsequently, an outer approximation of this RCCM-based invariant set is developed to design the tube cross-section and tighten the input constraint. The resulting RCCM-based tube MPC (RCCM-MPC) scheme is independent of the spatially varying metric, enhancing the computational efficiency of the proposed scheme. Under mild assumptions, we prove that the proposed RCCM-MPC scheme is recursively feasible and can asymptotically steer the AUV tracking error to a small region around the origin. Simulation results demonstrate the effectiveness of proposed RCCM-MPC approach.
|
|
14:45-15:00, Paper WeB12.6 | |
Learning-Based Distributed Model Predictive Control with State-Dependent Uncertainty Using Neural Network |
|
Tong, Junbo | Tsinghua University |
Du, Shuhan | Tsinghua University |
Fan, Wenhui | Tsinghua University |
Keywords: Predictive control for nonlinear systems, Machine learning, Stability of nonlinear systems
Abstract: This paper presents a novel learning-based distributed model predictive control strategy addressing the consensus problem in multi-agent systems with unknown state-dependent uncertainties. Assuming the availability of input-output data, a neural network-based model is used to describe state-dependent uncertainties. The Lipschitz constant of the neural network is estimated, and an upper bound on the prediction error of the neural network is derived based on this estimation. The multi-agent system is proven to be input-to-state stable concerning the prediction error. A numerical example is introduced to validate the theoretical results of the proposed controller. A comparison with other learning-based controllers illustrates the efficiency of the neural network-based controller. This research contributes to the advancement of learning-based model predictive control in scenarios involving multi-agent systems with inaccurate models.
|
|
WeB13 |
Richmond |
Constrained Control I |
Regular Session |
Chair: Liu, Changliu | Carnegie Mellon University |
Co-Chair: Danielson, Claus | University of New Mexico |
|
13:30-13:45, Paper WeB13.1 | |
Composing Control Barrier Functions for Complex Safety Specifications |
|
Molnar, Tamas G. | Wichita State University |
Ames, Aaron D. | California Institute of Technology |
Keywords: Constrained control, Lyapunov methods
Abstract: The increasing complexity of control systems necessitates control laws that guarantee safety w.r.t. complex combinations of constraints. In this letter, we propose a framework to describe compositional safety specifications with control barrier functions (CBFs). The specifications are formulated as Boolean compositions of state constraints, and we propose an algorithmic way to create a single continuously differentiable CBF that captures these constraints and enables safety-critical control. We describe the properties of the proposed CBF, and we demonstrate its efficacy by numerical simulations.
|
|
13:45-14:00, Paper WeB13.2 | |
Minimum-Time Planar Paths with up to Two Constant Acceleration Inputs and L_2 Velocity and Acceleration Constraints |
|
Montano, Victor | University of Houston |
Zhao, Haoran | University of Houston |
Abdurahiman, Nihal | Department of Surgery, Hamad Medical Corporation, Doha, Qatar |
Navkar, Nikhil Vishwas | Department of Surgery, Hamad Medical Corporation, Doha, Qatar |
Becker, Aaron | University of Houston |
Keywords: Constrained control, Algebraic/geometric methods
Abstract: Given starting and ending positions and velocities, L2 bounds on the acceleration and velocity, and the restriction to no more than two constant control inputs, this paper provides routines to compute the minimal-time path. Closed form solutions are provided for reaching a position in minimum time with and without a velocity bound, and for stopping at the goal position. A numeric solver is used to reach a goal position and velocity with no more than two constant control inputs. If a cruising phase at the terminal velocity is needed, this requires solving a non-linear equation with a single parameter. Code is provided on GitHub https://github.com/RoboticSwarmControl/MinTimeL2pathsConstraints/.
|
|
14:00-14:15, Paper WeB13.3 | |
Data-Driven Synthesis of Configuration-Constrained Robust Invariant Sets for Linear Parameter-Varying Systems |
|
Mejari, Manas | University of Applied Sciences and Arts of Southern Switzerland |
Mulagaleti, Sampath Kumar | IMT School of Advanced Studies Lucca |
Bemporad, Alberto | IMT School for Advanced Studies Lucca |
Keywords: Constrained control, Linear parameter-varying systems, Robust control
Abstract: We present a data-driven method to synthesize robust control invariant (RCI) sets for linear parameter-varying (LPV) systems subject to unknown but bounded disturbances. A finite-length data set consisting of state, input, and scheduling signal measurements is used to compute an RCI set and invariance-inducing controller, without identifying an LPV model of the system. We parameterize the RCI set as a configuration-constrained polytope whose facets have a fixed orientation and variable offset. This allows us to define the vertices of the polytopic set in terms of its offset. By exploiting this property, an RCI set and associated vertex control inputs are computed by solving a single linear programming (LP) problem, formulated based on a data-based invariance condition and system constraints. We illustrate the effectiveness of our approach via two numerical examples. The proposed method can generate RCI sets that are of comparable size to those obtained by a model-based method in which exact knowledge of the system matrices is assumed. We show that RCI sets can be synthesized even with a relatively small number of data samples, if the gathered data satisfy certain excitation conditions.
|
|
14:15-14:30, Paper WeB13.4 | |
Safety Index Synthesis with State-Dependent Control Space |
|
Chen, Rui | Carnegie Mellon University |
Zhao, Weiye | Carnegie Mellon University |
Liu, Changliu | Carnegie Mellon University |
Keywords: Constrained control, Lyapunov methods, Optimization
Abstract: This paper introduces an approach for synthesizing feasible safety indices to derive safe control laws under state-dependent control spaces. The problem, referred to as Safety Index Synthesis (SIS), is challenging because it requires the existence of feasible control input in all states and leads to an infinite number of constraints. The proposed method leverages Positivstellensatz to formulate SIS as a nonlinear programming (NP) problem. We formally prove that the NP solutions yield safe control laws with two imperative guarantees: forward invariance within user-defined safe regions and finite-time convergence to those regions. A numerical study validates the effectiveness of our approach.
|
|
14:30-14:45, Paper WeB13.5 | |
Constraint Admissible Positive Invariant Sets for Vehicles in SE(3) |
|
Danielson, Claus | University of New Mexico |
Brandt, Teo | University of New Mexico |
Keywords: Constrained control, Aerospace, Robust control
Abstract: This paper characterizes a class of constraint admissible positive invariant (CAPI) sets for vehicles with constrained nonlinear dynamics operating in SE(3). We present a robust linearization wherein the nonlinearities are eliminated using a combination of feedback-linearization and norm-bounded disturbances. We show that robust positive invariant sets for this robust linearization are positive invariant sets for the nonlinear dynamics. We present a trio of linear matrix inequalities for synthesizing the desired positive invariant sets. We present another pair of linear matrix inequalities which ensure that the feedback-linearizing controller is constraint admissible. We empirically show our candidate CAPI sets are invariant and constraint admissible through nonlinear simulation and compare with a candidate CAPI set synthesized using a non-robust linearization
|
|
14:45-15:00, Paper WeB13.6 | |
Safe Whole-Body Task Space Control for Humanoid Robots |
|
Paredes, Victor | The Ohio State University |
Hereid, Ayonga | Ohio State University |
Keywords: Constrained control, Optimal control, Autonomous robots
Abstract: Complex robotic systems require whole-body controllers to handle contact interactions, handle closed kinematic chains, and track task-space control objectives. However, for many applications, safety-critical controllers are essential to steer away from undesired robot configurations and prevent unsafe behaviors. A prime example is legged robotics, where we can have tasks such as balance control, regulation of torso orientation, and, most importantly, walking. As the coordination of multi-body systems is non-trivial, following a combination of those tasks might lead to configurations that are deemed dangerous, such as stepping on its support foot during walking, leaning the torso excessively, or producing excessive centroidal momentum, resulting in non-human-like walking. To address these challenges, we propose a formulation of an inverse dynamics control enhanced with control barrier functions that allow general higher-order relative degree safe sets for robotic systems with numerous degrees of freedom. Our approach utilizes a quadratic program that respects closed kinematic chains, minimizes the control objectives, and imposes desired constraints on the Zero Moment Point, friction cone, and torque. More importantly, it also ensures the forward invariance of a general user-defined high Relative-Degree safety set. We demonstrate the effectiveness of our method by applying it to the 3D biped robot Digit, both in simulation and with hardware experiments.
|
|
WeB14 |
Wellington |
Modeling, Control and Estimation of Soft Material and Continuum Systems |
Invited Session |
Chair: Vikas, Vishesh | University of Alabama |
Co-Chair: Chen, Zheng | University of Houston |
Organizer: Vikas, Vishesh | University of Alabama |
Organizer: Gilbert, Hunter B. | Louisiana State University |
Organizer: Zhao, Jianguo | Colorado State University |
Organizer: Tan, Xiaobo | Michigan State University |
|
13:30-13:45, Paper WeB14.1 | |
Physics-Informed Online Estimation of Stiffness and Shape of Soft Robotic Manipulators (I) |
|
Fairchild, Preston | Michigan State University |
Mei, Yu | Michigan State University |
Tan, Xiaobo | Michigan State University |
Keywords: Robotics, Identification for control, Modeling
Abstract: Soft robots are designed to be highly compliant to reduce the risk of injury or damage to humans and the environment. This compliance can lead to large deformations when handling payloads, significantly altering the operation of the robot. The stiffness of a soft robot, actively tuned or passively influenced by inputs (e.g., pneumatic pressures), is an important state variable, yet difficult to measure directly for the control of a soft robot. In this paper we propose a novel physics-informed approach to online estimation of the stiffness and shape of a soft manipulator under a payload, based on measurements that are readily available (e.g., positions of five points on the manipulator, or the position and orientation of the tip). The same approach is also adapted for estimating the payload when the stiffness is known. The proposed method is illustrated and supported with experimental results on a soft pneumatic actuator. In particular, it is shown to produce more accurate shape estimate than a commonly adopted piecewise constant curvature (PCC) model (which cannot produce a stiffness estimate), with an average error 57% smaller than the PCC method. The stiffness values estimated are also shown to be consistent and fall within the expected physical range.
|
|
13:45-14:00, Paper WeB14.2 | |
Morphological Computation by Exploiting Partial Feedback Linearizable Underactuated Soft-Bodied Systems (I) |
|
Haghshenas-Jaryani, Mahdi | New Mexico State University |
Keywords: Feedback linearization, Nonlinear systems identification, Learning
Abstract: The emerging concept of morphological computation has shown that the complex dynamics of soft robots can be utilized as computational resources that facilitate their control. This work presents new insights into the capabilities of underactuated soft-bodied systems, as a class of partial feedback linearizable systems, for emulating dynamic responses of arbitrary nonlinear systems with stable limit cycles. An underactuated mass-spring-damper network (MSDN) was used as a physical reservoir. The dynamic equations are derived for the actuated and unactuated subsystems. A control law was derived to linearize the actuated subsystem, while the remaining portion is nonlinear and represents the internal dynamics. The stability of the linearized subsystem and corresponding zero dynamics were discussed for linear and nonlinear output functions. The effectiveness of the physical reservoir computing was demonstrated for emulating the Van der Pol (VDP) and Quadratic limit cycle (QLC) dynamics. A dynamics-informed learning process was established, which simulates the dynamics with a 1 kHz rate and trains the readout weights using the "teacher forcing" method subjected to the stability constraints with a 1 Hz rate. After training the readout weights, the feedback loop was closed, and the system was simulated where both oscillators were regenerated.
|
|
14:00-14:15, Paper WeB14.3 | |
Modeling and Control of Dielectric Actuator Enabled Prosthetic Finger (I) |
|
Kaaya, Theophilus | University of Houston |
Koc, Denizcan | University of Houston |
Zhu, Qiang | University of Houston |
Chen, Zheng | University of Houston |
Keywords: Modeling, H-infinity control, Smart structures
Abstract: Dielectric actuators (DEAs) are lauded for their inherent softness, lightweight composition, and remarkable flexibility. These qualities empower DEAs with a unique capacity for finely tuned control, rendering them exceptionally suited for facilitating human-robot interactions. Their integration into prosthetic fingers introduces a unique advantage, enabling both delicate precision through soft actuation and broader manipulation via rigid activation. This paper presents a physics-based model for a prosthetic finger driven by DEAs, an effort enhanced by incorporation of feedback control mechanisms utilizing a self-sensing approach. This work not only underscores the potential of DEAs in creating lifelike human prostheses with enhanced dexterity, but also underscores their role in expanding the horizons of robotics for augmented human capabilities.
|
|
14:15-14:30, Paper WeB14.4 | |
Modeling and Inverse Compensation of the Non-Smooth Coiling-Induced Actuation in Twisted and Coiled String Actuators |
|
Konda, Revanth | University of Nevada Reno |
Zhang, Jun | University of Nevada Reno |
Keywords: Modeling, Identification, Mechatronics
Abstract: Twisted and coiled string (TCS) actuators are recently discovered motor-driven compliant actuators which generate actuation in two sequential phases, namely, twisting and coiling. TCS actuators' coiling-induced actuation is under-explored and exhibits unique characteristics: At a constant motor speed, twisting of strings generates contraction at increasing rate as the coil evolves from coil initiation stage to the coil formation stage. As the coiling process is discrete with individual coils forming sequentially, the aforementioned behavior generates non-smooth input-output correlation. To facilitate the application of TCS actuators in robotic and mechatronic devices, modeling and control of their behavior needs to be established. As a first step towards this objective, in this study, modeling and compensation for the unique non-smooth behavior of the TCS actuators in the coiling phase is presented. The proposed strategies leverage the geometric configuration of the coils. The model is formulated by relating the bias angle of each coil to the corresponding motor twist inputs. The proposed strategies were experimentally validated to perform satisfactorily: The model validation and control errors with the proposed strategies were 1.27%, and 1.11% of the overall actuation range, respectively.
|
|
14:30-14:45, Paper WeB14.5 | |
Efficient Learning and Control of String-Type Artificial Muscle Driven Robotic Systems |
|
Tao, Jiyue | Peking University |
Rajendran, Sunil Kumar | BSS Technologies Inc |
Zhang, Yunsong | Peking University |
Zhang, Feitian | Peking University |
Zhao, Dexin | National Innovation Institute of Defense Technology |
Shen, Tongsheng | National Innovation Institute of Defense Technology |
Keywords: Mechatronics, Robotics, Machine learning
Abstract: This paper investigates the learning-based control of robotic systems driven by string-type artificial muscles. Due to the highly nonlinear dynamics of the actuators and the complicated mechanical structure, it is typically very challenging to design traditional model-based controllers that exhibit desired control performances. With the rapid development of machine learning techniques, deep reinforcement learning (DRL) algorithms have been utilized to control a variety of complex robotic systems. However, these DRL algorithms usually require a huge amount of training data and iterations to converge, which is generally unacceptable for the robotic systems of interest. Therefore, this paper designs an efficient learning-based control algorithm, aiming to improve the training efficiency for robotic systems driven by string-type artificial muscle actuators. Specially, two training improvements are proposed including imitation learning and data augmentation. This paper applies the proposed learning methods to three popular DRL algorithms and tests the control performances in three case studies using three string-type artificial muscle-driven robots, including a parallel robotic wrist, a two degrees-of-freedom (DOF) robotic eye and a robotic finger. Simulation results show that the proposed learning-based control methods significantly accelerate the convergence speed and improve the data efficiency in all the three case studies.
|
|
WeB15 |
Yonge |
Estimation and Control of Distributed Parameter Systems I |
Invited Session |
Chair: Demetriou, Michael A. | Worcester Polytechnic Institute |
Co-Chair: Hu, Weiwei | University of Georgia |
Organizer: Demetriou, Michael A. | Worcester Polytechnic Institute |
Organizer: Hu, Weiwei | University of Georgia |
|
13:30-13:45, Paper WeB15.1 | |
Limit Cycle Generation in Van Der Pol Flavored PDE Setting (I) |
|
Aguilar, Luis T. | Instituto Politecnico Nacional |
Orlov, Yury | CICESE |
Keywords: Distributed parameter systems, Stability of nonlinear systems, Lyapunov methods
Abstract: The nonlinear Van der Pol oscillator is well-recognized for modeling limit cycles in electrical circuits. It was recently modified to a model whose parameters explicitly stood for the limit cycle amplitude and frequency. Due to this, the latter model has successfully been used in control engineering as a reference model for self-generation of limit cycles in the closed-loop. The popular Lotka-Volterra predator-prey partial differential equation (PDE) is another model which is capable of self-generating periodic orbits in infinite-dimensional setting. The present work aims to flavor the Van der Pol equation in PDE setting. It is shown that similar to the modified Van der Pol oscillator, its PDE-flavored model explicitly relies on the amplitude and frequency of the periodic orbit, which is self-generated by the model while also possessing a unique equilibrium in the origin similar to that of its finite-dimensional progenitor. Theoretical analysis is additionally supported by simulation results.
|
|
13:45-14:00, Paper WeB15.2 | |
Rates of Convergence in a Class of Native Spaces for Reinforcement Learning and Control (I) |
|
Bouland, Ali | Virginia Tech |
Niu, Shengyuan | Virginia Tech |
Paruchuri, Sai Tej | Lehigh University |
Kurdila, Andrew J. | Virginia Tech |
Burns, John A | Virginia Tech |
Schuster, Eugenio | Lehigh University |
Keywords: Optimal control, Machine learning, Statistical learning
Abstract: This paper studies convergence rates for some value function approximations that arise in a collection of reproducing kernel Hilbert spaces (RKHS) {mathrm{ H}}(Omega). By casting an optimal control problem in a specific class of native spaces, strong rates of convergence are derived for the operator equation that enables offline approximations that appear in policy iteration. Explicit upper bounds on error in value function and control law approximations are derived in terms of power function mathcal {P}_{mathrm{ H,N}} for the space of finite dimensional approximants rm H_{N} in the native space {mathrm{ H}}(Omega). These bounds exhibit a distinctive geometric nature, refine and build upon some well-known, now classical results concerning the convergence of approximations of value functions.
|
|
14:00-14:15, Paper WeB15.3 | |
Distributed Dynamic Encirclement Control for First-Order Multi-Agent Systems with Communication Delay (I) |
|
Hasanzadeh, Milad | Texas Tech University |
Tang, Shuxia | Texas Tech University |
Keywords: Cooperative control, Delay systems, Distributed parameter systems
Abstract: This paper presents a novel distributed encirclement control for first-order multi-agent systems that specifically considers the issue of communication delay. Unlike previous studies, this research analyzes its impact. One key aspect of our approach is the utilization of two distributed estimators affected by communication delay to accurately estimate the location of the geometric center of targets and to estimate the maximum distance of targets to the estimated geometric center of targets. The relative desired position of agents is achievable by employing the estimated maximum distance of targets to the estimated geometric center of targets. The employment of the estimated profile of targets and the relative desired position of agents allows for improved tracking and coordination among the agents, enhancing the effectiveness of the engaged encirclement control. To assess the stability of the closed-loop system, we employ the Lyapunov technique for analysis. Finally, we conduct simulations to evaluate the effectiveness of our proposed methodology.
|
|
14:15-14:30, Paper WeB15.4 | |
Predictor-Based Prescribed-Time Output Feedback for a Parabolic PDE (I) |
|
Zekraoui, Salim | Centrale Lille Institute |
Espitia, Nicolas | University of Lille - CNRS - CRIStAL Lab |
Perruquetti, Wilfrid | Ecole Centrale De Lille |
Krstic, Miroslav | University of California, San Diego |
Keywords: Distributed parameter systems, Delay systems, Control applications
Abstract: In this paper, we consider a 1D reaction-diffusion system with boundary input delay and propose a general method for studying the problem of prescribed-time output boundary stabilization. We first reformulate the system as a PDE-PDE cascade system (i.e., a cascade of a linear transport partial differential equation (PDE) with a linear reaction-diffusion PDE), where the transport equation represents the effect of the input delay. We then apply a time-varying infinite-dimensional backstepping transformation to convert the cascade system and the proposed observer system into two prescribed-time stable (PTS) target systems. The stability analysis is conducted on the target systems, and the desired stability property is transferred back to the closed-loop system and the error system using the inverse transformation. The effectiveness of the proposed approach is demonstrated through numerical simulations.
|
|
14:30-14:45, Paper WeB15.5 | |
Practical Observers for Velocity Field Estimation of Normal Flow Equations (I) |
|
Alessandri, Angelo | University of Genoa |
Bagnerini, Patrizia | University of Genoa |
Gaggero, Mauro | National Research Council of Italy |
Mantelli, Luca | Università Di Genova |
Keywords: Distributed parameter systems, Estimation, Modeling
Abstract: We propose a new approach for the estimation of the velocity field of normal flow partial differential equations by means of practical observers. In more detail, we suppose to know the evolution over time in each point of the domain of the solution of a normal flow equation subject to an unknown velocity field, which is assumed to be smooth but time-varying in general. Numerical results are presented to showcase the effectiveness of the proposed approach in one- and two-dimensional examples. In particular, the performance of the practical observer has been tested also to estimate the rate of spread of a fire propagation model based on the normal flow equation. Predicting wildfire evolution presents a multitude of challenges due to the variable nature of wildfires (influenced by a range of factors, from weather conditions and vegetation type to landscape topography), thus motivating the need to have at disposal an accurate estimate of the velocity vector to predict the evolution of the fire front.
|
|
14:45-15:00, Paper WeB15.6 | |
Distributed Flocking Control with Ellipsoidal Level Sets |
|
Hastedt, Philipp | Hamburg University of Technology |
Datar, Adwait | Hamburg University of Technology |
Kocev, Kliment | TUHH |
Werner, Herbert | Hamburg University of Technology |
Keywords: Agents-based systems, Networked control systems, Distributed control
Abstract: This paper proposes a framework for two- and three-dimensional distributed flocking control based on potential functions with ellipsoidal level sets. Contrary to most flocking frameworks in the existing literature, the presented approach allows for tuning the inter-agent distances in different directions individually. By defining a desired ellipsoidal lattice configuration based on an elliptic transformation matrix, the proposed flocking protocol can be formulated as a generalization of established spherical flocking protocols. Furthermore, it is shown how standard stability analysis can be extended seamlessly to the proposed framework without additional difficulties or assumptions. Simulation scenarios demonstrate the superior performance of the proposed scheme compared to existing ones.
|
|
WeB16 |
Dockside 4 |
Wind Turbines and Wind Farms |
Invited Session |
Chair: Sinner, Michael | National Renewable Energy Laboratory |
Co-Chair: Mulders, Sebastiaan Paul | Delft University of Technology |
Organizer: Mulders, Sebastiaan Paul | Delft University of Technology |
Organizer: Sinner, Michael | National Renewable Energy Laboratory |
Organizer: Bay, Christopher | National Renewable Energy Laboratory |
Organizer: Fleming, Paul | National Renewable Energy Laboratory |
Organizer: van Wingerden, Jan-Willem | Delft University of Technology |
|
13:30-13:45, Paper WeB16.1 | |
Short-Term Wind Forecasting Using Surface Pressure Measurements (I) |
|
Abootorabi, Seyedalireza | University of Texas at Dallas |
Leonardi, Stefano | The University of Texas at Dallas |
Rotea, Mario | University of Texas at Dallas |
Zare, Armin | University of Texas at Dallas |
Keywords: Kalman filtering, Linear parameter-varying systems, Energy systems
Abstract: We propose a short-term wind forecasting framework that enables model-based control systems to preemptively adapt ahead of atmospheric variations in improving turbine efficiency and reducing structural loads and failures. Our approach relies on a combination of linear stochastic estimation and Kalman filtering algorithms to assimilate and process real-time nacelle-mounted anemometer and surface air-pressure readings with the predictions of a stochastic reduced-order model of the hub-height velocity field. Our results serve as a proof of concept for a wind forecasting strategy based on ground-level pressure sensor measurements.
|
|
13:45-14:00, Paper WeB16.2 | |
Analysis of Extremum Seeking Control for Wind Turbine Torque Controller Optimization by Aerodynamic and Generator Power Objectives (I) |
|
Mulders, Sebastiaan Paul | Delft University of Technology |
Gallo, Alexander J. | Delft University of Technology |
Rotea, Mario | University of Texas at Dallas |
Keywords: Adaptive control, Energy systems
Abstract: Wind turbines degrade over time, resulting in varying structural, aeroelastic, and aerodynamic properties. In contrast, the turbine controller calibrations generally remain constant, leading to suboptimal performance and potential stability issues. The calibration of wind turbine controller parameters is therefore of high interest. To this end, several adaptive control schemes based on extremum seeking control (ESC) have been proposed in the literature. These schemes have been successfully employed to maximize turbine power capture by optimization of the Kw^2-type torque controller. In practice, ESC is performed using electrical generator power, which is easily obtained. This paper analyses the feasibility of torque gain optimization using aerodynamic and generator powers. It is shown that, unlike aerodynamic power, using the generator power objective limits the dither frequency to lower values, reducing the convergence rate unless the phase of the demodulation ESC signal is properly adjusted. A frequency-domain analysis of both systems shows distinct phase behavior, impacting ESC performance. A solution is proposed by constructing an objective measure based on an estimate of the aerodynamic power.
|
|
14:00-14:15, Paper WeB16.3 | |
Voltage Restoration in MVDC Shipboard Microgrids with Economic Nonlinear Model Predictive Control (I) |
|
Putri, Saskia | Lehigh University |
Hosseinipour, Ali | Lehigh University |
Ge, Xiaoyu | Lehigh University |
Moazeni, Farrah | Lehigh University |
Khazaei, Javad | Lehigh University |
Keywords: Maritime control, Power systems, Optimal control
Abstract: Future Naval Microgrids (MGs) will include hybrid energy storage systems (ESS), including battery and supercapacitors to respond to emerging constant power loads (CPLs) and fluctuating pulsed power loads (PPLs). Voltage regulation of naval microgrids and power sharing among these resources become critical for success of a mission. This paper presents a novel control strategy using nonlinear model predictive controller embedded with a complex droop control architecture for voltage restoration and power sharing in medium voltage DC (MVDC) Naval MGs. The complex droop control ensures allocating supercapacitors (SCs) for high-frequency loads (i.e., PPLs), while battery energy storage system (BESS) and auxiliary generators share the steady-state load (i.e., CPL). Compared to state-of-the-art control of the naval ship MGs that relies on linear models, the proposed method incorporates the nonlinear behavior of the MGs in the closed-loop control framework via nonlinear model predictive control (NMPC). A reduced order representation of the MVDC dynamic is employed as the prediction model, augmented with a multi-objective, constraints-based, optimal control formulation. The results demonstrate the effectiveness of the proposed control framework for voltage restoration and power sharing of resources in naval MGs.
|
|
14:15-14:30, Paper WeB16.4 | |
Reinforcement Learning Control for Enhancing Marine Hydrokinetic Turbine Energy Generation (I) |
|
Barton, Samuel | Oregon State University |
Brekken, Ted | Oregon State University |
Cao, Yue | Oregon State University |
Keywords: Adaptive control, Fault accomodation, Electrical machine control
Abstract: This paper proposes a reinforcement learning-based method to maximize power generation for a direct-drive marine hydrokinetic turbine. A high levelized cost of energy (LCOE) is preventative in the widespread adoption of many marine energy conversion technologies. A straightforward way to reduce LCOE is to increase conversion efficiency and ensure maximum energy generation. The proposed method utilizes a damping control methodology, varying applied generator torque via a linear relationship between the applied damping coefficient and rotor speed. A state-action-reward-state-action (SARSA) algorithm has been used to learn the optimal con- trol action for a given flow velocity. The proposed SARSA methodology uses Gaussian radial basis functions to create a three-dimensional surface to estimate the relationship between damping coefficient, incoming flow velocity, and coefficient of power (Cp). With an example flow velocity profile derived from discharge data from a USGS test site while considering the effects of biofouling on the hydrodynamics of the turbine, the SARSA algorithm is compared against a baseline optimal tip speed ratio controller, where it has been found that the proposed RL generates 0.92% more energy than the baseline.
|
|
14:30-14:45, Paper WeB16.5 | |
H Infinity Phase Locking Control for Wave Induced Wake Mixing (I) |
|
van den Berg, Daniel | Delft University of Technology |
De Tavernier, Delphine | Delft University of Technology |
van Wingerden, Jan-Willem | Delft University of Technology |
Keywords: Identification for control, H-infinity control
Abstract: The dynamic induction control wake mixing strategy has the potential to increase the energy yield of floating wind farms. These floating turbines will be subjected to surface waves, caused by the wind, and swell. When dynamic induction control is applied in open-loop, the effect of second-order wave forces and dynamic induction control on the thrust force can be out-of-phase and have destructive interference. In this work, we propose a method to synchronize the dynamic induction control input to the effect of the second-order wave forces. This is achieved by formulating the synchronization problem within an H-Infinity optimization framework and designing a controller that minimizes the difference between the effect of wave-induced thrust variation and thrust variation. Time domain simulations show that synchronization at a desired frequency can be achieved and that the overall performance of the dynamic induction control method can be enhanced.
|
|
14:45-15:00, Paper WeB16.6 | |
Self-Learning Data-Driven Wind Farm Control Strategy Using Field Measurements (I) |
|
Hulsman, Paul | ForWind |
Howland, Michael | Massachusetts Institute of Technology |
Göçmen, Tuhfe | DTU Risø Campus |
Petrović, Vlaho | University of Oldenburg |
Kühn, Martin | University of Oldenburg |
Keywords: Control applications, Iterative learning control, Optimization
Abstract: This paper presents a novel data-driven wind farm control strategy to steer the wake. The approach uses the turbine power output and standard turbine measurement equipment as input, such as the nacelle anemometer and the wind vane. By designing the control strategy based on the measured data, sub-optimal yaw angle estimates from an analytical model can be compensated and inaccuracies induced by the turbine sensors can be corrected. Thereby advancing wake steering controllers to be independent of external sensors, such as a met mast or lidars. Measurements acquired from a wake steering field campaign are used to train the data-driven model and to evaluate the predictions of the model. The proposed data-driven approach highlights a consistent increase in the power gain compared to an analytical approach with a potential improvement ranging from 3.4% up to 8.0%.
|
|
WeB17 |
Dockside 5 |
Cooperative Control |
Regular Session |
Chair: Chen, Lijun | University of Colorado at Boulder |
Co-Chair: Liu, Junwei | Southern University of Science and Technology |
|
13:30-13:45, Paper WeB17.1 | |
Fully Distributed Consensus of Multi-Agent Systems with Improved Minimum Inter-Event Times |
|
Su, Ruchao | Shanghai Jiao Tong University |
Li, Xianwei | Shanghai Jiao Tong University |
Li, Shaoyuan | Shanghai Jiao Tong University |
Keywords: Cooperative control, Control of networks, Distributed control
Abstract: We investigate the consensus problem of multiagent systems with linear dynamics via a fully distributed event-triggered approach. To solve this problem, we present a fully distributed control strategy with an innovative event-triggered mechanism, where a clock variable is introduced to guarantee the existence of strictly positive minimum inter-event times (MIETs). Compared with existing results, the proposed event-triggered mechanism provides more flexibility for the clock variable, improving inter-event times. A positive lower bound of MIETs is derived, and is shown to be larger than those in previous works. The main results are illustrated with a numerical example.
|
|
13:45-14:00, Paper WeB17.2 | |
Dynamic Event-Triggered Control for Multi-Agent Consensus with Relative Output Feedback |
|
Zhan, Sikang | Shanghai Jiao Tong University |
Li, Xianwei | Shanghai Jiao Tong University |
Keywords: Cooperative control, Distributed control, Control of networks
Abstract: This paper studies the consensus problem of linear multi-agent systems (MASs) with relative output measurement on undirected graphs. Output-feedback protocols are designed for both leaderless and leader-follower consensus. Different from [1], dynamic event-triggered sampling strategies are integrated to reduce the communication burdens between agents. Compared with most of the existing results which are based absolute output, the proposed protocols are based on sampled relative output. Finally, a simulation example is provided to illustrate the theoretical results.
|
|
14:00-14:15, Paper WeB17.3 | |
Unbounded Cooperative Pursuit Using a Linearized Safe-Reachable Set |
|
Ouyang, Zikai | Southern University of Science and Technology |
Liu, Junwei | Southern University of Science and Technology |
Lu, Haibo | Peng Cheng Laboratory |
Zhang, Wei | Southern University of Science and Technology |
Keywords: Cooperative control, Game theory, Optimization
Abstract: This paper proposes an online cooperative pursuit strategy for multiple pursuers within an unbounded domain. One of the popular cooperative pursuit strategies is Voronoi-based pursuit, which is designed by minimizing the area of the evader's Voronoi cell. However, this pursuit strategy only works for bounded domains as the Voronoi cell is well-defined in a bounded environment. Instead, our strategy is built upon the minimization of the farthest squared distance between the evader and its safe-reachable set (SRS), which formulates the cooperative pursuit as an online concave optimization problem. To overcome the computational burden arising from solving the concave optimization problem, we linearize the SRS as a convex polygon by sampling vertices on the boundary of SRS, which transforms the concave optimization problem into a more tractable one. Moreover, we show that if the vertex number is large enough, our approach can guarantee a capture of the evader in finite time for cases with non-zero capture radii. Simulation results demonstrate the efficacy and efficiency of our approach.
|
|
14:15-14:30, Paper WeB17.4 | |
ROMA-iQSS: An Objective Alignment Approach Via State-Based Value Learning and ROund-Robin MultiAgent Scheduling |
|
Lin, Chi-Hui | University of Colorado Boulder |
Koh, Joewie J. | University of Colorado Boulder |
Roncone, Alessandro | University of Colorado Boulder |
Chen, Lijun | University of Colorado at Boulder |
Keywords: Cooperative control, Machine learning, Distributed control
Abstract: Effective multi-agent collaboration is imperative for solving complex, distributed problems. In this context, two key challenges must be addressed: first, autonomously identifying optimal objectives for collective outcomes; second, aligning these objectives among agents. Traditional frameworks, often reliant on centralized learning, struggle with scalability and efficiency in large multi-agent systems. To overcome these issues, we introduce a decentralized state-based value learning algorithm that enables agents to independently discover optimal states. Furthermore, we introduce a novel mechanism for multi-agent interaction, wherein less proficient agents follow and adopt policies from more experienced ones, thereby indirectly guiding their learning process. Our theoretical analysis shows that our approach leads decentralized agents to an optimal collective policy. Empirical experiments further demonstrate that our method outperforms existing decentralized state-based and action-based value learning strategies by effectively identifying and aligning optimal objectives.
|
|
14:30-14:45, Paper WeB17.5 | |
MR.CAP: Multi-Robot Joint Control and Planning for Object Transport |
|
Jaafar, Hussein Ali | Toronto Metropolitan University |
Kao, Cheng-Hao | Toronto Metropolitan University |
Saeedi, Sajad | Toronto Metropolitan University |
Keywords: Cooperative control, Robotics, Optimization
Abstract: With the recent influx in demand for multi-robot systems throughout industry and academia, there is an increasing need for faster, robust, and generalizable path planning algorithms. Similarly, given the inherent connection between control algorithms and multi-robot path planners, there is in turn an increased demand for fast, efficient, and robust controllers. We propose a scalable joint path planning and control algorithm for multi-robot systems with constrained behaviours based on factor graph optimization. We demonstrate our algorithm on a series of hardware and simulated experiments. Our algorithm is consistently able to recover from disturbances and avoid obstacles while outperforming state-of-the-art methods in optimization time, path deviation, and inter-robot errors. See the code and supplementary video for experiments.
|
|
14:45-15:00, Paper WeB17.6 | |
Distributed Dual-Layer Adaptive Event-Triggered Formation Tracking for Quadrotor UAVs |
|
Chen, Tianxing | Harbin Institute of Technology, Shenzhen |
Zhang, Hongwei | Harbin Institute of Technology, Shenzhen |
Keywords: Cooperative control, Robust adaptive control, Networked control systems
Abstract: Achieving resource-efficient robust tracking is critical for micro aerial robotics subject to external disturbances. In this paper, a novel distributed dual-layer adaptive event-triggered controller is proposed for cooperative formation tracking under unknown disturbances. Two different triggering sequences are designed to determine the appropriate instants for communication and control updates, thus greatly reducing the update frequency of the controller and communication consumption. Moreover, adaptive event-triggered conditions with heterogeneous parameters are well-designed to cope with unknown disturbances, while improving the flexibility of parameter selection and excluding Zeno behavior. By employing Lyapunov theory, new sufficient conditions for ensuring the feasibility of the proposed algorithm are rigorously derived to eliminate the conservatism. The performance of the proposed algorithm is illustrated through numerical validations of cooperative formation tracking under cluttered nonlinear perturbations, and a detailed comparison with existing algorithms is made to fully demonstrate the robustness and advantage of the proposed algorithm.
|
|
WeB18 |
Dockside 6 |
Stability of Linear Systems |
Regular Session |
Chair: Yedavalli, Rama K. | Ohio State Univ |
Co-Chair: Ito, Hiroshi | Kyushu Institute of Technology |
|
13:30-13:45, Paper WeB18.1 | |
Convex Stability of Interconnections-Free X Shaped Real Square Matrices: New Conditions Using Transformation Allergic Indices and Proper X^{0} Definition |
|
Yedavalli, Rama K. | Ohio State Univ |
Keywords: Stability of linear systems, Linear systems, Linear parameter-varying systems
Abstract: In this paper (with its IP protected content), we introduce the concept of Convex Stability for analyzing the real state variable convergence issue (stability) of any Linear Time Invariant State Space (LTISS) system, x˙ = Ax, x(0) = xo. This is different from the current literature’s Hurwitz Stability concept because the definition of A^0 is different in the conditions we present for the Convex Stability of the matrix A. We propose Indices labeled, Transformation Allergic Indices, (TAIs) as real state variable convergence measures to guarantee the Convex Stability of the given real matrix A. For simpler exposition of this concept, in this paper, we present conditions for the Convex Stability property of a special class of X shaped real, square matrices, making them belonging to matrices with zero interconnections or interconnections-free (in the Qualitative ecological principles language). For such matrices, the proposed conditions of this paper do not require any eigenvalue concept at all. For the special case of the X matrices considered, the proposed conditions of Convex stability point out the misleading conclusions eigenvalue based conditions draw about the real state variable convergence of the same X matrix, thereby concluding that in general, Hurwitz stability conditions of any real matrix A (not just the X type matrices) can never guarantee Convex stability of the same real matrix A.
|
|
13:45-14:00, Paper WeB18.2 | |
Transformation Allergic Index Singularity: A Hidden Premature Instability Unrecognizable by Hurwitz Stable Matrices with Serious Implications to Safety Critical Systems |
|
Yedavalli, Rama K. | Ohio State Univ |
Keywords: Stability of linear systems, Linear systems, Linear parameter-varying systems
Abstract: In this paper, with its IP protected content, we introduce a new approach or philosophy (different from the Hurwitz Stability concept of the current literature) labeled as the Transformation Allergic (TA) Approach for analyzing the real state variable convergence of any real Linear Time Invariant State Space (LTISS) system, x˙ = Ax, x(0) = xo. The proposed TA Approach in turn defines a new concept labeled as Convex Stability of any real nth order real matrix which then hinges on the Convex Stability of a simple 2nd order matrix. The Convex Stability of a 2nd order real matrix is then analyzed by using recently introduced scalars labeled as Transformation Allergic Indices (TAIs). With the help of these TAIs, we identify a hidden instability labeled as Transformation Allergic Index Singularity which happens prematurely prior to the Determinantal Singularity predicted by Hurwitz Stable matrices. Then it is also shown that all Hurwitz Stable matrices turn out to be always Non-Convex. Thus it is emphasized that Non-Convexity itself is to be treated as Instability. Clearly, such premature Instability (Non-Convexity) needs to be considered as highly undesirable in safety critical LTISS systems such as those of autopilots for flight vehicles. Simple tests to identify Non-Convexity in any Hurwitz stable matrix are also presented.
|
|
14:00-14:15, Paper WeB18.3 | |
Distributed Stability Conditions for Interconnected LTI Systems Based on Differential Interconnection Neutral Functions |
|
Kristović, Pietro | The University of Zagreb, Faculty of Mechanical Engineering And |
Jokic, Andrej | University of Zagreb |
Keywords: Stability of linear systems, LMIs, Large-scale systems
Abstract: In this paper we are concerned with stability of interconnected linear time invariant systems. It is shown that stability conditions involving specifically structured Lyapunov function candidates can be decomposed to a set of local and coupled stability-related conditions, without introducing any additional conservatism. Specific feature of the considered Lyapunov functions is that they include (higher order) time derivatives of the state vector. Decomposition of stability conditions is based on the notion of differential interconnection neutral functions, which are conceptually related to interconnection neutral supply functions from the theory of dissipative dynamical systems.
|
|
14:15-14:30, Paper WeB18.4 | |
Stabilization of Almost Periodic Piecewise Linear Systems with Norm-Bounded Uncertainty for Roll-To-Roll Dry Transfer Manufacturing Processes |
|
Martin, Christopher | University of Texas at Austin |
Li, Wei | University of Texas at Austin |
Chen, Dongmei | The University of Texas at Austin |
Keywords: Stability of linear systems, Switched systems, Uncertain systems
Abstract: This paper presents the first stabilization result for almost periodic piecewise linear systems (APPLSs) with both norm-bounded additive modeling uncertainty and dwell-time uncertainty. The technique employs a mixed-mode time-varying Lyapunov function to generate a sequence of controller gains that stabilize the uncertain APPLS. This modelling structure aligns with the R2R dry transfer of patterned two-dimensional (2D) materials, an emerging technology for continuous, chemical-free flexible material and device transfer. Thus, the proposed controller synthesis method provides stabilization guarantees for a novel modeling framework with applications in the expanding realm of advanced 2D device manufacturing.
|
|
14:30-14:45, Paper WeB18.5 | |
A Dissipativity Framework for Input-To-State Stability with Positivity of Dynamical Systems with Interior Equilibria |
|
Ito, Hiroshi | Kyushu Institute of Technology |
Keywords: Stability of nonlinear systems, Lyapunov methods, Large-scale systems
Abstract: This paper proposes a module-based framework for analyzing and designing dynamical systems evolving on positive orthants and having stationary points in their interior. Dissipativity theory is a widely-used concept that allows one to assess a property of a system by aggregating dissipativity of its components. Popular storage functions and supply rates do apply to positive systems, but do not give information about the positivity. This paper employs non-vector space storage functions encompassing logarithmic functions to formulate input-to-state stability (ISS) with positivity guarantee, and shows how its dissipativity characterization can effectively address system interconnections. The developed supply rates and criteria are not only asymmetric to fit non-vector state spaces, but also carry equilibrium information which is inherent in positive systems exhibiting interior equilibria. Mathematical and practical examples illustrate that the proposed framework offers flexibility in applying to various system interconnections.
|
|
14:45-15:00, Paper WeB18.6 | |
Hybrid Feedback Control for Global and Optimal Safe Navigation |
|
Cheniouni, Ishak | Lakehead University |
Berkane, Soulaimane | Université Du Québec En Outaouais |
Tayebi, Abdelhamid | Lakehead University |
Keywords: Stability of hybrid systems, Autonomous robots, Autonomous systems
Abstract: We propose a hybrid feedback control strategy that safely steers a point-mass robot to a target location optimally from all initial conditions in the n-dimensional Euclidean space with a single spherical obstacle. The robot moves straight to the target when it has a clear line-of-sight to the target location. Otherwise, it engages in an optimal obstacle avoidance maneuver via the shortest path inside the cone enclosing the obstacle and having the robot's position as a vertex. The switching strategy that avoids the undesired equilibria, leading to global asymptotic stability (GAS) of the target location, relies on using two appropriately designed virtual destinations, ensuring control continuity and shortest path generation. Simulation results illustrating the effectiveness of the proposed approach are presented.
|
|
WeB19 |
Dockside 7 |
Robust Control I |
Regular Session |
Chair: Ratnam, Elizabeth | The Australian National University |
Co-Chair: Liu, Jun | University of Waterloo |
|
13:30-13:45, Paper WeB19.1 | |
Robust Model Predictive Control for Networked Control Systems with Timing Perturbations |
|
Wang, Renke | George Mason University |
Yao, Ningshi | George Mason University |
Keywords: Robust control, Delay systems, Networked control systems
Abstract: Our earlier work established a contention-resolving model predictive control (or MPC) framework to co-design priorities and control for networked control systems (or NCSs), assuming the system models are perfect. However, due to the differences between a model and the real world, there are inevitable perturbations. In this paper, we focus on perturbations of the predicted timings, which are rarely addressed in the literature, and present a robust MPC design to achieve guaranteed performance under such timing perturbations. We propose a state-feedback correction term with dynamic gain, added to the nominal contention-resolving MPC policy, which can eliminate the state deviation between perturbed and nominal systems at the predicted task completion time. Additionally, we identified the largest tolerable timing perturbation for such a robust MPC design. Under the tolerable timing perturbations, we analytically proved that the state deviation can be bounded by a forward invariant set (or FIS) for all time. The robust MPC policy can be then designed based on the FIS, such that perturbed system trajectories are guaranteed to satisfy all the original state and control constraints. The effectiveness of our proposed method is verified through simulation.
|
|
13:45-14:00, Paper WeB19.2 | |
Data-Driven Superstabilization of Linear Systems under Quantization |
|
Miller, Jared | ETH Zurich |
Zheng, Jian | Northeastern University |
Sznaier, Mario | Northeastern University |
Hixenbaugh, Chris | Naval Undersea Warfare Center |
Keywords: Robust control, Linear systems, Optimization
Abstract: This paper focuses on the stabilization and regulation of linear systems affected by quantization in state-transition data and actuated input. The observed data are composed of tuples of current state, input, and the next state's interval ranges based on sensor quantization. Using an established characterization of input-logarithmically-quantized stabilization based on robustness to sector-bounded uncertainty, we formulate a nonconservative infinite-dimensional linear program that enforces superstabilization of all possible consistent systems under assumed priors. We solve this problem by posing a pair of exponentially-scaling linear programs, and demonstrate the success of our method on example quantized systems.
|
|
14:00-14:15, Paper WeB19.3 | |
Achieving Optimal Performance with Data-Driven Frequency-Based Control Synthesis Methods |
|
Schuchert, Philippe | EPFL |
Karimi, Alireza | EPFL |
Keywords: Robust control, Optimal control, Optimization
Abstract: Frequency Response Function (FRF)-based control synthesis methods for Linear Time-Invariant (LTI) systems have been widely used in control theory and industry. Recently, there has been renewed interest in these methods, employing numerical optimization tools to enhance their performance. In this letter, we analyze the convergence properties of two data-driven frequencybased H2 and H∞ synthesis methods (Karimi and Kammer, 2017), (Schuchert et al., 2024) as the controller order increases. We demonstrate that, in the limit, an optimal controller can be designed using only the FRF, providing an estimate of the best achievable performance or enabling the design of an optimal high-order controller.
|
|
14:15-14:30, Paper WeB19.4 | |
Design and Stability of Angle Based Feedback Control in Power Systems: A Negative-Imaginary Approach |
|
Chen, Yijun | University of Sydney |
Petersen, Ian R. | Australian National University |
Ratnam, Elizabeth | The Australian National University |
Keywords: Power systems, Robust control, Stability of nonlinear systems
Abstract: This paper considers a power transmission network model characterized by interconnected nonlinear swing dynamics on generator buses. At the steady state, frequencies across different buses synchronize to a common nominal value such as 50Hz or 60Hz, and power flows on transmission lines are at steady-state values. We assume that fast measurements of generator rotor angles are available. Our approach to frequency and angle control centers on equipping generator buses with large-scale batteries that are controllable on a fast timescale. We link angle based feedback linearization control with negative-imaginary systems theory. Angle based feedback controllers are designed for these large-scale batteries and can be implemented in a distributed manner with local information. Our analysis demonstrates the internal stability of the interconnection between the power transmission network and the angle based feedback controllers. This internal stability underscores the benefits of achieving frequency synchronization and preserving steady-state power flows within the network through the use of feedback controllers. This will enable transmission lines to be operated at maximum power capacity since stability robustness is ensured by the use of feedback controllers rather than conservative criteria such as the equal area criterion. Simulation results illustrate our results.
|
|
14:30-14:45, Paper WeB19.5 | |
Distributionally Robust Path Integral Control |
|
Park, Hyuk | University of Illinois Urbana-Champaign |
Zhou, Duo | University of Illinois Urbana Champaign |
Hanasusanto, Grani A. | University of Illinois Urbana-Champaign |
Tanaka, Takashi | University of Texas at Austin |
Keywords: Robust control, Stochastic optimal control, Optimization
Abstract: We consider a continuous-time continuous-space stochastic optimal control problem, where the controller lacks exact knowledge of the underlying diffusion process, relying instead on a finite set of historical disturbance trajectories. In situations where data collection is limited, the controller synthesized from empirical data may exhibit poor performance. To address this issue, we introduce a novel approach named Distributionally Robust Path Integral (DRPI). The proposed method employs distributionally robust optimization (DRO) to robustify the resulting policy against the unknown diffusion process. Notably, the DRPI scheme shows similarities with risk-sensitive control, which enables us to utilize the path integral control (PIC) framework as an efficient solution scheme. We derive theoretical performance guarantees for the DRPI scheme, which closely aligns with selecting a risk parameter in risk-sensitive control. We validate the efficacy of our scheme and showcase its superiority when compared to risk-neutral and risk-averse PIC policies in the absence of the true diffusion process.
|
|
14:45-15:00, Paper WeB19.6 | |
Safe Tracking Control of Discrete-Time Nonlinear Systems Using Backward Reachable Sets |
|
Serry, Mohamed | University of Waterloo |
Yang, Liren | University of Michigan |
Ozay, Necmiye | Univ. of Michigan |
Liu, Jun | University of Waterloo |
Keywords: Robust control, Uncertain systems, Formal verification/synthesis
Abstract: Tracking controllers are often integrated into control systems to ensure their robustness against uncertainties and disturbances during trajectory following maneuvers, where design methods in the literature either lack formal guarantees or suffer from conservatism. In this paper, we propose a new tracking control approach for discrete-time nonlinear uncertain systems using set-based computations. In particular, we compute zonotopic backward reachable sets along prescribed nominal trajectories, and utilize such sets to synthesize tracking controllers that ensure safety and reachability in the presence of non-convex input/state constraints and uncertainties. We illustrate our approach through two numerical examples (Dubin's car and planar quadrotor).
|
|
WeB20 |
Dockside 8 |
Kalman Filtering |
Regular Session |
Chair: Molloy, Timothy L. | Australian National University |
Co-Chair: Chen, Tongwen | University of Alberta |
|
13:30-13:45, Paper WeB20.1 | |
Data-Driven Stealthy Attacks on Remote State Estimation with Sliding-Window Anomaly Detectors |
|
Guo, Ziyi | University of Alberta |
Zhou, Jing | University of Alberta |
Chen, Tongwen | University of Alberta |
Keywords: Kalman filtering, Estimation, Communication networks
Abstract: This paper proposes a data-driven attack strategy capable of circumventing sliding-window chi-square detectors in remote state estimation. The developed strategy is designed to operate based on only the intercepted output data from the plants, estimators, and anomaly detectors, without the knowledge of system parameters. Moreover, in scenarios where sufficient data has been collected before the occurrence of attacks, the proposed strategy exhibits optimality among all feasible attack policies using the same historical information. Through simulations, the effectiveness of the data-driven strategy is verified, with the stealthiness sustained by consistent empirical alarm rates.
|
|
13:45-14:00, Paper WeB20.2 | |
State and Parameter Estimation of Non-Ideal Batch Reactors with Heel Masses |
|
Crouse, Steven | Georgia Institute of Technology |
Prasad, Rupanjali | Georgia Institute of Technology |
Rousseau, Ronald | Georgia Institute of Technology |
Grover, Martha | Georgia Institute of Technology |
Keywords: Kalman filtering, Estimation, Fault detection
Abstract: In chemical batch reactors, proper mixing is important for consistent process and product quality. Heel masses (mass that is left behind from batch to batch) can be a feature of process design, or may occur inadvertently. The current practice at the Hanford nuclear waste processing site is to include heel masses (up to 30% by volume) to reduce batch to batch variation of chemical waste that will be vitrified. To monitor this process, the proposed process uses laboratory measurements with a mass balance to propagate composition estimates from tank to tank. Incorporating in-line monitoring tools can improve the ability to detect faults when they occur in processing, particularly faults concerning tank transfer and heel masses. A simulation study is performed based on reported data on expected tank concentrations, sensor accuracies, and analytical laboratory accuracies. Observability of model states and parameters are shown. By incorporating in-line sensors, the mean absolute prediction error for the heel mass of kyanite, an insoluble solid, is improved by a factor of 3.88 over mass-balance model approaches in a simulation study including a fault in heel mass.
|
|
14:00-14:15, Paper WeB20.3 | |
A Novel Variational Bayesian Adaptive Kalman Filter for Systems with Unknown State-Dependent Noise Covariance Matrices |
|
Uzzaman, Nahid | Oklahoma State University |
Bai, He | Oklahoma State University |
Keywords: Kalman filtering, Machine learning, Estimation
Abstract: We consider state estimation for a dynamical system that has unknown state-dependent dynamic process and measurement noise covariance matrices. When the noise covariances are state-dependent the typical Kalman filter (KF) fails to accurately estimate the states of the system. To estimate the states of such a system, we model the covariance matrices via the Wishart process and propose a novel variational Bayesian adaptive Kalman filter (VB-AKF). The proposed VB-AKF combines the variational Bayesian inference of the Wishart process with the KF. The resulting VB-AKF can estimate the states of the system together with the state-dependent dynamic process and measurement noise covariances. Through simulations, we show that the developed VB-AKF is effective and achieves satisfactory performance.
|
|
14:15-14:30, Paper WeB20.4 | |
Computationally Efficient Implementation of the Weighted Kalman Filter for Quadratic Systems |
|
Rotondo, Damiano | University of Stavanger |
Witczak, Marcin | University of Zielona Gora |
Seybold, Lothar | RAFI GmbH & Co. KG |
Keywords: Kalman filtering, Observers for nonlinear systems, Computational methods
Abstract: The weighted Kalman filter (WKF) can be perceived as a variant of the extended Kalman filter (EKF), which incorporates the so-called weighted linearisation. The price behind such an extension is expressed by an increased computational complexity, which aims at improving the overall convergence of the filter. Owing to the increased complexity, the WKF is in general not suitable for high-order nonlinear systems. To overcome this difficulty, the objective of this paper is to show that a closed-form expression of the WKF can be obtained for a specific class of nonlinear systems, i.e., those characterised by a quadratic state equation. This closed-form of the WKF scales well with the order of the system, enabling the application of the WKF to high-order nonlinear quadratic systems, as exemplified in the final part of the paper using a comprehensive Monte Carlo analysis applied to a large number of random systems.
|
|
14:30-14:45, Paper WeB20.5 | |
Two-Channel Extended Kalman Filtering with Intermittent Measurements |
|
Maer, Vicu-Mihalis | Technical University of Cluj-Napoca |
Lendek, Zsofia | Technical University of Cluj-Napoca |
Pirje, Stefan | Technical University of Cluj-Napoca |
Tolic, Domagoj | Rochester Institute of Technology, Croatia |
Đuraš, Antun | University of Dubrovnik |
Prkačin, Vicko | University of Dubrovnik |
Palunko, Ivana | University of Dubrovnik |
Busoniu, Lucian | Technical University of Cluj-Napoca |
Keywords: Kalman filtering, Sensor networks, Autonomous robots
Abstract: We consider two nonlinear state estimation problems in a setting where an extended Kalman filter receives measurements from two sets of sensors via two channels (2C). In the stochastic-2C problem, the channels drop measurements stochastically, whereas in 2C scheduling, the estimator chooses when to read each channel. In the first problem, we generalize linear-case 2C analysis to obtain -- for a given pair of channel arrival rates -- boundedness conditions for the trace of the error covariance, as well as a worst-case upper bound. For scheduling, an optimization problem is solved to find arrival rates that balance low channel usage with low trace bounds, and channels are read deterministically with the expected periods corresponding to these arrival rates. We validate both solutions in simulations for linear and nonlinear dynamics; as well as in a real experiment with an underwater robot whose position is being intermittently found in a UAV camera image.
|
|
14:45-15:00, Paper WeB20.6 | |
Extended Kalman Filtering for Recursive Online Discrete-Time Inverse Optimal Control |
|
Zhao, Tian | The Australian National University |
Molloy, Timothy L. | Australian National University |
Keywords: Optimal control, Estimation, Filtering
Abstract: We formulate the discrete-time inverse optimal control problem of inferring unknown parameters in the objective function of an optimal control problem from measurements of optimal states and controls as a nonlinear filtering problem. This formulation enables us to propose a novel extended Kalman filter (EKF) for solving inverse optimal control problems in a computationally efficient recursive online manner that requires only a single pass through the measurement data. Importantly, we show that the Jacobians required to implement our EKF can be computed efficiently by exploiting recent Pontryagin differentiable programming results, and that our consideration of an EKF enables the development of first-of-their-kind theoretical error guarantees for online inverse optimal control with noisy incomplete measurements. Our proposed EKF is shown to be significantly faster than an alternative unscented Kalman filter-based approach.
|
|
WeB21 |
Pier 4 |
Linear Systems |
Regular Session |
Chair: Drummond, Ross | University of Sheffield |
Co-Chair: Jokic, Andrej | University of Zagreb |
|
13:30-13:45, Paper WeB21.1 | |
Data-Driven State-Feedback Controller Synthesis for Dissipativity: A Dualization-Based Approach |
|
Kristović, Pietro | The University of Zagreb, Faculty of Mechanical Engineering And |
Jokic, Andrej | University of Zagreb |
Keywords: Linear systems, LMIs, H-infinity control
Abstract: In this paper we propose a non-conservative static state-feedback controller synthesis method for discrete-time linear time-invariant systems. The synthesis goal is to render a closed-loop system dissipative with respect to a given generic unstructured quadratic supply function, while the system dynamics is partially represented by noisy state-input data. While the same problem has already been considered in the literature, the main novelty is the solution approach which is based on the dualization lemma to obtain solvable linear matrix inequalities for controller synthesis.
|
|
13:45-14:00, Paper WeB21.2 | |
Formula for Estimating the Frequency Response of LTI Systems from Noisy Finite-Length Datasets |
|
Ossareh, Hamid | University of Vermont |
Dörfler, Florian | Swiss Federal Institute of Technology (ETH) Zurich |
Keywords: Linear systems, Identification, Subspace methods
Abstract: In this paper, we revisit the classical problem of estimating the frequency response of an LTI system from noisy, non-periodic input-output data. Existing solutions fall into two categories: indirect methods, which compute the frequency response using identified models, and direct methods, which estimate the response directly from data. Direct methods bypass system identification, but have challenges when applied to noisy, finite-length datasets with unknown initial conditions. This paper proposes a new direct method that addresses these challenges, and provides an explicit formula for computing the frequency response. To develop this method, the paper leverages ideas from behavioral system theory and poses the problem as an optimization problem, whose objective is to minimize the projection of the solution onto the nullspace of an input-output data matrix. The paper also offers an alternative derivation of the formula, based on identifying an ARX model, thereby bridging the gap with classical indirect approaches. The proposed method is applied to experimental data collected from a DC motor, where we show that the proposed method outperforms other direct approaches based on Fourier transforms and low-rank approximations, and performs equally as good as direct subspace identification methods, even though no model class is prescribed.
|
|
14:00-14:15, Paper WeB21.3 | |
Externally Positive Linear Systems from Transfer Function Properties |
|
Drummond, Ross | University of Sheffield |
Turner, Matthew C. | University of Southampton |
Keywords: Linear systems
Abstract: The characterisation of single-input-single-output externally positive linear systems is considered. A complete characterisation of the class of externally positive second and a class of underdamped third order systems is given and connections to negative-imaginary systems are highlighted. It is shown that negative-imaginary systems have non-negative step responses, leading to a condition for external positivity based on negative imaginary systems theory. Finally, a counter-example is introduced for a recently developed numerical test for external positivity based upon linear matrix inequalities. These results extend the class of system for which external positivity can be verified, facilitating large-scale control and less conservative absolute stability analysis.
|
|
14:15-14:30, Paper WeB21.4 | |
Can Model-Free Controllers for Complex Systems Stabilize and Provide Satisfactory Response? |
|
Narendra, Kumpati S. | Yale Univ |
George, Koshy | Gandhi Institute of Technology and Management (GITAM) |
Keywords: Linear systems, Modeling, Adaptive control
Abstract: In the history of control theory, many methods have been proposed to simplify the procedure for determining the control inputs for complex dynamical systems. The most recent one of that series is “Model Free Control” which is very popular at the present time. The first three sections of the paper deal with these topics to provide the appropriate background to evaluate the main contribution of the paper contained in Section IV. This includes simulation studies on the following three dynamical systems: i) The roll stabilization of an aircraft, ii) The speed control of a motor, and iii) The stabilization of the longitudinal dynamics of an aircraft. Both model-free control and model-based control were attempted in the above studies. While stability of the overall system and satisfaction of design specifications could be assured using model-based control, neither one was possible with model-free control. The results raise many questions which need to be addressed by those advocating model-free control. In particular, the limitations of the approach need to be clearly stated.
|
|
14:30-14:45, Paper WeB21.5 | |
On Formalisation of Martin Distance for Linear Dynamical Systems |
|
Sinha, Subhrajit | Pacific Northwest National Laboratory |
Nandanoori, Sai Pushpak | Pacific Northwest National Laboratory |
Huang, Bowen | PNNL |
Ramachandran, Thiagarajan | Pacific Northwest National Laboratory |
Bakker, Craig | Pacific Northwest National Laboratory |
Keywords: Linear systems
Abstract: The Martin distance was first developed to compare Autoregressive–Moving-Average (ARMA) models and has since been extended to compare linear dynamical systems (LDS). In this paper, we formalise the concept of Martin distance and put it on a firm mathematical footing. In particular, we prove that the Martin distance is not a metric on the space of LDSs. Furthermore, we show that unlike ARMA processes, where one can use the Martin distance to compare LDSs with different dimensions, this is not the case for LDSs. Moreover, we show that even on the space of LDSs with n states, p inputs, and q outputs, the Martin distance is only a pseudo-metric. We use this to construct a full-fledged metric on an appropriate quotient space. Finally, we extend the Martin distance to compare control-affine nonlinear systems.
|
|
WeC01 |
Metro E/C |
Machine Learning II |
Regular Session |
Chair: Xu, Zeyuan | National University of Singapore |
Co-Chair: Jin, Ming | Virginia Tech |
|
15:30-15:45, Paper WeC01.1 | |
Is Data All That Matters? the Role of Control Frequency for Learning-Based Sampled-Data Control of Uncertain Systems |
|
Römer, Ralf | Technical University of Munich |
Brunke, Lukas | University of Toronto |
Zhou, Siqi | University of Toronto |
Schoellig, Angela P | Technical University of Munich & University of Toronto |
Keywords: Machine learning, Uncertain systems, Robotics
Abstract: Learning models or control policies from data has become a powerful tool to improve the performance of uncertain systems. While a strong focus has been placed on increasing the amount and quality of data to improve performance, data can never fully eliminate uncertainty, making feedback necessary to ensure stability and performance. We show that the control frequency at which the input is recalculated is a crucial design parameter, yet it has hardly been considered before. We address this gap by combining probabilistic model learning and sampled-data control. We use Gaussian processes (GPs) to learn a continuous-time model and compute a corresponding discrete-time controller. The result is an uncertain sampled-data control system, for which we derive robust stability conditions. We formulate semidefinite programs to compute the minimum control frequency required for stability and to optimize perfor- mance. As a result, our approach enables us to study the effect of both control frequency and data on stability and closed-loop performance. We show in numerical simulations of a quadrotor that performance can be improved by increasing either the amount of data or the control frequency, and that we can trade off one for the other. For example, by increasing the control frequency by 33%, we can reduce the number of data points by half while still achieving similar performance.
|
|
15:45-16:00, Paper WeC01.2 | |
Federated Learning-Based Distributed Model Predictive Control of Nonlinear Systems |
|
Xu, Zeyuan | National University of Singapore |
Wu, Zhe | National University of Singapore |
Keywords: Machine learning, Predictive control for nonlinear systems, Chemical process control
Abstract: This work develops a federated learning-based distributed model predictive control (FL-DMPC) method for nonlinear systems with multiple subsystems to address the cybersecurity issue of data transmission among subsystems and heterogeneity issue due to non-independent and identically distributed data among subsystems. Specifically, a novel FL framework is proposed to aggregate submodels into a global FL model with a sufficiently small modeling error with provable convergence properties derived based on iteration theory. Subsequently, by incorporating the FL model into a DMPC scheme, an FL-DMPC method is presented to achieve the expected performance of nonlinear systems. Finally, a chemical process network is adopted to demonstrate the effectiveness of the proposed FL-DMPC method.
|
|
16:00-16:15, Paper WeC01.3 | |
Optimization Solution Functions As Deterministic Policies for Offline Reinforcement Learning |
|
Khattar, Vanshaj | Virginia Polytechnic Institute and State University |
Jin, Ming | Virginia Tech |
Keywords: Machine learning, Predictive control for nonlinear systems, Statistical learning
Abstract: Offline reinforcement learning (RL) is a promising approach for many control applications but faces challenges such as limited data coverage and value function overestimation. In this paper, we propose an implicit actor-critic (iAC) framework that employs optimization solution functions as a deterministic policy (actor) and a monotone function over the optimal value of optimization as a critic. By encoding “optimality” in the actor policy, we show that the learned policies are robust to the suboptimality of the learned actor parameters via the exponentially decaying sensitivity (EDS) property. We obtain performance guarantees for the proposed iAC framework and show its benefits over general function approximation schemes. Finally, we validate the proposed framework on two real-world applications and show a significant improvement over state-of-the-art (SOTA) offline RL methods.
|
|
16:15-16:30, Paper WeC01.4 | |
A Practical Reinforcement Learning (RL) Controller Design for Nonlinear Systems |
|
Hassanpour, Hesam | McMaster University |
Mhaskar, Prashant | McMaster University |
Corbett, Brandon | McMaster University |
Keywords: Machine learning, Process Control
Abstract: This paper presents a practically implementable reinforcement learning (RL) approach for process control applications. Standard model-free RL approaches are not applicable in practice because the learning process of an RL agent requires random exploration of state and action spaces, which can compromise process safety and economic objectives. To tackle this issue, an offline training strategy is proposed by leveraging existing model predictive control (MPC) to pre-train the RL agent. MPC actions, calculated offline by solving the MPC optimization problem based on a wide range of initial conditions, along with its objective function are utilized to pre-train the actor and critics of the RL agent. The pre-trained RL controller, with similar performance to the MPC performance, is then utilized for online control for further training. The efficacy of the proposed RL controller to improve the tracking performance is demonstrated using a simulation example for a pH neutralization process.
|
|
16:30-16:45, Paper WeC01.5 | |
Promises of Deep Kernel Learning for Control Synthesis |
|
Reed, Robert | University of Colorado Boulder |
Laurenti, Luca | TU Delft |
Lahijanian, Morteza | University of Colorado Boulder |
Keywords: Machine learning, Robust control, Stochastic systems
Abstract: Deep Kernel Learning (DKL) combines the representational power of neural networks with the uncertainty quantification of Gaussian Processes. Hence, it is potentially a promising tool to learn and control complex dynamical systems. In this work, we develop a scalable abstraction-based framework that enables the use of DKL for control synthesis of stochastic dynamical systems against complex specifications. Specifically, we consider temporal logic specifications and create an end-to-end framework that uses DKL to learn an unknown system from data and formally abstracts the DKL model into an interval Markov decision process to perform control synthesis with correctness guarantees. Furthermore, we identify a deep architecture that enables accurate learning and efficient abstraction computation. The effectiveness of our approach is illustrated on various benchmarks, including a 5-D nonlinear stochastic system, showing how control synthesis with DKL can substantially outperform state-of-the-art competitive methods.
|
|
WeC02 |
Harbour |
Network Control Systems II |
Regular Session |
Chair: Rojas, Alejandro J. | Universidad De Concepción |
Co-Chair: Davoodi, Mohammadreza | University of Georgia |
|
15:30-15:45, Paper WeC02.1 | |
Nonminimum Phase Zeros Effect on the Signal-To-Noise Ratio Channel Input Constraint in Continuous Time |
|
Rojas, Alejandro J. | Universidad De Concepción |
Keywords: Networked control systems, Linear systems, Optimal control
Abstract: It is well known that the presence of nonminimum phase (NMP) zeros increases already existing fundamental limitations. In this work we consider the minimal Signal-to-Noise Ratio (SNR) as the fundamental limitation in a control loop and, for an additive white Gaussian noise (AWGN) channel, propose a 2 degree of freedom (2DOF) controller to avoid the increase on the minimal SNR due to the presence of NMP zeros for a continuous time linear time invariant (LTI) plant model. We then characterize the presence of NMP zeros in the controller achieving the minimal SNR, when the plant model is minimum phase, but it is subject to a time delay. These controller NMP zeros then reveal that, if correctly placed to the same values, plant NMP zeros can result in a minimal SNR without the effect of NMP zeros, even in a 1 DOF controller strategy.
|
|
15:45-16:00, Paper WeC02.2 | |
Multi-Event-Triggered Control with Reduced Packet Sizes for Quantized Discrete-Time Linear Systems |
|
Batmani, Yazdan | University of Kurdistan |
Karimi, Zahra | University of Kurdistan |
Davoodi, Mohammadreza | The University of Memphis |
Keywords: Networked control systems, Linear systems
Abstract: This paper proposes a multi-event-triggering policy with reduced packet sizes for networked and quantized discrete-time linear systems. Unlike conventional event-triggered state feedback control methods in which all the system states must be sent to the controller at any triggering times, the proposed method samples each state based on its dynamical behavior and leads to properly-tuned inter-event intervals. Towards this aim, proceeding by emulation and in the absence of any event-triggered sampling, a state feedback control law is first designed. Then, the obtained closed-loop system is taken into account as a multi-time scale system based on its real Jordan form. Next, even-triggering mechanisms (ETMs) are designed for the Jordan blocks by considering their transient behavior, which result in different sequences of triggering times for the transmission of the states. This makes it possible to manage both the size and number of packets sent to the controller. The ultimate boundedness of the closed-loop system with state quantization is proved through precise analysis. To showcase the validity of the theoretical results, a simulation study is conducted where the proposed controller is applied to an inverted pendulum system.
|
|
16:00-16:15, Paper WeC02.3 | |
Risk Assessment of Multi-Agent System under Denial-Of-Service Cyberattacks Using Reachable Set Synthesis |
|
Cho, Minhyun | Purdue University |
Hwang, Sounghwan | Purdue University |
Hwang, Inseok | Purdue University |
Keywords: Networked control systems, LMIs, Lyapunov methods
Abstract: Multi-agent systems (MASs) have vulnerabilities to various types of cyberattacks disrupting inter-agent communication. To assess the potential risk associated with these cyberattacks, this paper proposes a proactive risk assessment method using reachable set synthesis. Denial-of-Service (DoS) attacks, where adversaries can disrupt communication by a sequence of link disconnections (dynamic alterations), are specifically considered. Our method employs the calculation of reachable sets using Lyapunov functions and linear matrix inequalities (LMIs) derived from them. The proposed method can evaluate the risk of DoS attacks for both individual agents and the entire system in two levels by computing over-approximated ellipsoidal reachable sets. To demonstrate the applicability of our method, we provide an illustrative example involving a leader-follower MAS performing formation control in an adversarial environment with scattered obstacles.
|
|
16:15-16:30, Paper WeC02.4 | |
Second-Order Heterogeneous Multi-Agent Target Tracking without Relative Velocities |
|
Nino, Cristian F. | University of Florida |
Patil, Omkar Sudhir | University of Florida |
Dixon, Warren E. | University of Florida |
Keywords: Networked control systems, Stability of nonlinear systems, Observers for nonlinear systems
Abstract: The multi-agent target tracking problem has received growing interest in the robotics and controls community in recent years. In particular, distributed target tracking is an abstraction for many potential applications. However, typical results assume that full state information is available. This paper addresses the multi-agent target tracking problem, where only the relative distance between neighbors is known. To yield this result, a novel distributed observer is designed that employs an auxiliary distributed filter. The distributed observer achieves network-to-target regulation by enabling the network of agents to estimate the relative velocities of all agents and the target. The distributed filter/observer structure is motivated by a Lyapunov-based stability analysis, which is provided to ensure that all agents are exponentially regulated to a neighborhood of the target. Comparative simulations are provided to demonstrate the performance of the developed method. The simulation results indicate that six agents modeled by an unknown heterogeneous second-order system can successfully track a target agent. The developed method provides approximately 90% and 30% improvements in the RMS tracking and velocity estimation errors, respectively, when compared to a baseline.
|
|
16:30-16:45, Paper WeC02.5 | |
Output-Feedback Stabilization of Stochastically-Sampled Networked Control System under Packet Dropouts |
|
Basu, Himadri | University of California Santa Cruz |
Fiacchini, Mirko | GIPSA-Lab, UMR CNRS 5216 |
Ferrante, Francesco | Universita Degli Studi Di Perugia |
Gomes da Silva Jr, Joao Manoel | Universidade Federal Do Rio Grande Do Sul |
Keywords: Networked control systems, Sampled-data control, Stochastic optimal control
Abstract: This letter deals with the mean-square output feedback stabilization of sampled-data linear time-invariant (LTI) systems in the presence of sporadically sampled measurement streams and packet dropouts. To address the problem we propose a control structure composed of: a) a hybrid observer, which resets with the arrival of a new measurement sample; and b) a feedback of the latest estimated state and the value of the control signal computed in the previous sampling instant, generating the control to be applied to the continuous-time plant. The control signal is kept constant, by means of a zero-order hold, between two successive sampling instants. The overall closedloop system exhibits a deterministic behavior except for jumps that occur at random sampling times resulting in a piecewise deterministic Markov process (PDMP). Using Lyapunov-based stability analysis for stochastic systems, we determine sufficient conditions for mean exponential stability (MES) of the overall closed-loop system, which are turned into Linear Matrix Inequalities (LMI) for the design of the proposed hybrid stabilizer. Finally, the effectiveness of the theoretical results is verified by an illustrative example.
|
|
WeC03 |
Frontenac |
Autonomous Robots II |
Regular Session |
Chair: Seo, Joohwan | University of California, Berkeley |
Co-Chair: Coogan, Samuel | Georgia Institute of Technology |
|
15:30-15:45, Paper WeC03.1 | |
Optimal Path Planning of a Solar-Powered Unmanned Ground Vehicle in an Unknown Solar Environment with Multi-Objective Optimization |
|
Strebe, Luke | Los Alamos National Lab |
Lee, Kooktae | New Mexico Tech |
Keywords: Autonomous robots, Robotics, Optimization algorithms
Abstract: Solar Powered Unmanned Ground Vehicles (SPUGV) can be used for long-term environmental monitoring of an area, however, there is limited research regarding battery maximizing path planning in an unknown solar environment. In this paper, a novel approach for optimal path planning in a completely unknown solar environment is investigated using Multi-Objective Optimization (MOO). The Feasibility Space Path Planner (FSPP) is proposed to update the path of the SPUGV in a receding-horizon fashion as it gains information about solar availability in the environment from onboard sensors. The use of MOO in path planning provides optimal path planning for maximizing the final battery of the SPUGV. The path that provides the maximum battery while also minimizing the distance toward the goal is chosen, and re-evaluated at each time step the SPUGV moves. Therefore, the SPUGV will create battery maximization optimal trajectories without any prior information about an area. Simulation results concluded drastically improved final battery values using the FSPP algorithm compared to straight path trajectories towards a goal.
|
|
15:45-16:00, Paper WeC03.2 | |
Cooperative 3-D Active Multi-Robot Multi-Target Tracking |
|
Xu, Jie | University of California, Riverside |
Zhu, Pengxiang | University of California, Riverside |
Ren, Wei | University of California, Riverside |
Keywords: Autonomous robots, Sensor fusion, Cooperative control
Abstract: In this paper, we present a novel algorithm for the 3-D active multi-robot multi-target tracking problem in a distributed manner. For a team of robots equipped with sensors, we design the estimation framework by integrating a distributed target state estimation algorithm with the cooperative visual inertial odometry (CVIO) algorithm. This approach allows moving robots to continuously update their localization and target estimates by using available measurements in their neighborhoods. Motion planning for the robots is tackled through a distributed optimization paradigm as a function of the estimation results. In the distributed optimization setting, all robots cooperate to find their control actions using local information. Cost functions are defined using a differentiable field of view concept for the targets’ uncertainty reduction, alongside potential functions for collision avoidance and connectivity maintenance.
|
|
16:00-16:15, Paper WeC03.3 | |
Energy Optimal Obstacle Avoidance Motion Planning for Wheeled Mobile Robots |
|
Kim, Youngjin | University at Buffalo |
Singh, Tarunraj | State Univ. of New York at Buffalo |
Keywords: Optimal control, Autonomous robots, Nonholonomic systems
Abstract: Energy optimal motion planning of a wheeled mobile robot with a circular obstacle is addressed in this article. The trajectory planning problem is posed as an optimal control problem with state inequality constraint where the benchmark L2 norm of the control inputs is considered as the cost function. The necessary conditions for optimality are formally derived using the variational principle and the fact that the control inputs remain continuous at the time instant of entering the constraint boundary is proven. The necessary conditions permit decomposing the problem in two intervals: prior to and after activation of the constraint. Parametric studies are completed to study the impact of the size of the obstacle of the optimal trajectory of the wheeled mobile robot.
|
|
16:15-16:30, Paper WeC03.4 | |
Motion Planning for Autonomous Vehicles: When Model Predictive Control Meets Ensemble Kalman Smoothing |
|
Askari, Iman | University of Kansas |
Wang, Yebin | Mitsubishi Electric Research Labs |
Deshpande, Vedang M. | Mitsubishi Electric Research Laboratories |
Fang, Huazhen | University of Kansas |
Keywords: Predictive control for nonlinear systems, Autonomous robots, Estimation
Abstract: Safe and efficient motion planning is of fundamental importance for autonomous vehicles. This paper investigates motion planning based on nonlinear model predictive control (NMPC) over a neural network vehicle model. We aim to overcome the high computational costs that arise in NMPC of the neural network model due to the highly nonlinear nonconvex optimization. In a departure from numerical optimization solutions, we reformulate the problem of NMPC-based motion planning as a Bayesian estimation problem, which seeks to infer optimal planning decisions from planning objectives. Then, we use a sequential ensemble Kalman smoother to accomplish the estimation task, exploiting its high computational efficiency for complex nonlinear systems. The simulation results show an improvement in computational speed by orders of magnitude, indicating the potential of the proposed approach for practical motion planning.
|
|
16:30-16:45, Paper WeC03.5 | |
A Comparison between Lie Group and Lie Algebra Based Potential Functions for Geometric Impedance Control |
|
Seo, Joohwan | University of California, Berkeley |
Potu Surya Prakash, Nikhil | UC BERKELEY |
Choi, Jongeun | Yonsei University |
Horowitz, Roberto | Univ. of California at Berkeley |
Keywords: Robotics, Autonomous robots, Algebraic/geometric methods
Abstract: In this paper, a comparison analysis between geometric impedance controls (GICs) derived from two different potential functions on SE(3) for robotic manipulators is presented. The first potential function is defined on the Lie group, utilizing the Frobenius norm of the configuration error matrix. The second potential function is defined utilizing the Lie algebra, i.e., log-map of the configuration error. Using a differential geometric approach, the detailed derivation of the distance metric and potential function on SE(3) is introduced. The GIC laws are respectively derived from the two potential functions, followed by extensive comparison analyses. In the qualitative analysis, the properties of the error function and control laws are analyzed, while the performances of the controllers are quantitatively compared using numerical simulation.
|
|
16:45-17:00, Paper WeC03.6 | |
Local-Global Interval MDPs for Efficient Motion Planning with Learnable Uncertainty |
|
Jiang, Jesse | Georgia Institute of Technology |
Zhao, Ye | Georgia Tech |
Coogan, Samuel | Georgia Institute of Technology |
Keywords: Markov processes, Formal verification/synthesis, Autonomous robots
Abstract: We study the problem of computationally efficient control synthesis for Interval Markov Decision Processes (IMDPs), that is, MDPs with interval uncertainty on the transition probabilities, against tasks specified in linear temporal logic. To address the scalability challenge when synthesizing this control policy in a holistic way, we propose decomposing the monolithic global IMDP into a collection of interconnected local IMDPs. We focus on the problem of robotic motion planning. Specifically, we assume a setting in which the transition probabilities can be learned and their interval uncertainty reduced by observing the dynamics of the system at runtime. This creates an objective of exploration to ensure that the planning task can be completed with sufficient probability of success. We perform decoupled exploration and learning on the local IMDPs and then combine local control policies to guarantee global task satisfaction. In a simulation-based case study, we show that, compared to existing approaches, our proposed decomposition leads to faster learning and satisfaction of the planning task and provides a feasible controller when other methods are infeasible.
|
|
WeC04 |
Metro W |
Estimation and Identification II |
Regular Session |
Chair: Anderson, Logan | University of Minnesota |
Co-Chair: Arezki, Hasni | University of Genova (Italy) University of Lorraine (France) |
|
15:30-15:45, Paper WeC04.1 | |
Nonlinear Observer Design for Vehicle Lateral Load Transfer Ratio Estimation |
|
Meng, Shengya | Universite De Lorraine, CRAN UMR CNRS |
Meng, Fanwei | School of Control Engineering, Northeastern University at Qinhua |
Zhang, Fan | Sun Yat-Sen University |
Alma, Marouane | CRAN Lorraine University |
Haddad, Madjid | SEGULA Technologie |
Zemouche, Ali | CRAN UMR CNRS 7039 & Université De Lorraine |
Keywords: Estimation, Observers for nonlinear systems, Automotive control
Abstract: This paper proposes a nonlinear observer design for estimating the lateral load transfer ratio (LTR), a type of rollover index for vehicle safety, with a reduced reliance on measurable signals, applicable to both untripped and tripped rollovers. The dynamics of the four-degree-of-freedom vehicle are modeled to include tripped rollover, treating tire forces as unknown inputs. To address output nonlinearity, the observer employs a generalized inverse, offering an innovative solution. Benefiting from the unique structure of the state-space equations of the vehicle model, only three measurable signals are required for state and unknown input estimation. An algorithm for the observer is presented, ensuring the asymptotic stability of error through parametric solutions for matrix equations and the Lyapunov stability theorem. Validation conducted via CarSim simulations demonstrates the effectiveness of the proposed nonlinear observer in accurately estimating the vehicle states and tire forces. This accuracy positions it as a valuable tool for rollover prediction in vehicle safety applications.
|
|
15:45-16:00, Paper WeC04.2 | |
Simple but Useful Contributions to High-Gain Observer for Non-Triangular Systems |
|
Arezki, Hasni | University of Genova (Italy) University of Lorraine (France) |
Zemouche, Ali | CRAN UMR CNRS 7039 & Université De Lorraine |
Keywords: Estimation, Observers for nonlinear systems, LMIs
Abstract: This paper deals with nonlinear observer design for a class of non-triangular nonlinear systems. Simple but useful contributions to the design of high-gain observers are proposed for systems with arbitrary nonlinear structures contrary to the standard results on high-gain observer methodology developed for triangular nonlinearities. First, based on the use of a convenient decomposition of the nonlinear function, a general design method is provided. Compared to the standard high-gain observer, the proposed method requires an extra constraint on the tuning parameter to dominate the non-triangular components of the nonlinearity of the system. To reduce the conservatism of the extra-condition, further results are established by exploiting the Linear Matrix Inequality (LMI) based approach. A numerical design algorithm is provided to build all the observer parameters. Finally, an illustrative example is presented to show the effectiveness and validity of the proposed methods.
|
|
16:00-16:15, Paper WeC04.3 | |
Controlling UAVs by Sensing the Electric or the Magnetic Field Around Power Lines |
|
Satici, Aykut C | Boise State University |
Peterson, Alex | Boise State University |
Chiasson, John | Boise State University |
Adams, Zachary | Pitch Aeronautics |
Keywords: Estimation, Robotics, Control applications
Abstract: We provide a procedure for estimating the relative position of a drone with respect to three-phase power lines. This is done by making use of the root mean square (rms) measurements of the electric/magnetic fields emanating from the power lines. Maxwell's equations are used to derive the sensor measurement model showing that the total rms electric/magnetic field is a measure of the relative position of the drone. The squared inverse of the total rms electric/magnetic field serves as a potential function for a Hamiltonian, which in turn is a Lyapunov function. It is shown that the gradient of this potential function always points to the (convex hull of the) power lines. Based on Lyapunov analysis, a control algorithm is developed that forces a drone to follow the gradient of the potential to the power lines. This controller provides the capability to inspect power lines or carry out installation tasks on them. Simulations are presented to illustrate the efficacy of the approach.
|
|
16:15-16:30, Paper WeC04.4 | |
Two-Layer Diffusion Adaptive Filters Over Directed Markovian Switching Networks |
|
Xie, Siyu | University of Electronic Science and Technology of China |
Gan, Die | Chinese Academy of Science |
Liu, Zhixin | Academy of Mathematics and Systems Science, ChineseAcademyof Scie |
Keywords: Estimation, Sensor networks, Time-varying systems
Abstract: We consider the problem of distributed adaptive filtering, where a set of nodes in the network is required to estimate an unknown parameter of interest from noisy measurements cooperatively. Based on normalized least mean squares (NLMS) adaptive filters, we focus on a two-layer diffusion strategy to diffuse data more thoroughly. We propose and analyze the two-layer diffusion NLMS algorithm, where the communications between nodes in the network are described by directed Markovian switching graphs. The directed graphs are not only used for the combination of local estimates but also used for the adaptation step in the two-layer strategy. The stability results of the two-layer diffusion adaptive filters are established under a general cooperative information assumption, without independence and stationarity signal conditions which were widely used in the literature. Moreover, the stability result indicates that even if any node cannot estimate the unknown parameter individually, the whole network can still fulfill the estimation task through communications. Simulation results show that the proposed two-layer diffusion NLMS algorithm has better performance compared with the consensus one.
|
|
16:30-16:45, Paper WeC04.5 | |
Outlier Accommodation for GNSS Precise Point Positioning Using Risk-Averse State Estimation |
|
Hu, Wang | University of California Riverside |
Uwineza, Jean-Bernard | University of California, Riverside |
Farrell, Jay A. | University of California Riverside |
Keywords: Fault accomodation, Kalman filtering, Estimation
Abstract: Reliable and precise absolute positioning is necessary in the realm of Connected Automated Vehicles (CAV). Global Navigation Satellite Systems (GNSS) provides the foundation for absolute positioning. Recently enhanced Precise Point Positioning (PPP) technology now offers corrections for GNSS on a global scale, with the potential to achieve accuracy suitable for real-time CAV applications. However, in obstructed sky conditions, GNSS signals are often affected by outliers; therefore, addressing outliers is crucial. In GNSS applications, there are many more measurements available than are required to meet the specification. Therefore, selecting measurements to avoid outliers is of interest. The recently developed Risk-Averse Performance-Specified (RAPS) state estimation optimally selects measurements to minimize outlier risk while meeting a positive semi-definite constraint on performance; at present, the existing solution methods are not suitable for real-time computation and have not been demonstrated using challenging real-world data or in Real-time PPP (RT-PPP) applications. This article makes contributions in a few directions. First, it uses a diagonal performance specification, which reduces computational costs relative to the positive semi-definite constraint. Second, this article considers GNSS RT-PPP applications. Third, the experiments use real-world GNSS data collected in challenging environments. The RT-PPP experimental results show that among the compared methods: all achieve comparable performance in open-sky conditions, and all exceed the Society of Automotive Engineers (SAE) specification; however, in challenging environments, the diagonal RAPS approach shows improvement of 6-19% over traditional methods. Throughout, RAPS achieves the lowest estimation risk.
|
|
16:45-17:00, Paper WeC04.6 | |
Statistical Bounds on Identified QSR Dissipative Properties from Input-Output Data |
|
Anderson, Logan | University of Minnesota |
Caverly, Ryan James | University of Minnesota |
Lamperski, Andrew | University of Minnesota |
Keywords: Identification for control, Estimation, Linear systems
Abstract: Dissipativity is a powerful tool in control design as it can be used to ensure closed-loop stability using open-loop input-output properties. This paper presents a framework to estimate the QSR-dissipative parameters of a discrete-time system from estimates of either the power spectral response or transfer function of the system. Specific methods are presented for estimating the input feedforward passivity index, conic sector bounds and L2-gain of the system. Methods are also presented to propagate the bound on the worst-case error from the power spectral response to these parameters with high probability. Numerical simulations are performed using the proposed methods on a randomly generated system. The estimators for the QSR-dissipative parameters closely match the true values even when the signal-to-noise ratio is small, while the computed worst-case error bound is not violated in any simulation.
|
|
WeC05 |
Marine |
Optimization II |
Regular Session |
Chair: Chen, Xu | University of Washington |
Co-Chair: Oliveira, Tiago Roux | State University of Rio De Janeiro |
|
15:30-15:45, Paper WeC05.1 | |
Extremum Seeking for a Class of Wave Partial Differential Equations with Kelvin-Voigt Damping |
|
Silva, Paulo Cesar Souza | Military Institute of Engineering (IME) |
Pellanda, Paulo Cesar | IME - Military Institut of Engineering |
Oliveira, Tiago Roux | State University of Rio De Janeiro |
de Andrade, Gustavo Artur | Universidade Federal De Santa Catarina |
Krstic, Miroslav | University of California, San Diego |
Keywords: Optimization, Adaptive control, Distributed parameter systems
Abstract: This paper presents the design and analysis of gradient extremum seeking applied to scalar static maps maps within the context of infinite-dimensional dynamics governed by Partial Differential Equations (PDEs) of wave type featuring a small amount of Kelvin-Voigt damping. Notably, this particular class of PDEs for extremum seeking still needs to be explored in the existing literature. First, to compensate for the influence of PDE actuation dynamics, we employ a boundary control law via backstepping transformation and averaging-based estimates of the gradient and Hessian. Finally, we prove the local exponential convergence to a small neighborhood surrounding the unknown optimal point by means of an Input-to-State Stability analysis, as well as by employing Lyapunov functionals and averaging theory in infinite dimensions.
|
|
15:45-16:00, Paper WeC05.2 | |
Distributed Optimization of Network Weights for Improved Performance |
|
Xu, Yicheng | University of California Irvine |
Jabbari, Faryar | Univ. of California at Irvine |
Keywords: Optimization, Agents-based systems, Computational methods
Abstract: This paper investigates a discrete-time multi-agent system, focusing on weight assignments in Laplacian matrices to enhance control performance. Leveraging leader-following network topologies and fast eigenvalue estimators, the algorithm minimizes the largest to smallest non-zero eigenvalue ratio. Max-consensus is used to guarantee finite convergence steps. An example demonstrates the improved closed-loop system performance.
|
|
16:00-16:15, Paper WeC05.3 | |
On-Line Motion Planning Using Bernstein Polynomials for Enhanced Target Localization in Autonomous Vehicles |
|
Tabasso, Camilla | University of Iowa |
Cichella, Venanzio | University of Iowa |
Keywords: Optimization, Autonomous systems
Abstract: The use of autonomous vehicles for target localization in modern applications has emphasized their superior efficiency, improved safety, and cost advantages over human-operated methods. For localization tasks, autonomous vehicles can be used to increase efficiency and ensure that the target is localized as quickly and precisely as possible. However, devising a motion planning scheme to achieve these objectives in a computationally efficient manner suitable for real-time implementation is not straightforward. In this paper, we introduce a motion planning solution for enhanced target localization, leveraging Bernstein polynomial basis functions to approximate the probability distribution of the target's trajectory. This allows us to derive estimation performance criteria which are used by the motion planner to enhance the estimator efficacy. To conclude, we present simulation results that validate the effectiveness of the suggested algorithm.
|
|
16:15-16:30, Paper WeC05.4 | |
Safe Online Convex Optimization with First-Order Feedback |
|
Hutchinson, Spencer | University of California, Santa Barbara |
Alizadeh, Mahnoosh | University of California Santa Barbara |
Keywords: Optimization, Learning
Abstract: We study an online convex optimization problem where the player must satisfy an unknown constraint at all rounds, while only observing the gradient and function value of the constraint at the chosen actions. For this problem, we develop an algorithm that uses an optimistic set, which overestimates the constraint, to identify low-regret actions while using a pessimistic set, which underestimates the constraint, to ensure constraint satisfaction. Our analysis shows that this algorithm satisfies the constraint at all rounds while enjoying O(sqrt{T}) regret when the constraint function is smooth and strongly convex. We then extend our algorithm to a setting with time-varying constraints and prove that it enjoys similar guarantees in this setting. Lastly, we demonstrate the effectiveness of our algorithm with a set of numerical experiments.
|
|
16:30-16:45, Paper WeC05.5 | |
Sparsity Via Sparse Group K-Max Regularization |
|
Tao, Qinghua | KU Leuven |
Xi, Xiangming | Zhejiang Lab |
Xu, Jun | Harbin Institute of Technology, Shenzhen |
Suykens, J.A.K. | Katholieke Univ. Leuven |
Keywords: Optimization, Machine learning, Identification
Abstract: For the linear inverse problem with sparsity constraints, the l0 regularized problem is NP-hard, and existing approaches either utilize greedy algorithms to find almost optimal solutions or approximate the l0 regularization with its convex counterparts. In this paper, we propose a novel regularization technique, namely the sparse group k-max regularization. This approach has the advantages of enhancing both the group-wise and in-group sparsity without imposing any additional constraints on the magnitude of variables in each group. Our approach approximates the l0 norm more closely than other relaxation methods, making it especially important for problems with variables at varying scales. We have also established an iterative soft thresholding algorithm and have provided local optimality conditions and complexity analysis. We have tested our method with numerical experiments on both synthetic and real-world datasets, and our results show that our technique is effective and flexible.
|
|
16:45-17:00, Paper WeC05.6 | |
Optimal Loop Shaping and Disturbance Rejection Beyond the Nyquist Frequency Using a Forward Model Disturbance Observer and Convex Optimization Based Filter Design |
|
Chu, Thomas | University of Washington |
Hu, Xiaohai | University of Washington |
Chen, Xu | University of Washington |
Keywords: Computer-aided control design, LMIs, Optimization
Abstract: Loop shaping based on the disturbance observer (DOB) offers great flexibility in designing a control system's closed-loop sensitivity to external disturbances and noises. While it is well understood how to design feedback control to reject band-limited disturbances with little trade-off when a closed-loop system has a single sampling control rate, challenges arise when disturbances appear beyond the Nyquist frequency and when the speed of the feedback sensor cannot be conveniently increased due to hardware and/or process constraints. Such is the case in hard disk drives and emerging vision-based motion control. In this paper, we propose an optimal multirate forward model disturbance observer (MFMDOB) for intuitive, flexible, and exact rejection of band-limited disturbances beyond the Nyquist frequency. Based on the tools from Youla-Kucera parameterization, the internal model principle, multirate analysis, and convex optimization, we translate the design objective into a set of model-based convex optimization and multirate prediction problems, enabling optimal local loop shaping. We provide different optimal design formulations with finite and infinite impulse response filters. Verification of the MFMDOB is conducted on a galvo scanning process model in selective laser sintering for additive manufacturing.
|
|
WeC06 |
Queens Quay 1 |
Modeling and State Estimation for Batteries |
Invited Session |
Chair: Song, Ziyou | University of Michigan, Ann Arbor |
Co-Chair: De Castro, Ricardo | University of California, Merced |
Organizer: Zhang, Dong | University of Oklahoma |
Organizer: Soudbakhsh, Damoon | Temple University |
Organizer: Jain, Neera | Purdue University |
Organizer: Dey, Satadru | The Pennsylvania State University |
Organizer: Tang, Shuxia | Texas Tech University |
Organizer: Roy, Tanushree | Texas Tech University |
Organizer: Moura, Scott | University of California, Berkeley |
Organizer: Lin, Xinfan | University of California, Davis |
Organizer: De Castro, Ricardo | University of California, Merced |
Organizer: Song, Ziyou | University of Michigan, Ann Arbor |
Organizer: Fogelquist, Jackson | University of California, Davis |
|
15:30-15:45, Paper WeC06.1 | |
Bias-Compensated State Estimation Algorithm for LFP Batteries with Flat OCV-SOC Curves (I) |
|
Yi, Baozhao | National University of Singapore |
Zhang, Jiawei | National University of Singapore |
Song, Ziyou | University of Michigan, Ann Arbor |
Keywords: Estimation, Kalman filtering
Abstract: The estimation of the state of charge (SOC) and state of health (SOH) for lithium-ion batteries is essential to ensure their safety and reliable operation. Although the current model-based estimation algorithms can mitigate the impact of measurement noise, they often struggle to address voltage measurement bias that can significantly undermine the estimation accuracy. This challenge is particularly pronounced in the case of Lithium Iron Phosphate (LFP) batteries that exhibit flat open-circuit voltage curves within the middle SOC range, rendering them highly susceptible to bias and noise. In this work, the effect of voltage measurement bias on the joint SOC/SOH estimation is quantitatively analyzed. Moreover, a bias-compensated algorithm is proposed based on the Extended Kalman Filter (EKF) to achieve accurate and robust state estimation for LFP batteries by estimating and compensating the bias in voltage measurements. Simulation results demonstrate the effectiveness of the proposed approach, which outperforms the conventional method that continuously estimates states without considering the effects of voltage measurement bias. Finally, the proposed algorithm is also experimentally verified, further validating its real-world applicability and advantages.
|
|
15:45-16:00, Paper WeC06.2 | |
Nonlinear Fractional Dynamics Integrated Physics-Informed Neural Network Model for LiFePO4 Batteries in Electric Vehicles (I) |
|
Borah, Manashita | 1) University of California, Berkeley, USA, 2) Tezpur University |
Jiang, Shida | Mr. Shida Jiang |
Shi, Junzhe | UC Berkeley |
Moura, Scott | University of California, Berkeley |
Keywords: Energy systems, Nonlinear systems identification, Model Validation
Abstract: This paper addresses the long-standing challenge of attaining high-precision models for LiFePO4 batteries which suffer from weakly observable dynamics. We introduce a new paradigm of integrating a nonlinear fractional-order physics-based model with a hybrid neural network model. First, a fractional-order model (FOM) is proposed to capture the physics of the battery that existing integer-order models (IOMs) fail to replicate, such as the solid phase diffusion. The FOM parameters are state dependent as they vary along with the progression of the state of charge (SOC). Second, the unknown and unmodelled physics is captured by a hybrid neural network model integrated with the FOM. The physical states of the FOM are used to guide the neural network resulting in a state dependent nonlinear fractional-order physics-informed neural network (FO-PINN) to predict the terminal voltage of the battery. Validation with experimental results and comparisons with existing modelling techniques reveal that the proposed scheme delivers improved predictive accuracy with decreased computational cost and enhanced physically meaningful information. The scheme has potential in applications that demand high propulsive power and accuracy, such as electric aircraft.
|
|
16:00-16:15, Paper WeC06.3 | |
Lightweight Electrochemical Hybrid Modeling Approach for Li-Ion Batteries Using Gaussian Process Regression (I) |
|
Fogelquist, Jackson | University of California, Davis |
Lin, Xinfan | University of California, Davis |
Keywords: Modeling, Energy systems, Machine learning
Abstract: The development of next-generation battery management systems needs models with enhanced performance to enable advanced control, diagnostic, and prognostic techniques for improving the safety and performance of lithium-ion battery systems. Specifically, battery models must deliver efficient and accurate predictions of physical internal states and output voltage, despite the inevitable presence of various system uncertainties. To facilitate this, we propose a lightweight hybrid modeling framework that couples a high-fidelity physics-based electrochemical battery model with a computationally-efficient Gaussian process regression (GPR) machine learning model to predict and compensate for errors in the electrochemical model output. This is the first time that GPR has been implemented to predict the output residual of an electrochemical battery model, which is significant for the following reasons. First, we demonstrate that GPR is capable of considerably improving output prediction accuracy, as evidenced by an observed average root-mean-square prediction error of 7.3 mV across six testing profiles, versus 119 mV for the standalone electrochemical model. Second, we employ a data sampling procedure to exhibit how GPR can use sparse training data to deliver accurate predictions at minimal computational expense. Our framework yielded a ratio of computation time to modeled time of 0.003, indicating ample suitability for online applications.
|
|
16:15-16:30, Paper WeC06.4 | |
Weaknesses and Improvements of the Extended Kalman Filter for Battery State-Of-Charge and State-Of-Health Estimation (I) |
|
Jiang, Shida | Mr. Shida Jiang |
Shi, Junzhe | UC Berkeley |
Borah, Manashita | University of California, Berkeley |
Moura, Scott | University of California, Berkeley |
Keywords: Kalman filtering, Estimation, Energy systems
Abstract: Battery management systems (BMS) are essential for ensuring battery performance and safety. Accurate estimation of the State of Charge (SOC) and State of Health (SOH), for example, are critical. However, utilizing the conventional Extended Kalman Filter (EKF) for SOC and SOH co-estimation is often challenging due to problems such as overconfident covariance estimation, overly simplistic assumptions about process noise and measurement noise covariance matrices, and the shift of the open circuit voltage (OCV) curve as the cell ages. To address these issues, this paper introduces an improved EKF design for co-estimating the SOC and SOH. The proposed approach incorporates innovative strategies to counteract covariance pitfalls, calculates the optimal covariance matrix configuration objectively, and incorporates OCV shifts from aging. Comparative simulations underscore the superiority of our method against traditional EKF and Unscented Kalman Filter (UKF) techniques.
|
|
16:30-16:45, Paper WeC06.5 | |
Interconnected Sigma-Point Kalman Filter Application for Electrochemical State Estimation of Lithium-Ion Batteries |
|
Kawakita de Souza, Aloisio Henrique | University of Colorado at Colorado Springs |
Plett, Gregory L. | University of Colorado Colorado Springs |
Trimboli, Michael | University of Colorado, Colorado Springs |
Keywords: Energy systems, Kalman filtering, Estimation
Abstract: Lithium-ion batteries (LIB) are the energy-storage element of choice due to their high energy density and low self-discharge. Accurate estimation of battery state-of-charge (SOC), state-of-health (SOH) and state-of-power (SOP) depend on accurate mathematical models of underlying electrochemical behavior. High-accuracy models of LIBs requires knowledge of the potentials associated with both positive and negative electrodes. Viewing each electrode as a proper subsystem of a cell, it is natural to consider estimating the behavior of the interacting subsystems using a pair of interacting estimators. This paper introduces an interconnected sigma-point Kalman filter applied to a reduced-complexity single particle model with electrolyte dynamics (ESPM). The resultant estimator is able to calculate approximations of a variety of internal electrochemical variables (e.g., lithium concentrations and internal potentials) useful for advanced control applications.
|
|
WeC07 |
Queens Quay 2 |
Traffic Control I |
Regular Session |
Chair: Malikopoulos, Andreas A. | Cornell University |
Co-Chair: Timotheou, Stelios | University of Cyprus |
|
15:30-15:45, Paper WeC07.1 | |
Safe Optimal Interactions between Automated and Human-Driven Vehicles in Mixed Traffic with Event-Triggered Control Barrier Functions |
|
Li, Anni | Boston University |
Cassandras, Christos G. | Boston University |
Xiao, Wei | Massachusetts Institute of Technology |
Keywords: Traffic control, Human-in-the-loop control, Optimal control
Abstract: This paper studies safe driving interactions between Human-Driven Vehicles (HDVs) and Connected and Automated Vehicles (CAVs) in mixed traffic where the dynamics and control policies of HDVs are unknown and hard to predict. In order to address this challenge, we employ event-triggered Control Barrier Functions (CBFs) to estimate the HDV model online, construct data-driven and state-feedback safety controllers, and transform constrained optimal control problems for CAVs into a sequence of event-triggered quadratic programs. We show that we can ensure collision-free interactions between HDVs and CAVs and demonstrate the robustness and flexibility of our framework on different types of human drivers in lane-changing scenarios while guaranteeing the satisfaction of safety constraints.
|
|
15:45-16:00, Paper WeC07.2 | |
Parameter Estimation in Optimal Tolling for Traffic Networks under the Markovian Traffic Equilibrium |
|
Chiu, Chih-Yuan | University of California, Berkeley |
Sastry, Shankar | Univ. of California at Berkeley |
Keywords: Traffic control, Learning, Game theory
Abstract: Tolling, or congestion pricing, has emerged as an effective tool for preventing gridlock in traffic systems. However, tolls are currently mostly designed on route-based traffic assignment models (TAM), which may be unrealistic and computationally expensive. Existing approaches also impractically assume that the central tolling authority can access latency function parameters that characterize the time required to traverse each network arc (edge), as well as the entropy parameter beta that characterizes commuters' stochastic arc-selection decisions on the network. To address these issues, this work formulates an online learning algorithm that simultaneously refines estimates of linear arc latency functions and entropy parameters in an arc-based TAM, while implementing tolls on each arc to induce equilibrium flows that minimize overall congestion on the network. We prove that our algorithm incurs regret upper bounded by O(sqrt{T} ln(T) |arcsMod| max{|nodesMod| ln(|arcsMod|/|nodesMod|), B }), where T denotes the total iteration count, |arcsMod| and |nodesMod| denote the total number of arcs and nodes in the network, respectively, and B describes the number of arcs required to construct an estimate of beta (usually ll |I|). Finally, we present numerical results on simulated traffic networks that validate our theoretical contributions.
|
|
16:00-16:15, Paper WeC07.3 | |
Decentralized Optimal Merging Control for Mixed Traffic with Vehicle Inference |
|
Xiao, Wei | Massachusetts Institute of Technology |
Cassandras, Christos G. | Boston University |
Keywords: Traffic control, Optimal control, Constrained control
Abstract: This paper addresses the optimal control of vehicles arriving from two curved roads at a merging point where the objective is to jointly minimize the travel time, energy consumption, and passenger discomfort. Unlike prior work where traffic consists entirely of Connected and Automated Vehicles (CAVs), we consider optimal controls for CAVs in mixed traffic including Human-Driven Vehicles (HDVs) behaving according to some car-following model that includes random actions. The control applied to CAVs is based on partial information about the presence and states of HDVs which is inferred from local observations available to the CAVs. The passing order of HDVs at the merging point is determined and assisted by CAVs using a proposed Minimum-Effort Merging Contract (MEMC) that uses Control Barrier Functions (CBFs) to guarantee safety. A coordinator is used to manage both the CAV information and inferred HDV information such that the problem can still be solved in a decentralized way. Our approach first determines an analytically tractable unconstrained optimal solution. We then use the joint Optimal Control and Barrier Function (OCBF) method to obtain a controller which optimally tracks such a solution while also guaranteeing all safety and control constraints, including a safe merging contract between CAVs and HDVs. Simulation examples are included to compare the performance under different CAV penetration rates.
|
|
16:15-16:30, Paper WeC07.4 | |
Optimizing the Crossing Sequence in Autonomous Intersection Management with Travel Time and Energy Considerations |
|
Hadjigeorgiou, Andreas | University of Cyprus |
Timotheou, Stelios | University of Cyprus |
Keywords: Traffic control, Optimization, Control applications
Abstract: Connected and Automated Vehicles (CAVs) offer fine-grained control and connectivity with other vehicles and the surrounding infrastructure that enable safe and energy efficient navigation. In this paper, mathematical programming is employed to construct optimal CAV crossing sequences with the objective of minimizing a combination of fuel consumption and travel time. The developed approach is based on the construction of the minimum energy profile for each CAV which allows the derivation of efficient crossing times, using Mixed Integer Programming, without the need to consider vehicle dynamics. Safety is then guaranteed by ensuring that the vehicles achieve the corresponding crossing times using Quadratic Programming. The effectiveness of the derived CAV sequences compared to FIFO is illustrated through extensive simulations. The results indicate that resequencing yields minor reduction in travel time but significant improvement in fuel consumption. In fact, fuel consumption improves for increasing number of CAVs arriving at the intersection. CAV rescheduling further supports observations of previous works that small sacrifices in travel time result in considerable fuel savings.
|
|
16:30-16:45, Paper WeC07.5 | |
Global Stabilization of Nash Equilibrium for Mixed Traffic |
|
Scruggs, Jeff | University of Michigan |
Lee, Richard | University of Michigan |
Yin, Yafeng | University of Michigan |
Keywords: Traffic control, Transportation networks, Networked control systems
Abstract: We consider a traffic network in which the traffic is a mix of regular and connected-autonomous vehicles. We presume the headways for vehicles in each link in the network are distinct for the regular and autonomous vehicles, and the autonomous vehicle headways differ depending on type of vehicle being followed. We analyze the network in the context of a population game, with each population corresponding to an origin-destination pair and vehicle type. We assume the evolutionary dynamics of each population distribution are governed by an Impartial Pairwise Comparison (IPC) Protocol. For the regular vehicles, we presume the payoff mechanism is the negative of the travel time. For the autonomous vehicles, we presume the payoff mechanism is an algorithm that is controlled centrally, using feedback about the current state of the system. For this scenario, we propose a dynamic payoff control algorithm for the autonomous vehicles that guarantees global convergence to Nash equilibrium. Additionally, the algorithm assures that in steady-state, the regular and autonomous vehicles for each origin-destination pair equilibrate to the same optimum routes.
|
|
16:45-17:00, Paper WeC07.6 | |
Routing in Mixed Transportation Systems for Mobility Equity |
|
Bang, Heeseung | University of Delaware |
Dave, Aditya Deepak | Cornell University |
Malikopoulos, Andreas A. | Cornell University |
Keywords: Traffic control, Transportation networks, Smart cities/houses
Abstract: This paper proposes a routing framework in mixed transportation systems for improving mobility equity. We present a strategic routing game that governs interactions between compliant and noncompliant vehicles, where noncompliant vehicles are modeled with cognitive hierarchy theory. Then, we introduce a mobility equity metric (MEM) to quantify the accessibility and fairness in the transportation network. We integrate the MEM into the routing framework to optimize it with adjustable weights for different transportation modes. The proposed approach bridges the gap between technological advancements and societal goals in mixed transportation systems to enhance efficiency and equity. We provide numerical examples and analysis of the results.
|
|
WeC08 |
Bay |
Fault Diagnosis |
Regular Session |
Chair: Kravaris, Costas | Texas A&M University |
Co-Chair: Bollas, George | University of Connecticut |
|
15:30-15:45, Paper WeC08.1 | |
Fault Identification Enhancement with Reinforcement Learning (FIERL) |
|
Sartor, Davide | University of Padua |
Zaccaria, Valentina | Università Degli Studi Di Padova |
Del Favero, Simone | University of Padova |
Susto, Gian Antonio | University of Padova |
Keywords: Fault diagnosis, Machine learning, Fault detection
Abstract: This work presents a novel approach in the field of Active Fault Detection (AFD), by explicitly separating the task into two parts: Passive Fault Detection (PFD) and control input design. This formulation is very general, and most existing AFD literature can be viewed through this lens. By recognizing this separation, methods can be leveraged to provide components that make efficient use of the available information, while the control input is designed in order to optimize the gathering of information. The core contribution of this work is FIERL, a general simulation-based approach for the design of such control strategies, using Constrained Reinforcement Learning (CRL) to optimize the performance of arbitrary passive detectors. The control policy is learned without the need of knowing the passive detector innerworkings, making FIERL broadly applicable. However, it is especially useful when paired with the design of an efficient passive component. Unlike most AFD approaches, FIERL can handle fairly complex scenarios such as continuous sets of fault modes. The effectiveness of FIERL is tested on a benchmark problem for actuator fault diagnosis, where FIERL is shown to be fairly robust, being able to generalize to fault dynamics not seen in training.
|
|
15:45-16:00, Paper WeC08.2 | |
Fault Detection in Closed-Loop Systems Based on Inferential Sensors |
|
Safikou, Efi | University of Connecticut |
Bollas, George | University of Connecticut |
Keywords: Fault diagnosis, Sensor fusion, Information theory and control
Abstract: Due to their ability to reduce the influence of external disturbances on system operation, closed-loop systems have been proven more robust and stable than open-loop systems. However, while numerous fault detection schemes have been developed for open-loop systems, they cannot be applied directly to systems with feedback due to the foundational differences in their operation. In this study, we develop a passive fault detection method for improving the identification of faults in closed-loop systems. We design inferential sensors by combining information theory and symbolic regression. These information-rich sensors are created using a genetic algorithm, which accounts for fault information through the Fisher Information Matrix. For evaluation purposes, we employ a k-Nearest Neighbors classification scheme, that compares system performance with and without the inclusion of inferential sensor(s). The proposed framework is applied to a dynamic model of a cross-flow plate-fin heat exchanger system, which encompasses various levels of measurement noise and uncertainty.
|
|
16:00-16:15, Paper WeC08.3 | |
Dual-Stream Cross-Modal Feature Fusion Based on Multi-Scale Attention for Industrial Fault Diagnosis |
|
Lian, Penglong | University of Electronic Science and Technology of China (UESTC) |
Shang, Penghui | Zhiyuan Research Institute |
Zhang, Jiyang | University of Electronic Science and Technology of China |
Su, Zhiheng | University of Electronic Science and Technology of China |
Zou, Jianxiao | University of Electronic Science and Technology of China |
Fan, Shicai | University of Electronic Science and Technology of China |
Keywords: Fault diagnosis
Abstract: The characteristics of non-stationarity and non-linearity present a challenging task for the fault diagnosis of bearings. Traditional feature fusion methods did not show satisfying capacity in information extraction and therefore confront difficulties in differentiating bearing fault states. To fully exploit correlated information and enhance cross-domain feature fusion, a novel dual-stream cross-modal feature fusion approach based on multi-scale attention for fault diagnosis is proposed in this paper. To effectively preserve the sequence signals and their temporal dependencies, the Gramian Angular Field (GAF) encoding technique is leveraged to transform raw time-domain signals into two-dimensional images. Besides, a multi-scale channel attention module (MSA) is introduced in the convolutional neural networks which could address the challenges related to inconsistent scales and feature distributions when fusing different modalities, and it enables dual-stream cross-modal (DSCM) data to exchange information and integrate complementary features. Experimental evaluations conducted on the Case Western Reserve University dataset demonstrate that the proposed method DSCM-MSA achieves higher accuracy and efficiency in cross-modal bearing fault diagnosis tasks compared with other methods, and can therefore provide a reliable foundation for industrial fault diagnosis.
|
|
16:15-16:30, Paper WeC08.4 | |
Detection of Valve Stiction in Industrial Control Loops through Continuous Wavelet Transformation with a CNN |
|
Gunnell, LaGrande | Brigham Young University |
Perez, Krystian X | Dow Chemical Company |
Castillo, Ivan | The Dow Chemical Company |
Hoogerwerf, Rob | Company |
Smith, Alexander | University of Minnesota |
Peng, You | Dow |
Hedengren, John | Brigham Young University |
Keywords: Pattern recognition and classification, Neural networks, Fault diagnosis
Abstract: Control valve stiction is a common equipment problem where the valve exhibits delayed response to control output and becomes stuck due to static friction, which can lead to undesired nonlinear behavior and oscillations. It is critical to identify and correct this problem to ensure consistent operation in control loops. This paper introduces the novel technique continuous wavelet transform - convolutional neural network (CWT-CNN) for non-intrusive valve stiction detection. Industrial Process data is converted to an image with continuous wavelet transformation and then fed into a deep convolutional neural network to classify stiction behavior. The CWT-CNN is fine-tuned from pretrained models like GoogleNet and ResNet via transfer learning for better classification and faster training while requiring less data. This work uses control loops from various chemical plants for training. The best performing CWT-CNN using GoogleNet can accurately predict 95.62% loops in the validation set, and has a true positive rate of 83.9% on the test set
|
|
16:30-16:45, Paper WeC08.5 | |
Disturbance Decoupled Functional Observers for Fault Estimation in Nonlinear Systems |
|
Venkateswaran, Sunjeev | Texas A&M University, College Station |
Kravaris, Costas | Texas A&M University |
Keywords: Observers for nonlinear systems, Fault diagnosis, Chemical process control
Abstract: This work deals with the problem of designing disturbance decoupled observers for the estimation of a function of the states in nonlinear systems. Necessary and sufficient conditions for the existence of lower order disturbance decoupled functional observers with linear dynamics and linear output map are derived. Based on this methodology, a fault estimation scheme based on disturbance decoupled observers will be presented. Throughout the paper, the application of the results will be illustrated through a chemical reactor case study. Simulation case studies demonstrate the effectiveness of the proposed methodology in reactor process monitoring and fault estimation.
|
|
16:45-17:00, Paper WeC08.6 | |
Fully Distributed Unknown Input Observer Based Fault Detection for Interconnected Systems |
|
Liang, Dingguo | Peking University |
He, Zhichen | Peking University |
Zhao, Zhengen | Nanjing University of Aeronautics and Astronautics |
Li, Wenlong | Peking University |
Yang, Ying | Peking University |
Keywords: Fault diagnosis, Control system architecture, Estimation
Abstract: In this article, a set of fully distributed unknown input observers are designed for continuous-time linear interconnected systems. As a fully distributed scheme, each subsystem uses only local and interconnected information with neighbors to design the UIO, and then their data to monitor the local state. By unknown input decomposition, the UIOs can asymptotically estimate the state of interconnected systems in the presence of actuator faults. The presented scheme is scalable, allowing plug and play operations. Then, a Luenberger observer is constructed for each subsystem, which interconnected with UIOs for the other subsystems is utilized to construct the fault detector. The residual generated by the fault detector is sensitive to the local fault while decoupled with faults in the other subsystems such that distributed fault detection for interconnected systems with more than one faulty subsystems is feasible. Finally, simulations are conducted for fault detection on power networks to show the effectiveness of the proposed method.
|
|
WeC09 |
Dockside 1 |
Flight Control |
Regular Session |
Chair: Michieletto, Giulia | University of Padova |
Co-Chair: Kidambi, Krishna Bhavithavya | University of Dayton |
|
15:30-15:45, Paper WeC09.1 | |
Multi-Outer Loop Dynamic Inversion Control: An Application to a VTOL Free-Wing Aircraft |
|
Axten, Rachel | Penn State University |
Khamvilai, Thanakorn | The Pennsylvania State University |
Johnson, Eric | Pennsylvania State University |
Keywords: Feedback linearization, Flight control, Autonomous systems
Abstract: This letter extends a typical series cascade control structure to an N-outer loop system applicable to coupled rigid body systems, including a VTOL free-wing aircraft. In this case, there are now external command signals from multiple, outer subsystems that are designed simultaneously for a single inner subsystem. For example, in the case of the VTOL free-wing aircraft, the standard outer loop modifies the wing pitch attitude command to achieve the desired position and velocity, and a newly implemented “swing loop” also modifies the wing pitch attitude command to dampen fuselage pitch attitude oscillations. The proposed multi-outer loop dynamic inversion architecture is successful in the autonomous control of a 3-loop VTOL free-wing aircraft with both simulation and flight test results presented.
|
|
15:45-16:00, Paper WeC09.2 | |
Multi-Agent Reinforcement Learning for the Low-Level Control of a Quadrotor UAV |
|
Yu, Beomyeol | The George Washington University |
Lee, Taeyoung | George Washington University |
Keywords: Flight control, Machine learning, Cooperative control
Abstract: By leveraging the underlying structures of the quadrotor dynamics, we propose multi-agent reinforcement learning frameworks to innovate the low-level control of a quadrotor, where independent agents operate cooperatively to achieve a common goal. While single-agent reinforcement learning has been successfully applied in quadrotor controls, training a large monolithic network is often data-intensive and time-consuming. Moreover, achieving agile yawing control remains a significant challenge due to the strongly coupled nature of the quadrotor dynamics. To address this, we decompose the quadrotor dynamics into translational and yawing components and assign collaborative reinforcement learning agents to each part to facilitate more efficient training. Additionally, we introduce regularization terms to mitigate steady-state errors and prevent excessive maneuvers. Benchmark studies, including sim-to-sim transfer verification, demonstrate that our proposed training schemes substantially improve the convergence rate of training, while enhancing flight control performance and stability compared to traditional single-agent approaches.
|
|
16:00-16:15, Paper WeC09.3 | |
Hybrid Control Framework of UAVs under Varying Wind and Payload Conditions |
|
Coursey, Austin | Vanderbilt University |
Zhang, Allan | Vanderbilt University |
Quinones-Grueiro, Marcos | Vanderbilt University |
Biswas, Gautam | Vanderbilt University |
Keywords: Flight control, Robust control, Supervisory control
Abstract: Reinforcement learning (RL) algorithms are increasingly applied to engineering control applications. They offer a promising alternative to traditional control methods, which often rely on complex physics-based models. However, RL algorithms may struggle with generalization beyond the training environment, making class control techniques preferable, particularly in safety-critical applications. This paper presents a hybrid control framework that combines a well-established cascade control architecture and data-driven methods to accommodate varying wind conditions and payloads for unmanned aerial vehicles (UAVs). We reframe the role of the data-driven methods to compensate for the limited adaptability of the traditional control approaches by dynamically modifying the reference velocities to account for disturbances that manifest as adverse wind and payload changes. We demonstrate the advantage of the proposed framework on a Tarot T18 octorotor simulation under aggressive wind field changes and payload changes mid-flight. We also show that our learned disturbance rejection controller generalizes to a different octorotor, the DJI-S1000. As suggested by recent literature, these results confirm that combining RL and established control techniques can significantly improve the generalization capabilities of operating modes not considered in the training stage while also maintaining stability guarantees.
|
|
16:15-16:30, Paper WeC09.4 | |
Trajectory Tracking for Tilted Hexarotors with Concurrent Attitude Regulation |
|
Perin, Marco | University of Padova |
Bertoni, Massimiliano | University of Padova |
Michieletto, Giulia | University of Padova |
Oboe, Roberto | University of Padova |
Cenedese, Angelo | University of Padova |
Keywords: Robotics, Flight control, Control applications
Abstract: Focusing on tilted hexarotors, we propose two control architectures that can tackle the position trajectory tracking task, ensuring also the attitude regulation. One is designed resting on the differential flatness property of the system, overcoming the standard implementation for under-actuated multi-rotors, and the other is a hierarchical nonlinear controller, building upon an advanced state-of-art hovering regulator. We comparatively discuss the performance of the two control schemes, in terms of the accuracy of both the tracking control action and the attitude regulation, the actuators effort, and the robustness in windy flight conditions. Numerical results reveal both the robustness of the hierarchical approach in the presence of external disturbances and the accuracy of the differential flatness-based controller in ideal flight scenarios.
|
|
16:30-16:45, Paper WeC09.5 | |
A Hammerstein-Weiner Modification of Adaptive Autopilot for Parameter Drift Mitigation with Experimental Results |
|
Chee, Yin Yong | University of Michigan Ann Arbor |
Oveissi, Parham | University of Maryland, Baltimore County |
Shao, Siyuan | University of Michigan |
Lee, Joonghyun | University of Michigan |
Paredes Salazar, Juan Augusto | University of Michigan |
Bernstein, Dennis S. | Univ. of Michigan |
Goel, Ankit | University of Maryland Baltimore County |
Keywords: Adaptive control, Flight control, Robotics
Abstract: A crucial challenge in the safe operation of adaptive controllers is the problem of parameter drift, where an underlying optimization problem, if ill-conditioned, may lead to parameter drift. This paper presents a Wiener adaptive autopilot for multicopters to mitigate instabilities caused by adaptive parameter drift and presents simulation and experimental results to validate the modified autopilot. The modified adaptive controller is obtained by including a static nonlinearity in the adaptive loop, updated by the retrospective cost adaptive control algorithm. It is shown in simulation and physical test experiments that the adaptive autopilot with proposed modifications can continually improve the fixed-gain autopilot as well as prevent the drift of the adaptive parameters, thus improving the robustness of the adaptive autopilot.
|
|
16:45-17:00, Paper WeC09.6 | |
Application of a Robust Nonlinear Control Strategy for Disturbance-Resilient Tilt-Rotor Quadcopter Trajectory Tracking |
|
Ijoga, Emmanuel Ogbanje | Embry Riddle Aeronautical University |
Kidambi, Krishna Bhavithavya | University of Dayton |
MacKunis, William | Embry-Riddle Aeronautical University |
Keywords: Control applications, Flight control, Robust control
Abstract: A robust nonlinear control strategy is developed for a Tilt-Rotor Quadcopter (TRQ), which is shown to achieve reliable trajectory tracking performance in the presence of external disturbances. The control method is based on the robust integral of the sign of the error (RISE) control method, which is modified to address the control challenges inherent in TRQ dynamic model. To the best of the authors' knowledge, this is the first result that applies a RISE-based nonlinear control method to a TRQ system. A challenge in the TRQ dynamic model is that the control input is pre-multiplied by a state-dependent input gain matrix. This challenge is mitigated by endowing the control law with a robust feedback element designed to provide enhanced compensation for the resulting perturbations in the input gain matrix. A rigorous Lyapunov-based stability analysis is utilized to prove that the proposed tracking control law achieves semi-global asymptotic stability in the presence of norm-bounded external disturbances, where the region of convergence can be made arbitrarily large through judicious control gain selection. A detailed numerical comparison study is provided to demonstrate the improved disturbance-rejection capability of the proposed control law as compared to a standard non-RISE control law.
|
|
WeC10 |
Dockside 2 |
Adaptive Control I |
Regular Session |
Chair: Cenedese, Angelo | University of Padova |
Co-Chair: Kiumarsi, Bahare | Michigan State University |
|
15:30-15:45, Paper WeC10.1 | |
Data-Driven Model Predictive Control of Airfoil Flow Separation |
|
Vander Schaaf, Jacob | University of Michigan |
Lu, Qizhi | University of Michigan, Ann Arbor |
Fidkowski, Krzysztof | University of Michigan |
Bernstein, Dennis S. | Univ. of Michigan |
Keywords: Adaptive control, Closed-loop identification, Predictive control for nonlinear systems
Abstract: This paper numerically investigates the performance of active flow control based on adaptive model predictive control without prior analytical modeling of any kind. Recursive least squares with variable-rate forgetting is used for online closed-loop system identification. The identified model is then used for receding-horizon optimization based on quadratic programming. This technique is applied to an airfoil model simulated using computational fluid dynamics. Two cases are considered. In the first case, the tangential component of the flow velocity is commanded at the location of a single flow-velocity sensor. In the second case, an array of flow-velocity sensors is used to estimate the location of the flow-separation point, which is used to command the flow-separation point.
|
|
15:45-16:00, Paper WeC10.2 | |
Safe Reinforcement Learning Based on Off-Policy Approach for Nonlinear Discrete-Time Systems |
|
Jha, Mayank Shekhar | University of Lorraine |
Kiumarsi, Bahare | Michigan State University |
Theilliol, Didier | Universite De Lorraine |
Keywords: Adaptive control, Learning, Neural networks
Abstract: This paper presents a control barrier function-based method for learning safe optimal controllers for discrete-time (DT) nonlinear systems such that safety, stability, and performance are guaranteed on an infinite time horizon. The paper investigates the fusion of reinforcement learning (RL) and control barrier functions (CBFs) and remains novel in that the approach is developed for DT nonlinear systems and develops off-policy safe RL approach for DT systems. Formulation of a novel generalised safety-aware Hamilton Jacobi Bellman (G-SHJB) equation is proposed to assure safety, stability and optimality during exploitation phase. The G-SHJB is solved in an iterative sense to obtain improved control policy. The invariance of safe set as well as stability and optimality of the system under learnt control law is established using mathematically rigorous proofs and studied using simulation.
|
|
16:00-16:15, Paper WeC10.3 | |
Enhancing Human Operator Performance with Long Short-Term Memory Networks in Adaptively Controlled Systems |
|
Uzun, Muhammed Yusuf | Bilkent University |
Inanc, Emirhan | Bilkent University |
Yildiz, Yildiray | Bilkent University |
Keywords: Adaptive control, Human-in-the-loop control, Neural networks
Abstract: The focus of this paper is developing a Long Short-Term Memory (LSTM) network-based control framework that works in collaboration with the human operator to enhance the overall closed-loop system performance in adaptively controlled systems. The domain of investigation is chosen to be flight control, although the proposed approach can be generalized for other domains such as automotive control. In accordance with this choice, an adaptive human pilot model is used as the mathematical representation of the pilot during the technical development of the method. An LSTM network is designed in such a way that it predicts and compensates for the inadequacies of the human operator's decisions while they fly an aircraft that has an adaptive inner loop controller. The simulation results demonstrate that the tracking performance is improved, and the pilot workload is reduced.
|
|
16:15-16:30, Paper WeC10.4 | |
Newton Bases and Event-Triggered Adaptive Control in Native Spaces |
|
Powell, Nathan | EPFL |
Kurdila, Andrew J. | Virginia Tech |
Wang, Haoran | Virginia Tech |
L'Afflitto, Andrea | Virginia Tech |
Guo, Jia | Georgia Institute of Technology |
Keywords: Adaptive control, Lyapunov methods, Nonlinear systems identification
Abstract: This paper extends recent methods of native space embedding for adaptive control by deriving event-driven controllers that modify the basis used for approximation of the functional uncertainty. Using the power function for the native space, trigger conditions are defined that determine when and how many new basis functions are introduced. Bases are selected using a greedy selection and augmentation process at each triggering event. By using a recursive Newton basis, coordinate implementations of the adaptive law do not require the inversion of the leading Grammian matrix, which yields a substantial improvement in numerical efficiency and numerical stability. This paper derives upper bounds on the ultimate tracking performance of the closed-loop control scheme in terms of known functions of the approximation dimension after each triggering event. These upper bounds are quite general and hold for bounded uncertainty classes for a variety of reproducing kernel Hilbert spaces (RKHS). The paper concludes by examining the qualitative performance of the closed-loop control system for a typical nonlinear model problem that features functional uncertainty in a native space.
|
|
16:30-16:45, Paper WeC10.5 | |
Adaptive Augmentation with Exponential Command Limiting for Aerial Vehicle Attitude Protection |
|
Sun, Donglei | University of Nottingham Ningbo China |
Hovakimyan, Naira | University of Illinois at Urbana-Champaign |
Keywords: Adaptive control, Constrained control, Flight control
Abstract: This paper presents an adaptive augmentation design with an exponential command limiting component for attitude protection of aerial vehicles equipped with rate control augmentation system. The baseline control law is designed using exponential potential functions for attitude protection, based on which, an adaptive augmentation is then developed to deal with disturbances and uncertainties. Stability analysis is provided for the closed-loop system to guarantee the command limiting performance in the presence of matched uncertainties. Simulation examples are presented to verify the design.
|
|
16:45-17:00, Paper WeC10.6 | |
A Natural Indirect Adaptive Controller for a Satellite-Mounted Manipulator |
|
Giordano, Jacopo | University of Padua |
Cenedese, Angelo | University of Padova |
Serrani, Andrea | The Ohio State University |
Keywords: Indirect adaptive control, Aerospace, Robotics
Abstract: The work considers the design of an indirect adaptive controller for a satellite equipped with a robotic arm manipulating an object. Uncertainty on the manipulated object can considerably impact the overall behavior of the system. In addition, the dynamics of the actuators of the base satellite are non-linear and can be affected by malfunctioning. Neglecting these two phenomena may lead to excessive control effort or degrade performance. An indirect adaptive control approach is pursued, which allows consideration of relevant features of the actuators dynamics, such as loss of effectiveness. Furthermore, an adaptive law that preserves the physical consistency of the inertial parameters of the various rigid bodies comprising the system is employed. The performance and robustness of the controller are first analyzed and then validated in simulation.
|
|
WeC11 |
Dockside 3 |
Game Theory II |
Regular Session |
Chair: Brown, Philip N. | University of Colorado Colorado Springs |
Co-Chair: Sanjari, Sina | Queen's University and Royal Military College |
|
15:30-15:45, Paper WeC11.1 | |
Learning How to Strategically Disclose Information |
|
Velicheti, Raj Kiriti | University of Illinois at Urbana Champaign |
Bastopcu, Melih | University of Illinois Urbana Champaign |
Etesami, Rasoul | University of Illinois at Urbana-Champaign |
Basar, Tamer | Univ of Illinois, Urbana-Champaign |
Keywords: Game theory, Learning, Optimization
Abstract: Strategic information disclosure, in its simplest form, considers a game between an information provider (sender) who has access to some private information that an information receiver is interested in. While the receiver takes an action that affects the utilities of both the players, the sender can design information (or modify beliefs) of the receiver through signal commitment, hence posing a Stackelberg game. However, obtaining a Stackelberg equilibrium for this game traditionally requires the sender to have access to the receiver's objective. In this work, we consider an online version of information design where a sender interacts with a receiver of an unknown type who is adversarially chosen at each round. Restricting attention to Gaussian prior and quadratic costs for the sender and receiver, we show that O(sqrt (T)) regret is achievable with full information feedback, where T is the total number of interactions between the sender and the receiver. Further, we propose a novel parametrization that allows the sender to achieve O(sqrt(T)) regret for a general convex utility function. We then consider the Bayesian Persuasion problem with an additional cost term in the objective function, which penalizes signaling policies that are more informative and obtain O(log(T)) regret. Finally, we establish a sublinear regret bound for the partial information feedback setting and provide simulations to support our theoretical results.
|
|
15:45-16:00, Paper WeC11.2 | |
Nash Equilibrium for Multi-Player Regime Switching Stochastic Differential Games |
|
Wang, Mingrui | The Pennsylvania State University |
Chakraborty, Prakash | The Pennsylvania State University |
Keywords: Game theory, Markov processes, Stochastic optimal control
Abstract: In this study, we consider multi-player stochastic differential games with regime switching in the player dynamics modeled by a Markov chain with a finite state space. We study the associated Nash system, which comprise a system of coupled nonlinear partial differential equations. We prove the existence and uniqueness of solutions to the Nash system, thereby establishing the existence of a unique Nash equilibrium.
|
|
16:00-16:15, Paper WeC11.3 | |
Large Decentralized Continuous-Time Convex Stochastic Teams and Their Mean-Field Limits |
|
Sanjari, Sina | Queen's University and Royal Military College |
Saldi, Naci | Bilkent University |
Yuksel, Serdar | Queen's University |
Keywords: Game theory, Mean field games, Decentralized control
Abstract: We study a class of continuous-time convex stochastic exchangeable teams with a finite number of decision makers (DMs) as well as their mean-field limits with infinite numbers of DMs. We establish the existence of a globally optimal solution and show that it is Markovian and symmetric (identical) for both the finite DM regime and the infinite one. In particular, for a general class of finite-N exchangeable stochastic teams satisfying a convexity condition, we establish the existence of a globally optimal solution that is symmetric among DMs and Markovian. As the number of DMs drives to infinity (that is for the mean-field limit), we establish the existence of a possibly randomized globally optimal solution and show that it is symmetric among DMs and Markovian.
|
|
16:15-16:30, Paper WeC11.4 | |
Rationality and Connectivity in Stochastic Learning for Networked Coordination Games |
|
Zhang, Yifei | Florida State University |
Vasconcelos, Marcos M. | Florida State University |
Keywords: Game theory, Stochastic systems, Learning
Abstract: Coordination is a desirable feature in many multi-agent systems such as robotic and socioeconomic networks. We consider a task allocation problem as a binary networked coordination game over an undirected regular graph. Each agent in the graph has bounded rationality, and uses a distributed stochastic learning algorithm to update its action choice conditioned on the actions currently played by its neighbors. After establishing that our framework leads to a potential game, we analyze the regime of bounded rationality, where the agents are allowed to make sub-optimal decisions with some probability. Our analysis shows that there is a relationship between the connectivity of the network, and the rationality of the agents. In particular, we show that in some scenarios, an agent can afford to be less rational and still converge to a near optimal collective strategy, provided that its connectivity degree increases. Such phenomenon is akin to the wisdom of crowds.
|
|
16:30-16:45, Paper WeC11.5 | |
Information Design under Uncertainty for Vehicle-To-Vehicle Communication |
|
Gould, Brendan | University of Colorado Colorado Springs |
Brown, Philip N. | University of Colorado Colorado Springs |
Keywords: Game theory, Transportation networks
Abstract: The emerging technology of Vehicle-to-Vehicle (V2V) communication aims to improve road safety by allowing vehicles to share information about the world. However, information design is in general a non-trivial problem, and is only made more difficult by uncertainty about the world or agents. In this work, using an existing model of V2V communication with endogenous accident probability, we study an information designer’s optimization problem under uncertainty about the "danger level" (the sensitivity of accident probability to agent behavior). First, we consider an information designer who does not know the danger level designing for agents who do; second, an informed designer designing for uninformed agents. In both cases, we present a simple characterization of the worst-case (i.e. largest accident probability) outcome that is possible under the uncertainty. When an information designer is uncertain about the world, the worst case occurs with the largest danger level. By contrast, when agents are uninformed, the worst case is caused by agents’ beliefs being the lowest danger level. Both of these results simplify the optimization problem, allowing an optimal signaling policy to be more easily determined.
|
|
16:45-17:00, Paper WeC11.6 | |
Equilibrium Selection in Data Markets: Multiple-Principal, Multiple-Agent Problems with Non-Rivalrous Goods |
|
Wadhwa, Samir | University of Illinois at Urbana-Champaign |
Dong, Roy | University of Illinois at Urbana-Champaign |
Keywords: Game theory, Variational methods
Abstract: The advent of data-driven tools has led to the rise of data markets. These data markets are often characterized by multiple data purchasers interacting with a set of data sources. There are several aspects of data markets that distinguish them from a typical commodity market. First, data sellers have more information about the quality of data than the data purchasers. Second, data is a non-rivalrous good that can be shared with multiple parties at negligible marginal cost. Third, the value of data is coupled, and there are strong informational externalities. Formally, this gives rise to a new class of games which we call multiple-principal, multiple-agent problem with non-rivalrous goods. We show that there is a fundamental degeneracy in the market of non-rivalrous goods: specifically, for a general class of payment contracts, there will be an infinite set of generalized Nash equilibria. This multiplicity of equilibria also affects common refinements of equilibrium definitions intended to uniquely select an equilibrium: both variational equilibria and normalized equilibria will be non-unique in general. This implies that most existing equilibrium concepts cannot provide predictions on the outcomes of data markets emerging today. The results support the idea that modifications to payment contracts themselves are unlikely to yield a unique equilibrium, and either changes to the models of study or new equilibrium concepts will be required to determine unique equilibria in settings with multiple principals and a non-rivalrous good.
|
|
WeC12 |
Dockside 9 |
Predictive Control for Nonlinear Systems II |
Regular Session |
Chair: Han, Kyoungseok | Kyungpook National University |
Co-Chair: Mohammadpour Velni, Javad | Clemson University |
|
15:30-15:45, Paper WeC12.1 | |
Deep Neural Network NMPC for Computationally Tractable Optimal Power Management of Hybrid Electric Vehicles |
|
Park, Suyong | Kyungpook National University |
Nguyen, Duc Giap | Kyungpook National University |
Park, Jinrak | Hyundai Motor Company |
Kim, Dohee | Hyundai Motor Company |
Eo, Jeong Soo | Hyundai Motor Company |
Han, Kyoungseok | Kyungpook National University |
Keywords: Predictive control for nonlinear systems, Neural networks, Autonomous vehicles
Abstract: This study presents a method for deep neural network nonlinear model predictive control (DNN-MPC) with the aim of reducing computational complexity, and we show its practical utility through its application in optimizing the energy management of hybrid electric vehicles (HEVs). For optimal power management of HEVs, we first design the online NMPC to collect the data set, and the deep neural network is trained to approximate the NMPC solutions. We assess the effectiveness of our approach by conducting comparative simulations with rule and online NMPC-based power management strategies for HEV, evaluating both fuel consumption and computational complexity. Lastly, we verify the real-time feasibility of our approach through process-in-the-loop (PIL) testing. The test results demonstrate that the proposed method closely approximates the NMPC performance while substantially reducing the computational burden.
|
|
15:45-16:00, Paper WeC12.2 | |
Neural Horizon Model Predictive Control -- Increasing Computational Efficiency with Neural Networks |
|
Alsmeier, Hendrik | TU Darmstadt, CCPS Laboratroy |
Savchenko, Anton | Technical University of Darmstadt |
Findeisen, Rolf | TU Darmstadt |
Keywords: Predictive control for nonlinear systems, Neural networks, Machine learning
Abstract: The expansion in automation of increasingly fast applications and low-power edge devices poses a particular challenge for optimization based control algorithms, like model predictive control. Our proposed machine-learning supported approach addresses this by utilizing a feed-forward neural network to reduce the computation load of the online-optimization. We propose approximating part of the problem horizon, while maintaining safety guarantees – constraint satisfaction – via the remaining optimization part of the controller. The approach is validated in simulation, demonstrating an improvement in computational efficiency, while maintaining guarantees and near-optimal performance. The proposed MPC scheme can be applied to a wide range of applications, including those requiring a rapid control response, such as robotics and embedded applications with limited computational resources.
|
|
16:00-16:15, Paper WeC12.3 | |
Model Predictive Control Barrier Functions: Guaranteed Safety with Reduced Conservatism and Shortened Horizon |
|
Abdi, Hossein | The University of Manchester |
Zhao, Pan | University of Alabama |
Hovakimyan, Naira | University of Illinois at Urbana-Champaign |
Ghabcheloo, Reza | Tampere University |
Keywords: Predictive control for nonlinear systems, Optimization
Abstract: In this study, we address the problem of safe control in systems subject to state and input constraints by integrating the Control Barrier Function (CBF) into the Model Predictive Control (MPC) formulation. While CBF offers a conservative policy and traditional MPC lacks the safety guarantee beyond the finite horizon, the proposed scheme takes advantage of both MPC and CBF approaches to provide a guaranteed safe control policy with reduced conservatism and a shortened horizon. The proposed methodology leverages the sum-of-square (SOS) technique to construct CBFs that make forward invariant safe sets in the state space that are then used as a terminal constraint on the last predicted state. CBF invariant sets cover the state space around system fixed points. These islands of forward invariant CBF sets will be connected to each other using MPC. To do this, we proposed a technique to handle MPC optimization problem subject to the combination of intersections and union of constraints. Our approach, termed Model Predictive Control Barrier Functions (MPCBF), is validated using numerical examples to demonstrate its efficacy, showing improved performance compared to classical MPC and CBF.
|
|
16:30-16:45, Paper WeC12.5 | |
Learning-Based Safety Critical Model Predictive Control Using Stochastic Control Barrier Functions |
|
Nejatbakhsh Esfahani, Hossein | Clemson University |
Ahmadi, Sajad | Clemson University |
Mohammadpour Velni, Javad | Clemson University |
Keywords: Stochastic optimal control, Machine learning, Predictive control for nonlinear systems
Abstract: This paper presents a learning-based safety-critical Model Predictive Control (MPC) design approach based on stochastic Control Barrier Functions (CBFs). To address the safety concerns and tackle model uncertainties in both the MPC and CBF, we first propose to use a parameterized stochastic CBF in the MPC scheme. We next devise a Reinforcement Learning (RL)-based algorithm based on the proposed stochastic CBF-MPC scheme to learn the approximate version of the proposed stochastic CBF for coping with an unknown CBF model, which cannot capture the correct structure of the CBF used in the real environment. To illustrate the performance of the proposed safety-critical control approach, we examine two test cases including trajectory tracking and path planning for a wheeled mobile robot.
|
|
16:45-17:00, Paper WeC12.6 | |
A Heuristic for Dynamic Output Predictive Control Design for Uncertain Nonlinear Systems |
|
Alamir, Mazen | CNRS / University of Grenoble |
Keywords: Uncertain systems, Machine learning, Predictive control for nonlinear systems
Abstract: In this paper, a simple heuristic is proposed for the design of uncertainty aware predictive controllers for nonlinear models involving uncertain parameters. The method relies on Machine Learning-based approximation of ideal deterministic MPC solutions with perfectly known parameters. An efficient construction of the learning data set from these off-line solutions is proposed in which each solution provides many samples in the learning data. This enables a drastic reduction of the required number of Non Linear Programming problems to be solved off- line while explicitly exploiting the statistics of the parameters dispersion. The learning data is then used to design a fast on-line output dynamic feedback that explicitly incorporate information of the statistics of the parameters dispersion. An example is provided to illustrate the efficiency and the relevance of the proposed framework. It is in particular shown that the proposed solution recovers up to 78% of the expected advantage of having a perfect knowledge of the parameters compared to nominal design.
|
|
WeC13 |
Richmond |
Constrained Control II |
Regular Session |
Chair: Nicotra, Marco M | University of Colorado Boulder |
Co-Chair: Richards, Christopher | University of Louisville |
|
15:30-15:45, Paper WeC13.1 | |
A Constrained Tracking Controller for Ramp and Sinusoidal Reference Signals Using Robust Positive Invariance |
|
Franca dos Santos, Geovana | Concordia University |
Castelan, Eugenio B. | Univ. Federal De Santa Catarina |
Lucia, Walter | Concordia University |
Keywords: Constrained control, PID control
Abstract: This paper proposes an output feedback controller capable of ensuring steady-state offset-free tracking for ramp and sinusoidal reference signals while ensuring local stability and state and input constraints fulfillment. The proposed solution is derived by jointly exploiting the internal model principle, polyhedral robust positively invariant arguments, and the Extended Farkas' Lemma. In particular, by considering a generic class of output feedback controller equipped with a feedforward term, a proportional effect, and a double integrator, we offline design the controller's gains by means of a single bilinear optimization problem. A peculiar feature of the proposed design is that the sets of all the admissible reference signals and the plant's initial conditions are also offline determined. Simulation results are provided to testify to the effectiveness of the proposed tracking controller and its capability to deal with both state and input constraints.
|
|
15:45-16:00, Paper WeC13.2 | |
Control Barrier Function for Linearizable Systems with High Relative Degrees from Signal Temporal Logics: A Reference Governor Approach |
|
Liang, Kaier | Lehigh University |
Cai, Mingyu | Lehigh University |
Vasile, Cristian Ioan | Lehigh University |
Keywords: Constrained control, Predictive control for linear systems, Model/Controller reduction
Abstract: This paper considers the safety-critical navigation problem with Signal Temporal Logic (STL) tasks. We developed an explicit reference governor-guided control barrier function (ERG-guided CBF) method that enables the application of first-order CBFs to high-order linearizable systems. This method significantly reduces the conservativeness of the existing CBF approaches for high-order systems. Furthermore, our framework provides safety-critical guarantees in the sense of obstacle avoidance by constructing the margin of safety and updating direction of safe evolution in the agent's state space. To improve control performance and enhance STL satisfaction, we employ efficient gradient-based methods for iteratively learning optimal parameters of ERG-guided CBF. We validate the algorithm through both high-order linear and nonlinear systems. A video demonstration can be found on: url{https://youtu.be/ZRmsA2FeFR4}
|
|
16:00-16:15, Paper WeC13.3 | |
A Terminal Set Feasibility Governor for Nonlinear Model Predictive Control |
|
Skibik, Terrence | University of Colorado Boulder |
Nicotra, Marco M | University of Colorado Boulder |
Keywords: Constrained control, Predictive control for nonlinear systems
Abstract: The terminal set feasibility governor (FG) is a model predictive control (MPC) add-on unit that guarantees the feasibility of the underlying optimal control problem by manipulating its target reference. Since the FG requires little information from the closed-loop system, it can be attached to any MPC law that tracks a piecewise constant reference. This paper extends the FG from linear to nonlinear systems as long as the nonlinear MPC features an invariant terminal set. The closed-loop FG+MPC satisfies constraints, is asymptotically stable, and exhibits finite-time convergence of the reference. Moreover, numerical simulations show that adding the FG significantly reduces the computational burden of the MPC for only a minor cost in performance.
|
|
16:15-16:30, Paper WeC13.4 | |
Safe Motion Planning for Serial-Chain Robotic Manipulators Via Invariant Sets |
|
Brandt, Teo | University of New Mexico |
Fierro, Rafael | University of New Mexico |
Danielson, Claus | University of New Mexico |
Keywords: Constrained control, Robotics, PID control
Abstract: Ongoing research is focused on developing autonomous motion-planning algorithms capable of addressing nonlinear robot manipulator dynamics and nonconvex collision avoidance constraints. This paper extends the application of the invariant-set motion planner (ISMP) to robot motion planning. We broaden the proof of output admissibility for configuration-space bubbles, accommodating robots with both prismatic and revolute joints. We derive a constraint admissible positive invariant (CAPI) subset within the configuration-space bubble for closed-loop system dynamics, integrating proportional-derivative joint controllers. Furthermore, we outline conditions for CAPI sets to be input admissible. Utilizing random graphsearch techniques, we identify a sequence of CAPI sets to guide the robot from an initial configuration to a goal equilibrium state while avoiding collisions. We illustrate the effectiveness and feasibility of the ISMP through a simulation involving the Universal Robots UR5e mounted on an actuated rail, modeled as a prismatic joint. Simulation results validate the ISMP for robot motion planning.
|
|
16:30-16:45, Paper WeC13.5 | |
Anti-Windup Compensator Design for Guidance and Control of Quadrotors |
|
Shahbazzadeh, Majid | University of Louisville |
Richards, Christopher | University of Louisville |
Keywords: Constrained control, Stability of nonlinear systems, Aerospace
Abstract: In this paper, the problem of anti-windup compensator (AWC) design for guidance and control of quadrotors in an unknown environment is addressed. Quadrotors can be affected by disturbances (such as wind), which potentially result in saturation of the propellers. When saturation occurs, the flight can become unstable, leading to a crash. On the other hand, designing an AWC to mitigate the saturation effects in the control system of a quadrotor can be a challenging task due to the heavy couplings and complex nonlinear dynamics. For this reason, we propose a new structure to design an AWC-based control system to solve this problem. Simulation results are presented in three cases: 1-without saturation, 2-with saturation - without AWC, 3-with saturation - with AWC. The effectiveness of the proposed theoretical results are verified by comparisons.
|
|
16:45-17:00, Paper WeC13.6 | |
Modelling a Broad Class of Actuator Saturations Using Takagi-Sugeno Models with a Reduced Number of Local Models |
|
Bainier, Gustave | Université De Lorraine |
Marx, Benoit | University of Lorraine |
Ponsart, Jean-Christophe | Université De Lorraine |
Keywords: Constrained control, Stability of nonlinear systems, LMIs
Abstract: In this paper, the Takagi–Sugeno (T-S) models are suggested as modelling tools to represent both the behavior of a nonlinear system and its saturated actuators. Given a T-S system with r local models, previous works on input saturation usually required 2^{n_u}r or 3^{n_u}r local models to represent an actuator saturated outside of an orthotope (n_u stands for the input vector dimension). In this paper, an elementary representation is suggested, which only takes 2r local models (thus, independently of n_u), and is able to capture a much broader class of actuator saturations. Local stabilization conditions expressed as linear matrix inequalities (LMI) are provided using the conventional static parallel distributed compensation (PDC) state feedback scheme. A heuristic solution is given in order to ensure a large guaranteed domain of attraction. Numerical examples are given in order to demonstrate the proposed approach and initiate a discussion on its contributions and limitations.
|
|
WeC14 |
Wellington |
Advanced Control for Safe Process Operations |
Invited Session |
Chair: Durand, Helen | Wayne State University |
Co-Chair: Tian, Yuhe | West Virginia University |
Organizer: Tian, Yuhe | West Virginia University |
Organizer: Durand, Helen | Wayne State University |
|
15:30-15:45, Paper WeC14.1 | |
Dynamic Risk-Based Model Predictive Quality Control with Online Model Updating (I) |
|
Braniff, Austin | West Virginia University |
Tian, Yuhe | West Virginia University |
Keywords: Chemical process control, Control applications, Machine learning
Abstract: In this work, we have introduced a multi-parametric control and optimization approach to integrate the decision making of optimal control, end-point quality control, and process safety management in batch reactor processes. The central research question lies in how to optimize batch operation over multiple time scales while proactively reducing process failures. The proposed approach features: (i) Statistical dynamic risk modeling for online process safety monitoring accounting for fault probability and severity, (ii) Short-term risk-aware model predictive controller for disturbance rejection and set- point tracking, (iii) Long-term control-aware safety and quality optimizer to oversee the entire batch operation for safe and optimal operation, (iv) Online model updating using recurrent neural network to address parameter uncertainty. The decisions of the short-term controller and long-term optimizer are fully integrated via multi-parametric programming with explicit optimal solutions generated offline a priori, which serves as a key advantage to design fit-for-purpose risk control strategies and to ensure computational efficiency for multi-scale dynamic optimization. The potential and efficacy of the approach is demonstrated on a real-world safety-critical batch reactor from T2 Laboratories Inc. to maintain safe and optimal operations while ensuring the desired quality of the end-product.
|
|
15:45-16:00, Paper WeC14.2 | |
Synthesis of Data-Driven Nonlinear State Observers Using Lipschitz-Bounded Neural Networks (I) |
|
Tang, Wentao | NC State University |
Keywords: Observers for nonlinear systems, Neural networks, Process Control
Abstract: This paper focuses on the model-free synthesis of state observers for nonlinear autonomous systems without knowing the governing equations. Specifically, the Kazantzis-Kravaris/Luenberger (KKL) observer structure is leveraged, where the outputs are fed into a linear time-invariant (LTI) system to obtain the observer states, which can be viewed as the states nonlinearly transformed by an immersion mapping, and a neural network is used to approximate the inverse of the nonlinear immersion and estimate the states. In view of the possible existence of noises in output measurements, this work proposes to impose an upper bound on the Lipschitz constant of the neural network for robust and safe observation. A relation that bounds the generalization loss of state observation according to the Lipschitz constant, as well as the H_2-norm of the LTI part in the KKL observer, is established, thus reducing the model-free observer synthesis problem to that of Lipschitz-bounded neural network training, for which a direct parameterization technique is used. The proposed approach is demonstrated on a chaotic Lorenz system.
|
|
16:00-16:15, Paper WeC14.3 | |
Bootstrapped Gross Error Detection for Efficient and Fault-Tolerant Real-Time Optimization (I) |
|
Patron, Gabriel David | University of Waterloo |
Ricardez-Sandoval, Luis | University of Waterloo |
Keywords: Fault tolerant systems, Chemical process control, Estimation
Abstract: Real-time optimization (RTO) is a model-based approach for generating economically optimal steady-state process set points. The process model used in RTO requires reconciliation with the plant through parameter estimation, which uses online measurements. In the presence of sensor faults causing measurement bias, the estimation layer can result in suboptimal set points that may violate safety, environmental, or operational constraints. Herein, a gross error detection approach is proposed to determine measurement sets that exclude faults, thus avoiding estimation errors propagating to the set points and ensuring safe operation. This is achieved by computing parameter estimate samples offline using varying measurement combinations and bootstrapping available plant data. The resulting parameter estimates are subjected to single-sample t-tests to determine which estimates are significantly different; these correspond to the measurement that have the highest probability of being faulty. The computational complexity of the algorithm is discussed, whereby it is shown to be related to the observability criteria and number of measurements. A continuously stirred tank reactor with an upper bound on heat generation is used to exemplify the proposed approach in a process safety setting. The incidence of constraint-violating operation is observed to decrease in both frequency and severity when using the proposed framework; thus, the resulting set points are economical while ensuring safe heating limits are respected during operation.
|
|
16:15-16:30, Paper WeC14.4 | |
A Set-Based Control Mode Selection Approach for Active Detection of False Data Injection Cyberattacks (I) |
|
Narasimhan, Shilpa | University of California, Davis |
El-Farra, Nael H. | University of California, Davis |
Ellis, Matthew | University of California, Davis |
Keywords: Process Control, Chemical process control, Fault detection
Abstract: In the last two decades, several highly sophisticated cyberattacks have targeted process control systems (PCSs) that operate chemical processes. To enhance PCS cybersecurity, cyberattack detection schemes utilizing operational data to reveal the presence of attacks on PCSs have received extensive attention. Stealthy attacks are designed to evade detection by an operational technology-based detection scheme. Their detection may require an active detection method, which perturbs the process by utilizing an external intervention for attack detection. In this work, two control modes that may be used to induce perturbations for active attack detection of stealthy false-data injection cyberattacks are presented. A reachability analysis is used to develop a set-based condition indicating that if met by a specific stealthy attack, the attack will be detected and therefore, the control mode is considered to be ``attack-revealing''. Leveraging the condition, a screening algorithm that may be used to select an attack-revealing control mode is presented. Using an illustrative process, the application of the screening algorithm is demonstrated.
|
|
16:30-16:45, Paper WeC14.5 | |
Lyapunov-Based Model Predictive Control Using Operable Adaptive Sparse Identification of Systems (OASIS) (I) |
|
Bhadriraju, Bhavana | Texas A&M University |
Kwon, Joseph | Texas A&M University |
Khan, Faisal | Memorial University of Newfoundland |
Keywords: Adaptive systems, Computer-aided control design, Computational methods
Abstract: Effectively managing process uncertainties, which stem from factors such as disturbances, throughput fluctuations, and time-varying parameters, poses a significant challenge in the realm of model predictive control (MPC). These uncertainties often lead to discrepancies between the plant and the model, diminishing the overall effectiveness of the control strategy. To address this issue, real-time model updating becomes crucial, allowing for the continuous adaptation of the model to dynamic changes and ensuring an accurate, real-time representation of the process. As a result, adaptive modeling emerges as a valuable tool in controller design. Within this context, this work explores the use of an adaptive modeling algorithm called operable adaptive sparse identification of systems (OASIS) in designing a Lyapunov-based MPC to ensure system stability while achieving the desired control objectives. Theoretical guarantees regarding the stabilizability and recursive feasibility of the resulting OASIS-based LMPC are also analyzed. The application of the developed control framework is demonstrated using a nonlinear chemical reactor example.
|
|
WeC15 |
Yonge |
Estimation and Control of Distributed Parameter Systems II |
Invited Session |
Chair: Hu, Weiwei | University of Georgia |
Co-Chair: Demetriou, Michael A. | Worcester Polytechnic Institute |
Organizer: Demetriou, Michael A. | Worcester Polytechnic Institute |
Organizer: Hu, Weiwei | University of Georgia |
|
15:30-15:45, Paper WeC15.1 | |
Finite-Time Boundary Stabilization for LWR Traffic Flow Model (I) |
|
Zhao, Hanxu | Beijing University of Technology |
Zhan, Jingyuan | Beijing University of Technology |
Zhang, Liguo | Beijing University of Technology |
Keywords: Distributed parameter systems, Traffic control, Lyapunov methods
Abstract: Finite-time stability means that the trajectory of dynamical system converges to a Lyapunov stable equilibrium state in finite time. The finite time stabilization of traffic flow model can improve the transportation efficiency and reduce the occurrence of traffic congestion. In this paper, the finite-time boundary stabilization problem for the Lighthill-Whitham-Richards (LWR) traffic flow model is considered, in which a variable speed limit (VSL) device is applied at the downstream boundary and a finite-time boundary controller is designed to drive the traffic density to the steady state in finite time. By utilizing the method of characteristics to illustrate the property of finite-time convergence and the Lyapunov function approach to prove stability, the local finite-time stability of LWR traffic flow system is ensured in H^2-norm. Moreover, the settling-time can be given depended on the initial value and the parameters of finite-time boundary controller. Lastly, a numerical simulation example is presented to verify the effectiveness of the theoretical results.
|
|
15:45-16:00, Paper WeC15.2 | |
Adaptive and Optimal Spatial PD Coupling in Synchronization Control of Networked Second-Order Infinite Dimensional Systems (I) |
|
Demetriou, Michael A. | Worcester Polytechnic Institute |
Keywords: Distributed parameter systems
Abstract: The consensus protocol employed for the synchronization control of networked systems uses the pairwise difference of the system states or their outputs as a means to minimize the mismatch between the networked states. A penalization of the derivative of the pairwise difference in the consensus protocol enhances the synchronization by adding an anticipatory action. Extending to networked partial differential equations, the derivative component takes a new interpretation as the spatial derivative of the pairwise differences can minimize the spatial gradient of the pairwise differences and thus improve the pointwise in space synchronization. The spatial proportional and derivative (PD) coupling in the synchronization control of partial differential equations is extended to second order PDEs representing many structural and mechanical systems. The optimization of the spatial proportional and derivative coupling is extended and applied to a class of second order systems with position and velocity measurements. When system parameters are not available, then the adaptation of the spatial proportional and derivative gains is also extended to second order infinite dimensional systems without explicitly imposing a positive realness condition. Simulation studies showcasing both the optimal and the adaptive spatial PD gains are included.
|
|
16:00-16:15, Paper WeC15.3 | |
Controllability and Optimal Control of Water Networks – a Comparison of Three Lumped Models (I) |
|
Baumann, Henry | Karlsruhe Institute of Technology |
Schaum, Alexander | University of Hohenheim |
Meurer, Thomas | Karlruhe Institute of Technology |
Keywords: Transportation networks, Distributed parameter systems, Reduced order modeling
Abstract: When large scale systems or transport networks, which are accurately modelled by partial differential equations, should be steered using optimization-based controllers, reduced order models are typically utilized to describe the plant dynamics in a fast but sufficiently accurate manner. Three different approximative models for water systems are recalled and used for the lumped description of an exemplary network for which, in addition, a structural controllability analysis is carried out. For the approximate models optimal control problems are solved and the results are compared for both, open and closed loop scenarios to evaluate accuracy, calculation time and parametrization effort of the reduced models. The results are visualized on the basis of numerical experiments for a test scenario.
|
|
16:15-16:30, Paper WeC15.4 | |
Neural Operator Approximations of Backstepping Kernels for 2x2 Hyperbolic PDEs (I) |
|
Wang, Shanshan | University of Shanghai for Science and Technology |
Diagne, Mamadou | University of California San Diego |
Krstic, Miroslav | University of California, San Diego |
Keywords: Distributed parameter systems, Machine learning, Fluid flow systems
Abstract: Deep neural network approximation of nonlinear operators, commonly referred to as DeepONet, has so far proven capable of approximating PDE backstepping designs in which a single Goursat-form PDE governs a single feedback gain function. In boundary control of coupled PDEs, coupled Goursat-form PDEs govern two or more gain kernels - a PDE structure unaddressed thus far with DeepONet. In this note we open the subject of approximating systems of gain kernel PDEs for hyperbolic PDE plants by considering a simple counter-convecting 2times 2 coupled system in whose control a 2times 2 Goursat form kernel PDE system arises. Such a coupled kernel PDE problem arises in several canonical 2times 2 hyperbolic PDE problems: oil drilling, Saint-Venant model of shallow water waves, and Aw-Rascle model of stop-and-go instability in congested traffic flow. In this paper we establish the continuity of the mapping from (a total of five) plant PDE functional coefficients to the kernel PDE solutions, prove the existence of an arbitrarily close DeepONet approximation to the kernel PDEs, and establish that the DeepONet-approximated gains guarantee global exponential stability of the equilibrium when replacing the exact backstepping gain kernels. The DeepONet operator speeds the computation of the controller gains by multiple orders of magnitude and its theoretically proven stabilizing capability is illustrated by simulations.
|
|
16:30-16:45, Paper WeC15.5 | |
Modeling and Detection of Cyber-Attacks on Highway Networks Using a 2D-LWR Model and Gaussian Processes (I) |
|
Kashyap, Abhishek | University of Texas at Arlington |
Chakravarthy, Animesh | University of Texas at Arlington |
Menon, Prathyush P | Faculty of Environment, Science and Economy |
Keywords: Distributed parameter systems, Transportation networks
Abstract: This paper considers a class of cyber-attacks that can occur over a two-dimensional road network. Our focus is on scenarios wherein an attacker may hack into a subset of vehicles driving on the road network and create subtle changes in their driving parameters. These hacked vehicles (referred to as malicious vehicles) are subsequently able to modify the driving behavior of the overall vehicle network. The traffic system comprising the mix of malicious and normal vehicles is modeled using a system of coupled Partial Differential Equations (PDEs) in a two-dimensional LWR model. We develop a methodology that combines Gaussian Processes (GP) with this two-species 2D PDE model, and use this method for detecting the presence of such malicious vehicles in the road network. A Bayesian Optimization scheme is employed to determine the optimal choice of basis and kernel functions that constitute the GP. Simulation results demonstrate that this detection architecture performs successful detection of the malicious vehicles, and also their mode of attack on the traffic.
|
|
16:45-17:00, Paper WeC15.6 | |
Distributed Biconnecitvity Achievement and Preservation in Multi-Agent Systems |
|
Restrepo, Esteban | CNRS, INRIA Rennes – Bretagne Atlantique |
Robuffo Giordano, Paolo | Centre National De La Recherche Scientifique (CNRS) |
Keywords: Distributed control, Constrained control, Networked control systems
Abstract: We propose a distributed control law to increase the robustness of a multi-agent network to node failure or removal. More precisely, our approach is able to maintain biconnectivity of an initially biconnected graph. Remarkably, if the graph is not initially biconnected or if the property is lost after a node removal or failure, our approach is also able to render the graph biconnected in a finite time which can be tuned by the user. The proposed control algorithm is completely distributed using only local available information and requires the estimation of a single global parameter akin to existing connectivity-maintenance algorithms in the literature. Numerical simulations illustrate the effectiveness of our approach.
|
|
WeC16 |
Dockside 4 |
Smart Grid |
Regular Session |
Chair: Barooah, Prabir | Indian Institute of Technology, Guwahati |
Co-Chair: Caiazzo, Bianca | University of Naples Federico II |
|
15:30-15:45, Paper WeC16.1 | |
Resilient Decentralized Control of Power Buffers in DC Microgrids |
|
Qian, Yangyang | University of Virginia |
Zhou, Siyu | University of Texas at Arlington |
Lin, Zongli | University of Virginia |
Wan, Yan | University of Texas at Arlington |
Shamash, Yacov | SUNY |
Keywords: Smart grid, Decentralized control, Power electronics
Abstract: This paper investigates resilient decentralized control of power buffers within active loads in a DC microgrid subject to false data injection (FDI) attacks. The attack signals are imposed on local controllers and are assumed to be time-varying and bounded. For each power buffer, a resilient decentralized controller is proposed based on its local information. Differently from the conventional decentralized controller, the proposed resilient decentralized controller incorporates an additional nonlinear feedback term. This nonlinear feedback term is constructed by using the boundary layer method and the adaptive control technique such that the effect of the associated attack signal is suppressed without requiring the knowledge of its upper bound. It is shown that the states of the resulting closed-loop system are uniformly ultimately bounded. Simulation results demonstrate that the proposed controller enhances resilience against FDI attacks when compared to the conventional decentralized controller.
|
|
15:45-16:00, Paper WeC16.2 | |
Prescribed-Time Consensus Control for the Voltage Restoration in Inverter-Based Islanded Microgrids |
|
Caiazzo, Bianca | University of Naples Federico II |
Lui, Dario Giuseppe | University of Naples Federico II |
Petrillo, Alberto | University of Naples Federico II |
Leccese, Sara | University of Naples Federico II |
Santini, Stefania | Univ. Di Napoli Federico II |
Andreotti, Amedeo | University of Naples Federico II |
Keywords: Smart grid, Energy systems, Distributed control
Abstract: Unlike Finite-Time (FT) and Fixed-Time (FxT) control strategies, Prescribed-Time (PT) control is able to alleviate the issue about the overestimation of the settling time, as well as the achievement of the system stability within a prescribed user-assignable settling time. Besides these benefits, no distributed PT solutions have been already proposed for distributed voltage secondary control in islanded Microgrid (MG). Therefore, the aim of this article is to fill this gap by introducing a novel distributed PT voltage controller in islanded MG able to guarantee that each distributed generation tracks the desired voltage value within a fixed time interval, which can be prescribed arbitrarily in advance and does not depend on initial conditions and control parameters. These features are crucial in such realistic applications due to the highly-varying operating conditions requiring the timely recovery of nominal voltage value. The tuning of the control gains is properly derived by leveraging Lyapunov theory, which allows analytically proving the PT stability of the entire MG after each transient phase. An extensive numerical analysis carried out on the standard IEEE 14-bus test system confirms the effectiveness of the theoretical derivations.
|
|
16:00-16:15, Paper WeC16.3 | |
Competitive Equilibrium in Microgrids with Dynamic Loads |
|
Salehi, Zeinab | The Australian National University |
Chen, Yijun | University of Sydney |
Petersen, Ian R. | Australian National University |
Ratnam, Elizabeth | The Australian National University |
Shi, Guodong | The University of Sydney |
Keywords: Smart grid, Optimal control, Agents-based systems
Abstract: In this paper, we consider microgrids that interconnect prosumers with distributed energy resources and dynamic loads. Prosumers are connected through the microgrid to trade energy and gain profit while respecting the network constraints. We establish a local energy market by defining a competitive equilibrium which balances energy and satisfies voltage constraints within the microgrid for all time. Using duality theory, we prove that under some convexity assumptions, a competitive equilibrium is equivalent to a social welfare maximization solution. Additionally, we show that a competitive equilibrium is equivalent to a Nash equilibrium of a standard game. In general, the energy price for each prosumer is different, leading to the concept of locational prices. We investigate a case under which all prosumers have the same locational prices. Additionally, we show that under some assumptions on the resource supply and network topology, locational prices decay to zero after a period of time, implying the available supply will be more than the demand required to stabilize the system. Finally, two numerical examples are provided to validate the results, one of which is a direct application of our results on electric vehicle charging control.
|
|
16:15-16:30, Paper WeC16.4 | |
Comments on Characterizing Demand Flexibility to Provide Power Grid Services |
|
Barooah, Prabir | Indian Institute of Technology, Guwahati |
Keywords: Smart grid, Power systems, Emerging control applications
Abstract: Many loads have flexibility in demand that can be used to provide ancillary services to power grids. A large body of literature exists on designing algorithms to coordinate actions of many loads to provide such a service. The topic of characterizing the flexibility of one or a collection of loads - to determine what kinds of demand deviation from the baseline is feasible - has also been studied. However, there is a large diversity in definitions of flexibility and methods proposed to characterize flexibility. As a result, there are several gaps in the literature on flexibility characterization. Some approaches on flexibility characterization are based on ad-hoc approximations that lead to highly conservative estimates. In this paper we point out some of these issues and their implications, with the hope to encourage additional research to address them.
|
|
16:30-16:45, Paper WeC16.5 | |
Robust Microgrid Energy Management System through a Scenario Approach |
|
Del Duca, Alessandro | Politecnico Di Milano |
Ruiz, Fredy | Politecnico Di Milano |
Scattolini, Riccardo | Politecnico Di Milano |
Keywords: Smart grid, Randomized algorithms, Hierarchical control
Abstract: The growing adoption of distributed Renewable Energy Sources and Battery Energy Storage in micro-grids requires robust Energy Management Systems (EMS) to handle power generation and consumption uncertainties. In this paper, we propose a hierarchical EMS architecture comprised of two controllers operating at different time scales: a supervisory Model Predictive Controller which optimises the micro-grid costs, and a low-level Model Predictive Controller which manages micro-grid uncertainties ensuring smooth operations. The proposed EMS decouples economic objectives from robustness concerns reducing operation costs, grid intervention, and operational constraint violations. The computational complexity is kept low by relying on a data-driven scenario approach to solve the resulting stochastic optimization problem. The architecture is tested using data from a Japanese micro-grid in Tsukuba to prove its effectiveness during daily operations.
|
|
WeC17 |
Dockside 5 |
Distributed Control I |
Regular Session |
Chair: Sarsilmaz, Selahattin Burak | Utah State University |
Co-Chair: Jensen, Emily | University of California, Berkeley |
|
15:30-15:45, Paper WeC17.1 | |
Trade-Off between Privacy and Accuracy in Resilient Vector Consensus |
|
Liu, Bing | Zhejiang University |
Zhao, Chengcheng | Zhejiang University |
Keywords: Distributed control, Communication networks
Abstract: This paper studies privacy-preserving resilient vector consensus in multi-agent systems where the number of tolerant faulty agents is limited. The objective is to guarantee that state vectors of all normal agents converge to a common state vector in the convex hull formed by their initial state vectors privately. To address this issue, we consider a modification of an existing algorithm called Approximate Distributed Robust Convergence Using Centerpoints (ADRC), named Differentially Private ADRC (DP-ADRC), where each agent adds noise to its state at each iteration. We first show the convergence and ε-differential privacy of DP-ADRC. Then, we characterize the change of the convex hull caused by the noise from two perspectives, i.e., the distance between the convex hulls with and without noises and the area change of the convex hull under a special case, respectively. Specifically, we use Chebyshev’s inequality to characterize the upper bound of the Hausdorff distance between the two convex hulls given a probability, where only partial dimensions of each state are added with noises. To characterize the area change, we consider adding noise to protect initial states during the first iteration and obtain the probability density function of the area difference between two convex hulls in the two-dimensional case. Finally, we perform numerical simulations to demonstrate the theoretical results
|
|
15:45-16:00, Paper WeC17.2 | |
Scalable Reinforcement Learning for Linear-Quadratic Control of Networks |
|
Olsson, Johan | Lund University |
Zhang, Runyu | Harvard University |
Tegling, Emma | Lund University |
Li, Na | Harvard University |
Keywords: Distributed control, Network analysis and control
Abstract: Distributed optimal control is known to be challenging and can become intractable even for linear-quadratic regulator problems. In this work, we study a special class of such problems where distributed state feedback controllers can give near-optimal performance. More specifically, we consider networked linear-quadratic controllers with decoupled costs and spatially exponentially decaying dynamics. We aim to exploit the structure in the problem to design a scalable reinforcement learning algorithm for learning a distributed controller. Recent work has shown that the optimal controller can be well approximated only using information from a kappa-neighborhood of each agent. Motivated by these results, we show that similar results hold for the agents' individual value and Q-functions. We continue by designing an algorithm, based on the actor-critic framework, to learn distributed controllers only using local information. Specifically, the Q-function is estimated by modifying the Least Squares Temporal Difference for Q-functions method to only use local information. The algorithm then updates the policy using gradient descent. Finally, we evaluate the algorithm through simulations that indeed suggest near-optimal performance.
|
|
16:00-16:15, Paper WeC17.3 | |
Joint Design of Estimation and Control for Multi-Agent Systems with Bearing Measurements |
|
Fang, Xu | KTH Royal Institute of Technology |
Li, Xiaolei | Yanshan University |
Xie, Lihua | Nanyang Tech. Univ |
Keywords: Distributed control, Cooperative control, Estimation
Abstract: In this paper, dynamic position estimation and formation control problems are considered for second-order multi-agent systems with bearing measurements. Different from the existing approaches that focus on static network localization and formation, this paper deals with this issue for a dynamic network. The key novelty of this paper is that the proposed method does not need to guarantee localization conditions of the followers at all times in dynamic second-order multi-agent systems. Specifically, a finite-time controller is first designed for the leader agents such that the expected behavior of the dynamic formation can be realized. For the follower agents, thanks to the designed two motion modes, i.e. maintaining mode and maneuvering mode, the localizability condition is not needed at all times for the position estimator and controller design. Finally, the simulation results verified the effectiveness of the proposed dynamic formation control framework.
|
|
16:15-16:30, Paper WeC17.4 | |
A Fully Distributed, Air-Ground Coordinated Coverage Control for Multi-Robot Systems with Limited Sensing Range |
|
Zhang, Hang | Zhejiang University |
Zheng, Ronghao | Zhejiang University, ZJU |
Zhang, Senlin | Zhejiang University |
Liu, Meiqin | Zhejiang University |
Keywords: Distributed control, Cooperative control, Multivehicle systems
Abstract: This paper presents a fully distributed coverage control scheme to address the challenge of limited sensing range in multi-robot systems. Unlike most existing methods that employ homogeneous robot teams, which exhibit sensitivity to sensing ranges in their coverage performance, our proposed method uses heterogeneous robot teams. We use aerial robots equipped with low-resolution but high-range sensors to gain rough features of the entire area first. Then aerial robots communicate with their neighbors, using local information to guide ground robots equipped with low-range but high-resolution sensors, thereby achieving local, fine-grained coverage. Moreover, each aerial robot dynamically adjusts its domains based on the final number of ground robots under its dominance, optimizing the coverage performance of each ground robot. The convergence of the method is proved and its performance is evaluated through simulations.
|
|
16:30-16:45, Paper WeC17.5 | |
Cooperative Output Regulation with Disturbance Decoupling |
|
Sarsilmaz, Selahattin Burak | Utah State University |
Gul, Kursad Metehan | Utah State University |
Acikmese, Behcet | University of Washington |
Keywords: Distributed control, Cooperative control, Output regulation
Abstract: This paper studies cooperative output regulation of linear multi-agent systems subject to external unmodeled disturbances. The objective is to solve the cooperative output regulation problem (CORP) while keeping tracking errors insensitive to all these disturbances. Given our objective and the related literature, we refer to this problem as the cooperative output regulation problem with disturbance decoupling (CORPDD). We present necessary and sufficient conditions for the solvability of the CORPDD by the distributed control law proposed in [1] to solve the CORP. The connections between the CORP and CORPDD, along with the spectral abscissa minimization of the closed-loop system matrix under the constraints of the CORPDD, are also provided.
|
|
WeC18 |
Dockside 6 |
Stability of Nonlinear Systems I |
Regular Session |
Chair: Liu, Xinzhi | University of Waterloo |
Co-Chair: Umathe, Bhagyashree | Clemson University |
|
15:30-15:45, Paper WeC18.1 | |
Global Exponential Stability or Contraction of an Unforced System Do Not Imply Entrainment to Periodic Inputs |
|
Duvall, Alon | Northeastern University |
Sontag, Eduardo | Northeastern University |
Keywords: Stability of nonlinear systems, Lyapunov methods
Abstract: It is often of interest to know which systems will approach a periodic trajectory when given a periodic input. Results are available for certain classes of systems, such as contracting systems, showing that they always entrain to periodic inputs. In contrast to this, we demonstrate that there exist systems which are globally exponentially stable yet do not entrain to a periodic input. This could be seen as surprising, as it is known that globally exponentially stable systems are in fact contracting with respect to some Riemannian metric. The paper also addresses the broader issue of entrainment when an input is added to a contractive system.
|
|
15:45-16:00, Paper WeC18.2 | |
Adaptive Meshes and Contraction Condition Certification for Nonlinear Control Synthesis Using Machine Learning |
|
Wei, Lai | UNSW |
McCloy, Ryan Josef | UNSW Sydney |
Bao, Jie | The University of New South Wales |
Keywords: Stability of nonlinear systems, Neural networks, Chemical process control
Abstract: In response to the continuously changing feedstock supply, time-varying utility (e.g., energy) costs and product market demand, the process operation strategy, modern chemical plants (which are typically nonlinear) need to be operated to achieve time-varying operation targets (e.g., production rates and specifications) to improve the process economy. As such the control system should be able to track time-varying references, i.e., flexible manufacturing. By leveraging the differential dynamics, contraction theory provides a tool for control design to deliver reference-independent stability (incremental stability). To address the difficulties in contraction-based control synthesis, a machine learning approach was proposed to determine the contraction metric and controller. However, this approach often requires a very large data set for neural network training which can be very computationally complex. In this article, we extend the above approach with an adaptive mesh to reduce the required size of training data sets and thus improve the efficiency of neural network training. The main contribution of this work is the development of the contraction condition certification ensuring that the contraction metric and corresponding controller, which are learned from limited number of data points, are valid for the entire state and input space.
|
|
16:00-16:15, Paper WeC18.3 | |
Spectral Koopman Method for Identifying Stability Boundary |
|
Umathe, Bhagyashree | Clemson University |
Vaidya, Umesh | Clemson University |
Keywords: Stability of nonlinear systems, Optimization
Abstract: The paper is about characterizing the stability boundary of an autonomous dynamical system using the Koopman spectrum. For a dynamical system with an asymptotically stable equilibrium point, the domain of attraction constitutes a region consisting of all initial conditions attracted to the equilibrium point. The stability boundary is a separatrix region that separates the domain of attraction from the rest of the state space. For a large class of dynamical systems, this stability boundary consists of the union of stable manifolds of all the unstable equilibrium points on the stability boundary. We characterize the stable manifold in terms of the zero-level curve of the Koopman eigenfunction. A path-integral formula is proposed to compute the Koopman eigenfunction for a saddle-type equilibrium point on the stability boundary. The algorithm for identifying stability boundary based on the Koopman eigenfunction is attractive as it does not involve explicit knowledge of system dynamics. We present simulation results to verify the main results of the paper.
|
|
16:15-16:30, Paper WeC18.4 | |
A Neural-Lyapunov-Based Adaptive Resilient Cruise Control of Platoons Subject to Cyber-Attacks on Leaders |
|
Khoshnevisan, Ladan | University of Waterloo |
Liu, Xinzhi | University of Waterloo |
Keywords: Stability of nonlinear systems, Robust adaptive control, Lyapunov methods
Abstract: In the realm of Intelligent Transportation Systems (ITSs), ensuring the safety and stability of connected automated vehicles (CAVs) is of paramount importance due to their susceptibility to vulnerabilities in interactions. The potential for system-wide disruption stemming from a cyber-attack on the leader underscores this need. Therefore, this paper introduces a nonlinear neural-Lyapunov-based adaptive resilient cruise control approach aimed at ensuring that all vehicles maintain safe tracking of the leader's profile, even in the presence of cyber-attacks and external disturbances. To achieve this, we employ an adaptive neural network to estimate the system's nonlinear characteristics. Subsequently, the control procedure is proposed, utilizing a virtual disturbance observer and Lyapunov theorem for stability analysis and adaptive laws to deal with nonlinearity, external disturbances, deception attacks, and singular control gain. Notably, our proposed approach eliminates the need for restrictive assumptions such as Lipschitz conditions on the nonlinear component and avoids the requirement for additional algorithms to switch between controllers in the event of a cyber-attack. The paper provides compelling evidence of system stability and the achievement of control objectives. Additionally, simulation and comparative results validate the theoretical analysis, highlighting the efficacy of the proposed methodology.
|
|
16:30-16:45, Paper WeC18.5 | |
Dynamic Output-Feedback Switching Control for Discrete-Time LPV Switched Systems |
|
Souza, Andressa | University of Campinas |
Oliveira, Ricardo C. L. F. | University of Campinas - UNICAMP |
Peres, Pedro L. D. | University of Campinas |
Keywords: Switched systems, Linear parameter-varying systems, LMIs
Abstract: This note addresses the problem of robust stabilization of discrete-time switched Linear Parameter-Varying (LPV) systems, where each mode depends on time-varying parameters in a polytope. The aim is to ensure robust stabilization and a bound on the decay rate of the closed-loop trajectories. This is achieved by jointly designing switching rules and fixed-order output feedback controllers by means of Lyapunov-Metzler (LM) inequalities with multiple quadratic Lyapunov functions. The novelty of this work lies on rewriting the LM conditions in an appropriate way that can be solved by an iterative procedure based on linear matrix inequalities. This approach eliminates the need for grid-based procedures, which are commonly used to solve LM inequalities. Numerical experiments are conducted to validate the effectiveness and efficiency of the proposed method.
|
|
16:45-17:00, Paper WeC18.6 | |
Guaranteed Stabilization and Safety of Nonlinear Systems Via Sliding Mode Control |
|
Ding, Fan | Shanghai Jiao Tong University |
Ke, Jin | Department of Automation, School of Aerospace Engineering, Xiame |
Jin, Wanxin | Purdue University |
He, Jianping | Shanghai Jiao Tong University |
Duan, Xiaoming | Shanghai Jiao Tong University |
Keywords: Stability of nonlinear systems, Variable-structure/sliding-mode control, Lyapunov methods
Abstract: We study simultaneous stabilization and safety for nonlinear affine systems by unifying the control Lyapunov function (CLF) and control barrier function (CBF). With the sliding mode control, we develop a CLF-CBF control strategy in joint and switching forms based on a general and a particular class of sliding mode manifolds, respectively. Then, the sufficient conditions on the parameters that achieve the stabilization and safety objectives simultaneously are provided. These conditions are less conservative than the conditions required by the existing quadratic programming methods, where both CLF and CBF constraints need to be satisfied for all time. Finally, numerical simulations are conducted to verify the effectiveness and superiority of our proposed method.
|
|
WeC19 |
Dockside 7 |
Robust Control II |
Regular Session |
Chair: Yong, Sze Zheng | Northeastern University |
Co-Chair: Caverly, Ryan James | University of Minnesota |
|
15:30-15:45, Paper WeC19.1 | |
Robust Adaptive MPC Using Uncertainty Compensation |
|
Tao, Ran | University of Illinois at Urbana-Champaign |
Zhao, Pan | University of Alabama |
Kolmanovsky, Ilya V. | The University of Michigan |
Hovakimyan, Naira | University of Illinois at Urbana-Champaign |
Keywords: Robust adaptive control, Optimal control, Constrained control
Abstract: This paper presents an uncertainty compensation-based robust adaptive model predictive control (MPC) framework for linear systems with both matched and unmatched nonlinear uncertainties subject to both state and input constraints. In particular, the proposed control framework leverages an L1 adaptive controller (L1AC) to compensate for the matched uncertainties and to provide guaranteed uniform bounds on the error between the states and control inputs of the actual system and those of a nominal i.e., uncertainty-free, system. The performance bounds provided by the L1AC are then used to tighten the state and control constraints of the actual system, and a model predictive controller is designed for the nominal system with the tightened constraints. The proposed control framework, which we denote as uncertainty compensation-based MPC (UC-MPC), guarantees constraint satisfaction and achieves improved performance compared with existing methods. Simulation results on a flight control example demonstrate the benefits of the proposed framework.
|
|
15:45-16:00, Paper WeC19.2 | |
Data-Driven Control Synthesis Using Koopman Operator: A Robust Approach |
|
Eyuboglu, Mert | EPFL |
Powell, Nathan | EPFL |
Karimi, Alireza | EPFL |
Keywords: Nonlinear systems identification, Linear parameter-varying systems, Robust control
Abstract: This paper presents a data-driven control design method for nonlinear systems using the Koopman operator framework. The Koopman operator lifts nonlinear dynamics to a higher-dimensional space, where observable functions evolve linearly. We initially consider an approximate linear time-invariant (LTI) lifted representation of the nonlinear system. To address residual errors, we calculate an approximation of the two-norm-gain of the error system based on data. Subsequently, we synthesize a robust dynamic feedback controller, relying solely on the LTI frequency response, providing closed-loop guarantees. Additionally, we consider linear parameter-varying (LPV) lifted models to further minimize the error systems two-norm-gain. In that case, we propose control design based on robustly stabilising the LTI dynamics and compensating for the parameter-varying part. The proposed strategy ensures internal stability under the assumption that the parameter-varying dynamics are open-loop BIBO stable. Moreover, it provides nominal performance guarantees for specific input-output channels for which the parameter-varying dynamics are fully cancelled.
|
|
16:00-16:15, Paper WeC19.3 | |
State Feedback Synthesis for Robust Performance with Probabilistic Parametric Uncertainty |
|
Caverly, Ryan James | University of Minnesota |
Bageshwar, Vibhor | Honeywell |
Keywords: Robust control, Uncertain systems, LMIs
Abstract: The paper presents a convex-optimization-based approach to synthesize robust full-state feedback controllers in the presence of probabilistic parametric uncertainty. The known probability distribution of the uncertain parameters is used to determine probabilistic sector bounds on the uncertainty. The proposed synthesis method results in a controller that ensures robust stability with high probability, while maximizing closed-loop performance for the most likely values of uncertainty. The method involves iteratively solving semidefinite programs within a bisection or coordinate descent scheme. A numerical example demonstrates the performance improvement achieved by the proposed method in the presence of probabilisitic parametric uncertainty compared to a controller designed with the typical assumption of uniform uncertainty distributions.
|
|
16:15-16:30, Paper WeC19.4 | |
Optimal Design of Disturbance Attenuation Feedback Controllers for Linear Dynamical Systems |
|
Mannini, Davide | University of California, Santa Barbara |
Strässer, Robin | University of Stuttgart |
Rawlings, James B. | University of California, Santa Barbara |
Keywords: H-infinity control, Optimization, Robust control
Abstract: This paper finds the optimal feedback controller for the discrete time, finite horizon disturbance attenuation problem under bounded disturbances. We consider a linear dynamical system and a quadratic objective function with a resulting nonlinear optimization and optimal nonlinear controller. In the space of initial states, two regions are identified. One region, containing the zero initial state, features the linear optimal H_{infty} controller, while the other region features nonlinear optimal control, and converges to the linear quadratic regulator (LQR) controller for large initial states. The transition between the two regions offers a unified framework that spans from H_{infty} control to LQR control as a function of the relative magnitude of the initial state and the imposed disturbance. This study enhances the versatility of disturbance attenuation feedback controllers and expands on the previous work on the dynamic game theory approach to optimal robust control design.
|
|
16:30-16:45, Paper WeC19.5 | |
Closed Loop Intent-Expressive Trajectory Planning and Intent Estimation |
|
Gah, Elikplim | Northeastern University |
Yong, Sze Zheng | Northeastern University |
Keywords: Robust control, Autonomous robots, Model Validation
Abstract: This paper presents a tractable approach for designing set-based closed-loop intent-expressive trajectory planning and intent estimation algorithms with (noisy) output feedback for multi-agent/robot teams with discrete-time affine dynamics. The algorithms allow an observed agent to implicitly convey intent information to observer agents/teammates while guaranteeing that the agent robustly satisfies state and input constraints, avoids obstacles and achieves its intended goal state under worst-case realizations of uncertainties. In particular, the intent-expressive trajectory planning algorithm encodes intent information by ensuring that the output reachable sets (i.e., all possible measured outputs by the observer agents) to all intended goals are distinct from each other, while the intent estimation algorithm enables the observer agents to decode the intent by eliminating all intent models that are incompatible with run-time observations. Another contribution of this paper that extends the state-of-the-art is a novel closed-loop output feedback control policy design that borrows from the tube-based predictive control literature to reduce conservatism.
|
|
16:45-17:00, Paper WeC19.6 | |
A Generalized Accelerated Gradient Optimization Method |
|
Wu, Alex (Xinting) | Australian National University |
Petersen, Ian R. | Australian National University |
Ugrinovskii, Valery | University of New South Wales |
Shames, Iman | Australian National University |
Keywords: Optimization algorithms, Robust control, Uncertain systems
Abstract: In this paper, we extend the recently developed generalized heavy ball optimization algorithm by introducing an additional parameter. This yields an improved optimization algorithm which has a similar form to the triple momentum method. The global convergence of the proposed algorithm for a class of functions with sector-bounded gradients is established. This is achieved by using the circle criterion. The proposed algorithm is designed to have the best possible R-convergence rate consistent with global convergence established using the circle criterion.
|
|
WeC20 |
Dockside 8 |
Filtering |
Regular Session |
Chair: Georgiou, Tryphon T. | University of California, Irvine |
Co-Chair: Ozay, Necmiye | Univ. of Michigan |
|
15:30-15:45, Paper WeC20.1 | |
A Projection Filter Algorithm for Stochastic Systems with Correlated Noise and State-Dependent Measurement Covariance |
|
Fuady Emzir, Muhammad | King Fahd University of Petroleum and Minerals |
Cheded, Lahouari | Independent Researcher |
Keywords: Filtering, Kalman filtering, Estimation
Abstract: The solution of the optimal filtering problem for nonlinear stochastic systems can be efficiently approximated by using a projection filter. A recent work by Emzir et al. (2023) proposed a novel framework that employs automatic differentiation to implement the projection filter for exponential family manifolds. This approach can effectively overcome the curse of dimensionality in the optimal filtering problem by exploiting the sparse-grid scheme. In this paper, we extend this framework to a class of stochastic differential equations where both state and measurement noises are correlated and the measurement processes have state-dependent covariances. We derive the projection filter equation for this class of systems on exponential family manifolds and present a numerical example to demonstrate its performance in solving a challenging nonlinear filtering problem.
|
|
15:45-16:00, Paper WeC20.2 | |
Can Transformers Learn Optimal Filtering for Unknown Systems? |
|
Du, Zhe | University of Michigan |
Balim, Haldun | ETH Zurich |
Oymak, Samet | University of California, Riverside |
Ozay, Necmiye | Univ. of Michigan |
Keywords: Filtering, Machine learning, Neural networks
Abstract: Transformer models have shown great success in natural language processing; however, their potential remains mostly unexplored for dynamical systems. In this work, we investigate the optimal output estimation problem using transformers, which generate output predictions using all the past ones. Particularly, we train the transformer using various distinct systems and then evaluate the performance on unseen systems with unknown dynamics. Empirically, the trained transformer adapts exceedingly well to different unseen systems and even matches the optimal performance given by the Kalman filter for linear systems. In more complex settings with non-i.i.d. noise, time-varying dynamics, and nonlinear dynamics like a quadrotor system with unknown parameters, transformers also demonstrate promising results. To support our experimental findings, we provide statistical guarantees that quantify the amount of training data required for the transformer to achieve a desired excess risk. Finally, we point out some limitations by identifying two classes of problems that lead to degraded performance, highlighting the need for caution when using transformers for control and estimation.
|
|
16:00-16:15, Paper WeC20.3 | |
Computational Optimal Transport and Filtering on Riemannian Manifolds |
|
Grange, Daniel | Stony Brook University |
Al-Jarrah, Mohammad | University of Washington Seattle |
Baptista, Ricardo | California Institute of Technology |
Taghvaei, Amirhossein | University of Washington Seattle |
Georgiou, Tryphon T. | University of California, Irvine |
Tannenbaum, Allen | Stony Brook University |
Keywords: Filtering, Optimal control, Neural networks
Abstract: In this paper we extend recent developments in computational optimal transport to the setting of Riemannian manifolds. In particular, we show how to learn optimal transport maps from samples that relate probability distributions defined on manifolds. Specializing these maps for sampling conditional probability distributions provides an ensemble approach for solving nonlinear filtering problems defined on such geometries. The proposed computational methodology is illustrated with examples of transport and nonlinear filtering on Lie groups, including the circle S1, the special Euclidean group SE(2), and the special orthogonal group SO(3).
|
|
16:15-16:30, Paper WeC20.4 | |
H∞ Filter Design for Discrete-Time LPV Systems |
|
Kang, Dongyeop | Electronics and Telecommunications Research Institute |
Park, Chaneun | Kyungpuk National University |
Keywords: Linear parameter-varying systems, Filtering, Estimation
Abstract: A new design condition for H∞ filter of discrete- time linear parameter-varying systems is addressed in this paper. The proposed method uses parameter-dependent Lyapunov functions and Finsler’s Lemma to derive the filter design conditions in terms of linear matrix inequalities. The presented design method has an advantage that it is possible to obtain less conservative results compared to existing methods in the view of H∞ performance and stability. The effectiveness and performance of the proposed filter design method are illustrated by numerical examples.
|
|
16:30-16:45, Paper WeC20.5 | |
Differential Privacy in Nonlinear Dynamical Systems with Tracking Performance Guarantees |
|
Chowdhury, Dhrubajit | Palo Alto Research Center |
Goyal, Raman | Palo Alto Research Center |
Rane, Shantanu | Palo Alto Research Center |
Keywords: Nonlinear output feedback, Filtering, Randomized algorithms
Abstract: We introduce a novel approach to make the tracking error of a class of nonlinear systems differentially private in addition to guaranteeing the tracking error performance. We use funnel control to make the tracking error evolve within a performance funnel that is pre-specified by the user. We make the performance funnel differentially private by adding a bounded continuous noise generated from an Ornstein-Uhlenbeck-type process. Since the funnel controller is a function of the performance funnel, the noise adds randomized perturbation to the control input. We show that, as a consequence of the differential privacy of the performance funnel, the tracking error is also differentially private. As a result, the tracking error is bounded by the noisy funnel boundary while maintaining privacy. We show a simulation result to demonstrate the framework.
|
|
16:45-17:00, Paper WeC20.6 | |
Optimal Control for Discrete-Time Systems under Bounded Disturbances |
|
Dogadin, Egor | ITMO University |
Peregudin, Alexey | ITMO University |
Shirokih, Dmitriy | ITMO University |
Keywords: Optimal control, Filtering, Linear systems
Abstract: This paper introduces a novel approach to the optimal control of linear discrete-time systems subject to bounded disturbances. Our approach is based on the newly established duality between ellipsoidal approximations of reachable and hardly observable sets. We provide exact solutions for state-feedback control and filtering problems, aligning with existing methods while offering improved computational efficiency. Moreover, our main contribution is the optimal solution to the output-feedback control problem for discrete-time systems which was not known before. Numerical simulations demonstrate the superiority of this result over previous sub-optimal ones.
|
|
WeC21 |
Pier 4 |
Parameter Identification for Battery Systems |
Invited Session |
Chair: Zhang, Dong | University of Oklahoma |
Co-Chair: Roy, Tanushree | Texas Tech University |
Organizer: Zhang, Dong | University of Oklahoma |
Organizer: Soudbakhsh, Damoon | Temple University |
Organizer: Jain, Neera | Purdue University |
Organizer: Dey, Satadru | The Pennsylvania State University |
Organizer: Tang, Shuxia | Texas Tech University |
Organizer: Roy, Tanushree | Texas Tech University |
Organizer: Moura, Scott | University of California, Berkeley |
Organizer: Lin, Xinfan | University of California, Davis |
Organizer: De Castro, Ricardo | University of California, Merced |
Organizer: Song, Ziyou | University of Michigan, Ann Arbor |
Organizer: Fogelquist, Jackson | University of California, Davis |
|
15:30-15:45, Paper WeC21.1 | |
System Identification for Lithium-Ion Batteries with Nonlinear Coupled Electro-Thermal Dynamics Via Bayesian Optimization (I) |
|
Tu, Hao | University of Kansas |
Lin, Xinfan | University of California, Davis |
Wang, Yebin | Mitsubishi Electric Research Labs |
Fang, Huazhen | University of Kansas |
Keywords: Energy systems, Nonlinear systems identification, Machine learning
Abstract: Essential to various practical applications of lithium-ion batteries is the availability of accurate equivalent circuit models. This paper presents a new coupled electro-thermal model for batteries and studies how to extract it from data. We consider the problem of maximum likelihood parameter estimation, which, however, is nontrivial to solve as the model is nonlinear in both its dynamics and measurement. We propose to leverage the Bayesian optimization approach, owing to its machine learning-driven capability in handling complex optimization problems and searching for global optima. To enhance the parameter search efficiency, we dynamically narrow and refine the search space in Bayesian optimization. The proposed system identification approach can efficiently determine the parameters of the coupled electro-thermal model. It is amenable to practical implementation, with few requirements on the experiment, data types, and optimization setups, and well applicable to many other battery models.
|
|
15:45-16:00, Paper WeC21.2 | |
Investigating Identification Input Designs for Modelling Lithium-Ion Batteries with Hysteresis Using LPV Framework (I) |
|
Sheikh, Abdul Muiz Ahmad | Eindhoven University of Technology |
Bergveld, Hendrik Johannes | Eindhoven University of Technology |
Donkers, M.C.F. | Eindhoven University of Technology |
Keywords: Modeling, Linear parameter-varying systems, Identification
Abstract: This paper proposes a methodology to identify models for lithium-ion batteries exhibiting pronounced hysteresis, such as the lithium-iron-phosphate (LFP) batteries. The proposed battery model structure considers a multiple-input single-output linear parameter-varying (LPV) framework, wherein a second input representing maximum hysteresis overpotential is introduced along with the battery current. Furthermore, three designs for the input-current profile used for model identification are realized to investigate the extent of the hysteretic battery behaviour relevant to certain model applications. Finally, several battery models with varying model order and basis-function complexity are identified using each of the three current profiles, which are subsequently validated using a real drive-cycle dataset that includes occasional charging spikes due to regenerative braking as well as prolonged constant-current charging conditions.
|
|
16:00-16:15, Paper WeC21.3 | |
Physics-Informed Neural Network for Discovering Systems with Unmeasurable States with Application to Lithium-Ion Batteries (I) |
|
Kajiura, Yuichi | The University of Oklahoma |
Espin, Jorge Esteban | University of Oklahoma |
Zhang, Dong | University of Oklahoma |
Keywords: Energy systems, Identification, Neural networks
Abstract: Combining machine learning with physics is a trending approach for discovering unknown dynamics, and one of such frameworks gaining attention is the physics-informed neural network (PINN). However, PINN often fails to optimize the network due to its difficulty in concurrently minimizing multiple losses originating from the system's governing equations. This problem can be more serious when the system's states are unmeasurable, as is the case for lithium-ion batteries. In this work, we introduce a robust method for training PINN that uses fewer loss terms and thus constructs a less complex landscape for optimization. In particular, instead of constructing losses from each differential equation, this method embeds the dynamics into a single loss function such that predicted states expanded in time given the dynamics are converted to a series of predicted outputs and compared with observed system outputs. Minimizing such a loss optimizes the network to predict states consistent with the physics. Further, system's unknown parameters are to be identified during training. For a demonstration, we apply it to a simple battery model to concurrently estimate its states and parameters.
|
|
16:15-16:30, Paper WeC21.4 | |
Degradation Modes Identification of Lithium-Ion Batteries Based on Flexible Discharge Data |
|
Wang, Shuquan | Shandong University |
Gao, Feng | Shandong University |
Zhang, Yusen | Shandong University |
Keywords: Pattern recognition and classification, Machine learning, Identification
Abstract: The precise identification of battery degradation modes stands as a critical aspect for the enhancement of battery system management, augmenting system safety, and fortifying overall system stability. However, the complex and variable nature of electrochemical processes, alongside the constraints posed by current testing apparatus, presents formidable obstacles to the real-time assessment of battery aging. In this study, we initially established the profound linkage between discharge curves and battery aging through an array of battery aging experiments. A meticulous examination was conducted on the correlation between segments of the discharge curve within specific voltage intervals and battery aging. This was followed by the accurate determination of battery aging patterns through the utilization of flexible discharge curve segments, integrated within a deep learning framework. Furthermore, we shed light on the critical importance of potential data encapsulated in discharge curves for depicting battery aging, corroborated by visual demonstrations of our model’s intermediate processes. Our findings highlight the significant promise of employing readily available battery data for unlocking deeper understanding of the dynamics driving battery aging.
|
|
16:30-16:45, Paper WeC21.5 | |
Sensitivity Analysis of Lithium-Ion Battery SoH Indicators: An Analytical Study (I) |
|
Sepasiahooyi, Sara | Texas Tech |
Tang, Shuxia | Texas Tech University |
Keywords: Identification, Energy systems, Distributed parameter systems
Abstract: The aging processes of lithium-ion (Li-ion) batteries are complex, involving various State-of-Health (SoH) indicators. These mechanisms significantly impact the overall longevity of Li-ion batteries. Sensitivity Analysis (SA) of the terminal voltage to the SoH indicators provides insight into how each aging factor contributes to the battery’s lifespan and performance. In this article, SA is applied to six SoH indicators, including, thickness of the Solid Electrolyte Interface (SEI) film in the negative electrode, electrolyte concentration on boundaries, lumped resistance, shell thickness on the positive electrode, volume fraction of active material, and reaction rate constant. Analytical derivation is employed as the SA approach. This generic approach empowers us to apply the results for all current inputs and different types of batteries with various parameters, avoiding data collection and fitting which are typically required in numerical methods. Single Particle Model with electrolyte (SPMe) dynamics is employed as the battery model. The simulation is performed on a Li-ion battery cell with manganese dioxide (LiyMn2O4) and carbon (LixC6) as the positive and negative electrode, respectively, and an Urban Dynamometer Driving Schedule (UDDS) current profile.
|
|
16:45-17:00, Paper WeC21.6 | |
Parameter Estimation of Solid-Electrolyte-Interphase Based Ageing in the Doyle-Fuller-Newman Model Framework |
|
le Roux, Francis Anne | Eindhoven University of Technology |
Bergveld, Hendrik Johannes | Eindhoven University of Technology |
Donkers, M.C.F. | Eindhoven University of Technology |
Keywords: Modeling, Estimation, Optimization
Abstract: Ageing model implementation and parameter esti- mation remains an active research area in the field of Lithium- ion battery modelling. This paper proposes a parameter esti- mation procedure based on multiobjective opmization, where both objectives of having accurate voltage predictions and capacity degradation are simultaneously optimized. An iterative solution strategy is proposed to obtain a Nash equilibrium of the non-cooperative multiobjective optimization problem. The implementation of the parameter estimation procedure results in parameters for both a Doyle-Fuller-Newman model equations as well as an ageing modeling implementation, which assumes ageing due to the formation of the Solid-Electrolyte-Interphase layer. The proposed strategy shows convergence of the model parameters while resulting in an improved accuracy of both the modeled voltage as well as the modeled capacity fade. Furthermore, voltage predictions remain more accurate over the lifetime by including ageing modeling
|
| |